登录查看更多内容

Game Day Strategy implementation (By AWS )

Sherwin Arjnan

Leadership | Strategy | Business Development| Digital Marketing | HR | Believer | Songwriter

发布日期: 2024年5月20日

A?game day?simulates a failure or event to test systems, processes, and team responses. The purpose is to perform the actions the team would perform as if an exceptional event happened.

Covers the areas of:

·??????? Operations,

·??????? Security,

·??????? Reliability,

·??????? Performance, and

·??????? Cost.

Operations

Ability to run and monitor systems to deliver business value and continually improve supporting processes and procedures

Key as aspects

Organization – the organizations structure and priorities.

Preparedness – Current design of systems, people and processes needed and available to perform important functions

Operability – How would workloads be managed? What is our workload health? How prepared are we to respond to events?

Evolve - What have we learned and how will we improve?

Power failures – backup energy and systems e.g cloud

Security

Protections of information, systems, and assets. Our current Risk Assessments and Mitigation Strategies.

-??????? Access management – who has access and who will be give access and at what level. The Principal of “Least Access”

-??????? Access Management policy – (Multi Factor Authentication (MFA), sign Mechanisms e.g. Commonly used passwords)

What is types security required and what is in place?

-??????? What are our layers of security?

-??????? Detection – Security events - what was detected, what needs to detect and how will we do it.The use of automated alert systems.

-??????? Infrastructure protection - current system protection from breach e.g. cyber-attack. Using appropriate Content Delivery Network, Network Firewalls. Distribution of layers into subnet (smaller networks)

Establish, Trust boundaries, System security configuration, Operation system ,Policy Enforcement.

-??????? Data protection - data breach, data backup, Data Classification (level of security required)

Dana Gardner 2 年前

The 3 Keys to Automated Certificate Lifecycle…

DigiCert 10 个月前

Route Analyzer vs. Reachability Analyzer vs. Network…

Jon Bonso 3 个月前

Creating Dashboards to view data instead access to main database. Data Encryption ( in rest and in transit)???

-??????? Incident response – who, what and timeframe and access requirements

-??????? Information storage and backup for recovery. e.g. on site, Cloud storage.

Reliability

Current state on how we recover from failure and how would we measure it. Systems/organization health.

Information storage and backup for recovery

Recover from Infrastructure or service disruptions- implementation of Service Orientated Architecture and microservices.

Network bandwidth – Current usage of network bandwidth – one or multiple ISP’s. Reliability of ISP

Dynamically acquire resources on demand

Scaling operations to match demand. People processes to be streamlined and system required to be automatic or as close as possible.

Mitigate disruptions

Change management – Monitoring metrics of People (Employee satisfaction survey, 360’s , and IT systems via monitoring dashboards. How well do people respond to agile movements .People resources required and duration (external )

Failure Management – conduct simulations (tests)in current environment (Failure drills) with possible live environments (cloud based or on other sites) .Distribution of workloads to other geographic zones. Back up and Disaster recovery (DR Strategy )

Common KPI

-Recovery Time Objective

-Recovery Point Objective

Making responses/actions Idempotent.

Design a playbook

Use Chaos Engineering (Create failures)

Performance

Are our people system (IT) and processes utilized efficiently?

Cost Optimization

Who monitoring our costs?” It can be the Finance Department”. They cannot advise “how” to reduce costs.

Develop a Cost Optimization plan.

Source : Amazon Web Services

要查看或添加评论，请登录

查看全部

Game Day Strategy implementation (By AWS )

Sherwin Arjnan

Leadership | Strategy | Business Development| Digital Marketing | HR | Believer | Songwriter

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

OCI government cloud fault domains improve availability

192 Security Architecture benefits and rewards you'll get that show you're successful. How many can you move to 'Done'?

Rethinking the world of logging?

Exploring the World of High Availability (HA) in Distributed Systems

IBM Storage's FlashSystem Drive Innovation, Dramatically Cuts Your Costs, and Eliminates Complexity

Navigating the Inevitable: Learning from Recent Major System Failures

MSP Maturity and Scalability

Commanding API Security: Unlocking Business Value with Kong and Styra

How AI is Driving Innovation in IT Infrastructure Management

The Importance of Network Observability in Modern IT Infrastructure

领英推荐

Natural Diamonds: A Precious Legacy for Economic Growth, Job Creation, and Sustainability

2024年7月22日

Scenario: Smart Rainwater Harvesting with Digital Twins

2024年7月19日

Why Your LinkedIn Profile Is Your Best Resume..

2024年7月15日

Tantalum in Everyday Life

2024年6月3日

Mine Ventilation (The Human Body Illustration)

2024年5月22日

The Role of Programmable Logic Controllers (PLCs) in Underground Mining

2024年5月21日

Beyond Scripted Responses ( The Chinese Room Paradox)

2024年5月8日

10 Prompts to get you started as a Prompt Engineer .

2024年1月26日

Effective Marketing Is The Much-Needed Salesman

2024年1月24日

Rethinking Leadership: A Partnership Approach to Organizational Success (Principle 1 of 2).

2024年1月8日