登录查看更多内容

Avoid Trouble Through Operational Resilience

Chris Farrell

Entrepreneur, Observability Product Leader helping companies optimize their use of technology

发布日期: 2024年10月2日

Discussing the CrowdStrike-induced Windows meltdown and the culpability of our own operations, Mitch Ashley at Devops.com writes that “the CrowdStrike incident highlights the importance of adopting more sophisticated deployment strategies.” Ashley asserts that DevOps teams, in particular, should have more robust and resilient testing / rollout processes such as Red-Black, Blue-Green, Canary, etc.

From Candost Dagdeviren's blog titled "What is Blue-Green Deployment"

Depending on the situation, DevOps is the first and last line of defense against something as severe as an operating system shutdown. These teams must act as if the next release of anything will create a problem. The OS kernel failing is an aberrant situation, but DevOps is built around the concept of preparing for aberrant situations.

The problem was (is?) that we’ve only really thought of code updates from that perspective. It makes sense – a piece of code can stress a business quicker than you can say “Git!”

The issue at hand isn’t that platform update processes need more resilience – they do. The biggest problems caused by the CrowdStrike kernel crash weren't caused by missing technical resiliency, but by a lack of business resiliency. Specifically, the inability to restore an application's state after a crash or outage.

From @DreamFactory blog post titled "Stateful vs. Stateless Web App Design"

A more sophisticated deployment strategy could have kept application availability up higher, but companies that lacked operational resilience were the biggest losers. Thus, while you should be more cautious in IT deployments, to truly safeguard the business, make the company more resilient across the board.

The Excelsior Business

916 位关注者

要查看或添加评论，请登录

Chris Farrell的更多文章

Important Safety Tip: Don't Make Developers Mad

2024年8月1日

Important Safety Tip: Don't Make Developers Mad

If you don’t understand how important developers are to the ultimate success of your business, you’re living in the…
Avoid Application Meltdown!

2024年7月19日

Avoid Application Meltdown!

IBM Cyber Security expert Sam Hector has a nice piece summarizing how the CrowdStrike problem took down Windows…
2024 Myths: Monitoring Agents Cause High Observability Costs (and BigFoot Returns)

2024年4月29日

2024 Myths: Monitoring Agents Cause High Observability Costs (and BigFoot Returns)

Last year, observability costs made mainstream news, with stories from both the vendor and the customer perspective…
Crossing the Observability [Knowledge] Gap

2024年4月8日

Crossing the Observability [Knowledge] Gap

As the APM market shifted to “Observability,” the system engineering definition took a back seat to marketing messaging…
Will Your Lead Funnels Be Apocalyptic in 2024?

2024年1月5日

Will Your Lead Funnels Be Apocalyptic in 2024?

In a classic end-of-year prediction column in Forbes, Tom Wozniak discusses AI, privacy regulations and the latest…

2 条评论
Don't Let AI become F-AI-L

2023年12月6日

Don't Let AI become F-AI-L

Imagine if a company executive did something that caused a major backlash against your company. The PR team swings into…

1 条评论
Open Source Creates Vendor Lock-in? Inconceivable!!!!

2023年9月25日

Open Source Creates Vendor Lock-in? Inconceivable!!!!

Open source technology within IT Operations is fairly common now, especially as the "traditional" observability APIs…

1 条评论
Saving Money Saves the Planet!

2023年7月7日

Saving Money Saves the Planet!

Analyzing a recent CloudOps survey, Mike Vizard at DevOps.com noted that 94% of organizations admit they struggle with…

1 条评论
Logging is the Floppy Disk of Observability

2023年5月23日

Logging is the Floppy Disk of Observability

When Apple Killed the Floppy Disk I was cleaning out my parents' office and came across a treasure trove of old…

5 条评论
Avengers: Endgame was Simply the Greatest Moment of Cinema History

2023年2月18日

Avengers: Endgame was Simply the Greatest Moment of Cinema History

Title Hyperbole Before film buffs get all up in arms, I'm not suggesting that Marvel's culmination of the Infinity Saga…

2 条评论

See all articles

Avoid Trouble Through Operational Resilience

Chris Farrell

Entrepreneur, Observability Product Leader helping companies optimize their use of technology

The Excelsior Business

916 位关注者

Chris Farrell的更多文章

社区洞察

其他会员也浏览了

Engineering discord with Chaos Monkeys

WHY DEVSECOPS AND WHAT’S DIFFERENT ABOUT IT? (PART 2) – SECURITY IS NOT A ‘CONSIDERATION’

Doppler vs Traditional Secrets Managers

Platform vs. DevEx teams: What’s the difference?

DevSecOps: ROI and How Adopting It Saves You From Future Compliance Issues

DevSecOps: One CISO's Journey

Boosting DevSecOps and SRE Practices with Red Hat Developer Hub

Best Practices for SRE Implementation: Beyond the Automation Hype

Chaos Engineering: Safeguarding the Digital Transformation Journey with System Reliability

Chas Engineering- A perspective

The Excelsior Business

916 位关注者

Chris Farrell的更多文章

Important Safety Tip: Don't Make Developers Mad

Avoid Application Meltdown!

2024 Myths: Monitoring Agents Cause High Observability Costs (and BigFoot Returns)

Crossing the Observability [Knowledge] Gap

Will Your Lead Funnels Be Apocalyptic in 2024?

Don't Let AI become F-AI-L

Open Source Creates Vendor Lock-in? Inconceivable!!!!

Saving Money Saves the Planet!

Logging is the Floppy Disk of Observability

Avengers: Endgame was Simply the Greatest Moment of Cinema History

社区洞察

其他会员也浏览了

Engineering discord with Chaos Monkeys

WHY DEVSECOPS AND WHAT’S DIFFERENT ABOUT IT? (PART 2) – SECURITY IS NOT A ‘CONSIDERATION’

Doppler vs Traditional Secrets Managers

Platform vs. DevEx teams: What’s the difference?

DevSecOps: ROI and How Adopting It Saves You From Future Compliance Issues

DevSecOps: One CISO's Journey

Boosting DevSecOps and SRE Practices with Red Hat Developer Hub

Best Practices for SRE Implementation: Beyond the Automation Hype

Chaos Engineering: Safeguarding the Digital Transformation Journey with System Reliability

Chas Engineering- A perspective