登录查看更多内容

CrowdStrike or the resilience conundrum

Benoit Felten

Partner at Plum Consulting

发布日期: 2024年7月19日

We all woke up this morning to news of a major computer outage all over the world affecting large businesses such as airlines, hospitals, railway companies, telecom operators, and, according to NBC News, some 911 emergency services in the US.

Game designer Robin D. Laws just posted this on Mastodon, which I find funny because there's a strong kernel of truth to it:

"This is how the world ends, not with a bang, but with a Windows 365 cycling reboot error."

What's going on seems to be a little bit more complex (and hopefully a little less dire) than that, but is indeed an issue (from what I understand) driven by a CrowdStrike update initiated last night, but impacting Windows OS and effectively decommissioning computers that belong to networks that use CrowdStrike for their security until either the solution is disabled or the patch is installed. Apparently this needs to be done on a computer by computer basis.

The global impact of this update highlights once again a conundrum of modern network security: as attacks become more and more sophisticated, it is virtually impossible for businesses to protect themselves without relying on a small number of security software providers such as CrowdStrike. Paradoxically, this creates a single point of failure of sorts as a huge proportion of businesses worldwide use the same solutions. If these solutions fail, a large proportion of businesses go down, and cascading effects impact others due to interdependencies (a number of unaffected airports are closing down because the airlines or other airports are...)

领英推荐

DIRECTOR'S NOTE

McCrary Institute for Cyber & Critical Infrastructure Security 1 个月前

The Digital Shield: Strengthening OT Security in an…

Quotacom 1 年前

Cyber Briefing ~ 03/28/2024

McCrary Institute for Cyber & Critical Infrastructure Security 11 个月前

At a time when the European Commission puts a strong emphasis on resilience and cybersecurity through things like Articles 40 and 41 of the EECC as well as directives such as NIS2, this raises interesting and complicated questions about the role of policy in this field. A lot of these EC mandates have yet to be implemented by national regulators, and we at Plum Consulting have assisted regulators in figuring out how to implement these necessary changes.

The incidents from today however, raise a real question about the role and limitations of policy when it comes to resilience and cybersecurity: if imposing measures on various critical businesses leads to an increasing dependency of said businesses on the same providers to meet the obligations put upon them, this in itself is a new and systemic risk of failure should said providers themselves crash or fail.

This is, in another guise, the tension between centralised and highly protected architectures, which rarely fail, but have massive impacts when they do, and decentralised but potentially weaker architectures, which may individually have higher risks of failure, but much lower impact when they fail.

There may be a risk that policy interventions designed to enhance security through standardised requirements lead to reliance on just a small number of vendors who have the capability to meet these standards at scale. Paradoxically this could lead to overreliance on systems which are themselves capable of failure. Maybe some thoughts need to go into different, less centralised models that would mitigate the systemic risks a little better?

Blaine Gaither

Computer Performance Modeling and Analysis

7 个月

Auto update should not be considered a best practice, except perhaps for virus definition updates. It is nether good for security nor reliability . If you don't have an integration team, then wait a day, having listened for screams.

Hendrik Rood

7 个月

Most telecom operators until the late 1990s had a multi-vendor strategy, with multiple suppliers for every key system (switches, transmission systems) including software strategies with multiple versions. This protects against CrowdStrike alike incidents. Organisations contracting with two different 'security vendors' and installing half their Windows systems in an airport terminal hall on Vendor A and the other half on Vendor B would keep services partially running. But it does require in-house highly skilled technical staff, the ability to define crips and clear interfaces between vendors, conformance and interoperability testers and more operations staff. * Software Soaking * A and B-sides, immediate switch-back / roll-back capabilities * Only one side first upgraded and the other after observed faultless operation period * roll-out first in small operational field sites, then mid-size and ultimately large systems It is doable. It is costly. The deeper issue alas is the attractiveness for many executives outsourcing "responsibility" to single vendors, economizing on their own IT-staff,

3 次回应

Yann LE FRIEC

7 个月

Thank you for this post, cher cousin ! I’m currently sitting in Chicago O’Hare airport, not sure yet when I will board and take off to Chicago… Thank you CloudStrike!????

1 次回应

Karl Wermig

7 个月

Thanks Beno?t. This resonates across the resilience projects I've led for ICT companies in Africa and Asia. diversity and preparation across a system = resilience More can be done on a regular basis to conduct systemic audits. This can remove or mitigate at many of the risks that emerge from the combination of layers (hardware, software, process, people, and regulation) upon which we depend.

1 次回应

查看更多评论

要查看或添加评论，请登录

Benoit Felten的更多文章

Is "regulation or growth" the question of the day in Brussels?

2025年3月13日

Is "regulation or growth" the question of the day in Brussels?

Next week I'll be in Brussels for the IIC Annual Europe Digital Communications and Media Forum 2025. This is a really…

1 条评论
Time to take on take-up

2025年3月5日

Time to take on take-up

A couple of years back, the FTTH Council Europe hired Plum Consulting to undertake a study looking at the role of…

5 条评论
Who really has an incentive to switch off the copper network?

2025年2月24日

Who really has an incentive to switch off the copper network?

In the last few months, we at Plum Consulting did a lot of work related to copper switchoff, understanding its drivers,…

15 条评论
Could latency help with broadband service differentiation?

2025年2月19日

Could latency help with broadband service differentiation?

A few weeks ago, Comcast announced the launch of a new technology for all its Xfinity broadband subscribers as part of…

16 条评论
Europe’s digital infrastructure needs: now what?

2024年9月2日

Europe’s digital infrastructure needs: now what?

Last February the European Commission published a white paper entitled "How to master Europe’s digital infrastructure…
The risks of the EC's proposed regulatory expansion

2024年6月12日

The risks of the EC's proposed regulatory expansion

This morning in Brussels I presented the key findings of a report Plum Consulting published today on the EC proposals…

1 条评论
Why does the Commission never talk about Germany ?

2024年4月17日

Why does the Commission never talk about Germany ?

(I wanted to keep a steady stream of posts on the EC White Paper and its implications, but I notice that I haven't done…

4 条评论
From Berlin with love (of FTTH)

2024年3月27日

From Berlin with love (of FTTH)

Last week I was in Berlin for the FTTH Council Europe annual conference. I calculated that it was my 16th annual…

8 条评论
Are we witnessing the dying days of telecom regulation in Europe ?

2024年3月18日

Are we witnessing the dying days of telecom regulation in Europe ?

As I prepare to fly off to Berlin for the 2024 FTTH Council Europe conference, one thing I expect to hear a whole lot…

8 条评论
Why can't the European Commission see the success of its own telecom policies?

2024年3月11日

Why can't the European Commission see the success of its own telecom policies?

The thing that struck me from the very first pages of the Commission's White Paper was how bleak its outlook is, both…

27 条评论

See all articles

CrowdStrike or the resilience conundrum

Benoit Felten

Partner at Plum Consulting

领英推荐

Benoit Felten的更多文章

社区洞察

其他会员也浏览了

Global IT Outage - How do We Pivot from Chaos to Control

Public/Private Partnerships and Cybersecurity

5 Lessons We Can Learn from The Microsoft Global Outage

The Great IT Outage of 2024: Analyzing CrowdStrike's Costly Error

THE DAY THE EARTH STOPPED: THE CROWDSTRIKE OUTAGE EXPLAINED

Cyber News

Cyber and Critical Infrastructure Regulations in Australia – A User’s Guide

Connecting Critical Infrastructure to Mitigate Threats and Bolster Collective Defence

Cybersecurity for the Rail Traffic Network: A Holistic Approach

The Rising Cybersecurity Threat in a World of Conflict: The Role of Network Visibility

领英推荐

Benoit Felten的更多文章

Is "regulation or growth" the question of the day in Brussels?

Time to take on take-up

Who really has an incentive to switch off the copper network?

Could latency help with broadband service differentiation?

Europe’s digital infrastructure needs: now what?

The risks of the EC's proposed regulatory expansion

Why does the Commission never talk about Germany ?

From Berlin with love (of FTTH)

Are we witnessing the dying days of telecom regulation in Europe ?

Why can't the European Commission see the success of its own telecom policies?

社区洞察

其他会员也浏览了

Global IT Outage - How do We Pivot from Chaos to Control

Public/Private Partnerships and Cybersecurity

5 Lessons We Can Learn from The Microsoft Global Outage

The Great IT Outage of 2024: Analyzing CrowdStrike's Costly Error

THE DAY THE EARTH STOPPED: THE CROWDSTRIKE OUTAGE EXPLAINED

Cyber News

Cyber and Critical Infrastructure Regulations in Australia – A User’s Guide

Connecting Critical Infrastructure to Mitigate Threats and Bolster Collective Defence

Cybersecurity for the Rail Traffic Network: A Holistic Approach

The Rising Cybersecurity Threat in a World of Conflict: The Role of Network Visibility