登录查看更多内容

AI Under Siege: From Hacked Minds to Weaponized Code

Edward Liebig

vCISO | VP of Cybersecurity | IT/OT Security | U.S. Navy Veteran | CISSP, CISM

发布日期: 2025年3月13日

Next in our look at AI Governance and Cybersecurity

As AI becomes integral to our lives—from powering self-driving cars to securing airports—its vulnerabilities are increasingly exploited by attackers. Building on prior discussions of AI risk frameworks like ISO and NIST, and the perils of AI poisoning, this two-part article explores emerging threats. In Part 1, Hacked Minds - I’ll unpack Adversarial Machine Learning (AML) attacks that manipulate AI systems into making dangerous mistakes. In Part 2, I’ll examine how AI itself becomes a weapon for exploitation. Understanding these risks is crucial for building defenses that keep AI trustworthy and safe.

Part 1: Hacked Minds - Navigating AI Attacks—Understanding the Threat Landscape

Adversarial Machine Learning Explained

Adversarial Machine Learning (AML) refers to malicious attempts to fool or manipulate AI systems (beyond the "data" as in our Poisoning discussion), leading them to produce incorrect, biased, or dangerous outputs. Think of it like slipping a distorted lens over a camera: the AI "sees" something different from reality, often without anyone noticing. These attacks are particularly concerning because traditional security methods—like firewalls or antivirus software—often fail to detect them, leaving critical systems exposed.

Types of Adversarial ML Attacks

Model Evasion Attackers craft inputs to mislead AI systems, exploiting weaknesses in how they interpret data. Early examples involved subtle, imperceptible changes to images—say, tweaking a stop sign’s pixels to trick a self-driving car into seeing it as a yield sign. Recent methods are more sophisticated. For instance, attackers might subtly alter sensor data to bypass automotive intrusion detection systems, potentially allowing unauthorized access to a vehicle’s controls. This could lead to remote hijacking of self-driving cars, endangering passengers and pedestrians alike.
Model Backdooring (ShadowLogic) "ShadowLogic" is a cutting-edge backdooring technique where attackers embed hidden "triggers"—think of them as secret codes—into an AI model’s structure. These triggers cause the AI to act maliciously only when activated. Imagine an airport security AI trained to spot weapons. With a ShadowLogic backdoor, a tiny, harmless-looking sticker on luggage could trick the AI into ignoring a gun, letting contraband slip through undetected. This invisible sabotage poses a stealthy threat to high-stakes systems.
Model Theft (Extraction) Model theft involves attackers copying an AI’s functionality by repeatedly querying it—like guessing a recipe by tasting the dish over and over. Modern techniques make this cheap and stealthy. Recent studies showed that sophisticated language models, like those from OpenAI, can be replicated for under $20 by exploiting their public APIs. This undermines years of costly development, handing proprietary technology to competitors or criminals.

Real-World Impacts

Attacks targeting AI systems, such as those exploiting Adversarial Machine Learning (AML), threaten national security, public safety, and business integrity, eroding trust in AI’s reliability. The stakes are high, as evidenced by real-world data from 2024. For instance, a 2023 study found that adversarial attacks on healthcare AI could misdiagnose up to 30% of patients—imagine an AI system misidentifying cancer in a scan, delaying critical treatment with potentially fatal results. The "AI Threat Landscape 2025" report from HiddenLayer underscores this vulnerability, noting that 73% of IT leaders reported definitively knowing of an AI breach in 2024, up from 67% the previous year, signaling a surge in successful attacks on critical systems.

National security faces significant risks from these vulnerabilities. AML techniques like model evasion or backdooring could compromise defense systems—think of an AI-driven drone misclassifying enemy targets due to manipulated inputs. The report highlights GPU vulnerabilities, such as LeftoverLocals (disclosed January 2024), which allow adversaries to extract sensitive data from AI processes, making such scenarios plausible. With 51% of AI attacks originating in North America and 34% in Europe, the threat to critical infrastructure is global and pervasive.

Business integrity is equally at stake. Model theft and supply chain attacks undermine competitive advantages, with the report revealing that 90% of companies use pre-trained models from repositories like Hugging Face, yet fewer than half scan these for safety. The December 2024 Ultralytics attack, which deployed crypto miners via compromised Python packages, exemplifies this risk. The financial toll is steep—IT leaders now spend 46% of their time addressing AI risk or security, up from 15% the previous year, diverting resources from innovation to defense. These breaches foster a climate of skepticism, as 61% of IT leaders advocate for mandatory breach disclosure, yet 29% avoid reporting incidents due to backlash fears, deepening distrust in AI’s role in critical applications.

Edward Liebig的更多文章

Defense in Depth: Another Angle to Securing AI Systems Against Exploits

2025年3月20日

Defense in Depth: Another Angle to Securing AI Systems Against Exploits

On March 10, an unsuspecting financial analyst at a major U.S.
AI Under Siege: From Hacked Minds to Weaponized Code

2025年3月17日

AI Under Siege: From Hacked Minds to Weaponized Code

Part 2: Weaponized Code AI as a Tool for Attackers—From Phishing to Political Chaos While Part 1 explored how attackers…

3 条评论
Poisoned Bots and Cyber Chaos: AI’s New Nightmare in 2025

2025年3月9日

Poisoned Bots and Cyber Chaos: AI’s New Nightmare in 2025

A few weeks back, I wrote about the CISO community’s rallying cry: “If you aren’t managing risks in AI, you will be…
Managing AI Risk - ISO, NIST or both?

2025年2月25日

Managing AI Risk - ISO, NIST or both?

There is a growing war cry amongst the CISO community, “If you aren’t managing risks in AI, you will be soon.” With…
The Digital Battlefield: AI, Cybersecurity, and the Deepfake Challenge

2025年1月19日

The Digital Battlefield: AI, Cybersecurity, and the Deepfake Challenge

Artificial intelligence has unleashed an era of extraordinary possibilities, revolutionizing industries, enhancing…

6 条评论
GRC in OT - is the juice worth the squeeze?

2025年1月9日

GRC in OT - is the juice worth the squeeze?

In my experience, the general sentiment around Governance, Risk, and Compliance (GRC) in Operational Technology (OT)…

3 条评论
DORA! Just in time for the Salt Typhoon battle

2024年12月6日

DORA! Just in time for the Salt Typhoon battle

DORA rolls into enforcement. The Digital Operational Resilience Act (DORA) is an EU regulation that came into force on…
Comparing and Contrasting Volt Typhoon and Salt Typhoon: A Quick Cyber Threat Analysis

2024年11月19日

Comparing and Contrasting Volt Typhoon and Salt Typhoon: A Quick Cyber Threat Analysis

Understanding the tactics and motivations of threat actors is crucial in the dynamic landscape of cybersecurity. Two…
The Addition of NIST 800-53 and the CSF Cross-Mapping for OT in the NIST 800-82r3 Release

2023年10月30日

The Addition of NIST 800-53 and the CSF Cross-Mapping for OT in the NIST 800-82r3 Release

In the ever-evolving world of cybersecurity, the protection of Operational Technology (OT) systems has become…

6 条评论
The cupcakes are ready!

2023年10月14日

The cupcakes are ready!

In my example of Grandma Mae and her cupcakes, I made the assertion that OT Cybersecurity still has a gooey center (the…

See all articles

社区洞察

Artificial Intelligence

How can you defend against new AI cyberattacks?

Edward Liebig的更多文章

Defense in Depth: Another Angle to Securing AI Systems Against Exploits

AI Under Siege: From Hacked Minds to Weaponized Code

Poisoned Bots and Cyber Chaos: AI’s New Nightmare in 2025

Managing AI Risk - ISO, NIST or both?

The Digital Battlefield: AI, Cybersecurity, and the Deepfake Challenge

GRC in OT - is the juice worth the squeeze?

DORA! Just in time for the Salt Typhoon battle

Comparing and Contrasting Volt Typhoon and Salt Typhoon: A Quick Cyber Threat Analysis

The Addition of NIST 800-53 and the CSF Cross-Mapping for OT in the NIST 800-82r3 Release

The cupcakes are ready!

社区洞察