ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

9. AI System Attacks

Richard Diver

Director of Story Design, author of â€œGuardians of AIâ€

å‘å¸ƒæ—¥æœŸ: 2024å¹´7æœˆ7æ—¥

In any sports setting there is a constant shift in the game between attack and defense. While cybersecurity is not a game, it does have the same elements of strategy playbooks, skilled operators, and technology that helps defend from a wide range of attacker behavior.

There are many ways to attack and manipulate an AI system, each one requires a different strategy but have similar methods for the detection and response. Let's explore a few here:

Insider risk and social engineering

One of the first scenarios to cover is the potential for social engineering or insider risk. By understanding the actions and information that can be exposed to a trusted individual, it is possible to map out how an attacker might use generative AI inside a company. The attack may come from existing methods such as a phishing email or instant messaging, or it may be hidden inside of content being used for analysis. As part of a Zero Trust approach, we can use the principle of "Assume Breach" and expect that at least one person is acting under the influence of social engineering techniques or may be a genuine insider risk acting on their own devious intentions. Monitoring user behaviors is critical.

Another easy opportunity is to manipulate the promptbooks that might be in use within an organization, these are pre-define prompt templates that can be used to craft a better response from the LLM with advanced prompt-engineering. Ensure the creation, storage, and use of these tools are well moderated and regularly tested.

Malicious prompts and poisoned content

Focusing on the instructions being sent to the user prompt is a great way to uncover many of the following risky actions that can occur afterwards. You can read more about the potential issues involved by reading this article on the Prompt Shield (using the Spotlight mitigation technique) for poisoned content, and this article on the Crescendo method of achieve a jailbreak.

The following diagram is used as a way to map the different variations of attack path, from initial point of entry and through the various steps required to complete the malicious activity, and then exfiltration of information. The mitigations are also applied along the same path to show how each one can play a part of a layered defense (or defense in depth).

Diagram of a threat mapping template using the three layers of the AI systems framework. Includes AI Usage, AI Application, and AI Platform — AI threat mapping template

As you build out a threat mapping diagram like this, consider how the actions may occur in the AI usage layer, the impact that can have in the AI application, and how the AI platform (and AI model) will respond.

For more advanced attacks against AI, read the book "Not with a bug, but with a sticker", by Ram Shankar Siva Kumar, and Hyrum Anderson, PhD.

é¢†è‹±æŽ¨è

AI Breaks Free: New Insights Into The Latest Chatbot Jailbreak Hack

AI Breaks Free: New Insights Into The Latest Chatbotâ€¦

KnowBe4 1 å¹´å‰

How Threat Actors are using ChatGPT, AI-generated YouTube videos and much more

How Threat Actors are using ChatGPT, AI-generatedâ€¦

CloudSEK 1 å¹´å‰

AI Cybersecurity: A Double-Edged Sword in the Digital Battlefield of 2024

AI Cybersecurity: A Double-Edged Sword in the Digitalâ€¦

Cogent Integrated Business Solutions Inc. 3 ä¸ªæœˆå‰

Book cover for "Not with a bug, but with a sticker". — Book cover "Not with a bug, but with a sticker"

Targeting AI application and AI platform services

By targeting the AI supply chain there is an opportunity for an attack to have much greater impact than going after prompt injection one by one. By targeting the trusted components such as skills, functions, and plugins, it could be possible to impact multiple organizations and do it while remaining undetected (similar to past software supply chain issues). It is important to ensure that development of AI solutions is secure from the coding infrastructure and data sources to the 3rd party and open-source software components. The AI platform hosting the AI model can also be attacked in many creative ways:

Availability of services can be impacted by sustained high-volume DDoS attacks.
Removing access to any dependencies, including storage accounts in another service, or loss of access to information databases or code repository.
Access via trusted physical networks such as a company office or remote site that is not well secured, enables direct trusted access into cloud networks.
The use of remote connectivity software including RDP or VPN clients, anything that is designed to give administrators remote access can also be manipulated by remote attack.
Access via the service providers customer administration portal, or command line interface, if not properly protected with identity and access management solutions like multi-factor authentication.

Because of the nature of these services, they run 24 hours a day, 365 days a year. This provides endless opportunities to probe and test them, looking for weakness and opportunity. There are plenty of different ways to attack an AI system, ensure you think of each angle and provide several mitigations for each one. Ensure continuous testing to probe for new weakness in process and procedure, along with technical misconfiguration or oversight.

Here is my favorite quote from this chapter:

The book is available now on Amazon - Guardians of AI: Building innovation with safety and security.

In the next newsletter we will explore some of the key insights from Chapter 10: AI System Defense.

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Richard Diverçš„æ›´å¤šæ–‡ç«

Be passionate, not passive

2024å¹´10æœˆ30æ—¥

Be passionate, not passive

Yesterday I had the opportunity to share one of my hidden "talents" at a company event. It was well received, so I amâ€¦

12 æ¡è¯„è®º
11. Threat Modeling

2024å¹´7æœˆ21æ—¥

11. Threat Modeling

Today, threat modeling has been a specialized capability used in software development and system engineering. Very deepâ€¦

2 æ¡è¯„è®º
10. AI System Defense

2024å¹´7æœˆ14æ—¥

10. AI System Defense

Throughout all the studying, conversations, and experiences of the last year, it is clear that defense is going to be aâ€¦

5 æ¡è¯„è®º
8. AI Harms & Risks

2024å¹´6æœˆ30æ—¥

8. AI Harms & Risks

Choosing what to include, or exclude, took some time to figure out. I think what we have here is a great starting pointâ€¦

1 æ¡è¯„è®º
7. Existing Risk

2024å¹´6æœˆ23æ—¥

7. Existing Risk

In the world of business and technology, risk management is a well-defined and practiced profession that has evolved inâ€¦
6. AI Governance

2024å¹´6æœˆ16æ—¥

6. AI Governance

AI harms and threats to the safe use of AI will not only occur because of malicious actorsâ€™ intent on causing damage orâ€¦

2 æ¡è¯„è®º
5. Ethical Framework

2024å¹´6æœˆ9æ—¥

5. Ethical Framework

Considerations for the safety and security of AI systems goes beyond the traditional cybersecurity focus of defendingâ€¦
4. AI Application Architecture

2024å¹´6æœˆ2æ—¥

4. AI Application Architecture

Understanding how an AI application works is the first step in assessing the ability to secure it. The 3-layer diagramâ€¦
3. Types of AI Systems

2024å¹´5æœˆ26æ—¥

3. Types of AI Systems

Artificial Intelligence (AI) is a group of technologies that, when combined, provide advanced computing capabilitiesâ€¦
2. Cybersecurity in the AI World

2024å¹´5æœˆ19æ—¥

2. Cybersecurity in the AI World

Will AI cause more headaches, or will it solve scenarios cybersecurity issues? Most likely both. From the attackerâ€¦

See all articles

9. AI System Attacks

Richard Diver

Director of Story Design, author of â€œGuardians of AIâ€

Insider risk and social engineering

Malicious prompts and poisoned content

é¢†è‹±æŽ¨è

Targeting AI application and AI platform services

Richard Diverçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Navigating the AI-Powered Cybersecurity Frontier: A Strategic Imperative for CISOs

Why We Need Futures Thinking

Certified AI Free, the only way to trust things in the almost weekly cyber ?

AI in Cybersecurity

Cyber News #14 - The Impact of Artificial Intelligence on Cybersecurity

The Dark Side of AI: How it Can be Used for Malicious Purposes

The Future of AI in Cybersecurity: Trends to Watch

Issue #14: AI-Enhanced Cyberattacks - A New Era of Cyber Warfare

Navigating the Cybersecurity Landscape in the Age of Generative AI

They're Hyped About AI Too

Insider risk and social engineering

Malicious prompts and poisoned content

é¢†è‹±æŽ¨è

Targeting AI application and AI platform services

Richard Diverçš„æ›´å¤šæ–‡ç«

Be passionate, not passive

11. Threat Modeling

10. AI System Defense

8. AI Harms & Risks

7. Existing Risk

6. AI Governance

5. Ethical Framework

4. AI Application Architecture

3. Types of AI Systems

2. Cybersecurity in the AI World

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Navigating the AI-Powered Cybersecurity Frontier: A Strategic Imperative for CISOs

Why We Need Futures Thinking

Certified AI Free, the only way to trust things in the almost weekly cyber ?

AI in Cybersecurity

Cyber News #14 - The Impact of Artificial Intelligence on Cybersecurity

The Dark Side of AI: How it Can be Used for Malicious Purposes

The Future of AI in Cybersecurity: Trends to Watch

Issue #14: AI-Enhanced Cyberattacks - A New Era of Cyber Warfare

Navigating the Cybersecurity Landscape in the Age of Generative AI

They're Hyped About AI Too

é¢†è‹±æŽ¨è

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†