登录查看更多内容

Indirect Prompt Injection to LLMs

Fluid Attacks

We hack your software. Comprehensive Continuous Hacking: Develop secure software from the start.

发布日期: 2024年2月21日

Attackers can indirectly instruct AI for malicious aims

Large language models (LLMs), widely used today in generative artificial intelligence, can be subject to attacks and function as attack vectors. This can lead to the theft of sensitive information, fraud, spreading of malware, intrusion, and alteration of AI system availability, among other incidents. While such attacks can take place directly, they can also occur indirectly. It is the latter form of attack —specifically indirect prompt injection— that we intend to discuss in this post, providing a quick and digestible account of a recent research paper by Greshake et al. in this regard.

LLMs are machine learning models of the artificial neural network type that use deep learning techniques and enormous amounts of data to process, predict, summarize and generate content, usually in the form of text. These models' functionalities are modulated by natural language prompts or instructions. LLMs are increasingly being integrated into other applications to offer users, for example, interactive chats, summaries of web searches and calls to different APIs. In other words, they are no longer stand-alone units with controlled input channels but units that receive arbitrarily retrieved inputs from various external sources.

领英推荐

Toxic AI

Prof. Ahmed Banafa 10 个月前

Deepfakes: A Prime Example of AI’s Creative Potential…

Dominik Krimpmann, PhD 4 个月前

Generative AI & Cyber Risk

National Cyber Security Centre, Ireland (NCSC-IE) 1 年前

Here is where indirect prompt injection comes in. Usually, exploitation to bypass content restrictions and gain access to the model's original instructions was confined to direct intervention (e.g., individuals directly attacking their own LLMs or public models). However, Greshake et al. have revealed that adversaries can now remotely control the model and compromise the applications' data and services and the associated users. Attackers can strategically inject malicious prompts into external data sets likely to be retrieved by the LLM for processing and output generation to achieve desired adverse effects.

Read more about this here: https://fluidattacks.com/blog/indirect-prompt-injection-llms/

Indirect Prompt Injection to LLMs

Fluid Attacks

We hack your software. Comprehensive Continuous Hacking: Develop secure software from the start.

Attackers can indirectly instruct AI for malicious aims

领英推荐

Fluid Attacks的更多文章

社区洞察

其他会员也浏览了

The Shadow Self of AI: Reflections on Emergent Misalignment

Center for AI Policy Proposes 2025 AI Action Plan

NxtG.ai | AI Unveiled: Hype vs. Reality – February 2025

Navigating the Complexities of AI: Insights from ASD’s ACSC Publication

The Future Of AI And ML In Cybersecurity

AI, Fake News and Pandora's Box

Awakening the AI Titan: A Personal Journey Through Opportunity and Uncertainty

AI's Capacity to Jeopardize Humanity Within a Span of Five to Ten Years

The Dark Sides of AI Technology: The Risks and Challenges Ahead

AI's Journey to Zero Emissions, LLM Jail Breaking - Does Refusal Training in LLMs Generalize to the Past Tense, Consolidation in Foundation Models....

Attackers can indirectly instruct AI for malicious aims

领英推荐

Fluid Attacks的更多文章

Ataques al sector del transporte

Attacks Against the Transportation Sector

What Your Risk Management's Missing

Lo que le falta a tu gestión de riesgos

De frágil a blindada

From Flaky to Bulletproof

Ciberseguridad en servicios financieros

Cybersecurity in Financial Services

Zero Trust Network Access

Zero Trust Network Access

社区洞察

其他会员也浏览了

The Shadow Self of AI: Reflections on Emergent Misalignment

Center for AI Policy Proposes 2025 AI Action Plan

NxtG.ai | AI Unveiled: Hype vs. Reality – February 2025

Navigating the Complexities of AI: Insights from ASD’s ACSC Publication

The Future Of AI And ML In Cybersecurity

AI, Fake News and Pandora's Box

Awakening the AI Titan: A Personal Journey Through Opportunity and Uncertainty

AI's Capacity to Jeopardize Humanity Within a Span of Five to Ten Years

The Dark Sides of AI Technology: The Risks and Challenges Ahead

AI's Journey to Zero Emissions, LLM Jail Breaking - Does Refusal Training in LLMs Generalize to the Past Tense, Consolidation in Foundation Models....