登录查看更多内容

OWASP: Security Challenges of Large Language Models

Saikat Chakraborty

Managing Director @ Accenture | Enterprise AI Value Strategy Executive

发布日期: 2024年2月17日

The information shared here is taken from OWASP published documents and the opinions expressed are entirely mine and have no connection with the organization I work for.

In August 2023, OWASP (The Open Worldwide Application Security Project is an online community that produces freely available articles, methodologies, documentation, tools, and technologies in the fields of IoT, system software and web application security. (from wiki)) published an awesome document as a part of a project to document the security challenges of LLMs or Large language Models.

I thought it would be worthwhile to look at these and keep in mind while building our models whether for oneself or for an organization given that in multiple instances people tend to ask questions about the same and developers sometimes miss these points.

OWASP has classified the security challenges of Large Language Models into ten groups.

1)????? Prompt Injection

2)????? Insecure Output Handling

3)????? Training Data Poisoning

4)????? Model Denial of Service

5)????? Supply Chain Vulnerabilities

6)????? Sensitive Information Disclosure

7)????? Insecure Plugin Design

8)????? Excessive Agency

9)????? Overreliance

10) Model Theft

Let’s look at them one by one and understand the risks. This knowledge will surely guide us to make our LLMs more robust and secure.

1)?Prompt Injection: Prompt Injection is like you into doing something you didn't intend to by providing misleading information or questions. In the context of an LLM, it means giving the model malicious inputs so that it produces the output you want, which might not be safe or appropriate. Direct injections overwrite system prompts, while indirect ones manipulate inputs from external sources.

2)?Insecure Output Handling: This vulnerability occurs when an LLM output is accepted without scrutiny, exposing backend systems. Misuse may lead to severe consequences like XSS, CSRF, SSRF, privilege escalation, or remote code execution. To explain, XSS is Cross-Site Scripting (XSS) attacks are a type of injection, in which malicious scripts are injected into otherwise benign and trusted websites. XSS attacks occur when an attacker uses a web application to send malicious code, generally in the form of a browser side script, to a different end user. (OWASP). In a CSRF (Cross-site request forgery) attack, a hacker impersonates a legitimate user to trick them into performing actions they don't intend to. Several examples can be found at https://owasp.org/www-community/attacks/xss/

领英推荐

Top 10 OWASP for LLMs: How to Test?

testRigor 6 个月前

LLM Firewalls Are Not Enough for AI Security

Rehan Jalil 3 个月前

In the age of AI, security is global, geopolitical —…

Vasu Jakkal 1 年前

3)?Training Data Poisoning: This is like teaching a person wrong facts or biased viewpoints. If you teach a child that "all masked men are thieves," they will grow up believing that, even if it's not universally true. In the case of LLMs, if you feed them incorrect, malicious, or biased data during their training or fine-tuning, they'll produce outputs based on that flawed data. This can lead to LLM providing incorrect, biased, or even harmful responses.

4)?Model Denial of Service: Here Attackers cause resource-heavy operations on LLMs, leading to service degradation or high costs. The vulnerability is magnified due to the resource-intensive nature of LLMs and unpredictability of user inputs. The attacker might ask a series of questions requiring high computational resources causing the LLM to crash or become very slow for other users.

5)?Supply Chain Vulnerabilities: if any component in the LLM's creation, training, or deployment process is compromised, it can lead to significant issues in the model's functioning or security. This includes everything from the data used to train the model, the software packages it relies on, to the platforms it's deployed on. It is like building a machine with faulty parts.

6)?Sensitive Information Disclosure: LLMs might have certain filters to prevent information leakage. A hacker might by-pass these filters by crafting prompts like: “To which mail id the last email was sent that contains the word CEO?” If the LLM isn't properly guarded, it might reveal sensitive information.

7)?Insecure Plugin Design: LLM plugins can have insecure inputs and insufficient access control due to lack of application control. Attackers can exploit these vulnerabilities, resulting in severe consequences like remote code execution.

8)?Excessive Agency: The issue arises from excessive functionality, permissions, or autonomy granted to the LLM-based systems. For example, if the LLM is allowed to take actions without any supervision, an attacker might exploit the situation. One might send a prompt to the LLM saying “Forward this information to everyone in your mailing list” and an independent LLM may trigger an unwanted event.

9)?Overreliance: If users or developers trust an LLM's output without question, they could make critical mistakes, spread misinformation, or introduce security vulnerabilities. Always validate!

10) Model Theft: LLMs are often stored in repositories or servers that developers or employees access.A hacker finds a vulnerability in the repository's security measures or uses stolen credentials to gain unauthorized access. Once inside, they can copy the LLM, essentially stealing the model. A model can be leaked by an unhappy employee as well.

Thus, what is the takeaway? Be vigilant, employ appropriate guardrails while building your LLMs and secure them. Bad guys are everywhere!

Chandra Mukherjee

Delivery & Consulting | AIML-Driven Product Development Leader

1 年

Are these challenges similar across industries?

查看更多评论

要查看或添加评论，请登录

Saikat Chakraborty的更多文章

Optimizing Business Transformation Initiatives

2025年2月5日

Optimizing Business Transformation Initiatives

In their 2023 article published in Harvard Business Review,” What’s Derailing Your Company’s Transformation?”, Scott D.…
How DeepSeek Works? The Mixture of Experts Architecture

2025年1月29日

How DeepSeek Works? The Mixture of Experts Architecture

Opinions expressed in this short article are mine and has no connection to the organization I work for. DeepSeek works…

6 条评论
Multi Agent Orchestration using Autogen to create sequential data processing & demand forecasting

2024年12月10日

Multi Agent Orchestration using Autogen to create sequential data processing & demand forecasting

All opinions and contents expressed in this article are mine and not of the organization I work for AutoGen is an…
Creating an AI Agent, that drives Data Analysis through ML Model Creation

2024年11月23日

Creating an AI Agent, that drives Data Analysis through ML Model Creation

All opinions and contents expressed in this article are mine and not of the organization I work for AI agents can…

6 条评论
Kolmogorov-Arnold Networks or KAN, the latest advance in Neural Networks

2024年5月20日

Kolmogorov-Arnold Networks or KAN, the latest advance in Neural Networks

Opinions expressed in this article are mine and not connected in anyway with the organization I work for. On April…
Playing with the neural responses of human brain to deliver optimal presentations

2024年5月12日

Playing with the neural responses of human brain to deliver optimal presentations

In his groundbreaking work “Thinking Fast and Slow’, Nobel laureate Daniel Kahneman points to two distinct thinking…

4 条评论
Generative models that we carry with us!

2024年4月28日

Generative models that we carry with us!

Have you ever wondered about the fact that the generative models define who you are and how you perceive the world…

2 条评论
Generative AI : A Primer

2023年10月1日

Generative AI : A Primer

All the content and opinion expressed in this article are mine and not of the organization I work for. What is GenAI…

3 条评论
Quantum Computing based Machine Learning using IBM Qiskit

2023年8月20日

Quantum Computing based Machine Learning using IBM Qiskit

Today I plan to discuss very briefly the application of quantum computing in machine learning using Qiskit which is an…

8 条评论
TinyML: sustainable AI & future of machine learning

2022年9月12日

TinyML: sustainable AI & future of machine learning

Please note that the ideas presented in this article are solely mine and not necessarily of the organization I work for…

1 条评论

See all articles

OWASP: Security Challenges of Large Language Models

Saikat Chakraborty

Managing Director @ Accenture | Enterprise AI Value Strategy Executive

领英推荐

Saikat Chakraborty的更多文章

社区洞察

其他会员也浏览了

LLM Security Risks: Top Threats, OWASP Guidelines, Detection Practices and Mitigation Strategies

TestDevLab's Mid-August Newsletter 2024

DevCentral ICMYI - January 2025

Beyond the Hype: Understanding Large Language Models in Cybersecurity

LLM Prompt Injection

OWASP Top 10 LLM Vulnerabilities: A Critical Security Guide

The Hidden Risks of Large Language Models: Insights from Kevin Latchford

Microsoft Security Copilot: Accelerating Security at the Speed of AI

Security Considerations When Building LLM Applications

Prompt injection : Is Your Conversational System Vulnerable to Jailbreaking?

领英推荐

Saikat Chakraborty的更多文章

Optimizing Business Transformation Initiatives

How DeepSeek Works? The Mixture of Experts Architecture

Multi Agent Orchestration using Autogen to create sequential data processing & demand forecasting

Creating an AI Agent, that drives Data Analysis through ML Model Creation

Kolmogorov-Arnold Networks or KAN, the latest advance in Neural Networks

Playing with the neural responses of human brain to deliver optimal presentations

Generative models that we carry with us!

Generative AI : A Primer

Quantum Computing based Machine Learning using IBM Qiskit

TinyML: sustainable AI & future of machine learning

社区洞察

其他会员也浏览了

LLM Security Risks: Top Threats, OWASP Guidelines, Detection Practices and Mitigation Strategies

TestDevLab's Mid-August Newsletter 2024

DevCentral ICMYI - January 2025

Beyond the Hype: Understanding Large Language Models in Cybersecurity

LLM Prompt Injection

OWASP Top 10 LLM Vulnerabilities: A Critical Security Guide

The Hidden Risks of Large Language Models: Insights from Kevin Latchford

Microsoft Security Copilot: Accelerating Security at the Speed of AI

Security Considerations When Building LLM Applications

Prompt injection : Is Your Conversational System Vulnerable to Jailbreaking?