登录查看更多内容

Securing AI Agents: 4 Controls for Responsible Development

Matthew Thompson

#AgenticAI #ResponsibleAI #ObservableAgents #HumanInTheLoop

发布日期: 2024年1月27日

Security professionals and data scientists, in particular, need to secure highly capable AI systems. While there is much discussion about AGI, our focus, especially in the security industry, should be on Artificial Capable Intelligence. ACI is already here, has been for over a year, and is expected to grow exponentially.

In practical terms, this growth will be fueled by the introduction of Neural Processing Units (NPUs) to the market in the coming months. NPUs are chips that measure performance in Terra Flops and allow quick model inference. They will facilitate the development and deployment of local AI models, introducing a new paradigm in technical risk controls.

We are already witnessing the development and deployment of AI Systems (like ChatGPT) and Agents (such as automation tools like Zapier). These systems have varying levels of autonomy and operational scope, and it's crucial to maintain clear transparency in the actions performed by AI Agents and Systems.

Clearly, this technology presents both opportunities and threats. The conversation around existential risks is well-documented. However, the real challenge lies not in low probability, high impact risks, but in whether AI Systems and Agents are developed and deployed responsibly.

People will delegate decisions to these systems, as is already done in some contexts like SEO or market tracking funds, and ask them to take actions like displaying relevant information or making trades.

This isn't about the models themselves, as there is plenty of ongoing work to ensure models do not exhibit bias or facilitate harmful actions. It's about the systems in which they operate: What context is being added to these models? How do we ensure that this context is secure? It's likely that hackers are viewing these systems as potential vulnerabilities.

Security professionals need to consider new areas for ACI (Artificial Capable Intelligence):

- Task Specificity: ACIs' roles should be clearly defined, whether intended for specific tasks or broader general capabilities. The risk profile for a general-purpose agent differs significantly from that of a narrow agent.

S&P Global 1 年前

DeepMind's New AI is getting closer to AGI; The AI…

Steve Nouri 2 年前

Balancing Security and Privacy in the Age of…

Santosh G 7 个月前

- Operational Scope: The proactive and reactive capabilities of ACIs must be delineated to understand their potential impact and the security measures necessary to govern their actions. General purpose proactive agents are likely to carry greater risk as they will be unbounded and very capable.

- Decision Autonomy: The level of human oversight required for ACIs must be carefully considered to balance efficiency with the need for control and accountability.

- Transparency and Accountability: Clear mechanisms for transparency in ACIs' decision-making processes and accountability for their actions are essential to ensure trust and ethical alignment with societal values.

The security community needs to be threat modeling AI Systems. What happens if a memory component gets compromised? That means an attack can insert or delete memories into someone's personal assistant. How do we detect that? How do we secure a memory component? We have current methods for hardening these components, but do we need more?

Thanks for reading,

Matt

#ContextIsAllYouNeed #ResponsibleAI #SecureByDesign?

Guy Huntington

Trailblazing Human and Entity Identity & Learning Visionary - Created a new legal identity architecture for humans/ AI systems/bots and leveraged this to create a new learning architecture

8 个月

Hi Matt, My take on AI security is quite different than most others. If you'd like to know why, read on... 1. First skm these articles: “* The Challenge with AI & Bots - Determining Friend From Foe” - https://www.dhirubhai.net/pulse/challenge-ai-bots-determining-friend-from-foe-guy-huntington/ * “A Whopper Sized Problem- AI Systems/Bots Beginnings & Endings” - https://www.dhirubhai.net/pulse/whopper-sized-problem-guy-huntington/ * “Hives, AI, Bots & Humans - Another Whopper Sized Problem”- https://www.dhirubhai.net/pulse/hives-ai-bots-humans-another-whopper-sized-problem-guy-huntington * “AI Leveraged Smart Digital Identities of Us” - https://www.dhirubhai.net/pulse/ai-leveraged-smart-digital-identities-us-guy-huntington/ My premise? Without being able to instantly determine entity friend from foe, down in the security, legal, and identity weeds, models and systems won't work well. I'll continue in the next message...

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Securing AI Agents: 4 Controls for Responsible Development

Matthew Thompson

#AgenticAI #ResponsibleAI #ObservableAgents #HumanInTheLoop

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Curious AI #39

From Data to Dynamos: A Perspective on AI's Evolution in Network Intelligence

The Top Ten Artificial Intelligence (AI) Risks to watch out for in 2024

Colorado Sets Precedent with Comprehensive AI Regulation by Fadi Agour, J.D.

"GenAI is inevitable, so be prepared to manage its flow."

S.D.I. English Edition newsletter: Trick or AI Treat … ?

How to Securely Enable Generative AI within the Public Sector

AI TRISM: Building Trustworthy AI for a Bright Future

How the emergence of AI (and AI agents with reasoning capabilities) could transform and disrupt the security industry

The AI Vanguard: Four Brits and Their Quest to Shape a Data-Driven World

领英推荐

What is an AI Engineer?

2024年6月24日

Guide to Enhancing Artificial Intelligence: Maximising Capability and Reducing Risks

2024年2月7日

Ethics for AI?

2023年3月20日

DevOps - Scaling from Production backwards

2021年10月15日