登录查看更多内容

How does OpenAI's Preparedness Framework compare to "Responsible Scaling Policies"?

SaferAI

We're building the infrastructure to audit general-purpose AI systems.

发布日期: 2024年1月19日

Two of the companies building the most widely-used Large Language Models (LLMs) have released frameworks for how they would self-evaluate risks in their models. These frameworks are? Responsible Scaling Policies (RSPs) and the Preparedness Framework (PF), from Anthropic and OpenAI, respectively. We’ve previously discussed Anthropic’s RSPs. How does OpenAI's compare?

Assuming OpenAI keeps its commitments, there are significant improvements of OpenAI’s Preparedness Framework (PF) over Anthropic’s RSPs:?

It runs the safety tests twice as frequently to check for dangerous capability advancement.?
It adds “safety drills” to stress test the company culture robustness to emergencies and a dedicated team to oversee technical work.
It provides the board with the ability to overturn the CEO’s decisions.
It adds key components of risk assessment such as risk identification and risk analysis in scope of their framework.
The PF aims to forecast risks, not just respond to them. This means they can avoid training a dangerous model rather than having to evaluate for danger after training.

The PF lacks important components of safety that were present in RSPs:

领英推荐

Risk Management of the Emerging Tech and Security…

Chuck Brooks 1 年前

ThreatReady by Hack The Box

Hack The Box 1 年前

Our Developers’ New Motto is “LLM Take the Wheel”

CISO Series 1 个月前

Lack of commitment to publicize any result of evaluations.?
Lack of incident reporting mechanism which is key in order to have a feedback loop of the effectiveness of safety practices.?
The commitments for infosecurity and cybersecurity are less detailed, possibly weaker, which increases the risk that dangerous models are stolen by hackers.

Both frameworks lack the following:?

We’d like to see the framework not evaluate each category of risk separately, but consider how they work together to increase the overall risk severity and likelihood of an event.?
A process to set risk levels that considers the risk appetite of the public, not just the risk appetite of the company.

Some elements that could be improved in the Preparedness Framework:

We believe that the Safety Advisory Group would be substantially more relevant if some of its members were external to the company. This is particularly important without stronger disclosure commitments.
Measuring safety culture using processes established in other fields like nuclear safety would enable OpenAI to iteratively improve.
Predicting the magnitude and likelihood of key risks. A method to aggregate risk experts’ opinions would enable the company to evaluate the magnitude and likelihood of risks. This would provide interpretable indicators that allow society to decide deliberately what risk level it accepts. (Koessler et al., 2023)

要查看或添加评论，请登录

How does OpenAI's Preparedness Framework compare to "Responsible Scaling Policies"?

SaferAI

We're building the infrastructure to audit general-purpose AI systems.

领英推荐

SaferAI的更多文章

社区洞察

其他会员也浏览了

Architecting, Designing, and Implementing an AI-Enhanced Supply Chain Cybersecurity System: A Comprehensive NIST C-SCRM Framework

PESTEL Analysis: A Strategic Lens for Navigating Cybersecurity Landscape

Unlocking the Future of Industrial Security: A Deep Dive into AI for Industrial Cybersecurity

Cybersecurity – A Consequential Risk for All Businesses

Why the security 'threat' and risk rating you have right now is inaccurate, incomplete and a lesson in reductionism (simple complexity)

President Biden's Executive Order Sets Groundbreaking Standards for AI Governance

ADVERSARY INNOVATION AND PROCUREMENT AT OPERATIONAL TEMPO

The Smart Contract Auditor Swiss Army Knife - Solodit Full Guide

FUELING GROWTH: How Policy Changes Will Shape Cybersecurity Recruitment in Oil and Gas

AI Security Insider — May 2024

领英推荐

SaferAI的更多文章

Towards SaferAI: How the US AI Safety Institute Consortium Could Accelerate AI Risk Management

AI researchers predict AI will outperform humans in all tasks by 2047, urge greater focus on AI safety

社区洞察

其他会员也浏览了

Architecting, Designing, and Implementing an AI-Enhanced Supply Chain Cybersecurity System: A Comprehensive NIST C-SCRM Framework

PESTEL Analysis: A Strategic Lens for Navigating Cybersecurity Landscape

Unlocking the Future of Industrial Security: A Deep Dive into AI for Industrial Cybersecurity

Cybersecurity – A Consequential Risk for All Businesses

Why the security 'threat' and risk rating you have right now is inaccurate, incomplete and a lesson in reductionism (simple complexity)

President Biden's Executive Order Sets Groundbreaking Standards for AI Governance

ADVERSARY INNOVATION AND PROCUREMENT AT OPERATIONAL TEMPO

The Smart Contract Auditor Swiss Army Knife - Solodit Full Guide

FUELING GROWTH: How Policy Changes Will Shape Cybersecurity Recruitment in Oil and Gas

AI Security Insider — May 2024