登录查看更多内容

5 Globally Accepted Benchmarks to Assess LLMs on Safety

Tejash Mehta

Customer Success Leader..[Opinions or views expressed here are solely my personal opinions]

发布日期: 2024年6月9日

AI Testing in Disarray: No Easy Way to Choose an Ethical Model

There's a growing problem in the world of AI development – a lack of agreement on how to test if these models are behaving responsibly. This, according to the latest AI Index from Stanford's Institute for Human-Centered Artificial Intelligence (HAI), released earlier this week.

The big concern? Businesses and everyday users are left in the dark. With no clear way to compare AI models, how can they choose one that aligns with their needs and values?

"There's a huge difference in how AI models behave depending on what they're designed for," explains Nestor Maslej, editor of the 2024 AI Index. "The challenge is, there just aren't any simple tools for comparing them, and it doesn't seem like a solution is coming anytime soon."

Take benchmark tests for responsible AI. TruthfulQA, one of the most common ones, is only used by a handful of leading developers. OpenAI, Meta, and Anthropic all put their models through this test, but Google and Mistral haven't used it on their latest creations.

The level of enthusiasm for responsibility testing also varies wildly. Meta stands out for putting its Llama 2 model through three different tests, while Mistral hasn't used any of the five evaluated by Stanford.

领英推荐

Artificial intelligence: A human matter above all

CI&T 1 年前

Peering into the Future: What to Expect from AI and…

DataThick 6 个月前

An Introduction to the Four Principles of Explainable…

Algolia 1 年前

Here's the rub: current benchmarks tend to be very specific. TruthfulQA looks at a model's honesty based on its training data. Others, like RealToxicityPrompts and Toxic Gen, focus on how likely a model is to generate hateful content.

"There's definitely a lack of standardization," says Maslej. "What's causing it is unclear, but some developers might be cherry-picking tests that make their models look better. Or maybe they're making it harder for users to see the limitations."

This lack of standardized testing tools has actually given rise to a new organization – the Responsible AI Institute, backed by major companies. They've developed their own set of benchmarking tools to address the gap.

The bigger picture? AI developers and academics are locked in a heated debate. Which AI risks are the most pressing? Is it the immediate bias creeping into model outputs, or the potential "existential threats" posed by highly advanced AI systems in the future?

The Stanford AI Index also sheds light on regional trends. The US leads in building significant AI models (61) compared to the EU (21) and China (15). However, China dominates in AI patents (61%), while the US holds the crown for private investment ($67.2 billion). Interestingly, in 2023, industry pumped out 108 new foundation models, compared to just 28 from academia.

Cerebyte

1,011 位关注者

要查看或添加评论，请登录

Tejash Mehta的更多文章

The State of Ethical AI in 2025.Breakthroughs, Challenges, and Global Consensus

2025年2月27日

The State of Ethical AI in 2025.Breakthroughs, Challenges, and Global Consensus

The State of Ethical AI in 2025: Breakthroughs, Challenges, and Global Consensus As we approach mid-2025, the field of…
AI Ethics Unplugged: Navigating the Moral Maze of Machine Intelligence

2025年2月10日

AI Ethics Unplugged: Navigating the Moral Maze of Machine Intelligence

Dear Readers, Welcome to the latest edition of AI Ethics Unplugged! This month, we’re diving into the most pressing…

1 条评论
Ethics in AI – January 2025 Updates: Are We Moving in the Right Direction?

2025年2月2日

Ethics in AI – January 2025 Updates: Are We Moving in the Right Direction?

?? AI is evolving fast, but are ethics keeping up? Disclaimer: I work for eClinicalWorks, but the opinions or views…
An update on OpenAI's safety & security practices

2024年11月25日

An update on OpenAI's safety & security practices

SSC will become an independent Board oversight committee, chaired by Zico Kolter, to oversee critical safety and…
Yoshua Bengio will receive ￡59 million for The project, called Safeguarded AI

2024年10月21日

Yoshua Bengio will receive ￡59 million for The project, called Safeguarded AI

Yoshua Bengio, a Turing Award winner who is considered one of the “godfathers” of modern AI, is throwing his weight…
Many Safety Evaluations for AI Models Have Significant Limitations

2024年10月7日

Many Safety Evaluations for AI Models Have Significant Limitations

A new report highlights that despite the growing need for AI safety and accountability, current tests and benchmarks…
AI ethics boards coming up fast in Indian tech majors.

2024年9月30日

AI ethics boards coming up fast in Indian tech majors.

Synopsis: Leading tech majors are building internal review boards to ensure the ethical development of AI systems, in…
OpenAI’s approach to spotting hallucinations: Use AI

2024年9月24日

OpenAI’s approach to spotting hallucinations: Use AI

You know what they say: “You’re your own worst critic.” OpenAI took the phrase literally and built a model called…
OpenAI faces criticism over its whistleblower policies

2024年9月10日

OpenAI faces criticism over its whistleblower policies

What if you thought your company was doing something wrong — but you felt you had no way of speaking out against it?…
A Right to Warn about Advanced Artificial Intelligence

2024年9月3日

A Right to Warn about Advanced Artificial Intelligence

Right to Warn AI – Open letter for companies to expand whistleblower protections who raise the alarm about potential AI…

See all articles

5 Globally Accepted Benchmarks to Assess LLMs on Safety

Tejash Mehta

Customer Success Leader..[Opinions or views expressed here are solely my personal opinions]

AI Testing in Disarray: No Easy Way to Choose an Ethical Model

领英推荐

Cerebyte

1,011 位关注者

Tejash Mehta的更多文章

社区洞察

其他会员也浏览了

Generative AI: Crafting the future with care & creativity

Transparency, Explainability, and Auditability

Addressing the Full Stack of AI Concerns: Responsible AI, Trustworthy AI, Secure AI and Safe AI Explained

How AI Observability Enhances Model Reliability and Diagnoses Issues Faster

Navigating the AI Ethical Frontier with Dr. Kate Devlin

Explainable AI (XAI): Demystifying the Black Box of Artificial Intelligence

The Deep Media Digital Digest: AI Predictions for 2025

AI News & Insights: The AI Defamation Case, Microsoft's Ethical AI Commitment, and McKinsey's Forecast on AI's Impact on the Economy

Advanced Frameworks for Responsible & Safe AI-Integrating Scalable Solutions for Alignment, Risk Mitigation, and Ethical Compliance for Nextgen Models

Is Artificial Intelligence Getting Out of Control? Navigating the Challenges of AI Governance and Regulation

AI Testing in Disarray: No Easy Way to Choose an Ethical Model

领英推荐

Cerebyte

1,011 位关注者

Tejash Mehta的更多文章

The State of Ethical AI in 2025.Breakthroughs, Challenges, and Global Consensus

AI Ethics Unplugged: Navigating the Moral Maze of Machine Intelligence

Ethics in AI – January 2025 Updates: Are We Moving in the Right Direction?

An update on OpenAI's safety & security practices

Yoshua Bengio will receive ￡59 million for The project, called Safeguarded AI

Many Safety Evaluations for AI Models Have Significant Limitations

AI ethics boards coming up fast in Indian tech majors.

OpenAI’s approach to spotting hallucinations: Use AI

OpenAI faces criticism over its whistleblower policies

A Right to Warn about Advanced Artificial Intelligence

社区洞察

其他会员也浏览了

Generative AI: Crafting the future with care & creativity

Transparency, Explainability, and Auditability

Addressing the Full Stack of AI Concerns: Responsible AI, Trustworthy AI, Secure AI and Safe AI Explained

How AI Observability Enhances Model Reliability and Diagnoses Issues Faster

Navigating the AI Ethical Frontier with Dr. Kate Devlin

Explainable AI (XAI): Demystifying the Black Box of Artificial Intelligence

The Deep Media Digital Digest: AI Predictions for 2025

AI News & Insights: The AI Defamation Case, Microsoft's Ethical AI Commitment, and McKinsey's Forecast on AI's Impact on the Economy

Advanced Frameworks for Responsible & Safe AI-Integrating Scalable Solutions for Alignment, Risk Mitigation, and Ethical Compliance for Nextgen Models

Is Artificial Intelligence Getting Out of Control? Navigating the Challenges of AI Governance and Regulation