登录查看更多内容

A Step Ahead in AI performance and efficiency

Subrat Panda, PhD

CTPO @ AGNEXT | AI Expert | PhD in Computer Science | IIT KGP | Ex - Capillary |Quality Food for Billions

发布日期: 2025年3月17日

Google's new open-weight model, Gemma 3, is a big step forward in AI, offering efficiency, flexibility, and competitive performance. As an AI professional, I view this launch as a strategic step in the ongoing competition for smaller but efficient models that can run with minimal computational power.

Major improvements in Gemma 3

In contrast to its predecessor, Gemma 2, this latest version provides parameter sizes of 1B, 4B, 12B, and 27B, which allows it to be versatile for a wide range of applications.

Its most notable feature is an increased 128K token context window, which enables it to process and hold more information in a single session. This makes it especially efficient in handling long content, coding work, and sophisticated reasoning.

Moreover, Gemma 3 is multimodal, with the capability to analyze text, images, and short videos. It natively supports 35 languages with pre-trained compatibility support for 140 languages, further enhancing its universal usability.

Performance in Benchmarks

In Chatbot Arena, where AI models are evaluated side-by-side by human testers, Gemma 3 (27B) outperformed OpenAI’s o3-mini, DeepSeek-V3, and Meta’s Llama 3-405B. It also delivered strong results in standardized AI benchmarks:

MMLU-Pro (67.5%) and GPQA Diamond (42.4%), surpassing Claude 3.5 Haiku (63% and 41%) and closely competing with GPT-4o Mini (65% and 43%).

Meta's Llama 3 70B is still the top performer (71% MMLU-Pro, 50% GPQA Diamond).

Efficiency

The most thrilling discovery may be Gemma 3's efficiency. It attained these levels of performance on one NVIDIA H100 GPU, while other models needed a maximum of 32 GPUs. Google has optimized the use of KV-cache memory, making it more efficient when processing longer contexts.

Access and availability

Gemma 3 is accessible through Google AI Studio, the GenAI SDK, and deployed locally by Hugging Face, Ollama, and Kaggle. Google also introduced ShieldGemma 2, a 4B parameter image safety model to identify unsafe content.

With Gemma 3, Google is propelling AI towards a future where power is matched with efficiency—raising the bar in the industry.

要查看或添加评论，请登录

Subrat Panda, PhD的更多文章

NSDC opens Centre for Future Skills to Enable Youth for New-Age Technologies

2025年3月10日

NSDC opens Centre for Future Skills to Enable Youth for New-Age Technologies

A major step towards enabling India's youth for the future workforce, the National Skill Development Corporation (NSDC)…
The dawn of a more thoughtful AI

2025年3月3日

The dawn of a more thoughtful AI

The world of artificial intelligence is changing at a speed never seen before, and OpenAI has just revolutionized the…

1 条评论
How Meltem ?akmak is redefining AI-Driven brand growth?

2025年2月20日

How Meltem ?akmak is redefining AI-Driven brand growth?

When it comes to the world of brand marketing and entrepreneurship, Meltem ?akmak stands as a visionary leader…
India leads AI Governance at Paris summit 2025

2025年2月14日

India leads AI Governance at Paris summit 2025

At the Paris AI Action Summit 2025, Prime Minister Narendra Modi’s call for open and unbiased datasets set the stage…

2 条评论
How Netflix uses LLMs to redefine content discovery?

2025年2月11日

How Netflix uses LLMs to redefine content discovery?

Netflix has become synonymous with entertainment, offering millions of hours of content to its global user base…
How Morgan Stanley Uses Large Language Models (LLMs) in Finance?

2025年2月1日

How Morgan Stanley Uses Large Language Models (LLMs) in Finance?

Morgan Stanley is leading the way in using advanced AI technology, especially Large Language Models (LLMs), to improve…

1 条评论
Databricks’ DBRX: The New Open-Source AI Powerhouse

2025年1月30日

Databricks’ DBRX: The New Open-Source AI Powerhouse

Databricks made a significant move in the AI space with the launch of DBRX, a powerful open-source language model that…
How IBM Watson is changing industries with AI ?

2025年1月27日

How IBM Watson is changing industries with AI ?

IBM Watson is one of the most powerful AI tools helping industries like healthcare, finance, and customer service work…
Stargate AI: The Future of AI Infrastructure

2025年1月22日

Stargate AI: The Future of AI Infrastructure

In a historic announcement at the White House, OpenAI CEO Sam Altman, SoftBank CEO Masayoshi Son, and Oracle Chairman…

2 条评论
OpenAI’s O3: A Big Step Towards Advanced AI

2025年1月16日

OpenAI’s O3: A Big Step Towards Advanced AI

OpenAI recently made headlines with its new chatbot model, o3, which scored an impressive 87.5% on the ARC-AGI test.

See all articles

Major improvements in Gemma 3

Performance in Benchmarks

Efficiency

Access and availability

Subrat Panda, PhD的更多文章

NSDC opens Centre for Future Skills to Enable Youth for New-Age Technologies

The dawn of a more thoughtful AI

How Meltem ?akmak is redefining AI-Driven brand growth?

India leads AI Governance at Paris summit 2025

How Netflix uses LLMs to redefine content discovery?

How Morgan Stanley Uses Large Language Models (LLMs) in Finance?

Databricks’ DBRX: The New Open-Source AI Powerhouse

How IBM Watson is changing industries with AI ?

Stargate AI: The Future of AI Infrastructure

OpenAI’s O3: A Big Step Towards Advanced AI

社区洞察