登录查看更多内容

Alibaba's Qwen2.5-Max Enters Global Top 10

OFweek Energy

Your Gateway to the High-Tech Industry—— Contact Us : [email protected]

发布日期: 2025年2月5日

On February 4, the latest rankings from the globally renowned AI large model evaluation platform, Chatbot Arena, revealed that Alibaba's Qwen2.5-Max model has entered the global top ten for the first time, surpassing the recently popular DeepSeek-V3 and leading other top proprietary models like O1-Mini and Claude-3.5-Sonnet.

Specifically, Qwen2.5-Max ranks first in mathematics and programming, and second in handling hard prompts. The official evaluation from Chatbot Arena praises Qwen2.5-Max for its strong performance across multiple domains, particularly in specialized technical areas like programming, mathematics, and hard prompts.

The latest version, Qwen2.5-Max, uses an advanced mixture-of-experts (MoE) architecture, with over 20 trillion tokens of pre-training data. It is optimized with supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) techniques, excelling in knowledge, programming, general abilities, and human alignment.

Whether for language models or multimodal models, Qwen is pre-trained on large-scale multilingual and multimodal data and fine-tuned with high-quality datasets to align more closely with human preferences. Qwen possesses a range of capabilities, including natural language understanding, text generation, visual understanding, audio processing, tool usage, role-playing, and interactive AI agent functions.

Key features of Qwen2.5 include:

- Easy-to-use decoder-based dense language models, available in 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameter sizes, with both base and instruction-fine-tuned variants (where "B" stands for billion, with 72B referring to 72 billion parameters).

领英推荐

How to Unlock the Full Potential of Prompt…

ThinkPalm Technologies Pvt. Ltd. 11 个月前

AI Innovations: Unveiling the Latest Breakthroughs

Bayes Labs 5 个月前

Large Language Models (LLMs) Revolutionizing Web…

Weavers Web 3 周前

- Pre-trained on the latest datasets, including up to 18 trillion tokens.

- Significant improvements in instruction following, long-text generation (over 8K tokens), structured data comprehension (e.g., tables), and the generation of structured outputs, especially JSON.

- Enhanced adaptability to diverse system prompts, improving role-playing and background settings for chatbots.

- Supports a context length of up to 128K tokens and generates up to 8K tokens of text.

- Supports over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.

In fact, over the past year, the domestic large model industry in China has seen several waves of price reductions. For instance, Alibaba Cloud’s Tongyi Qianwen visual understanding model saw its entire line reduced by more than 80%, with a cost as low as 0.0015 yuan per thousand tokens.?ByteDance’s Doubao visual understanding model charges?just 3 cents per thousand tokens, 85% cheaper than industry prices. Baidu’s Wenxin Yiyan has made its two major models, ERNIE Speed and ERNIE Lite, available for free to users.

The rise of domestic models in China has made it clear that OpenAI is no longer the sole dominant force in the large model field. The technological capabilities of these models can now rival, and even exceed, those of international mainstream models. As noted by Chatbot Arena:“Chinese large models, represented by Qwen2.5-Max, are catching up fast.”OpenAI CEO Sam Altman acknowledged the impact of China’s AI rise after the launch of O3-Mini, stating that it had weakened OpenAI’s technological lead.

Alibaba's Qwen2.5-Max Enters Global Top 10

OFweek Energy

Your Gateway to the High-Tech Industry—— Contact Us : [email protected]

Key features of Qwen2.5 include:

领英推荐

OFweek Energy的更多文章

社区洞察

其他会员也浏览了

The Expanding Universe of Large Language Models: A Deep Dive

Meta Llama 3.1: Latest Development in AI Technology

Empowering AI Development - Unveiling the Latest Trends

Retrieval Augmented Generation (RAG) Vs Fine Tuning LLMs

CONNECT: OpenAI Launches a Store for Custom AI-Powered Chatbots, Large Language Models (LLMs) Explained, The AGI Elephant Q1 2024, and more.

Qwen-2.5: Alibaba's Breakthrough in Open-Source AI

Open Code LLMs; Long-Range Transformers; GPT-5 Release Date; ChatGPT for iOS; Understanding the Power of Intrinsic Motivation; and More

Advanced Prompting Techniques in Large Language Models

GitHub Copilot X launched, Databricks open-sources an AI, The best AI chatbots (28th March, 2023)

Developing LLM Applications with LangChain

Key features of Qwen2.5 include:

领英推荐

OFweek Energy的更多文章

Battery Giants Stake Their Claim in Robotic Lawnmower Revolution

6000mAh + 1-inch main camera! Xiaomi 15 Ultra defines dual extremes in imaging and battery life

Nearly HKD 1 Billion Raised! RoboSense Plans to Allocate 70% for Robotics Development

Xiaomi SU7 Ultra's Surprise Launch: How Lei Jun's Bold Pricing Strategy Unveils a Global Energy Ecosystem Ambition

Chinese Universities Pioneer Breakthrough Catalysts for Green Hydrogen Revolution

Focusing on Application Pain Points, FLEXIV's Adaptive Robots Drive Innovation in Grinding Automation

Global First Humanoid Robot Front Flip Draws Attention, ENGINEAI Secures Investment from Middle Eastern Capital

Kaierda Invests Billions to Accelerate Transformation and Enter Humanoid Robotics Industry

Li Auto Unveils i8, Its First Pure Electric SUV

Korean Battery Giant LG Energy Solution Invests in Chinese Material Supplier

社区洞察

其他会员也浏览了

The Expanding Universe of Large Language Models: A Deep Dive

Meta Llama 3.1: Latest Development in AI Technology

Empowering AI Development - Unveiling the Latest Trends

Retrieval Augmented Generation (RAG) Vs Fine Tuning LLMs

CONNECT: OpenAI Launches a Store for Custom AI-Powered Chatbots, Large Language Models (LLMs) Explained, The AGI Elephant Q1 2024, and more.

Qwen-2.5: Alibaba's Breakthrough in Open-Source AI

Open Code LLMs; Long-Range Transformers; GPT-5 Release Date; ChatGPT for iOS; Understanding the Power of Intrinsic Motivation; and More

Advanced Prompting Techniques in Large Language Models

GitHub Copilot X launched, Databricks open-sources an AI, The best AI chatbots (28th March, 2023)

Developing LLM Applications with LangChain