登录查看更多内容

DeciLM-7B: The Fastest and Most Accurate 7 Billion-Parameter LLM to Date ??

Deci AI (Acquired by NVIDIA)

Deci enables deep learning to live up to its true potential by using AI to build better AI.

发布日期: 2023年12月12日

In an era where language models are becoming integral to how we interact with technology, Deci is excited to unveil DeciLM-7B, a groundbreaking development in the realm of language models. Licensed under Apache 2.0, DeciLM-7B emerges as the fastest and most proficient 7-billion parameter base LLM available today, redefining the benchmarks for speed and accuracy.

DeciLM-7B at a Glance ??

Unmatched Accuracy

Achieving an average score of 61.55 on the Open LLM Leaderboard, DeciLM-7B outshines its competitors in the 7-billion parameter class, including the previous frontrunner, Mistral 7B. This accuracy improvement can potentially lead to more reliable and precise responses in various applications, from customer service bots to complex data analysis.

Enhanced Throughput Performance

In a head-to-head PyTorch benchmark, DeciLM-7B demonstrates a notable performance enhancement, outpacing Mistral 7B with a 1.83 times higher throughput and surpassing Llama 2 7B by 2.39 times in handling sequences of 2048 tokens in both input and output.

Accelerated Speed with Infery-LLM

The remarkable performance of DeciLM-7B can be further accelerated as a result of its synergistic relationship with Infery-LLM, the world’s fastest inference engine, designed to deliver high throughput, low latency and cost-effective inference on widely available GPUs. This powerful duo sets a new standard in throughput performance, achieving speeds 4.4 times greater than Mistral 7B with vLLM. This synergy isn't just a technical feat; it's a pivotal transformation for sectors that demand the capacity to serve numerous customers concurrently. The integration of DeciLM-7B with Infery-LLM creates an environment where high-speed, high-volume customer interactions become a reality. This is especially crucial in sectors like telecommunications, online retail, and cloud services, where the ability to respond to a massive influx of customer inquiries in real time can significantly enhance user experience and operational efficiency.

Innovative Architecture

Developed with the assistance of our Neural Architecture Search-powered engine, AutoNAC, DeciLM-7B? employs Variable Grouped Query Attention, a breakthrough in achieving an optimal balance between accuracy and speed.

领英推荐

??Top ML Papers of the Week

DAIR.AI 8 个月前

Issue #283 - The ML Engineer ??

Alejandro Saucedo 10 个月前

When to Use GraphRAG

Louis-Fran?ois Bouchard 7 个月前

Instruction-Tuned Variant

DeciLM-7B? was instruction-tuned using LoRA on the SlimOrca dataset. The resulting model, DeciLM-7B-instruct, achieves an average of 63.19 on the Open LLM Leaderboard.?

Businesses can leverage DeciLM-7B’s remarkable combination of efficiency and accuracy to create more effective, user-friendly AI tools at a lower cost, driving innovation across sectors. From enhancing high-volume customer service with real-time chatbots and personalized recommendations to facilitating workflow automation for text-heavy professional domains, DeciLM-7B? paves the way for smarter, more responsive, cost-effective, and scalable AI solutions.

Explore DeciLM-7B Now ??

Join us as we delve deeper into the capabilities and potential of DeciLM-7B and its instruction-tuned variant, DeciLM-7B-Instruct.

Interested in Infery-LLM’s capabilities and how our advanced SDK for LLM optimization can improve the performance of your LLMs? Talk with our experts!

带有此图标的链接由领英创建，不带此图标的链接由作者添加。

DeciLM-7B: The Fastest and Most Accurate 7 Billion-Parameter LLM to Date ??

Deci AI (Acquired by NVIDIA)

Deci enables deep learning to live up to its true potential by using AI to build better AI.

DeciLM-7B at a Glance ??

领英推荐

Explore DeciLM-7B Now ??

Deci AI (Acquired by NVIDIA)的更多文章

社区洞察

其他会员也浏览了

??Top ML Papers of the Week

Introducing Mixtral-8x22B: The new open model from Mistral outperforms all existing open LLMs ??

Eliminating hallucinations (fast!) in Large Language Models with Finite State Machines

Edition 35 - Creating Self-Improving LLM Evals

Everybody Likes RAG

Topic 19: Inside LLaVA-o1

Test-Time Compute for LLM Reasoning

Key Takeaways from OpenAI's Groundbreaking A.I. Text-to-Video Generator

TAPE: LLM Explanations as GNN Features

DeciLM-7B at a Glance ??

领英推荐

Explore DeciLM-7B Now ??

Deci AI (Acquired by NVIDIA)的更多文章

How to Improve Small Object Detection Accuracy Without Increasing Latency

Just Launched: Deci’s Gen AI Development Platform and Deci-Nano

What makes LLM inference more challenging than traditional NLP?

YOLO-NAS-Sat: A Small Object Detection Model for Edge Deployment

Exploring the Modern Transformer - From 'Attention Is All You Need' to SwiGLU, RoPE, and GQA

How to Build Better AI Models with a Production-Aware Approach and NAS

DeciCoder-6B and DeciDiffusion 2.0: Models Built for Accuracy, Speed, and Cost-Efficiency

Maximizing LLM Inference Speed: Proven Strategies and Best Practices

Key Factors to Success of YOLO-NAS Pose ??

8 Community-Created Content to Get Started with YOLO-NAS Pose

社区洞察

其他会员也浏览了

??Top ML Papers of the Week

Introducing Mixtral-8x22B: The new open model from Mistral outperforms all existing open LLMs ??

Eliminating hallucinations (fast!) in Large Language Models with Finite State Machines

Edition 35 - Creating Self-Improving LLM Evals

Everybody Likes RAG

Topic 19: Inside LLaVA-o1

Test-Time Compute for LLM Reasoning

Key Takeaways from OpenAI's Groundbreaking A.I. Text-to-Video Generator

TAPE: LLM Explanations as GNN Features