登录查看更多内容

Move Over Chain of Thought | The Rise of Chain of Draft in AI Reasoning

Shyamal Indika

Senior Software Engineer | AI Generalist | Technologist

发布日期: 2025年3月1日

The AI world is always evolving, and one of the biggest game changers in recent years has been Chain of Thought (CoT) prompting. This structured reasoning approach has propelled AI models to new heights by allowing them to break down complex problems step by step. However, while CoT has proven effective, it comes with significant drawbacks mainly, its verbosity and high computational costs. But now, there’s a more efficient alternative: Chain of Draft (CoD).

The Problem with Chain of Thought

CoT enables AI models to think step by step, mimicking human structured reasoning. This has led to breakthroughs in reasoning based tasks, but it also requires models to generate an extensive number of tokens. The result? Higher latency and increased computational costs.

Thinking models like DeepSeek 2, Gemini, and others that rely on CoT often scale their computations significantly at inference time. While CoT provides thorough explanations, it doesn’t necessarily reflect how humans approach problem solving. In reality, we often rely on concise drafts or shorthand notes to capture essential insights without unnecessary elaboration.

Introducing Chain of Draft: A More Efficient Approach

Researchers at Zoom Communications (yes, the video conferencing company) have proposed a new strategy Chain of Draft. This novel prompting technique achieves results comparable to or even better than CoT while being significantly more efficient in both cost and speed.

Rather than generating verbose intermediate steps like CoT, Chain of Draft encourages language models to create concise, dense information outputs at each step. This allows the model to maintain transparency in its reasoning process without the excessive overhead of traditional CoT reasoning.

How Chain of Draft Works

To illustrate, consider this simple math problem: Question: Kumar had 20 lollipops. He gave Namal some lollipops. Now Kumarhas 12 lollipops. How many lollipops did Kumar give to Namal?

Standard Model Response:

Outputs only the final answer: 8
No explanation of how the model arrived at the answer.

Chain of Thought Response:

Kumar starts with 20 lollipops.
After giving some to Namal, he has 12 left.
Setting up a subtraction equation: 20 - X = 12.
Solving for X: X = 8.
Final Answer: 8

While CoT ensures transparency, it includes a lot of unnecessary steps for a simple problem.

Chain of Draft Response:

Concise reasoning: 20 - X = 12 → X = 8
Final Answer: 8

With CoD, we get the essential reasoning needed without excessive elaboration, reducing token usage and processing time.

Performance Comparison: CoT vs. CoD

The researchers tested CoD on multiple benchmarks and compared it to standard prompting and CoT.

Implementing Chain of Draft: Simpler Than You Think

One of the most exciting aspects of CoD is that it doesn’t require any model fine tuning, reinforcement learning, or architectural changes. It is purely a prompting strategy, meaning it can be implemented immediately by modifying the system message.

Example Prompts:

Standard Prompt: "Answer the question directly. Do not return any preamble, explanation, or reasoning."
Chain of Thought: "Think step by step to answer the following question. Return the answer at the end of the response after the separator '####'."
Chain of Draft: "Think step by step but only keep a minimum draft for each thinking step, with five words at most. Return the answer at the end of the response after the separator '####'."

The Future of AI Reasoning

The introduction of Chain of Draft shows that small changes in how we guide AI models can lead to massive efficiency improvements. While CoT revolutionized AI reasoning, CoD refines it further delivering nearly the same accuracy at a fraction of the cost and time.

In a world where AI models are increasingly integrated into real time applications, reducing latency and computational expenses is crucial. Chain of Draft provides a powerful yet simple solution to this problem, proving that sometimes, less is more.

What do you think about Chain of Draft? Could this be the future of AI reasoning?

要查看或添加评论，请登录

Shyamal Indika的更多文章

Overcoming RAG’s Limitations with Agentic RAG

2025年3月1日

Overcoming RAG’s Limitations with Agentic RAG

Retrieval Augmented Generation (RAG) is popular for making AI agents smarter using knowledge bases. However…
Microsoft’s Quantum Breakthrough: A New State of Matter for the Future of Computing

2025年2月23日

Microsoft’s Quantum Breakthrough: A New State of Matter for the Future of Computing

Microsoft has achieved a major milestone in quantum computing by creating a new state of matter known as topological…
Evaluating "Docling" for Production Use: A Comprehensive Analysis

2025年2月22日

Evaluating "Docling" for Production Use: A Comprehensive Analysis

Docling, an open source document processing library, has emerged as a powerful tool for converting PDFs and other…
Grok 3 is Here: Elon Musk's AI Breakthrough

2025年2月19日

Grok 3 is Here: Elon Musk's AI Breakthrough

Elon Musk and the xAI team delivered on their promise Grok 3 is officially here. Announced at 8:00 PM last night, this…

1 条评论
How MOSIP Can Support Sri Lanka’s Digital Transformation

2025年2月18日

How MOSIP Can Support Sri Lanka’s Digital Transformation

In today’s rapidly evolving digital landscape, a secure and efficient digital identity system is crucial for…
How to Set Up Supabase for Local AI Agents: A Step-by-Step Guide

2025年2月17日

How to Set Up Supabase for Local AI Agents: A Step-by-Step Guide

Introduction Supabase has quickly become one of the most popular database solutions for AI applications. Built on…
The Aadhaar System: Lessons for Sri Lanka’s Digital Transformation

2025年2月17日

The Aadhaar System: Lessons for Sri Lanka’s Digital Transformation

In today’s digital era, seamless identification systems are fundamental for governance, economic efficiency, and public…

6 条评论
Decoding the LangChain Ecosystem: LangChain, LangGraph, LangFlow, and LangSmith

2025年2月15日

Decoding the LangChain Ecosystem: LangChain, LangGraph, LangFlow, and LangSmith

Building powerful AI applications with Large Language Models (LLMs) like GPT4 and Llama 3 is exciting but often…
Beyond Words: Is Latent Reasoning the Key to True AI?

2025年2月14日

Beyond Words: Is Latent Reasoning the Key to True AI?

Large language models (LLMs) have taken the world by storm, demonstrating impressive feats of text generation and…
The AI Revolution: Is Superintelligence Just Around the Corner?

2025年2月11日

The AI Revolution: Is Superintelligence Just Around the Corner?

This is a question that has been asked for decades, and it is one that is becoming increasingly relevant as AI…

See all articles

The Problem with Chain of Thought

Introducing Chain of Draft: A More Efficient Approach

How Chain of Draft Works

Standard Model Response:

Chain of Thought Response:

Chain of Draft Response:

Performance Comparison: CoT vs. CoD

Implementing Chain of Draft: Simpler Than You Think

Example Prompts:

The Future of AI Reasoning

What do you think about Chain of Draft? Could this be the future of AI reasoning?

Shyamal Indika的更多文章

Overcoming RAG’s Limitations with Agentic RAG

Microsoft’s Quantum Breakthrough: A New State of Matter for the Future of Computing

Evaluating "Docling" for Production Use: A Comprehensive Analysis

Grok 3 is Here: Elon Musk's AI Breakthrough

How MOSIP Can Support Sri Lanka’s Digital Transformation

How to Set Up Supabase for Local AI Agents: A Step-by-Step Guide

The Aadhaar System: Lessons for Sri Lanka’s Digital Transformation

Decoding the LangChain Ecosystem: LangChain, LangGraph, LangFlow, and LangSmith

Beyond Words: Is Latent Reasoning the Key to True AI?

The AI Revolution: Is Superintelligence Just Around the Corner?