登录查看更多内容

Overthinking can trip up not only people or where CoT doesn't help.

TuringPost

Newsletter about AI and ML. ?? Sign up for free to get your list of essential AI resources ??

发布日期: 2024年11月7日

美国普林斯顿大学 and 美国纽约大学 investigated 3 cases where Chain-of-Thoughts (CoT, or step-by-step thinking) can lead to worse outcomes in both humans and models:

? Implicit learning tasks (learning patterns without explicitly thinking about them),

? Visual tasks (recognizing images or objects at a glance)

? Learning with exceptions (where some rules don't always apply)

Research links human psychology insights to predict when CoT aids or hinders model performance. They used two criteria to identify where CoT might reduce performance.

? Does verbal thinking lower human performance?

? Do these limitations also apply to AI models?

And here are the key findings:

Implicit learning tasks:

- Researchers used specific grammar rules (FSGs) to create "words," including 4400 tasks with words either matching or slightly altered from the pattern.

- The model identified words matching these patterns after examples.

Result: CoT negatively impacted model performance??

Facial recognition test:

- The model views a person's face and chooses the same one from five options.

- Researchers generated 500 synthetic problems with 2500 unique faces. Each had one target face plus four others with similar features.

Result: Verbal reasoning misses fine visual details; simpler prompts work better for nuanced visual tasks.

Learning patterns with exceptions:

- Vehicles were classified by features: one usually matching the label (80%), three unrelated, and a unique color for correct identification.

- The model must correctly label all vehicles.

Result: CoT prompting slows models by up to 4x.

However, in some cases, human limitations don’t apply to models because of key differences in how they process information.

These tasks are:

- Explaining logical inconsistencies

- Spatial intuitions

- Aggregating features for decision-making

Original paper: https://arxiv.org/pdf/2410.21333

Overthinking can trip up not only people or where CoT doesn't help.

TuringPost

Newsletter about AI and ML. ?? Sign up for free to get your list of essential AI resources ??

Turing Post

2,419 位关注者

TuringPost的更多文章

社区洞察

其他会员也浏览了

How Does Multimodal Data Enhance Machine Learning Models?

Welcome to The Visionary, your go-to guide for mastering the art of computer vision.

Balancing AI Prompt Engineering with Emotional Intelligence in the Workplace Part 2

AI-Resistent Skills

AI & ML Integration in 2025

Artificial Intelligence and its real-life practices

AI/ML/Cognitive Intelligence- What are they?

Unlocking the Power of Prompt Engineering: Shaping the Future of AI Interactions

The Future of AI

Turing Post

2,419 位关注者

TuringPost的更多文章

??#92: Fight for Developers and the Year of Orchestration

????#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Topic 31: How to Reduce Memory Use in Reasoning Models

??#91: We are failing in AI literacy

????#13: Action! How AI Agents Execute Tasks with UI and API Tools

????#12: How Do Agents Learn from Their Own Mistakes? The Role of Reflection in AI

Topic 30: Everything You Need to Know about Knowledge Distillation

??#90: Why AI’s Reasoning Tests Keep Failing Us

SWE-RL, the first reinforcement learning (RL) method for software engineering

Topic 29: Inside the family of Smol models