登录查看更多内容

Model Swarms and swarm intelligence

TuringPost

Newsletter about AI and ML. ?? Sign up for free to get your list of essential AI resources ??

发布日期: 2024年10月25日

Model Swarms is a collaborative search algorithm proposed by University of Washington , Google Cloud AI Research, Google DeepMind, Google.

Inspired by Particle Swarm Optimization, it uses swarm intelligence to help multiple LLMs work together to adapt without fine-tuning and improve.

Here’s how it works:

Model Swarms begins with a set of LLM experts already fine-tuned on different tasks. Inspired by swarm intelligence (like one that flocks of birds have), each LLM acts as a "particle" with a location (its settings or weights) and a direction (velocity) for improvement.

Step 0 – Initialization:

To make the search more effective, Model Swarms expands the initial set of particles by combining pairs of experts. Each particle starts with a random direction.

Step 1 – Velocity Update: Each particle adjusts its direction based on four factors:

? Inertia: Keeps it moving in its current direction.

? Personal best: Draws it toward the best performance it has achieved so far.

? Global best: Guides it toward the best result of the group.

? Global worst: Keeps it away from the least effective settings.

Step 2 – Weight Update:

Towards Data Science 1 个月前

This week's latest generative AI updates - September…

SymphonyAI 2 个月前

Issue #284 - The ML Engineer ??

Alejandro Saucedo 6 个月前

The particle takes a step in its new direction and is evaluated again. If a particle fails to improve over a set number of tries, it restarts from its best-known settings to avoid getting stuck.

End of Search:

The search stops when the global best result hasn’t improved after several attempts, or after a set number of iterations. The particle with the highest score is selected as the best solution.

Results of Model Swarms:

? Single task: improved by up to 21.0%, beating 12 models by 13.3% on average across 9 datasets.

? Multi-task: 5.7% better than baselines, with 11.3% gains in legal domain

? Reward model: Outperformed baselines by 6.7% on average

? Human interest: 70.8% win rate in evaluations; improved LLM-as-a-judge scores by 17.6% and factuality by 17.0% in 16 topics

Original paper: MODEL SWARMS: COLLABORATIVE SEARCH TO ADAPT LLM EXPERTS VIA SWARM INTELLIGENCE

Model Swarms and swarm intelligence

TuringPost

Newsletter about AI and ML. ?? Sign up for free to get your list of essential AI resources ??

领英推荐

Turing Post

2,213 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Artificial Intelligence #183

Artificial Intelligence #183

Global AI and Data Analytics in Construction and Property Roundup

A Recipe for AI-nnovation

OpenAI’s Bold Move: $6.5 Billion Funding Round Valuing the AI Giant at $150 Billion

Accenture Pioneers Custom Llama LLM Models with NVIDIA AI Foundry

AI its Impact and More…

?? Anthropic Goes Public! Kind Of.

Your AI's Memory Sucks

领英推荐

Turing Post

2,213 位关注者

TüLU 3: not just a model

2024年11月29日

NLRL: Natural Language Reinforcement Learning redefines Reinforcement Learning.

2024年11月28日

Topic 19: Inside LLaVA-o1

2024年11月28日

Hymba small model: a great combo of 2 concepts

2024年11月28日

??#77: Amid Big Model Chaos: Small Models and Embeddings Steal the Spotlight

2024年11月26日

????#5: Building Blocks of Agentic Systems

2024年11月25日

SAMURAI model for perfect segmenting and tracking objects in videos

2024年11月25日

Concepts: Supervised, Semi-Supervised, Self-Supervised, Unsupervised types of Machine Learning

2024年11月23日

FastRAG for semi-structured data

2024年11月23日

SEALONG, self-impovement approach for long context reasoning

2024年11月22日

社区洞察

其他会员也浏览了

Artificial Intelligence #183

Artificial Intelligence #183

Global AI and Data Analytics in Construction and Property Roundup

A Recipe for AI-nnovation

OpenAI’s Bold Move: $6.5 Billion Funding Round Valuing the AI Giant at $150 Billion

Accenture Pioneers Custom Llama LLM Models with NVIDIA AI Foundry

AI its Impact and More…

?? Anthropic Goes Public! Kind Of.

Your AI's Memory Sucks