登录查看更多内容

LLMPC: Large Language Model Predictive Control

Gabriel Maher

AI Researcher and Engineer

发布日期: 2025年1月9日

+ 关注

This article is a summary of a full paper available at https://arxiv.org/abs/2501.02486

The original research code and examples are available at https://github.com/gmaher/llmpc

Large Language Models (LLMs) seem to perform better when given structured prompts, particularly prompts that ask the LLM to reason and plan before acting are effective. However, fundamental questions remain: Why do these methods work? What are their limitations? How can we improve them further? This post examines LLM prompting through the lens of Model Predictive Control (MPC), a framework where controllers generate and execute action plans. We show that LLMs act as approximate cost function minimizers when planning, and that their performance can be enhanced by incorporating explicit planning objectives.

The MPC Framework

In MPC, an agent navigates a state space by choosing actions that minimize an objective function over a planning horizon. The objective typically combines task-specific costs (like distance to a goal state) with regularization costs (like action complexity). The action plan is obtained by solving the objective function minimization problem. From the perspective of MPC we see that asking LLMs to generate a plan is thus analogous to using LLMs to approximately solve the MPC objective minimization problem.

LLMs as Planners

In the MPC viewpoint an LLM takes a prompt (encoding the current state) and outputs a sequence of tokens that map to actions. Different prompting methods (ReAct, Tree-of-Thoughts, etc.) thus mostly vary in how they structure this mapping. The key insight is regardless of prompting structure, all planning prompts are limited by the fact that LLMs can only approximately solve the planning optimization problem.

领英推荐

??Top ML Papers of the Week

DAIR.AI 9 个月前

Fine-Tuning LLMs: Selecting the Optimal Supervised…

TAGX 9 个月前

Introduction To Retrieval Augmented Generation (RAG)

Wiro AI 4 个月前

Improving Performance with LLMPC

Since LLMs are approximate optimizers, we can enhance their performance by making better use of explicit objective functions. Our LLMPC method: 1. Uses the LLM to sample multiple possible control sequences 2. Evaluates each sequence using actual cost and state update functions 3. Selects and executes the best-performing sequence 4. Replans after a few steps We demonstrated this approach on two problems: 1. Spring-Mass Control: LLMPC successfully controlled a spring-mass system to reach target states, though with higher objective values than exact MPC solutions (as expected for an approximate method). 2. Code Generation: We compared LLMPC against one-shot generation for creating a Flappy Bird game. LLMPC produced more complete code with additional features like sprites and game-over screens, showing how longer-horizon planning enables handling more complex tasks.

Key Takeaways

- LLM prompting methods can be understood through the MPC framework

- LLMs act as approximate optimizers of planning objectives

- Performance can be improved by incorporating explicit cost functions

- LLMPC provides a systematic way to enhance LLM planning abilities

This framework helps explain why techniques like Monte Carlo Tree Search with external evaluators improve LLM performance, and suggests further ways to enhance LLM-based planning systems.

要查看或添加评论，请登录

Gabriel Maher的更多文章

Metadata Tagging with LLMs to Improve RAG Document Search and Retrieval

2024年12月2日

Metadata Tagging with LLMs to Improve RAG Document Search and Retrieval

Being able to retrieve the right documents for a given query is important when building any kind of RAG application. If…
Handling Long Context RAG for LLMs with Contextual Summarization

2024年11月12日

Handling Long Context RAG for LLMs with Contextual Summarization

Responding to user queries when they require analyzing large contexts or many documents can be challenging. A common…

2 条评论
Suggestive AI, a New Paradigm in Artificial Intelligence

2022年12月13日

Suggestive AI, a New Paradigm in Artificial Intelligence

This post also appears on my homepage. Recently there has been a surge of interest in generative AI models such as GPT…

1 条评论
Simultaneous Forecasting of Multiple Time-Series

2019年9月30日

Simultaneous Forecasting of Multiple Time-Series

Data rarely appears in isolation. Information from one source can tell us a lot about what to expect on information…
Comparison of Machine Learning Methods for Forecasting Web Traffic

2019年8月26日

Comparison of Machine Learning Methods for Forecasting Web Traffic

Which machine learning methods work best for forecasting on real world data, especially if we only have limited…
Optimal Inventory Management with Model Predictive Control

2019年8月13日

Optimal Inventory Management with Model Predictive Control

Ever wondered how to solve real world problems with clever math, statistics and code? The inventory management problem…
Creating Artificial Intelligence with Model Predictive Control

2019年8月5日

Creating Artificial Intelligence with Model Predictive Control

For a while there has been a lot of hype surrounding reinforcement learning in the machine learning community. The goal…

2 条评论
Time Series Forecasting with Harmonic Regression

2019年7月31日

Time Series Forecasting with Harmonic Regression

Recently I have been looking into time series forecasting. The most common approaches for forecasting are typically…

See all articles

LLMPC: Large Language Model Predictive Control

Gabriel Maher

AI Researcher and Engineer

The MPC Framework

LLMs as Planners

领英推荐

Improving Performance with LLMPC

Key Takeaways

Gabriel Maher的更多文章

社区洞察

其他会员也浏览了

Large Language Models In The Financial Industry

Unleashing the Power of Large Language Models

Bridging the Reasoning Gap: How NLEPs Empower Large Language Models

Enhancing Reasoning in Transformer-Based Large Language Models via Symbolic Templates

?? Getting RAG Right: All in One Go

Large Language Model Settings: Temperature, Top P and Max Tokens

A Coder's Take on Tasks Best Suited for Large Language Models

Understanding the Basic Components of a Prompt in LLM Models

Faithful Logical Reasoning- Symbolic Chain-of-Thought & GNN-RAG - Graph Neural Retrieval for Large Language Model Reasoning

Product problem considerations when building Large Language Model based applications

The MPC Framework

LLMs as Planners

领英推荐

Improving Performance with LLMPC

Key Takeaways

Gabriel Maher的更多文章

Metadata Tagging with LLMs to Improve RAG Document Search and Retrieval

Handling Long Context RAG for LLMs with Contextual Summarization

Suggestive AI, a New Paradigm in Artificial Intelligence

Simultaneous Forecasting of Multiple Time-Series

Comparison of Machine Learning Methods for Forecasting Web Traffic

Optimal Inventory Management with Model Predictive Control

Creating Artificial Intelligence with Model Predictive Control

Time Series Forecasting with Harmonic Regression

社区洞察

其他会员也浏览了

Large Language Models In The Financial Industry

Unleashing the Power of Large Language Models

Bridging the Reasoning Gap: How NLEPs Empower Large Language Models

Enhancing Reasoning in Transformer-Based Large Language Models via Symbolic Templates

?? Getting RAG Right: All in One Go

Large Language Model Settings: Temperature, Top P and Max Tokens

A Coder's Take on Tasks Best Suited for Large Language Models

Understanding the Basic Components of a Prompt in LLM Models

Faithful Logical Reasoning- Symbolic Chain-of-Thought & GNN-RAG - Graph Neural Retrieval for Large Language Model Reasoning

Product problem considerations when building Large Language Model based applications