登录查看更多内容

ChatGPT understands First Order Logic

Christopher Logan

Tech mod policy - White House

发布日期: 2024年4月3日

If you've taken a logic course, there's a good chance that you've learned about Wumpus World - a game from the 1970s that inspired AI experiments in logic.

As a refresher, Wumpus World is a game played on a grid where a gold reward and hazards such as pits and wumpuses (a monster-like creature) await you. You can't see the gold or hazards, but you can sense them through glitter, breezes and stenches respectively. When you sense any of them, it's an indication that an associated item is in a horizontally or vertically adjacent square. Below is an example of a Wumpus World board.

Sounds like a fun game, but how is a motivated game player to keep track of all those glitters, breezes and stenches and know how to move? An effective way is by using First Order Logic (FOL). The set of games rules is maintained as clauses. For example, "all cells next to the cell of Wumpus will sense a stench" can be expressed as:

Stench(a,b)??c,d:(Adjacent([a,b],[c,d]) ∧ Wumpus(c,d))

and "all cells next to the cell of a pit will sense a breeze" as:

Breeze(a,b) ? ?c,d:(Adjacent([a,b],[c,d]) ∧ Pit(c,d))

Using the Stench and Wumpus rule, how can we derive that a Wumpus does not exist at [1,3] if a Stench was not sensed at [1,2]? We can convert the rule, observations and conclusion into Conjunctive Normal Form clauses resulting in:

领英推荐

ChatGPT: Threat or Saviour?

Vertus Partners 1 年前

ChatGPT 101 (what to know in proposals) & Never do…

Javier Escartin 1 年前

ChatGPT Won’t Replace You

Sharp Decisions 1 年前

(Stench[1,2] ? (Wumpus[1,3] ∨ Wumpus[2,2])) #bi-directional implication
?Stench[1,2] #observation - no stench sensed at [1,2]
?Wumpus[1,3] #conclusion - no wumpus at [1,3]

Then conjunct them resulting in:


(Stench[1,2] ? (Wumpus[1,3] ∨ Wumpus[2,2])) ∧ ?Stench[1,2] ∧ Wumpus[1,3]

Using proof by negation and resolution, it can be derived at Wumpus[1,3] (the negation of no Wumpus at 1,3) is derived.

The above process is taught and learned by students every year, so I'm not demonstrating anything revolutionary. But...what if an artificially intelligent agent could perform this task without any specific training in the problem itself? This is what I did with ChatGPT. Below is a transcript of a ChatGPT interaction that I had to show how this works.

And ChatGPT understands FOL quantifiers:

Beyond the obviously fun aspect of playing Wumpus World, ChatGPT's ability to deduce and infer using First Order Logic opens up interesting possibilities of rule-based problem solving using ChatGPT as the agent. Creating an LLM with pre-built FOL predicates and clauses would allow users to reason over problems effectively and in a scalable way.

If you've used ChatGPT in this way, please reach out so we can discuss.

Jonah Czerwinski

Head of U.S. Public Sector @ Thoughtworks / former SES / former Obama-Biden Administration / permanent Wisconsinite

6 个月

Very cool, Chris — Could you replicate it using Conway’s Game of Life?

Christopher Logan

Tech mod policy - White House

6 个月

For me, the most interesting part of this research was learning that GPT-4 understands how to reason using logic. It's not clear how it learned to do this. I've tested this approach with a variety of logic questions and it performed satisfactorily on all of them.

2 次回应

Link Parikh ??

Founder & CEO, Inventor, Futurist, Speed to Market expert. Author. Speaker

6 个月

For generative to be valuable to the DoD, we must leave chatgpt behind as a toy and move to IBM’s governance and data fabric approach. It’s easy to prototype - I recommend it!

1 次回应

查看更多评论

要查看或添加评论，请登录

ChatGPT understands First Order Logic

Christopher Logan

Tech mod policy - White House

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Where ChatGPT excels and where it fails

ChatGPT, Self-Checkouts, & The Times Are Changing

ChatGPT now has a ‘memory’

Are all AIs the same?

Two experiments with ChatGPT in generating algorithms

What Is ChatGPT Doing … and Why Does It Work? – the summary

The Lazy AI: The Sunday That ChatGPT Wasn't Helpful

Show Me The Steps, ChatGPT

For More Context, Embrace RAG

Dan mode enabled!

领英推荐

Unveiling the Capability Conundrum: Common Misunderstandings When Building a Business Architecture Model

2023年9月27日

Finding Team Players Through Board Games

2022年12月14日

社区洞察

其他会员也浏览了

Where ChatGPT excels and where it fails

ChatGPT, Self-Checkouts, & The Times Are Changing

ChatGPT now has a ‘memory’

Are all AIs the same?

Two experiments with ChatGPT in generating algorithms

What Is ChatGPT Doing … and Why Does It Work? – the summary

The Lazy AI: The Sunday That ChatGPT Wasn't Helpful

Show Me The Steps, ChatGPT

For More Context, Embrace RAG

Dan mode enabled!