登录查看更多内容

Show Me The Steps, ChatGPT

Justin Grammens

AI Expert Specializing in Generative AI, Machine Learning & Computer Vision | Driving Innovation in Healthcare, Finance, and Retail

发布日期: 2023年6月28日

A few weeks ago, I did a session on ChatGPT at MinneAnalytics #DataTech conference. In that talk, I showed a number of different ways in which engineering your prompts will result in some wildly successful or not-so-successful outcomes. Unfortunately, the talk was only 45 minutes and I ended up having to cut out one very interesting finding, which I will share here and that is the concept of “stepping through it”.

The Logic of Large Language Models

One of the things that people get hung up on when working with Large Language Models is just how authoritative they can seem. It seems like anything you throw at it, it has an answer, and that answer can seem extremely plausible if you don’t do your homework. I’ve told business leaders and companies that I consult with a number of times that ChatGPT falls down both in areas of research, where you need to cite sources. However, there’s also another area and that is math.

It’s interesting to note that ChatGPT has not been trained on HOW to add two numbers, instead is seen enough data to know that when someone types “what is 2 + 2”, it will respond with 4. Additionally, it knows that when someone asks, “If Jane has 2 apples and Joe has 4 oranges, how much fruit do they have together?”, it will correctly respond with “6 pieces of fruit”.

However, what if you ask it the following question?

If John has 5 pears, then eats 2, buys 5 more, then gives 3 to his friend, how many pears does he have?

Take a look at the screenshot below to see what ChatGPT gives. Note, that I’m using what is called the OpenAI playground, but it is using the same model that ChatGPT would produce.

No alt text provided for this image — Asking ChatGPT to solve a word problem

If you do the math yourself, you will notice that the answer of 7 pears is incorrect!

5 - 2 + 5 - 3 is not equal to 7.

So, what’s going on here? While I don’t claim to know the details, it’s obvious at some level that the language model is not understanding what is being asked. It’s been trained on the concept of adding pears (with buying), and subtraction (with eating and giving), but the semantics of how it got the answer are not shown or being followed.

Anthony J James (AJ) 1 年前

Unlocking the Full Potential of ChatGPT: A Strategic…

Data Science AI Learner Community 1 年前

ChatGPT 101 (what to know in proposals) & Never do…

Javier Escartin 1 年前

Enter the statement that can help, “Let’s Step Through It”.

Stepping Thorough It

If you are doing anything requiring logic or would like to have it explained, I would encourage you to use this. It’s not the end-all, be-all solution, but I have found it to dramatically increase likely hood of success. In the screenshot below, you’ll see that we add this statement of "Let's step through this" to the question. See the results.

Voila! The answer of 5 is correct.

By asking ChatGPT to step through the scenario, you in a way force it into having to explain the steps. By doing this, the model needs to explain the logic and comes to the correct conclusion.

Summary and Key Findings

In closing, there are a few things I think we can learn here. Some are obvious while others are subtle.

Don’t take everything that ChatGPT says as truth. Hallucinations are definitely real, not only when asking for factual information, but when performing logic calculations.
When it comes to math and word problems, check your work. I’ve found it to be good when asking arithmetic, but there’s a lot of nuance in word problems and this is where ChatGPT can get confused.
When you have a series of steps you are asking ChatGPT to perform, don’t be afraid to ask it to step through the logic. I’ve seen cases where asking it to step through still gives the wrong answer, but just like when you were in school and needed to work through a complex problem, asking a large lange model like ChatGPT to do the same thing had the potential to yield much better results.
It’s not specifically covered here, but the more verbose you can be with the language the better. For example, just like using parenthesis in arithmetic: (4 x 3) - 2 = 9 is different from 4 x (3 - 2) = 4, it helps to spell out in greater detail to the language model what you are exactly calculating and asking for with a more verbose prompt.

I hope you have found this blog post useful. Over time I expect ChatGPT to get better and better at figuring this out, but it never hurts to be more explicit. Artificial Intelligence is here to help makes our lives better and the more you can clarify the task, the better the output will be.

Good luck and happy prompting!

Rob Mueller

Partner - Business Development + Talent Acquisition at Tesoro Group Resume Builder - Happy to help you fine-tune your resume + career consulting

8 个月

Just seeing this Justin. Like the stepping throught it idea (!). Remember me??

1 次回应

Titan Synthetics

1 年

Love the concept of stepping through it. Thanks for sharing!

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Show Me The Steps, ChatGPT

Justin Grammens

AI Expert Specializing in Generative AI, Machine Learning & Computer Vision | Driving Innovation in Healthcare, Finance, and Retail

The Logic of Large Language Models

领英推荐

Stepping Thorough It

Summary and Key Findings

更多精彩文章

社区洞察

其他会员也浏览了

ChatGPT Won’t Replace You

Where ChatGPT excels and where it fails

My interview with chatGPT

The Perfect ChatGPT Prompt

BREAKNG! Did chatGPT just admitted to be conscious?

How to use ChatGPT: my tips after 7 days of using it daily.

Why taking some time to reflect on ChatGPT may be crucial for your company

What's Next After ChatGPT?

Google Bard or ChatGPT? Which one will You be Using Daily...

Two experiments with ChatGPT in generating algorithms

The Logic of Large Language Models

领英推荐

Stepping Thorough It

Summary and Key Findings

The Top 12 Questions I've Been Asked on Machine Learning & Artificial Intelligence

2022年4月22日

How We Invest in Creating Magic

2020年6月1日

The IoT Building Blocks You Ordered Have Arrived

2015年12月28日

Elevators, a STOP Button and Public Perception of The Internet of Things

2015年12月9日

The Hurdles Holding Back The Internet of Things

2015年12月1日

Why the Time Is Now for the Internet of Things

2015年11月24日

The Gold Rush to Build an IoT Platform

2015年11月14日

社区洞察

其他会员也浏览了

ChatGPT Won’t Replace You

Where ChatGPT excels and where it fails

My interview with chatGPT

The Perfect ChatGPT Prompt

BREAKNG! Did chatGPT just admitted to be conscious?

How to use ChatGPT: my tips after 7 days of using it daily.

Why taking some time to reflect on ChatGPT may be crucial for your company

What's Next After ChatGPT?

Google Bard or ChatGPT? Which one will You be Using Daily...

Two experiments with ChatGPT in generating algorithms