Can it (genAI) solve wordle?
Created with Microsoft Designer on LinkedIn

Can it (genAI) solve wordle?

For a very long time, the running theme for high end consumer pc's and cutting edge graphics cards was "Can it play Crysis?". Crysis as a game had garnered a reputation to melt the most cutting edge PC's and bring them down on their knees.

It's a nerdy joke. Jensen Huang mentioned it during his Computex 2023 keynote pondering if the state of the art 256 Grace Hopper super chip data center configuration could play Crysis.

Jensen Huang at Computex 2023


I decided to take all the latest generative AI models and throw the toughest problem that all humankind face first thing in the morning.

"Wordle"

And hence the big question. Can it (genAI) solve Wordle?

Spoiler alert - It can't. Neither GPT-4o, nor Gemini, nor Claude

So here's me put Gemini, Claude and GPT-4o throught their paces to see if they can solve Wordle.

This is what my wordle screen looked three tries in.

Wordle Attempt | 5/29/2024

I devised the following prompt.

I am looking for a 5 letter word. The word cannot contain the alphabets "C","R","O","W","N","S","I","M","E","B","U","K","Y". The word contains the alphabet "L". "L" cannot be in the second or third place in the word. What are the possible solutions for this word?        

Claude

Claude

Claude melted faster than butter on a hot skillet.

It generated a possible list of words that while structurally valid were not real English words. It also did not understand the positional constraint of "L".

It failed miserably.

Gemini

Gemini

Gemini tried its reasoning by fixing the positional constraint of the alphabet "L" but totally missed out on the constraint on which alphabets cannot be part of the word.

So it did half the work and did not validate its end results with all the constraints.

GPT-4o

Finally, we get to the coolest kid on the block. GPT-4o.

To give credit where it's due, GPT-4o started like the proverbial first bencher in the classroom. It replayed the constraints that I gave and started to solve it logically. It got my hopes up and dashed them equally fast when it came with its final solution as "HALF".

Seriously ! This is not even a five letter word. I was left speechless.

In Conclusion

I won't reveal today's Wordle answer to avoid spoilers, but it's worth mentioning that none of the generative AI models got it right

The key to solving any problem is to devise a solution framework, apply, get an answer and validate your results through a parallel process.

Both Gemini and GPT-4o seem to break down the problem into logical solution steps but falter at the application level. I would say that I found GPT-4o's abilities better than Gemini's for this specific problem.

Claude follows a brute force approach and falters at the first step itself.

In conclusion, the answer to the question "Can it (genAI) solve wordle?" is a resounding No.

The objective of this exercise beyond general amusement was to test the ability of foundational generative AI models to breakdown a problem in to logical steps, execute them while maintaining constraint integrity, decomposing the problem, arriving at or deducing the answer, and finally validating the answer.

I believe that the investments in generative AI will yield in substantial increase in capabilities every year. While the hype on genAI and its potential still remains at an all time high, RAG based solutions, thorough testing and validation for your use case is critical to ensure consistent results.


The opinions in this article are the author's own and do not reflect the views of the organization he is employed with.

Shrikant Kulkarni

Passionate for Customer Success, Data/AI Technology, People Development, Practice Eminence

5 个月

While GenAI can’t solve , here is my attempt sometime back to aid Wordle solving using basic python programming https://www.dhirubhai.net/posts/shrikantkulkarni_github-shrikant78dssummit-activity-6910441872398135296-IiLu?utm_source=share&utm_medium=member_ios

Vijay Gunti

Building Generative AI , Single and Multiple Agents for SAP Enterprises | Mentor | Agentic AI expert | SAP BTP &AI| Advisor | Gen AI Lead/Architect | SAP Business AI |Joule | Authoring Gen AI Agents Book

6 个月

What strategies did each model use to tackle Wordle?

回复
Madhavan M.

Business Technology Solution leader- ZS Associates, Evanston

6 个月

Awesome :) My co-pilot buddy answers: ertainly! Let’s find a 5-letter word that meets your criteria. It cannot contain the letters C, R, O, W, N, S, I, M, E, B, U, or K. The word must include the letter L, but L cannot be in the second or third position. Here are some possible solutions: Label Legal Ladle Loyal Lilac Feel free to choose the one that resonates with you!

回复
Naveen Sharma

Global Practice Head @ Cognizant | AI, Data and Analytics

6 个月

Better Call Sam!

Tushar Sinha

Sales and Solution leader - Data, Analytics and AI

6 个月

Wonderful experiment ! I am glad that you were never my professor, else you would have devised a test to fail me :) The world is enjoying Gen AI and you are showing how it can be broken to pieces !

要查看或添加评论,请登录

社区洞察

其他会员也浏览了