登录查看更多内容

Solving Wordle

Kohsuke Kawaguchi

发布日期: 2022年1月28日

Let me first come clean. Yes, I’ve been playing Wordle. I have it open as a browser tab for like a week. The game is irresistable. I somehow feel the need to prove to a computer that, you know, my English knowledge is decent. But the game is also irritating. The puzzle is not intelligent. All it takes is really a fairly mandane search process that runs in my head. Doing this doesn’t improve my English.

I know of only one way to get out of this kind of addiction.

I’m born to automate. So I decided to algorithmically solve the puzzle. Gotta show the computer who is in charge. Now that is a puzzle worth solving.

And you know, this is how we roll at Launchable. We believe developers don’t just tackle the problem in front of them. We believe in solving a problem once and for all.

Algorithmic solution to Wordle

Modeling a puzzle problem first involves defining the game state. In this case, that’s the set of all the possible answer words. As we make guesses and receive hints, this set shrinks.

My first instinct was that I also needed to model guesses and hints given to them into the game state, but somewhat counterintuitively, that information is not needed, because it’s all embedded in the set of possible answer words.

With this, a solution becomes easy enough. Let’s take this step by step.

Two words x and y are "confusingly similar" if they produce the same hint with respect to the guess g. For example, if we guess ‘fox’, then answers ‘joe' and 'bob’ would produce the same hints, so we cannot tell them apart:

This relationship partitions the set of all possible answers A to subsets, otherwise known as quotinent set (which is a set of sets):

Take one member of A/~g. The probability that this is the next game state is proportional to the size of this member:

We don’t know which one of those members will be the actual next game state, so we compute the expected size of the next game state, which is |x| weighted by their probability:

Now we just need to find the guess 'g' that minimizes this metric.

There are some additional details that need treatment, for example a guess could actually be an answer, but I’ll save you from the details for brevity. See kohsuke/wordle-solver for the whole source code.

Here’s how it solved today’s puzzle. According to this algorithm, “roate” is the most optimal first guess (I would have never guessed that!). On average, that reduces the 2315 possible answers down to just 60.4. In this particular case, the actual possible answers after the first guess is 102, so we got a little unlucky. “feued” reduced it down to 7 candidates. “picky” narrowed it down to just 1, which was “perky”:

领英推荐

Talogy Cubiks Logiks and PAPI Assessments:…

Online Training for Everyone 10 个月前

New Truth Machine App Detects Lies

Bruce Kasanoff 9 年前

"I Don't Do Drugs; I am the Drug."

Gopal Padinjaruveetil 1 年前

Is this really optimal?

Now that I solved the puzzle, the next obvious question is, is this the best possible algorithm?

First, you can come up with an algorithm that cheats. If you look at the game source code, you’ll see that the answer word is not randomly chosen, and in fact if you know yesterday’s word you know what today’s word is. That gives you an algorithm that solves the game in 1 guess every single time. But that’s not very interesting, right?

Ignoring those, I suspect this algorithm is not actually optimal, but I haven’t come up with a case to prove that. The algorithm locally optimizes the expected size of the possible answer words at each guess, but I suspect that doesn’t automatically make this globally optimal. That is, it’s conceivable that a guess can create a slightly suboptimal quotient sets, but the words in those sets are such that in the next guess it allows a highly efficient elimination.

I’m not sure how to compute such a case in short of brute force. And assuming I can find the existence of at least one such case via brute force, what’s even less clear is how to design a provably optimal solution that is also computationally tractable. I’m suspecting such a solution doesn’t exist. But maybe it does.

Let the real game begin!

It kinda sucks that we cannot prove the optimalness of the algorithm! Or does it?

The upside (Yes we always look for an upside at Launchable!) is that the algorithms can compete. Now that is the real game, if you ask me!

Mine solves the game in 3.48 guesses on average across all the possible answers. Let’s see if you can do better, and if so, tweet to us at @Launchableinc.

(Update 2022/2/2) Is this the optimal algorithm? No

I discussed whether this is the best possible algorithm or not, and that I suspected it isn't. Today I actually proved it isn't.

The game always starts with a blank state and the first move is yours, so pretty quickly, everyone develops their favorite first guess. My algorithm determined that that should be ROATE, and from there exhaustively computing every possible game, we arrived at 3.48 guesses.

I evaluated a few other promising candidates exhaustively in a similar way, and found cases in which worse partitions result in overall better performance:

ROATE's expected next game state size is 60.42 and solves the puzzle in 3.48 guesses
SNARE's expected next game state size is 71.10 and solves the puzzle in 3.47 guesses
SALET's expected next game state size is 71.27 and solves the puzzle in 3.45 guesses

This raises all sorts of questions in my head... stay tuned for more updates.

(Update 2022/2/10) Hard mode

I was asked about the hard mode. I suspect that the gap with the theoretical limit will widen in the hard mode, because you can end up in a situation where the hints obtained earlier can constrain the speed in which you can search. For example, if you guess LIGHT and you get ?????????, you have to exhaustively try EIGHT, MIGHT, NIGHT, SIGHT, TIGHT, ... one by one.

With that said, the algorithm does extend very trivially to the hard mode, so it's useful to establish as the base line, if not the very best. I extended my program and got this:

1 guesses: 0
2 guesses: 118
3 guesses: 1026
4 guesses: 1005
5 guesses: 136
6 guesses: 23
7 guesses: 6
8 guesses: 1

The average turns to solve in 3.542981 guesses.

Startup COO Club

7 个月

Wow, that's amazing! ?? Solving #Wordle for good sounds like quite the accomplishment. What was your winning strategy, and how did it feel to crack it? ??

Shahid Yousuf

The Nobody (Lost in oblivion). ??

2 年

Word Lead provides help to solve WORDLE puzzle in such a way that it won't spoil your excitement of solving the daily word challenge, rather it encourages you to enjoy the WORDLE puzzle more. https://play.google.com/store/apps/details?id=com.wordlespoiler.mobi

Bob Bickel

Founder RunSignup | TicketSignup | GiveSignup

3 年

Marlise wants to know if you play the hard version?

Bob Bickel

Founder RunSignup | TicketSignup | GiveSignup

3 年

It took Marlise 5 today. But 3 yesterday. So you are better than her 2 day average.

Younes Hairej

Founder & CEO at Aokumo | Driving Human-Centric Innovation through Cloud, AI, and Culture | Former CTO | Entrepreneur

3 年

さすが! I would not expect less from the master of automation. A fun read. Thanks for sharing Kohsuke Kawaguchi

查看更多评论

要查看或添加评论，请登录

Kohsuke Kawaguchi的更多文章

Gitlab DevSecOps global survey 2021

2022年3月29日

Gitlab DevSecOps global survey 2021

This might be a bit of an old news, but the other day I saw that Gitlab issued a “DevSecOps landscape” survey. So I…

4 条评论
Making your CI/CD 10x faster

2022年2月14日

Making your CI/CD 10x faster

I came across the following post today on Twitter: What was interesting was the reaction. It really shows the big…

2 条评论
I signed up to ride from SF to LA, and I need your help

2022年2月7日

I signed up to ride from SF to LA, and I need your help

I signed up to ride from San Francisco to Los Angeles this summer, and I need your help. This is an annual charity…

2 条评论
What the data tells us: When it rains, it pours

2022年1月25日

What the data tells us: When it rains, it pours

As late as 19th century, blood-letting was a common practice. It was “obvious” that illness can be prevented or cured…

2 条评论

Solving Wordle

Kohsuke Kawaguchi

Algorithmic solution to Wordle

领英推荐

Is this really optimal?

Let the real game begin!

(Update 2022/2/2) Is this the optimal algorithm? No

(Update 2022/2/10) Hard mode

Kohsuke Kawaguchi的更多文章

社区洞察

其他会员也浏览了

Futurist: FUD and AI

Algorithms to live by

The day it stopped being fun

Ghostbusters; Bing AI Chat

I'm building an AI CLONE! You will get to experience it first!

Labyrinth (1986) sequel; ideas & suggestions

Ever start a weekend with a simple plan, only to find yourself knee-deep in a technological challenge that just won't quit? That was me this past week

Self-Authoring: Do You Know Your Tribe?

Human Buffering and Retrieval

Smash your crushing stress!

Algorithmic solution to Wordle

领英推荐

Is this really optimal?

Let the real game begin!

(Update 2022/2/2) Is this the optimal algorithm? No

(Update 2022/2/10) Hard mode

Kohsuke Kawaguchi的更多文章

Gitlab DevSecOps global survey 2021

Making your CI/CD 10x faster

I signed up to ride from SF to LA, and I need your help

What the data tells us: When it rains, it pours

社区洞察

其他会员也浏览了

Futurist: FUD and AI

Algorithms to live by

The day it stopped being fun

Ghostbusters; Bing AI Chat

I'm building an AI CLONE! You will get to experience it first!

Labyrinth (1986) sequel; ideas & suggestions

Ever start a weekend with a simple plan, only to find yourself knee-deep in a technological challenge that just won't quit? That was me this past week

Self-Authoring: Do You Know Your Tribe?

Human Buffering and Retrieval

Smash your crushing stress!