Machine learning and competitions

Machine learning and competitions

The two games of Go played by the young Chinese genius of Go, Ke Jie, against AlphaGo took place in an unusual context: not only did they come after the successive defeats of Fan Hui and Lee Sedol, two of the best players in the world, but also, the Chinese authorities censored their broadcast, in what can only be assumed to be a gross misinterpretation of the national pride. For Ke Jie, who has recognized that Google’s algorithm is already “too strong for humans” and has become a kind of “Go god”, he finds himself being used as evidence, as the final proof of the superiority of machines at playing the game, a role previously experienced by a handful of humans in other games.

The importance of these types of challenges is relative. In reality, they are about letting the public know about the progress of machine learning by the companies sponsoring the challenge. Over time, we have seen how machine learning algorithms have taken over chess, Jeopardy, Go and, most recently, poker, without wondering what exactly each competition intended to prove.

Chess is a scenario game. Each move generates a new scenario, and good players are able to mentally see many moves ahead. When Kasparov lost against IBM’s Deep Blue in 1996, the only thing humans had to recognize was that a machine was already capable of calculating probabilistic scenarios better than we were. In other words, computational brute force. Given that we have been using calculators for decades, that is easy to assimilate: therefore, no offense taken, the human pride was still in a reasonable good position.

When, in 2011, another IBM product, Watson, beat the best ever players of Jeopardy, things were different. Here, a machine was better able to understand rhetorical questions expressed in human language, to look for its possible answers, to choose one of them, to press a button … and win. After Jeopardy, we knew that an algorithm is better than us at understanding our own language, opening up all kinds of innovation in conversational robots, chatbots, the law, medicine, and many more areas.

In 2015, a Google algorithm beat two of the best Go players in the world, a feat that has just been repeated few days ago. What’s going on? This challenge crowns the importance of a revolutionary technique: deep learning. After the algorithm was fed every game of Go ever played and registered, Google had a very good player, but not one that always won. It was as good as Go’s great masters, but not definitively better. Deep learning made the difference: the machine was programmed to invent new moves and to play against itself, to explore improbable scenarios. The result was that in some matches, AlphaGo used moves that no human had ever made in any of the previous games ever played, moves with a probability of one in ten thousand, and managed to win. So we now live in a world where an algorithm can develop intelligence for a task beyond anything humans are capable of.

Finally, in 2017, an algorithm created by Carnegie Mellon University, Libratus, beat some of the world’s best poker players. Poker is an intrinsically human game in which several cards remain face down, the identity of which can only be speculated on, while we get information about the rest of the cards on the table and from what the other players give us with their bets, which of course can be bluffs. After 120,000 poker hands played over 20 days, Libratus’ victory was absolute and unconditional: humans had no chance.

What does this mean? Simply that a machine is already capable of analyzing a situation with imperfect information, subject to uncertainty, and can make better decisions than a human could. That’s right: making decisions based on incomplete or imperfect information, including possibly false information, is what a manager does at a company; something I’ve been trying to teach my students for twenty-seven years. The approach taken by Tuomas Sandholm, creator of Libratus, is to create algorithm with a wide range of uses. He is not interested in an algorithm to play poker but in one able to carry out cybersecurity analysis, medical diagnostics or business negotiations.

Competitions and business challenges should not be analyzed simply as the communications strategy they represent but instead as what they really mean in terms of achievements, to show what is possible. If anybody reading this still doubts the possibilities of machine learning, let them come up with the next challenge.



(En espa?ol, aquí)

Daniel Tshabalala

Head of Insights and Analytics at Standard Bank

7 年

Unbeatable Poker algorithm? Now, this class of games with imperfect information is especially interesting to me.

回复
willis "Scooter" duff

Science fiction author at Amazon, Smashwords

7 年

Link a human mind to a next-gen quantum computer and STAND BACK! The human/machine hybrid will exceed any machine - until the hardware can duplicate, then surpass, the unique wetware evolved over a few million years.

回复
Utku ünal

Senior Software Engineer

7 年

Thanks for this comprehensive article.

回复
Chuck Sebesta

Real Estate at Chuck Sebesta

7 年

Good Read

回复

要查看或添加评论,请登录

Enrique Dans的更多文章

  • El desastre del software y la automoción

    El desastre del software y la automoción

    GM se ve obligada a detener temporalmente las ventas de su Chevy Blazer EV después de detectar un sinnúmero de…

    11 条评论
  • El enésimo drama de la automoción tradicional: la interfaz

    El enésimo drama de la automoción tradicional: la interfaz

    Porsche acaba de anunciar que se une a toda la legión de empresas de automoción tradicionales y renuncia a tener una…

  • Poniendo a prueba a ChatGPT: consultores centauros o cyborgs

    Poniendo a prueba a ChatGPT: consultores centauros o cyborgs

    Un working paper de Harvard, ?Navigating the jagged technological frontier: field experimental evidence of the effects…

    12 条评论
  • Suscripciones, tramos… y spam

    Suscripciones, tramos… y spam

    Elon Musk confirma sus intenciones de convertir la antigua Twitter, ahora X, en un complejo entramado de suscripciones…

  • El código abierto y sus límites

    El código abierto y sus límites

    Sin duda, el código abierto es la forma más ventajosa de crear software: cuando un proyecto de software toma la forma…

  • La gran expansión china

    La gran expansión china

    El ranking de apps más descargadas en el mundo en iOS y Android para el mes de septiembre de 2023 elaborado por…

    1 条评论
  • Starlink y las torres de telefonía en el espacio

    Starlink y las torres de telefonía en el espacio

    Starlink remodela su página web y a?ade una oferta de internet, voz y datos para smartphones provistos de conectividad…

    3 条评论
  • La fotografía con trampa

    La fotografía con trampa

    La presentación de los nuevos smartphones de Google, Pixel 8 y Pixel 8 Pro, y fundamentalmente de las funcionalidades…

  • Las consecuencias de reprimir los procesos de innovación

    Las consecuencias de reprimir los procesos de innovación

    Mi columna de esta semana en Invertia se titula ?El mercado de trabajo y la innovación? (pdf), y previene sobre los…

  • We are on the verge of the most dangerous election in history

    We are on the verge of the most dangerous election in history

    In just a few days, on November 3rd, the US presidential elections will take place, the most dangerous in history, and…

    2 条评论

社区洞察

其他会员也浏览了