登录查看更多内容

Transformers and self-attention: the research breakthroughs behind ChatGPT

Sriram Krishnan

Senior Policy Advisor for AI, White House

发布日期: 2023年1月21日

What are the research breakthroughs behind ChatGPT? In recent times, LLMs have been all the rage and especially transformers and "self-attention". We got a chance to talk to Noam Shazeer of Character.ai / ex-Google who was a key contributor to several of these.

Noam covered a bunch in our conversation and Aarthi Ramamurthy and I had a blast (links at the bottom)

How he got into Google in the early 2000s, including his famous interview with Paul Bucheit that lead to a rewrite of the Google spellchecker.
The history of AI research, especially key momemts in the 2000s.
What is a "transformer"? And the story behind the legendary "Attention is all you need" paper
What is "self-attention"?
What are key limitations/step-function changes?
What are future apps for AI that are interesting?

Warning: this is dense - but fun. Leave a comment below and we will try and answer!

Watch/listen:

Spotify
Apple.
Youtube (best option for comments!)

Kristoph Lederer

Data Scientist | Data Engineer | Data Analyst | MBA | MSBA Candidate at Georgetown University

1 年

Sriram, thanks for sharing!

1 次回应

要查看或添加评论，请登录

Sriram Krishnan的更多文章

EP 29: Brian Grazer on working with Tom Hanks, Ron Howard, meeting Castro, Apollo 13, Thirteen Lives, breaking into Hollywood as an outsider and more

2023年1月7日

EP 29: Brian Grazer on working with Tom Hanks, Ron Howard, meeting Castro, Apollo 13, Thirteen Lives, breaking into Hollywood as an outsider and more

“Making a movie is a startup” https://www.youtube.

1 条评论
James Clear on how he wrote a bestseller that sold 9m+ copies and shaping new habits

2023年1月4日

James Clear on how he wrote a bestseller that sold 9m+ copies and shaping new habits

“Every action you take is a vote for the type of person you wish to become” Aarthi Ramamurthy and I sat down with the…

8 条评论

Transformers and self-attention: the research breakthroughs behind ChatGPT

Sriram Krishnan

Senior Policy Advisor for AI, White House

Sriram Krishnan的更多文章

社区洞察

其他会员也浏览了

Possible | Prompt and Process with Ethan Mollick [AI miniseries]

The AI Transistor

"The world as you know it is over. It's not about to be over, it's over."

A Response to “How the New ChatGPT Will Send Ripples of Change through the World As We Know It”

What does the GPT in ChatGPT even stand for?

Today I had a frightening conversation with ChatGPT-4

ChatGPT: Everything You Need to Know About the AI-Powered Chatbot

"Anticipate the unknown, presume the unimaginable, and plan for the unthinkable!"

Oh no... Not Another AI Article...

Sriram Krishnan的更多文章

EP 29: Brian Grazer on working with Tom Hanks, Ron Howard, meeting Castro, Apollo 13, Thirteen Lives, breaking into Hollywood as an outsider and more

James Clear on how he wrote a bestseller that sold 9m+ copies and shaping new habits

社区洞察

其他会员也浏览了

Possible | Prompt and Process with Ethan Mollick [AI miniseries]

The AI Transistor

"The world as you know it is over. It's not about to be over, it's over."

A Response to “How the New ChatGPT Will Send Ripples of Change through the World As We Know It”

What does the GPT in ChatGPT even stand for?

Today I had a frightening conversation with ChatGPT-4

ChatGPT: Everything You Need to Know About the AI-Powered Chatbot

"Anticipate the unknown, presume the unimaginable, and plan for the unthinkable!"

Oh no... Not Another AI Article...