登录查看更多内容

#8 Positioning in an AI world

Roger Lam

Learning the ins and outs of ML/AI and sharing along the way.

发布日期: 2024年7月26日

This week we have Llama 3.1, Chip Huyen 's post on building a GenAI platform, SearchGPT, and DeepMind getting Silver at the International Math Olympiad.

Llama 3.1 simply performs

Meta launches a GPT-4 level LLM for free (and 8B and 70B models too).

We knew this was coming but more than just releasing the weights, they detailed a lot of interesting decisions.

They chose to use a decoder-only architecture - steering away from Mixture of Experts - and focused on producing very clean data and data boundaries to prevent contamination in the training data. Showing simplicity is valuable.

They also allow for synthetic data generation and model distillation, something current frontier models didn't allow in their TOS. I could see an explosion in competition toward data companies like Scale AI or Nurdle AI .

They're furthering support for security and safety tooling like Llama Guard and Prompt Guard which is awesome.

Which leads to their announcement of Llama Stack - their open source ambitions of standardizing interfaces to LLMs. Something LlamaIndex and LangChain will definitely keep an eye on. React and PyTorch shows Meta is really good a building developer communities.

https://ai.meta.com/blog/meta-llama-3-1/

Groq is fast and fast is key

This demo really clicks why speed is key to unlocking a bunch of use cases.

https://x.com/JonathanRoss321/status/1815777714642858313

Speak, get an answer instantly, tweak, iterate. Then, it becomes reducing the number of iteration loops. To do that, you'll want personalization and specialization. Very exciting times.

Blueprint for building your Generative AI stack

Chip Huyen 's blog hits again. A very good resource for how to design your system incrementally. And it reminds us that this sort of stuff takes a lot of engineering to get it reliably into production.

Somewhat tangental but look back at Llama 3. They don't detail (at least that I know of) the size and talent of the team.

Which leads to two avenues if you want to keep your team lean - banking on raw model performance improving and having less engineering required or relying on vendors and frameworks to provide high quality results.

领英推荐

The Weekend @ ...

Generative AI 1 年前

DeepSeek's R1 Disrupting America's AI Business Model

Michael Spencer 1 个月前

TAI #109: Cost and Capability Leaders Switching Places…

Towards AI 8 个月前

OpenAI is coming for your search bar

OpenAI is taking on Perplexity with their own enhanced real-time search bar called SearchGPT. Partnerships with publishers is the first step but is this a trojan horse in a 20 sided game of prisoners dilemma? Did publishers already lose?

We'll probably see more of this type of commercialization as rumors are that they're set to lose $5B this year.

DeepMind gets silver in International Math Olympiad

I love how DeepMind quietly work on breakthroughs. OpenAI was born out of being the foil to DeepMind.

They use Lean, a formal solver, and searches for solutions until they get to one that works. One of the questions took minutes and others took three days.

Using formal solvers has been hinted at by many including Terry Tao.

It's very exciting to see this happening in real-time.

https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/

What's our takeaway?

In reinforcement learning, there's a concept of a reward function - letting your program know if what it did was good or bad. This lets it learn how to do things quicker or better.

Using Generative AI apps, we are the reward function. We tell it good or bad or tweak the result.

We all have personal preferences too. Can we model our personal preferences?

In a way, we have - in the form of social media algorithms.

We can also pre-populate and cache things we might want answers to if we know possible questions.

If you had infinite compute and energy, what could you do and who is best positioned to do it?

Roger Lam Weekly

441 位关注者

要查看或添加评论，请登录

Roger Lam的更多文章

#23 Building Scientists and Software Engineers

2025年2月20日

#23 Building Scientists and Software Engineers

A biweekly cadence feels more natural as things are settling around ML / AI. It's still an exciting field but much of…

1 条评论
#22 Mixture of Perspectives on DeepSeek

2025年2月6日

#22 Mixture of Perspectives on DeepSeek

You've likely heard of DeepSeek, the Chinese AI company that has put OpenAI and Nvidia on notice. There was a ton of…
#21 Fire, Birds, and Career

2025年1月19日

#21 Fire, Birds, and Career

Hi everyone - hope all is well. It's been a bit since the last newsletter but in the meantime I posted slide on:…
#20 Engineering Addiction and Class

2024年12月14日

#20 Engineering Addiction and Class

Hey everyone, thanks following along. Hope all is well.
#19 Amazon Nova = Commoditization?

2024年12月7日

#19 Amazon Nova = Commoditization?

Hey! Hope all is well. I'm digging the lull of the holidays.
#18 It's always a good time to build

2024年11月23日

#18 It's always a good time to build

Hey - Hope all is well. First, election fatigue and now recovering from a bad cold.
#17 A Wild AI Appears! (in Japan and Korea)

2024年11月3日

#17 A Wild AI Appears! (in Japan and Korea)

I'm back from vacation and was pretty unplugged with limited data on the e-sim and a whole lot of walking so this is…
All Meta and meta planning

2024年10月11日

All Meta and meta planning

Meta Movie Gen is coming for Sora There are so many releases coming from Meta. Llama 3.
#15 What does PyTorch and the Holy Roman Empire have in common?

2024年9月29日

#15 What does PyTorch and the Holy Roman Empire have in common?

I had the chance to go to PyTorch Conf in SF a week ago and really appreciated the level of depth. Lots of talks and…

1 条评论
#14 The Future of o(a)1

2024年9月17日

#14 The Future of o(a)1

Few things to write about this week: OpenAI's o1 model, Terrance Tao on Future Iterations, and ecosystem shifts Product…

See all articles

#8 Positioning in an AI world

Roger Lam

Learning the ins and outs of ML/AI and sharing along the way.

Llama 3.1 simply performs

Groq is fast and fast is key

Blueprint for building your Generative AI stack

领英推荐

OpenAI is coming for your search bar

DeepMind gets silver in International Math Olympiad

Roger Lam Weekly

441 位关注者

Roger Lam的更多文章

社区洞察

其他会员也浏览了

TAI #137: DeepSeek r1 Ignites Debate: Efficiency vs. Scale and China vs. US in the AI Race

This AI newsletter is all you need #95

AI in Practice: How to Choose and Deploy the Right Strategy

AI news

AI Heat Wave

DeepSeek: The AI revolution you didn’t see coming

Emerging AI: Roundup for May and June 2024

Artificial Intelligence #225

Artificial Intelligence #225

Artificial Intelligence #209

Llama 3.1 simply performs

Groq is fast and fast is key

Blueprint for building your Generative AI stack

领英推荐

OpenAI is coming for your search bar

DeepMind gets silver in International Math Olympiad

Roger Lam Weekly

441 位关注者

Roger Lam的更多文章

#23 Building Scientists and Software Engineers

#22 Mixture of Perspectives on DeepSeek

#21 Fire, Birds, and Career

#20 Engineering Addiction and Class

#19 Amazon Nova = Commoditization?

#18 It's always a good time to build

#17 A Wild AI Appears! (in Japan and Korea)

All Meta and meta planning

#15 What does PyTorch and the Holy Roman Empire have in common?

#14 The Future of o(a)1

社区洞察

其他会员也浏览了

TAI #137: DeepSeek r1 Ignites Debate: Efficiency vs. Scale and China vs. US in the AI Race

This AI newsletter is all you need #95

AI in Practice: How to Choose and Deploy the Right Strategy

AI news

AI Heat Wave

DeepSeek: The AI revolution you didn’t see coming

Emerging AI: Roundup for May and June 2024

Artificial Intelligence #225

Artificial Intelligence #225

Artificial Intelligence #209