登录查看更多内容

How DeepSeek overcame US sanctions

MIT Technology Review

Our in-depth reporting on innovation reveals and explains what’s happening now to help you know what’s coming next.

发布日期: 2025年1月28日

The AI community is abuzz over DeepSeek R1, a new open-source reasoning model. The model was developed by the Chinese AI startup DeepSeek, which claims that R1 matches or even surpasses OpenAI’s ChatGPT o1 on multiple key benchmarks but operates at a fraction of the cost. DeepSeek’s success is even more remarkable given the constraints facing Chinese AI companies in the form of increasing US export controls on cutting-edge chips. In this edition of What’s Next in Tech, discover how the company was able to overcome US sanctions to create DeepSeek R1.

??Flash sale alert! Subscribe today to save 25% on the 10 Breakthrough Technologies 2025 list and get a FREE digital report on small language models.

With a new reasoning model that matches the performance of ChatGPT o1, DeepSeek managed to turn restrictions into innovation.

Early evidence shows that the US’s export controls on advanced semiconductors are not working as intended. Rather than weakening China’s AI capabilities, the sanctions appear to be driving startups like DeepSeek to innovate in ways that prioritize efficiency, resource-pooling, and collaboration.

To create R1, DeepSeek had to rework its training process to reduce the strain on its GPUs, a variety released by Nvidia for the Chinese market that have their performance capped at half the speed of its top products, according to Zihan Wang, a former DeepSeek employee and current PhD student in computer science at Northwestern University.?

DeepSeek R1 has been praised by researchers for its ability to tackle complex reasoning tasks, particularly in mathematics and coding. The model employs a “chain of thought” approach similar to that used by ChatGPT o1, which lets it solve problems by processing queries step by step.

Dimitris Papailiopoulos, principal researcher at Microsoft’s AI Frontiers research lab, says what surprised him the most about R1 is its engineering simplicity. “DeepSeek aimed for accurate answers rather than detailing every logical step, significantly reducing computing time while maintaining a high level of effectiveness,” he says.

The company has also released six smaller versions of R1 that are small enough to? run locally on laptops. It claims that one of them even outperforms OpenAI’s o1-mini on certain benchmarks.

领英推荐

AI Hitting Hard: Japan’s $3B Bet, Anthropic’s Hack…

Steve Nouri 3 周前

Last Week's Tech Highlights

CXO Tech 1 个月前

For This Week's Espresso ? OpenAI's Financial…

aixigo 7 个月前

Although there's a lot of buzz around R1, DeepSeek remains relatively unknown. Read the story to dive deep into how the startup managed to create an AI model that one expert says could be a “truly equalizing breakthrough” despite tight US sanctions.

China Report is your weekly guide to everything happening in China and technology. Stay informed on the biggest headlines, deep analysis, and original stories. Sign up today to stay informed.

Get ahead with these related stories:

The second wave of AI coding is here — A string of startups are racing to build models that can produce better and better software. They claim it’s the shortest path to AGI.
AI’s energy obsession just got a reality check — DeepSeek poses a threat to the narrative that more computing power is the only thing that’ll unlock AI breakthroughs.
What’s next for AI in 2025 — You already know that agents and small language models are the next big things. Here are five other hot trends you should watch out for this year.

Image: Stephanie Arnett/ MIT Technology Review | Rawpixel

Flash sale! Save 25% when you subscribe today.

What's Next in Tech

635,015 位关注者

Bo?tjan Dolin?ek

3 周

OK Bo?tjan Dolin?ek

Roy Y.

4 周

It's a good news for AI/ML industry indeed. While we create AI hype across the street, the main issue of this innovation is that we are trying to use more expensive technology to replace cheaper ones. This won't make any economic sense and eventually will bust. DeepSeek shared their breakthroughs like early OpenAI, now Meta. It lowers the cost and every AI/ML company can apply it. It will boost the AIML application development and increase the adoption. We still need the compute power.

1 次回应

Joseph Bayana

In the Business of Big Data

4 周

What's all the shock about low cost DeepSeek? It's made in China, where they also experiment on their 1.4 billion population using unbridled AI and the readily available big-to-massive-to-humongous-data for any and all models. If you think Meta, Microsoft, Amazon, ABC, among others, exploits data from Americans, wait til you find out what China does with their humongous data that's 4 times that of the USA. Kung Hei Fat Choi!

MICHAEL WOO, MBA, CCS

Certified Collection Specialist at Wakefield & Associates, Inc

1 个月

Impressive

Andrew Fox

UI & UX Product Specialist @ Freelance Digital | Creative / Design Director | Innovator

1 个月

Restrictions force us to think outside the box, leading us to natural innovation. When resources, time, options are limited, we find creative solutions to overcome obstacles. This constraint-driven problem-solving encourages efficiency, new perspectives, and unconventional thinking. If you want a creative mind to excel restrict it!! Disruption is king.

查看更多评论

要查看或添加评论，请登录

MIT Technology Review的更多文章

See all articles

How DeepSeek overcame US sanctions

MIT Technology Review

Our in-depth reporting on innovation reveals and explains what’s happening now to help you know what’s coming next.

With a new reasoning model that matches the performance of ChatGPT o1, DeepSeek managed to turn restrictions into innovation.

领英推荐

Get ahead with these related stories:

What's Next in Tech

635,015 位关注者

MIT Technology Review的更多文章

社区洞察

其他会员也浏览了

?? Hacking a Kia car and a Nobel prize for 'the founders of AI': this was October

Latest AI, Blockchain, & Crypto Trends, Insights and News Headlines for January 29, 2025

Web3 & Emerging Tech Weekly

Google Plans to Make Search “Personal” With AI Chat and Video

?? "Welcome to the A-Team of AI: Where Impossible Missions Meet High-Tech Solutions" ??

The AI kill switch. A PR stunt or a real solution?

Latest AI, Crypto Trends, Insights and News Headlines for July 8, 2024

?? "Sunday, Monday, Happy AIs! Tuesday, Wednesday, Happy AIs!" ????

AI4Future: Top AI News (19-25th August)

AI Claire’s Weekly #53: DeepSeek Controversy, AI Agents, and the Future of Personalization

With a new reasoning model that matches the performance of ChatGPT o1, DeepSeek managed to turn restrictions into innovation.

领英推荐

Get ahead with these related stories:

What's Next in Tech

635,015 位关注者

MIT Technology Review的更多文章

Your boss is watching

The AI relationship revolution is already here

An AI chatbot told a user how to kill himself—but the company doesn’t want to “censor” it

The second wave of AI coding is here

OpenAI has created an AI model for longevity science

What’s next for AI in 2025

These are the 10 Breakthrough Technologies of 2025

Our most-read stories of the year

These were the biggest technology flops of 2024

How to use Sora, OpenAI’s new video generating tool

社区洞察

其他会员也浏览了

?? Hacking a Kia car and a Nobel prize for 'the founders of AI': this was October

Latest AI, Blockchain, & Crypto Trends, Insights and News Headlines for January 29, 2025

Web3 & Emerging Tech Weekly

Google Plans to Make Search “Personal” With AI Chat and Video

?? "Welcome to the A-Team of AI: Where Impossible Missions Meet High-Tech Solutions" ??

The AI kill switch. A PR stunt or a real solution?

Latest AI, Crypto Trends, Insights and News Headlines for July 8, 2024

?? "Sunday, Monday, Happy AIs! Tuesday, Wednesday, Happy AIs!" ????

AI4Future: Top AI News (19-25th August)

AI Claire’s Weekly #53: DeepSeek Controversy, AI Agents, and the Future of Personalization