登录查看更多内容

The AI Semiconductor Landscape Primer

Michael Spencer

A.I. Writer, researcher and curator - full-time Newsletter publication manager.

发布日期: 2025年1月23日

To get my best coverage consider becoming a paid subscriber for less than $2 a week. To read this post with infographics and proper formatting please visit the original here.

This was written on January 20th, 2025. Happy Trump inauguration day! With the U.S. continuing a number of stringent exports controls and this next administration expected to keep it up maybe even with elevated tariffs it’s a super interesting time to think more about the semiconductor industry. The AI arms race and national security due diligence related to U.S. exceptionalism is upon us.

A flurry of Executive Orders by the Biden Administration on his last two weeks in office were telling. Trump and the new administration will be carefully watched and their actions scrutinized. Meanwhile, I’ve long admired the work of Eric Flaningam for his macro overviews on various aspects of technology stacks. Let’s feature some of them here:

?? Generative Value ??

Generative Value

Sharing learnings about tech & business.

By Eric Flaningam

His Newsletter, Generative Value provides great insights into how everything is connected.

Articles to check out!

Whether you are an investor, technologist or just a casual reader his overviews will provide you some value and are easy to understand and scan.

As you know, the Biden administration has implemented a series of executive orders (EOs) and export controls (ECs) aimed at regulating the semiconductor industry, particularly in response to national security concerns and competition with China. Trump has said various things with regards to tariffs on China as well. The U.S. appears to be trying to control how AI spreads to other nations, limiting China’s ability to for example access the best Nvidia AI related GPUs and chips.

Jake Sullivan — with three days left as White House national security adviser, with wide access to the world's secrets — called on journalists and news media to deliver a chilling, "catastrophic" warning for America and the incoming administration:

The AI Arms Race circa 2025

What happens this point on is fundamentally a new world of innovation and competition in innovation.

“The next few years will determine whether artificial intelligence leads to catastrophe — and whether China or America prevails in the AI arms race.”

According to JS, as reported by Axios, “AI development sits outside of government and security clearances, and in the hands of private companies with the power of nation-states.”
“U.S. failure to get this right, Sullivan warns, could be "dramatic, and dramatically negative — to include the democratization of extremely powerful and lethal weapons; massive disruption and dislocation of jobs; an avalanche of misinformation." It wasn’t clear in his briefing if OpenAI, Anthropic, Google and others can be expected to “get this right”. The U.S. believes it is the AI leader heading into the new year and new administration.
Clearly in 2025, corporations and the financial elite who have the most say (majority shareholders), have enormous power in the AI arms race that’s ahead in the 2025 to 2035 period, an incredible decade of datacenters, semiconductors and a sprawling new landscape related to AI ahead. The 2025-2035 argutely is the most important decade in the history of innovation human civilization has ever witnessed.
Geopolitics aside, the semiconductor industry is becoming way more important with datacenters and a new emergence of AI’s capabilities. I will be covering the semiconductor industry more closely in 2025 in this and related publications.

But how does it all work? What are the companies involved? Why are companies like Nvidia, TSMC, ASML and others so pivotal? What about the big picture and landscape?

The AI Semiconductor Landscape

By Eric Flaningam, December, 2024.

Hi, my name’s Eric Flaningam, I’m the author of Generative Value, a technology-focused investment newsletter. My investment philosophy is centered around value. I believe that businesses are valued based on the value they provide to customers, the difference between that value & the value of competitors, and the ability to defend that value over time. I also believe that technology has created some of the best businesses in history and that finding those businesses will lead to strong returns over time. Generative Value is the pursuit of those businesses.

1. Introduction

Nvidia’s rise in the last 2 years will go down as one of the great case studies in technology.

Jensen envisioned accelerated computing back in 2006. As he described at a commencement speech in 2023, ”In 2007, we announced [released] CUDA GPU accelerated computing. Our aspiration was for CUDA to become a programming model that boosts applications from scientific computing and physics simulations, to image processing. Creating a new computing model is incredibly hard and rarely done in history. The CPU computing model has been the standard for 60 years, since the IBM System 360.”

For the next 15 years, Nvidia executed on that vision.

With CUDA, they created an ecosystem of developers using GPUs for machine learning. With Mellanox, they became a (the?) leader in data center networking. They then integrated all of their hardware into servers to offer vertically integrated compute-in-a-box.

When the AI craze started, Nvidia was the best-positioned company in the world to take advantage of it: a monopoly on the picks and shovels of the AI gold rush.

That led to the rise of Nvidia as one of the most successful companies ever to exist.

With that rise came competition, including from its biggest customers. Tens of billions of dollars have flowed into the ecosystem to take a share of Nvidia’s dominance.

This article will be a deep dive into that ecosystem today and what it may look like moving forward. A glimpse at how we map out the ecosystem before we dive deeper:

To read the entire piece, consider supporting the Newsletter for less than $2 a week.

领英推荐

Future Beat: Ahead of the curve

The National News 1 年前

Future Beat: Endless possibilities

The National News 11 个月前

Sam Altman and the $7T geopolitics of AI chips

VentureBeat 1 年前

?? Generative Value Newsletter ??

A mental model for the AI semiconductor value chain. The graphic is not exhaustive of companies and segments.

2. An Intro to AI Accelerators

At a ~very~ high level, all logic semiconductors have the following pieces:

Computing Cores - run the actual computing calculations.
Memory - stores data to be passed on to the computing cores.
Cache - temporarily stores data that can quickly be retrieved.
Control Unit - controls and manages the sequence of operations of other components.

Traditionally, CPUs are general-purpose computers. They’re designed to run any calculation, including complex multi-step processes. As shown below, they have more cache, more control units, and much smaller cores (Arithmetic Logic Units or ALUs in CPUs).

Source: https://cvw.cac.cornell.edu/gpu-architecture/gpu-characteristics/design

On the other hand, GPUs are designed for many small calculations or parallel processing. Initially, GPUs were designed for graphics processing, which needed many small calculations to be run simultaneously to load displays. This fundamental architecture translated well to AI workloads.

Why are GPUs so good for AI?

The base unit of most AI models is the neural network, a series of layers with nodes in each layer. These neural networks represent scenarios by weighing each node to most accurately represent the data it's being trained on.

Once the model is trained, new data can be given to the model, and it can predict what the outputted data should be (inference).

This “passing through of data” requires many, many small calculations in the form of matrix multiplications [(one layer, its nodes, and weights) times (another layer, its nodes, and weights)].

This matrix multiplication is a perfect application for GPUs and their parallel processing capabilities.

(Stephen Wolfram has a wonderful article about how ChatGPT works.)

The GPU today

GPUs continue to get larger, with more computing power and memory, and they are more specialized for matrix multiplication workloads.

Let’s look at Nvidia’s H100 for example. It consists of CUDA and Tensor cores (basic processors), processing clusters (collections of cores), and high-bandwidth memory. The H100’s goal is to process as many calculations as possible, with as much data flow as possible.

Source: https://resources.nvidia.com/en-us-tensor-core

The goal is not just chip performance but system performance. Outside of the chip, GPUs are connected to form computing clusters, servers are designed as integrated computers, and even the data center is designed at the systems level.

Training vs Inference

To understand the AI semiconductor landscape, we have to take a step back to look at AI architectures.

Training iterates through large datasets to create a model that represents a complex scenario, and inference provides new data to that model to make a prediction.

https://www.dhirubhai.net/pulse/difference-between-deep-learning-training-inference-mark-robins-mdq8c/

A few key differences are particularly important with inference:

Latency & Location Matter - Since inference runs workloads for end users, speed of response matters, meaning inference at the edge or inference in edge cloud environments can make more sense than training. In contrast, training can happen anywhere.
Reliability Matters (A Little) Less—Training a leading-edge model can take months and requires massive training clusters. The interdependence of training clusters means mistakes in one part of the cluster can slow down the entire training process. With inference, the workloads are much smaller and less interdependent; if a mistake occurs, only one request is affected and can be rerun quickly.
Hardware Scalability Matters Less - One of the key advantages for Nvidia is its ability to scale larger systems via its software and networking advantages. With inference, this scalability matters less.

Combined, these reasons help explain why so many new semiconductor companies are focused on inference. It’s a lower barrier to entry.

Nvidia's networking and software allow it to scale to much larger, more performant, and more reliable training clusters.

On to the competitive landscape.

3. The AI Semiconductor Landscape

We can broadly look at the AI semiconductor landscape in three main buckets:

Data Center Chips used for Training
Data Center Chips used for Inference
Edge Chips used for Inference

Visualizing some of those companies below:

Read the entire article here.

Artificial Intelligence Report

243,140 位关注者

Bo?tjan Dolin?ek

1 个月

OK Bo?tjan Dolin?ek

Hugo Rauch

Finance at Microsoft | Climate and VC | Founder at VCo2 Media

1 个月

I love to see you both collaborate!

Rubén Domínguez Ibar

VC Investor | Actionable insights on startups, innovation, and entrepreneurship

1 个月

This is a great article, congrats!

Ivan Landabaso

Partner at JME.vc

1 个月

Interesting

Chris Tottman

Partner at Notion Capital

1 个月

Jimmy Acton - check this out ??

查看更多评论

要查看或添加评论，请登录

Michael Spencer的更多文章

On why LLMs cannot truly reason

2025年2月28日

On why LLMs cannot truly reason

?? In partnership with HubSpot ?? HubSpot Integrate tools on HubSpot The HubSpot Developer Platform allows thousands of…

2 条评论
Can AI Lead us to Enlightenment?

2025年2月27日

Can AI Lead us to Enlightenment?

This is a guest post by Chad Woodford, JD, MA, to read the entire thing read the original published today here. For…

10 条评论
Apple is a Stargate Too for American Jobs and R&D ??

2025年2月24日

Apple is a Stargate Too for American Jobs and R&D ??

Apple's $500 Billion Investment Plan in the U.S.

7 条评论
OpenAI o3 Deep Research vs. Google Gemini Deep Research

2025年2月20日

OpenAI o3 Deep Research vs. Google Gemini Deep Research

Good Morning, A whole lot of Deep Research, as we wait for Anthropic and OpenAI models like GPT-4.5.

4 条评论
AI Capex, DeepSeek and Nvidia's Monster on the Horizon ??

2025年2月13日

AI Capex, DeepSeek and Nvidia's Monster on the Horizon ??

This is a guest post by the folk at the Pragmatic Optimist. To get access to my best work, consider a Paid subscription…

4 条评论
Key to Using Perplexity for Intelligent Search

2025年2月5日

Key to Using Perplexity for Intelligent Search

Mastering Perplexity AI: Your Guide to the Future of Intelligent Search Hello there, This is a guest post by Nick…

15 条评论
Google, OpenAI, DeepSeek Battle it out in AI

2025年2月3日

Google, OpenAI, DeepSeek Battle it out in AI

Good morning, To get access to the best of my work, consider subscribing for less than $2 a week. While DeepSeek felt…

18 条评论
DeepSeek's R1 Disrupting America's AI Business Model

2025年1月30日

DeepSeek's R1 Disrupting America's AI Business Model

This week I am collecting thoughts on DeepSeek, the AI story of the year in 2025 so far. Today we feature a great piece…

15 条评论
OpenAI's Portal to an Alien Intelligence: Stargate

2025年1月23日

OpenAI's Portal to an Alien Intelligence: Stargate

If you find Project Stargate worthy of discussion and reading, feel free to share the article. For less than $2 a week,…

9 条评论
How to use NotebookLM Creatively in 2025

2025年1月16日

How to use NotebookLM Creatively in 2025

This is a guest post by Alex McFarland, read the original on Substack for videos and images. Become a paid premium…

15 条评论

See all articles

The AI Semiconductor Landscape Primer

Michael Spencer

A.I. Writer, researcher and curator - full-time Newsletter publication manager.

?? Generative Value ??

Articles to check out!

The AI Arms Race circa 2025

The AI Semiconductor Landscape

1. Introduction

领英推荐

?? Generative Value Newsletter ??

2. An Intro to AI Accelerators

The GPU today

Training vs Inference

3. The AI Semiconductor Landscape

Artificial Intelligence Report

243,140 位关注者

Michael Spencer的更多文章

社区洞察

其他会员也浏览了

NVIDIA to Surpass Apple as Second Most Valuable Company

AI Boom: Why NVIDIA, Broadcom, and Twilio Are Set to Thrive Despite DeepSeek Panic.

Geopolitics Dominates the Semiconductor Industry

CHINA APPEARS TO LEAPFROG U.S. IN AI, INVESTORS SELLING NVIDIA AND BUYING CHINESE AI STOCKS

NewMind AI Journal #16

?? Charles in Charge… But is AI? ??

The Digital Battlefield: AI’s Shift and the Coming Strategic Reckoning

DeepSeek Is Stealing the Spotlight, but the Big Picture Is Even Brighter Than You Realize

Breaking News: Sam Altman Believes AGI is Achievable with Current Hardware, But Could Require $7 Trillion Investment and Years of Development

Who Rules the AI World? OpenAI on the Path to Independence from Nvidia

?? Generative Value ??

Articles to check out!

The AI Arms Race circa 2025

The AI Semiconductor Landscape

1. Introduction

领英推荐

?? Generative Value Newsletter ??

2. An Intro to AI Accelerators

The GPU today

Training vs Inference

3. The AI Semiconductor Landscape

Artificial Intelligence Report

243,140 位关注者

Michael Spencer的更多文章

On why LLMs cannot truly reason

Can AI Lead us to Enlightenment?

Apple is a Stargate Too for American Jobs and R&D ??

OpenAI o3 Deep Research vs. Google Gemini Deep Research

AI Capex, DeepSeek and Nvidia's Monster on the Horizon ??

Key to Using Perplexity for Intelligent Search

Google, OpenAI, DeepSeek Battle it out in AI

DeepSeek's R1 Disrupting America's AI Business Model

OpenAI's Portal to an Alien Intelligence: Stargate

How to use NotebookLM Creatively in 2025

社区洞察

其他会员也浏览了

NVIDIA to Surpass Apple as Second Most Valuable Company

AI Boom: Why NVIDIA, Broadcom, and Twilio Are Set to Thrive Despite DeepSeek Panic.

Geopolitics Dominates the Semiconductor Industry

CHINA APPEARS TO LEAPFROG U.S. IN AI, INVESTORS SELLING NVIDIA AND BUYING CHINESE AI STOCKS

NewMind AI Journal #16

?? Charles in Charge… But is AI? ??

The Digital Battlefield: AI’s Shift and the Coming Strategic Reckoning

DeepSeek Is Stealing the Spotlight, but the Big Picture Is Even Brighter Than You Realize

Breaking News: Sam Altman Believes AGI is Achievable with Current Hardware, But Could Require $7 Trillion Investment and Years of Development

Who Rules the AI World? OpenAI on the Path to Independence from Nvidia