登录查看更多内容

The OCP Rack Architecture of the GH200 is Pretty Neat (at least to a HW nerd like myself)

Tony Grayson

VADM Stockdale Leadership Award Recipient | Tech and Business CxO | Ex-Submarine Captain | Top 10 Datacenter Influencer | Veteran Advocate

发布日期: 2024年2月7日

The NVIDIA GH200 NVL32 rack-scale reference architecture is designed to cater to 16 dual-server nodes compatible with the NVIDIA MGX chassis.

Central to the GH200 Grace Hopper Superchip's innovation is the NVLink-C2C interface. This feature establishes an NVLink addressable memory space, significantly streamlining model programming. Integrating high-bandwidth, low-power LPDDR5X, and HBM3e memory, combined with NVIDIA's GPU acceleration and high-performance Arm cores, creates a powerful and balanced system.

The connectivity framework of the GH200 server nodes employs an NVLink passive copper cable cartridge, allowing seamless access to a remarkable 19.5 TB of memory across the network. This setup ensures that each Hopper GPU can access 32 x 624 GB of NVLink addressable memory. The NVLink Switch System has been upgraded to incorporate NVLink copper interconnects, connecting 32 GH200 GPUs with nine NVLink switches that include third-generation NVSwitch chips. This creates a fully connected fat-tree network architecturer. For expanding computational needs, the system is scalable using 400 Gb/s InfiniBand or Ethernet connections, merging exceptional performance with energy efficiency.

NVIDIA is developing its own DGX GH200-based AI supercomputer, named NVIDIA Helios, to power its research and development efforts. Helios will consist of four DGX GH200 systems, interconnected with NVIDIA Quantum-2 InfiniBand networking, to supercharge data throughput for training large AI models. This setup includes 1,024 Grace Hopper Superchips.

Very neat!

Datacenters, Network, and More

5,531 位关注者

Javier Martin

Strategic Accounts - Acquisition Team

1 年

Tony, without a doubt, the next generation of data centers will require a much deeper thought and engineering process to absorb the massive increase in power density and rack liquid cooling requirements. Data Center of the Future - who has the power?

1 次回应

Mike Mann

Liquid Cooling Consulting Services

1 年

The NVIDIA GPUs represent the first commitment at scale by the industry to bring liquid cooled servers to market. Liquid cooling is inherently more efficient than air so win-win for the power grid.

1 次回应

Rex Stock

Seeking Planet Friendly Solutions

1 年

#OpenCompute! Yes Tony Grayson

2 次回应

John Wallerich

Hyperscale Data Center Infrastructure Specialist, Strategist, Energy Efficiency & Sustainability Leader, with 40+ years in tech Researcher/Inventor/Fellow/Advisor

1 年

Thanks for this, Tony. It's amazing to see how far Nvidia has come. I worked with them 20 years ago deploying HPC racks that used Infiniband and MPI to create huge virtual blocks of memory required to optimize EDA app run times. It was a brilliant move, one of many, to acquire Mellanox. But to tie together GPU's into dynamically reconfigurable GPU clusters is a game changer. Well done, Nvidia!

2 次回应

??Greg Crumpton??

HVAC for Life | Writer | Mentor | Skilled Trades Zealot | Dot Connector

1 年

Tony, thanks for the #GraceHopper info yesterday!

1 次回应

查看更多评论

要查看或添加评论，请登录

Tony Grayson的更多文章

Breaking the Chains of Fixed Assets: How the Next Conflict Will Target Vulnerable Infrastructure

2025年3月13日

Breaking the Chains of Fixed Assets: How the Next Conflict Will Target Vulnerable Infrastructure

The next major conflict will unfold with unprecedented speed, and hypersonic missiles and drones can obliterate…

31 条评论
What I Learned in the Navy with Duct Tape and J-B Weld: Building Success with Adaptability and Resilience

2025年3月12日

What I Learned in the Navy with Duct Tape and J-B Weld: Building Success with Adaptability and Resilience

Success often hinges on resourcefulness, adaptability, and determination, whether in the military or running a startup—…

26 条评论
Trust Your Gut

2025年3月11日

Trust Your Gut

Decision-making is an essential skill cultivated through rigorous training and high-stakes environments in the…

19 条评论
Scaling Isn’t Dead: How Reasoning Models and Synthetic Data Are Redefining AI Progress

2024年12月20日

Scaling Isn’t Dead: How Reasoning Models and Synthetic Data Are Redefining AI Progress

Recent debates in the AI community have questioned the relevance of scaling laws—the principle that increasing data and…
Battlefield Lessons: How Ukraine Redefined Modern Warfare for Contested Environments

2024年12月4日

Battlefield Lessons: How Ukraine Redefined Modern Warfare for Contested Environments

The conflict in Ukraine has offered a sobering preview of the future of warfare, where electronic warfare…

10 条评论
Why Aren't We Talking More About Gen III+ Reactors?

2024年11月26日

Why Aren't We Talking More About Gen III+ Reactors?

As global energy demand rises and carbon emissions need to be reduced, Generation III+ (Gen III+) nuclear reactors…

15 条评论
Thinking Sketchy: How Life as a Submariner Teaches Adaptability, Observation, and Creative Problem-Solving

2024年11月15日

Thinking Sketchy: How Life as a Submariner Teaches Adaptability, Observation, and Creative Problem-Solving

I was watching PT-109 recently, and I couldn’t help but think about how much their mindset aligns with that of…

17 条评论
Adapt and Overcome: Why Diverse Perspectives Are the Military’s Best Weapon

2024年11月15日

Adapt and Overcome: Why Diverse Perspectives Are the Military’s Best Weapon

The military is often perceived as a bastion of uniformity in appearance and mindset. Tradition and standardized…

13 条评论
Protecting Guam’s Digital Infrastructure: A Vital Line in Pacific Security

2024年11月15日

Protecting Guam’s Digital Infrastructure: A Vital Line in Pacific Security

In May 2024, U.S.

9 条评论
Guam: The Strategic Cornerstone of U.S. Defense in the Pacific

2024年11月14日

Guam: The Strategic Cornerstone of U.S. Defense in the Pacific

As I sit in the SAME conference in Guam, it’s abundantly clear: this island is no ordinary U.S.

7 条评论

See all articles

The OCP Rack Architecture of the GH200 is Pretty Neat (at least to a HW nerd like myself)

Tony Grayson

VADM Stockdale Leadership Award Recipient | Tech and Business CxO | Ex-Submarine Captain | Top 10 Datacenter Influencer | Veteran Advocate

Datacenters, Network, and More

5,531 位关注者

Tony Grayson的更多文章

社区洞察

其他会员也浏览了

Nvidia 101

NVIDIA 101

What is all the hype for NVIDIA A 100 80 GB about ?

Nvidia's Blackwell Chip Design Flaw and Potential Solutions

What is all the hype for NVIDIA A 100 80 GB about ?

What is all the hype for NVIDIA A 100 80 GB about ?

What is all the hype for NVIDIA A 100 80 GB about ?

NVIDIA evolution from B200/GB200A to B300/GB300

GTC 2024 preview

?? Bringing the Power of Lovelace Architecture from Graphics Cards to Data Centers ??

Datacenters, Network, and More

5,531 位关注者

Tony Grayson的更多文章

Breaking the Chains of Fixed Assets: How the Next Conflict Will Target Vulnerable Infrastructure

What I Learned in the Navy with Duct Tape and J-B Weld: Building Success with Adaptability and Resilience

Trust Your Gut

Scaling Isn’t Dead: How Reasoning Models and Synthetic Data Are Redefining AI Progress

Battlefield Lessons: How Ukraine Redefined Modern Warfare for Contested Environments

Why Aren't We Talking More About Gen III+ Reactors?

Thinking Sketchy: How Life as a Submariner Teaches Adaptability, Observation, and Creative Problem-Solving

Adapt and Overcome: Why Diverse Perspectives Are the Military’s Best Weapon

Protecting Guam’s Digital Infrastructure: A Vital Line in Pacific Security

Guam: The Strategic Cornerstone of U.S. Defense in the Pacific

社区洞察

其他会员也浏览了

Nvidia 101

NVIDIA 101

What is all the hype for NVIDIA A 100 80 GB about ?

Nvidia's Blackwell Chip Design Flaw and Potential Solutions

What is all the hype for NVIDIA A 100 80 GB about ?

What is all the hype for NVIDIA A 100 80 GB about ?

What is all the hype for NVIDIA A 100 80 GB about ?

NVIDIA evolution from B200/GB200A to B300/GB300

GTC 2024 preview

?? Bringing the Power of Lovelace Architecture from Graphics Cards to Data Centers ??