登录查看更多内容

Scaling AI on HPC

Gadi Singer

VP & Director, Emergent AI Research, Intel Labs

发布日期: 2017年12月20日

(HPC and AI Part 2 )

In the HPC DevCon keynote, we highlighted some of the opportunities created by running AI applications on HPC infrastructure (AI-on-HPC), and the equally promising results of executing HPC usages with the addition of AI techniques (HPC-on-AI). Let’s start by discussing AI-on-HPC.

AI Deep Learning (DL) applications excel in knowledge discovery, which involves ingesting large volumes of mostly unstructured data, identifying structure and patterns, and classifying clusters and features within a large multi-dimensional space. All of this is done with moderate to no supervision; the parallel processing capabilities inherent to Neural Network structures provide almost limitless extendability when coupled with the bandwidth provided by HPC.

The underlying infrastructure of HPC masters aggregation and scale. HPC enables the highest level of system compute performance, massive memory pools, and optimized communication infrastructure with best cross-nodes bandwidth and throughput. These crucial capabilities allow Deep Learning to scale and solve the largest and most complex challenges.

The potential for DL scalability that is achievable with HPC infrastructure finds an example in the work done by Prabhat and the team at US Department of Energy, Office of Science, along with Berkeley Lab in collaboration with Intel Labs. The team put together a 15-PetaFLOP Deep Learning system for solving scientific pattern classification problems. The system scales training of a single model to ~9600 Intel? Xeon Phi? processor-based nodes on the Cori supercomputer, to effectively extract weather patterns in a 15TB climate dataset [see 1 below]. Their results demonstrate the advantages of optimizing and scaling DL structures onto many-core HPC systems.

Another major benefit of using HPC infrastructure for Deep Learning comes from the greatly improved response time for DL training iterations. Developing a DL network solution requires an iterative process that involves data scientists or researchers, and compute-intensive experimentation. The ability to effectively explore, examine and optimize the network materially shortens the time to train models and can contribute to higher quality results.

The next post will discuss the great potential of applying Deep Learning capabilities and techniques to significantly enhance key HPC usages (HPC-on-AI).

Thanks for your interest,

Gadi

You can watch here the portion of the HPC DevCon Keynote relevant to this article.

Kurth, Thorsten et Al.: Deep learning at 15PF: supervised and semi-supervised classification for scientific data, Proceedings of the International Conference for High Performance Computing SC '17, Networking, Storage and Analysis, Article No. 7 https://dl.acm.org/citation.cfm?doid=3126908.3126916

Chakravarthy Nagarajan

Principal Solutions Architect specialized in Machine Learning and Gen AI

7 年

Interesting

1 次回应

要查看或添加评论，请登录

Gadi Singer的更多文章

Seat of Knowledge: Information-Centric Classification in AI - Class 2

2021年3月23日

Seat of Knowledge: Information-Centric Classification in AI - Class 2

Class 2 - Semi-Structured Information Repository Second in a series on the choices for capturing information and using…

4 条评论
Seat of Knowledge: Information-Centric Classification in AI

2021年2月16日

Seat of Knowledge: Information-Centric Classification in AI

Class 1 – Fully Encapsulated Information First in a series on the choices for capturing information and using knowledge…

2 条评论
Deep Knowledge as the Key to Higher Machine Intelligence

2021年1月26日

Deep Knowledge as the Key to Higher Machine Intelligence

Third in a series on Cognitive Computing Research – The Age of Knowledge Emerges By Gadi Singer, Intel Labs The next…

15 条评论
Efficiency, Extensibility and Cognition: Charting the Frontiers

2020年8月31日

Efficiency, Extensibility and Cognition: Charting the Frontiers

Second in a series on Cognitive Computing Research: The Age of Knowledge Emerges By Gadi Singer, Intel Labs The rapid…

1 条评论
Next, Machines Get Wiser

2020年7月15日

Next, Machines Get Wiser

By Gadi Singer, Intel Labs Where is machine intelligence headed in the next five years? All signs indicate a…

2 条评论
Accelerating Healthcare’s AI Transformation - Assessing the Drivers

2019年1月10日

Accelerating Healthcare’s AI Transformation - Assessing the Drivers

AI will fundamentally change healthcare. It has the potential to greatly boost disease prevention and early detection…

1 条评论
Upping the game – the imminent arrival of Deep Learning Exaops

2018年3月26日

Upping the game – the imminent arrival of Deep Learning Exaops

I expect to see the first deep learning (DL) systems capable of exaops (10^18) operations within the coming year, and…

3 条评论
AI Taking Action – Emergence of Decision Making and Generative Capabilities

2018年2月14日

AI Taking Action – Emergence of Decision Making and Generative Capabilities

My previous article explored the evolution of AI capabilities from a recognition-based competence to context-rich…

6 条评论
Toward truly intelligent AI: From ‘Recognition’ to ‘Understanding’

2018年1月31日

Toward truly intelligent AI: From ‘Recognition’ to ‘Understanding’

What was it that people saw in this photo of Usain Bolt nearing the finish line at the 2016 Summer Olympics that caused…

6 条评论
Key Transitions: Broadening adoption of Deep Learning in HPC

2018年1月24日

Key Transitions: Broadening adoption of Deep Learning in HPC

In my last post, I articulated the unique capabilities that AI bring to HPC (HPC-on-AI) and the great opportunities…

5 条评论

See all articles

Scaling AI on HPC

Gadi Singer

VP & Director, Emergent AI Research, Intel Labs

Gadi Singer的更多文章

社区洞察

其他会员也浏览了

?? Quantum Computing and Micro LLMs: Revolutionizing the Future of AI ??

Major software libraries for physics-informed machine learning

Eighty years of breakthroughs

The cutting edge advances coming to AI and quantum

The Silicon Symphony: Understanding the Computational Orchestra Behind Generative AI

Hello Computer: The Opportunity for Generative AI to Modernize Computing

Quantum AI: The Next Frontier

February 09, 2025

15 Best GPUs for Deep Learning for Your Next Project

HPE Alletra MP Deep Dive, LLaMa in the Lab, vSAN Crushing, More...

Gadi Singer的更多文章

Seat of Knowledge: Information-Centric Classification in AI - Class 2

Seat of Knowledge: Information-Centric Classification in AI

Deep Knowledge as the Key to Higher Machine Intelligence

Efficiency, Extensibility and Cognition: Charting the Frontiers

Next, Machines Get Wiser

Accelerating Healthcare’s AI Transformation - Assessing the Drivers

Upping the game – the imminent arrival of Deep Learning Exaops

AI Taking Action – Emergence of Decision Making and Generative Capabilities

Toward truly intelligent AI: From ‘Recognition’ to ‘Understanding’

Key Transitions: Broadening adoption of Deep Learning in HPC

社区洞察

其他会员也浏览了

?? Quantum Computing and Micro LLMs: Revolutionizing the Future of AI ??

Major software libraries for physics-informed machine learning

Eighty years of breakthroughs

The cutting edge advances coming to AI and quantum

The Silicon Symphony: Understanding the Computational Orchestra Behind Generative AI

Hello Computer: The Opportunity for Generative AI to Modernize Computing

Quantum AI: The Next Frontier

February 09, 2025

15 Best GPUs for Deep Learning for Your Next Project

HPE Alletra MP Deep Dive, LLaMa in the Lab, vSAN Crushing, More...