登录查看更多内容

Amazon’s AWS Extends Computing Options and Services

Bob O'Donnell

发布日期: 2022年12月2日

Ever since Amazon launched its cloud computing Amazon Web Services (AWS) in 2006, the company has been on a mission to not only convert the world to its vision of how computing resources can be purchased and deployed, but also to make them as ubiquitous as possible. That strategy was on clear display at this year’s iteration of its annual re:Invent conference. AWS debuted several new computing options—some based on its own new custom silicon designs—as well as a staggering array of data organization, analysis, and connection tools and services.

The sheer number and complexity of many of the new features and services that were unveiled makes it difficult to keep track of all the choices now available to customers. Rather than being the outcome of unchecked development, however, the abundance of capabilities is by design. As new AWS CEO Adam Selipsky pointed out during his keynote and other appearances throughout the conference, the organization is customer obsessed. As a result, most of its product decisions and strategies are based on customer requests. It turns out that when you have lots of different types of customers with different types of workloads and requirements, you end up with a complex array of choices.

Realistically, of course, that kind of approach will reach a logical limit at some point, but in the meantime, it means that the extensive range of AWS products and services likely represent a mirror image of the totality (and complexity) of today’s enterprise computing landscape. In fact, there’s a wealth of insight into enterprise computing trends waiting to be gleaned from an analysis of what services are being used to what degree and how it has shifted over time—but that’s a topic for another time.

In the world of computing options, the company acknowledged that it now has over 600 different Elastic Compute Cloud (EC2) computing instances, each of which consists of different combinations of CPU and other acceleration silicon, memory, network connections, and more. While that’s certainly a hard number to fully appreciate, it once again indicates how diverse today’s computing demands have become. From cloud native, AI or ML-based, containerized applications that need the latest dedicated AI accelerators or GPUs to legacy “lifted and shifted” enterprise applications that only use older x86 CPUs, cloud computing services like AWS now need to be able to handle all of the above.

New entries announced this year include several based on Intel’s 3rd Generation Xeon Scalable Processors with various numbers of CPU cores, memory, and more. What received the most attention, however, were instances based on three of Amazon’s own new silicon designs. The Hpc7g instance is based on an updated version of the Arm-based Graviton3 processor dubbed the Graviton3E that the company claims offer 2x the floating-point performance of the previous Hpc6g instance and 20% overall performance versus the current Hpc6a.

As with many instances, Hpc7g is targeted at a specific set of workloads—in this case High Performance Computing (HPC), such as weather forecasting, genomics processing, fluid dynamics, and more. Even more specifically, thanks to optimizations to ensure the best price/performance for these HPC applications, it’s ideally designed for bigger ML models that often end up running across thousands of cores. What’s interesting about this is it both demonstrates how far Arm-based CPUs have advanced in terms of the types of workloads they’ve been used for, as well as the degree of refinement that AWS is bringing to its various EC2 instances.

领英推荐

The History of AWS and the Evolution of Computing

Neal K. Davis 3 个月前

What is Amazon EC2?

Neal K. Davis 3 年前

A Brief History Of AWS – And How Computing Has Changed

Neal K. Davis 2 年前

Separately, in several other sessions, AWS highlighted the momentum towards Graviton usage for many other types of workloads as well, particularly for cloud-native containerized applications from AWS customers like DirecTV and Stripe. One intriguing insight that came out of these sessions is that because of the nature of the tools being used to develop these types of applications, the challenges of porting code from x86 to Arm native instructions (which were once believed to be a huge stopping point for Arm-based server adoption) have largely gone away. Instead, all that’s required is the simple switch of a few options before the code is completed and deployed on the instance. That makes the potential for further growth in Arm-based cloud computing significantly more likely, particularly on newer applications. Of course, some of these organizations are working toward wanting to build completely instruction set agnostic applications in the future, which would seemingly make instruction set choice irrelevant. However, even in that situation, compute instances that offer better price/performance or performance/watt ratios—which Arm-based CPUs often do have—are a more attractive option.

For ML workloads, Amazon unveiled its second generation Inferentia processor as part of its new Inf2 instance. Inferentia2 is designed to support ML inferencing on models with billions of parameters, such as many of the new large language models for applications like real-time speech recognition that are currently in development. The new architecture is specifically designed to scale across thousands of cores, which is what these enormous new models, such as GPT-3, require. In addition, Inferentia2 includes support for a mathematical technique known as stochastic rounding, which AWS describes as “a way of rounding probabilistically that enables high performance and higher accuracy as compared to legacy rounding modes.” To take best advantage of the distributed computing, the Inf2 instance also supports a next-generation version of the company’s NeuronLink ring network architecture, which supposedly offers 4x the performance and 1/10 the latency of existing Inf1 instances. The bottom-line translation is that it can offer 45% higher performance per watt for inferencing than any other option, including GPU-powered ones. Given that inferencing power consumption needs are often 9 times higher than what’s needed for model training according to AWS, that’s a big deal.

The third new custom-silicon driven instance is called C7gn, and it features a next generation AWS Nitro networking card equipped with fifth-generation Nitro chips. Designed specifically for workloads that demand extremely high throughput, such as firewalls, virtual network, and real-time data encryption/decryption, C7gn is purported to have 2x the network bandwidth and 50% higher packet processing per second than the previous instances. Importantly, the new Nitro cards are able to achieve those levels with a 40% improvement in performance per watt versus the predecessors.

All told, Amazon’s ongoing emphasis on custom silicon and its increasingly diverse range of computing options represent a comprehensively impressive set of tools for companies looking to move more of their workloads to the cloud. As with many other aspects of its AWS offerings, the company continues to refine and enhance what have clearly become a very sophisticated, mature set of computing tools. Collectively, they offer a notable and promising view to the future of computing and the new types of applications they can enable.

Bob O’Donnell is the president and chief analyst of TECHnalysis Research, LLC a market research firm that provides strategic consulting and market research services to the technology industry and professional financial community. You can follow him on LinkedIn at Bob O’Donnell or on Twitter @bobodtech.

TECHnalysis Research Insights

4,550 位关注者

要查看或添加评论，请登录

Bob O'Donnell的更多文章

Nvidia Positions Itself as AI Infrastructure Provider

2025年3月21日

Nvidia Positions Itself as AI Infrastructure Provider

One of the biggest challenges in analyzing a rapidly growing company like Nvidia is trying to make sense of all the…
Enterprise AI Will Go Nowhere Without Training

2025年3月13日

Enterprise AI Will Go Nowhere Without Training

One of the biggest questions currently facing the tech industry is how fast and how wide adoption of Generative…

4 条评论
The Rapid Rise of On-Device AI

2025年2月18日

The Rapid Rise of On-Device AI

While everything related to generative AI (GenAI) seems to be evolving at breakneck speed, there’s one area that’s…

4 条评论
Adobe Reimagines Generative Video with Latest Firefly

2025年2月12日

Adobe Reimagines Generative Video with Latest Firefly

One of the most impressive applications of Generative AI (GenAI) is the ability to create videos from nothing more than…
Samsung Cracks the AI Puzzle with Galaxy S25

2025年1月22日

Samsung Cracks the AI Puzzle with Galaxy S25

Finally. After years of half-filled promises and underwhelming realities, it looks like Samsung has finally succeeded…

4 条评论
Nvidia Brings GenAI to the Physical World with Cosmos

2025年1月8日

Nvidia Brings GenAI to the Physical World with Cosmos

In what was undoubtedly the most eagerly anticipated, most closely watched and highly attended CES keynotes of all…

6 条评论
Amazon Refreshes AI Story with New Chips, Models and Platform Tools

2024年12月5日

Amazon Refreshes AI Story with New Chips, Models and Platform Tools

One thing that’s become very clear when it comes to Generative AI (GenAI) is that we’re still in the early days of the…

3 条评论
Microsoft Brings AI Agents to the Mainstream

2024年11月20日

Microsoft Brings AI Agents to the Mainstream

As exciting and impressive as the Generative AI (GenAI)-powered, prompt-driven, chatbot experience may be, the…
How Hybrid AI Will Change Everything

2024年11月6日

How Hybrid AI Will Change Everything

When it comes to Generative AI, the common thinking is that the critical tools necessary for businesses to build their…

4 条评论
Cisco Expands AI Infrastructure Offerings

2024年10月29日

Cisco Expands AI Infrastructure Offerings

While most people think about Cisco as a company that can link infrastructure elements together in data centers and the…

4 条评论

See all articles

Amazon’s AWS Extends Computing Options and Services

Bob O'Donnell

领英推荐

TECHnalysis Research Insights

4,550 位关注者

Bob O'Donnell的更多文章

社区洞察

其他会员也浏览了

IT News: Cloud, Storage (288.4)

AWS

AWS

AW

Cost Optimization & Strategy with Amazon EC2 Spot Instances

5 Ways to Speed Up Your Lambda Function

AWS re:Invent 24 - Keynote recap - Peter Desantis

Similarities Between AWS VPC and Cisco SDA – Intra-Subnet Communication

Understanding Amazon EC2: Your Complete Guide to Cloud Computing

AWS Compute Services

领英推荐

TECHnalysis Research Insights

4,550 位关注者

Bob O'Donnell的更多文章

Nvidia Positions Itself as AI Infrastructure Provider

Enterprise AI Will Go Nowhere Without Training

The Rapid Rise of On-Device AI

Adobe Reimagines Generative Video with Latest Firefly

Samsung Cracks the AI Puzzle with Galaxy S25

Nvidia Brings GenAI to the Physical World with Cosmos

Amazon Refreshes AI Story with New Chips, Models and Platform Tools

Microsoft Brings AI Agents to the Mainstream

How Hybrid AI Will Change Everything

Cisco Expands AI Infrastructure Offerings

社区洞察

其他会员也浏览了

IT News: Cloud, Storage (288.4)

AWS

AWS

AW

Cost Optimization & Strategy with Amazon EC2 Spot Instances

5 Ways to Speed Up Your Lambda Function

AWS re:Invent 24 - Keynote recap - Peter Desantis

Similarities Between AWS VPC and Cisco SDA – Intra-Subnet Communication

Understanding Amazon EC2: Your Complete Guide to Cloud Computing

AWS Compute Services