登录查看更多内容

Adapting to the changes in fundamental forces in GenAI

Abhishek Gupta

Vice President - Healthcare Data | Analytics | Cloud | GenAI

发布日期: 2024年7月5日

During the last two years in the GenAI space, I’ve been fascinated by the evolution of conversations with clients. Initially, discussions revolved around assessments, building strategies, and prompt engineering. These conversations matured into developing POCs, pilot use cases, and even some production deployments. Nowadays, the focus has shifted towards enhancing customer experience and streamlining processes through automation using GenAI. Designing these use case solutions often involves Retrieval-Augmented Generation (RAG), prompt engineering, and sometimes an agentic approach. Moreover, more mature customers are now discussing the scalability of their GenAI solutions and addressing performance-related issues.

While Parameter-Efficient Fine-Tuning (PEFT) techniques exist, they remain largely academic from the customers’ perspective, partly due to the lack of underlying data foundations for LLM builds and the time and effort involved. Recently, I read a few articles that made me think that the fundamental forces behind GenAI will change very soon this and bring discussions around model tuning, custom models on the table. Here’s why:

1. Alternative AI Chips by Cloud Providers: Almost all major cloud providers (Azure, AWS, GCP) have developed their own AI chips designed specifically for training models and inference. Azure has Maia and Cobalt, AWS offers Inferentia and Trainium, and GCP has its TPU. These chips are marketed as faster and cheaper alternatives to Nvidia GPUs.

2. Competition from AMD, Intel, and Others: AMD and Intel have significant orders in backlog for their chipsets, promising cheaper and faster options compared to Nvidia GPUs. This competition is bolstered by substantial VC investment, with $4 billion already funneled into 93 separate efforts according to Pitchbook. Nvidia’s dominance, with a market cap of $2.7 trillion and $80 billion in annual revenue at 78% gross margins, is making the industry very lucrative. If this is not enough, AI chip sales are expected to hit $400 billion annually in five years, that has sent the market on a wild goose chase.

3. Edge Device AI Chips: Apple’s latest laptops and tablets, optimized for AI with neural engines, and Qualcomm’s PC chips enabling laptops to run Microsoft AI services, are shifting AI work from server farms to consumer devices. This localizes AI processing, making it more efficient and accessible.

4. OneAPI as an Alternative to CUDA: CUDA has been crucial for building large language models that require hundreds of thousands of GPU cores. However, a coalition of tech companies, including Qualcomm, Google, and Intel, is developing oneAPI technology to create open-source software compatible with multiple AI chips. This move aims to challenge Nvidia’s dominance through a cross device compatible solution.

For practitioners like me, who work with customers on industry-specific problem statements, these changes imply several key points:

1. Cost Mechanics Will Become Crucial: As discussions advance into use cases involving custom model building and fine-tuning, decisions around processors, GPUs, and their associated costs will become more apparent.

2. Choice of Training Framework and Heterogeneous Computing: Deciding between TensorFlow and Pytorch, CUDA and oneAPI, will be essential to avoid vendor lock-in. The ability to build solutions that run across different GPUs, including AMDs, Intels, TPUs, and Nvidia’s, will be critical.

领英推荐

AWS and NVIDIA extend their collaboration to advance…

Amazon Web Services (AWS) 11 个月前

This AI newsletter is all you need #92

Towards AI 11 个月前

Latest Updates: FREE Llama 3.2 Multimodal & FLUX.1…

Together AI 4 个月前

3. Flexibility, Modularity, Scalability, Interoperability: These qualities will become even more pertinent. The ability to switch LLMs to other providers, integrate fine-tuned models based on cost dynamics and performance needs, and decide between cloud vs. edge device models, as well as small language models vs. LLMs, will be crucial topics I see coming up in client discussions.

Despite these changes, it’s still Day 1 in the field of GenAI, and the real value realization from these solutions is yet to come. The landscape is evolving rapidly, and staying ahead means adapting to these transformative forces.

References -

Nvidia dominates the AI chip market, but there's rising competition (cnbc.com)

Exclusive: Behind the plot to break Nvidia's grip on AI by targeting software | Reuters

VENKATESWARLU PATTEDA

Senior Reporting Analyst at EXL

7 个月

Very informative

DARSANA KURIAN

Associate at EXL Service| US Health Care Analytics

7 个月

Very informative

MD RIHAN

7 个月

Very informative?

aswathi suresh

Reporting Analyst at EXL

7 个月

Good advice

Geo Wilson

Associate Analyst- Speridian technologies / Ex - EXL Service

7 个月

Insightful!

查看更多评论

要查看或添加评论，请登录

Abhishek Gupta的更多文章

DeepSeek Disrupts AI Across All Frontiers

2025年2月11日

DeepSeek Disrupts AI Across All Frontiers

Joke of the Month: "AI has put AI out of a job!" ?? While debates around AI replacing human jobs are still ongoing…

4 条评论
Part 1: Why is LLM System Evaluation so important, and what are the Best Practices?

2024年11月12日

Part 1: Why is LLM System Evaluation so important, and what are the Best Practices?

What you cannot measure, you cannot improve. Model evaluation is foundational in Machine Learning (ML) development, yet…

5 条评论
Part II - Designing a Comprehensive Data Masking Strategy for Unstructured Data

2024年9月8日

Part II - Designing a Comprehensive Data Masking Strategy for Unstructured Data

Hello everyone! It’s taken some time to follow up with this post on data masking strategies, building on the ideas…

1 条评论
To bloom during AI Generation; Creativity is not optional

2024年8月18日

To bloom during AI Generation; Creativity is not optional

As we stand on the brink of an era dominated by artificial intelligence (AI), one thing becomes clear: creativity is no…

2 条评论
Part I: Data Masking is Table Stakes in Today's Data Programs

2024年8月8日

Part I: Data Masking is Table Stakes in Today's Data Programs

Data masking has been a significant topic for many decades. Despite the wealth of content available, many organizations…

3 条评论
Part II: Data Platform Fungibility - The Future of Data Interoperability

2024年7月28日

Part II: Data Platform Fungibility - The Future of Data Interoperability

The acquisition of Tabular by Databricks signals a new era of interoperability and no vendor lock-in. This article is…

2 条评论
Part I: Evolution of Data Modernization Stack: From Relational Databases to the Data Lakehouse

2024年7月17日

Part I: Evolution of Data Modernization Stack: From Relational Databases to the Data Lakehouse

In the ever-evolving landscape of technology, data has emerged as the lifeblood of modern enterprises, driving…

4 条评论

See all articles

Adapting to the changes in fundamental forces in GenAI

Abhishek Gupta

Vice President - Healthcare Data | Analytics | Cloud | GenAI

领英推荐

Abhishek Gupta的更多文章

社区洞察

其他会员也浏览了

Latest Updates: FREE Llama 3.2 Multimodal & FLUX.1 [schnell], NVIDIA H200s, and Enterprise Platform

The World This Week in AI (9th December 2024)

Insider’s Edit: Nvidia’s Future Defining Hardware, Google’s AI Search Edits, Microsoft’s $18.9 Billion Partnership

Which AI Hardware Will Rise Above in the Wake of Competing AI Models?

Google Stacking GPUs, Algorithms of Thought Reasoning for LLMs, and A21 Gets Fresh Funding

This AI Stock Could Outpace Nvidia’s Returns by 2030

Nvidia AI Chip Delays Impact on Tech Giant's Valuations Investments What is the Nvidia AI Chip?

BUYING IN STOCK MARKET ON ARTIFICIAL INTELLIGENCE INFLECTION MEETS SELLING ON ECONOMIC DATA

Oracle OCI and Nvidia AI

Scaling Decline: Is Nvidia Losing Its Edge in the AI Race?

领英推荐

Abhishek Gupta的更多文章

DeepSeek Disrupts AI Across All Frontiers

Part 1: Why is LLM System Evaluation so important, and what are the Best Practices?

Part II - Designing a Comprehensive Data Masking Strategy for Unstructured Data

To bloom during AI Generation; Creativity is not optional

Part I: Data Masking is Table Stakes in Today's Data Programs

Part II: Data Platform Fungibility - The Future of Data Interoperability

Part I: Evolution of Data Modernization Stack: From Relational Databases to the Data Lakehouse

社区洞察

其他会员也浏览了

Latest Updates: FREE Llama 3.2 Multimodal & FLUX.1 [schnell], NVIDIA H200s, and Enterprise Platform

The World This Week in AI (9th December 2024)

Insider’s Edit: Nvidia’s Future Defining Hardware, Google’s AI Search Edits, Microsoft’s $18.9 Billion Partnership

Which AI Hardware Will Rise Above in the Wake of Competing AI Models?

Google Stacking GPUs, Algorithms of Thought Reasoning for LLMs, and A21 Gets Fresh Funding

This AI Stock Could Outpace Nvidia’s Returns by 2030

Nvidia AI Chip Delays Impact on Tech Giant's Valuations Investments What is the Nvidia AI Chip?

BUYING IN STOCK MARKET ON ARTIFICIAL INTELLIGENCE INFLECTION MEETS SELLING ON ECONOMIC DATA

Oracle OCI and Nvidia AI

Scaling Decline: Is Nvidia Losing Its Edge in the AI Race?