GPUs is the new oil - How to survive the GPU War...
Sreedhar Gade
Vice President, Engineering & Responsible AI at Freshworks | The Customer & Employee Delight Engine
Assumption 1: AI models & their ability determine how someone can derive value out of AI
Assumption 2: GenAI use cases, how they are implemented define the AI value
Assumption 3: Its the AI feature pricing & monetisation strategy that ultimately helps realise AI implementation value
Assumption 4: It's about how we govern and create controlled/responsible AI that defines longer term sustain value.
So far all these 4 assumptions are right. A combination of these 4 would determine value a company can derive out of GenAI and can become key differentiators to win in the market place.
But there's one more key factor thats emerging faster. Its what causing below news headlines...
They said, infrastructure is commodity, and they they were right then. They also said, what matters is the value of the software thats written on top of infra, which is right again. For a considerable amount of time, it felt that innovation in silicon was good enough, all the innovation is due in AI/ML and other top layers. But now, it looks like the silicon has once again became the key bottleneck with GPUs becoming key ingredients for Generative AI, and they are not available with quantity that the companies are looking for. Making it new scare resource and need for better planning if we need to continue with AI revolution that we are currently in...
领英推荐
First of all, whats a GPU?
A GPU, or Graphics Processing Unit, is a specialized electronic circuit or chip that utilizes parallel processing to accelerate and optimize the rendering and processing of computer graphics and images. Composed of multiple processing cores, GPUs excel at handling complex mathematical calculations and data parallelism, allowing for faster and more efficient computation compared to traditional CPUs. While initially developed for graphics rendering in gaming and visual applications, GPUs have evolved to play a crucial role in various fields such as scientific research, machine learning, cryptocurrency mining, and artificial intelligence, where their parallel architecture and high memory bandwidth make them ideal for handling computationally intensive tasks.
GPUs vs CPUs for your ready reference:
Widespread uses & increasing Datacenter based usage (GenAI & others)
What can I do as CTO/CIO?
In the next article, I will share few methods on how can we use cloud based vs on-prem GPU resources for non-prod Vs prod workloads to minimise cost & maximise innovation in your companies. Remember, if you are the person who is responsible for forecasting, planning on how to keep your company's GenAI dreams running, then you should read-on and the time is now to start planning...
Enterprise Customer Success Manager, MBA
1 年Very interesting post, thanks Sreedhar Gade for sharing