登录查看更多内容

What is the hardware (cost) to fine tune an AI Model? A comparison of various models to date.

Dr Andrea Isoni

PhD,Chief AI Officer, AI speaker

发布日期: 2024年5月29日

We all know that training an AI model from scratch is expensive.?

While that is prohibitive, how much does it cost to fine tune an AI model? While there are different cloud services with different price points, all rely on hardware (GPU and VRAM mostly) you need to choose to run or fine tune your chosen mode.

I will give some examples in this post of hardware for well known models.

Meta Llama 2

In August 2023 Meta released Llama 2, at that time the largest open source model to date. The largest version has 70b parameters. As an example, it is possible to fine tune Llama 2 in a single A100 with 80GB of VRAM with a dataset of 50k prompts, each of which is ~1000 - 1500 tokens of prompt + 500 tokens of response. It can be performed roughly in 4 days of training or 200$.

To fine tune the 7b version instead, on a 1k dataset with a single RTX series 3, it takes 3-4h.

For the 30b or 65b versions, you would need between 150 and 300 hrs respectively using about 72GB VRAM and 3 GPUS (series 3).

StableDiffusion

Arka Roy 4 年前

Speed Up Deep Learning with new Tesla CPU, from…

Adriano Rinaudo 6 年前

NVIDIA Brings Personalized AI Chatbots to Your PC

Jonathan Chew 8 个月前

Using a Stability AI StableDiffusion 1.5, about 200 images, it takes about 20 min with a RTX series 3 and 24GB VRAM.

Google Gemma?

Google recently released 2b and 7b Gemma LLMs models (Feb 24). No precise tests I can report but it is safe to say that a quantized (i.e. reduced) version of the 7b model will require up to 8GB and A100 gpu. While the 2b model can run on a T4.

Grok1

Finally the latest and largest model released to date (March 24): Grok-1 ,314b parameters.

No clear indications yet but without quantisation we are talking about 1-2TB or more and therefore multiple top end GPUs (10-20).

That said, since it took less than a month to quantise Gemma models, I would say within 1-2 months we would have quantised Grok-1 fine tunable into GPU with 24-60GB VRAM.

Let’s see.

#ai #artificialintelligence #business #innovation #technology

What is the hardware (cost) to fine tune an AI Model? A comparison of various models to date.

Dr Andrea Isoni

PhD,Chief AI Officer, AI speaker

领英推荐

Thoughts about AI. By a Human.

2,722 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

NVIDIA introduces 120TFLOP AI GPU

Hyper-realistic simulations are possible on Omniverse

220K real-time simulated pedestrians

Digital twins and The Flow

[How to]Setup Nvidia Prime Proprietary Driver in Manjaro

Advantech releases MIC-711 Series and MIC-713S-OX based on NVIDIA Jetson Orin NX and Orin Nano System-on-Modules

GPUDirect: Next-Level Data Processing and Transfer for GigE Machine Vision Cameras

The Million-Dollar Trick: LLAMA 3.1 is Free to Own, Costly to Run

New World Order: Saudi Arabia And The UAE Are Purchasing THOUSANDS Of Powerful NVIDIA GPUs In A New Global Arms Race

领英推荐

Thoughts about AI. By a Human.

2,722 位关注者

Why AI is unlocking a trend that could happen without AI

2024年10月23日

Why AI adoption is increasing cybersecurity spending 2

2024年10月16日

3 steps to AI mass adoption

2024年10月9日

Why chatGPT is NOT going to replace outsourced inexpensive workforce

2024年10月2日

State of AI 2024: Why deepfake detectors are failing (and we got it)

2024年9月25日

Why chatGPT is NOT going to replace outsourced inexpensive workforce

2024年9月18日

Why AI will increase cybersecurity spending 1

2024年9月11日

Generative AI: why you will never experience blank page syndrome ever again

2024年9月4日

Why there may be an ai offshore country

2024年8月15日

State of AI 2024: open (weight) source LLMs now run similar to GPT-4o (and we got it)

2024年8月8日