登录查看更多内容

The Big Problem with Open Source LLMs

Sukumar Rajagopal

Founder & CEO, Tiny Magiq; EiR at CMI Algolabs;xSVP/CIO & Head of Innovation, Cognizant

发布日期: 2024年3月7日

Meta's LLAMA2 and the various open source models from Mistral have become quite capable. The recently released Mistral Large is supposed to be better than Open AI's GPT-4 the current leader. LLAMA3 is imminent and will likely challenge GPT-4.

One very important reason to pursue the open source LLMs is the confidentiality needs of most Fortune 2000 companies. We can run these on private hosted infra and not make any API calls to any public hosted LLMs like GPT4/3.5/Claude etc

However the biggest problem with running these models is the GPU infra required for it to give reasonably fast responses [I am talking about the full-blown models and not about the quantized models which can be run even on your phone]. This high power GPU Infra is not easy to put together on shoestring budgets especially when you don't yet have the business case.

Hugging Face Assistants

Enter Hugging Face Assistants platform, a direct competitor to the Open AI GPT Builder and the GPT Store [ I wrote a post a couple of weeks about the GPT Store phenomenon].

When I heard about this, I got very excited. I took my first ever GPT on the GPT Store and repurposed it to run on the Hugging Face Assistants Platform using LLAMA2 and the results were almost as good as the GPT4 in most cases. You can try it here. All our other agents including this one are on this page.

领英推荐

TAI #141: Claude 3.7 Sonnet; Software Dev Focus in…

Towards AI 3 周前

TAI #126; New Gemini, Pixtral, and Qwen 2.5 model…

Towards AI 4 个月前

Why intelligent observability is essential for our…

New Relic 1 个月前

Over the last couple of months, I have built several agents for our customers on the Hugging Face platform to classify incidents, generate python code, reverse engineer COBOL code, update JSONs.

Please try it build your own agents and share your experiences here. Now will this kill the GPT Store as mentioned in the cover image? Nah. That was just to make you read the post :)

Tiny Curator's desk - one of the biggest production use cases by way of business impact I have heard so far is this one from Klarna.

If you like this newsletter, please share it with your colleagues and friends.

#TMT3 #secretsofgenaidigitaldisruptors #genai

TMT3 Gen AI Digital Disruptors

9,126 位关注者

Arunkumar N T

Board Director & Investor | Business Advisor in AI, Digital, Emerging Technologies.

1 年

Brilliantly provocative! Open Source LLMs and platforms like Highing Face are just the tip of the AI iceberg. Enterprises - and even individuals - will soon drive very niche or ‘narrow’ use cases that will obviate the need for heavy GPU compute; synthetic data and ‘within firewall’ controlled deployments will lead to a distributed GitHub like environment. Exciting times ahead…

2 次回应

Venkatarangan Thirumalai Nallan Chakravarthy

Keynote Speaker on Generative AI and The Founder Catalyst

1 年

Useful, till now I have been running Mistral and LLAMA2 locally with OLLAMA, but this one seems to be convenient. Thanks for sharing.

1 次回应

查看更多评论

要查看或添加评论，请登录

Sukumar Rajagopal的更多文章

FSA, TMT Summit & NotebookLM magic

2025年3月15日

FSA, TMT Summit & NotebookLM magic

?????????????? ???????????? ?????????????????????? In just a few days from now March 19th 4-8PM IST, we are doing our…

8 条评论
Special Dispatch - Time Sensitive

2024年12月16日

Special Dispatch - Time Sensitive

Updated Dec 21, 2024 12 Noon: ???????????????????????? Hello all, I got more than twice the number of pass requests I…

137 条评论
Gratitude Circle GPT

2024年9月23日

Gratitude Circle GPT

Updated Sep 24, 2024 - 1. Nagaraja Srivatsan got the GPT to generate a story about the gratitude circle.

28 条评论
How do disruptors think different?

2024年7月21日

How do disruptors think different?

Have you ever wondered how Disruptors manage to look at problems/situations from a very different point of view to…

10 条评论
Disrupting Senior Care

2024年6月27日

Disrupting Senior Care

First 2 parts [~27K views] I had posted the first 2 parts of this interview with Venkataraman Krishnan of Prayojana:…

6 条评论
Build an app in 15 min

2024年4月25日

Build an app in 15 min

Last week I shared the short video of how I built a Apple iPad/Pencil app with zero javascript skills. I got some…
The Magic of p5.js

2024年4月18日

The Magic of p5.js

Believe it or not, I managed to build an Apple iPad/Apple Pencil app in a couple of hours with zero knowledge of…

8 条评论
Gen AI Demo

2024年3月21日

Gen AI Demo

I recently did my Secrets of Gen AI Digital Disruptors session for a Gen AI program for a group of Supply Chain…

4 条评论
Can you make a presentation in 3 minutes?

2024年2月22日

Can you make a presentation in 3 minutes?

Last week, I covered the four one-minute picopresentations from the 2nd edition of the world's first nanopresentation…

4 条评论
The GPT Store Phenomenon

2024年2月1日

The GPT Store Phenomenon

Stunning Achievement Open AI recently made an announcement [ Introducing the GPT Store ] - 3 millions GPTs in just 2…

16 条评论

See all articles

The Big Problem with Open Source LLMs

Sukumar Rajagopal

Founder & CEO, Tiny Magiq; EiR at CMI Algolabs;xSVP/CIO & Head of Innovation, Cognizant

领英推荐

TMT3 Gen AI Digital Disruptors

9,126 位关注者

Sukumar Rajagopal的更多文章

社区洞察

其他会员也浏览了

Issue #289 - The ML Engineer ??

Custom Enterprise LLM/RAG with Real-Time Fine-Tuning

Artificial Intelligence #85

AI - Monday, February 3, 2025: Commentary with Notable and Interesting News, Articles, and Papers

Artificial Intelligence #64

Digixvalley Granite 3.0: open, state-of-the-art Enterprise Models

LLM Developers: The future of software development

A big Update for Building LLMs for Production!

Looking for the CAP theorem of AI agents

Turning ideas into real things: Meet our synthesis team

领英推荐

TMT3 Gen AI Digital Disruptors

9,126 位关注者

Sukumar Rajagopal的更多文章

FSA, TMT Summit & NotebookLM magic

Special Dispatch - Time Sensitive

Gratitude Circle GPT

How do disruptors think different?

Disrupting Senior Care

Build an app in 15 min

The Magic of p5.js

Gen AI Demo

Can you make a presentation in 3 minutes?

The GPT Store Phenomenon

社区洞察

其他会员也浏览了

Issue #289 - The ML Engineer ??

Custom Enterprise LLM/RAG with Real-Time Fine-Tuning

Artificial Intelligence #85

AI - Monday, February 3, 2025: Commentary with Notable and Interesting News, Articles, and Papers

Artificial Intelligence #64

Digixvalley Granite 3.0: open, state-of-the-art Enterprise Models

LLM Developers: The future of software development

A big Update for Building LLMs for Production!

Looking for the CAP theorem of AI agents

Turning ideas into real things: Meet our synthesis team