The Big Problem with Open Source LLMs

The Big Problem with Open Source LLMs

Meta's LLAMA2 and the various open source models from Mistral have become quite capable. The recently released Mistral Large is supposed to be better than Open AI's GPT-4 the current leader. LLAMA3 is imminent and will likely challenge GPT-4.

One very important reason to pursue the open source LLMs is the confidentiality needs of most Fortune 2000 companies. We can run these on private hosted infra and not make any API calls to any public hosted LLMs like GPT4/3.5/Claude etc

However the biggest problem with running these models is the GPU infra required for it to give reasonably fast responses [I am talking about the full-blown models and not about the quantized models which can be run even on your phone]. This high power GPU Infra is not easy to put together on shoestring budgets especially when you don't yet have the business case.

Hugging Face Assistants

Enter Hugging Face Assistants platform, a direct competitor to the Open AI GPT Builder and the GPT Store [ I wrote a post a couple of weeks about the GPT Store phenomenon].

When I heard about this, I got very excited. I took my first ever GPT on the GPT Store and repurposed it to run on the Hugging Face Assistants Platform using LLAMA2 and the results were almost as good as the GPT4 in most cases. You can try it here. All our other agents including this one are on this page.

Over the last couple of months, I have built several agents for our customers on the Hugging Face platform to classify incidents, generate python code, reverse engineer COBOL code, update JSONs.

Please try it build your own agents and share your experiences here. Now will this kill the GPT Store as mentioned in the cover image? Nah. That was just to make you read the post :)

Tiny Curator's desk - one of the biggest production use cases by way of business impact I have heard so far is this one from Klarna.

If you like this newsletter, please share it with your colleagues and friends.

#TMT3 #secretsofgenaidigitaldisruptors #genai


Arunkumar N T

Board Director & Investor | Business Advisor in AI, Digital, Emerging Technologies.

1 年

Brilliantly provocative! Open Source LLMs and platforms like Highing Face are just the tip of the AI iceberg. Enterprises - and even individuals - will soon drive very niche or ‘narrow’ use cases that will obviate the need for heavy GPU compute; synthetic data and ‘within firewall’ controlled deployments will lead to a distributed GitHub like environment. Exciting times ahead…

Venkatarangan Thirumalai Nallan Chakravarthy

Keynote Speaker on Generative AI and The Founder Catalyst

1 年

Useful, till now I have been running Mistral and LLAMA2 locally with OLLAMA, but this one seems to be convenient. Thanks for sharing.

要查看或添加评论,请登录

Sukumar Rajagopal的更多文章

  • FSA, TMT Summit & NotebookLM magic

    FSA, TMT Summit & NotebookLM magic

    ?????????????? ???????????? ?????????????????????? In just a few days from now March 19th 4-8PM IST, we are doing our…

    8 条评论
  • Special Dispatch - Time Sensitive

    Special Dispatch - Time Sensitive

    Updated Dec 21, 2024 12 Noon: ???????????????????????? Hello all, I got more than twice the number of pass requests I…

    137 条评论
  • Gratitude Circle GPT

    Gratitude Circle GPT

    Updated Sep 24, 2024 - 1. Nagaraja Srivatsan got the GPT to generate a story about the gratitude circle.

    28 条评论
  • How do disruptors think different?

    How do disruptors think different?

    Have you ever wondered how Disruptors manage to look at problems/situations from a very different point of view to…

    10 条评论
  • Disrupting Senior Care

    Disrupting Senior Care

    First 2 parts [~27K views] I had posted the first 2 parts of this interview with Venkataraman Krishnan of Prayojana:…

    6 条评论
  • Build an app in 15 min

    Build an app in 15 min

    Last week I shared the short video of how I built a Apple iPad/Pencil app with zero javascript skills. I got some…

  • The Magic of p5.js

    The Magic of p5.js

    Believe it or not, I managed to build an Apple iPad/Apple Pencil app in a couple of hours with zero knowledge of…

    8 条评论
  • Gen AI Demo

    Gen AI Demo

    I recently did my Secrets of Gen AI Digital Disruptors session for a Gen AI program for a group of Supply Chain…

    4 条评论
  • Can you make a presentation in 3 minutes?

    Can you make a presentation in 3 minutes?

    Last week, I covered the four one-minute picopresentations from the 2nd edition of the world's first nanopresentation…

    4 条评论
  • The GPT Store Phenomenon

    The GPT Store Phenomenon

    Stunning Achievement Open AI recently made an announcement [ Introducing the GPT Store ] - 3 millions GPTs in just 2…

    16 条评论

社区洞察

其他会员也浏览了