The Best Open-Source Generative AI Models Available Today
The Best Open-Source Generative AI Models Available Today

The Best Open-Source Generative AI Models Available Today

Thank you for reading my latest article The Best Open-Source Generative AI Models Available Today. Here at LinkedIn and at Forbes I regularly write about management and technology trends.

To read my future articles simply join my network by clicking 'Follow'. Also feel free to connect with me via Twitter , Facebook , Instagram , Podcast or YouTube .


There are many reasons that businesses may want to choose open-source over proprietary tools when getting started with generative AI.

This could be because of cost, opportunities for customization and optimization, transparency or simply the support that’s offered by the community.

There are disadvantages too, of course, and I cover the pros and cons of each option more fully in this article .

With software generally, the term open-source simply means that the source code is publicly available and can be used, free of charge, for pretty much any purpose.

When it comes to AI models, though, there has been some debate about exactly what this entails, as we will get into it as we discuss the individual models covered here. So, let's dive in.

Stable Diffusion

One of the most powerful and flexible image generation models, and certainly the most widely-used open-source image models, Stable Diffusion 3 (the latest version as of writing) supports text-to-image as well as image-to-image generation and has become well-known for its ability to create highly realistic and detailed images.

As is common with open-source software, using Stable Diffusion isn’t quite as straightforward as using commercial, proprietary tools like ChatGPT. Rather than having its own web interface, it’s accessed through third-party tools built by commercial entities, including DreamStudio and Stable Diffusion Web . The alternative is to compile and run it yourself locally , and this requires providing your own compute resources as well as technical know-how.

Meta Llama 3

This is a family of language models available in various sizes, making it suitable for different applications, from lightweight mobile clients to full-size cloud deployments. The same model that powers the Meta AI assistant available across its social media platforms can be deployed by anyone for many uses including natural language generation and creating computer code. One of its strong points is its ability to run on relatively low-powered hardware. However, as with some of the other models covered here, there is some debate as to whether it can truly be considered open-source, as Meta has not disclosed exact details of its training data.

Mistral AI

Mistral is a French startup that has developed several generative AI models that it has made available under open-source licenses. These include Mistral 7B, which is designed to be lightweight and easy to deploy on low-power hardware, and the more powerful Mistral 8x22B. It has a strong user community offering support, and positions itself as a highly flexible and customizable generative language model.

GPT-2

OpenAI has open-sourced the second version of their LLM – essentially an earlier version of the engines that are now used to power ChatGPT. While it isn’t as big, powerful or flexible as the later GPT-3.5 or GPT-4 (built on 1.2 billion parameters compared to GPT-4’s one-trillion plus), it’s still considered to be perfectly adequate for many language-based tasks such as generating text or powering chatbots. GPT-2 is made available by OpenAI under the MIT license, which is generally considered to be compliant with open-source principles.

BLOOM

BLOOM is described as the world’s largest open, multilingual language model, built on 176 billion parameters. Development was led by Hugging Face, a repository of open-source AI resources working alongside a team of over 1,000 researchers as part of a global collaborative project known as BigScience. The aim was to create a truly open and transparent LLM available to anyone who agrees to the terms of the project’s Responsible AI License . Technically, this means it isn’t quite open source, but it is freely available to use and distribute, as long as it isn’t used for harmful purposes as defined by the terms of the license. This makes it a very interesting experiment in the critically important domain of developing and distributing ethical AI.

Grok.AI

This LLM also claims to be the world’s largest open-source model, although again there is some debate as to whether it technically fills all of the criteria for being truly open source.

Grok was designed and built by X.ai , a startup founded by Elon Musk following his split from OpenAI. This split has been reported as being caused by disagreements over exactly what “open” means when it comes to AI models.

Rather than using the term large language model, X describes Grok as a “mixture of experts” model, reflecting the fact that the base model is designed to be more general-purpose and is not specifically trained for creating dialogue, as is the case with, for example, ChatGPT.

As with Llama, skepticism of Grok’s open-source status is based on the fact that while X.ai has made the weights and architecture of the model publicly available, it hasn’t revealed all of the code or training data.

Falcon

Two models of this LLM architecture have been made freely available by its developers, the Technology Innovation Institute, a research institution founded by the government of Abu Dhabi. Both models – the more portable Falcon 40B and the more powerful 180B, have been released as open source and reportedly come second only to GPT-4 on Open Face’s leaderboard of LLM performance. While the smaller model is released under the Apache 2.0 license – generally considered to fit the definition of open-source - the larger model has had some conditions attached to its use and distribution. ?

?

This exploration into the realm of open-source generative AI tools illuminates the diverse array of options available and underscores the transformative potential these technologies hold for businesses eager to leverage AI's power while embracing transparency, cost-efficiency, and robust community support.


About Bernard Marr

Bernard Marr is a world-renowned futurist, influencer and thought leader in the fields of business and technology, with a passion for using technology for the good of humanity. He is a best-selling author of over 20 books , writes a regular column for Forbes and advises and coaches many of the world’s best-known organisations.

He has a combined following of 4 million people across his social media channels and newsletters and was ranked by LinkedIn as one of the top 5 business influencers in the world. Bernard’s latest book is ‘Generative AI in Practice ’.


Itai Carmeli

There is no "I" in AI | Customer-Focused Integration & Technical Solutions Expert: Bridging the gap between technology and people.

6 个月

Kinda lost me on "mistral.ai" - What's so great about a model with data scraped till 2021 when we're half way through 2024?

回复
Marcelo Grebois

? Infrastructure Engineer ? DevOps ? SRE ? MLOps ? AIOps ? Helping companies scale their platforms to an enterprise grade level

6 个月

Fantastic insights! ?? Bernard Marr

回复
Antoine Clerc-Renaud

Get the professional website you envisioned!

6 个月

Love the insights shared!

回复

要查看或添加评论,请登录

社区洞察

其他会员也浏览了