登录查看更多内容

Open Source Large Language Models

Frank Morales Aguilera, BEng, MEng, SMIEEE

Boeing Associate Technical Fellow /Engineer /Scientist /Inventor /Cloud Solution Architect /Software Developer /@ Boeing Global Services

发布日期: 2024年2月19日

Introduction

Large Language Models (LLMs) are AI systems that model and process human language[1]. They are called “large” because they have hundreds of millions or even billions of parameters pre-trained using a massive corpus of text data[1]. LLMs are the foundation models of popular and widely used chatbots[1]. However, a parallel movement in the LLM space is rapidly gaining pace: open-source LLMs[1].

Proprietary vs Open-Source LLMs

Proprietary LLMs, such as GPT-4 and Google’s PaLM 2, are owned by a company and can only be used by customers after buying a license[1]. This license comes with rights, possible restrictions on using the LLM, and limited information on the mechanisms behind the technology[1].

On the other hand, open-source LLMs are free and available for anyone to access, use for any purpose, modify, and distribute[2]. The term “open source” refers to the LLM code and underlying architecture being accessible to the public, meaning developers and researchers can use, improve, or modify the model[2].

Benefits of Open-Source LLMs

There are multiple short-term and long-term benefits to choosing open-source LLMs instead of proprietary LLMs[1]:

Enhanced data security and privacy: One of the biggest concerns of using proprietary LLMs is the risk of data leaks or unauthorized access to sensitive data by the LLM provider[1]. By using open-source LLM, companies will be solely responsible for protecting personal data, as they will completely control it[1].
Cost savings and reduced vendor dependency: Most proprietary LLMs require a license to use them[1]. This differs from open-source LLMs, which are usually free [1].

Popular Open-Source LLMs

The open-source community has already achieved significant milestones, with many open-source LLMs available for different purposes[1]. Some of the top open-source LLMs for 2024 include LLaMA 2, BLOOM, BERT and Mistral7B[4-6]. These models are all licensed for commercial use[3].

Case study

I developed two notebooks covering Mistral 7B [7] and Mixtral_8x7B[8] LLM in Google Colab.?

Pavan Belagatti 2 个月前

Almost Timely News: How Large Language Models Are…

Christopher Penn 1 年前

?? Getting RAG Right: All in One Go

Pascal Biese 4 个月前

Conclusion

Open-source LLMs promise to make the rapidly growing field of LMMs and generative AI more accessible, transparent, and innovative[1]. They offer enhanced data security, privacy, cost savings, and reduced vendor dependency[1]. With the rise of open-source LLMs, the future of generative AI looks promising and exciting.

References

1.-?8 Top Open-Source LLMs for 2024 and Their Uses | DataCamp

2.-?Open source large language models: Benefits, risks and types - IBM Blog

3.-?GitHub - eugeneyan/open-llms: ?? A list of open LLMs available for commercial use.

4.-?Mistral’s Open Source LLM | Internet Public Library ( ipl.org )

5.-?Mistral 7B: An Open-Source LLM Pushing the Frontiers of AI - Lusera ( luseratech.com )

6.-?GitHub - mistralai/mistral-src: Reference implementation of Mistral AI 7B v0.1 model.

7.-?MLxDL/Mistral-7B-Instruct.ipynb at main · frank-morales2020/MLxDL · GitHub

8.-?MLxDL/Mixtral_8x7B.ipynb at main · frank-morales2020/MLxDL · GitHub

Alex Armasu

Founder & CEO, Group 8 Security Solutions Inc. DBA Machine Learning Intelligence

9 个月

Grateful for your post!

Alex Armasu

Founder & CEO, Group 8 Security Solutions Inc. DBA Machine Learning Intelligence

9 个月

Thank you for your valuable post!

1 次回应

Alex Armasu

Founder & CEO, Group 8 Security Solutions Inc. DBA Machine Learning Intelligence

9 个月

Thanks a bunch for posting!

2 次回应

Alex Armasu

Founder & CEO, Group 8 Security Solutions Inc. DBA Machine Learning Intelligence

9 个月

Gratitude for your contribution!

1 次回应

查看更多评论

要查看或添加评论，请登录

Frank Morales Aguilera, BEng, MEng, SMIEEE的更多文章

Top 20 Must-Read Generative AI Books for Professional Growth

2024年9月20日

Top 20 Must-Read Generative AI Books for Professional Growth

The article provides a curated list of 20 essential books that offer a deep dive into the field of Generative AI. This…
Fine-Tuning the LLM Mistral-7B-Instruct-v0.3 for Text-to-SQL with SQL-Create-Context Dataset and Enhanced Training Techniques

2024年6月25日

Fine-Tuning the LLM Mistral-7B-Instruct-v0.3 for Text-to-SQL with SQL-Create-Context Dataset and Enhanced Training Techniques

Frank Morales Aguilera, BEng, MEng, SMIEEE Boeing Associate Technical Fellow /Engineer /Scientist /Inventor /Cloud…
Integration of GPT-4 with RAG Fusion, PostgreSQL, and LlamaIndex

2024年2月22日

Integration of GPT-4 with RAG Fusion, PostgreSQL, and LlamaIndex

Introduction Generative Pre-trained Transformer 4 (GPT-4) is a state-of-the-art language model developed by OpenAI[1]…
Smaug-72B: The Pinnacle of Open-Source Language Models

2024年2月21日

Smaug-72B: The Pinnacle of Open-Source Language Models

Introduction Smaug-72B, named after the legendary dragon from J.R.
Diffusion Transformer and Its Applications, Including OpenAI's Sora

2024年2月20日

Diffusion Transformer and Its Applications, Including OpenAI's Sora

Diffusion Transformer and Its Applications, Including OpenAI's Sora Introduction Diffusion Transformer (DiT) is a novel…

2 条评论
Langchain with Mistral LLM using Embeddings and PostgreSQL with pg_embedding

2024年2月20日

Langchain with Mistral LLM using Embeddings and PostgreSQL with pg_embedding

Langchain is a revolutionary technology that leverages the power of language processing to create a unique chain of…
Flash Attention 2 in Large Language Models

2024年2月19日

Flash Attention 2 in Large Language Models

Introduction Large Language Models (LLMs) such as GPT3/4, Falcon, and LLama are rapidly advancing in tackling…

1 条评论
Mistral LLM: A New Era in Language Models

2024年2月18日

Mistral LLM: A New Era in Language Models

Introduction Mistral LLM, or Large Language Model, is a groundbreaking development in artificial intelligence. It is a…

5 条评论
Foundation Models: A Revolution in AI

2024年2月17日

Foundation Models: A Revolution in AI

Introduction Foundation models, also known as pre-trained models, represent a significant advancement in artificial…
Generative AI: From Text to Video. Overview of the groundbreaking OpenAI foundation model called SORA

2024年2月16日

Generative AI: From Text to Video. Overview of the groundbreaking OpenAI foundation model called SORA

Introduction Generative Artificial Intelligence (AI) has revolutionized how we create and interact with digital…

See all articles

Open Source Large Language Models

Frank Morales Aguilera, BEng, MEng, SMIEEE

Boeing Associate Technical Fellow /Engineer /Scientist /Inventor /Cloud Solution Architect /Software Developer /@ Boeing Global Services

Introduction

Proprietary vs Open-Source LLMs

Benefits of Open-Source LLMs

Popular Open-Source LLMs

Case study

领英推荐

Conclusion

References

Frank Morales Aguilera, BEng, MEng, SMIEEE的更多文章

社区洞察

其他会员也浏览了

?? 3 Ways to Efficient AI

LLM Pulse - September 16, 2024

Issue #228 - THE ML ENGINEER ??

Article - The Rapidly Evolving Landscape of Large Language Models

Our 4-Tool Stack + Strategy for Building Enterprise AI Solutions on LLMs - AI&YOU #53

A 100T Language Model?

Trustworthy AI - Latest Insights

Large Language Model or Large Data Compression Technique? The Illusion of Intelligence.

On-Device LLM - Future is EDGE AI

Introduction

Proprietary vs Open-Source LLMs

Benefits of Open-Source LLMs

Popular Open-Source LLMs

Case study

领英推荐

Conclusion

References

Frank Morales Aguilera, BEng, MEng, SMIEEE的更多文章

Top 20 Must-Read Generative AI Books for Professional Growth

Fine-Tuning the LLM Mistral-7B-Instruct-v0.3 for Text-to-SQL with SQL-Create-Context Dataset and Enhanced Training Techniques

Integration of GPT-4 with RAG Fusion, PostgreSQL, and LlamaIndex

Smaug-72B: The Pinnacle of Open-Source Language Models

Diffusion Transformer and Its Applications, Including OpenAI's Sora

Langchain with Mistral LLM using Embeddings and PostgreSQL with pg_embedding

Flash Attention 2 in Large Language Models

Mistral LLM: A New Era in Language Models

Foundation Models: A Revolution in AI

Generative AI: From Text to Video. Overview of the groundbreaking OpenAI foundation model called SORA

社区洞察

其他会员也浏览了

?? 3 Ways to Efficient AI

LLM Pulse - September 16, 2024

Issue #228 - THE ML ENGINEER ??

Article - The Rapidly Evolving Landscape of Large Language Models

Our 4-Tool Stack + Strategy for Building Enterprise AI Solutions on LLMs - AI&YOU #53

A 100T Language Model?

Trustworthy AI - Latest Insights

Large Language Model or Large Data Compression Technique? The Illusion of Intelligence.

On-Device LLM - Future is EDGE AI