登录查看更多内容

How to Run ChatGPT-like LLMs Locally on Your Computer in 3 Easy Steps

Paolo Perazzo

AI Product Manager · AI Startup Advisor

发布日期: 2023年12月5日

Running Large Language Models (LLMs) similar to ChatGPT locally on your computer and without Internet connection is now more straightforward, thanks to llamafile, a tool developed by Justine Tunney of the Mozilla Internet Ecosystem (MIECO) and Mozilla's innovation group. Llamafile is a game-changer in the world of LLMs, enabling you to run these models locally with ease.

In this post, I’ll show you how to run locally on your Mac LLaVA 1.5, an open-source multimodal LLM capable of handling both text and image inputs, or Mistral 7B, an open-source LLM known for its advanced natural language processing and efficient text generation, leveraging llamafile.

What is llamafile?

Llamafile transforms LLM weights into executable binaries. This technology essentially packages both the model weights and the necessary code required to run an LLM into a single, multi-gigabyte file. This file includes everything needed to run the model, and in some cases, it also contains a full local server with a web UI for interaction. This approach simplifies the process of distributing and running LLMs on multiple operating systems and hardware architectures, thanks to its compilation using Cosmopolitan Libc.

This innovative approach simplifies the distribution and execution of LLMs, making it much more accessible for users to run these models locally on their own computers.

What is LLaVA 1.5?

LLaVA 1.5 is an open-source large multimodal model that supports text and image inputs, similar to GPT-4 Vision. It is trained by fine-tuning LLaMA/Vicuna on GPT-generated multimodal instruction-following data. It is an auto-regressive language model, based on the transformer architecture.

What is Mistral 7B?

Mistral 7B is an open-source large language model with 7.3 billion parameters developed by Mistral AI. It excels in generating coherent text and performing various NLP tasks. Its unique sliding window attention mechanism allows for faster inference and handling of longer text sequences. Notable for its fine-tuning capabilities, Mistral 7B can be adapted to specific tasks, and it has shown impressive performance in benchmarks, outperforming many similar models.

Here’s how to start using LLaVA 1.5 or Mistral 7B on your own computer leveraging llamafile. Don’t get intimidated, the setup process is very straightforward!

Setting Up LLaVA 1.5

One Time Setup

Open Terminal: Before beginning, you need to open the Terminal application on your computer. On a Mac, you can find it in the Utilities folder within the Applications folder, or you can use Spotlight (Cmd + Space) to search for "Terminal."
Download the LLaVA 1.5 llamafile: Pick your preferred option to download the llamafile for LLaVA 1.5 (around 4.26GB):? Go to Justine's repository of LLaVA 1.5 on Hugging Face and click download or just click here and the download should start directly.? Use this command in the Terminal:curl -LO https://huggingface.co/jartine/llava-v1.5-7B-GGUF/resolve/main/llava-v1.5-7b-q4-server.llamafile
Make the Binary Executable: Once downloaded, use the Terminal to navigate to the folder where the file was downloaded, e.g. Downloads, and make the binary executable:cd ~/Downloads chmod 755 llava-v1.5-7b-q4-server.llamafileFor Windows, simply add .exe at the end of the file name.

Using LLaVA 1.5

Every time you want to use LLaVA on your compute follow these steps:

领英推荐

Introducing ChatGPT 5.0: The Pinnacle of AI-Driven…

Cogent Integrated Business Solutions Inc. 9 个月前

ChatGPT Plus: OpenAI's New Frontier in Language and…

Data Science AI Learner Community 1 年前

Can Chat GPT replace virtual assistants and research…

IndiaNIC Infotech Limited 2 年前

Run the Executable: Start the web server by executing the binary1:./llava-v1.5-7b-q4-server.llamafileThis command will launch a web server on port 8080.
Access the Web UI: To start using the model, open your web browser and navigate to https://127.0.0.1:8080/ (or click the link to open directly).

Terminating the process

Once you're done using the LLaVA 1.5 model, you can terminate the process. To do this, return to the Terminal where the server is running. Simply press Ctrl + C. This key combination sends an interrupt signal to the running server, effectively stopping it.

Setting Up Mistral 7B

One Time Setup

Open Terminal
Download the Mistral 7B llamafile: Pick your preferred option to download the llamafile for Mistral 7B (around 4.37 GB):? Go to Justine's repository of Mistral 7B on Hugging Face and click download or just click here and the download should start directly.? Use this command in the Terminal:curl -LO https://huggingface.co/jartine/llava-v1.5-7B-GGUF/resolve/main/mistral-7b-instruct-v0.1-Q4_K_M-server.llamafile
Make the Binary Executable: Once downloaded, use the Terminal to navigate to the folder where the file was downloaded, e.g. Downloads, and make the binary executable:cd ~/Downloads chmod 755 mistral-7b-instruct-v0.1-Q4_K_M-server.llamafileFor Windows, simply add .exe at the end of the file name.

Using Mistral 7B

Every time you want to use LLaVA on your compute follow these steps:

Run the Executable: Start the web server by executing the binary:./mistral-7b-instruct-v0.1-Q4_K_M-server.llamafileThis command will launch a web server on port 8080.
Access the Web UI: To start using the model, open your web browser and navigate to https://127.0.0.1:8080/ (or click the link to open directly).

Terminating the process

Once you're done using the Mistral 7B model, you can terminate the process. To do this, return to the Terminal where the server is running. Simply press Ctrl + C. This key combination sends an interrupt signal to the running server, effectively stopping it.

Conclusion

The introduction of llamafile significantly simplifies the deployment and use of advanced LLMs like LLaVA 1.5 or Mistral 7B for personal, development, or research purposes. This tool opens up new possibilities in the realm of AI and machine learning, making it more accessible for a wider range of users.

Originally published at https://ppaolo.substack.com

要查看或添加评论，请登录

Paolo Perazzo的更多文章

How AI is Shaping a New Human-Centric Approach to Product Design

2024年8月29日

How AI is Shaping a New Human-Centric Approach to Product Design

Introduction The Problem Current, Tool-Centric Solutions ”Integrating” Calendar and Email Publishing Calendar…

2 条评论
In-Depth Product Analysis of Devin, the Hottest AI Developer

2024年5月16日

In-Depth Product Analysis of Devin, the Hottest AI Developer

Table of content Experiencing the product Information architecture Meta awareness Following Devin’s work Chat UI and UX…

1 条评论
The Busy Person's Introduction to Large Language Models

2023年12月14日

The Busy Person's Introduction to Large Language Models

Based on Andrej Karpathy's Talk (November 2023) This blog takes its full inspiration from Andrej Karpathy's YouTube…

2 条评论
GPT-3 and the Rise of Human-centric Adaptive Software - Part 3

2020年9月2日

GPT-3 and the Rise of Human-centric Adaptive Software - Part 3

?? This article is part of a series: Intro, Part 1, Part 2, Part 3 The need for a human-centric design As I showed in…

2 条评论
GPT-3 and the Rise of Human-centric Adaptive Software - Part 2

2020年9月2日

GPT-3 and the Rise of Human-centric Adaptive Software - Part 2

?? This article is part of a series: Intro, Part 1, Part 2, Part 3 A new generation of adaptive software The reactions…
GPT-3 and the Rise of Human-centric Adaptive Software - Part 1

2020年9月2日

GPT-3 and the Rise of Human-centric Adaptive Software - Part 1

?? This article is part of a series: Intro, Part 1, Part 2, Part 3 The problem with today's computer-centric user…
GPT-3 and the Rise of Human-centric Adaptive Software - Intro

2020年9月2日

GPT-3 and the Rise of Human-centric Adaptive Software - Intro

?? This article is part of a series: Intro, Part 1, Part 2, Part 3 The release of GPT-3 to the world In July 2020…
How native apps can turn Slack into the operating system for your workplace

2018年4月17日

How native apps can turn Slack into the operating system for your workplace

Like any emerging technology, bots and apps for messaging have been closely following the famous hype cycle chart from…
So Yeah, AdCom8 Tried Slack and Kyber

2016年10月18日

So Yeah, AdCom8 Tried Slack and Kyber

Getting things done together is a core problem at work. Throughout my career, standard solutions like task and project…

1 条评论
Human Interface Guidelines for Slack

2016年6月9日

Human Interface Guidelines for Slack

The importance and benefits of Human Interface Guidelines When Apple launched the iOS platform in 2007, designing…

See all articles

How to Run ChatGPT-like LLMs Locally on Your Computer in 3 Easy Steps

Paolo Perazzo

AI Product Manager · AI Startup Advisor

What is llamafile?

What is LLaVA 1.5?

What is Mistral 7B?

Setting Up LLaVA 1.5

One Time Setup

Using LLaVA 1.5

领英推荐

Terminating the process

Setting Up Mistral 7B

One Time Setup

Using Mistral 7B

Terminating the process

Conclusion

Paolo Perazzo的更多文章

社区洞察

其他会员也浏览了

The Rise of the Machines

ChatGPT: How Bolt PR Embraces the Power of AI

Top AI Highlights of 2024: From OpenAI’s ChatGPT to Google’s Gemini and More

Why Does ChatGPT AI Matter and What Does It Mean?

Play with AI Tools

Get Gemini Go: Google's Gemini AI Vs OpenAI's ChatGPT

Exploring ChatGPT: A Breakdown of the Advanced Language Model

ChatGPT-It's a game changer or a hype?

How to Integrate ChatGPT in Laravel 10?

Your Ultimate Guide to ChatGPT: Prompt Your Way Through AI

What is llamafile?

What is LLaVA 1.5?

What is Mistral 7B?

Setting Up LLaVA 1.5

One Time Setup

Using LLaVA 1.5

领英推荐

Terminating the process

Setting Up Mistral 7B

One Time Setup

Using Mistral 7B

Terminating the process

Conclusion

Paolo Perazzo的更多文章

How AI is Shaping a New Human-Centric Approach to Product Design

In-Depth Product Analysis of Devin, the Hottest AI Developer

The Busy Person's Introduction to Large Language Models

GPT-3 and the Rise of Human-centric Adaptive Software - Part 3

GPT-3 and the Rise of Human-centric Adaptive Software - Part 2

GPT-3 and the Rise of Human-centric Adaptive Software - Part 1

GPT-3 and the Rise of Human-centric Adaptive Software - Intro

How native apps can turn Slack into the operating system for your workplace

So Yeah, AdCom8 Tried Slack and Kyber

Human Interface Guidelines for Slack

社区洞察

其他会员也浏览了

The Rise of the Machines

ChatGPT: How Bolt PR Embraces the Power of AI

Top AI Highlights of 2024: From OpenAI’s ChatGPT to Google’s Gemini and More

Why Does ChatGPT AI Matter and What Does It Mean?

Play with AI Tools

Get Gemini Go: Google's Gemini AI Vs OpenAI's ChatGPT

Exploring ChatGPT: A Breakdown of the Advanced Language Model

ChatGPT-It's a game changer or a hype?

How to Integrate ChatGPT in Laravel 10?

Your Ultimate Guide to ChatGPT: Prompt Your Way Through AI