登录查看更多内容

Working with LLM Locally Using Ollama

Trisna Widia

Corporate Director of eCommerce at Nilamani Hotels

发布日期: 2024年8月24日

Working with LLM Locally Using Ollama

In the rapidly evolving world of artificial intelligence, large language models (LLMs) like GPT-4, LLaMA, and Phi3.5 have become invaluable tools for developers and researchers alike. However, accessing these models often requires cloud-based services, which can be costly and dependent on external providers like OpenAI or Anthropic. What if you could run these powerful models locally, on modest hardware, without spending a penny? Enter Ollama, Open-WebUI, continue.dev, and aider.chat—tools that make local LLM deployment on Windows 10 not only possible but remarkably straightforward. I ran an experiment on the week-end and successfully tried it out.

Installing LLMs Locally with Ollama

Ollama is a powerful yet user-friendly tool that simplifies the process of installing and running LLMs on your local machine. The setup process is surprisingly easy, even if you're not an AI expert.

Step-by-Step Installation

1. Download and Install Ollama:

- Visit the Ollama website (https://ollama.com) and download the installer compatible with Windows 10. The process is very straightforward as you just need to download the .exe file and then run it.

- Run the installer, and Ollama will guide you through the installation process, ensuring all dependencies are installed automatically.

2. Setup Open-WebUI:

- Open-WebUI is another excellent tool for running LLMs locally. It provides a web-based interface that makes interacting with your models easy.

- Download the Open-WebUI package from https://openwebui.com and extract it to a convenient location on your PC.

- This is a bit tricky as there are two methods of installation either using Docker or manual Python using pip. I use mini conda with Python script installation. For me, it's easier than running Docker. Even though I have Docker in my machine.

- Install any necessary dependencies using the included setup script.

3. Hardware Requirements:

- Ollama and Open-WebUI are optimized for running on modest hardware. Even with a mid-range GPU like my Nvidia GeForce GTX 1660, you'll be able to experiment with powerful models like Microsoft's Phi3.5 and Meta's Llama 3.1.

Trying Out the Phi3.5 and Llama 3.1 Models

Once Ollama and Open-WebUI are up and running, it's time to experiment with the models themselves.

领英推荐

State-Of-The-Art

Yogeshwaran Singarasu 7 个月前

Stargate Supercomputer, OpenAI's Voice Cloning, DBRX…

Dhamodharan Sankaran 6 个月前

AI Generation SuperBloom

Heather Schoell 2 年前

1. Phi3.5 by Microsoft:

- Phi3.5 is a lightweight yet powerful model, designed for speed and efficiency. It's perfect for tasks that require quick responses without compromising on quality.

- To run Phi3.5, simply load the model using Ollama's command-line interface or through Open-WebUI's graphical interface. You'll be impressed by how responsive it is, even on a GTX 1660.

2. Llama 3.1 by Meta:

- If you need more power, Meta's Llama 3.1 is the way to go. This model is designed for more complex tasks, offering superior performance for intensive applications.

- Llama 3.1 is a bit heavier on resources, but with Ollama's optimization and your GTX 1660, you can still achieve impressive results.

Developing Apps Locally Without Costly APIs

One of the most exciting aspects of running LLMs locally is the freedom it gives you to develop applications with the help of AI without relying on external APIs. Tools like continue.dev and aider.chat makes this process seamless.

1. continue.dev:

- Continue.dev is a powerful IDE extension that integrates with your local environment, allowing you to develop and debug LLM-powered applications directly on your machine.

- This tool supports a variety of programming languages and frameworks, making it versatile for different types of projects.

2. aider.chat:

- Aider.chat provides a chatbot interface that can run locally, utilizing the LLMs you have installed. This is particularly useful for developing conversational AI applications or integrating AI into customer support tools.

- With aider.chat, you can experiment with different models and fine-tune them for your specific needs, all without incurring any API costs.

Conclusion

Running large language models locally using Ollama, Open-WebUI, continue.dev, and aider.chat is not only feasible but also highly practical, even on modest hardware like my Nvidia GeForce GTX 1660. Whether you're experimenting with Microsoft's Phi3.5 for quick, lightweight tasks or leveraging Meta's Llama 3.1 for more demanding applications, these tools give you the freedom and flexibility to develop powerful AI applications without relying on costly external services. Best of all, you can do it all without spending a penny on API access, making local LLM deployment a game-changer for developers on a budget.

要查看或添加评论，请登录

Trisna Widia的更多文章

Is GPT-4 the key to unlocking AGI or another step in the journey?

2023年3月19日

Is GPT-4 the key to unlocking AGI or another step in the journey?

Artificial General Intelligence (AGI) is a concept discussed in the AI community for decades. AGI refers to creating…

1 条评论
Q&A with Chat-GPT about Prompt Engineering

2023年3月12日

Q&A with Chat-GPT about Prompt Engineering

What is a prompt in terms of Prompt Engineering? In the context of prompt engineering, a prompt is a text or a question…

1 条评论
How to Chat with Bing and Learn Amazing Things: A Q&A Session with the Most Engaging Search Engine Ever

2023年3月11日

How to Chat with Bing and Learn Amazing Things: A Q&A Session with the Most Engaging Search Engine Ever

What are ToM (Theory of Mind), and GPT3? ChatGPT is a tool developed by OpenAI that uses a GPT-3 model to generate…
Influence: The Psychology of Persuasion by Robert B. Cialdini

2023年3月5日

Influence: The Psychology of Persuasion by Robert B. Cialdini

The book explores the science of persuasion and how it can be used ethically and effectively in various situations…
How to Create a Hotel Occupancy Forecast: A Step-by-Step Guide

2023年3月4日

How to Create a Hotel Occupancy Forecast: A Step-by-Step Guide

If you work in the hotel industry, you know how important it is to have accurate occupancy forecasts. Not only do…
Pathways: A next-generation AI architecture

2023年2月18日

Pathways: A next-generation AI architecture

Google has created a new system that teaches machines how to do things better. This system is called Pathways.
AI Content Generator

2022年8月21日

AI Content Generator

In my previous post, I mentioned about content generator powered by GPT3 from OpenAI. I give an example of how this…
How soon is Web 3.0 and will it change our lives?

2021年11月21日

How soon is Web 3.0 and will it change our lives?

https://www.youtube.
Facebook released test bench that pits AI againts human

2020年9月26日

Facebook released test bench that pits AI againts human

The standard practice for machine learning model is to train using existing dataset such as ImageNet - public data set…

1 条评论
Google just release Web Stories plugin for Wordpress

2020年9月26日

Google just release Web Stories plugin for Wordpress

Google Web Stories Wordpress plugin Google release version 1.0 out of beta which previously available in Github.

See all articles

Working with LLM Locally Using Ollama

Trisna Widia

Corporate Director of eCommerce at Nilamani Hotels

Working with LLM Locally Using Ollama

Installing LLMs Locally with Ollama

Step-by-Step Installation

Trying Out the Phi3.5 and Llama 3.1 Models

领英推荐

Developing Apps Locally Without Costly APIs

Conclusion

Trisna Widia的更多文章

社区洞察

其他会员也浏览了

NVIDIA ChatQA Paper

?? Supercharged RAG Application with NVIDIA NIM API, Embeddings, and LLM! ??

?????????? IA-ism! Highlights of the Week: OpenAI Code Interpreter, Stable Diffusion-XL 0.9 Local, Inflection AI, and RTX 4060 TI! ??????

Edge AI chips by Kneron. Diving into the firmware

[Project Report] Developing InstructGPT pipeline for fine-tuning models upto 13 Billion parameter size

Diffusion Model - Gen AI

Running Mistral 7B Locally with Ollama: The 'Ask a PDF' Question-Answering App

Exciting Advances in Generative AI with Mistral's Codestral, Google's AGREE, NVIDIA's Blackwell, and More

"Today's AI Dispatch: Innovations and Insights Unveiled"

Run AI Code Generation on Your Own GPU! - Introducing DeciCoder ??

Working with LLM Locally Using Ollama

Installing LLMs Locally with Ollama

Step-by-Step Installation

Trying Out the Phi3.5 and Llama 3.1 Models

领英推荐

Developing Apps Locally Without Costly APIs

Conclusion

Trisna Widia的更多文章

Is GPT-4 the key to unlocking AGI or another step in the journey?

Q&A with Chat-GPT about Prompt Engineering

How to Chat with Bing and Learn Amazing Things: A Q&A Session with the Most Engaging Search Engine Ever

Influence: The Psychology of Persuasion by Robert B. Cialdini

How to Create a Hotel Occupancy Forecast: A Step-by-Step Guide

Pathways: A next-generation AI architecture

AI Content Generator

How soon is Web 3.0 and will it change our lives?

Facebook released test bench that pits AI againts human

Google just release Web Stories plugin for Wordpress

社区洞察

其他会员也浏览了

NVIDIA ChatQA Paper

?? Supercharged RAG Application with NVIDIA NIM API, Embeddings, and LLM! ??

?????????? IA-ism! Highlights of the Week: OpenAI Code Interpreter, Stable Diffusion-XL 0.9 Local, Inflection AI, and RTX 4060 TI! ??????

Edge AI chips by Kneron. Diving into the firmware

[Project Report] Developing InstructGPT pipeline for fine-tuning models upto 13 Billion parameter size

Diffusion Model - Gen AI

Running Mistral 7B Locally with Ollama: The 'Ask a PDF' Question-Answering App

Exciting Advances in Generative AI with Mistral's Codestral, Google's AGREE, NVIDIA's Blackwell, and More

"Today's AI Dispatch: Innovations and Insights Unveiled"

Run AI Code Generation on Your Own GPU! - Introducing DeciCoder ??