登录查看更多内容

How to Set Up a Local and Cloud LLM Aggregator Using Open WebUI

Praveen Kasam

Certified Agile Leader |Expert Site Reliability Engineering Leader| Founder and host of an SRE Community | SRE Coach | ISAQB certified Architect

发布日期: 2025年1月30日

Why Aggregate Local and Cloud LLMs? Large Language Models (LLMs) like Llama 2, Mistral, and OpenAI’s GPT-4 offer unique strengths. By combining local and cloud-based models, you can:

Maximize flexibility: Use local models for privacy-sensitive tasks and cloud APIs for state-of-the-art performance.
Reduce costs: Offload lightweight tasks to local models and reserve cloud APIs for critical workloads.
Future-proof your setup: Stay ready for new models, whether they’re local or cloud-based.

Prerequisites

A machine with 16GB+ RAM (GPU recommended for speed).
Basic terminal/Docker knowledge.
Docker and Python installed.
API keys for cloud services like OpenAI.

Step 1: Set Up Ollama (Local Model Manager)

Ollama simplifies running LLMs locally. Install via Docker:

docker run -d -v ollama:/root/.ollama -p 11434:11434 --name ollama ollama/ollama

Download models like Llama 2 and Mistral:

docker exec -it ollama ollama pull llama2  
docker exec -it ollama ollama pull mistral

Step 2: Deploy Open WebUI

Open WebUI is a user-friendly frontend for Ollama and cloud APIs. Deploy it with:

docker run -d -p 3000:8080 --add-host=host.docker.internal:host-gateway -v open-webui:/app/backend/data --name open-webui ghcr.io/open-webui/open-webui:main

Access the UI at https://localhost:3000.

领英推荐

I’m an AWS developer. Here are 3 ways I’m using Amazon…

Amazon Web Services (AWS) 7 个月前

AWS re:Invent ’23 Day 3- Impactful Disclosures on AWS…

CloudThat 1 年前

Everything About Azure ML Service- A Must Knowledge -…

Naresh i Technologies 2 年前

Step 3: Configure Local Models

Add Models: In Open WebUI, navigate to settings and link Ollama (default URL: https://host.docker.internal:11434).
Switch Models: Select any downloaded model from the dropdown during chats.
Customize Presets: Save prompts tailored to specific models (e.g., code generation with Mistral, creative writing with Llama 2).

Step 4: Integrate Cloud APIs (e.g., OpenAI)

Get Your API Key: Log in to your OpenAI account and generate an API key.
Add OpenAI to Open WebUI:
Switch Between Local and Cloud Models:

Advanced Tips

Hybrid Workflows: Use local models for drafts and cloud APIs for final refinements.
Cost Management: Set usage limits for cloud APIs to avoid unexpected bills.
Security: Use HTTPS and auth tools like Caddy for secure remote access.

Conclusion

With Open WebUI, you’ve built a hybrid AI hub that combines the best of local and cloud-based models. Whether you prioritize privacy, cost, or performance, this setup adapts to your needs.

?? GitHub Repos: Ollama | Open WebUI

Ready to innovate? Dive deeper into the code, share your setups, and tag me in your experiments! ????

Sreekar A.

Bridging Gaps between Devs and Ops through Automation | AWS User Group Vizag Co-organiser

1 个月

This is great Praveen Kasam. How has been the performance when you had set it up locally and ran it alongside other programs? I'd love to hear your observations.

1 次回应

Rajendra Kunchala

Technical Architect

1 个月

Very informative Praveen Kasam

1 次回应

查看更多评论

要查看或添加评论，请登录

Praveen Kasam的更多文章

Local RAG Chatbot: Capabilities & Setup Guide

2025年1月29日

Local RAG Chatbot: Capabilities & Setup Guide

?? Capabilities of This RAG System This Retrieval-Augmented Generation (RAG) Chatbot provides the following…

10 条评论
Recreating OpenAI Operator’s Power with Open Source: A Browser-Use Tutorial

2025年1月27日

Recreating OpenAI Operator’s Power with Open Source: A Browser-Use Tutorial

OpenAI’s Operator ($200/month) is revolutionary but inaccessible to many due to cost or region locks. Here’s how I…

3 条评论
How I Built a Lead Generation Tool for thetheatreroom.co.in in Just 3 Hours

2024年12月20日

How I Built a Lead Generation Tool for thetheatreroom.co.in in Just 3 Hours

Creating a custom lead generation tool doesn’t have to be a lengthy, complex process. When my brother-in-law needed a…

1 条评论

How to Set Up a Local and Cloud LLM Aggregator Using Open WebUI

Praveen Kasam

Certified Agile Leader |Expert Site Reliability Engineering Leader| Founder and host of an SRE Community | SRE Coach | ISAQB certified Architect

Step 1: Set Up Ollama (Local Model Manager)

Step 2: Deploy Open WebUI

领英推荐

Step 3: Configure Local Models

Step 4: Integrate Cloud APIs (e.g., OpenAI)

Advanced Tips

Conclusion

Praveen Kasam的更多文章

社区洞察

其他会员也浏览了

How Uber Leverages FastAPI For Scalable Machine Learning Inference With Michelangelo

Working with Generative AIs in AWS

Machine Learning

Azure OpenAI with Azure API Management

Use AWS Bedrock language models with a Slack-powered chatbot

Building a Serverless AI-Powered Assistant on AWS Amplify with Amazon Lex and Amazon Bedrock

AWS Doubles Down on Generative AI at re:Invent 2023

AWS Summit New York: AWS Bolsters Its Generative AI Stack

Cloud Strategies for LLM Model Deployment : AWS, Azure, GCP

AWS re:Invent 2024: Top Highlights and Best Moments (In My Opinion)

Step 1: Set Up Ollama (Local Model Manager)

Step 2: Deploy Open WebUI

领英推荐

Step 3: Configure Local Models

Step 4: Integrate Cloud APIs (e.g., OpenAI)

Advanced Tips

Conclusion

Praveen Kasam的更多文章

Local RAG Chatbot: Capabilities & Setup Guide

Recreating OpenAI Operator’s Power with Open Source: A Browser-Use Tutorial

How I Built a Lead Generation Tool for thetheatreroom.co.in in Just 3 Hours

社区洞察

其他会员也浏览了

How Uber Leverages FastAPI For Scalable Machine Learning Inference With Michelangelo

Working with Generative AIs in AWS

Machine Learning

Azure OpenAI with Azure API Management

Use AWS Bedrock language models with a Slack-powered chatbot

Building a Serverless AI-Powered Assistant on AWS Amplify with Amazon Lex and Amazon Bedrock

AWS Doubles Down on Generative AI at re:Invent 2023

AWS Summit New York: AWS Bolsters Its Generative AI Stack

Cloud Strategies for LLM Model Deployment : AWS, Azure, GCP

AWS re:Invent 2024: Top Highlights and Best Moments (In My Opinion)