登录查看更多内容

Part 2: Introduction to Ollama

Satish Srinivasan

Cloud Architect I Cloud Security Analyst I Specialist - AWS & Azure Cloud. AWS Community Builder| AWS APN Ambassador

发布日期: 2024年12月23日

Ollama is an innovative framework designed for running, managing, and building applications using large language models (LLMs) directly on local machines. It provides developers with the tools to integrate AI capabilities into their applications while maintaining control over the models and data. It Utilizes a?Mixture-of-Experts (MoE)?architecture, which allows it to handle nuanced tasks such as reasoning and inference effectively. This design enables it to maintain high performance with smaller models, making it resource-efficient and accessible for individual users and smaller organizations.?

?Key Features of Ollama

·???????? Local Model execution

·???????? Open-Source Models

·???????? Easy Integration

·???????? Model Customization

·???????? Performance Optimization

In this section we will demo how to deploy Ollama as a Docker container in AWS ec2 instance and use the same with llama3 and phi4. We will also install a Hugging face model to demo usage of models not natively supported by Ollama.

We will not be covering the steps to create ec2 instance. we will assume this is already available we will start with deploying docker, docker-compose and Ollama with Open Web-UI service and validating the same from our laptop/desktop.

The EC2 instance is running.

?Steps to install Docker.

Steps to Install Docker compose.

Ollama-compose file to create both Ollama and OpenWebUI as Service.

Let us connect to the instance via putty.

Let us install Docker and docker-compose

Next docker-compose.

Next , we will create the ollama.yaml file.

Now let us deploy this by running the command

docker-compose -f ollama.yaml up -d

We can validate it by running the command.

docker ps

Now we can open chrome browser and test it,

https:// 18.209.164.240:3000 where “18.209.164.240” is the Public IP of the ec2 instance on which the service is running.

Press “Get Started”.

Enter the details and press “Create Admin Account”.

Press “Okey, Lets go”.

Now we are ready to use. Go to url https://ollama.com/library to get list of models available in ollama.

For experiment Let us Load the Llama 3.2 model

In our OpenWebUI browser type

And Press on pull “llama3.2:1b” from ollama.com.

Wait for it to complete.

Now its complete. Let us use it and ask questions.

领英推荐

AWS Goodies - April 11, 2024

Jeff Barr 11 个月前

AWS Community Builders: How to Join the Program

Guille Ojeda 2 个月前

AWS Tagging Best Practices

CoreStack 2 年前

Next, let us test the reasoning of llama3.2 model.

Not very good at reasoning.

Next Let us Load phi4 and try the same question and see how the reasoning works with phi4.

Press Pull “vanilj/Phi-4” from Ollama.com. Let us switch to Phi-4 and ask the same question and see the answer.

Phi-4 is very good at reasoning. Now let us try a code generation as well.

Next, we will create HTML CSS code from Wire frame Diagram using Llama3.2 vision model.

Press “ Pull llama3.2-vision:11b” from Ollama.com . It ready now. Select the Vision Model

Press “ on the “+” sign to upload the image.

Press “Upload files”.

Press “Enter now”.

Next, we will Load a model from Hugging face Hub which is not natively available in ollama. We can load only GGUF models.

Let us use this model with ollama now.

?Cleaning the installation

Next, we will bring down the service and remove the docker images.

Let us manually remove the images.

If we run “docker ps -a” we should not see any service running.

We can logout and shutdown ec2 instance.? In the Next Part we will integrate all this and create an AI Search engine.

要查看或添加评论，请登录

Satish Srinivasan的更多文章

AI Based Search using SearXNG, AWS Bedrock and Perplexia

2025年1月4日

AI Based Search using SearXNG, AWS Bedrock and Perplexia

AI-based search has revolutionized how we access and interact with information online. By leveraging advanced…
AWS Bedrock - Requirement to Code with Anthropci Claude Sonnet 3.5 and Bolt.new

2024年12月27日

AWS Bedrock - Requirement to Code with Anthropci Claude Sonnet 3.5 and Bolt.new

Bolt.new, developed by StackBlitz, is revolutionizing the way we create web applications by leveraging AI to streamline…
Part 1: AI Based Search using SearXNG, Ollama and Perplexia

2024年12月26日

Part 1: AI Based Search using SearXNG, Ollama and Perplexia

AI-based search has revolutionized how we access and interact with information online. By leveraging advanced…
AWS Bedrock Based Utilities Part 1

2024年12月24日

AWS Bedrock Based Utilities Part 1

The integration of advanced AI models, such as Anthropic’ s Claude 3.5 Sonnet, into cloud environments like AWS Bedrock…
Part 1: Introduction to SearXNG

2024年12月23日

Part 1: Introduction to SearXNG

SearXNG is a free and open-source federated metasearch engine that serves as a fork of the original Searx project…
Ticket Analysis using AWS Bedrock Anthropic Claude

2024年8月11日

Ticket Analysis using AWS Bedrock Anthropic Claude

Ticket analysis involves examining customer support tickets to extract valuable insights related to customer issues…
Connecting to RDS DB in AWS Private Subnet using NLB in & RDS with NLB & RDS in seperate Region and VPC

2024年8月10日

Connecting to RDS DB in AWS Private Subnet using NLB in & RDS with NLB & RDS in seperate Region and VPC

There are plenty of use cases where the user needs to connect to the RDS DB in the Private subnet in AWS Account…

1 条评论
AWS Bedrock Anthropic Claude based Guardrail enabled RAG with Kendra, AWS Knowledge Base and AWS Neptune as Data store

2024年8月10日

AWS Bedrock Anthropic Claude based Guardrail enabled RAG with Kendra, AWS Knowledge Base and AWS Neptune as Data store

Generative AI is a type of artificial intelligence technology that can create new content, such as text, images, code…
AWS Bedrock Anthropic Claude 3 Multi Modal Model Features

2024年6月2日

AWS Bedrock Anthropic Claude 3 Multi Modal Model Features

AWS Bedrock now offers Anthropic’ s Claude 3 models, including Claude 3 Sonnet, and Claude 3 Haiku, which are the next…

2 条评论
AWS Bedrock Guardrails with Anthropic Claude Sonnet Model.

2024年5月24日

AWS Bedrock Guardrails with Anthropic Claude Sonnet Model.

Guardrails in the context of Generative AI refers to guidelines implemented to ensure the responsible and secure usage…

1 条评论

See all articles

Part 2: Introduction to Ollama

Satish Srinivasan

Cloud Architect I Cloud Security Analyst I Specialist - AWS & Azure Cloud. AWS Community Builder| AWS APN Ambassador

领英推荐

Satish Srinivasan的更多文章

社区洞察

其他会员也浏览了

All re:Invent 2020 sessions (with reference id) by their title from A to Z

Implementing and Validating Cloud-Native Microservices Using CNTI/CNF Test Catalog on AWS...

Implementing 2048 Game on EKS Cluster with Ingress Routing

A Step-by-Step Guide to Securely Exposing an API Gateway with AWS Services

Scaling APIs To New Heights: Asynchronous API with AWS Serverless

DeepSeek ?? + AWS ??

Modernising Docker on AWS Elastic Container Service (ECS)

117. Simplifying the complex - AWS re:Invent 2022 Takeaway #2

Mastering Amazon EKS: Enhance Your Kubernetes Cluster with Horizontal Scaling, CloudWatch, Fluent Bit, and AWS X-Ray

A curious case of slow APIs

领英推荐

Satish Srinivasan的更多文章

AI Based Search using SearXNG, AWS Bedrock and Perplexia

AWS Bedrock - Requirement to Code with Anthropci Claude Sonnet 3.5 and Bolt.new

Part 1: AI Based Search using SearXNG, Ollama and Perplexia

AWS Bedrock Based Utilities Part 1

Part 1: Introduction to SearXNG

Ticket Analysis using AWS Bedrock Anthropic Claude

Connecting to RDS DB in AWS Private Subnet using NLB in & RDS with NLB & RDS in seperate Region and VPC

AWS Bedrock Anthropic Claude based Guardrail enabled RAG with Kendra, AWS Knowledge Base and AWS Neptune as Data store

AWS Bedrock Anthropic Claude 3 Multi Modal Model Features

AWS Bedrock Guardrails with Anthropic Claude Sonnet Model.

社区洞察

其他会员也浏览了

All re:Invent 2020 sessions (with reference id) by their title from A to Z

Implementing and Validating Cloud-Native Microservices Using CNTI/CNF Test Catalog on AWS...

Implementing 2048 Game on EKS Cluster with Ingress Routing

A Step-by-Step Guide to Securely Exposing an API Gateway with AWS Services

Scaling APIs To New Heights: Asynchronous API with AWS Serverless

DeepSeek ?? + AWS ??

Modernising Docker on AWS Elastic Container Service (ECS)

117. Simplifying the complex - AWS re:Invent 2022 Takeaway #2

Mastering Amazon EKS: Enhance Your Kubernetes Cluster with Horizontal Scaling, CloudWatch, Fluent Bit, and AWS X-Ray

A curious case of slow APIs