Image to Text using Muti Modal Model LLAVA in GGUF format.

Satish Srinivasan

Cloud Architect I Cloud Security Analyst I Specialist - AWS & Azure Cloud. AWS Community Builder| AWS APN Ambassador

发布日期: 2024年3月31日

We will be using Amazon EC2 Linux instance and be deploying the model and requisite libraries. We will be using M5.8xlarge instance for inference. This is a CPU based instance.The sample code for the same is available in the repo git.

?Create EC2 instance.

Press Launch Instance and then login to the instance that is created.The security group associated with the EC2 instance should have the below Ports opened for the inbound rules.

We are allowing access to the user public IP address for security purposes.

?Login to the EC2 instance and create a virtual environment.

Install the huggingface-cli to download the Model.

pip install -U "huggingface_hub[cli]"

We can use either “llava-v1.5-7B-GGUF” or “liuhaotian_llava-v1.5-13b-GGUF”. For this demo I will be using “jartine/llava-v1.5-7B-GGUF llava-v1.5-7b-Q4_K.gguf” & “jartine/llava-v1.5-7B-GGUF llava-v1.5-7b-mmproj-f16.gguf”.

?Steps to download the Model.

?Install the required libraries. We will be using “llama_cpp” for loading the model.

?It will prompt to install , press “Y”.

Other packages required to run this sample are given below.

Create a folder image in the demo folder. We use this folder for image conversion and this part of the code needs to be refined for generic image format conversion. Currently the conversion does convert .png to ,jpeg format which is the default image format used in the demo.

Let us go back to the demo folder.

?Create a file app.py and copy the code to this file.

?Let us run the code.

?We will be using google chrome browser to test . The url is https://3.90.36.218:8501 which is the public IP address of the ec2 instance.

Further refinement of the code sample can be done and we can experiment with other use cases using LLAVA model.

要查看或添加评论，请登录

查看全部

Image to Text using Muti Modal Model LLAVA in GGUF format.

Satish Srinivasan

Cloud Architect I Cloud Security Analyst I Specialist - AWS & Azure Cloud. AWS Community Builder| AWS APN Ambassador

?Create EC2 instance.

更多精彩文章

社区洞察

其他会员也浏览了

EKS Cluster Creation

Day 8 - Exploring Azure Compute Services: VMs, App Services, and More

Issue #45

Expanding EC2 Instance Storage for Development Needs

?? Mastering AWS EC2 Instance Setup on Windows: A Step-by-Step GuideStep

Dealing with Memory and Storage on EC2

AWS EC2 Instance Connect

AWS DAY 3 Part 2: Practical Hands-on

AWS Lambda

?Create EC2 instance.

Ticket Analysis using AWS Bedrock Anthropic Claude

2024年8月11日

Connecting to RDS DB in AWS Private Subnet using NLB in & RDS with NLB & RDS in seperate Region and VPC

2024年8月10日

AWS Bedrock Anthropic Claude based Guardrail enabled RAG with Kendra, AWS Knowledge Base and AWS Neptune as Data store

2024年8月10日

AWS Bedrock Anthropic Claude 3 Multi Modal Model Features

2024年6月2日

AWS Bedrock Guardrails with Anthropic Claude Sonnet Model.

2024年5月24日

Generative AI based RAG with Llama2 model using AWS Sagemaker jumpstart and Kendra as Data Source with Multilingual features

2024年5月4日

Getting started with AWS Bedrock agents

2024年3月26日

Active-active Replication for PostgreSQL on Amazon RDS for PostgreSQL using pgactive

2024年1月2日

Generative AI Part 10 Text to sql using Langchain ,OpenAI and Llama Index

2023年10月24日

Generative AI Part 9 Text summarization using Salesforce xgen-7b-8k model and Gradio

2023年10月21日

社区洞察

其他会员也浏览了

EKS Cluster Creation

Day 8 - Exploring Azure Compute Services: VMs, App Services, and More

Issue #45

Expanding EC2 Instance Storage for Development Needs

?? Mastering AWS EC2 Instance Setup on Windows: A Step-by-Step GuideStep

Dealing with Memory and Storage on EC2

AWS EC2 Instance Connect

AWS DAY 3 Part 2: Practical Hands-on

AWS Lambda