登录查看更多内容

AI Based Search using SearXNG, AWS Bedrock and Perplexia

Satish Srinivasan

Cloud Architect I Cloud Security Analyst I Specialist - AWS & Azure Cloud. AWS Community Builder| AWS APN Ambassador

发布日期: 2025年1月4日

AI-based search has revolutionized how we access and interact with information online. By leveraging advanced algorithms. These systems utilize machine learning and deep learning techniques to analyse vast amounts of data, improving accuracy and efficiency in retrieving information34. Additionally, AI-driven search capabilities extend beyond traditional web searches, influencing various applications including e-commerce, social media, and content streaming platforms by tailoring experiences to individual users5. As AI continues to evolve, its impact on search technology promises.

Amazon Bedrock is a fully managed service from AWS that simplifies the development of generative AI applications by providing access to a variety of foundation models (FMs) from top AI companies, including Anthropic's Claude 3 Sonnet and Amazon's Titan models. This platform allows developers to experiment with, customize, and integrate these models into their applications without the need for extensive machine learning expertise or infrastructure management.

The Claude 3 Sonnet model, showcases significant advancements in speed and reasoning capabilities, making it suitable for enterprise-level generative AI applications. It is designed to handle complex instructions effectively and can evaluate generated content, providing feedback and suggestions for improvement, thereby enhancing user experience.

Additionally, the Amazon Titan embedding model offers optimized retrieval augmented generation (RAG) capabilities, enabling users to enhance their applications with up-to-date information and tailored responses. This integration of Titan embeddings with Claude 3 Sonnet allows for a more robust and versatile approach to developing AI solutions that meet specific business needs.

In this blog we will be covering implementation of AI Based Search using Open-Source tools searxng, Perplexia with AWS Bedrock based models, in this implementation we have hardcoded for AWS Bedrock Anthropic Claude sonnet 3.0 model for LLM and Amazon Titan model for embedding.? We will be modifying the code later to add support for other LLM as well.

We will be using the Open-source project “Perplexia” implemented by “Kushagra”. The entire backend service has been modified to use Bedrock chat and embedding libraries.

We will not be covering the steps to create ec2 instance. we will assume this is already available. We will start from associating an Elastic IP to the EC2 instance and installing the required software’s.

?Prerequisites

·???????? Security Groups

·???????? EC2 instance

·???????? AWS bedrock Console.

·???????? Access to the required Foundation Models

·???????? AWS Access & Secret Access keys with permission to AWS Bedrock related Policies.

?Walkthrough

The code has been fully modified to support Bedrock. All usage of Chat OpenAI has been replaced with Bedrock related Libraries.

The “config.toml” file has the following settings

?We need to configure the AWS user and for that user we need to grant Bedrock full Access privilege and use that Access Key and secret Access Key. Also, in Bedrock console we need to make sure we have access to all the Models required.

Let us first create a IAM user, give permission to access bedrock models and generate Keys to be used.

For the user we created Please associate the following policy

Next, we will generate the Keys

Press “Create Access Key” and save it securely. This should not be shared.

Press “Next”.

Press “Create Access Key”.

Press Download .csv file.

?In the Bedrock console, request and get access for Bedrock model (Anthropic Claude sonnet 3.5).

In AWS Bedrock -> Model Access.

We will start with EC2 instance and associate an Elastic IP to the instance.

Steps to install Docker.

?Steps to Install Docker compose.

?Steps to Install Make in Amazon Linx2 instance.

Steps to install git in Amazon Linx2 instance.

Steps to Install Searxng for AI Search engine.

?The EC2 instance is running. Now let us assign an elastic IP for the same.

Press “Allocate”.

Now associate this to the EC2 instance

Press “Associate”.

We have done the first part.Let us connect to the instance via putty.

Let us install Docker and docker-compose

Next docker-compose.

Next, we will install the make and git in Amazon linux 2

Next, we will install Git

Next, we will create a virtual environment and deploy Searxng.

?Create a Virtual environment.

Load required packages.

领英推荐

Beginner’s Guide to Amazon Q: Why, How, and Why Not

Ofir Nachmani 8 个月前

AWS announces 5 new innovations at AWS Summit New York…

AWS Events 8 个月前

How to Get Started with AWS Generative AI in Just 5…

OneData Software Solutions 4 周前

Git clone the searxng source

under searx directory in settings.yml file, change following:

search:

? formats:

??? - html

??? - json

???

Build the image.

Deploy the container and test.

Get the private IP address associated with the EC2 instance.

docker run --rm -d -p 8181:8080 -v "${PWD}/searxng:/etc/searxng" -e "BASE_URL=https://172.31.21.1:$PORT/" -e "INSTANCE_NAME=my-instance" searxng/searxng

We will test the Search engine.

https://44.198.108.129:8181

This is up and running. Next, we will configure Perplexia and demonstrate AI based search using AWS Bedrock Anthropic Claude sonnet 3 model.

In the Folder Perplexia we will have the file config.toml file make the changes in the file as shown below.

We will run the entire search engine using Bedrock models and Bedrock libraries. Then build and run the backend and UI.

Starting the backend and UI services.

For the backend

For the UI.

The Model configuration used.

?Let us open https://localhost:3000 from browser.

Ask the First Question

Cleanup

Stop all the services in ec2 instance and terminate the instance. Remove the AWS Keys used for this demo for additional security.

要查看或添加评论，请登录

Satish Srinivasan的更多文章

AWS Bedrock - Requirement to Code with Anthropci Claude Sonnet 3.5 and Bolt.new

2024年12月27日

AWS Bedrock - Requirement to Code with Anthropci Claude Sonnet 3.5 and Bolt.new

Bolt.new, developed by StackBlitz, is revolutionizing the way we create web applications by leveraging AI to streamline…
Part 1: AI Based Search using SearXNG, Ollama and Perplexia

2024年12月26日

Part 1: AI Based Search using SearXNG, Ollama and Perplexia

AI-based search has revolutionized how we access and interact with information online. By leveraging advanced…
AWS Bedrock Based Utilities Part 1

2024年12月24日

AWS Bedrock Based Utilities Part 1

The integration of advanced AI models, such as Anthropic’ s Claude 3.5 Sonnet, into cloud environments like AWS Bedrock…
Part 2: Introduction to Ollama

2024年12月23日

Part 2: Introduction to Ollama

Ollama is an innovative framework designed for running, managing, and building applications using large language models…
Part 1: Introduction to SearXNG

2024年12月23日

Part 1: Introduction to SearXNG

SearXNG is a free and open-source federated metasearch engine that serves as a fork of the original Searx project…
Ticket Analysis using AWS Bedrock Anthropic Claude

2024年8月11日

Ticket Analysis using AWS Bedrock Anthropic Claude

Ticket analysis involves examining customer support tickets to extract valuable insights related to customer issues…
Connecting to RDS DB in AWS Private Subnet using NLB in & RDS with NLB & RDS in seperate Region and VPC

2024年8月10日

Connecting to RDS DB in AWS Private Subnet using NLB in & RDS with NLB & RDS in seperate Region and VPC

There are plenty of use cases where the user needs to connect to the RDS DB in the Private subnet in AWS Account…

1 条评论
AWS Bedrock Anthropic Claude based Guardrail enabled RAG with Kendra, AWS Knowledge Base and AWS Neptune as Data store

2024年8月10日

AWS Bedrock Anthropic Claude based Guardrail enabled RAG with Kendra, AWS Knowledge Base and AWS Neptune as Data store

Generative AI is a type of artificial intelligence technology that can create new content, such as text, images, code…
AWS Bedrock Anthropic Claude 3 Multi Modal Model Features

2024年6月2日

AWS Bedrock Anthropic Claude 3 Multi Modal Model Features

AWS Bedrock now offers Anthropic’ s Claude 3 models, including Claude 3 Sonnet, and Claude 3 Haiku, which are the next…

2 条评论
AWS Bedrock Guardrails with Anthropic Claude Sonnet Model.

2024年5月24日

AWS Bedrock Guardrails with Anthropic Claude Sonnet Model.

Guardrails in the context of Generative AI refers to guidelines implemented to ensure the responsible and secure usage…

1 条评论

See all articles

AI Based Search using SearXNG, AWS Bedrock and Perplexia

Satish Srinivasan

Cloud Architect I Cloud Security Analyst I Specialist - AWS & Azure Cloud. AWS Community Builder| AWS APN Ambassador

领英推荐

Satish Srinivasan的更多文章

社区洞察

其他会员也浏览了

Why AWS is the Best Cloud Platform for Machine Learning

Forte Spotlight: Hello from AWS re:Invent 2024

Breaking Down AWS Bedrock Pricing Models

AI & ML in Cloud Computing ??

Estafet Insights - Edition 9

LangChain on AWS: Develop the Future of AI in the Cloud

Unlocking the Power of Generative AI with AWS Services

Gen AI Services on AWS: A Three-Layered Approach

Embracing the Future of AI and Cloud Innovation

领英推荐

Satish Srinivasan的更多文章

AWS Bedrock - Requirement to Code with Anthropci Claude Sonnet 3.5 and Bolt.new

Part 1: AI Based Search using SearXNG, Ollama and Perplexia

AWS Bedrock Based Utilities Part 1

Part 2: Introduction to Ollama

Part 1: Introduction to SearXNG

Ticket Analysis using AWS Bedrock Anthropic Claude

Connecting to RDS DB in AWS Private Subnet using NLB in & RDS with NLB & RDS in seperate Region and VPC

AWS Bedrock Anthropic Claude based Guardrail enabled RAG with Kendra, AWS Knowledge Base and AWS Neptune as Data store

AWS Bedrock Anthropic Claude 3 Multi Modal Model Features

AWS Bedrock Guardrails with Anthropic Claude Sonnet Model.

社区洞察

其他会员也浏览了

Why AWS is the Best Cloud Platform for Machine Learning

Forte Spotlight: Hello from AWS re:Invent 2024

Breaking Down AWS Bedrock Pricing Models

AI & ML in Cloud Computing ??

Estafet Insights - Edition 9

LangChain on AWS: Develop the Future of AI in the Cloud

Unlocking the Power of Generative AI with AWS Services

Gen AI Services on AWS: A Three-Layered Approach

Embracing the Future of AI and Cloud Innovation