登录查看更多内容

23-6 Qdrant - Getting started with a Vector DB powered Meme Recommender

Won Bae Suh

发布日期: 2024年1月15日

We explored Pinecone and Chroma, where the latter was easy to install and run locally. The former could scale up per need and use case. We also saw how Langchain and Llama Index make it possible to abstract away the embedding and loading process. Starting your own Vector Database with your own set of documents is accessible more than ever.

Another Vector DB to note is Qdrant. In their home page, Qdrant “is a vector similarity search engine that provides a production-ready service with a convenient API to store, search, and manage points (i.e. vectors) with an additional payload. (Source: What is Qdrant? - Qdrant.)

The above link also provides some details on what a Vector Database is, how it is different from other forms of DBs and some use cases. The page is worth reviewing simply to recap on the basics. Navigating to the main page we can even review live demos of Qdrant in action.

Qdrant offers flexibility of deployment methods like Chroma

Run a Docker image if you don’t have a Python development environment. Setup a local Qdrant server and storage in a few moments.
Get the Python client if you’re familiar with Python. Just pip install qdrant-client. The client uses an in-memory database.
Spin up a Qdrant Cloud cluster: the recommended method to run Qdrant in production.

An interesting read on what makes Qdrant spark for Production use cases is the internal benchmarks the team has done to compare latency and Requests-per-Second (RPS). Vector Database Benchmarks - Qdrant

The Vector Search Solutions page offers a glimpse into how a Vector DB like Qdrant can be incorporated for business challenges and needs, ranging from Recommendation, Semantic Text and Image Search as well as Anomaly detection. If you were thinking Vector DBs only in RAG framework, this page opens up the mind for possibilities.

Since we have gone through a local vector store last time with Chroma, let's try Qdrant Cloud. https://cloud.qdrant.io/ (note you can easily install a local storage option too)

Upon signing in we can create our first cluster in the free tier.

Select the Get API Key and copy the details somewhere safe.

We are now ready to build with Qdrant.

Open up a clean notebook and install the Qdrant Python Client

领英推荐

Data Science Portfolios, Speeding Up Python, KANs, and…

Towards Data Science 9 个月前

DABL

360DigiTMG 1 年前

KX's developed innovation of AI (Artificial…

Caspian One 9 个月前

 pip install qdrant-client

Optionally if you are running in a CPU instance, locally or in the cloud

pip install qdrant-client[fastembed]

from qdrant_client import QdrantClient

qdrant_client = QdrantClient(url="YOUR_URL". api_key="YOUR_API")

If you missed the URL key during the setup process,

Go to the console, select the Clusters Tab and copy the URL under Cluster URL

Let's try something more than similar text retrieval and generation. At this rate, it is becoming too repetitive and we still have more vector DBs to cover.

We will create a Meme Image Recommender.

Data: Obtain a meme dataset with images, descriptions, and tags.
Preprocessing and Embedding: Clean up and add more metadata
Image Storage: Extract and store the raw images with tags
Query Processing and Retrieval: Add logic for retrieval and processing user query with LLM to generate response
Recommendation of similar meme images: Use Qdrant's Recommendation engine to enhance similarity based responses
(Bonus) Generate the Memes using an external service via API and direct to URL

marij868/memes_dataset_full · Datasets at Hugging Face

Note this app can be overkill and you can easily create a Meme using custom GPTs or open source tools. Here we are primarily focused on learning similarity search for images and text while taking advantage of Qdrant's unique recommendation API. (if I am capable to use it)

Let's start with reviewing and processing the dataset to start our journey. See you there.

Shawn Gordon

Data geek and developer advocate supreme

1 年

I did a project recently with Llamaindex and Lancedb for the vector store. Crazy simple and fast.

1 次回应

查看更多评论

要查看或添加评论，请登录

Won Bae Suh的更多文章

Solution Architecting with LLMs - Data

2025年1月27日

Solution Architecting with LLMs - Data

The headline news has been ablaze with Deepseek R (as of 28th Jan). The NASDAQ stocks plummeted, with NVIDIA—the…
Maybe Fine-Tuning is Not so Terrible After All

2024年4月3日

Maybe Fine-Tuning is Not so Terrible After All

Previously, I covered some lessons and insights from prompting to harness the In-Context Learning capabilities of LLMs,…
(Mis)adventures in GenerativeAI and, well, Just Trust in the Process, Figure things Out, Fail Quickly and Move On.

2024年4月3日

(Mis)adventures in GenerativeAI and, well, Just Trust in the Process, Figure things Out, Fail Quickly and Move On.

In the evolving and demanding realm of localization/translation, I was fortunate to explore most SOTA methods…
Some Thoughts on Content and Data in Life Science

2024年3月28日

Some Thoughts on Content and Data in Life Science

In my journey across the realms of content and data management systems, I've been fortunate to witness firsthand the…
Untitled

2024年3月19日

Untitled

The power of beginner mindset The power of first principles The power of curiosity and wonder The power of exponentials…
Raw Notes on Fine-Tuning LLMs

2024年3月5日

Raw Notes on Fine-Tuning LLMs

High Quality Data > Lots of Poor Quality Data Know your data well Watch out for the curse of overfitting Start with…
Challenges and Observations on building LLM-powered PubMed Chat Assistants (Part 3)

2024年2月26日

Challenges and Observations on building LLM-powered PubMed Chat Assistants (Part 3)

A freeform essay on "If I were to back in time, what would i have done differently, advise my past self and tell him…
Notes on Building GPT-powered PubMed Chat Assistants (Part 2) (feat. Sendbird)

2024年2月25日

Notes on Building GPT-powered PubMed Chat Assistants (Part 2) (feat. Sendbird)

After building a PubMed Chat application purely with Python and OpenAI's GPT 3.5 Turbo model, I came across another…

1 条评论
Reflections of Building GPT-powered PubMed Chat Assistants (Part 1)

2024年2月24日

Reflections of Building GPT-powered PubMed Chat Assistants (Part 1)

This article is to reflect on the journey of creating two distinct yet complementary projects that blend Generative AI…

1 条评论
Introducing RAST: Retrieval Augmented Search to Translate

2024年1月18日

Introducing RAST: Retrieval Augmented Search to Translate

RAST framework is WIP personal project to see performance and accuracy gains from leveraging LLMs, external stores and…

See all articles

23-6 Qdrant - Getting started with a Vector DB powered Meme Recommender

Won Bae Suh

领英推荐

Won Bae Suh的更多文章

社区洞察

其他会员也浏览了

Utilizing ML for Better Scraping, Data Extraction With a Headless Browser, and More

Scraping simplified

Deploying models from notebooks, auto-selecting the best plan, INFORMS events, and more

Microservices Design IV: Distributed Tracing, Python in Excel and ChatGPT Enterprise

Introduction To PandasAI Part 1

Just add "easy" ... economically expanding the machine learning problem space

Platforms for Machine Learning, AI, & Data Science Best Practices

GroupBy #11: Python at Meta, Netflix Incremental Processing with Apache Iceberg, 2023 AI year in brief

Bigbird, TensorFlowJS and LinkedIn — Web models for your network.

Tools for Data Collection and Processing: Integrating Python, AI, and Machine Learning

领英推荐

Won Bae Suh的更多文章

Solution Architecting with LLMs - Data

Maybe Fine-Tuning is Not so Terrible After All

(Mis)adventures in GenerativeAI and, well, Just Trust in the Process, Figure things Out, Fail Quickly and Move On.

Some Thoughts on Content and Data in Life Science

Untitled

Raw Notes on Fine-Tuning LLMs

Challenges and Observations on building LLM-powered PubMed Chat Assistants (Part 3)

Notes on Building GPT-powered PubMed Chat Assistants (Part 2) (feat. Sendbird)

Reflections of Building GPT-powered PubMed Chat Assistants (Part 1)

Introducing RAST: Retrieval Augmented Search to Translate

社区洞察

其他会员也浏览了

Utilizing ML for Better Scraping, Data Extraction With a Headless Browser, and More

Scraping simplified

Deploying models from notebooks, auto-selecting the best plan, INFORMS events, and more

Microservices Design IV: Distributed Tracing, Python in Excel and ChatGPT Enterprise

Introduction To PandasAI Part 1

Just add "easy" ... economically expanding the machine learning problem space

Platforms for Machine Learning, AI, & Data Science Best Practices

GroupBy #11: Python at Meta, Netflix Incremental Processing with Apache Iceberg, 2023 AI year in brief

Bigbird, TensorFlowJS and LinkedIn — Web models for your network.

Tools for Data Collection and Processing: Integrating Python, AI, and Machine Learning

GroupBy #11: Python at Meta, Netflix Incremental Processing with Apache Iceberg, 2023 AI year in brief