ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Qdrant

Rohit Singh

Associate Project Manager @ HuQuo

å‘å¸ƒæ—¥æœŸ: 2024å¹´12æœˆ28æ—¥

+ å…³æ³¨

Qdrant is an open-source, fully managed vector database and vector similarity search engine that allows users to:?

Store, search, and manage vector embeddings?
Add payloads to vectors to help refine searches and provide useful information to users?

Qdrant offers a production-ready service with an API. It's designed for massive-scale use and is considered high-performance. Vector Databases have become the go-to place for storing and indexing the representations of unstructured and structured data. These representations are the vector embeddings generated by the Embedding Models. The vector stores have become an integral part of developing apps with Deep Learning Models, especially the Large Language Models. In the ever-evolving landscape of Vector Stores, Qdrant is one such Vector Database that has been introduced recently and is feature-packed.

Embeddings

Vector Embeddings are a means of expressing data in numerical formâ€”that is, as numbers in an n-dimensional space, or as a numerical vectorâ€”regardless of the type of dataâ€”text, photos, audio, videos, etc. Embeddings enable us to group together related data in this way. Certain inputs can be transformed into vectors using certain models. A well-known embedding model created by Google that translates words into vectors (vectors are points with n dimensions) is called Word2Vec. Each of the Large Language Models has an embedding model that generates an embedding for the LLM.

Embeddings Used for

One advantage of translating words to vectors is that they allow for comparison. When given two words as numerical inputs, or vector embeddings, a computer can compare them even though it cannot compare them directly. It is possible to group words with comparable embeddings together. Because they are related to one another, the terms King, Queen, Prince, and Princess will appear in a cluster.

In this sense, embeddings help us locate words that are related to a given term. This can be used in sentences, where we enter a sentence, and the supplied data returns related sentences. This serves as the foundation for numerous use cases, including chatbots, sentence similarity, anomaly detection, and semantic search. The Chatbots that we develop to answer questions based on a PDF or document that we provide make use of this embedding notion. This method is used by all Generative Large Language Models to obtain content that is similarly connected to the queries that are supplied to them.

Know the Qdrant Terminology

To get a smooth start with Qdrant, itâ€™s a good practice to get familiar with the terminology / the main Components used in the Qdrant Vector Database.

é¢†è‹±æŽ¨è

Build Your First RAG System Using LlamaIndex!

Pavan Belagatti 2 ä¸ªæœˆå‰

Advanced Retrieval-Augmented Generation (RAG) for LLMs: Transforming Enterprise Data from SAP, Workday, Salesforce, etc. into Context-Aware Insights

Advanced Retrieval-Augmented Generation (RAG) forâ€¦

Anand Ramachandran 6 ä¸ªæœˆå‰

The Position Encoding In Transformers!

Damien Benveniste, PhD 8 ä¸ªæœˆå‰

Collections

Collections are named sets of Points, where each Point contains a vector and an optional ID and payload. Vectors in the same Collection must share the same dimensionality and be Evaluated with a single chosen Metric.

Distance Metrics

Essential for measuring how close are the vectors to each other, distance metrics are selected during the creation of a Collection. Qdrant provides the following Distance Metrics: Dot, Cosine, and Euclidean.

Points

The fundamental entity within Qdrant, points consists of a vector embedding, an optional ID, and an associated payload, where id: A unique identifier for each vector embedding vector:?A high-dimensional representation of data, which can be either structured or unstructured formats like images, text, documents, PDFs, videos, audio, etc. payload: An optional JSON object containing data associated with a vector. This can be considered similar to metadata, and we can work with this to filter the search process

Storage

Qdrant provides two storage options:

In-Memory Storage: Stores all vectors in RAM, optimizing speed by minimizing disk access to persistence tasks.
Memmap Storage: Creates a virtual address space linked to a file on disk, balancing speed and persistence requirements.

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Rohit Singhçš„æ›´å¤šæ–‡ç«

Azure Blob storage

2025å¹´3æœˆ21æ—¥

Azure Blob storage

Blob storage is a type of cloud storage for unstructured data, like images, videos, or documents, where data is storedâ€¦
BI Testing

2025å¹´3æœˆ20æ—¥

BI Testing

BI testing, or Business Intelligence testing, verifies and validates the accuracy and reliability of insights deliveredâ€¦
Amazon Elastic Container Service (Amazon ECS)

2025å¹´3æœˆ19æ—¥

Amazon Elastic Container Service (Amazon ECS)

Amazon Elastic Container Service (Amazon ECS) is a fully managed container orchestration service that simplifies theâ€¦
User Acceptance Testing (UAT)

2025å¹´3æœˆ18æ—¥

User Acceptance Testing (UAT)

User Acceptance Testing (UAT) is a crucial phase in software testing where the software is tested in a real-worldâ€¦
Software Development Engineer in Test (SDET)

2025å¹´3æœˆ17æ—¥

Software Development Engineer in Test (SDET)

Software Development Engineer in Test (SDET) is a developer with the primary responsibility for the development ofâ€¦

1 æ¡è¯„è®º
Data center

2025å¹´3æœˆ15æ—¥

Data center

A data center is essentially a building or a dedicated space within a building that serves as a central hub forâ€¦
Network security engineer

2025å¹´3æœˆ13æ—¥

Network security engineer

A Network and Security Engineer designs, implements, and maintains secure network infrastructure, protecting systemsâ€¦
Firewall

2025å¹´3æœˆ12æ—¥

Firewall

A firewall is a network security device either hardware or software-based which monitors all incoming and outgoingâ€¦
Apache Sqoop

2025å¹´3æœˆ11æ—¥

Apache Sqoop

Apache Sqoop is a command-line tool that transfers data between relational databases and Hadoop. It's used to importâ€¦
Trello

2025å¹´3æœˆ10æ—¥

Trello

Trello is a popular, simple, and easy-to-use collaboration tool that enables you to organize projects, and everythingâ€¦

See all articles

Qdrant

Rohit Singh

Associate Project Manager @ HuQuo

Embeddings

Embeddings Used for

Know the Qdrant Terminology

é¢†è‹±æŽ¨è

Collections

Distance Metrics

Points

Storage

Rohit Singhçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Creating a Product Support AI Agent using Natural Language

Paper Review: Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Unlocking the Power of Retrieval Augmented Generation (RAG) with Azure and Cosmos DB: A Comprehensive Guide (part1)

What are Retrieval Augmented Generation (RAG) Systems?

Unleashing the Potential: The Role and Responsibilities of an AI Application Developer

LLMOps - Taking LLMs to Production at Scale in HealthCare Industry

Semantic chunking, Vectorization and role of Graph Databases

Open-Source AI Framework for Generating Long-Form Documents with RAG and LLMs

Kafka-Driven LLM Optimization

RAG Failure Points and Optimization Strategies: A Deep?Dive

Embeddings

Embeddings Used for

Know the Qdrant Terminology

é¢†è‹±æŽ¨è

Collections

Distance Metrics

Points

Storage

Rohit Singhçš„æ›´å¤šæ–‡ç«

Azure Blob storage

BI Testing

Amazon Elastic Container Service (Amazon ECS)

User Acceptance Testing (UAT)

Software Development Engineer in Test (SDET)

Data center

Network security engineer

Firewall

Apache Sqoop

Trello

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Creating a Product Support AI Agent using Natural Language

Paper Review: Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Unlocking the Power of Retrieval Augmented Generation (RAG) with Azure and Cosmos DB: A Comprehensive Guide (part1)

What are Retrieval Augmented Generation (RAG) Systems?

Unleashing the Potential: The Role and Responsibilities of an AI Application Developer

LLMOps - Taking LLMs to Production at Scale in HealthCare Industry

Semantic chunking, Vectorization and role of Graph Databases

Open-Source AI Framework for Generating Long-Form Documents with RAG and LLMs

Kafka-Driven LLM Optimization

RAG Failure Points and Optimization Strategies: A Deep?Dive

é¢†è‹±æŽ¨è

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†