登录查看更多内容

Tinkering, Innovation, and Automation: The raggity.ai Story : Part 2

Joe Tustin

Senior Sales Engineer

发布日期: 2025年2月11日

Recap

Last year, while studying machine learning, I had an idea for a product that could save me hours of work each week and ensure a smooth experience for every customer I engaged with. Armed with both validation for the idea and my newfound technical skills, I dove right into building the solution.

The Systems Behind the Solution

To bring my vision to life, I built a few interconnected systems:

ParsePoint: A system designed to take a data source, parse the information in a specific style, and store it in a vector database (Pinecone).
SpiceRag: An agentic Retrieval-Augmented Generation (RAG) system that powered the entire operation.
SideKick: A user-friendly front-end chatbot that captured data from each interaction.

In this installment, I’ll focus on ParsePoint—specifically, how it leverages vector databases and an effective chunking strategy to process and store data.

ParsePoint: Harnessing the Power of Vector Databases

As I mentioned in part 1, vector databases are revolutionary because they store data as numerical representations rather than relying on manual tagging. Let’s break down the fundamentals:

Data as Vectors

Transformation: In a vector database, raw data is converted into a list of numbers—a vector—where each number represents a specific feature of the data (e.g., word frequency, context).
Multi-Dimensional Space: Think of each vector as a point on a map with many dimensions. Unlike a simple 2D map (with just x and y coordinates), data vectors can have hundreds or thousands of dimensions to capture subtle nuances.
Proximity Equals Similarity: Similar data points have similar vectors. If two articles cover the same topic, their vectors will be close together in this high-dimensional space, making it efficient to find related content.
Efficient Retrieval: When you query the database, your query is transformed into its own vector. The system then quickly compares this vector to those stored in the database to retrieve the most relevant information based on proximity.

In summary, a vector database is like a sophisticated map that transforms data into numerical coordinates. This transformation allows the system to quantify similarity and makes searches both faster and more accurate.

The Challenge of Data Chunking

Simply sending all raw data to the vector database isn’t enough. If we break the data into individual words or sentences, we risk losing the context and nuance essential for meaningful retrieval. That’s where a well-thought-out chunking strategy comes into play.

领英推荐

Transforming business operations with AI: From…

e& enterprise 2 个月前

Why Do 90% of Digital Transformations Fail?

Lakshminarasimhan S. 3 周前

Mastering Flow State Management in CrewAI

Anshuman Jha 1 周前

Chunking Strategies Explored

Here are some common approaches:

Fixed-Size Chunking: Split the data into chunks with a pre-defined token length.
Overlapping Chunks: Each chunk overlaps with its neighbors by 20–50 tokens to preserve context.
Sentence/Paragraph-Based Chunking: Use natural language boundaries rather than fixed token counts.
Semantic-Based Chunking: Leverage advanced techniques to capture complete ideas or intentions.
Hierarchical Chunking: Use the document’s inherent structure (like headers) to guide chunking.
Dynamic, Content-Aware Chunking: Allow the system to analyze the text and determine the optimal breakpoints.

After weighing the pros and cons—and considering that my focus was on technical documentation—I opted for a combination of Semantic-Based Chunking with overlapping. I also ensured that each document’s defining characteristics (such as tags for blog posts, walk-throughs, or troubleshooting guides) were captured. This approach allowed me to score and rank documents based on query relevance and provided the agent with context on what to prioritize.

The Importance of Document Quality

One critical insight from my testing was that no matter how clever the chunking strategy, the quality of the original documentation is paramount. Poorly organized or written docs simply can’t be salvaged by any chunking technique.? Which makes sense, since the intended audience is humans which would need a way to navigate them.? Could you imagine pulling up technical docs and it was a single 100 page doc with no headers or any defining breaks? Nightmare?

Performance Optimizations and Technical Architecture

From an architectural standpoint, parallelization was key. In my early experiments, processing 50MB of text and 70,000 vector records took over 5 minutes. With optimizations, I reduced that time to just 15 seconds while producing 36,000 high-quality records. This not only streamlined onboarding for new data sources but also made the system far more responsive.

For the vector database, I chose Pinecone because of its ease of use—thanks to their Python SDK and serverless offerings. I found Pinecone to be fast, reliable, and approachable.?

There are many vector database options available today, and I believe that speed and transparency in understanding application impacts will be key factors in determining the winner.

Looking Ahead

This integrated approach lays a strong foundation for further innovation. As each component evolves, the entire system becomes more adaptive, potentially incorporating real-time analytics, automated follow-ups, or even predictive insights to enhance the overall experience. The synergy between ParsePoint, SpiceRag, and SideKick not only streamlines the current workflow but also opens up exciting possibilities for future development.

What’s Next?

Now that we have tens of thousands of processed records in our vector database, the next question is: What do we do with them? In the upcoming installment, I’ll dive into SpiceRag—the agentic RAG system that leverages these records to power the entire operation. Stay tuned as we explore how SpiceRag transforms raw data into actionable insights.

要查看或添加评论，请登录

Joe Tustin的更多文章

SpiceRAG: The Agentic RAG System Behind raggity.ai – Part 3

2025年2月13日

SpiceRAG: The Agentic RAG System Behind raggity.ai – Part 3

Welcome back to my multi-part series on raggity.ai! In Parts 1 and 2, I dove into the overall vision of raggity and…

2 条评论
Tinkering, Innovation, and Automation: The raggity.ai Story : Part 1

2025年2月10日

Tinkering, Innovation, and Automation: The raggity.ai Story : Part 1

My Journey of Tinkering and Problem-Solving I’ve always had a passion for tinkering and solving problems. My very first…

2 条评论
What a Grilled Cheese Taught Me About Problem Solving

2024年10月21日

What a Grilled Cheese Taught Me About Problem Solving

One night after a night out with friends, I found myself in the kitchen, tasked with making grilled cheese sandwiches…

1 条评论
Overcoming LLM Knowledge Cut-offs with RAG: A Case Study in Improving AI-Assisted Coding

2024年10月9日

Overcoming LLM Knowledge Cut-offs with RAG: A Case Study in Improving AI-Assisted Coding

In my previous article (Hey @Anthropic @OpenAI - Can you do me a favor?), I discussed the challenges posed by knowledge…
Automating Ollama Deployment with LLaMA on AWS EC2: A Journey in AI Infrastructure

2024年8月6日

Automating Ollama Deployment with LLaMA on AWS EC2: A Journey in AI Infrastructure

Hey there, AI enthusiasts and cloud computing pros! I've been on quite a journey lately, and I want to share my…

11 条评论
The Business Case for In-House LLM Infrastructure: When Does ROI Justify the Investment?

2024年7月30日

The Business Case for In-House LLM Infrastructure: When Does ROI Justify the Investment?

As a software engineer learning and building new AI solutions, I am constantly in awe of the endless releases and eager…

1 条评论
Hey @Anthropic @OpenAI - Can you do me a favor?

2024年7月24日

Hey @Anthropic @OpenAI - Can you do me a favor?

TLDR: Anthropic OpenAI - Update your LLMs' knowledge to match their latest releases. Better UX, up-to-date code, and…

4 条评论
ShedGPT

2024年6月27日

ShedGPT

TLDR: Gave ChatGPT measurements for a shed; it generated code to run 1000 iterations to find the most cost-effective…

2 条评论
Niching down and my $10 million lesson

2024年1月23日

Niching down and my $10 million lesson

TLDR: Lessons Learned in Finding Your Niche — How I discovered focusing too broadly is a critical misstep. This is an…

10 条评论

See all articles

Tinkering, Innovation, and Automation: The raggity.ai Story : Part 2

Joe Tustin

Senior Sales Engineer

Recap

The Systems Behind the Solution

ParsePoint: Harnessing the Power of Vector Databases

Data as Vectors

The Challenge of Data Chunking

领英推荐

Chunking Strategies Explored

The Importance of Document Quality

Performance Optimizations and Technical Architecture

Looking Ahead

What’s Next?

Joe Tustin的更多文章

社区洞察

其他会员也浏览了

Intelligent Automation News #48

Taking GenAI from Prototype to Production: Navigating the Transition

Anthropic's new expertise test

AI and Automation Predictions for 2025

Making Your Data-pipelines AI Ready through Automation

Revolutionizing Enterprises with AI: The Tools That Make It Happen

IT Industry journey from information processor, to deploying IT workforce for business & developing limited intelligent version of human!

Drivers for AI Process Risk

Exploring Agentic AI: Alternatives to LangChain for Building Smarter Business Workflows

Don’t hesitate, hyperautomate. Maximise your automation potential.

Recap

The Systems Behind the Solution

ParsePoint: Harnessing the Power of Vector Databases

Data as Vectors

The Challenge of Data Chunking

领英推荐

Chunking Strategies Explored

The Importance of Document Quality

Performance Optimizations and Technical Architecture

Looking Ahead

What’s Next?

Joe Tustin的更多文章

SpiceRAG: The Agentic RAG System Behind raggity.ai – Part 3

Tinkering, Innovation, and Automation: The raggity.ai Story : Part 1

What a Grilled Cheese Taught Me About Problem Solving

Overcoming LLM Knowledge Cut-offs with RAG: A Case Study in Improving AI-Assisted Coding

Automating Ollama Deployment with LLaMA on AWS EC2: A Journey in AI Infrastructure

The Business Case for In-House LLM Infrastructure: When Does ROI Justify the Investment?

Hey @Anthropic @OpenAI - Can you do me a favor?

ShedGPT

Niching down and my $10 million lesson

社区洞察

其他会员也浏览了

Intelligent Automation News #48

Taking GenAI from Prototype to Production: Navigating the Transition

Anthropic's new expertise test

AI and Automation Predictions for 2025

Making Your Data-pipelines AI Ready through Automation

Revolutionizing Enterprises with AI: The Tools That Make It Happen

IT Industry journey from information processor, to deploying IT workforce for business & developing limited intelligent version of human!

Drivers for AI Process Risk

Exploring Agentic AI: Alternatives to LangChain for Building Smarter Business Workflows

Don’t hesitate, hyperautomate. Maximise your automation potential.