登录查看更多内容

A Sunday Evening Activity with my drink: Bringing LLM to My Personal Data

Vivek Mehrotra

Chief Business Development Officer - Digital & AI | Circular Economy | GenAI

发布日期: 2024年6月25日

Last weekend, I tried to to run a Large Language Model (LLM) on my Apple MacBook with an M1 chip and 8GB RAM. Given my machine's limitations, I knew running large, sophisticated models wasn't feasible. While I could use Google Colab, however, I decided to run on my personal computer. Here's how I made it work, inspired by PrivateGPT:

Leveraging LLMs with Personal Data

LLMs provide a powerful natural language interface between humans and data. While they come pre-trained on extensive public datasets, they aren't tuned to your specific data, which might reside behind APIs, in SQL databases, or trapped in PDFs, DOCs, Excel sheets, and slide decks.

My Approach

I utilised LlamaIndex to ingest, parse, index, and process my private documents stored locally. LlamaIndex supports HuggingFace embedding models to generate embeddings (vectors) before storing them in a database.

To ensure my generative model could answer questions using my internal data, I used the Retrieval Augmented Generation (RAG) approach. This way I could use my private data as context, which is then stored locally in a vector database. I chose Qdrant (though alternatives like Milvus, ChromaDB, MongoDB are also few options).

For building the app, I used the FastAPI framework and wrapped everything in Gradio UI (I also experimented with Streamlit UI but did not proceed further).

The Workflow

So here is the final flow, ask a question based on your private documents, it will create embeddings for your question using SentenceTransformers.

领英推荐

Data Roles, Small Language Models, Knowledge Graphs…

Towards Data Science 1 个月前

Top LLM Papers of the Week (August Week 3, 2024)

Kalyan KS 7 个月前

Mastering Azure AI Foundry: Bridging the Gap Between…

Victor Karabedyants 2 个月前

Then it will use LangChain to search the vectorstore for the most similar embeddings from your documents.

It will return a number of chunks (sources) that match your question best.

Then it will use the LLM model to generate an answer for your question based on these chunks. It will feed the chunks and the question into the LLM model one by one (or in batches), and get an output from the model.

The output will be a natural language answer that tries to satisfy your question.

The Experiment

I kept unstructured data (doc, ppt, pdf) in a local folder and ingested all using the python script. I additionally trained the already pre-trained model on my personal data, I chose my CV, academic documents from high school to my Master's, and other private documents. Then, I had a fun session asking the model quirky questions about myself, enjoying a relaxed Sunday evening with a nice drink and my customised AI model, running small on my limited resource laptop. Next is when I will try on Colab.

Takeaway

With the right tools and inspiration from projects like PrivateGPT, even basic hardware can be leveraged to create powerful, personalised AI experiences. If you're curious about LLMs and have some private data to experiment with, give it a try! There are many GitHub projects and you can leverage one of those, if you need more compute for free (TNVIDIA Tesla T4 GPU with 16GB of VRAM) use colab. That's next for me.

Chaitanya Sharma

Head of Procurement | Procurement, Negotiation, Vendor Management

9 个月

Very informative

要查看或添加评论，请登录

Vivek Mehrotra的更多文章

Unconscious Bias in AI: A Sigmund Freud's Lens on Generative AI Biases and Ethics

2025年2月17日

Unconscious Bias in AI: A Sigmund Freud's Lens on Generative AI Biases and Ethics

Sigmund Freud’s ground-breaking work, The Interpretation of Dreams (1899), revolutionised our understanding of the…
Breaking the Next-Token Barrier: The Future Beyond LLMs

2025年2月11日

Breaking the Next-Token Barrier: The Future Beyond LLMs

Let me begin with Moravec’s Paradox. Moravec’s Paradox states that tasks that are easy for humans, like walking…
The Rise of Open-Source AI: How DeepSeek is Challenging Proprietary Models

2025年1月27日

The Rise of Open-Source AI: How DeepSeek is Challenging Proprietary Models

In the domain of artificial intelligence, particularly with large language models (LLMs), there are several notable…

3 条评论
If relevant training data is getting over, next what?

2025年1月12日

If relevant training data is getting over, next what?

The AI world is buzzing with a surprising revelation: we might have exhausted the internet's supply of relevant data…
Generative AI: Breakthroughs in 2024 and the Rise of AI Agents in 2025

2025年1月1日

Generative AI: Breakthroughs in 2024 and the Rise of AI Agents in 2025

The pace of innovation in generative AI in 2024 has been nothing short of extraordinary. With billions in investments…

1 条评论
From Developing to Composing Software: How AI is Transforming Software Development

2024年9月19日

From Developing to Composing Software: How AI is Transforming Software Development

When I started my career as a software developer, I used to write the code line by line, debug it, and spend hours…
Reinforcement Learning

2024年9月15日

Reinforcement Learning

Reinforcement Learning (RL) is a type of machine learning technique where a system (called an "agent") learns how to…

2 条评论
Inside Apple Intelligence Architecture

2024年9月12日

Inside Apple Intelligence Architecture

TL;DR: Apple's new iPhone 16 and iOS 18 introduce a unique AI architecture that combines powerful on-device models with…

1 条评论
Neuroplasticity and Neuromorphic Computing: Emulating Brain with the advancements in AI

2024年6月13日

Neuroplasticity and Neuromorphic Computing: Emulating Brain with the advancements in AI

Language is one of the fundamental abilities that underlie human cognition. Language and cognition are deeply…

1 条评论

See all articles

A Sunday Evening Activity with my drink: Bringing LLM to My Personal Data

Vivek Mehrotra

Chief Business Development Officer - Digital & AI | Circular Economy | GenAI

领英推荐

Vivek Mehrotra的更多文章

社区洞察

其他会员也浏览了

Notes on Data Compression: Part 4 (JPEG)

Data Analysis with an LLM Twist

OSAI tl;dr 11th ed — The “Fauxpen:Open” Ratio Approaching 10:1

You have to fall in love with the Insights not with the Models (or with Coding)

Vector Indexing plus Knowledge Graphs with Neo4j

Strategies to Enhance Accuracy and Performance in LLM for Your Private Data

Strategies to Enhance Accuracy and Performance in LLM for Your Private Data

Learn what’s coming with Milvus 2.5, RAG Evaluation, and A Guide to Choose a Vector DB for You

Enhancing LLMs for Technical Reasoning and SQL Query Handling

Function Calling with OpenAI

领英推荐

Vivek Mehrotra的更多文章

Unconscious Bias in AI: A Sigmund Freud's Lens on Generative AI Biases and Ethics

Breaking the Next-Token Barrier: The Future Beyond LLMs

The Rise of Open-Source AI: How DeepSeek is Challenging Proprietary Models

If relevant training data is getting over, next what?

Generative AI: Breakthroughs in 2024 and the Rise of AI Agents in 2025

From Developing to Composing Software: How AI is Transforming Software Development

Reinforcement Learning

Inside Apple Intelligence Architecture

Neuroplasticity and Neuromorphic Computing: Emulating Brain with the advancements in AI

社区洞察

其他会员也浏览了

Notes on Data Compression: Part 4 (JPEG)

Data Analysis with an LLM Twist

OSAI tl;dr 11th ed — The “Fauxpen:Open” Ratio Approaching 10:1

You have to fall in love with the Insights not with the Models (or with Coding)

Vector Indexing plus Knowledge Graphs with Neo4j

Strategies to Enhance Accuracy and Performance in LLM for Your Private Data

Strategies to Enhance Accuracy and Performance in LLM for Your Private Data

Learn what’s coming with Milvus 2.5, RAG Evaluation, and A Guide to Choose a Vector DB for You

Enhancing LLMs for Technical Reasoning and SQL Query Handling

Function Calling with OpenAI