登录查看更多内容

The LOR Stack

John Willis

As an accomplished author and innovative entrepreneur, I am deeply passionate about exploring and advancing the synergy between Generative AI technologies and the transformative principles of Dr. Edwards Deming.

发布日期: 2024年3月26日

During the late 1990s, the tech world adopted the LAMP stack, which consisted of Linux, Apache, MySQL, and PHP/Perl/Python. This combination was a game-changer for web development, clarifying the path through the internet's early and chaotic days.

Fast forward to a chat, I had about my GenAI for DevOps workshops. We ended up drawing parallels to the classic LAMP Stack, but for the GenAI era, we jokingly dubbed it the LOR Stack: Langchain/LlamaIndex, Observability, and RAG. It's a fresh take on today's challenges, where Langchain and LlamaIndex loosely play Apache's role by managing the operations of Large Language Models (LLMs). Rags introduces us to the Vector Database Management Systems (VDMS) concept, which is different but analogous to MySQL in that the data is not relational but in vector embeddings.?

Observability, or the "O" in LOR, shifts gears from traditional service monitoring to a nuanced view of LLM operations. It checks for relevance and correctness and ensures our GenAI isn't hallucinating. In one of my workshops, I showcased this process in action. A demo Python program in a Jupyter Notebook analyzed an academic paper, loaded it into MongoDB Atlas Vector Search, and then organized the data with Langchain's chunking. The open-source project Phoenix Arize monitored the accuracy of our queries and responses.

Let’s look at some of the code examples in the demo program.

This demo program loads a GPT-4 Technical Report research paper into a vector database. Langchain is used for data formatting (chunking), loading (embedding), and querying an LLM that uses the paper as the vector database,?

The first section of code installs the necessary Langchain tools, Arize's observability tool, and Pymongo (MongoDB Atlas Vector Search.)

We call the pdfloader library to get the research paper. After loading the PDF, we begin chunking. Chunking ensures that the data you are vectorizing is clustered correctly in the vector database and that retrial strategies are optimal.?

RecursiveCharacterTextSplitter is a common chunking strategy used by Langchain. This method does not split sentences, paragraphs, and pages in half.?

The next step in this Python program is to set up the vector database. In this example, we use Langhcain to convert our documents into vector embeddings by calling OpenAI’s default GPT foundational model. Langchain also calls the Pymongo API to load the embeddings into MongoDB’s Atlas Vector Database.

领英推荐

Database Design and App Development with Python + MySQL

Free Online Courses With Certificates 1 个月前

Keploy Khronicles September '24

Keploy ?? 4 个月前

Optimising Django ORM queries: A guide to efficiently…

SP-Lutsk 4 个月前

Here’s an example of how the data looks in MongoDB’s Atlas. Notice the field called “embedding,” which is the vectorized data. This field lists floating-point numbers in a highly dimensional vector representing the data's location. In this example, we used the OpenAI default embedding model, a 1536-dimensional vector. One of MongoDB's powerful features is combining JSON Document data with vector data in the same database and object.?

Once the PDF file has been converted into a vector database, we must tell Langchain to start a tracing session if we want to use the Phoenix Arize observability tool.? Following is an example of Langchain starting tracing. Phoenix Arize is open-source and straightforward to set up, which is one of my favorite things about it. They use “LLM as a Judge”? as their default evaluation setup. This allows you to skip taking a Coursera class to set up evaluation templates, flow, and ground truth. You can still use the deep dive version of evaluations with Arize if you want; you don’t have to.?

Now, we can run a query against the research paper PDF using Langchain and MongoDB’s Atlas Vector Search.?

Finally, we can look at the dynamic link Phoenix Arize created to see the trace. In future posts, I will demonstrate how to use evaluation tools (i.e., LLM Observability tools) to monitor correctness, relevance, and hallucinations.

The transition from LAMP to LOR represents our technological development as we embrace the intricacies and potential of GenAI. It demonstrates our ability to adapt and progress, discovering novel ways to innovate and propel ourselves forward. Digital narratives are constantly evolving.

Attention Is All You Need

1,685 位关注者

Matthew Thompson

#Tech #AI #AgenticAI #RL #ResponsibleAI #BeingHuman (maybe some Neuroscience).

11 个月

Tools John! Not just because you get LOTR then ??

Rich Miller

CEO, Telematica Inc.

11 个月

Great post, John Willis! This very topic has been on my mind recently, I've been coming at it from (…you guessed it) an enterprise / production point of view… and I'm afraid that I'm about to add to the alphabet soup. First, I completely agree with the inclusion of OBSERVABILITY and RAG as first class citizens in the 'new stack'. Note: RAG for me includes complimentary data management architectures like graph dbs, SQL and NoSQL which permit multiply indexed, hybrid approaches. (But let's let RAG be the umbrella concept.) FRAMEWORKS (which you call out as the 'L' - LangChain, LlamaIndex) are the necessary 'active abstraction layers' required to take implementation from experiment to prototype to production. Soon to join this pair are frameworks that address multi-agent approaches, such as Autogen.? Third, Observability serves and services a number of essential operational aspects, which include EVALUATION and DATA GOVERNANCE, which will be increasingly vital in addressing numerous measures of quality and alignment. There you have it:?O R F E G … OK. OK. You're right. It does NOT roll off the tongue.

Christopher Thompson

IT Professional

11 个月

We are getting to the point where the market is going to settle on a few generic stacks to get us to mass adoption. If not LOR, something. Looking forward to the next post on correctness and relevance. Keep the engaging dialogue going, John.

1 次回应

Savinder Puri

DevOps Evangelist & Spiritual Coach | DevOps Ambassador, Speaker and Author | I help people 100x in Professional and Personal life through Spiritual Wisdom, Healing and Technology

11 个月

Nice one John Willis - Is there a recording of one of your workshops - Would love to try it out.

1 次回应

Mikyo King

Head of Open Source. Building the future of AI Observability at Arize AI

11 个月

Nice one!!

查看更多评论

要查看或添加评论，请登录

John Willis的更多文章

The Rumors of RAG's Demise Might be Exaggerated

2025年3月14日

The Rumors of RAG's Demise Might be Exaggerated

Retrieval Augmented Generation (RAG) has become interchangeable with integrating external knowledge sources into large…

1 条评论
Abraham Wald: A Pioneer in Systems Thinking and Operations Research

2025年3月14日

Abraham Wald: A Pioneer in Systems Thinking and Operations Research

When people talk about Abraham Wald, they usually bring up the famous story of survivorship bias—the one about the…
First Look at Rebels of Reason

2025年3月7日

First Look at Rebels of Reason

I’m getting close to the finish line. Here’s a sample of the book’s Prologue that I just finished.
First Look at Rebels of Reason

2025年3月7日

First Look at Rebels of Reason

I’m getting close to the finish line. Here’s a sample of the book’s Prologue that I just finished.
Speaking Schedule for March, April, and May

2025年3月6日

Speaking Schedule for March, April, and May

My upcoming speaking schedule for March, April, and May: 3/17 - All Things Open AI - RAG Workshop 3/18 - All Things…
Slow and Steady Wins the Race

2025年2月28日

Slow and Steady Wins the Race

In writing my new book, Rebels of Reason, I documented IBM’s contributions to the development of AI, and one of its…
An Ode to an Original Dr. Deming Master: Peter Scholtes

2025年2月28日

An Ode to an Original Dr. Deming Master: Peter Scholtes

Although I never met Peter Scholtes, while researching for my book, Deming’s Journey to Profound Knowledge, I came…

10 条评论
Thomas Bayes: The Minister Who Transformed Probability and Decision-Making

2025年2月21日

Thomas Bayes: The Minister Who Transformed Probability and Decision-Making

In this ongoing series, we’ve covered a good number of systems thinkers who have overhauled the field of statistics and…

3 条评论
Beyond Raw Metrics: Why AI-Generated Code Must Be Tempered by Sound Engineering Practices

2025年2月21日

Beyond Raw Metrics: Why AI-Generated Code Must Be Tempered by Sound Engineering Practices

The recent AI-Copilot Code Quality report by GitClear has sparked a lively debate across the software development…
Barbara Simons: Pioneering Computer Scientist and Election Security Advocate

2025年2月14日

Barbara Simons: Pioneering Computer Scientist and Election Security Advocate

As I complete my new book, Rebels of Reason, one of the final chapters focuses on Fintech and its contributions to the…

See all articles

The LOR Stack

John Willis

As an accomplished author and innovative entrepreneur, I am deeply passionate about exploring and advancing the synergy between Generative AI technologies and the transformative principles of Dr. Edwards Deming.

领英推荐

Attention Is All You Need

1,685 位关注者

John Willis的更多文章

社区洞察

其他会员也浏览了

Understanding the Power of on_delete CASCADE vs. on_delete PROTECT in Django Framework!!

Testing FastAPI application with PostgreSQL database - using Pytest and SQLAlchemy

Flask Extensions

Coding Challenge #37 - Redis CLI Tool

Building a Highly Resilient and Scalable System for AI/ML Models using Python(Django), PostreSQL, and ReactJS.

Exploring Django's Key Components: Building Efficient Web Applications.

More fun with Medium story stats, JSON, Python, Pandas, and Oracle SQL Developer Web

FLASK VS DJANGO

Building your first web app with Flask

Django ORM Fundamentals: A Guide to Interacting with Databases

领英推荐

Attention Is All You Need

1,685 位关注者

John Willis的更多文章

The Rumors of RAG's Demise Might be Exaggerated

Abraham Wald: A Pioneer in Systems Thinking and Operations Research

First Look at Rebels of Reason

First Look at Rebels of Reason

Speaking Schedule for March, April, and May

Slow and Steady Wins the Race

An Ode to an Original Dr. Deming Master: Peter Scholtes

Thomas Bayes: The Minister Who Transformed Probability and Decision-Making

Beyond Raw Metrics: Why AI-Generated Code Must Be Tempered by Sound Engineering Practices

Barbara Simons: Pioneering Computer Scientist and Election Security Advocate

社区洞察

其他会员也浏览了

Understanding the Power of on_delete CASCADE vs. on_delete PROTECT in Django Framework!!

Testing FastAPI application with PostgreSQL database - using Pytest and SQLAlchemy

Flask Extensions

Coding Challenge #37 - Redis CLI Tool

Building a Highly Resilient and Scalable System for AI/ML Models using Python(Django), PostreSQL, and ReactJS.

Exploring Django's Key Components: Building Efficient Web Applications.

More fun with Medium story stats, JSON, Python, Pandas, and Oracle SQL Developer Web

FLASK VS DJANGO

Building your first web app with Flask

Django ORM Fundamentals: A Guide to Interacting with Databases