登录查看更多内容

The temporary definitive guide to building and operating LLM solutions in production environments

Chris Mann

AI Product Management. Former LinkedIn, IBM, Bizo, 1Password and several 0-1's.

发布日期: 2023年4月5日

I would love it if Cohere and OpenAI would write documentation on this subject. I call this "temporary" because everything is changing so fast.

Why am I asking for these docs? Because it will open up the market opportunity for their products. (more on this below)

This is a bit of a rant and a callout to the recent $10M seed round LangChain just closed.

When you read the documentation at Cohere and OpenAI , the best docs for using commercially available LLM's today, you might arrive at the thought that, hey they have all of the stuff I need. If you are implementing the most simple of GenAI solutions, many of which are very powerful and not to be discounted, then yes they do have what you need, and that's cool.

However, once you start bringing your own data to the party and start thinking about things like automating customer service and support, or you want to fine-tune a model (or even wonder if you should) you are in a whole new ball of wax that gets complicated fast.

What does it take to get a solution like this into a production that is safe, performant, cost-effective and dependable over time? What third-party tools are you going to need? Who are the players? Is there a sample code that shows how a more complex LLM solution with multiple applications is strung together? How do I test, version control and monitor the system over time? When should I use vector/semantic/neural search over fine-tuning? What is the temporary definitive guide to fine-tuning?

领英推荐

?? Top LLM Papers of the Week (December Week 2, 2024)

Kalyan KS 3 个月前

Superfast Vector Search for LLM, GPT, GenAI and More

Vincent Granville 1 年前

TestDevLab's Newsletter: January 2025 ??

TestDevLab 1 个月前

This is where companies like LangChain, Humanloop , Vectara , Pinecone , Weights & Biases Robust Intelligence come in. There are many more...

I call this document temporary because best practices are often not yet known and/or are evolving rapidly. I've watched videos where experts at 美国斯坦福大学 and places like Humanloop are in wonder and amazement at how fast this is all changing.

If I were at Cohere , OpenAI , Anthropic , AI21 Labs ... I would have sufficient effort in place to create this documentation and keep it updated as part of our formal help and learning center. This documentation would be as important as the product itself. This requires a dedicated durable team.

Why? Because it will open up the market opportunity for their products. How big is the current market for LLMs? Well, given LLMs can bring value to anyone who uses a computer / smart device, the market is MASSIVE. Companies are rushing to figure out how to add AI to their products. Yet to build something you need a RARE set of skills. This means we need to educate ourselves to expand the near-term and future market opportunity.

Check out this article about LangChain. They are building a lot of the needed connective tissue.

Is today Wednesday?

要查看或添加评论，请登录

Chris Mann的更多文章

Why I Joined You.com (and Why You Should Start Using You.com Today)

2024年11月4日

Why I Joined You.com (and Why You Should Start Using You.com Today)

I'm thrilled to announce that I have joined You.com as Head of Enterprise Products, where I'm working on something…

15 条评论
Why I am joining Martian

2024年5月23日

Why I am joining Martian

I am thrilled to share that I am joining Martian as Head of Product Marketing. Every once in a while, you can find or…

43 条评论
Long-term Token Usage and Costs Trends Insights from Martian's Founder

2024年3月28日

Long-term Token Usage and Costs Trends Insights from Martian's Founder

In a recent insightful conversation with Shriyash Upadhyay "Yash", the founder of Martian—a leader in the domain of…

8 条评论
Managing Costs of Your LLM Application - Part 2 of 2

2024年3月25日

Managing Costs of Your LLM Application - Part 2 of 2

Using ChatGPT4 is like driving a Porsche GT4, fast, fun, thrilling… and expensive… This is the second article in a…

11 条评论
The Implications of Mega-Context Models by Gemini and Claude

2024年3月5日

The Implications of Mega-Context Models by Gemini and Claude

It's been an exciting few weeks with the introduction of new models from Google’s Gemini on Feb 15th and, now on March…

8 条评论
AI Art with DALL-E

2024年3月1日

AI Art with DALL-E

Be careful. This is ADDICTIVE! I've been using DALL-E to create art for my rock band's posters and a few other things.

7 条评论
Beyond the ChatGPT Wrapper: Our Competitive Moat

2023年11月14日

Beyond the ChatGPT Wrapper: Our Competitive Moat

It's a common belief that a company can't establish a competitive edge by building on top of ChatGPT. My hands-on…

8 条评论
Miri’s AI Behavioral Change Methodology (MBCM)

2023年10月11日

Miri’s AI Behavioral Change Methodology (MBCM)

I'm just past two weeks at Miri. I've learned so many exciting things I want to share.

4 条评论
What I'm learning from a client project: How AI Can Transform the Construction Design Industry

2023年7月13日

What I'm learning from a client project: How AI Can Transform the Construction Design Industry

I’m working with a client in the construction design industry who wants to bring AI into their business to automate…

5 条评论
Surviving in an AI world - what you should know may not be what you think

2023年5月19日

Surviving in an AI world - what you should know may not be what you think

As I explored the essential abilities for thriving in a future dominated by AI, I discovered various perspectives that…

4 条评论

See all articles

The temporary definitive guide to building and operating LLM solutions in production environments

Chris Mann

AI Product Management. Former LinkedIn, IBM, Bizo, 1Password and several 0-1's.

领英推荐

Chris Mann的更多文章

社区洞察

其他会员也浏览了

Building an Internet of Truth at Verafy

#AGI? ARE WE WHERE YET?

Beginner’s Guide to Building & Evaluating RAG Apps

GPT: Developer Tips, Tricks & Techniques

Martell: DoD's last chance at getting AI right

AI for Associations in 2025

June 2024 DVC Pulse!

How to create an automated financial advisor API, voice-activated, with OpenAI

OpenAI: How to Build a Voice-activated Stock Market Advisor Chatbot

?? Run Powerful LLMs Locally on Your Machine! Here's How (Ollama + Enchanted + Ngrok + DeepSeek V3!) ??

领英推荐

Chris Mann的更多文章

Why I Joined You.com (and Why You Should Start Using You.com Today)

Why I am joining Martian

Long-term Token Usage and Costs Trends Insights from Martian's Founder

Managing Costs of Your LLM Application - Part 2 of 2

The Implications of Mega-Context Models by Gemini and Claude

AI Art with DALL-E

Beyond the ChatGPT Wrapper: Our Competitive Moat

Miri’s AI Behavioral Change Methodology (MBCM)

What I'm learning from a client project: How AI Can Transform the Construction Design Industry

Surviving in an AI world - what you should know may not be what you think

社区洞察

其他会员也浏览了

Building an Internet of Truth at Verafy

#AGI? ARE WE WHERE YET?

Beginner’s Guide to Building & Evaluating RAG Apps

GPT: Developer Tips, Tricks & Techniques

Martell: DoD's last chance at getting AI right

AI for Associations in 2025

June 2024 DVC Pulse!

How to create an automated financial advisor API, voice-activated, with OpenAI

OpenAI: How to Build a Voice-activated Stock Market Advisor Chatbot

?? Run Powerful LLMs Locally on Your Machine! Here's How (Ollama + Enchanted + Ngrok + DeepSeek V3!) ??