登录查看更多内容

Our Picks of High-Performance LLM Tech Stacks

Tenten

Tenten: AI-Fueled Creative Solutions. 300+ Businesses Growing Faster with Us

发布日期: 2024年8月1日

Stacks we choose to build and deploy Large Language Model applications

Over the past few months, We've successfully built and deployed various applications utilizing large language models (LLMs). Here are a few examples:

Banking assistant: AI-powered chatbot to handle customer inquiries and support requests for financial institutions.
Medical clinic onboarding system: Conversational AI to guide new patients through registration and initial information gathering.
Legal document analyzer: Automated tool that reviews and extracts key information from legal texts and contracts.
Kindle library natural language search: Application that allows users to find books and content in their e-reader library using conversational queries.
AI-assisted code editor: Intelligent tool for quickly navigating, modifying, and refactoring large codebases in software development projects.

Two of these applications are currently in production, serving real users.

Through extensive trial and error, I've honed a tech stack that I find to be the most effective for building these types of applications.

The Ultimate LLM Stack

TypeScript

TypeScript forms the backbone of my stack. Its type-safe environment significantly enhances developer experience and maintainability, allowing seamless type definitions across both frontend and backend. This was a game changer compared to my early projects that used Python for the backend and a different framework for the frontend.

Instructor

Instructor?ensures typed LLM responses, providing predictability and consistency in outputs, crucial for robust application development.

LlamaIndexTS

LlamaIndexTS?excels in document processing and retrieval, making it a powerful tool for extracting data from structured documents and generating embeddings.

Milvus

Milvus?offers efficient vector storage, making it ideal for managing vector data. I recommend starting with Milvus Lite during development and scaling up with Zilliz for production environments.

GitHub 3 周前

What is CodeGen?

Michael Spencer 2 年前

Microsoft is Partnering with the Future of AI

Michael Spencer 3 年前

MongoDB

MongoDB?is my go-to for a flexible, scalable database solution. Its native support for JavaScript objects aligns perfectly with TypeScript, streamlining data management.

Next.js

Next.js?serves as my full-stack framework, providing server-side rendering capabilities and excellent performance out of the box. Its stable API allows for seamless integration with Copilot or GPTs for coding assistance.

Stytch

Stytch?is my choice for authentication, offering robust B2B-focused features like OAuth and Magic Links, crucial for enterprise applications.

Logging+Eval

For LLM logging and analysis, a service like?Velvet?is essential. While I haven't used Velvet personally, its positive feedback suggests it’s a reliable choice for storing and analyzing LLM API responses.

Vercel

Vercel?is my preferred hosting platform, thanks to its seamless integration with Next.js and excellent performance monitoring tools. It simplifies deployment and scaling, allowing me to focus on development.

HappyDevKit

HappyDevKit?handles feature flag management efficiently, enabling controlled rollouts and A/B testing of new AI features, which is crucial for iterative development.

Sentry

Sentry?provides robust error monitoring and alerting, ensuring application reliability by sending alerts directly to Slack and email.

Emerging Architectures for LLM Applications

Key Takeaways: Recommended Tech Stack for LLM Systems

Language:?TypeScript
LLM Library:?Instructor
RAG Tool:?LlamaIndexTS
Vector Storage:?Milvus
Database:?MongoDB
Full-stack Framework:?Next.js
Authentication:?Stytch
LLM Logging: Logging+Eval (TBD)
Hosting:?Vercel
Feature Flags:?HappyDevKit
Error Monitoring:?Sentry

These tools and frameworks have been instrumental in my development of B2B applications leveraging LLMs. While this stack works well for me, your needs may vary. Experiment with different combinations to find what works best for your projects. I'd love to hear your thoughts and experiences!

AI for Business

307 位关注者

Tyler Falcon

SEO Leader and GenAI Marketer

3 个月

Zilliz Milvus FTW

Rae Yu

AI Influencer + Co-founder ??

3 个月

Whoa, your LLM applications sound lit! ?? Excited to see the impact they make in the future! ??"

1 次回应

Maria Ning

Full Stack Marketer at Tenten

3 个月

What an impressive lineup of AI applications, Tenten! Your dedication to pushing the boundaries of LLM technology is truly inspiring. Keep up the fantastic work!

2 次回应

查看更多评论

要查看或添加评论，请登录

Our Picks of High-Performance LLM Tech Stacks

Tenten

Tenten: AI-Fueled Creative Solutions. 300+ Businesses Growing Faster with Us

Stacks we choose to build and deploy Large Language Model applications

The Ultimate LLM Stack

TypeScript

Instructor

LlamaIndexTS

Milvus

领英推荐

MongoDB

Next.js

Stytch

Logging+Eval

Vercel

HappyDevKit

Sentry

Key Takeaways: Recommended Tech Stack for LLM Systems

AI for Business

307 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Native AOT in .NET

How To Make Your Chatbot Tweet News For You

The Anti-Framework Guide for Building LLM Apps

Replit Agents: Cursor Who?

Build RAG application using Llama-3 with just 4 lines of code ??

CodeTeller: Part 2. Understanding Retrieval-Augmented Generation

Node.js: Powering Scalable AI Solutions with JavaScript's Versatility

Now I'm a Believer...

Create your own GPT, Astro gains resumability, and the right way build an AI project.

Oracle and Generative AI

Stacks we choose to build and deploy Large Language Model applications

The Ultimate LLM Stack

TypeScript

Instructor

LlamaIndexTS

Milvus

领英推荐

MongoDB

Next.js

Stytch

Logging+Eval

Vercel

HappyDevKit

Sentry

Key Takeaways: Recommended Tech Stack for LLM Systems

AI for Business

307 位关注者

Taiwan's Crucial Role in AI: Semiconductor Dominance

2024年11月13日

Why Founder Vesting is Essential: A Comprehensive Guide

2024年11月12日

Updates on Claude Sonnet 3.5 & Claude 3.5 Haiku

2024年11月3日

Apple Study Reveals Major Flaws in AI Reasoning

2024年10月29日

Mastering the Elevator Pitch: Defining Your Company's Value Proposition

2024年10月26日

Mochi 1: The Largest Open Source Video Generation Model Unveiled

2024年10月24日

Business of AI #1019

2024年10月18日

Top 30 Shopify Agencies: Proven Partners for Enterprise Growth

2024年10月16日

Generative Engine Optimization (GEO) vs. Search Engine Optimization (SEO): A Comprehensive Analysis

2024年10月14日

Tesla Cybercab: The Future of Urban Transportation is Here

2024年10月11日

社区洞察

其他会员也浏览了

Native AOT in .NET

How To Make Your Chatbot Tweet News For You

The Anti-Framework Guide for Building LLM Apps

Replit Agents: Cursor Who?

Build RAG application using Llama-3 with just 4 lines of code ??

CodeTeller: Part 2. Understanding Retrieval-Augmented Generation

Node.js: Powering Scalable AI Solutions with JavaScript's Versatility

Now I'm a Believer...

Create your own GPT, Astro gains resumability, and the right way build an AI project.

Oracle and Generative AI