AIM Weekly 19-August-2024
Tim Spann ??
Principal Developer Advocate - Milvus, AIM, Python, LLM, GenAI, HuggingFace, DeveloperWeek Advisor, Apache NiFi --- @ --- Zilliz (creators of Milvus, world's most popular open-source vector database)
Mivus, Vector Database, Unstructured Data, Open Source, AI, GenAI, LLM, Machine Learning, Deep Learning, Java, Python, Kafka, Pulsar, NiFi, Flink
19-August-2024
Tim Spann @PaaSDev Milvus?—?Towhee?—?Attu?—?Feder?—?GPTCache?—?VectorDB Bench
AIM Weekly (Towhee?—?Attu?—?Milvus (Tim-Tam))
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
This is Issue #151
Join us at the next meetup in September.
Our Best?Friends
Milvus Adventures August 14, 2024 COMMUNITY Bussin' summer indeed! We have had a lot of legit talks from so many people... Tagged with rag, opensource…dev.to
Webinar Coming
Challenges in Structured Document Data Extraction at Scale with LLMs Join this webinar to learn about unstructured document processing.zilliz.com
Tutorials
bootcamp/bootcamp/tutorials/quickstart/apps/multimodal_rag_with_milvus at master ·… Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis…github.com
What is Faiss (Facebook AI Similarity Search)? - Zilliz blog Faiss (Facebook AI similarity search) is an open-source library for efficient similarity search of unstructured data…zilliz.com
NLP Basics: Tokens, N-Grams, and Bag-of-Words Models - Zilliz blog This post covers Natural Language Processing fundamentals that are essential to understanding all of today's language…zilliz.com
Neural Networks and Embeddings for Language Models - Zilliz blog Exploring neural network language models, specifically recurrent neural networks, and taking a sneak peek at how…zilliz.com
Sparse and Dense Embeddings - Zilliz blog Learn about sparse and dense embeddings, their use cases, and a text classification example using these embeddings.zilliz.com
Enhancing Information Retrieval with Sparse Embeddings | Zilliz Learn - Zilliz blog Explore the inner workings, advantages, and practical applications of learned sparse embeddings with the Milvus vector…zilliz.com
BGE-M3 and Splade: Two Popular Sparse Embedding Models - Zilliz blog In this blog, we've journeyed through the intricate world of vector embeddings and explored how BGE-M3 and Splade…zilliz.com
Comparing SPLADE Sparse Vectors with BM25 - Zilliz blog In general, there are two types of vectors: dense vectors and sparse vectors. While they can be utilized for similar…zilliz.com
Build a Multimodal RAG with Gemini, BGE-M3, Milvus and LangChain - Zilliz blog Multimodal RAG extends RAG by accepting data from different modalities as context. Learn how to build one with Gemini…zilliz.com
Multimodal RAG locally with CLIP and Llama3 - Zilliz blog A tutorial walks you through how to build a multimodal RAG with CLIP, Llama3, and Milvus.zilliz.com
Exploring the Frontier of Multimodal Retrieval-Augmented Generation (RAG) - Zilliz blog Multimodal RAG is an extended RAG framework incorporating multimodal data including various data types such as text…zilliz.com
Exploring OpenAI CLIP: The Future of Multi-Modal AI Learning - Zilliz blog Multimodal AI learning can get input and understand information from various modalities like text, images, and audio…zilliz.com
Build Better Multimodal RAG Pipelines with FiftyOne, LlamaIndex, and Milvus - Zilliz blog Enhance the capabilities of multimodal systems by efficiently leveraging text and visual data for improved data…zilliz.com
ColBERT: A Token-Level Embedding and Ranking Model - Zilliz blog Unlike traditional embedding models like BERT, which focus on pooling embeddings into a single vector, ColBERT retains…zilliz.com
A Beginner's Guide to Natural Language Processing - Zilliz blog Learn the intricacies of Natural Language Processing and how vector databases, like Zilliz Cloud, transform NLP with…zilliz.com
Key NLP technologies in Deep Learning - Zilliz blog An exploration of the evolution and fundamental principles underlying key Natural Language Processing (NLP)…zilliz.com
20 Useful Open Datasets for Natural Language Processing - Zilliz blog Learn the key criteria for selecting the ideal dataset for your NLP projects and explore 20 popular open datasets.zilliz.com
Top 10 Popular NLP Tools and Platforms - Zilliz blog An overview of the top ten NLP tools and platforms, highlighting their key features, applications, and advantages to…zilliz.com
NLP Basics: Tokens, N-Grams, and Bag-of-Words Models - Zilliz blog This post covers Natural Language Processing fundamentals that are essential to understanding all of today's language…zilliz.com
Top 10 Real-World NLP Applications - Zilliz blog NLP makes our lives much easier. Learn about the top 10 most popular NLP applications and how they have an impact on…zilliz.com
Top 20 NLP Models to Empower Your ML Application - Zilliz blog Learn about the 10 most popular LLMs taking 2023 by storm and another 10 basic NLP models.zilliz.com
NLP Essentials: Understanding Transformers in AI - Zilliz blog This article will introduce you to the field of Natural Language Processing (NLP) and the breakthrough architecture…zilliz.com
Neural Networks and Embeddings for Language Models - Zilliz blog Exploring neural network language models, specifically recurrent neural networks, and taking a sneak peek at how…zilliz.com
Large Language Models and Search | Zilliz Learn - Zilliz blog Explore the integration of Large Language Models (LLMs) and search technologies, featuring real-world applications and…zilliz.com
Top LLMs of 2024: Only the Worthy - Zilliz blog This blog introduces the six most influential large language models in 2024.zilliz.com
What is Prompt as Code (Prompt Engineering) Explores what prompt engineering is, how it works in NLP, and best practices for effective prompt engineering.zilliz.com
LangChain & Milvus: Enhancing ChatGPT's Intelligence and Efficiency - Zilliz blog Learn how LangChain and Milvus can be used to improve the performance and memory of LLMs.zilliz.com
A Guide to Using OpenAI Text Embedding Models for NLP Tasks - Zilliz blog A comprehensive guide to using OpenAI text embedding models for embedding creation and semantic search.zilliz.com
NLP and Vector Databases: Creating a Synergy for Advanced Processing - Zilliz blog Finding photos, recommending products, or enabling facial recognition, the power of vector databases lies in their…zilliz.com
Cool Stuff
Retrieval-Augmented Generation (RAG) with Milvus and BentoML | Milvus Documentation This guide demonstrates how to use an open-source embedding model and large-language model on BentoCloud with Milvus…milvus.io
Integrate Milvus with DSPy | Milvus Documentation This guide demonstrates how to use MilvusRM, one of DSPy's retriever modules, to optimize RAG programs. | v2.4.xmilvus.io
Airbyte: Open-Source Data Movement Infrastructure | Milvus Documentation Airbyte is an open-source data movement infrastructure for building extract and load (EL) data pipelines. It is…milvus.io
NVIDIA NIM | radtts-hifigan-tts Experience the leading models to build enterprise generative AI apps now.build.nvidia.com
Articles
What’s in the Air Tonight, Mr. Milvus. (Air Quality + Vector Database + RAG)?
What’s in the Air Tonight Mr. Milvus? Open Source, Air Quality, REST, JSON, Python, Milvus, Vector Databasemedium.com
AI and Vectors?—?Meetup Report?
AI Camp?—?15 August 2024 Report?
Milvus?—?The Unstructured Olympics of the Mind? AI? Data??
From Edge to the Cloud and Back Again?
Milvus on EKS?
A step-by-step guide on deploying the Milvus vector database on AWS using managed services such as Amazon EKS, S3, MSK…milvus.io
Milvus with NVIDIA for Retail Rag?
Retail Shopping Advisor - Technical Brief Retail Shopping Advisor - Technical Briefresources.nvidia.com
Work Flows Generative AI?
Technical Brief Build Generative AI chatbots that accurately answer domain-specific queries using latest informationdocs.nvidia.com
Landscape of Gen AI Ecosystem Beyond LLMs and Vector Databases?
The Landscape of GenAI Ecosystem: Beyond LLMs and Vector Databases - Zilliz blog Initially, Large Language Models (LLMs) and vector databases captured the most attention. However, the GenAI ecosystem…zilliz.com
What is Information Retrieval??
What is Information Retrieval? A Comprehensive Guide. - Zilliz blog Information retrieval (IR) is the process of efficiently retrieving relevant information from large collections of…zilliz.com
NVIDIA Nemo Curator?
Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator | NVIDIA… In a recent post, we discussed how to use NVIDIA NeMo Curator to curate custom datasets for pretraining or continuous…developer.nvidia.com
Evaluating LLM Conversations?
LLM-Eval: A Simplified Approach to Evaluating LLM Conversations - Zilliz blog LLM-Eval is an approach to simplifying and automating the evaluation of LLM conversation quality.zilliz.com
Pokeman Embeddings?
The Super Effectiveness of Pokémon Embeddings Using Only Raw JSON and Images Embeddings encourage engineers to go full YOLO because it's actually rewarding to do so!minimaxir.com
LLM Evaluation?
Milvus on LinkedIn: #llm #evaluation #demo #learn #ai LLM-Eval: used to evaluate the response quality of an LLM. This article covers: ?? What is LLM-Eval? ?? LLM-Eval…www.dhirubhai.net
Agent Q?
Agent Q: Breakthrough AI Research in Self-Healing Web Agents | MultiOn - MultiOn AI MultiOn's Agent Q: AI research breakthrough in web navigation. 340% performance boost with self-healing capabilities.www.multion.ai
The Landscape of OS Licensing in AI https://medium.com/@zilliz_learn/the-landscape-of-open-source-licensing-in-ai-a-primer-on-llms-and-vector-databases-5effbccbccd5
Unlocking the Secrets of GPT 4.0 https://medium.com/@zilliz_learn/unlocking-the-secrets-of-gpt-4-0-and-large-language-models-0020f61b62c2
AI Databases Ensuring the Quality of LLMs in Chatbots https://www.opensourceforu.com/2024/08/ai-databases-ensuring-the-quality-of-llms-in-chatbots/
Bringing Confidentially to Vector Search https://developer.nvidia.com/blog/bringing-confidentiality-to-vector-search-with-cyborg-and-rapids-cuvs/
Google ImageGen3 https://arxiv.org/pdf/2408.07009
领英推荐
AI Bringing Voice to Peopl https://indianexpress.com/article/world/als-stole-his-voice-ai-retrieved-it-9516953/
InfluxDB plus Milvus https://www.influxdata.com/blog/time-series-influxdb-vector-database/
End to End Rag with Airbyte https://airbyte.com/tutorials/end-to-end-rag-with-airbyte-cloud-microsoft-sharepoint-and-milvus-zilliz
How to Prune https://developer.nvidia.com/blog/how-to-prune-and-distill-llama-3-1-8b-to-an-nvidia-llama-3-1-minitron-4b-model/
Streamling the Deployment of Enterprise GenAI https://medium.com/@zilliz_learn/streamlining-the-deployment-of-enterprise-genai-apps-with-efficient-management-of-unstructured-data-2d3b1a2f2d85
Learn GenAI?
LangChain?—?Milvus https://api.python.langchain.com/en/latest/vectorstores/langchain_community.vectorstores.milvus.Milvus.html
Hybrid Search in Rag Apps https://ai.plainenglish.io/the-role-of-hybrid-search-in-rag-applications-29bf46b95152
Understanding Transformers https://medium.com/@zilliz_learn/nlp-essentials-understanding-transformers-in-ai-29d9d973a1fc
AI Agents?
Pandas, AI, OLLAMA?
Flink, Kafka, GenAI, Real-Time?
How to import new model from HuggingFace to Ollama https://medium.com/@raphael.mansuy/how-to-import-a-new-model-from-huggingface-for-ollama-9dfe9ffe1a0b
LangGraph Guide https://bhavikjikadara.medium.com/langgraph-a-comprehensive-guide-for-beginners-ef17d3dd5383
High Speed Inference with LLAMA CPP and Vicuna https://pub.towardsai.net/high-speed-inference-with-llama-cpp-and-vicuna-on-cpu-136d28e7887b
Videos
AI Camp Videos - Pose Estimation
Fun Unstructured Friday
Quick Edge Demo
NYC Replacement Talk
Live Fun Friday with Unstructed Data Preview
High Speed Inference with LLAMA CPP and Vicuna
Unstructured Data Processing at the Edge Webinar
Unstructured Meetup SF
Building an Agentic RAG locally with Milvus, Ollama and Llama Agents
Slides
Unstructured Data Processing from Cloud to Edge Webinar - Download as a PDF or view online for freewww.slideshare.net
Implement Agentic RAG Using Claude 3.5 Sonnet, LlamaIndex, and Milvus - Download as a PDF or view online for freewww.slideshare.net
Events
August 20, 2024: DotNet Conf Virtual AI?
Join the?.NET Conf Focus on AI free virtual event August 20 2024 to learn about the newest developments across the?.NET…focus.dotnetconf.net
September 18, 2024: Unstructured Data Meetup NYC?
This is an in-person event! Registration is required to get in. Topic: Connecting your unstructured data with…lu.ma
Unstructured Data Meetup New York Book Tickets for Unstructured Data Meetup New York Hosted By Unstructured Data Meetup. Event starts on Tuesday, 24…allevents.in
October 23, 2024: Unstructured Data Meetup NYC?
Unstructured Data Meetup New York · Luma This is an in-person event! Registration is required to get in. Topic: Connecting your unstructured data with…lu.ma
October 27–29, Raleigh, NC?—?All Things Open https://2024.allthingsopen.org/speakers/timothy-spann?
Advanced Retrieval Augmented Generation (RAG) Techniques - All Things Open 2024 In 2023, we saw many simple retrieval augmented generation (RAG) examples being built. However, most of these examples…2024.allthingsopen.org
October 31?—?Live stream from my Halloween decorations with three 12 foot skeletons
November 5–7, 10–12, 2024: CloudX. Online/Santa Clara.? https://www.developerweek.com/cloudx/
November 13–15, 2024: Build Stuff. Online. Adding Generative AI to Real-Time Streaming Pipelines
Software Development Conference | Developer Meetups | Online Courses Want to grow your career in the field of software development? Save the date: November 13-15, 2024. The upcoming Build…www.buildstuff.events
November 19, 2024: XtremePython. Online.?
XtremePython 2024 Online Conference We are excited to welcome the entire Python community to join us in the coming XtremePython online conference. We are…xtremepython.dev
November 21, 2024: Big Data Conference 2024 EU
November 21, 2024: Unstructured Data Meetup NYC https://lu.ma/cqxuproe
December 4, 2024: Grace Hopper Celebration?—?Open Source?—?Milvus https://ghc.anitab.org/open-source/
December 10, 2024: Unstructured Data Meetup NYC https://lu.ma/u2ijucyv
Code
Models
Tools
? 2020–2024 Tim Spann https://www.youtube.com/@FLaNK-Stack
??? Videos
https://www.youtube.com/@MilvusVectorDatabase/videos
X Twitter - / milvusio
https://x.com/milvusio
?? Linkedin: / zilliz
https://www.dhirubhai.net/company/zilliz/
?? GitHub
https://github.com/milvus-io/milvus
?? Invitation to join discord: / discord https://discord.com/invite/FjCMmaJng6
https://discord.gg/9jdMRPJb?event=1273364262710022209
Python/GenAI Dev | 3D Artist | Startup Founder | Social Media Analyst
1 周LinkedIn is your golden opportunity for professional growth, but have you ever thought of unlocking even more value from it? With DSPy and Pandas, you can dive deep into the vast ocean of LinkedIn posts to fish out insights that are not only relevant but game-changing. Imagine understanding market trends, gauging audience interest, or identannoying competitors' strategies with just a few clicks! Let DSPy and Pandas do the heavy lifting as you sit back, analyze and plan your next big move. https://www.artificialintelligenceupdate.com/analyze-linkedin-posts-with-dspy-and-pandas/riju/ #learnmore #DataScience #LinkedInAnalytics #DSPy #Pandas #ProfessionalGrowth