AIM Weekly 19-August-2024

AIM Weekly 19-August-2024

Mivus, Vector Database, Unstructured Data, Open Source, AI, GenAI, LLM, Machine Learning, Deep Learning, Java, Python, Kafka, Pulsar, NiFi, Flink

19-August-2024

Tim Spann @PaaSDev Milvus?—?Towhee?—?Attu?—?Feder?—?GPTCache?—?VectorDB Bench

AIM Weekly (Towhee?—?Attu?—?Milvus (Tim-Tam))

https://github.com/milvus-io/milvus?utm_source=partner&utm_medium=referral&utm_campaign=2024_newsletter_tspann-ai-newsletters_external

https://www.youtube.com/@FLaNK-Stack

https://medium.com/@tspann/subscribe

https://ossinsight.io/analyze/tspannhw


CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

https://www.meetup.com/unstructured-data-meetup-new-york/?utm_source=partner&utm_medium=referral&utm_campaign=2024_newsletter_tspann-ai-newsletters_external


This is Issue #151

Join us at the next meetup in September.

Our Best?Friends

Milvus Adventures August 14, 2024 COMMUNITY Bussin' summer indeed! We have had a lot of legit talks from so many people... Tagged with rag, opensource…dev.to

Webinar Coming

Challenges in Structured Document Data Extraction at Scale with LLMs Join this webinar to learn about unstructured document processing.zilliz.com

Tutorials

bootcamp/bootcamp/tutorials/quickstart/apps/multimodal_rag_with_milvus at master ·… Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis…github.com

What is Faiss (Facebook AI Similarity Search)? - Zilliz blog Faiss (Facebook AI similarity search) is an open-source library for efficient similarity search of unstructured data…zilliz.com

NLP Basics: Tokens, N-Grams, and Bag-of-Words Models - Zilliz blog This post covers Natural Language Processing fundamentals that are essential to understanding all of today's language…zilliz.com

Neural Networks and Embeddings for Language Models - Zilliz blog Exploring neural network language models, specifically recurrent neural networks, and taking a sneak peek at how…zilliz.com

Sparse and Dense Embeddings - Zilliz blog Learn about sparse and dense embeddings, their use cases, and a text classification example using these embeddings.zilliz.com

Enhancing Information Retrieval with Sparse Embeddings | Zilliz Learn - Zilliz blog Explore the inner workings, advantages, and practical applications of learned sparse embeddings with the Milvus vector…zilliz.com

BGE-M3 and Splade: Two Popular Sparse Embedding Models - Zilliz blog In this blog, we've journeyed through the intricate world of vector embeddings and explored how BGE-M3 and Splade…zilliz.com

Comparing SPLADE Sparse Vectors with BM25 - Zilliz blog In general, there are two types of vectors: dense vectors and sparse vectors. While they can be utilized for similar…zilliz.com

Build a Multimodal RAG with Gemini, BGE-M3, Milvus and LangChain - Zilliz blog Multimodal RAG extends RAG by accepting data from different modalities as context. Learn how to build one with Gemini…zilliz.com

Multimodal RAG locally with CLIP and Llama3 - Zilliz blog A tutorial walks you through how to build a multimodal RAG with CLIP, Llama3, and Milvus.zilliz.com

Exploring the Frontier of Multimodal Retrieval-Augmented Generation (RAG) - Zilliz blog Multimodal RAG is an extended RAG framework incorporating multimodal data including various data types such as text…zilliz.com

Exploring OpenAI CLIP: The Future of Multi-Modal AI Learning - Zilliz blog Multimodal AI learning can get input and understand information from various modalities like text, images, and audio…zilliz.com

Build Better Multimodal RAG Pipelines with FiftyOne, LlamaIndex, and Milvus - Zilliz blog Enhance the capabilities of multimodal systems by efficiently leveraging text and visual data for improved data…zilliz.com

ColBERT: A Token-Level Embedding and Ranking Model - Zilliz blog Unlike traditional embedding models like BERT, which focus on pooling embeddings into a single vector, ColBERT retains…zilliz.com

A Beginner's Guide to Natural Language Processing - Zilliz blog Learn the intricacies of Natural Language Processing and how vector databases, like Zilliz Cloud, transform NLP with…zilliz.com

Key NLP technologies in Deep Learning - Zilliz blog An exploration of the evolution and fundamental principles underlying key Natural Language Processing (NLP)…zilliz.com

20 Useful Open Datasets for Natural Language Processing - Zilliz blog Learn the key criteria for selecting the ideal dataset for your NLP projects and explore 20 popular open datasets.zilliz.com

Top 10 Popular NLP Tools and Platforms - Zilliz blog An overview of the top ten NLP tools and platforms, highlighting their key features, applications, and advantages to…zilliz.com

NLP Basics: Tokens, N-Grams, and Bag-of-Words Models - Zilliz blog This post covers Natural Language Processing fundamentals that are essential to understanding all of today's language…zilliz.com

Top 10 Real-World NLP Applications - Zilliz blog NLP makes our lives much easier. Learn about the top 10 most popular NLP applications and how they have an impact on…zilliz.com

Top 20 NLP Models to Empower Your ML Application - Zilliz blog Learn about the 10 most popular LLMs taking 2023 by storm and another 10 basic NLP models.zilliz.com

NLP Essentials: Understanding Transformers in AI - Zilliz blog This article will introduce you to the field of Natural Language Processing (NLP) and the breakthrough architecture…zilliz.com

Neural Networks and Embeddings for Language Models - Zilliz blog Exploring neural network language models, specifically recurrent neural networks, and taking a sneak peek at how…zilliz.com

Large Language Models and Search | Zilliz Learn - Zilliz blog Explore the integration of Large Language Models (LLMs) and search technologies, featuring real-world applications and…zilliz.com

Large Language Models (LLMs) What Is a Large Language Model? A Developer's Referencezilliz.com

Top LLMs of 2024: Only the Worthy - Zilliz blog This blog introduces the six most influential large language models in 2024.zilliz.com

What is Prompt as Code (Prompt Engineering) Explores what prompt engineering is, how it works in NLP, and best practices for effective prompt engineering.zilliz.com

LangChain & Milvus: Enhancing ChatGPT's Intelligence and Efficiency - Zilliz blog Learn how LangChain and Milvus can be used to improve the performance and memory of LLMs.zilliz.com

A Guide to Using OpenAI Text Embedding Models for NLP Tasks - Zilliz blog A comprehensive guide to using OpenAI text embedding models for embedding creation and semantic search.zilliz.com

NLP and Vector Databases: Creating a Synergy for Advanced Processing - Zilliz blog Finding photos, recommending products, or enabling facial recognition, the power of vector databases lies in their…zilliz.com


Cool Stuff

Retrieval-Augmented Generation (RAG) with Milvus and BentoML | Milvus Documentation This guide demonstrates how to use an open-source embedding model and large-language model on BentoCloud with Milvus…milvus.io

Integrate Milvus with DSPy | Milvus Documentation This guide demonstrates how to use MilvusRM, one of DSPy's retriever modules, to optimize RAG programs. | v2.4.xmilvus.io

Airbyte: Open-Source Data Movement Infrastructure | Milvus Documentation Airbyte is an open-source data movement infrastructure for building extract and load (EL) data pipelines. It is…milvus.io

NVIDIA NIM | radtts-hifigan-tts Experience the leading models to build enterprise generative AI apps now.build.nvidia.com


Articles

What’s in the Air Tonight, Mr. Milvus. (Air Quality + Vector Database + RAG)?

What’s in the Air Tonight Mr. Milvus? Open Source, Air Quality, REST, JSON, Python, Milvus, Vector Databasemedium.com

AI and Vectors?—?Meetup Report?


AI Camp?—?15 August 2024 Report?


Milvus?—?The Unstructured Olympics of the Mind? AI? Data??


From Edge to the Cloud and Back Again?


Milvus, Edge AI, Vector Database, MQTT, Kafka, Zilliz Cluster, Pythonmedium.com

Milvus on EKS?


A step-by-step guide on deploying the Milvus vector database on AWS using managed services such as Amazon EKS, S3, MSK…milvus.io

Milvus with NVIDIA for Retail Rag?

Retail Shopping Advisor - Technical Brief Retail Shopping Advisor - Technical Briefresources.nvidia.com

Work Flows Generative AI?

Technical Brief Build Generative AI chatbots that accurately answer domain-specific queries using latest informationdocs.nvidia.com

Landscape of Gen AI Ecosystem Beyond LLMs and Vector Databases?

The Landscape of GenAI Ecosystem: Beyond LLMs and Vector Databases - Zilliz blog Initially, Large Language Models (LLMs) and vector databases captured the most attention. However, the GenAI ecosystem…zilliz.com

What is Information Retrieval??

What is Information Retrieval? A Comprehensive Guide. - Zilliz blog Information retrieval (IR) is the process of efficiently retrieving relevant information from large collections of…zilliz.com

NVIDIA Nemo Curator?

Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator | NVIDIA… In a recent post, we discussed how to use NVIDIA NeMo Curator to curate custom datasets for pretraining or continuous…developer.nvidia.com

Evaluating LLM Conversations?

LLM-Eval: A Simplified Approach to Evaluating LLM Conversations - Zilliz blog LLM-Eval is an approach to simplifying and automating the evaluation of LLM conversation quality.zilliz.com

Pokeman Embeddings?

The Super Effectiveness of Pokémon Embeddings Using Only Raw JSON and Images Embeddings encourage engineers to go full YOLO because it's actually rewarding to do so!minimaxir.com

LLM Evaluation?

Milvus on LinkedIn: #llm #evaluation #demo #learn #ai LLM-Eval: used to evaluate the response quality of an LLM. This article covers: ?? What is LLM-Eval? ?? LLM-Eval…www.dhirubhai.net

Agent Q?

Agent Q: Breakthrough AI Research in Self-Healing Web Agents | MultiOn - MultiOn AI MultiOn's Agent Q: AI research breakthrough in web navigation. 340% performance boost with self-healing capabilities.www.multion.ai

The Landscape of OS Licensing in AI https://medium.com/@zilliz_learn/the-landscape-of-open-source-licensing-in-ai-a-primer-on-llms-and-vector-databases-5effbccbccd5

Unlocking the Secrets of GPT 4.0 https://medium.com/@zilliz_learn/unlocking-the-secrets-of-gpt-4-0-and-large-language-models-0020f61b62c2

AI Databases Ensuring the Quality of LLMs in Chatbots https://www.opensourceforu.com/2024/08/ai-databases-ensuring-the-quality-of-llms-in-chatbots/

Bringing Confidentially to Vector Search https://developer.nvidia.com/blog/bringing-confidentiality-to-vector-search-with-cyborg-and-rapids-cuvs/

Google ImageGen3 https://arxiv.org/pdf/2408.07009

AI Bringing Voice to Peopl https://indianexpress.com/article/world/als-stole-his-voice-ai-retrieved-it-9516953/

InfluxDB plus Milvus https://www.influxdata.com/blog/time-series-influxdb-vector-database/

End to End Rag with Airbyte https://airbyte.com/tutorials/end-to-end-rag-with-airbyte-cloud-microsoft-sharepoint-and-milvus-zilliz

How to Prune https://developer.nvidia.com/blog/how-to-prune-and-distill-llama-3-1-8b-to-an-nvidia-llama-3-1-minitron-4b-model/

Streamling the Deployment of Enterprise GenAI https://medium.com/@zilliz_learn/streamlining-the-deployment-of-enterprise-genai-apps-with-efficient-management-of-unstructured-data-2d3b1a2f2d85

Learn GenAI?

https://zilliz.com/learn/generative-ai

LangChain?—?Milvus https://api.python.langchain.com/en/latest/vectorstores/langchain_community.vectorstores.milvus.Milvus.html

Hybrid Search in Rag Apps https://ai.plainenglish.io/the-role-of-hybrid-search-in-rag-applications-29bf46b95152

Agent Based Rag https://valentinaalto.medium.com/introducing-agent-based-rag-9b7141ae1cd7

Rag2SQL https://medium.com/@marvin_thompson/text2sql-is-out-rag2sql-is-in-5fd160a004f0

Understanding Transformers https://medium.com/@zilliz_learn/nlp-essentials-understanding-transformers-in-ai-29d9d973a1fc

AI Agents?

https://towardsdatascience.com/ai-agents-from-concepts-to-practical-implementation-in-python-fb26789b1560

Pandas, AI, OLLAMA?

https://medium.com/free-or-open-source-software/pandasai-ollama-text2sql-llama3-ask-questions-from-excel-create-visualization-in-natural-language-fbfb14ac9360

Flink, Kafka, GenAI, Real-Time?

https://medium.com/@zilliz_learn/build-real-time-genai-applications-with-zilliz-cloud-and-confluent-cloud-for-apache-flink-c1922b3a1603

How to import new model from HuggingFace to Ollama https://medium.com/@raphael.mansuy/how-to-import-a-new-model-from-huggingface-for-ollama-9dfe9ffe1a0b

LangGraph Guide https://bhavikjikadara.medium.com/langgraph-a-comprehensive-guide-for-beginners-ef17d3dd5383

High Speed Inference with LLAMA CPP and Vicuna https://pub.towardsai.net/high-speed-inference-with-llama-cpp-and-vicuna-on-cpu-136d28e7887b

Videos

AI Camp Videos - Pose Estimation

Fun Unstructured Friday

Quick Edge Demo

NYC Replacement Talk

Live Fun Friday with Unstructed Data Preview

High Speed Inference with LLAMA CPP and Vicuna

https://pub.towardsai.net/high-speed-inference-with-llama-cpp-and-vicuna-on-cpu-136d28e7887b

Unstructured Data Processing at the Edge Webinar

Unstructured Meetup SF

Building an Agentic RAG locally with Milvus, Ollama and Llama Agents



Slides





Unstructured Data Processing from Cloud to Edge Webinar - Download as a PDF or view online for freewww.slideshare.net



Implement Agentic RAG Using Claude 3.5 Sonnet, LlamaIndex, and Milvus - Download as a PDF or view online for freewww.slideshare.net


Events

August 20, 2024: DotNet Conf Virtual AI?


Join the?.NET Conf Focus on AI free virtual event August 20 2024 to learn about the newest developments across the?.NET…focus.dotnetconf.net

September 18, 2024: Unstructured Data Meetup NYC?


This is an in-person event! Registration is required to get in. Topic: Connecting your unstructured data with…lu.ma

Unstructured Data Meetup New York Book Tickets for Unstructured Data Meetup New York Hosted By Unstructured Data Meetup. Event starts on Tuesday, 24…allevents.in

October 23, 2024: Unstructured Data Meetup NYC?

Unstructured Data Meetup New York · Luma This is an in-person event! Registration is required to get in. Topic: Connecting your unstructured data with…lu.ma

October 27–29, Raleigh, NC?—?All Things Open https://2024.allthingsopen.org/speakers/timothy-spann?

Advanced Retrieval Augmented Generation (RAG) Techniques - All Things Open 2024 In 2023, we saw many simple retrieval augmented generation (RAG) examples being built. However, most of these examples…2024.allthingsopen.org

October 31?—?Live stream from my Halloween decorations with three 12 foot skeletons

November 5–7, 10–12, 2024: CloudX. Online/Santa Clara.? https://www.developerweek.com/cloudx/

November 13–15, 2024: Build Stuff. Online. Adding Generative AI to Real-Time Streaming Pipelines

Software Development Conference | Developer Meetups | Online Courses Want to grow your career in the field of software development? Save the date: November 13-15, 2024. The upcoming Build…www.buildstuff.events

November 19, 2024: XtremePython. Online.?

XtremePython 2024 Online Conference We are excited to welcome the entire Python community to join us in the coming XtremePython online conference. We are…xtremepython.dev

November 21, 2024: Big Data Conference 2024 EU

Big Data Conference Europe 2024 Edit descriptionevents.pinetool.ai

November 21, 2024: Unstructured Data Meetup NYC https://lu.ma/cqxuproe

December 4, 2024: Grace Hopper Celebration?—?Open Source?—?Milvus https://ghc.anitab.org/open-source/

December 10, 2024: Unstructured Data Meetup NYC https://lu.ma/u2ijucyv

Code


Models


Tools

? 2020–2024 Tim Spann https://www.youtube.com/@FLaNK-Stack

??? Videos
https://www.youtube.com/@MilvusVectorDatabase/videos        
X Twitter -   / milvusio  
https://x.com/milvusio        
?? Linkedin:  / zilliz  
https://www.dhirubhai.net/company/zilliz/        
?? GitHub
https://github.com/milvus-io/milvus        
?? Invitation to join discord:   / discord  https://discord.com/invite/FjCMmaJng6        
https://discord.gg/9jdMRPJb?event=1273364262710022209        


Hrijul Dey

Python/GenAI Dev | 3D Artist | Startup Founder | Social Media Analyst

1 周

LinkedIn is your golden opportunity for professional growth, but have you ever thought of unlocking even more value from it? With DSPy and Pandas, you can dive deep into the vast ocean of LinkedIn posts to fish out insights that are not only relevant but game-changing. Imagine understanding market trends, gauging audience interest, or identannoying competitors' strategies with just a few clicks! Let DSPy and Pandas do the heavy lifting as you sit back, analyze and plan your next big move. https://www.artificialintelligenceupdate.com/analyze-linkedin-posts-with-dspy-and-pandas/riju/ #learnmore #DataScience #LinkedInAnalytics #DSPy #Pandas #ProfessionalGrowth

回复

要查看或添加评论,请登录

社区洞察

其他会员也浏览了