登录查看更多内容

GenAI Core Topics Explained in Simple Pictures

Vincent Granville

AI/LLM Disruptive Leader | GenAI Tech Lab

发布日期: 2024年3月31日

7 concepts, each explained with hundreds of pictures embedded into one-minute data videos.

The featured picture at the top illustrates xLLM, the new trend in modern LLMs and RAG architecture: far less expensive (no GPU needed), specialized, self-tuned, low-latency, non-hallucinating, and locally implemented with no data leaks. Currently under development for fortune 100 client. See details here.

GPU Classification: The Father of Neural Networks

These days, GPUs are used to train neural networks that have nothing to do with images or videos. Yet they were initially built to accelerate image processing and video games. Back to the original usage, the classification method in this data animation does the opposite: turning the training set (tabular data) into an image bitmap, perform the fuzzy classification as a bitmap transform in GPU, then turn the last frame back into tabular data. And voila! You performed classification in GPU. Ironically, without neural networks, just using a high-pass image filter.

Well, you may argue that it is a neural network in disguise, indeed one of the first use cases. A frame is just a deep layer. If the filtering window is very small as in the video, the neural network is very sparse and very deep with hundreds of hidden layers. If the filtering window is very large, one or two layers will do the job, and boundaries will be smoother. I won’t share my opinion on whether or not this is a neural network. Clearly, the computations and architecture are nearly identical.

The first frame is the original training set transformed into a bitmap. Black zones are regions unclassified yet. After a while, the whole feature space is classified, with relatively stable group boundaries: in short, we observe stochastic convergence.

Vincent Granville 5 个月前

The Rise and Fall of RNNs: Why Memory is Best Left to…

Shameem Ansari 3 周前

Decoding Transformers on Edge Devices

Axelera AI 1 年前

Sampling Outside the Observation Range

Many GenAI techniques produce poor results when the training set is small. The reason is because none of the existing methods can sample artificial yet realistic values outside the training set range: below the minimum, or above the maximum. Not even for a single feature, let alone in higher dimensions with correlated features. All of them rely on quantiles generation at some point, and none of the quantile functions in Python offer this possibility. The classic solution consists of using bigger and bigger training sets or trillions of weights, to fix sampling issues. But you can do it a lot faster with much less data. The video below starts with the empirical distribution observed on a small training set, and then extends it as if your training set was far bigger. Pure magic, like reconstructing invisible observations! And you can generalize easily to higher dimensions.

Approximate Nearest Neighbor Search

Fast approximate vector search is a core component of most LLM/GPT apps, to find prompt-derived embeddings similar to existing ones stored in backend embedding tables built on crawled data. My xLLM system uses key-value rather than vector databases and variable-length embeddings (VLE) rather than fixed size, but the nearest neighbor search applies to both architectures, and in many more contexts.

To view all the topics and videos, access the detailed articles (free, no sign-up required), the Python code, use cases and datasets, follow this link.

GenAI and Machine Learning

197,662 位关注者

Dr. B. Mini Devi

Director-Centre for Information Literacy Studies, Former Director- UIT (University Institute of Technology), Assistant Professor & Former Head-Department of Library and Information Science, University of Kerala.

6 个月

Thanks

Salash Motiani

Inventor, Entrepreneur and Investor

6 个月

That's why I am shorting Nvidia.

1 次回应

Christel-Silvia Fischer

DER BUNTE VOGEL ?? Internationaler Wissenstransfer - Influencerin bei Corporate Influencer Club | Wirtschaftswissenschaften

6 个月

Thank you Vincent Granville

2 次回应

Christel-Silvia Fischer

DER BUNTE VOGEL ?? Internationaler Wissenstransfer - Influencerin bei Corporate Influencer Club | Wirtschaftswissenschaften

6 个月

Thank you Vincent Granville

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

GenAI Core Topics Explained in Simple Pictures

Vincent Granville

AI/LLM Disruptive Leader | GenAI Tech Lab

GPU Classification: The Father of Neural Networks

领英推荐

Sampling Outside the Observation Range

Approximate Nearest Neighbor Search

GenAI and Machine Learning

197,662 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Making Sense of the Data & Your AI Strategy!

PINN: A birthplace of Safe LLMs

Unveiling Complexity: Innovating World Simulations with Sora’s Deep-Physical Fusion

Infinite Context Length ??

Fast Classification and Clustering via Image Convolution Filters

What is the future of artificial intelligence?

2-Min AI Newsletter #15

My AI Engineer Journey - Part 2

Noisy by Nature: How AI Learns to Shush the Static

The Emergence of Machine Learning in Forecasting– a Field Where Statistical Models Dominate

GPU Classification: The Father of Neural Networks

领英推荐

Sampling Outside the Observation Range

Approximate Nearest Neighbor Search

GenAI and Machine Learning

197,662 位关注者

New Book: Building Disruptive AI & LLM Technology from Scratch

2024年10月15日

Building an Enterprise-Grade Agentic RAG

2024年10月14日

Databases For AI, GenAI & RAG/LLMs: Vendor Comparison

2024年10月9日

Building a Ranking System to Enhance Prompt Results: The New PageRank for RAG/LLM

2024年10月8日

State of the Art in AI Research

2024年10月4日

Top Professional GenAI and LLM Courses & Certifications

2024年10月3日

All Databases are Equal, but Some Databases are More Equal than Others

2024年9月26日

Beginner's Guide to Graph RAG

2024年9月25日

No-Code LLM Fine-Tuning and Debugging in Real Time: Case Study

2024年9月23日

30 Features that Dramatically Improve LLM Performance: Part 3

2024年9月21日

社区洞察

其他会员也浏览了

Making Sense of the Data & Your AI Strategy!

PINN: A birthplace of Safe LLMs

Unveiling Complexity: Innovating World Simulations with Sora’s Deep-Physical Fusion

Infinite Context Length ??

Fast Classification and Clustering via Image Convolution Filters

What is the future of artificial intelligence?

2-Min AI Newsletter #15

My AI Engineer Journey - Part 2

Noisy by Nature: How AI Learns to Shush the Static

The Emergence of Machine Learning in Forecasting– a Field Where Statistical Models Dominate