登录查看更多内容

30 Features that Dramatically Improve LLM Performance - Part 1

Vincent Granville

Co-Founder, BondingAI.io

发布日期: 2024年8月21日

Read full article, here.

Many are ground-breaking innovations that make LLMs much faster and not prone to hallucinations. They reduce the cost, latency, and amount of computer resources (GPU, training) by several orders of magnitude. Some of them improve security, making your LLM more attractive to corporate clients. I introduced a few of these features in my previous article "New Trends in LLM Architecture". Now I offer a comprehensive list, based on the most recent developments.

1. From one trillion parameters to less than 5

By parameter, here I mean the weight between two connected neurons in a deep neural network. How can you possibly replace one trillion parameters by less than 5, and yet get better results, faster? The idea is to use parametric weights. In this case, you update the many weights with a simple formula relying on a handful of explainable parameters, as opposed to neural network activation functions updating (over time) billions of Blackbox parameters — the weights themselves — over and over. I illustrate this in Figure 1. The example comes from my recent book, available here.

领英推荐

My AI Engineer Journey - Part 2

Luigi C. Filho 6 个月前

Neural network architecture in 30 minutes

Vizuara 10 个月前

Can you do object detection with just one kernel ?

Marco Rizk 5 个月前

Figure 1: LLM for classification, with only 2 parameters

2. Adaptive loss function

The goal of many deep neural networks (DNN) is to minimize a loss function, usually via stochastic gradient descent. This is also true for LLMs that use transformers. The loss function is a proxy to the evaluation metric that measures the quality of your output. In supervised learning LLMs (for instance, those performing supervised classification), you may use the evaluation metric as the loss function, to get better results. One of the best evaluation metrics is the full multivariate Kolmogorov-Smirnov distance (KS), see here, with Python library here.

But it is extremely hard to design an algorithm that makes billions of atomic changes to KS extremely fast, a requirement in all DNNs as it happens each time you update a weight. A workaround is to use an adaptive loss function that slowly converges to the KS distance over many epochs. I did not succeed at that, but I was able to build one that converges to the multivariate Hellinger distance, the discrete alternative that is asymptotically equivalent to the continuous KS.

This is the first post in a series of three. Each one lists 10 features. To read the full article, learn about agentic LLMs, LLM routers, contextual tables, fast search, and more, follow this link.

GenAI and Machine Learning

211,677 位关注者

Akshay Sharma

Recruitment Manager | Connecting Top U.S. IT and Non-IT Talent with Leading U.S. Clients | Your Partner in Excellence" || IT Consultants for Public and Federal Clients.

5 个月

Hi , This is Kritika?Sharma? I have position for Title: Finsys Analytics Consultant : Instacart : Remote, PST and CST based candidates only role with a good payrate. Please let me know if you are interested or you can send me your updated resume on [email protected]

Adam Nemecek

Founder

7 个月

instead of the first four lines, you can do local_hash = hash.get(key, {}) both shorter and faster.

1 次回应

William Alfred Rose

Product Leader

7 个月

This is great

1 次回应

Akshay Radha Manohar

7 个月

Your article on new LLM architectures is fascinating, especially the move from trillions of parameters to just a few and the adaptive loss function. Great work on these groundbreaking ideas! Your insights could significantly advance LLM technology and its practical uses. Thanks for sharing your expertise!

1 次回应

Danial Hosseinpour

Data Analyst | MSc by Research Student at the University of Huddersfield

7 个月

Useful tips

1 次回应

查看更多评论

要查看或添加评论，请登录

Vincent Granville的更多文章

10 Tips to Design Hallucination-Free RAG/LLM Systems

2025年3月20日

10 Tips to Design Hallucination-Free RAG/LLM Systems

The NVIDIA #GTC25 conference in San Jose, this week, is one of the largest AI conferences of the year. Besides robotics…

12 条评论
LLM Challenge with Petabytes of Data to Prove Famous Number Theory Conjecture

2025年3月7日

LLM Challenge with Petabytes of Data to Prove Famous Number Theory Conjecture

For direct access to the full article with code, challenge, and dataset, follow this link. In my recent article…

6 条评论
Invitation to Attend the Top AI Conference of the Year: NVIDIA GTC 2025

2025年2月27日

Invitation to Attend the Top AI Conference of the Year: NVIDIA GTC 2025

If there is one major AI event that you don’t want to miss in 2025, that’s the NVIDIA GPU Technical Conference (GTC) in…

2 条评论
Spectacular Connection Between LLMs, Quantum Systems, and Number Theory

2025年2月24日

Spectacular Connection Between LLMs, Quantum Systems, and Number Theory

In my recent research on cracking the deepest mathematical mystery, with version 2.0 published yesterday and available…

10 条评论
How to Improve RAG / LLM Accuracy & Resilience with Change Data Capture

2025年2月8日

How to Improve RAG / LLM Accuracy & Resilience with Change Data Capture

Register here. Change Data Capture (CDC) aims at detecting and tracking changes made to data.

2 条评论
Using AI to Solve the Deepest Math Conjecture

2025年1月28日

Using AI to Solve the Deepest Math Conjecture

The proof of the seminal result in question significantly benefited from our home-made AI technology: see the…

8 条评论
10 Great AI, LLM & GenAI Courses and Certifications to Boost your Career

2025年1月22日

10 Great AI, LLM & GenAI Courses and Certifications to Boost your Career

Covering all the AI topics most sought after by hiring companies: agents, multimodality, model evaluation, LangChain…

7 条评论
Piercing the Deepest Mathematical Mystery

2025年1月20日

Piercing the Deepest Mathematical Mystery

To skip the high-level presentation and directly download the paper, visit the AI research section here, and look for…

8 条评论
9 Tips to Design Hallucination-Free RAG/LLM Systems

2025年1月14日

9 Tips to Design Hallucination-Free RAG/LLM Systems

Here I explain how we manage to avoid hallucinations with our home-made Enterprise RAG/LLM. The most recent article on…

19 条评论
LLM 2.0, RAG & Non-Standard Gen AI on GitHub

2025年1月3日

LLM 2.0, RAG & Non-Standard Gen AI on GitHub

Full article available here. In this article, I share my latest Gen AI and LLM advances, featuring innovative…

See all articles

30 Features that Dramatically Improve LLM Performance - Part 1

Vincent Granville

Co-Founder, BondingAI.io

1. From one trillion parameters to less than 5

领英推荐

2. Adaptive loss function

Read more

GenAI and Machine Learning

211,677 位关注者

Vincent Granville的更多文章

社区洞察

其他会员也浏览了

Computer Vision in a nutshell

Kolmogorov-Arnold Networks or KAN, the latest advance in Neural Networks

My 5 favorite ICLR 2021 paper recommendations

NeuroEvolution of Augmenting Topologies (NEAT)

Why do LLMs hallucinate?

Gradient Reversal Layers

Implementation from Scratch: Forward and Back Propagation of a Pooling Layer

Is the evolutionary algorithm more powerful than machine learning?

Face Recognization Using VGG16

That's one small step for man, one giant leap for machine learning

1. From one trillion parameters to less than 5

领英推荐

2. Adaptive loss function

Read more

GenAI and Machine Learning

211,677 位关注者

Vincent Granville的更多文章

10 Tips to Design Hallucination-Free RAG/LLM Systems

LLM Challenge with Petabytes of Data to Prove Famous Number Theory Conjecture

Invitation to Attend the Top AI Conference of the Year: NVIDIA GTC 2025

Spectacular Connection Between LLMs, Quantum Systems, and Number Theory

How to Improve RAG / LLM Accuracy & Resilience with Change Data Capture

Using AI to Solve the Deepest Math Conjecture

10 Great AI, LLM & GenAI Courses and Certifications to Boost your Career

Piercing the Deepest Mathematical Mystery

9 Tips to Design Hallucination-Free RAG/LLM Systems

LLM 2.0, RAG & Non-Standard Gen AI on GitHub

社区洞察

其他会员也浏览了

Computer Vision in a nutshell

Kolmogorov-Arnold Networks or KAN, the latest advance in Neural Networks

My 5 favorite ICLR 2021 paper recommendations

NeuroEvolution of Augmenting Topologies (NEAT)

Why do LLMs hallucinate?

Gradient Reversal Layers

Implementation from Scratch: Forward and Back Propagation of a Pooling Layer

Is the evolutionary algorithm more powerful than machine learning?

Face Recognization Using VGG16

That's one small step for man, one giant leap for machine learning