登录查看更多内容

Large number of ML models' parameter tuning

Jimmy W.

Generative AI Engineer | LLM | ChatGPT | RAG | LangChain | VectorDB

发布日期: 2023年4月8日

We have to do cross-validation for 1K models. each model needs to do parameter tuning over 100+ combinations of hyper-parameters. HOW?

So now we need thousands of models to train together, and each model has its own data set, but the parameters they need to tune are shared, so how to do it? Our specific approach is this: we will have an infinite loop, and each iteration in your loop will randomly select a model and a combination of parameters. If the combination of this model and this parameter has not been tested, then we will start the experiment and write the results of the experiment into the database. But the problem is that for different models, the number of experiments and resources they obtain are different, and the number of experiments required by each model is also different. Some models get very good results in the first experiment. Some models may be very difficult and require more than ten experiments. Our specific approach is to balance the performance of different models, as well as the number of experiments, that is, every time we randomly select a model. , we will see if the best result he has obtained now is better than 30% or 50%, or 70% of the other models in the database. If so, then he will pass it.

要查看或添加评论，请登录

Jimmy W.的更多文章

How to build a Palantir Workflow

2025年2月10日

How to build a Palantir Workflow

Building a comprehensive workflow in Palantir Foundry involves integrating several key components: Foundry, AIP…
VectorDBs Comparison: Pros and Cons

2024年9月18日

VectorDBs Comparison: Pros and Cons

As of 2024, several vector databases have gained significant popularity, especially in AI, machine learning, and…
Multi-LLM Routing: From Feature Engineering to Model Building

2024年9月17日

Multi-LLM Routing: From Feature Engineering to Model Building

Multi-LLM Routing is the process of selecting and combining different Large Language Models (LLMs) to handle various…

Large number of ML models' parameter tuning

Jimmy W.

Generative AI Engineer | LLM | ChatGPT | RAG | LangChain | VectorDB

Jimmy W.的更多文章

社区洞察

其他会员也浏览了

The need of ensembling

What is hiding behind the term “mean Average Precision”?

Data Distribution in Machine Learning

What Are We Measuring When We Evaluate Large Vision-Language Models?

Maybe you should be using Ordinary Least Squares Regression

“X affects Y”. What does that even mean?

Understanding Inference Hyperparameters and their side effects.

A Bit on "Missing Values" and "Imputation" in Machine Learning

?? Unlocking the Mystery of Degrees of Freedom in ML??

Grad Descent, GDWM, RMSProp & Adam Optimizers

Jimmy W.的更多文章

How to build a Palantir Workflow

VectorDBs Comparison: Pros and Cons

Multi-LLM Routing: From Feature Engineering to Model Building

社区洞察

其他会员也浏览了

The need of ensembling

What is hiding behind the term “mean Average Precision”?

Data Distribution in Machine Learning

What Are We Measuring When We Evaluate Large Vision-Language Models?

Maybe you should be using Ordinary Least Squares Regression

“X affects Y”. What does that even mean?

Understanding Inference Hyperparameters and their side effects.

A Bit on "Missing Values" and "Imputation" in Machine Learning

?? Unlocking the Mystery of Degrees of Freedom in ML??

Grad Descent, GDWM, RMSProp & Adam Optimizers