Optimizing a model

Optimizing a model


What do we mean by "Optimizing a model"? When we talk about "model selection", we mean searching for the optimal model for a specific training dataset. If we have features X and a target Y, we would like to learn what is the optimal transformation F from the data:

Y = F(X)

The term "optimal "implies we have a model performance metric, and the "optimal" model is the one that maximizes that metric. There are different axes we can consider to optimize our model:

- The model parameter space: this is the "space" we optimize when we "train" a model through statistical learning. The parameters are learned using an optimization principle such as the Maximum likelihood estimation principle.

- The model paradigm space: Many supervised learning algorithms could be used to solve the same problem. Algorithms like Naive Bayes, XGBoost, or Neural Network could perform very differently depending on the specific dataset.

- The hyperparameter space: those are the model parameters we cannot optimize using statistical learning, but they are choices we need to make to set up our training run.

- The model architecture space: this is more relevant for Neural Networks. The model architecture can be characterized by a set of hyperparameters, but it tends to be a more complex search than typical hyperparameters. The search space dimension can be as big as 10^40.

- The feature space: We also need to select the right feature to feed to our model. Different models will react differently depending on the features we use. Too many features and we may overfit. Too few features and we may underfit.

- The feature transformation space: We could consider many transformations to improve our model's performance, such as feature encoding or Box-Cox transformation.


要查看或添加评论,请登录

Andre Cassim的更多文章

  • The difference between Hashing, Encryption, and Encoding

    The difference between Hashing, Encryption, and Encoding

    ??. ?????????????? : ------------- ?????????????? ? Generates a unique fingerprint (hash) of the data, often used for…

  • What is a Hook in WordPress ?

    What is a Hook in WordPress ?

    A hook in WordPress is a mechanism that allows developers to extend or modify the functionality of WordPress without…

  • Artificial Intelligence

    Artificial Intelligence

    f you want to know where the money is in Machine Learning, look no further than Recommender Systems! Recommender…

  • Artificial Intelligence & Deep Learning

    Artificial Intelligence & Deep Learning

    Don't waste too much time training your models! It is essential to limit the time for the model selection process. It…

  • Machine Learning Algo's

    Machine Learning Algo's

    ML

  • Creating Stunning Data Visualizations with D3.js

    Creating Stunning Data Visualizations with D3.js

    Data visualization is an essential tool in the modern world of data-driven decision-making. As a content writer and…

  • Setting Up WordPress Python Integration: 2 Easy Steps

    Setting Up WordPress Python Integration: 2 Easy Steps

    WordPress is one of the widely used platforms for publishing and managing content. It powers around 14-15% of the…

    2 条评论
  • Large Learning Models: User Habits

    Large Learning Models: User Habits

    To create a LLM to learn about user habits, you can use a deep learning framework such as TensorFlow or PyTorch. Here's…

  • How to build a machine learning model in 7 steps

    How to build a machine learning model in 7 steps

    Step 1. Understand the business problem (and define success) The first phase of any machine learning project is…

  • What id a ML Model Breaks ?

    What id a ML Model Breaks ?

    What happens if your Machine Learning model breaks? Imagine if the Netflix movie ranking model, the Facebook feed…

社区洞察

其他会员也浏览了