Bayesian Optimization
Shaghik Amirian
AI Researcher ∣ ML Engineer ∣ Research Fellow ∣ Data Science & ML ∣ Graph Optimization ∣ Blockchain & Smart Contract Development | Procurement Digutalization Expert
In machine learning (ML), one of the interesting and, at the same time, difficult tasks is Bayesian optimization. The Bayesian optimization is utilized in machine learning hyperparameter optimization and noted that hyperparameters are ML parameters in which values are selected prior to training, i.e., learning rate for training a neural network and values of k in the k-nearest neighborhood.
The motive for the Bayesian optimization is to identify the global minimum in a function f with a very large list of parameters where the computational cost is high. It is significant to utilize the ML algorithm to analyze which derivates are unknown.
Bayesian optimization has been restricted in many ways; the most important of them include the following:
领英推荐
Solving the problem of finding a global minimum with fewer steps is given by the Bayesian optimization framework. To understand better, we need to know how the Bayesian nature is? In Bayesian statistics and modeling, the essence is associated with new information prior to previous beliefs and then updated posterior as post-beliefs are obtained. It involves statistical methods that assign probabilities to events based on the experience or the best guess before the experimentation and data collection and then implementing Bayes' theorem to obtain the possibilities.
In short, for optimization, we select the acquisition function and evaluate the sample with the objective function, then update the data and in turn, the surrogate function. So Bayesian optimization builds a probability model from the objective function and utilizes it to choose a hyperparameter with the purpose of evaluating the true objective function.
The question arises when we should use Bayesian optimization. It is suggested to apply Bayesian optimization for the objective function, which is expensive to assess; most commonly, it is used in hyper parameter tuning. There exist libraries such as HyperOpt for this purpose.
Resource: Bayesian Optimization Algorithm