Math Heavy Topics!
Chitwan Manchanda
ML @ Turing.com (Core Team) | DSML TA at Scaler Academy | Ex-EditorialistYX | Ex-Delhivery | Ex-Goals 101 | 2X Kaggle Expert
1. Support-vector machine
Given a set of training examples, each marked as belonging to one of two categories, an SVM training algorithm builds a model that assigns new examples to one category or the other, making it a non-probabilistic binary linear classifier (although methods such as Platt scaling exist to use SVM in a probabilistic classification setting). SVM maps training examples to points in space so as to maximize the width of the gap between the two categories. New examples are then mapped into that same space and predicted to belong to a category based on which side of the gap they fall.
In addition to performing linear classification, SVMs can efficiently perform a non-linear classification using what is called the kernel trick, implicitly mapping their inputs into high-dimensional feature spaces.
Link to the entire article: https://en.wikipedia.org/wiki/Support-vector_machine
2. Gradient boosting
Gradient boosting is a machine learning technique used in regression and classification tasks, among others. It gives a prediction model in the form of an ensemble of weak prediction models, which are typically decision trees. When a decision tree is a weak learner, the resulting algorithm is called gradient-boosted trees; it usually outperforms random forest. A gradient-boosted trees model is built in a stage-wise fashion as in other boosting methods, but it generalizes the other methods by allowing optimization of an arbitrary differentiable loss function.
Link to the entire article: https://en.wikipedia.org/wiki/Gradient_boosting
领英推荐
3. Multivariate statistics
Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable.
Multivariate statistics concerns understanding the different aims and backgrounds of each of the different forms of multivariate analysis, and how they relate to each other. The practical application of multivariate statistics to a particular problem may involve several types of univariate and multivariate analyses in order to understand the relationships between variables and their relevance to the problem being studied.
In addition, multivariate statistics is concerned with multivariate probability distributions, in terms of both
Certain types of problems involving multivariate data, for example, simple linear regression and multiple regression, are not usually considered to be special cases of multivariate statistics because the analysis is dealt with by considering the (univariate) conditional distribution of a single outcome variable given the other variables.
Link to the entire article: https://en.wikipedia.org/wiki/Multivariate_statistics
Conclusion
I hope you found this blog post insightful. Please do share it with your friends & family and subscribe to my blog?Keeping Up With Data Science?for more informative content on Data Science straight to your inbox. You can reach out to me on?Twitter?&?LinkedIn. You can also connect with me on social media by clicking on the icons attached below. I am quite active there & I will be happy to have a conversation with you.
I hope you found this blog post informative. Please feel free to share your feedback in the comments. Thanks for reading & Happy Learning?:)
Global Knowledge Management Lead | Linkedln-managed creator
2 年Chitwan, I think you will find Divyanshu's posts quite interesting.