Math Heavy Topics!

Math Heavy Topics!

1. Support-vector machine

Given a set of training examples, each marked as belonging to one of two categories, an SVM training algorithm builds a model that assigns new examples to one category or the other, making it a non-probabilistic binary linear classifier (although methods such as Platt scaling exist to use SVM in a probabilistic classification setting). SVM maps training examples to points in space so as to maximize the width of the gap between the two categories. New examples are then mapped into that same space and predicted to belong to a category based on which side of the gap they fall.

In addition to performing linear classification, SVMs can efficiently perform a non-linear classification using what is called the kernel trick, implicitly mapping their inputs into high-dimensional feature spaces.

Link to the entire article: https://en.wikipedia.org/wiki/Support-vector_machine


2. Gradient boosting

Gradient boosting is a machine learning technique used in regression and classification tasks, among others. It gives a prediction model in the form of an ensemble of weak prediction models, which are typically decision trees. When a decision tree is a weak learner, the resulting algorithm is called gradient-boosted trees; it usually outperforms random forest. A gradient-boosted trees model is built in a stage-wise fashion as in other boosting methods, but it generalizes the other methods by allowing optimization of an arbitrary differentiable loss function.

Link to the entire article: https://en.wikipedia.org/wiki/Gradient_boosting


3. Multivariate statistics

Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable.

Multivariate statistics concerns understanding the different aims and backgrounds of each of the different forms of multivariate analysis, and how they relate to each other. The practical application of multivariate statistics to a particular problem may involve several types of univariate and multivariate analyses in order to understand the relationships between variables and their relevance to the problem being studied.

In addition, multivariate statistics is concerned with multivariate probability distributions, in terms of both

  1. How these can be used to represent the distributions of observed data?
  2. How they can be used as part of statistical inference, particularly where several different quantities are of interest to the same analysis?

Certain types of problems involving multivariate data, for example, simple linear regression and multiple regression, are not usually considered to be special cases of multivariate statistics because the analysis is dealt with by considering the (univariate) conditional distribution of a single outcome variable given the other variables.

Link to the entire article: https://en.wikipedia.org/wiki/Multivariate_statistics


Conclusion

I hope you found this blog post insightful. Please do share it with your friends & family and subscribe to my blog?Keeping Up With Data Science?for more informative content on Data Science straight to your inbox. You can reach out to me on?Twitter?&?LinkedIn. You can also connect with me on social media by clicking on the icons attached below. I am quite active there & I will be happy to have a conversation with you.

I hope you found this blog post informative. Please feel free to share your feedback in the comments. Thanks for reading & Happy Learning?:)

Supriya Bohra

Global Knowledge Management Lead | Linkedln-managed creator

2 年

Chitwan, I think you will find Divyanshu's posts quite interesting.

要查看或添加评论,请登录

Chitwan Manchanda的更多文章

社区洞察

其他会员也浏览了