Important Questions for Data Scientist Interview Pt-1

Important Questions for Data Scientist Interview Pt-1

Q1. What is the main difference between overfitting and underfitting?

A. Overfitting is a modelling error which occurs when a function is too closely fit to a limited set of data points. Underfitting refers to a model that can neither model the training data nor generalize to new data.

Q2. VIF

A.?Variance Inflation Factor (VIF) is used to detect the presence of multicollinearity. Variance inflation factors (VIF) measure how much the variance of the estimated regression coefficients are inflated as compared to when the predictor variables are not linearly related.

Q3. Difference Between Bagging and Boosting

A.?Bagging is a way to decrease the variance in the prediction by generating additional data for training from dataset using combinations with repetitions to produce multi-sets of the original data. Boosting is an iterative technique which adjusts the weight of an observation based on the last classification.

Q4. P value and its significance

A.?The p-value is the probability that the null hypothesis is true. (1 – the p-value) is the probability that the alternative hypothesis is true. A low p-value shows that the results are replicable. A low p-value shows that the effect is large or that the result is of major theoretical, clinical or practical importance.

Are you looking to become a Data Scientist? Check out my Data Science Combo course. Click on the link below.

?Course Link: https://learneverythingai.com/data-science-combo-course/

?Website: https://www.learneverythingai.com

No alt text provided for this image


要查看或添加评论,请登录

Shivam Modi的更多文章

社区洞察

其他会员也浏览了