Important Questions for Data Scientist Interview Pt-1
Shivam Modi
46K Followers | I help people build their AI & Data Science career | Founder & CEO - Learn Everything AI | IIT Bombay | Click "Follow" to learn AI & Data Science daily
Q1. What is the main difference between overfitting and underfitting?
A. Overfitting is a modelling error which occurs when a function is too closely fit to a limited set of data points. Underfitting refers to a model that can neither model the training data nor generalize to new data.
Q2. VIF
A.?Variance Inflation Factor (VIF) is used to detect the presence of multicollinearity. Variance inflation factors (VIF) measure how much the variance of the estimated regression coefficients are inflated as compared to when the predictor variables are not linearly related.
Q3. Difference Between Bagging and Boosting
A.?Bagging is a way to decrease the variance in the prediction by generating additional data for training from dataset using combinations with repetitions to produce multi-sets of the original data. Boosting is an iterative technique which adjusts the weight of an observation based on the last classification.
Q4. P value and its significance
A.?The p-value is the probability that the null hypothesis is true. (1 – the p-value) is the probability that the alternative hypothesis is true. A low p-value shows that the results are replicable. A low p-value shows that the effect is large or that the result is of major theoretical, clinical or practical importance.
Are you looking to become a Data Scientist? Check out my Data Science Combo course. Click on the link below.
?Course Link: https://learneverythingai.com/data-science-combo-course/
?Website: https://www.learneverythingai.com