Top Data Scientist Interview Questions & Answers
Data scientist interviews span statistics, machine learning, coding, and product sense. The goal is to show you can turn messy, real-world data into decisions — not just build models.
1Statistics & Machine Learning
What is the bias-variance tradeoff?
Bias is error from incorrect model assumptions — a high-bias model underfits and performs poorly on training and test data. Variance is sensitivity to noise in the training data — a high-variance model overfits and performs well on training data but poorly on test data. The tradeoff: increasing model complexity reduces bias but increases variance. The goal is the minimum total error, achieved by selecting the right model complexity and regularization.
How do you handle class imbalance in a classification problem?
Options: (1) Resampling — oversample the minority class (SMOTE) or undersample the majority class. (2) Class weights — penalize misclassification of the minority class more heavily in the loss function. (3) Threshold tuning — adjust the decision threshold based on the precision-recall tradeoff for your business objective. (4) Ensemble methods — boosting algorithms (XGBoost, LightGBM) handle imbalance well. Evaluate with AUC-ROC or precision-recall, not accuracy.
What is regularization and when would you use L1 vs L2?
Regularization adds a penalty to the loss function to reduce model complexity and prevent overfitting. L1 (Lasso) adds the absolute value of coefficients — it drives some weights to exactly zero, performing feature selection. L2 (Ridge) adds the squared value — it shrinks weights toward zero but rarely to exactly zero. Use L1 when you want feature selection; L2 when all features may contribute but need to be constrained.
How do you evaluate a machine learning model beyond accuracy?
Accuracy is misleading for imbalanced classes. Use precision and recall for the minority class, F1-score as the harmonic mean, AUC-ROC for ranking performance across thresholds, AUC-PR for imbalanced classification. For regression: RMSE (penalizes large errors), MAE (robust to outliers), MAPE (interpretable as a percentage). Always evaluate on a held-out test set, not the training or validation set.
2Product & Business Sense
How would you design an A/B test for a new feature?
Define the hypothesis and the primary metric you are testing. Determine the minimum detectable effect and calculate the required sample size (using a power calculation). Randomly assign users to control and treatment groups, ensuring they are representative. Run the experiment until you reach the required sample size — do not stop early based on interim results (p-hacking). Analyze with the pre-specified metric, check for novelty effects and interaction effects, and make a decision with statistical confidence.
Tell me about a data project where your analysis changed a business decision.
Use the STAR format. Be specific about the business question, the data you used, the analysis you ran, the insight you surfaced, and the decision that changed as a result. Quantify the impact if possible. The strongest answers show you communicated the insight clearly to a non-technical stakeholder and drove action.
How to Prepare for Data Scientist Interviews
- ⚡Practice SQL window functions, complex joins, and aggregations — most DS interviews include a SQL screen
- ⚡Be ready to code a logistic regression, a decision tree, and cross-validation from scratch in Python
- ⚡Review Central Limit Theorem, hypothesis testing (p-values, confidence intervals), and Bayesian probability
- ⚡Prepare a "metrics" story: a time you defined, built, or improved a key business metric
More Interview Guides
Land the Interview First
An ATS-optimized resume is your ticket to the interview room. Build yours in 5 minutes.
🎯 Take Free Quiz📱 Download AppQuick Prep Checklist
- ✅Prepare 5-7 STAR stories
- ✅Research the company & team
- ✅Practice questions aloud
- ✅Prepare 3 thoughtful questions to ask
- ✅Confirm interview format and logistics
In This Guide
- Statistics & Machine Learning4
- Product & Business Sense2
All Interview Guides
- → Software Engineer Questions
- → Product Manager Questions
- → Data Analyst Questions
- → Marketing Manager Questions
- → All Roles Questions
- → Nurse Questions
- → Financial Analyst Questions
- → Project Manager Questions
- → Teacher Questions
- → Sales Representative Questions
- → HR Manager Questions
- → UX Designer Questions
- → Operations Manager Questions
- → Customer Service Representative Questions
- → Data Scientist Questions
- → Account Executive Questions
- → Graphic Designer Questions
- → Business Analyst Questions
- → DevOps Engineer Questions
- → Accountant Questions
- → Supply Chain Manager Questions
- → Content Writer Questions
- → Paralegal Questions
- → Mechanical Engineer Questions
- → Social Media Manager Questions
Ready for the Full Process?
Start with a resume that passes ATS. IntelligentCV builds it in 5 minutes — then use this guide to nail the interview.