Data Scientist · Interview Prep 2026

Top Data Scientist Interview Questions & Answers

Data scientist interviews span statistics, machine learning, coding, and product sense. The goal is to show you can turn messy, real-world data into decisions — not just build models.

Statistics & Machine Learning (4 questions)Product & Business Sense (2 questions)

1Statistics & Machine Learning

Q

What is the bias-variance tradeoff?

A

Bias is error from incorrect model assumptions — a high-bias model underfits and performs poorly on training and test data. Variance is sensitivity to noise in the training data — a high-variance model overfits and performs well on training data but poorly on test data. The tradeoff: increasing model complexity reduces bias but increases variance. The goal is the minimum total error, achieved by selecting the right model complexity and regularization.

Q

How do you handle class imbalance in a classification problem?

A

Options: (1) Resampling — oversample the minority class (SMOTE) or undersample the majority class. (2) Class weights — penalize misclassification of the minority class more heavily in the loss function. (3) Threshold tuning — adjust the decision threshold based on the precision-recall tradeoff for your business objective. (4) Ensemble methods — boosting algorithms (XGBoost, LightGBM) handle imbalance well. Evaluate with AUC-ROC or precision-recall, not accuracy.

Q

What is regularization and when would you use L1 vs L2?

A

Regularization adds a penalty to the loss function to reduce model complexity and prevent overfitting. L1 (Lasso) adds the absolute value of coefficients — it drives some weights to exactly zero, performing feature selection. L2 (Ridge) adds the squared value — it shrinks weights toward zero but rarely to exactly zero. Use L1 when you want feature selection; L2 when all features may contribute but need to be constrained.

Q

How do you evaluate a machine learning model beyond accuracy?

A

Accuracy is misleading for imbalanced classes. Use precision and recall for the minority class, F1-score as the harmonic mean, AUC-ROC for ranking performance across thresholds, AUC-PR for imbalanced classification. For regression: RMSE (penalizes large errors), MAE (robust to outliers), MAPE (interpretable as a percentage). Always evaluate on a held-out test set, not the training or validation set.

2Product & Business Sense

Q

How would you design an A/B test for a new feature?

A

Define the hypothesis and the primary metric you are testing. Determine the minimum detectable effect and calculate the required sample size (using a power calculation). Randomly assign users to control and treatment groups, ensuring they are representative. Run the experiment until you reach the required sample size — do not stop early based on interim results (p-hacking). Analyze with the pre-specified metric, check for novelty effects and interaction effects, and make a decision with statistical confidence.

Q

Tell me about a data project where your analysis changed a business decision.

A

Use the STAR format. Be specific about the business question, the data you used, the analysis you ran, the insight you surfaced, and the decision that changed as a result. Quantify the impact if possible. The strongest answers show you communicated the insight clearly to a non-technical stakeholder and drove action.

Get the Interview First
IntelligentCV builds an ATS-optimized resume that gets you to the interview stage.
🎯 Free Quiz

How to Prepare for Data Scientist Interviews

  • Practice SQL window functions, complex joins, and aggregations — most DS interviews include a SQL screen
  • Be ready to code a logistic regression, a decision tree, and cross-validation from scratch in Python
  • Review Central Limit Theorem, hypothesis testing (p-values, confidence intervals), and Bayesian probability
  • Prepare a "metrics" story: a time you defined, built, or improved a key business metric

More Interview Guides

Ready for the Full Process?

Start with a resume that passes ATS. IntelligentCV builds it in 5 minutes — then use this guide to nail the interview.