Ravi Kapoor
Data Scientist — ML & Experimentation
San Jose, CA · ravikapoor.ml · github.com/ravikapoor · linkedin.com/in/ravikapoor
Summary
Data scientist with 5 years shipping models that move business metrics. Built a churn model that helped retention prioritize 12K at-risk users/quarter, designed 9 experiments tied to a $40M revenue base, and partners closely with engineering to ship to production. Strong in Python, scikit-learn, SQL, and causal inference.
Skills
- ModelingPython · scikit-learn · XGBoost · PyTorch · feature engineering
- Stats & experimentsA/B testing · causal inference · statsmodels · hypothesis testing
- DataSQL · pandas · Spark · dbt
- ProductionMLflow · Airflow · model monitoring
Experience
Data Scientist — Streamline Media
2021 — Present
San Jose, CA
- Shipped a gradient-boosted churn model (LightGBM, AUC 0.87) into a weekly batch pipeline; helped the retention team prioritize 12K at-risk users/quarter.
- Designed and read out 9 A/B tests across signup and checkout; one ranked-search experiment shipped a +6% conversion lift on a $40M revenue base.
- Re-scored the homepage recommendation model with session-level features; offline NDCG +9% and online CTR +4% on the holdout cohort.
- Built a reusable experimentation library that cut average test setup from 3 days to half a day for the analytics team.
Junior Data Scientist — Northpoint Analytics
2019 — 2021
Remote
- Built demand-forecasting models (Prophet + XGBoost) that cut inventory stockouts 14% for a retail client.
- Automated a feature pipeline in Airflow, removing ~8 hours/week of manual data prep.
Education
M.S. Statistics — University of California, Davis
2017 — 2019
Thesis: causal inference for observational marketing data
Certifications
Deep Learning Specialization (Coursera) · AWS Certified Machine Learning — Specialty