My Data Science Projects
50M+ Data Points | 510+ Securities | 95% Model Accuracy
Projects I've worked on - from loan default prediction to algorithmic trading systems. Each one solves real problems with real data.
By the Numbers
Featured Projects
Bungaku
Billboard Music Analysis Platform
Statistical analysis of 130K+ Billboard entries using ANOVA, Tukey HSD, and t-tests to discover seasonal patterns and decade-based shifts in music trends.
CandleThrob
Algorithmic Trading System
Advanced trading system tracking 510+ securities with 24+ years of historical data. Calculates 113+ technical indicators and stores everything in Oracle.
Customer Purchase Predictor
ML-Powered Customer Analytics
Logistic classifier analyzing 3.9M+ behavioral events to predict purchase triggers with optimized business decision thresholds.
Lending Club Risk Model
Credit Risk Assessment System
XGBoost model processing 2.2M+ loan records with SHAP explainability for transparent risk assessment decisions.
Breast Cancer Classifier
From-Scratch ML Implementation
Coded logistic regression from scratch achieving 95% test accuracy. Implemented without ML libraries to demonstrate deep understanding of mathematical foundations and algorithm mechanics.