Projects

Selected projects across data, systems, and analytics.

Sort by:

Default risk project snapshot

Scoring Risk of Default Using Banking Transaction Data

Mar 2024

Undergraduate - DSC Capstone Project

PythonMLNLPXGBoost

Developed a cash score model for assessing credit risk of first-time applicants.
Led data analysis, income estimation, and feature derivation for robust risk assessment.
Achieved 84% accuracy and 0.87 AUC with XGBoost; identified key default risk factors.
Provided actionable insights to support better lending decisions and inclusive practices.

Report →Website →

Data science careers visualization snapshot

Status and Prospects of Data Science Careers

Dec 2023

D3StorytellingVisualization

Visualization project on data science job trends and salary growth.
Used a drill-down narrative structure from overview to details.
Covered remote work trends, salary vs experience, geography, and job categories.

Live Demo →GitHub →

Sudoku solver UI snapshot

Sudoku Solver

Jul 2023

JavaScriptHTMLCSSAlgorithms

Built a Sudoku solver using backtracking.
Maintained both JavaScript and Java versions.

Live Demo →GitHub (JS) →GitHub (Java) →

Clothing fit project snapshot

Predictive Analysis on Clothing Fit

Dec 2022

Pythonscikit-learnpandas

Developed a predictive model for clothing fit based on user measurements.
Conducted exploratory analysis on size distribution and key drivers.
Implemented baseline and improved models; addressed class imbalance for better performance.

Report →GitHub →

NYPD rank prediction snapshot

Rank Prediction of NYC Police Officers Based on Civilian Complaints

Jun 2022

PythonMLFairness

Built a model to predict officer rank using civilian complaint data.
Improved accuracy from 0.13 to 0.34 via feature engineering and tuning.
Performed fairness analysis and a permutation test to assess potential bias.

Report →GitHub →