Data Science and Analytics Portfolio

7 completed projects spanning financial technology, machine learning, NLP, healthcare, and community data systems. Each project is documented with methodology, results metrics, and downloadable code where available.

7
Projects
4
Notebooks
20+
Freelance Projects
TreasuryIQ - NBFC Liquidity Risk and ALM Platform
Financial Technology2025

TreasuryIQ - NBFC Liquidity Risk and ALM Platform

Production-grade AI-driven ALM intelligence platform at Technocolabs Softwares. Identified a severe maturity mismatch across 5 asset and liability buckets from a 10,000-row synthetic NBFC dataset.

PythonSQLXGBoostGCP BigQueryDocker
View Details
Credit Card Fraud Detection: End-to-End ML Pipeline
Machine Learning2024

Credit Card Fraud Detection: End-to-End ML Pipeline

XGBoost, Random Forest, and Logistic Regression on 50,000 transactions. ROC-AUC 0.983, PR-AUC 0.871. SMOTE oversampling, class imbalance handling, feature importance analysis.

XGBoostSMOTEscikit-learnPR-AUC
View Details
NLP Text Classification: SMS Spam Detection
NLP2024

NLP Text Classification: SMS Spam Detection

TF-IDF with bigrams and Linear SVM on 5,572 SMS messages. 98.9% accuracy, 0.981 5-fold CV F1. PorterStemmer, stopword removal, discriminative word analysis.

TF-IDFLinearSVCNLTKbigrams
View Details
Customer Churn Prediction: Production-Ready Pipeline
Machine Learning2024

Customer Churn Prediction: Production-Ready Pipeline

scikit-learn Pipeline with ColumnTransformer eliminating data leakage. GridSearchCV-tuned Gradient Boosting on 7,043 telecom records. ROC-AUC 0.934, PR-AUC 0.812. Business impact analysis.

Gradient BoostingPipelineGridSearchCVthreshold optimisation
View Details
Multi-Class News Article Text Classifier
Natural Language Processing2024

Multi-Class News Article Text Classifier

TF-IDF vectorisation with 15,000 bigram features on the 20 Newsgroups dataset (6 categories). Linear SVM: 94% accuracy and 0.94 weighted F1. Keyword analysis reveals discriminative terms per class.

TF-IDFLinearSVC20 Newsgroupsscikit-learn
View Details
Kwara State Livestock Vaccination Data Pipeline
Healthcare and Community Data2025

Kwara State Livestock Vaccination Data Pipeline

Python data pipeline and choropleth geo-visualisations for the 2026 Kwara State Mass TADs Preliminary Report. Coverage analytics and Power BI dashboards for 16 local government areas.

GeoPandasPower BIPythonchoropleth
View Details
Ifeloju Community Security Levy Tracking System
Community Data Systems2024

Ifeloju Community Security Levy Tracking System

Multi-sheet Excel workbook tracking 200+ households, payment history, arrears, and automated summary reports. Built for the Ifeloju Community security management committee in Ibadan.

ExcelVBAData Management
View Details

Download the Notebooks

Four of these projects have fully documented Jupyter notebooks available for direct download as .ipynb files.