Hello, I’m Reginald Erzoah
Contact me and let's create data solutions
Let’s Talk
Seeing how best I can learn, build, and grow in the data space.
I am a data professional dedicated to applying machine learning, data analytics, and business intelligence to solve real-world problems and build scalable solutions.
Skilled in building impactful ML & BI solutions, streamlining reporting systems, and delivering dashboards that drive measurable outcomes, he blends technical expertise with a strong business mindset.
Reginald is an open source builder and contributor.
He is open to collaborations, partnerships, and opportunities to create data solutions that make a lasting impact.
Explore featured portfolio projects and more below.
Below are some featured projects from my portfolio.
Check my github profile for more projects.
Python, open source
Dift is an open-source CLI tool that helps data professionals compare two datasets and instantly understand what changed, why it matters and whether the new data is safe to trust.
Dift is open to contributions.
More Info
Python, Streamlit, Cloudflare R2, Docker
This is an End-to-end ML system predicting credit default risk to support loan decisions. Built Logistic Regression and XGBoost models with feature engineering and missing value handling. Developed interactive Streamlit dashboard with SHAP-based explainability and actionable business insights.
More Info
Python, scikit-learn, cloudpickle, Streamlit, Docker, MinIO, Cloudflare R2
This is an end-to-end customer segmentation pipeline using RFM and KMeans to help businesses identify high-value, occasional, and inactive customers, with a Streamlit dashboard for interactive visualizations and personalized recommendations.
More Info
Python, scikit-learn, joblib, Streamlit
This is an end-to-end ML system to detect fraudulent credit card transactions on an imbalanced dataset, experimenting with Logistic Regression, Random Forest, and XGBoost, and deployed an interactive Streamlit dashboard with SHAP explainability for real-time monitoring and risk reduction.
More Info
Python & Streamlit
This project is an interactive Streamlit dashboard that analyzes and visualizes data quality metrics and error clusters in transactional datasets.
More Info