Selected projects across Applied Data Science and AI Engineering.
Each project includes links to code, live demos/APIs where available, and measurable results.
End-to-end MLOps system predicting 30-day hospital readmission risk at discharge with live API + Streamlit dashboard, monitoring, explainability, and ROI analysis.
Reduced preventable readmissions with projected £7.9M annual savings
AI-powered contract analysis and risk scoring system with live Streamlit dashboard + Cloud Run FastAPI API, batch processing, CI/CD, and monitoring.
Automated contract risk detection with 97% contract-type accuracy
MSc thesis optimizing training datasets and feature selection for Linear B-Cell Epitope prediction, benchmarking XGBoost vs neural networks with strong AUC/F1/MCC performance.
Achieved 99.4% AUC-ROC with optimized training sets for vaccine development
Production-ready RAG chatbot for UK legal queries, processing 131,253+ chunks with sub-3s latency, hybrid retrieval (BM25 + FAISS + RRF), enterprise auth, and guardrails.
Enabled sub-3s legal search over 130k+ documents
Production-grade multi-agent system for automated GDPR/HIPAA/CCPA compliance audits using Google ADK + Gemini + Presidio + ChromaDB with rigorous evaluation.
Automated GDPR/HIPAA/CCPA compliance audits with 85%+ precision