Mbasha Seth

I am working across AI engineering, data science, and machine learning — my work moves toward problems where technical rigor reaches communities current technology overlooks.

NLP for low-resource African languages · Edge AI and embedded systems for African infrastructure · Narrowing the digital divide through quiet, deliberate work.

Selected Work

Gakyali Mabaga.

A Luganda saying — so little done, so much more to do. Three projects already shipped, with the rest of the road still ahead.

2025 GandaBERT A classifier for Luganda news, built from a corpus that did not exist. Natural Language Processing

The challenge

Luganda is spoken by millions across Uganda. It has almost no public NLP infrastructure — no labeled corpus large enough to train against, no off-the-shelf classifier to fall back on. The task was not only to fit a model, but to make one possible.

The approach

I assembled a 2,600-article corpus by combining native sources, careful machine translation of English news, and synthetic augmentation calibrated to the language's morphology. I fine-tuned Multilingual BERT, then served the model through a small FastAPI layer with a lightweight web demo so anyone could try it on a paragraph of their own.

Where it stands

The classifier reaches 85.7% accuracy across five categories. The corpus, more than the model, is the part I expect to outlive this project.

Built with

Python · PyTorch · Hugging Face · FastAPI

2025 WaterBoard Analytics A shared dashboard for water utilities in four countries. Data Systems

The challenge

Operators in Uganda, Cameroon, Malawi, and Lesotho were comparing performance against the WHO/UNICEF service ladder by spreadsheet. Regulators could not run their own questions; engineers became a bottleneck for everyone else.

The approach

I designed a role-based dashboard separating regulator, operator, and admin views over one schema, and built service-ladder visualizations that drill down by country. A small assistant, backed by Gemini, translates plain-language questions into pandas queries against the live data, with previews and guard-rails before anything runs.

Where it stands

Shipped during the Equitech Futures Fellowship as production tooling for partner utilities. People without engineering backgrounds can ask their own questions and get answers they can cite.

Built with

Python · Streamlit · Plotly · Gemini API · pandas

2025 Malaria Detection Looking for the shortcuts a model takes when no one is watching. Responsible AI

The challenge

Medical-imaging models can learn the wrong things — backgrounds, scanner artifacts, label leakage — and still post good numbers in the lab. The failures show up later, in places the original authors do not see.

The approach

I am probing shortcut learning in malaria-detection models trained on Ghana health data, comparing what the network actually attends to against the regions a clinician would. The work is ongoing.

Where it stands

Early-stage, but oriented toward a deployment standard, not a benchmark score: a model that can be honestly used in a clinic in Accra, not only one that ranks well on a held-out set.

Built with

Python · PyTorch

Other work Hide other work

Where the journey has taken me

A journey of a thousand miles begins with a single step.

Computer Science at Uganda Christian University. Equitech Futures Applied Data Institute Fellow. The work moves between NLP research on low-resource languages, data systems for social impact, embedded engineering, and leadership across technical and operational rooms. The practical question underneath it all is the same one, what does it take to deploy something useful where infrastructure is thin?

2025

Equitech Futures Fellow

Applied Data Institute · 1 of 25 selected globally

A 10-week intensive in Bayesian statistics, deep learning, and causal inference — running alongside data work for water utilities in Uganda, Cameroon, Malawi, and Lesotho, and a parallel research in investigating spurious correlation thread on malaria detection models.

2025

Rhodes Forum · Deep Learning Indaba · Cohere Labs · Stanford

Where curiosity led

Conferences, workshops, and convenings — including Stanford's Precision Medicine & Data Science in Diabetes program. Time spent listening to where the field is heading, what others are wrestling with, and which questions deserve to come home with me.

2024 — 2025

Local Committee President

AIESEC Uganda

Led the chapter through 46% growth in active membership; managed budget and partnerships with Baraka Community Foundation and Happy Times Childcare Initiative.

2024

Embedded Systems Intern

Uganda Industrial Research Institute

Three months on hardware projects between research seasons — sensor integration, PCB design, and a working security-system prototype.

2023 — 2024

Global Consultant

AIESEC International · Information Management

Resolved global platform issues, translated user feedback into product fixes, and helped close the loop between members and engineering.

2022 — 2026

B.Sc. Computer Science

Uganda Christian University

AI, data structures, big-data analytics, robotics, embedded systems. Graduating July 2026.

Learnings

Short courses and certifications taken along the way

AWS Certified Cloud Practitioner

Amazon Web Services

Exploratory Data Analysis for Machine Learning

IBM

Precision Medicine & Data Science in Diabetes

Stanford Deep Data Research Center

Foundations of Data Science

Google

Data Analytics

ALX

IT Essentials

Cisco

Currently reading

Side reading, running parallel to the work.

Where Good Ideas Come From

Steven Johnson

“Chance favors the connected mind.”

What Is Intelligence?

Lessons from AI about evolution, computing, and minds

Blaise Agüera y Arcas

Let’s talk.

“The best way to predict the future is to invent it. The second best is to talk with someone who is.”

— a working belief
Currently
Sharpening technical depth, mapping the research landscape, and learning to recognise the problems I am ready to take on.
Languages
English · Swahili · French · Luganda

Gakyali Mabaga.

Luganda — so little done, so much more to do.