Beyond Limited Labels: Safe Semi-Supervised Learning for Malaria Diagnosis

Abstract

Deploying AI-based diagnostic tools in resource-constrained healthcare settings remains a critical challenge, where the scarcity of annotated medical images limits the viability of fully supervised approaches. Semi-supervised learning (SSL) offers a promising path forward, yet existing SSL methods trade deployment safety for annotation efficiency relying on fixed confidence thresholds that produce dangerously overconfident errors in clinical settings where expert oversight is minimal. We address this gap with an uncertainty-guided SSL framework that integrates Monte Carlo dropout-based epistemic uncertainty estimation with adaptive confidence weighting, enabling safe pseudo-label filtering without sacrificing learning efficiency. Evaluated on malaria cell classification under low-resource conditions (20% labeled data), our method achieves a statistically significant 29.2% reduction in silent failure rate over the strongest SSL baseline, while maintaining comparable diagnostic accuracy with 80% fewer labeled samples. Unlike standard SSL methods whose safety degrades sharply with threshold relaxation, our approach maintains consistent robustness across all operating points—a property essential for real-world deployment across diverse clinical environments.

Key Methodologies & Contributions

Code & Resources