[edit]
Boosting Phonocardiogram Classification Performance with Function Generated Data
Proceedings of the Fifth Machine Learning for Health Symposium, PMLR 297:473-524, 2026.
Abstract
Deep neural networks require large datasets, yet medical phonocardiogram ({PCG}) data are scarce due to privacy and disease rarity. To address this challenge in {PCG} analysis, we present a function-generated {PCG} pipeline that synthesizes {S1}/{S2} heart sounds with modulated noise to emulate aortic stenosis ({AS}), aortic regurgitation ({AR}), and mitral regurgitation ({MR}). Across eight architectures, we compare real-only training, synthetic-only, and synthetic pretraining followed by real fine-tuning ({Syn}$\to${Real}). {Syn}$\to${Real} consistently improves {AUROC} with average gains of +15.3% ({AS}), +17.0% ({AR}), +17.1% ({MR}) on {BMD-HS}, and +7.1%, +8.8%, +6.1% on a private cohort (8,564 recordings). Furthermore, we show {Syn}$\to${Real} is competitive with pretraining on out-of-domain real data, and combining it with multi-stage real fine-tuning yields the best overall performance, highlighting the complementary value of synthetic and real {PCG}s. While synthetic-only training generalizes poorly, pretraining on function-generated {PCG}s consistently improves {PCG} classification over training from scratch, offering a practical path to mitigate data-collection burdens and potentially reduce privacy and ethical exposure.