Flowing Straighter with Conditional Flow Matching for Accurate Speech Enhancement

Mattias Cross; Anton Ragni

Flowing Straighter with Conditional Flow Matching for Accurate Speech Enhancement

Mattias Cross, Anton Ragni

Proceedings of the 2nd ECAI Workshop on "Machine Learning Meets Differential Equations: From Theory to Applications", PMLR 277:121-132, 2025.

Abstract

Current flow based generative speech enhancement methods learn curved probability paths which model a mapping between clean and noisy speech. Despite impressive performance, the implications of curved probability paths are unknown. Methods such as Schr\"{}odinger bridges focus on curved paths, where time dependent gradients and variance do not promote straight paths. Findings in machine learning research suggest that straight paths, such as conditional flow matching, are easier to train and offer better generalisation. In this paper we quantify the effect of path straightness on speech enhancement quality. We report experiments with the Schrödinger bridge, where we show that certain configurations lead to straighter paths. Conversely, we propose independent conditional flow matching for speech enhancement, which models straight paths between noisy and clean speech. We demonstrate empirically that a time independent variance has a greater effect on sample quality than the gradient. Although conditional flow matching improves several speech quality metrics, it requires multiple inference steps. We rectify this with a one step solution by inferring the trained flow based model as if it was directly predictive. Our work suggests that straighter time independent probability paths improve generative speech enhancement over curved time dependent paths.

Cite this Paper

BibTeX

@InProceedings{pmlr-v277-cross25a,
  title = 	 {Flowing Straighter with Conditional Flow Matching for Accurate Speech Enhancement},
  author =       {Cross, Mattias and Ragni, Anton},
  booktitle = 	 {Proceedings of the 2nd ECAI Workshop on "Machine Learning Meets Differential Equations: From Theory to Applications"},
  pages = 	 {121--132},
  year = 	 {2025},
  editor = 	 {Coelho, Cecı́lia and Zimmering, Bernd and Costa, M. Fernanda P. and Ferrás, Luı́s L. and Niggemann, Oliver},
  volume = 	 {277},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {26 Oct},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v277/main/assets/cross25a/cross25a.pdf},
  url = 	 {https://proceedings.mlr.press/v277/cross25a.html},
  abstract = 	 {Current flow based generative speech enhancement methods learn curved probability paths which model a mapping between clean and noisy speech. Despite impressive performance, the implications of curved probability paths are unknown. Methods such as Schr\"{}odinger bridges focus on curved paths, where time dependent gradients and variance do not promote straight paths. Findings in machine learning research suggest that straight paths, such as conditional flow matching, are easier to train and offer better generalisation. In this paper we quantify the effect of path straightness on speech enhancement quality. We report experiments with the Schrödinger bridge, where we show that certain configurations lead to straighter paths. Conversely, we propose independent conditional flow matching for speech enhancement, which models straight paths between noisy and clean speech. We demonstrate empirically that a time independent variance has a greater effect on sample quality than the gradient. Although conditional flow matching improves several speech quality metrics, it requires multiple inference steps. We rectify this with a one step solution by inferring the trained flow based model as if it was directly predictive. Our work suggests that straighter time independent probability paths improve generative speech enhancement over curved time dependent paths.}
}

Endnote

%0 Conference Paper
%T Flowing Straighter with Conditional Flow Matching for Accurate Speech Enhancement
%A Mattias Cross
%A Anton Ragni
%B Proceedings of the 2nd ECAI Workshop on "Machine Learning Meets Differential Equations: From Theory to Applications"
%C Proceedings of Machine Learning Research
%D 2025
%E Cecı́lia Coelho
%E Bernd Zimmering
%E M. Fernanda P. Costa
%E Luı́s L. Ferrás
%E Oliver Niggemann	
%F pmlr-v277-cross25a
%I PMLR
%P 121--132
%U https://proceedings.mlr.press/v277/cross25a.html
%V 277
%X Current flow based generative speech enhancement methods learn curved probability paths which model a mapping between clean and noisy speech. Despite impressive performance, the implications of curved probability paths are unknown. Methods such as Schr\"{}odinger bridges focus on curved paths, where time dependent gradients and variance do not promote straight paths. Findings in machine learning research suggest that straight paths, such as conditional flow matching, are easier to train and offer better generalisation. In this paper we quantify the effect of path straightness on speech enhancement quality. We report experiments with the Schrödinger bridge, where we show that certain configurations lead to straighter paths. Conversely, we propose independent conditional flow matching for speech enhancement, which models straight paths between noisy and clean speech. We demonstrate empirically that a time independent variance has a greater effect on sample quality than the gradient. Although conditional flow matching improves several speech quality metrics, it requires multiple inference steps. We rectify this with a one step solution by inferring the trained flow based model as if it was directly predictive. Our work suggests that straighter time independent probability paths improve generative speech enhancement over curved time dependent paths.

APA

Cross, M. & Ragni, A.. (2025). Flowing Straighter with Conditional Flow Matching for Accurate Speech Enhancement. Proceedings of the 2nd ECAI Workshop on "Machine Learning Meets Differential Equations: From Theory to Applications", in Proceedings of Machine Learning Research 277:121-132 Available from https://proceedings.mlr.press/v277/cross25a.html.

Related Material

Download PDF