Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

Somin Wadhwa; Jay DeYoung; Benjamin Nye; Silvio Amir; Byron C. Wallace

Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

Somin Wadhwa, Jay DeYoung, Benjamin Nye, Silvio Amir, Byron C. Wallace

Proceedings of the 8th Machine Learning for Healthcare Conference, PMLR 219:754-771, 2023.

Abstract

Results from Randomized Controlled Trials (RCTs) establish the comparative effectiveness of interventions, and are in turn critical inputs for evidence-based care. However, results from RCTs are presented in (often unstructured) natural language articles describing the design, execution, and outcomes of trials; clinicians must manually extract findings pertaining to interventions and outcomes of interest from such articles. This onerous manual process has motivated work on (semi-)automating extraction of structured evidence from trial reports. In this work we propose and evaluate a text-to-text model built on instruction-tuned Large Language Models (LLMs) to jointly extract Interventions, Outcomes, and Comparators (ICO elements) from clinical abstracts, and infer the associated results reported. Manual (expert) and automated evaluations indicate that framing evidence extraction as a conditional generation task and fine-tuning LLMs for this purpose realizes considerable (∼20 point absolute F1 score) gains over the previous SOTA. We perform ablations and error analyses to assess aspects that contribute to model performance, and to highlight potential directions for further improvements. We apply our model to a collection of published RCTs through mid-2022, and release a searchable database of structured findings: http://ico-relations.ebm-nlp.com.

Cite this Paper

BibTeX


@InProceedings{pmlr-v219-wadhwa23a,
  title = 	 {Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs},
  author =       {Wadhwa, Somin and DeYoung, Jay and Nye, Benjamin and Amir, Silvio and Wallace, Byron C.},
  booktitle = 	 {Proceedings of the 8th Machine Learning for Healthcare Conference},
  pages = 	 {754--771},
  year = 	 {2023},
  editor = 	 {Deshpande, Kaivalya and Fiterau, Madalina and Joshi, Shalmali and Lipton, Zachary and Ranganath, Rajesh and Urteaga, Iñigo and Yeung, Serene},
  volume = 	 {219},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {11--12 Aug},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v219/wadhwa23a/wadhwa23a.pdf},
  url = 	 {https://proceedings.mlr.press/v219/wadhwa23a.html},
  abstract = 	 {Results from Randomized Controlled Trials (RCTs) establish the comparative effectiveness of interventions, and are in turn critical inputs for evidence-based care. However, results from RCTs are presented in (often unstructured) natural language articles describing the design, execution, and outcomes of trials; clinicians must manually extract findings pertaining to interventions and outcomes of interest from such articles. This onerous manual process has motivated work on (semi-)automating extraction of structured evidence from trial reports. In this work we propose and evaluate a text-to-text model built on instruction-tuned Large Language Models (LLMs) to jointly extract Interventions, Outcomes, and Comparators (ICO elements) from clinical abstracts, and infer the associated results reported. Manual (expert) and automated evaluations indicate that framing evidence extraction as a conditional generation task and fine-tuning LLMs for this purpose realizes considerable (∼20 point absolute F1 score) gains over the previous SOTA. We perform ablations and error analyses to assess aspects that contribute to model performance, and to highlight potential directions for further improvements. We apply our model to a collection of published RCTs through mid-2022, and release a searchable database of structured findings: http://ico-relations.ebm-nlp.com.}
}

Endnote

%0 Conference Paper
%T Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs
%A Somin Wadhwa
%A Jay DeYoung
%A Benjamin Nye
%A Silvio Amir
%A Byron C. Wallace
%B Proceedings of the 8th Machine Learning for Healthcare Conference
%C Proceedings of Machine Learning Research
%D 2023
%E Kaivalya Deshpande
%E Madalina Fiterau
%E Shalmali Joshi
%E Zachary Lipton
%E Rajesh Ranganath
%E Iñigo Urteaga
%E Serene Yeung	
%F pmlr-v219-wadhwa23a
%I PMLR
%P 754--771
%U https://proceedings.mlr.press/v219/wadhwa23a.html
%V 219
%X Results from Randomized Controlled Trials (RCTs) establish the comparative effectiveness of interventions, and are in turn critical inputs for evidence-based care. However, results from RCTs are presented in (often unstructured) natural language articles describing the design, execution, and outcomes of trials; clinicians must manually extract findings pertaining to interventions and outcomes of interest from such articles. This onerous manual process has motivated work on (semi-)automating extraction of structured evidence from trial reports. In this work we propose and evaluate a text-to-text model built on instruction-tuned Large Language Models (LLMs) to jointly extract Interventions, Outcomes, and Comparators (ICO elements) from clinical abstracts, and infer the associated results reported. Manual (expert) and automated evaluations indicate that framing evidence extraction as a conditional generation task and fine-tuning LLMs for this purpose realizes considerable (∼20 point absolute F1 score) gains over the previous SOTA. We perform ablations and error analyses to assess aspects that contribute to model performance, and to highlight potential directions for further improvements. We apply our model to a collection of published RCTs through mid-2022, and release a searchable database of structured findings: http://ico-relations.ebm-nlp.com.

APA


Wadhwa, S., DeYoung, J., Nye, B., Amir, S. & Wallace, B.C.. (2023). Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs. Proceedings of the 8th Machine Learning for Healthcare Conference, in Proceedings of Machine Learning Research 219:754-771 Available from https://proceedings.mlr.press/v219/wadhwa23a.html.

Related Material

Download PDF