Evaluating Tuning Strategies for Sequence Generation with Protein Language Models

Andrea Nathansen; Kevin Klein; Bernhard Renard; Melania Nowicka; Jakub M Bartoszewicz

Evaluating Tuning Strategies for Sequence Generation with Protein Language Models

Andrea Nathansen, Kevin Klein, Bernhard Renard, Melania Nowicka, Jakub M Bartoszewicz

Proceedings of the 18th Machine Learning in Computational Biology meeting, PMLR 240:76-89, 2024.

Abstract

Designing artificial proteins with specialized functions promises new solutions for biological, medical, and environmental use cases. This field benefits from advances in natural language processing, with state-of-the-art text generation models already being successfully applied to protein sequences. Openly available pre-trained protein language models are able to generate artificial protein sequences and can be finetuned on very specific tasks. Considering the high computational cost of finetuning a model exclusively for one downstream task, prompt tuning has been proposed as a more cost-efficient alternative that shares one model across different tasks. However, no openly available implementation of this approach compatible with protein language models has been previously published. Thus, we adapt an open-source codebase designed for NLP models to build a pipeline for prompt tuning on protein sequence data, supporting the protein language models ProtGPT2 and RITA. We benchmark this implementation for generating proteins of a specific family and evaluate the approach using text processing metrics as well as family membership prediction and protein activity prediction of generated sequences. Our results confirm the advantages of prompt tuning in resource usage, especially storage, encouraging further research and expansion of this technique to related use cases. For our evaluated use case, prompt tuning does not reach up to finetuning in terms of the quality of generated protein sequences, indicating the need for more extensive optimization. Lastly, we observe discrepancies between results of similar evaluation tools, highlighting open problems for principled assessment of protein sequence generation quality.

Cite this Paper

BibTeX


@InProceedings{pmlr-v240-nathansen24a,
  title = 	 {Evaluating Tuning Strategies for Sequence Generation with Protein Language Models},
  author =       {Nathansen, Andrea and Klein, Kevin and Renard, Bernhard and Nowicka, Melania and Bartoszewicz, Jakub M},
  booktitle = 	 {Proceedings of the 18th Machine Learning in Computational Biology meeting},
  pages = 	 {76--89},
  year = 	 {2024},
  editor = 	 {Knowles, David A. and Mostafavi, Sara},
  volume = 	 {240},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {30 Nov--01 Dec},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v240/nathansen24a/nathansen24a.pdf},
  url = 	 {https://proceedings.mlr.press/v240/nathansen24a.html},
  abstract = 	 {Designing artificial proteins with specialized functions promises new solutions for biological, medical, and environmental use cases. This field benefits from advances in natural language processing, with state-of-the-art text generation models already being successfully applied to protein sequences. Openly available pre-trained protein language models are able to generate artificial protein sequences and can be finetuned on very specific tasks. Considering the high computational cost of finetuning a model exclusively for one downstream task, prompt tuning has been proposed as a more cost-efficient alternative that shares one model across different tasks. However, no openly available implementation of this approach compatible with protein language models has been previously published. Thus, we adapt an open-source codebase designed for NLP models to build a pipeline for prompt tuning on protein sequence data, supporting the protein language models ProtGPT2 and RITA.  We benchmark this implementation for generating proteins of a specific family and evaluate the approach using text processing metrics as well as family membership prediction and protein activity prediction of generated sequences. Our results confirm the advantages of prompt tuning in resource usage, especially storage, encouraging further research and expansion of this technique to related use cases. For our evaluated use case, prompt tuning does not reach up to finetuning in terms of the quality of generated protein sequences, indicating the need for more extensive optimization. Lastly, we observe discrepancies between results of similar evaluation tools, highlighting open problems for principled assessment of protein sequence generation quality.}
}

Endnote

%0 Conference Paper
%T Evaluating Tuning Strategies for Sequence Generation with Protein Language Models
%A Andrea Nathansen
%A Kevin Klein
%A Bernhard Renard
%A Melania Nowicka
%A Jakub M Bartoszewicz
%B Proceedings of the 18th Machine Learning in Computational Biology meeting
%C Proceedings of Machine Learning Research
%D 2024
%E David A. Knowles
%E Sara Mostafavi	
%F pmlr-v240-nathansen24a
%I PMLR
%P 76--89
%U https://proceedings.mlr.press/v240/nathansen24a.html
%V 240
%X Designing artificial proteins with specialized functions promises new solutions for biological, medical, and environmental use cases. This field benefits from advances in natural language processing, with state-of-the-art text generation models already being successfully applied to protein sequences. Openly available pre-trained protein language models are able to generate artificial protein sequences and can be finetuned on very specific tasks. Considering the high computational cost of finetuning a model exclusively for one downstream task, prompt tuning has been proposed as a more cost-efficient alternative that shares one model across different tasks. However, no openly available implementation of this approach compatible with protein language models has been previously published. Thus, we adapt an open-source codebase designed for NLP models to build a pipeline for prompt tuning on protein sequence data, supporting the protein language models ProtGPT2 and RITA.  We benchmark this implementation for generating proteins of a specific family and evaluate the approach using text processing metrics as well as family membership prediction and protein activity prediction of generated sequences. Our results confirm the advantages of prompt tuning in resource usage, especially storage, encouraging further research and expansion of this technique to related use cases. For our evaluated use case, prompt tuning does not reach up to finetuning in terms of the quality of generated protein sequences, indicating the need for more extensive optimization. Lastly, we observe discrepancies between results of similar evaluation tools, highlighting open problems for principled assessment of protein sequence generation quality.

APA


Nathansen, A., Klein, K., Renard, B., Nowicka, M. & Bartoszewicz, J.M.. (2024). Evaluating Tuning Strategies for Sequence Generation with Protein Language Models. Proceedings of the 18th Machine Learning in Computational Biology meeting, in Proceedings of Machine Learning Research 240:76-89 Available from https://proceedings.mlr.press/v240/nathansen24a.html.

Related Material

Download PDF