Improving the Efficiency and the Validity of Molecular Transformers

  • Leone Bacciu
  • , Matteo Grazioso
  • , Silvia Multari
  • , Francesca Grisoni
  • , Angelica Mazzolari
  • , Marco S. Nobile

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Since their advent, Transformer models have been applied across a wide range of fields, including cheminformatics. In this context, drug discovery has benefited from using Molecular Transformers by leveraging diverse string representations of molecules, such as the Simplified Molecular Input Line Entry Systems (SMILES), for a variety of tasks. In this study, we present a model focused on the optimization of a formerly developed Molecular Transformer specifically dedicated to metabolism prediction. Metabolism refers to all the biotransformations a drug undergoes once inside the human body, directly influencing its therapeutic effect and potential toxicity, and therefore represents a key topic in medicinal chemistry. Framing molecular transformation prediction as a sequence-to-sequence translation task has shown promise, but suffers from limitations such as low validity of generated molecules and high computational cost. To address this limitation, we here propose an optimized model that integrates pre-training, transfer learning, and fine-tuning techniques, already improving validity and reducing computation time. Finally, by separating the metabolism prediction task from the SMILES syntax learning, we ensure broader applicability of the proposed model across diverse datasets and a variety of SMILES-based tasks beyond metabolic transformations, expanding its potential utility.

Original languageEnglish
Title of host publication2025 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2025
PublisherInstitute of Electrical and Electronics Engineers
Number of pages9
ISBN (Electronic)9798331502669
DOIs
Publication statusPublished - 30 Sept 2025
Event2025 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2025 - Tainan, Taiwan
Duration: 20 Aug 202522 Aug 2025

Conference

Conference2025 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2025
Country/TerritoryTaiwan
CityTainan
Period20/08/2522/08/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Keywords

  • Biochemical reactions
  • Drug discovery
  • Metabolism
  • Transformers

Fingerprint

Dive into the research topics of 'Improving the Efficiency and the Validity of Molecular Transformers'. Together they form a unique fingerprint.

Cite this