Bidirectional Molecule Generation with Recurrent Neural Networks

Francesca Grisoni, Michael Moret, Robin Lingwood, Gisbert Schneider

Onderzoeksoutput: Bijdrage aan tijdschriftTijdschriftartikelAcademicpeer review

33 Citaten (Scopus)


Recurrent neural networks (RNNs) are able to generate de novo molecular designs using simplified molecular input line entry systems (SMILES) string representations of the chemical structure. RNN-based structure generation is usually performed unidirectionally, by growing SMILES strings from left to right. However, there is no natural start or end of a small molecule, and SMILES strings are intrinsically nonunivocal representations of molecular graphs. These properties motivate bidirectional structure generation. Here, bidirectional generative RNNs for SMILES-based molecule design are introduced. To this end, two established bidirectional methods were implemented, and a new method for SMILES string generation and data augmentation is introduced-the bidirectional molecule design by alternate learning (BIMODAL). These three bidirectional strategies were compared to the unidirectional forward RNN approach for SMILES string generation, in terms of the (i) novelty, (ii) scaffold diversity, and (iii) chemical-biological relevance of the computer-generated molecules. The results positively advocate bidirectional strategies for SMILES-based molecular de novo design, with BIMODAL showing superior results to the unidirectional forward RNN for most of the criteria in the tested conditions. The code of the methods and the pretrained models can be found at URL

Originele taal-2Engels
Pagina's (van-tot)1175-1183
Aantal pagina's9
TijdschriftJournal of Chemical Information and Modeling
Nummer van het tijdschrift3
StatusGepubliceerd - 23 mrt. 2020
Extern gepubliceerdJa


Duik in de onderzoeksthema's van 'Bidirectional Molecule Generation with Recurrent Neural Networks'. Samen vormen ze een unieke vingerafdruk.

Citeer dit