학술논문

ReactionCode: format for reaction searching, analysis, classification, transform, and encoding/decoding
Document Type
article
Source
Journal of Cheminformatics, Vol 12, Iss 1, Pp 1-13 (2020)
Subject
ReactionCode
Reaction
Encoding
Decoding
Searching
Classification
Information technology
T58.5-58.64
Chemistry
QD1-999
Language
English
ISSN
1758-2946
Abstract
Abstract In the past two decades a lot of different formats for molecules and reactions have been created. These formats were mostly developed for the purposes of identifiers, representation, classification, analysis and data exchange. A lot of efforts have been made on molecule formats but only few for reactions where the endeavors have been made mostly by companies leading to proprietary formats. Here, we present ReactionCode: a new open-source format that allows one to encode and decode a reaction into multi-layer machine readable code, which aggregates reactants and products into a condensed graph of reaction (CGR). This format is flexible and can be used in a context of reaction similarity searching and classification. It is also designed for database organization, machine learning applications and as a new transform reaction language.