학술논문

massNet: integrated processing and classification of spatially resolved mass spectrometry data using deep learning for rapid tumor delineation.
Document Type
Article
Source
Bioinformatics. 4/1/2022, Vol. 38 Issue 7, p2015-2021. 7p.
Subject
*DEEP learning
*MASS spectrometry
*BIOLOGICAL classification
*DATA libraries
*SUPPORT vector machines
*MACHINE learning
Language
ISSN
1367-4803
Abstract
Motivation Mass spectrometry imaging (MSI) provides rich biochemical information in a label-free manner and therefore holds promise to substantially impact current practice in disease diagnosis. However, the complex nature of MSI data poses computational challenges in its analysis. The complexity of the data arises from its large size, high-dimensionality and spectral nonlinearity. Preprocessing, including peak picking, has been used to reduce raw data complexity; however, peak picking is sensitive to parameter selection that, perhaps prematurely, shapes the downstream analysis for tissue classification and ensuing biological interpretation. Results We propose a deep learning model, massNet, that provides the desired qualities of scalability, nonlinearity and speed in MSI data analysis. This deep learning model was used, without prior preprocessing and peak picking, to classify MSI data from a mouse brain harboring a patient-derived tumor. The massNet architecture established automatically learning of predictive features, and automated methods were incorporated to identify peaks with potential for tumor delineation. The model's performance was assessed using cross-validation, and the results demonstrate higher accuracy and a substantial gain in speed compared to the established classical machine learning method, support vector machine. Availability and implementation https://github.com/wabdelmoula/massNet. The data underlying this article are available in the NIH Common Fund's National Metabolomics Data Repository (NMDR) Metabolomics Workbench under project id (PR001292) with http://dx.doi.org/10.21228/M8Q70T. Supplementary information Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]