학술논문

ACES: a machine learning toolbox for clustering analysis and visualization.
Document Type
Article
Source
BMC Genomics. 12/27/2018, Vol. 19 Issue 1, p1-8. 8p. 2 Diagrams, 4 Graphs.
Subject
*MACHINE learning
*VISUALIZATION
*PHENOTYPES
*GENETICS of disease susceptibility
*RNA sequencing
Language
ISSN
1471-2164
Abstract
Background: Studies that aim at explaining phenotypes or disease susceptibility by genetic or epigenetic variants often rely on clustering methods to stratify individuals or samples. While statistical associations may point at increased risk for certain parts of the population, the ultimate goal is to make precise predictions for each individual. This necessitates tools that allow for the rapid inspection of each data point, in particular to find explanations for outliers. Results: ACES is an integrative cluster- and phenotype-browser, which implements standard clustering methods, as well as multiple visualization methods in which all sample information can be displayed quickly. In addition, ACES can automatically mine a list of phenotypes for cluster enrichment, whereby the number of clusters and their boundaries are estimated by a novel method. For visual data browsing, ACES provides a 2D or 3D PCA or Heat Map view. ACES is implemented in Java, with a focus on a user-friendly, interactive, graphical interface. Conclusions: ACES has been proven an invaluable tool for analyzing large, pre-filtered DNA methylation data sets and RNA-Sequencing data, due to its ease to link molecular markers to complex phenotypes. The source code is available from https://github.com/GrabherrGroup/ACES. [ABSTRACT FROM AUTHOR]