학술논문

Models for Similarity Distributions of Syntenic Homologs and Applications to Phylogenomics
Document Type
Periodical
Source
IEEE/ACM Transactions on Computational Biology and Bioinformatics IEEE/ACM Trans. Comput. Biol. and Bioinf. Computational Biology and Bioinformatics, IEEE/ACM Transactions on. 16(3):727-737 Jun, 2019
Subject
Bioengineering
Computing and Processing
Genomics
Bioinformatics
Fractionation
Gaussian distribution
Biological system modeling
Computational modeling
Analytical models
Gene tree
species tree
whole genome doubling
algorithms
mixture of distributions
Brassicaceae
Language
ISSN
1545-5963
1557-9964
2374-0043
Abstract
We outline an integrated approach to speciation and whole genome doubling (WGD) to resolve the occurrence of these events in phylogenetic analysis. We propose a more principled way of estimating the parameters of gene divergence and fractionation than the standard mixture of normals analysis. We formulate an algorithm for resolving data on local peaks in the distributions of duplicate gene similarities for a number of related genomes. We illustrate with a comprehensive analysis of WGD-origin duplicate gene data from the family Brassicaceae.