A branching process for homology distribution-based inference of polyploidy, speciation and loss
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Abstract
Background
The statistical distribution of the similarity or difference between pairs of paralogous genes, created by whole genome doubling, or between pairs of orthologous genes in two related species is an important source of information about genomic evolution, especially in plants.
Methods
We derive the mixture of distributions of sequence similarity for duplicate gene pairs generated by repeated episodes of whole gene doubling. This involves integrating sequence divergence and gene pair loss through fractionation, using a branching process and a mutational model. We account not only for the timing of these events in terms of local modes, but also the amplitude and variance of the component distributions. This model is then extended to orthologous gene pairs.
Results
We apply the model and inference procedures to the evolution of the Solanaceae, focusing on the genomes of economically important crops. We assess how consistent or variable fractionation rates are from species to species and over time.
Description
Keywords
Citation
Algorithms for Molecular Biology. 2019 Aug 01;14(1):18
