Repository logo

Scaffold filling, contig fusion and comparative gene order inference

dc.contributor.authorMuñoz, Adriana
dc.contributor.authorZheng, Chunfang
dc.contributor.authorZhu, Qian
dc.contributor.authorAlbert, Victor A
dc.contributor.authorRounsley, Steve
dc.contributor.authorSankoff, David
dc.date.accessioned2015-12-18T10:52:08Z
dc.date.available2015-12-18T10:52:08Z
dc.date.issued2010-06-04
dc.date.updated2015-12-18T10:52:08Z
dc.description.abstractAbstract Background There has been a trend in increasing the phylogenetic scope of genome sequencing without finishing the sequence of the genome. Increasing numbers of genomes are being published in scaffold or contig form. Rearrangement algorithms, however, including gene order-based phylogenetic tools, require whole genome data on gene order or syntenic block order. How then can we use rearrangement algorithms to compare genomes available in scaffold form only? Can the comparative evidence predict the location of unsequenced genes? Results Our method involves optimally filling in genes missing from the scaffolds, while incorporating the augmented scaffolds directly into the rearrangement algorithms as if they were chromosomes. This is accomplished by an exact, polynomial-time algorithm. We then correct for the number of extra fusion/fission operations required to make scaffolds comparable to full assemblies. We model the relationship between the ratio of missing genes actually absent from the genome versus merely unsequenced ones, on one hand, and the increase of genomic distance after scaffold filling, on the other. We estimate the parameters of this model through simulations and by comparing the angiosperm genomes Ricinus communis and Vitis vinifera. Conclusions The algorithm solves the comparison of genomes with 18,300 genes, including 4500 missing from one genome, in less than a minute on a MacBook, putting virtually all genomes within range of the method.
dc.identifier.citationBMC Bioinformatics. 2010 Jun 04;11(1):304
dc.identifier.urihttp://dx.doi.org/10.1186/1471-2105-11-304
dc.identifier.urihttp://hdl.handle.net/10393/33471
dc.language.rfc3066en
dc.rights.holderMuñoz et al.
dc.titleScaffold filling, contig fusion and comparative gene order inference
dc.typeJournal Article

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
12859_2009_Article_3761.pdf
Size:
675.64 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
4.92 KB
Format:
Item-specific license agreed upon to submission
Description: