Repository logo

Dating the Common Ancestor from an NCBI Tree of 83688 High-Quality and Full-Length SARS-CoV-2 Genomes

dc.contributor.authorXia, Xuhua
dc.date.accessioned2022-01-04T14:42:34Z
dc.date.available2022-01-04T14:42:34Z
dc.date.issued2021
dc.description.abstractAll dating studies involving SARS-CoV-2 are problematic. Previous studies have dated the most recent common ancestor (MRCA) between SARS-CoV-2 and its close relatives from bats and pangolins. However, the evolutionary rate thus derived is expected to differ from the rate estimated from sequence divergence of SARS-CoV-2 lineages. Here, I present dating results for the first time from a large phylogenetic tree with 86,582 high-quality full-length SARS-CoV-2 genomes. The tree contains 83,688 genomes with full specification of collection time. Such a large tree spanning a period of about 1.5 years offers an excellent opportunity for dating the MRCA of the sampled SARS-CoV-2 genomes. The MRCA is dated 16 August 2019, with the evolutionary rate estimated to be 0.05526 mutations/genome/day. The Pearson correlation coefficient (r) between the root-to-tip distance (D) and the collection time (T) is 0.86295. The NCBI tree also includes 10 SARS-CoV-2 genomes isolated from cats, collected over roughly the same time span as human COVID-19 infection. The MRCA from these cat-derived SARS-CoV-2 is dated 30 July 2019, with r = 0.98464. While the dating method is well known, I have included detailed illustrations so that anyone can repeat the analysis and obtain the same dating results. With 16 August 2019 as the date of the MRCA of sampled SARS-CoV-2 genomes, archived samples from respiratory or digestive tracts collected around or before 16 August 2019, or those that are not descendants of the existing SARS-CoV-2 lineages, should be particularly valuable for tracing the origin of SARS-CoV-2.en_US
dc.description.sponsorshipNSERCen_US
dc.identifier.doi10.3390/v13091790en_US
dc.identifier.issn1999-4915en_US
dc.identifier.urihttp://hdl.handle.net/10393/43074
dc.identifier.urihttps://doi.org/10.20381/ruor-27291
dc.language.isoenen_US
dc.rightsAttribution-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nd/4.0/*
dc.subjectCOVID-19en_US
dc.subjectSARS-CoV-2en_US
dc.subjectmost recent common ancestoren_US
dc.subjectphylogenyen_US
dc.subjecttip datingen_US
dc.subjecttip rootingen_US
dc.subjectviral evolutionen_US
dc.subjectAnimalsen_US
dc.subjectCOVID-19en_US
dc.subjectEvolution, Molecularen_US
dc.subjectHumansen_US
dc.subjectPhylogenyen_US
dc.subjectSARS-CoV-2en_US
dc.subjectGenome, Viralen_US
dc.subjectGenomicsen_US
dc.titleDating the Common Ancestor from an NCBI Tree of 83688 High-Quality and Full-Length SARS-CoV-2 Genomesen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
2021Viruses_dating.pdf
Size:
2.83 MB
Format:
Adobe Portable Document Format
Description:
Developed and applied a new method to identify the location and time of origin of SARS-CoV-2

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
4.92 KB
Format:
Item-specific license agreed upon to submission
Description: