Repository logo

Post-Alignment Adjustment and Its Automation

dc.contributor.authorXia, Xuhua
dc.date.accessioned2022-01-04T14:42:20Z
dc.date.available2022-01-04T14:42:20Z
dc.date.issued2021
dc.description.abstractMultiple sequence alignment (MSA) is the basis for almost all sequence comparison and molecular phylogenetic inferences. Large-scale genomic analyses are typically associated with automated progressive MSA without subsequent manual adjustment, which itself is often error-prone because of the lack of a consistent and explicit criterion. Here, I outlined several commonly encountered alignment errors that cannot be avoided by progressive MSA for nucleotide, amino acid, and codon sequences. Methods that could be automated to fix such alignment errors were then presented. I emphasized the utility of position weight matrix as a new tool for MSA refinement and illustrated its usage by refining the MSA of nucleotide and amino acid sequences. The main advantages of the position weight matrix approach include (1) its use of information from all sequences, in contrast to other commonly used methods based on pairwise alignment scores and inconsistency measures, and (2) its speedy computation, making it suitable for a large number of long viral genomic sequences.en_US
dc.description.sponsorshipNSERCen_US
dc.identifier.doi10.3390/genes12111809en_US
dc.identifier.issn2073-4425en_US
dc.identifier.urihttp://hdl.handle.net/10393/43073
dc.identifier.urihttps://doi.org/10.20381/ruor-27290
dc.language.isoenen_US
dc.rightsAttribution-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nd/4.0/*
dc.subjectPWMen_US
dc.subjectautomationen_US
dc.subjectcodon-based alignmenten_US
dc.subjectinconsistencyen_US
dc.subjectphylogeneticsen_US
dc.subjectposition weight matrixen_US
dc.subjectsequence alignmenten_US
dc.subjectsum-of-pairs scoreen_US
dc.titlePost-Alignment Adjustment and Its Automationen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
2021Genes_post-alignment.pdf
Size:
2.11 MB
Format:
Adobe Portable Document Format
Description:
Improve sequence alignment quality to improve comparative sequence analysis and molecular phylogenetics

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
4.92 KB
Format:
Item-specific license agreed upon to submission
Description: