MaSC: mappability-sensitive cross-correlation for estimating mean fragment length of single-end short-read sequencing data
| dc.contributor.author | Ramachandran, Parameswaran | |
| dc.contributor.author | Palidwor, Gareth A. | |
| dc.contributor.author | Porter, Christopher J. | |
| dc.contributor.author | Perkins, Theodore J. | |
| dc.date.accessioned | 2013-04-30T14:14:46Z | |
| dc.date.available | 2013-04-30T14:14:46Z | |
| dc.date.created | 2013 | |
| dc.date.issued | 2013-04-30 | |
| dc.description.abstract | Motivation: Reliable estimation of the mean fragment length for next-generation short-read sequencing data is an important step in next-generation sequencing analysis pipelines, most notably because of its impact on the accuracy of the enriched regions identified by peak-calling algorithms. Although many peak-calling algorithms include a fragment-length estimation subroutine, the problem has not been adequately solved, as demonstrated by the variability of the estimates returned by different algorithms. Results: In this article, we investigate the use of strand crosscorrelation to estimate mean fragment length of single-end data and show that traditional estimation approaches have mixed reliability. We observe that the mappability of different parts of the genome can introduce an artificial bias into cross-correlation computations, resulting in incorrect fragment-length estimates. We propose a new approach, called mappability-sensitive cross-correlation (MaSC), which removes this bias and allows for accurate and reliable fragment-length estimation. We analyze the computational complexity of this approach, and evaluate its performance on a test suite of NGS datasets, demonstrating its superiority to traditional cross-correlation analysis. Availability: An open-source Perl implementation of our approach is available at http://www.perkinslab.ca/Software.html. | |
| dc.identifier.doi | 10.1093/bioinformatics/btt001 | |
| dc.identifier.uri | http://hdl.handle.net/10393/24088 | |
| dc.identifier.uri | http://bioinformatics.oxfordjournals.org/content/29/4/444.full | |
| dc.language.iso | en | |
| dc.title | MaSC: mappability-sensitive cross-correlation for estimating mean fragment length of single-end short-read sequencing data | |
| dc.type | Article |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Ramachandran_Parameswaran_2013_MaSC_mappabilty-sensitive_cross-correlation.pdf
- Size:
- 242.38 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 4.84 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
