Abstract
The number of segregating sites provides an indicator of the degree of DNA sequence variation that is present in a sample, and has been of great interest to the biological, pharmaceutical and medical professions. In this paper, we first provide linear- and expected-sublinear-time algorithms for finding all the segregating sites of a given set of DNA sequences. We also describe a data structure for tracking segregating sites in a set of sequences, such that every time the set is updated with the insertion of a new sequence or removal of an existing one, the segregating sites are updated accordingly without the need to re-scan the entire set of sequences.
| Original language | English |
|---|---|
| Pages (from-to) | 1001-1010 |
| Number of pages | 10 |
| Journal | Journal of Computational Biology |
| Volume | 14 |
| Issue number | 7 |
| DOIs | |
| State | Published - Sep 2007 |
| Externally published | Yes |
Keywords
- Segregating sites
- Single nucleotide polymorphisms (SNPs)
Fingerprint
Dive into the research topics of 'Efficient Algorithms for Counting and ReportingSegregating Sites in Genomic Sequences'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver