Skip to main navigation Skip to search Skip to main content

Efficient Algorithms for Counting and ReportingSegregating Sites in Genomic Sequences

  • Manolis Christodoulakis
  • , G. Brian Golding
  • , Costas S. Iliopoulos
  • , Yoan José Pinzón Ardila
  • , William F. Smyth
  • King's College London
  • McMaster University
  • Universidad Nacional de Colombia
  • Curtin University

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

The number of segregating sites provides an indicator of the degree of DNA sequence variation that is present in a sample, and has been of great interest to the biological, pharmaceutical and medical professions. In this paper, we first provide linear- and expected-sublinear-time algorithms for finding all the segregating sites of a given set of DNA sequences. We also describe a data structure for tracking segregating sites in a set of sequences, such that every time the set is updated with the insertion of a new sequence or removal of an existing one, the segregating sites are updated accordingly without the need to re-scan the entire set of sequences.

Original languageEnglish
Pages (from-to)1001-1010
Number of pages10
JournalJournal of Computational Biology
Volume14
Issue number7
DOIs
StatePublished - Sep 2007
Externally publishedYes

Keywords

  • Segregating sites
  • Single nucleotide polymorphisms (SNPs)

Fingerprint

Dive into the research topics of 'Efficient Algorithms for Counting and ReportingSegregating Sites in Genomic Sequences'. Together they form a unique fingerprint.

Cite this