Genomic distances under deletions and insertions

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

As more and more genomes are sequenced, evolutionary biologists are becoming increasingly interested in evolution at the level of whole genomes, in scenarios in which the genome evolves through insertions, deletions, and movements of genes along its chromosomes. In the mathematical model pioneered by Sankoff and others, a unichromosomal genome is represented by a signed permutation of a multi-set of genes; Hannenhalli and Pevzncr showed that the edit distance between two signed permutations of the same set can be computed in polynomial time when all operations are inversions. El-Mabrouk extended that result to allow deletions and a limited form of insertions (which forbids duplications). In this paper we extend El-Mabrouk's work to handle duplications as well as insertions and present an alternate framework for computing (near) minimal edit sequences involving insertions, deletions, and inversions. We derive an error bound for our polynomial-time distance computation under various assumptions and present preliminary experimental results that suggest that performance in practice may be excellent, within a few percent of the actual distance.

Original languageEnglish
Pages (from-to)537-547
Number of pages11
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2697
DOIs
StatePublished - 2003

Funding

This work is supported by the National Science Foundation under grants ACI 00-81404, DEB 01-20709, EIA 01-13095, EIA 01-21377, EIA 02-03584, and EF 03-31654. The first two authors would also like to thank Linda, Sarah, and Sage for their editorial assistance.

FundersFunder number
National Science Foundation Arctic Social Science ProgramEF 03-31654, EIA 02-03584, ACI 00-81404, DEB 01-20709, EIA 01-13095, EIA 01-21377
National Science Foundation Arctic Social Science Program

    Keywords

    • Genomic distance
    • Hannenhalli-Pevzner
    • Inversion distance
    • Reversal distance

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Genomic distances under deletions and insertions'. Together they form a unique fingerprint.

    Cite this