Kmer-SSR: A fast and exhaustive SSR search algorithm

Brandon D. Pickett, Justin B. Miller, Perry G. Ridge

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

Motivation: One of the main challenges with bioinformatics software is that the size and complexity of datasets necessitate trading speed for accuracy, or completeness. To combat this problem of computational complexity, a plethora of heuristic algorithms have arisen that report a 'good enough' solution to biological questions. However, in instances such as Simple Sequence Repeats (SSRs), a 'good enough' solution may not accurately portray results in population genetics, phylogenetics and forensics, which require accurate SSRs to calculate intra- and inter-species interactions. Results: We present Kmer-SSR, which finds all SSRs faster than most heuristic SSR identification algorithms in a parallelized, easy-to-use manner. The exhaustive Kmer-SSR option has 100% precision and 100% recall and accurately identifies every SSR of any specified length. To identify more biologically pertinent SSRs, we also developed several filters that allow users to easily view a subset of SSRs based on user input. Kmer-SSR, coupled with the filter options, accurately and intuitively identifies SSRs quickly and in a more user-friendly manner than any other SSR identification algorithm.

Original languageEnglish
Pages (from-to)3922-3928
Number of pages7
JournalBioinformatics
Volume33
Issue number24
DOIs
StatePublished - Dec 15 2017

Bibliographical note

Funding Information:
This work has been supported by funds provided by Brigham Young University and the Department of Biology.

Publisher Copyright:
© The Author 2017. Published by Oxford University Press. All rights reserved.

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'Kmer-SSR: A fast and exhaustive SSR search algorithm'. Together they form a unique fingerprint.

Cite this