An efficient algorithm for the extraction of HGVS variant descriptions from sequences

Jonathan K. Vis, Martijn Vermaat, Peter E.M. Taschner, Joost N. Kok, Jeroen F.J. Laros

Research output: Contribution to journalArticleAcademicpeer-review

24 Citations (Scopus)

Abstract

Motivation: Unambiguous sequence variant descriptions are important in reporting the outcome of clinical diagnostic DNA tests. The standard nomenclature of the Human Genome Variation Society (HGVS) describes the observed variant sequence relative to a given reference sequence. We propose an efficient algorithm for the extraction of HGVS descriptions from two sequences with three main requirements in mind: minimizing the length of the resulting descriptions, minimizing the computation time and keeping the unambiguous descriptions biologically meaningful. Results: Our algorithm is able to compute the HGVS descriptions of complete chromosomes or other large DNA strings in a reasonable amount of computation time and its resulting descriptions are relatively small. Additional applications include updating of gene variant database contents and reference sequence liftovers.

Original languageEnglish
Pages (from-to)3751-3757
Number of pages7
JournalBioinformatics
Volume31
Issue number23
DOIs
Publication statusPublished - 18 Jun 2015
Externally publishedYes

Bibliographical note

Publisher Copyright:
© The Author 2015. Published by Oxford University Press. All rights reserved.

Fingerprint

Dive into the research topics of 'An efficient algorithm for the extraction of HGVS variant descriptions from sequences'. Together they form a unique fingerprint.

Cite this