Evaluation of computational genotyping of structural variation for clinical diagnoses.

TitleEvaluation of computational genotyping of structural variation for clinical diagnoses.
Publication TypeJournal Article
Year of Publication2019
AuthorsChander, V, Gibbs, RA, Sedlazeck, FJ
JournalGigascience
Volume8
Issue9
Date Published2019 Sep 01
ISSN2047-217X
KeywordsArthritis, Computer Simulation, Deafness, Genomic Structural Variation, Genotype, Humans, Polychondritis, Relapsing, Software
Abstract

BACKGROUND: Structural variation (SV) plays a pivotal role in genetic disease. The discovery of SVs based on short DNA sequence reads from next-generation DNA sequence methods is error-prone, with low sensitivity and high false discovery rates. These shortcomings can be partially overcome with extensive orthogonal validation methods or use of long reads, but the current cost precludes their application for routine clinical diagnostics. In contrast, SV genotyping of known sites of SV occurrence is relatively robust and therefore offers a cost-effective clinical diagnostic tool with potentially few false-positive and false-negative results, even when applied to short-read DNA sequence data.

RESULTS: We assess 5 state-of-the-art SV genotyping software methods, applied to short-read sequence data. The methods are characterized on the basis of their ability to genotype different SV types, spanning different size ranges. Furthermore, we analyze their ability to parse different VCF file subformats and assess their reliance on specific metadata. We compare the SV genotyping methods across a range of simulated and real data including SVs that were not found with Illumina data alone. We assess sensitivity and the ability to filter initial false discovery calls. We determined the impact of SV type and size on the performance for each SV genotyper. Overall, STIX performed the best on both simulated and GiaB based SV calls, demonstrating a good balance between sensitivity and specificty.

CONCLUSION: Our results indicate that, although SV genotyping software methods have superior performance to SV callers, there are limitations that suggest the need for further innovation.

DOI10.1093/gigascience/giz110
Alternate JournalGigascience
PubMed ID31494671
PubMed Central IDPMC6732172
Grant ListUM1 HG008898 / HG / NHGRI NIH HHS / United States

Similar Publications