Determination of disease phenotypes and pathogenic variants from exome sequence data in the CAGI 4 gene panel challenge.

Printer-friendly versionPrinter-friendly versionPDF versionPDF version
TitleDetermination of disease phenotypes and pathogenic variants from exome sequence data in the CAGI 4 gene panel challenge.
Publication TypeJournal Article
Year of Publication2017
AuthorsKundu, K, Pal, LR, Yin, Y, Moult, J
JournalHum Mutat
Date Published2017 May 12
ISSN1098-1004
Abstract

The use of gene panel sequence for diagnostic and prognostic testing is now widespread, but there are so far few objective tests of methods to interpret these data. We describe the design and implementation of a gene panel sequencing data analysis pipeline (VarP) and its assessment in a CAGI4 community experiment. The method was applied to clinical gene panel sequencing data of 106 patients, with the goal of determining which of 14 disease classes each patient has and the corresponding causative variant(s). The disease class was correctly identified for 36 cases, including 10 where the original clinical pipeline did not find causative variants. For a further seven cases, we found strong evidence of an alternative disease to that tested. Many of the potentially causative variants are missense, with no previous association with disease, and these proved the hardest to correctly assign pathogenicity or otherwise. Post analysis showed that three-dimensional structure data could have helped for up to half of these cases. Over-reliance on HGMD annotation led to a number of incorrect disease assignments. We used a largely ad hoc method to assign probabilities of pathogenicity for each variant, and there is much work still to be done in this area. This article is protected by copyright. All rights reserved.

DOI10.1002/humu.23249
Alternate JournalHum. Mutat.
PubMed ID28497567
Grant ListR01 GM104436 / GM / NIGMS NIH HHS / United States
R01 GM120364 / GM / NIGMS NIH HHS / United States