Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

BackgroundAlthough rare missense variants in Mendelian disease genes often cluster in specific regions of proteins, it is unclear how to consider this when evaluating the pathogenicity of a gene or variant. Here we introduce methods for gene association and variant interpretation that use this powerful signal.MethodsWe present statistical methods to detect missense variant clustering (BIN-test) combined with burden information (ClusterBurden). We introduce a flexible generalised additive modelling (GAM) framework to identify mutational hotspots using burden and clustering information (hotspotmodel) and supplemented by in silico predictors (hotspot+model). The methods were applied to synthetic data and a case–control dataset, comprising 5338 hypertrophic cardiomyopathy patients and 125 748 population reference samples over 34 putative cardiomyopathy genes.ResultsIn simulations, theBIN-testwas almost twice as powerful as the Anderson-Darling or Kolmogorov-Smirnov tests;ClusterBurdenwas computationally faster and more powerful than alternative position-informed methods. For 6/8 sarcomeric genes with strong clustering,Clusterburdenshowed enhanced power over burden-alone, equivalent to increasing the sample size by 50%.Hotspot+models that combine burden, clustering and in silico predictors outperform generic pathogenicity predictors and effectively integrate ACMG criteria PM1 and PP3 to yield strong or moderate evidence of pathogenicity for 31.8% of examined variants of uncertain significance.ConclusionGAMs represent a unified statistical modelling framework to combine burden, clustering and functional information.Hotspotmodels can refine maps of regional burden andhotspot+models can be powerful predictors of variant pathogenicity. TheBIN-testis a fast powerful approach to detect missense variant clustering that when combined with burden information (ClusterBurden) may enhance disease-gene discovery.

Original publication

DOI

10.1136/jmedgenet-2020-106922

Type

Journal

Journal of Medical Genetics

Publisher

BMJ

Publication Date

08/2021

Volume

58

Pages

556 - 564