Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Biological datasets often consist of thousands or millions of variables, e.g. genetic variants or biomarkers, and when sample sizes are large it is common to find many associated with an outcome of interest, for example, disease risk in a GWAS, at high levels of statistical significance, but with very small effects. The False Discovery Rate (FDR) is used to identify effects of interest based on ranking variables according to their statistical significance. Here, we develop a complementary measure to the FDR, the priorityFDR, that ranks variables by a combination of effect size and significance, allowing further prioritisation among a set of variables that pass a significance or FDR threshold. Applying to the largest GWAS of type 1 diabetes to date (15,573 cases and 158,408 controls), we identified 26 independent genetic associations, including two newly-reported loci, with qualitatively lower priorityFDRs than the remaining 175 signals. We detected putatively causal type 1 diabetes risk genes using Mendelian Randomisation, and found that these were located disproportionately close to low priorityFDR signals (p = 0.005), as were genes in the IL-2 pathway (p = 0.003). Selecting variables on both effect size and significance can lead to improved prioritisation for mechanistic follow-up studies from genetic and other large biological datasets.

More information Original publication

DOI

10.1002/gepi.22608

Type

Journal article

Publication Date

2025-01-01T00:00:00+00:00

Volume

49

Addresses

J, D, R, F, /, W, e, l, l, c, o, m, e, , D, i, a, b, e, t, e, s, , a, n, d, , I, n, f, l, a, m, m, a, t, i, o, n, , L, a, b, o, r, a, t, o, r, y, ,, , N, u, f, f, i, e, l, d, , D, e, p, a, r, t, m, e, n, t, , o, f, , M, e, d, i, c, i, n, e, ,, , C, e, n, t, r, e, , f, o, r, , H, u, m, a, n, , G, e, n, e, t, i, c, s, ,, , N, I, H, R, , O, x, f, o, r, d, , B, i, o, m, e, d, i, c, a, l, , R, e, s, e, a, r, c, h, , C, e, n, t, r, e, ,, , U, n, i, v, e, r, s, i, t, y, , o, f, , O, x, f, o, r, d, ,, , O, x, f, o, r, d, ,, , U, K, .

Keywords

Humans, Diabetes Mellitus, Type 1, Genetic Predisposition to Disease, Bayes Theorem, Follow-Up Studies, Polymorphism, Single Nucleotide, Genetic Variation, Genome-Wide Association Study, Mendelian Randomization Analysis