Computationally efficient whole-genome quantile regression at biobank scale
(2025) In Proceedings of the National Academy of Sciences of the United States of America 122(50).- Abstract
Genotype–phenotype associations can be context-dependent and dynamic in nature leading to heterogeneity of genetic effects across different parts of the phenotype distribution. Quantile regression, an alternative to linear regression for continuous phenotypes, is particularly well suited for detecting and characterizing heterogeneous genotype–phenotype associations. Here, we propose a computationally efficient whole-genome quantile regression technique, Regenie.QRS, for biobank-scale genome-wide association studies (GWAS) data with genetic structure. Our approach first estimates the polygenic effect, and then incorporates this effect as an offset in the nonmixed quantile regression model. Our simulations demonstrate robust control of... (More)
Genotype–phenotype associations can be context-dependent and dynamic in nature leading to heterogeneity of genetic effects across different parts of the phenotype distribution. Quantile regression, an alternative to linear regression for continuous phenotypes, is particularly well suited for detecting and characterizing heterogeneous genotype–phenotype associations. Here, we propose a computationally efficient whole-genome quantile regression technique, Regenie.QRS, for biobank-scale genome-wide association studies (GWAS) data with genetic structure. Our approach first estimates the polygenic effect, and then incorporates this effect as an offset in the nonmixed quantile regression model. Our simulations demonstrate robust control of type I error and higher power to detect heterogeneous associations relative to linear regression in GWAS and improved power over the marginal quantile regression tests. We present applications using data from the UK Biobank and the ProgeNIA/SardiNIA project, where we show the advantages of Regenie.QRS in identifying and characterizing heterogeneous genetic effects. To cite just one interesting example, using quantile regression, we are able to show that even though variants at the G6PC2 locus increase glucose levels, their effects are much stronger at lower quantiles of glucose level distribution than at higher quantiles, suggesting that G6PC2 may serve as a guardian against low glucose levels without driving dangerous hyperglycemia, which may explain the previously reported lack of association with diabetes risk. Beyond human genetics, our approach can be applied to plant and animal genetic studies to improve selective breeding and conservation efforts.
(Less)
- author
- Wang, Fan ; Wang, Chen ; Wang, Tianying ; Masala, Marco ; Fiorillo, Edoardo ; Devoto, Marcella ; Cucca, Francesco and Ionita-Laza, Iuliana LU
- organization
- publishing date
- 2025-12-16
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- context-dependent associations, GWAS, quantile regression
- in
- Proceedings of the National Academy of Sciences of the United States of America
- volume
- 122
- issue
- 50
- article number
- e2513007122
- publisher
- National Academy of Sciences
- external identifiers
-
- scopus:105024743341
- pmid:41380003
- ISSN
- 0027-8424
- DOI
- 10.1073/pnas.2513007122
- language
- English
- LU publication?
- yes
- additional info
- Publisher Copyright: Copyright © 2025 the Author(s).
- id
- b8b1ee11-695f-4e01-bf5d-940547dcb4b1
- date added to LUP
- 2026-02-11 13:54:02
- date last changed
- 2026-02-12 03:00:02
@article{b8b1ee11-695f-4e01-bf5d-940547dcb4b1,
abstract = {{<p>Genotype–phenotype associations can be context-dependent and dynamic in nature leading to heterogeneity of genetic effects across different parts of the phenotype distribution. Quantile regression, an alternative to linear regression for continuous phenotypes, is particularly well suited for detecting and characterizing heterogeneous genotype–phenotype associations. Here, we propose a computationally efficient whole-genome quantile regression technique, Regenie.QRS, for biobank-scale genome-wide association studies (GWAS) data with genetic structure. Our approach first estimates the polygenic effect, and then incorporates this effect as an offset in the nonmixed quantile regression model. Our simulations demonstrate robust control of type I error and higher power to detect heterogeneous associations relative to linear regression in GWAS and improved power over the marginal quantile regression tests. We present applications using data from the UK Biobank and the ProgeNIA/SardiNIA project, where we show the advantages of Regenie.QRS in identifying and characterizing heterogeneous genetic effects. To cite just one interesting example, using quantile regression, we are able to show that even though variants at the G6PC2 locus increase glucose levels, their effects are much stronger at lower quantiles of glucose level distribution than at higher quantiles, suggesting that G6PC2 may serve as a guardian against low glucose levels without driving dangerous hyperglycemia, which may explain the previously reported lack of association with diabetes risk. Beyond human genetics, our approach can be applied to plant and animal genetic studies to improve selective breeding and conservation efforts.</p>}},
author = {{Wang, Fan and Wang, Chen and Wang, Tianying and Masala, Marco and Fiorillo, Edoardo and Devoto, Marcella and Cucca, Francesco and Ionita-Laza, Iuliana}},
issn = {{0027-8424}},
keywords = {{context-dependent associations; GWAS; quantile regression}},
language = {{eng}},
month = {{12}},
number = {{50}},
publisher = {{National Academy of Sciences}},
series = {{Proceedings of the National Academy of Sciences of the United States of America}},
title = {{Computationally efficient whole-genome quantile regression at biobank scale}},
url = {{http://dx.doi.org/10.1073/pnas.2513007122}},
doi = {{10.1073/pnas.2513007122}},
volume = {{122}},
year = {{2025}},
}