Predicting new cases of hypertension in Swedish primary care with a machine learning tool
(2024) In Preventive Medicine Reports 44.- Abstract
Background: Many individuals with hypertension remain undiagnosed. We aimed to develop a predictive model for hypertension using diagnostic codes from prevailing electronic medical records in Swedish primary care. Methods: This sex- and age-matched case-control (1:5) study included patients aged 30–65 years living in the Stockholm Region, Sweden, with a newly recorded diagnosis of hypertension during 2010–19 (cases) and individuals without a recorded hypertension diagnosis during 2010–19 (controls), in total 507,618 individuals. Patients with diagnoses of cardiovascular diseases or diabetes were excluded. A stochastic gradient boosting machine learning model was constructed using the 1,309 most registered ICD-10 codes from primary care... (More)
Background: Many individuals with hypertension remain undiagnosed. We aimed to develop a predictive model for hypertension using diagnostic codes from prevailing electronic medical records in Swedish primary care. Methods: This sex- and age-matched case-control (1:5) study included patients aged 30–65 years living in the Stockholm Region, Sweden, with a newly recorded diagnosis of hypertension during 2010–19 (cases) and individuals without a recorded hypertension diagnosis during 2010–19 (controls), in total 507,618 individuals. Patients with diagnoses of cardiovascular diseases or diabetes were excluded. A stochastic gradient boosting machine learning model was constructed using the 1,309 most registered ICD-10 codes from primary care for three years prior the hypertension diagnosis. Results: The model showed an area under the curve (95 % confidence interval) of 0.748 (0.742–0.753) for females and 0.745 (0.740–0.751) for males for predicting diagnosis of hypertension within three years. The sensitivity was 63 % and 68 %, and the specificity 76 % and 73 %, for females and males, respectively. The 25 diagnoses that contributed the most to the model for females and males all exhibited a normalized relative influence >1 %. The codes contributing most to the model, all with an odds ratio of marginal effects >1 for both sexes, were dyslipidaemia, obesity, and encountering health services in other circumstances. Conclusions: This machine learning model, using prevailing recorded diagnoses within primary health care, may contribute to the identification of patients at risk of unrecognized hypertension. The added value of this predictive model beyond information of blood pressure warrants further study.
(Less)
- author
- organization
- publishing date
- 2024-08
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- Artificial intelligence, Family practice, Gradient boosting, Hypertension, Opportunistic screening, Prediction
- in
- Preventive Medicine Reports
- volume
- 44
- article number
- 102806
- publisher
- Elsevier
- external identifiers
-
- scopus:85198217388
- pmid:39091569
- ISSN
- 2211-3355
- DOI
- 10.1016/j.pmedr.2024.102806
- language
- English
- LU publication?
- yes
- id
- bb01e421-aa93-4d9a-a4e8-f566ac92a9a8
- date added to LUP
- 2024-09-23 15:52:29
- date last changed
- 2024-09-24 03:00:05
@article{bb01e421-aa93-4d9a-a4e8-f566ac92a9a8, abstract = {{<p>Background: Many individuals with hypertension remain undiagnosed. We aimed to develop a predictive model for hypertension using diagnostic codes from prevailing electronic medical records in Swedish primary care. Methods: This sex- and age-matched case-control (1:5) study included patients aged 30–65 years living in the Stockholm Region, Sweden, with a newly recorded diagnosis of hypertension during 2010–19 (cases) and individuals without a recorded hypertension diagnosis during 2010–19 (controls), in total 507,618 individuals. Patients with diagnoses of cardiovascular diseases or diabetes were excluded. A stochastic gradient boosting machine learning model was constructed using the 1,309 most registered ICD-10 codes from primary care for three years prior the hypertension diagnosis. Results: The model showed an area under the curve (95 % confidence interval) of 0.748 (0.742–0.753) for females and 0.745 (0.740–0.751) for males for predicting diagnosis of hypertension within three years. The sensitivity was 63 % and 68 %, and the specificity 76 % and 73 %, for females and males, respectively. The 25 diagnoses that contributed the most to the model for females and males all exhibited a normalized relative influence >1 %. The codes contributing most to the model, all with an odds ratio of marginal effects >1 for both sexes, were dyslipidaemia, obesity, and encountering health services in other circumstances. Conclusions: This machine learning model, using prevailing recorded diagnoses within primary health care, may contribute to the identification of patients at risk of unrecognized hypertension. The added value of this predictive model beyond information of blood pressure warrants further study.</p>}}, author = {{Norrman, Anders and Hasselström, Jan and Ljunggren, Gunnar and Wachtler, Caroline and Eriksson, Julia and Kahan, Thomas and Wändell, Per and Gudjonsdottir, Hrafnhildur and Lindblom, Sebastian and Ruge, Toralph and Rosenblad, Andreas and Brynedal, Boel and Carlsson, Axel C.}}, issn = {{2211-3355}}, keywords = {{Artificial intelligence; Family practice; Gradient boosting; Hypertension; Opportunistic screening; Prediction}}, language = {{eng}}, publisher = {{Elsevier}}, series = {{Preventive Medicine Reports}}, title = {{Predicting new cases of hypertension in Swedish primary care with a machine learning tool}}, url = {{http://dx.doi.org/10.1016/j.pmedr.2024.102806}}, doi = {{10.1016/j.pmedr.2024.102806}}, volume = {{44}}, year = {{2024}}, }