Correlation Engine 2.0
Clear Search sequence regions


Sizes of these terms reflect their relevance to your search.

The number of heart disease patients is increasing. Establishing a risk assessment model for chronic heart disease (CHD) based on risk factors is beneficial for early diagnosis and timely treatment of high-risk populations. Four machine learning models, including logistic regression, support vector machines (SVM), random forests, and extreme gradient boosting (XGBoost), were used to evaluate the CHD among 14 971 participants in the National Health and Nutrition Examination Survey from 2011 to 2018. The area under the receiver-operator curve (AUC) is the indicator that we evaluate the model. In four kinds of models, SVM has the best classification performance (AUC = 0.898), and the AUC value of logistic regression and random forest were 0.895 and 0.894, respectively. Although XGBoost performed the worst with an AUC value of 0.891. There was no significant difference among the four algorithms. In the importance analysis of variables, the three most important variables were taking low-dose aspirin, chest pain or discomfort, and total amount of dietary supplements taken. All four machine learning classifiers can identify the occurrence of CHD based on population survey data. We also determined the contribution of variables in the prediction, which can further explore their effectiveness in actual clinical data. Copyright © 2023 Italian Federation of Cardiology - I.F.C. All rights reserved.

Citation

Xiaofei Chen, Dingjie Guo, Yashan Wang, Zihan Qu, Guangliang He, Chuanying Sui, Linwei Lan, Xin Zhang, Yuqing Duan, Hengyu Meng, Chunpeng Wang, Xin Liu. Using machine learning algorithms to identify chronic heart disease: National Health and Nutrition Examination Survey 2011-2018. Journal of cardiovascular medicine (Hagerstown, Md.). 2023 Jul 01;24(7):461-466

Expand section icon Mesh Tags


PMID: 37161973

View Full Text