Predicting Public Health Risks Based on Lifestyle Factors Using the Support Vector Machine

Authors

  • Andri Ismail Sitepu Andri Universitas Pembangunan Panca Budi
  • Muhammad Iqbal Universitas Pembangunan Panca Budi

Keywords:

Support Vector Machine, health risk prediction, lifestyle factors, Machine Learning, preventive healthcare

Abstract

Public health risks are often influenced by multiple lifestyle factors, such as age, diet, exercise, smoking, and alcohol consumption. This study aims to develop a predictive model for assessing individual health risks using the Support Vector Machine (SVM) algorithm. The dataset used consists of lifestyle attributes, including age, weight, height, exercise frequency, sleep duration, sugar intake, smoking habits, alcohol consumption, marital status, profession, and body mass index (BMI). The data were preprocessed through normalization and label encoding, followed by training and testing using a 70:30 data split. The SVM model employed the Radial Basis Function (RBF) kernel to capture non-linear relationships between variables. Experimental results show that the proposed SVM model achieved an accuracy of approximately 89%, demonstrating strong predictive capability. The confusion matrix analysis revealed that the model effectively distinguishes between high and low health risk categories, while the PCA visualization confirmed clear clustering of classified data. Moreover, the feature importance analysis indicated that age, smoking habits, BMI, and alcohol consumption were the most significant contributors to health risk prediction. Overall, the results suggest that the SVM algorithm is a robust and efficient approach for predicting public health risks based on lifestyle factors. This model can serve as a foundation for preventive health monitoring systems, providing valuable insights for promoting healthier lifestyles and supporting data-driven public health strategies.

References

E. Tompa, “The Impact of Health on Productivity: Empirical,” Rev. Econ. Perform. Soc. Prog., 2002.

X. Zhang et al., “Linking urbanization and air quality together: A review and a perspective on the future sustainable urban development,” J. Clean. Prod., vol. 346, p. 130988, 2022.

T. B. Awofala and N. O. S. Godwin, “Data driven strategies to combat chronic diseases globally,” GSC Adv. Res. Rev. 21 (03), 235, vol. 240, 2024.

W. K. Balwan and S. Kour, “Lifestyle Diseases: The Link between Modern Lifestyle and threat to public health,” Saudi J Med Pharm Sci, vol. 7, no. 4, pp. 179–184, 2021.

S. Hussain et al., “Modern diagnostic imaging technique applications and risk factors in the medical field: a review,” Biomed Res. Int., vol. 2022, no. 1, p. 5164970, 2022.

J. Wang, C. Rao, M. Goh, and X. Xiao, “Risk assessment of coronary heart disease based on cloud-random forest,” Artif. Intell. Rev., vol. 56, no. 1, pp. 203–232, 2023.

S. Liu, Y. Gao, Y. Shen, M. Zhang, J. Li, and P. Sun, “Application of three statistical models for predicting the risk of diabetes,” BMC Endocr. Disord., vol. 19, no. 1, p. 126, 2019.

R. Guido, S. Ferrisi, D. Lofaro, and D. Conforti, “An overview on the advancements of support vector machine models in healthcare applications: a review,” Information, vol. 15, no. 4, p. 235, 2024.

M. Dirik, “Application of machine learning techniques for obesity prediction: a comparative study,” J. Complex. Heal. Sci., vol. 6, no. 2, pp. 16–34, 2023.

F. Ekundayo, “Using machine learning to predict disease outbreaks and enhance public health surveillance,” World J Adv Res Rev, vol. 24, no. 3, pp. 794–811, 2024.

W. Huang et al., “Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction,” Sci. Rep., vol. 12, no. 1, p. 1033, 2022.

K. M. Seaw, M. K. S. Leow, and X. Bi, “Early obesity risk prediction via non‐dietary lifestyle factors using machine learning approaches,” Clin. Obes., vol. 15, no. 4, p. e70011, 2025.

Z. Zhao et al., “Risk factor analysis and risk prediction study of obesity in steelworkers: model development based on an occupational health examination cohort dataset,” Lipids Health Dis., vol. 23, no. 1, p. 10, 2024.

F. R. Razak, M. K. Biddinika, and H. Yuliansyah, “Radial Basis Function Model for Obesity Classification Based on Lifestyle and Physical Condition,” J. ELTIKOM J. Tek. Elektro, Teknol. Inf. dan Komput., vol. 8, no. 2, pp. 192–200, 2024.

T. Chen et al., “A gastric cancer LncRNAs model for MSI and survival prediction based on support vector machine,” BMC Genomics, vol. 20, no. 1, p. 846, 2019.

Downloads

Published

2025-07-12

How to Cite

Andri, A. I. S., & Muhammad Iqbal. (2025). Predicting Public Health Risks Based on Lifestyle Factors Using the Support Vector Machine. Journal of Computer Science and Research (JoCoSiR), 3(3), 62–66. Retrieved from https://journal.aptikomsumut.org/index.php/jocosir/article/view/65