Document Type : Original Article(s)

Authors

1 Department of Medical Informatics, School of Medical Sciences, Tarbiat Modares University, Tehran, Iran

2 Associate Professor, Department of Computer Engineering, School of Electrical and Computer Engineering, Tarbiat Modares University, Tehran, Iran

3 Professor, Cardiac Rehabilitation Research Center, Cardiovascular Research Institute, Isfahan University of Medical Sciences, Isfahan, Iran

Abstract

BACKGROUND: Metabolic syndrome which underlies the increased prevalence of cardiovascular disease and Type 2 diabetes is considered as a group of metabolic abnormalities including central obesity, hypertriglyceridemia, glucose intolerance, hypertension, and dyslipidemia. Recently, artificial intelligence based health-care systems are highly regarded because of its success in diagnosis, prediction, and choice of treatment. This study employs machine learning technics for predict the metabolic syndrome. METHODS: This study aims to employ decision tree and support vector machine (SVM) to predict the 7-year incidence of metabolic syndrome. This research is a practical one in which data from 2107 participants of Isfahan Cohort Study has been utilized. The subjects without metabolic syndrome according to the ATPIII criteria were selected. The features that have been used in this data set include: gender, age, weight, body mass index, waist circumference, waist-to-hip ratio, hip circumference, physical activity, smoking, hypertension, antihypertensive medication use, systolic blood pressure (BP), diastolic BP, fasting blood sugar, 2-hour blood glucose, triglycerides (TGs), total cholesterol, low-density lipoprotein, high density lipoprotein-cholesterol, mean corpuscular volume, and mean corpuscular hemoglobin. Metabolic syndrome was diagnosed based on ATPIII criteria and two methods of decision tree and SVM were selected to predict the metabolic syndrome. The criteria of sensitivity, specificity and accuracy were used for validation. RESULTS: SVM and decision tree methods were examined according to the criteria of sensitivity, specificity and accuracy. Sensitivity, specificity and accuracy were 0.774 (0.758), 0.74 (0.72) and 0.757 (0.739) in SVM (decision tree) method. CONCLUSION: The results show that SVM method sensitivity, specificity and accuracy is more efficient than decision tree. The results of decision tree method show that the TG is the most important feature in predicting metabolic syndrome. According to this study, in cases where only the final result of the decision is regarded significant, SVM method can be used with acceptable accuracy in decision making medical issues. This method has not been implemented in the previous research. 

Keywords

  1. Karabatak M, Ince MC. An expert system for detection of breast cancer based on association rules and neural network. Expert Syst Appl 2009; 36(2, Part 2): 3465-9.
  2. Khan J, Wei JS, Ringner M, Saal LH, Ladanyi M, Westermann F, et al. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nat Med 2001; 7(6): 673-9.
  3. Hirose H, Takayama T, Hozawa S, Hibi T, Saito I.
  4. Prediction of metabolic syndrome using artificial neural network system based on clinical data including insulin resistance index and serum adiponectin. Comput Biol Med 2011; 41(11): 1051-6.
  5. International Diabetes Federation. The IDF consensus worldwide definition of the metabolic syndrome [Online]. [cited 2006]; Available from: URL: http://www.idf.org/webdata/docs/MetS_def_update2006.pdf
  6. Zabetian A, Hadaegh F, Azizi F. Prevalence of metabolic syndrome in Iranian adult population, concordance between the IDF with the ATPIII and the WHO definitions. Diabetes Res Clin Pract 2007; 77(2): 251-7.
  7. Esteghamati A, Ashraf H, Rashidi A, Meysamie A. Waist circumference cut-off points for the diagnosis of metabolic syndrome in Iranian adults. Diabetes Res Clin Pract 2008; 82(1): 104-7.
  8. Liu HX, Zhang RS, Luan F, Yao XJ, Liu MC, Hu ZD, et al. Diagnosing breast cancer based on support vector machines. J Chem Inf Comput Sci 2003; 43(3): 900-7.
  9. Doniger S, Hofmann T, Yeh J. Predicting CNS permeability of drug molecules: comparison of neural network and support vector machine algorithms. J Comput Biol 2002; 9(6): 849-64.
  10. Yu Y, Chen S, Wang LS, Chen WL, Guo WJ, Yan H, et al. Prediction of pancreatic cancer by serum biomarkers using surface-enhanced laser desorption/ionization-based decision tree classification. Oncology 2005; 68(1): 79-86.
  11. Worachartcheewan A, Nantasenamat C, Isarankura-Na-Ayudhya C, Pidetcha P, Prachayasittikul V. Identification of metabolic syndrome using decision tree analysis. Diabetes Res Clin Pract 2010; 90(1): e15-e18.
  12. Talaei M, Sarrafzadegan N, Sadeghi M, Oveisgharan S, Marshall T, Thomas GN, et al. Incidence of cardiovascular diseases in an Iranian population: the Isfahan Cohort Study. Arch Iran Med 2013; 16(3): 138-44.
  13. Grundy SM, Cleeman JI, Daniels SR, Donato KA, Eckel RH, Franklin BA, et al. Diagnosis and management of the metabolic syndrome: an American Heart Association/National Heart, Lung, and Blood Institute Scientific Statement. Circulation 2005; 112(17): 2735-52.
  14. Azizi F, Khalili D, Aghajani H, Esteghamati A, Hosseinpanah F, Delavari A, et al. Appropriate waist circumference cut-off points among Iranian adults: the first report of the Iranian National Committee of Obesity. Arch Iran Med 2010; 13(3): 243-4.
  15. Witten LH, Frank E. Data mining: practical machine learning tools and techniques. Burlington, MA: Morgan Kaufmann; 2005.
  16. Jerez-Aragones JM, Gomez-Ruiz JA, Ramos-Jimenez G, Munoz-Perez J, Alba-Conejo E. A combined neural network and decision trees model for prognosis of breast cancer relapse. Artif Intell Med 2003; 27(1): 45-63.
  17. Palaniappan S, Awang R. Intelligent heart disease prediction system using data mining techniques. Proceedings of the IEEE/ACS International Conference on Computer Systems and Applications; 2008 Mar 31-Apr 4; Doha, Qatar.
  18. Anbananthen KS, Sainarayanan G, Chekima A, Teo J. Artificial neural network tree approach in data mining. Malaysian Journal of Computer Science 2007; 20(1): 51-62.
  19. Murthy SK. Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey. Data Min Knowl Discov 1998; 2(4): 345-89.
  20. Alpaydin E. Introduction to machine learning. 2nd ed. Cambridge, MA: MIT Press; 2010.
  21. Yan WW, Shao HH. Application of support vector machines and least squares support vector machines to heart disease diagnoses. Control and Decision 2003; 18(3): 358-60.
  22. Yu W, Liu T, Valdez R, Gwinn M, Khoury MJ. Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes. BMC Med Inform Decis Mak 2010; 10: 16.
  23. Akay MF. Support vector machines combined with feature selection for breast cancer diagnosis. Expert Syst Appl 2009; 36(2): 3240-7.
  24. Furey TS, Cristianini N, Duffy N, Bednarski DW, Schummer M, Haussler D. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics 2000; 16(10): 906-14.
  25. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: Synthetic minority over-sampling technique. J Artif Intell Res 2002; 16: 321-57.
  26. Zhang D, Liu W, Gong X, Jin H. A Novel Improved SMOTE Resampling Algorithm Based on Fractal. Journal of Computational Information Systems 2011; 7(6): 2204-11.
  27. Alpaydin E. Introduction to machine learning. 3rd ed. Cambridge, MA: MIT Press; 2014.
  28. Sarrafzadegan N, Talaei M, Sadeghi M, Kelishadi R, Oveisgharan S, Mohammadifard N, et al. The Isfahan cohort study: rationale, methods and main findings. J Hum Hypertens 2011; 25(9): 545-53.
  29. Lemieux I, Poirier P, Bergeron J, Almeras N, Lamarche B, Cantin B, et al. Hypertriglyceridemic waist: a useful screening phenotype in preventive cardiology? Can J Cardiol 2007; 23(Suppl B): 23B-31B.