Paper title:

Overview on How Data Mining Tools May Support Cardiovascular Disease Prediction

Published in: Issue 2, (Vol. 4) / 2010
Publishing date: 2010-04-30
Pages: 57-62
Author(s): SITAR- TAUT Dan- Andrei, SITAR- TAUT Adela- Viviana
Abstract. Terms as knowledge discovery or Knowledge Discovery from Databases (KDD), Data Mining (DM), Artificial Intelligence (AI), Machine Learning (ML), Artificial Neural networks (ANN), decision tables and trees, gain from day to day, an increasing significance in medical data analysis. They permit the identification, evaluation, and quantification of some less visible, intuitively unpredictable, by using generally large sets of data. Cardiology represents an extremely vast and important domain, having multiple and complex social and human implications. These are enough reasons to promote the researches in this area, becoming shortly not just national or European priorities, but also world-level ones. The profound and multiple interwoven relationships among the cardiovascular risk factors and cardiovascular diseases – but still far to be completely discovered or understood – represent a niche for applying IT&C modern and multidisciplinary tools in order to solve the existing knowledge gaps. This paper’s aim is to present, by emphasizing their absolute or relative pros and cons, several opportunities of applying DM tools in cardiology, more precisely in endothelial dysfunction diagnostic and quantification the relationships between these and so-called “classical” cardiovascular risk factors.
Keywords: KDD, Data Mining, Cardiovascular Disease, Cardiovascular Risk Factors, Machine Learning Algorithms, Classifiers
References:

1. V. Podgorelec, P. Kokol, B. Stiglic and I. Rozman. Decision Trees: An Overview and Their Use in Medicine. Journal of Medical Systems, Vol. 26, No. 5, October 2002; 445-62, 2002.

2. C. Helma, E. Gottmann, and S. Kramer. Knowledge discovery and data mining in toxicology. Statistical Methods in Medical Research; 9: 329–358, 2000.

3. S.J. Hamilton, G. Chew, and G. Watts. Therapeutic regulation of endothelialdysfunction in type 2 diabetes mellitus. Diabetes Vasc Dis Res; 4:89, 2007.

4. R. Campisi. Noninvasive assessment of coronary microvascular function in women at risk for ischaemic heart disease. Int J Clin Pract, 62, 2, 300–307.

5. B. Obrenović-Kirćanski. Endothelial dysfunction reversibility Vojnosanit Pregl. 2007;64(5):337-43, 2008.

6. W.C. Aird. Endothelium in health and disease. Pharmacological reports; 139-43, 2008.

7. M.A. Albert and P.M. Ridker. Reactive Protein as a Risk Predictor Do Race/Ethnicity and Gender Make a Difference. Circulation; 114:e67-e74, 2006.

8. P. Andreeva Data Modelling and Specific Rule Generation via Data Mining Techniques International Conference on Computer Systems and Technologies - CompSysTech’, 2006.

9. G. Widerhold, On the barriers of future of Knowledge discovery, in Advanced in Knowledge Discovery and DM, AAAI Press/MIT Press, 1996.

10. S.-C. Liao and I.-N. Lee. Appropriate medical data categorization for data mining classification techniques MED. INFORM. VOL. 27, NO. 1, 59–67, 2002.

11. U.M. Fayyad, G. Piatetsky-Shapiro and P. Smyth. From Data Mining to Knowledge Discovery in Databases, AI Magazine, American Association for Artificial Intelligence, 1996.

12. I.-N. Lee, S.-C. Liao and M. Embrechts. Data mining techniques applied to medical information. Med. inform. vol. 25, no. 2, 81- 102, 2000.

13. R. Sabzevari and GH.A. Montazer. An Intelligent Data Mining Approach Using Neuro-Rough Hybridization to Discover Hidden Knowledge from Information Systems Journal of Information Science And Engineering 24,1111 -1126, 2008.

14. K. Viikki, E. Kentala, M. Juhola and I. Pyykko. Decision tree induction in the diagnosis of otoneurological diseases Med . inform., vol . 24, no. 4, 277-289, 1999.

15. K. Matoušek and P. Aubrecht. Data Modelling and Preprocessing for Efficient Data Mining in Cardiology In International Special Topics Conference on Information Technology in Biomedicine CD-ROM.. Piscataway: IEEE, 2006.

16. R. Hewett, J. Leuchner, S. D. Mooney, T.E. Klein. Analysis of mutations in the colia1 gene with secondorder rule induction, International Journal of Pattern Recognition and Artificial Intelligence Vol. 17, No. 5, 721-740, World Scientific Publishing Company, 2003.

17. V. A. Sitar-Taut, D. Zdrenghea, D. Pop and A.D. SitarTaut. Using machine learning algorithms in cardiovascular disease risk evaluation, Journal of Applied Computer Science & Mathematics no. 5(3);29- 32, 2009.

18. B.-C. Chen, R. Ramakrishnan, J.W. Shavlik, P. Tamma. Bellwether Analysis: Searching for Cost-Effective Query-Defined Predictors in Large Databases. ACM Transactions on Knowledge Discovery in Data, Vol. 3 Issue 1, p5:1-5, 2009.

19. D.W. Aha, D. Kibler and M.K. Albert. Instance-Based Learning Algorithms. Machine Learning, 6(1):37–66, 1991.

20. Z. Zheng, G.I. Webb. Lazy Learning of Bayesian Rules, Machine Learning, 41, 53–87, Kluwer Academic Publishers, 2000.

21. Y. Li and N. Zhong. Interpretations of association rules by granular computing, Proceedings of the Third IEEE International Conference on Data Mining, USA, pp.593– 596, 2003.

22. Y. Wanzhong, Li Yuefeng, Wu Jingtong and Xu Yue. Granule mining oriented data warehousing model for representations of multidimensional association rules. Int. J. Intelligent Information and Database Systems, Vol. 2, No. 1, 2008.

23. I. Inza, B. Sierra, R. Blanco and P.L. Naga. Gene selection by sequential search wrapper approaches in microarray cancer class prediction. Journal of Intelligent & Fuzzy Systems; Vol. 12 Issue 1, p25, 9p, IOS Press, 2002.

24. L. Gaga, V. Moustakis, Y. Vlachakis, G. Charissis. ID+: Enhancing Medical Knowledge Acquisition WITH Machine Learning. Applied Artificial Intelligence, vol. 10, p79- 94, Taylor & Francis, 1996.

25. I.H. Witten and E. Frank, Data Mining: Practical Machine learning tools and techniques, 2nd Edition, Morgan Kaufmann, San Francisco, 2005.

26. K. Mollazade, H. Ahmadi, M. Omid, R. Alimardani. International Journal of Intelligent Technology, An Intelligent Combined Method Based on Power Spectral Density, Decision Trees and Fuzzy Logic for Hydraulic Pumps Fault Diagnosis. Vol. 3 Issue 4, p251-263, 2008.

27. A. Wong, Investigating noise tolerance in generalised nearest neighbour learning, Departmental Post-graduate Conference, Computer Science and Software Engineering, University of Caterbury, 2005.

Back to the journal content
Creative Commons License
This article is licensed under a
Creative Commons Attribution-ShareAlike 4.0 International License.
Home | Editorial Board | Author info | Archive | Contact
Copyright JACSM 2007-2024