Supervised and Unsupervised Machine Learning for Improved Identification of Intrauterine Growth Restriction Types
Agnieszka Wosiak, Agata Zamecznik, Katarzyna Niewiadomska-Jarosik
DOI: http://dx.doi.org/10.15439/2016F515
Citation: Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, M. Ganzha, L. Maciaszek, M. Paprzycki (eds). ACSIS, Vol. 8, pages 323–329 (2016)
Abstract. This paper concerns automated identification of intrauterine growth restriction (IUGR) types by use of machine learning methods. The research presents a comparison of supervised and unsupervised learning covering single and hybrid classification, as well as clustering. Supervised learning techniques included bagging with Naive Bayes, kNN, C4.5 and SMO as base classifiers, random forest as a variant of bagging with a decision tree as a base classifier, boosting with NaiveBayes, SMO, kNN and C4.5 as base classifiers, and voting by all single classifiers using majority as a combination rule, as well as five single classification strategies: k-nearest neighbours (kNN), J48, NaiveBayes, random tree and sequential minimal optimization algorithm for training support vector machines. Unsupervised learning encompassed k-means and expectation-maximization algorithms. The major conclusion drawn from the study was that hybrid classifiers have demonstrated their potential ability to identify more accurately symmetrical and asymmetrical types of IUGR, whereas the unsupervised learning techniques produced the worst results.
References
- Paja W.: "Medical diagnosis support and accuracy improvement by application of total scoring from feature selection approach", Proceedings of the 2015 Federated Conference on Computer Science and Information Systems (FEDCSIS 2015), Annals of Computer Science and Information Systems, eds. M. Ganzha and L. Maciaszek and M. Paprzycki, IEEE, 2015, pp. 281–286, http://dx.doi.org/10.15439/2015F361
- Gong, P. and Cheng, Y.-H. and Wang, X.-S.: "Benign or Malignant Classification of Lung Nodules Based on Semantic Attributes", Acta Electronica Sinica, 2015, vol. 43, no. 12, pp. 2476–2483
- Pérez, N. and Guevara, M.A. and Silva, A. and Ramos, I. and Loureiro, J.: "Improving the performance of machine learning classifiers for Breast Cancer diagnosis based on feature selection", Proceedings of the 2014 Federated Conference on Computer Science and Information Systems, Annals of Computer Science and Information Systems, eds. M. Ganzha, L. Maciaszek, M. Paprzycki, IEEE, 2014, vol. 2, pp. 209–217, http://dx.doi.org/10.15439/2014F249
- Hijazi, H. and Chan, Ch.: "A Classification Framework Applied to Cancer Gene Expression Profiles" Journal of Healthcare Engineering, vol. 4, no. 2, pp. 255–283, 2013, http://dx.doi.org/10.1260/2040-2295.4.2.255
- Sun, S., Wang, H., Jiang, Z., Fang, Y., Tao, T.: "Segmentation-based heart sound feature extraction combined with classifier models for a VSD diagnosis system", Expert Systems with Applications, 41(4), 2014, pp. 1769–1780, http://dx.doi.org/10.1016/j.eswa.2013.08.076
- Montejo, L. D., Jia, J., Kim, H. K., Netz, U. J., Blaschke, S., Muller, G. A., Hielscher, A. H.: "Computer-aided diagnosis of rheumatoid arthritis with optical tomography, Part 2: image classification", Journal of biomedical optics, 2013, vol. 18(7), pp. 076002–076002, http://dx.doi.org/10.1117/1.JBO.18.7.076002
- Stamatis, K. and Nikos, F. and Sotiris, K. and Kyriakos S.: "A Semisupervised Cascade Classification Algorithm", Applied Computational Intelligence and Soft Computing, Article ID 5919717, 14 pages, 2016, http://dx.doi.org/10.1155/2016/5919717
- Baker, D.J.: "Maternal nutrition, fetal nutrition, and disease in later life", Nutrition, 1997, vol.13, pp. 807–813, http://dx.doi.org/10.1016/S0899-9007(97)00193-7
- Mahajan, S.D. and Singh, S. and Shah, P. and Gupta, N. and Kochupillai, N.: "Effect of maternal malnutrition and anemia on the endocrine regulation of fetal growth", Endocrine research, 2004, vol. 30(2), pp. 189–203, http://dx.doi.org/10.1081/ERC-200027380
- Mahajan, S.D. and Aalinkeel, R. and Singh, S. and Shah, P. and Gupta, N. and Kochupillai, N.: "Endocrine regulation in asymmetric intrauterine fetal growth retardation", Journal of Maternal-Fetal and Neonatal Medicine, 2006, vol. 19(10), pp. 615–623, http://dx.doi.org/10.1080/14767050600799901
- Gadagkar, A.V. and Shreedhara, K.S.: "Fetal Growth Diagnosis using Re-Initialization Free Level Set Method and Classification using Radial Basis Function Neural Network", Proceedings of the International Conference on Multimedia Processing, Communication and Information Technology MPCIT 2013, 2013, pp. 137–144, DOI: 03.AETS.2013.4.81
- Bagi, K.S. and Shreedhara, K.S.: "Biometric measurement and classification of IUGR using neural networks", Proceedings of the International Conference on Contemporary Computing and Informatics (IC3I 2014), 2014, pp. 157–161, http://dx.doi.org/10.1109/IC3I.2014.7019613
- Black, R.E. and Victora, C.G. and Walker, S.P. and Bhutta, Z.A. and Christian, P. and de Onis, M. and et al.: "Maternal and child undernutrition and overweight in low-income and middle-income countries", Lancet, 2013, vol. 382, pp. 427–451, http://dx.doi.org/10.1016/S0140-6736(13)60937-X
- Gürgen, F. and Zeynep, Z. and Füsun, V.: "Intrauterine growth restriction (IUGR) risk decision based on support vector machines", Expert Systems with Applications, 2012, vol.39(3), pp. 2872–2876, http://dx.doi.org/10.1016/j.eswa.2011.08.147
- Shreedhara, K.S. and Veena, A.: "Multiple sonographic features based IUGR diagnosis using artificial neural networks", International Journal of Information Technology and Knowledge Management, 2009, vol.2(1), pp. 73–78, http://dx.doi.org/10.1109/ICSIP.2014.54
- Zamecznik, A. and Niewiadomska-Jarosik, K. and Wosiak, A. and Zamojska, J. and Moll, J. and Stańczyk, J.: "Intra-uterine growth restriction as a risk factor for hypertension in children six to 10 years old", Cardiovascular Journal of Africa, 2014, pp.73–77, http://dx.doi.org/10.5830/CVJA-2014-009
- Dashe, J.S. and McIntire, D.D. and Lucas, M.J. and Leveno, K.J.: "Effects of symmetric and asymmetric fetal growth on pregnancy outcomes", Obstetrics & Gynecology, 2000, vol. 96(3), pp. 321–327
- Bocca-Tjeertes, I. and Bos, A. and Kerstjens, J. and de Winter, A. and Reijneveld, S.: "Symmetrical and Asymmetrical Growth Restriction in Preterm-Born Children", Pediatrics, 2013, vol. 133(3), pp. e650–e656
- Ferrario, M. and Signorini, M.G. and Magenes, G.: "Complexity analysis of the fetal heart rate variability: early identification of severe intrauterine growth-restricted fetuses", Medical & Biological Engineering & Computing, 2009, vol.47(9), pp. 911–919, http://dx.doi.org/10.1007/s11517-009-0502-8
- Gadagkar, A.V. and Shreedhara, K.S.: "Features Based IUGR Diagnosis Using Variational Level Set Method and Classification Using Artificial Neural Networks", Proceedings of the Fifth International Conference on Signal and Image Processing (ICSIP 2014), 2014, pp. 303–309, http://dx.doi.org/10.1109/ICSIP.2014.54
- Lunghi, F. and Magenes, G. and Pedrinazzi, L. and Signorini, M.G.: "Detection of fetal distress though a support vector machine based on fetal heart rate parameters", Computers in Cardiology, 2005, vol. 32, pp. 247–250, http://dx.doi.org/10.1109/CIC.2005.1588083
- Salafia, C.M. and Minior, V.K. and Pezzullo, J.C. and Popek, E.J. and Rosenkrantz, T.S. and Vintzileos, A.M.: "Intrauterine growth restriction in infants of less than thirty-two weeks’ gestation: associated placental pathologic features", American Journal of Obstetrics and Gynecology, 1995, vol. 173(4), pp. 1049–1057, http://dx.doi.org/10.1016/0002-9378(95)91325-4
- Jeetha Lakshmi, P. S. and Saravan Kumar S. and Suresh A.: "A Novel Hybrid Medical Diagnosis System Based on Genetic Data Adaptation Decision Tree and Clustering", ARPN Journal of Engineering and Applied Sciences, vol. 10, no. 16, 2015, pp. 7293–7299
- Malinowski, A. and Chlebna-Sokół, D.: "Dziecko łódzkie-metody badań i normy rozwoju biologicznego", Ankal, 1998, (In Polish)
- Kuncheva, L.I.: Combining Pattern Classifiers. Methods and Algorithms., John Wiley & Sons, Inc., 2004, Hoboken, New Jersey, USA
- Woźniak, M. and Graña, M. and Corchado, E.: "A survey of multiple classifier systems as hybrid systems", Information Fusion, 2014, pp. 3–17, http://dx.doi.org/10.1016/j.inffus.2013.04.006
- Breiman, L.: "Bagging predictors", Technical Report 421, Department of Statistics, University of California, Berkeley, 1994
- Breiman, L.: "Bagging predictors", Machine Learning, 1996, vol. 26(2), pp. 123–140
- Breiman, L.: "Random forests", Machine Learning, 2001vol. 45, pp. 5–32
- Freund, Y. and Schapire, R.E.: "A decision–theoretic generalization of on-line learning and an application to boosting", Journal of Computer and System Sciences, 1997, vol. 55(1), pp. 119–139, http://dx.doi.org/10.1006/jcss.1997.1504
- Rokach, L.: "Pattern Classification Using Ensemble Methods", World Scientific Publishing Co., Inc., 2010, River Edge, New Jork, USA
- Seni, G. and Elder, J.F.: "Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions", Morgan & Claypool, 2010
- Michalski, R.S. and Tecuci, G.: "Machine Learning, A Multistrategy Approach", J. Morgan Kaufmann, 1994
- Wang, S.L. and Li, X.L. and Fang, J.: "Finding minimum gene subsets with heuristic breadth-first search algorithm for robust tumour classification", BMC Bioinformatics, 2012, vol. 13(178, pp. 1–26, http://dx.doi.org/10.1186/1471-2105-13-178
- Pirooznia, M. and Yang, J. and Yang M.Q. and Deng, Y.: "A comparative study of different machine learning methods on microarray gene expression data", BMC Genomics, 2008 vol. 9, pp. 1471–2164, http://dx.doi.org/10.1186/1471-2164-9-s1-s13
- Wosiak, A. and Zakrzewska, D.: "Feature Selection for Classification Incorporating Less Meaningful Attributes in Medical Diagnostics", Proceedings of the 2014 Federated Conference on Computer Science and Information Systems, Annals of Computer Science and Information Systems ACSIS, 2014, pp. 235–240, http://dx.doi.org/10.15439/2014F296
- Witten, I.H. and Frank, E. and Hall, M.A.: "Data Mining: Practical Machine Learning Tools and Techniques" (3rd ed.), Morgan Kaufmann Publishers Inc., 2011
- Demsar, J.: "Statistical Comparisons of Classifiers over Multiple Data Sets", The Journal of Machine Learning Research, 2006, vol. 7, pp. 1–30
- MacQueen, J.B.: "Some Methods for Classification and Analysis of MultiVariate Observations", Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1967, pp. 281–297
- Ankita, V. and Satyanarayana, R.V. and Kamalakar, K.: "An Experiment with Distance Measures for Clustering", Proceedings of the International Conference on Management of Data, 2008
- Dietterich, T. G.: "Approximate statistical tests for comparing supervised classification learning algorithms", Neural Computation, 2006, vol. 10(7), pp. 1895–1923, http://dx.doi.org/10.1162/089976698300017197
- Friedman, M.: "The use of ranks to avoid the assumption of nor- mality implicit in the analysis of variance", Journal of the Amer- ican Statistical Association, 1937, vol. 32, pp. 675–701, http://dx.doi.org/10.1080/01621459.1937.10503522
- Friedman, M.: "A comparison of alternative tests of significance for the problem of m rankings", Annals of Mathematical Statistics, 1940, vol. 11, pp. 86–92
- Weka Data Mining Tool: http://www.cs.waikato.ac.nz/ml/weka/index.html
- Garcia, S. and Fernandez, A. and Luengo, J. and Herrera, F.: "Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power", Information Sciences, 2010, vol. 180(10), pp. 2044–2064, http://dx.doi.org/10.1016/j.ins.2009.12.010
- Nemenyi, P.B.: "Distribution-free multiple comparisons", PhD thesis, Princeton University, 1963
- Widjaja, M. and Darmawan, A. and Mulyono, S.: "Fuzzy classifier of paddy growth stages based on synthetic MODIS data", Proceedings of the IEEE International Conference on Advanced Computer Science and Information Systems (ICACSIS 2012), 2012, pp. 239–244
- Grochowina, M. and Leniowska, L.: "Comparison of SVM and k-NN classifiers in the estimation of the state of the arteriovenous fistula problem", Proceedings of the 2015 Federated Conference on Computer Science and Information Systems (FEDCSIS 2015), Annals of Computer Science and Information Systems, eds. M. Ganzha and L. Maciaszek and M. Paprzycki, IEEE, 2015, pp. 249–254, http://dx.doi.org/10.15439/2015F194
- Lakshmi, PS Jeetha, S. Saravan Kumar, and A. Suresh. "Intelligent Medical Diagnosis System Using Weighted Genetic and New Weighted Fuzzy C-Means Clustering Algorithm." Artificial Intelligence and Evolutionary Algorithms in Engineering Systems. Springer India, 2015. pp. 213-220
- Nawarycz, T. and Pytel, K. and Gazicki-Lipman, M. and Drygas, W. and Ostrowska-Nawarycz, L.: "A Fuzzy Logic Approach to The Evaluation of Health Risks Associated with Obesity", Proceedings of the 2013 Federated Conference on Computer Science and Information Systems, eds. M. Ganzha, L. Maciaszek, M. Paprzycki, IEEE, 2013, pp. 231–234