Machine Learning Models for Statistical Analysis

Author Marko Grebovic, Luka Filipovic, Ivana Katnic, Milica Vukotic, Tomo Popovic,

Keywords #Machine learning #artificial neural networks #statistical models #accuracy measures

Abstract

Compared to traditional statistical models, Machine Learning (ML) algorithms provide the ability to interpret, understand and summarize patterns and regularities in observed data for making predictions in an advanced and more sophisticated way. The main reasons for the advantage of ML methods in making predictions are a small number of significant predictors of the statistical models, which means limited informative capability, and pseudo-correct regular statistical patterns, used without previous understanding of the used data causality. Also, some ML methods, like Artificial Neural Networks, use non-linear algorithms, considering links and associations between parameters. On the other hand, statistical models use one- step-ahead linear processes to improve only short-term prediction accuracy by minimizing a cost function. Although designing an optimal ML model can be a very complex process, it can be used as a potential solution for making improved prediction models compared to statistical ones. However, ML models will not automatically improve prediction accuracy, so it is necessary to evaluate and analyze several statistical and ML methods, including some artificial neural networks, through accuracy measures for prediction purposes in various fields of applications. A couple of techniques for improving suggested ML methods and artificial neural networks are proposed to get better accuracy results.

References

[1] Agrawal S., Jalal A., and Tripathi R., “A Survey on Manual and Non-Manual Sign Language Recognition for Isolated and Continuous Sign,” International Journal of Applied Pattern 0 1 2 3 4 5 6 7 8 9 10 11 12 13 0123456 Performance measure value Method ID NN sMAPENN MASENN MF SM sMAPESM MASESM MF Machine Learning Models for Statistical Analysis 511 Recognition, vol. 3, no. 2, pp. 99-134, 2016. https://doi.org/10.1504/IJAPR.2016.079048

[2] Ahmed N., Atiya A., El Gayar N., and El-Shishiny H., “An Empirical Comparison of Machine Learning Models for Time Series Forecasting,” Econometric Reviews, vol. 29, no. 5-6, pp. 594- 621, 2010. https://doi.org/10.1080/07474938.2010.481556.

[3] Al Balawi S. and Aljohani N., “Credit-card Fraud Detection System using Neural Networks,” The International Arab Journal of Information Technology, vol. 20, no. 2, pp. 234-241, 2023. https://doi.org/10.34028/iajit/20/2/10.

[4] Alzu’bi A., Najadat H., Eyadat W., Al-Mohtaseb A., and Haddad H., “A New Approach for Detecting Eosinophils in the Gastrointestinal Tract and Diagnosing Eosinophilic Colitis,” The International Arab Journal of Information Technology, vol. 18, no. 4, pp. 596-603, 2021. https://doi.org/10.34028/18/4/12.

[5] Assimakopoulos V. and Nikolopoulos K., “The Theta Model: A Decomposition Approach To Forecasting,” International Journal of Forecasting, vol. 16, no. 4, pp. 521-530, 2000. https://doi.org/10.1016/S0169-2070(00)00066-2

[6] Barnett-Itzhaki Z., Elbaz M., Butterman R., Amar D., Amitay M., et al. “Machine Learning Vs. Classic Statistics for the Prediction of IVF Outcomes,” Journal of Assisted Reproduction and Genetics, vol. 37, no. 10, pp. 2405-2412, 2020.

[7] Berin Jones C. and Murugamani C., “Malaria Parasite Detection on Microscopic Blood Smear Images with Integrated Deep Learning Algorithms,” The International Arab Journal of Information Technology, vol. 20, no. 2, pp. 170- 179, 2023. https://doi.org/10.34028/iajit/20/2/3.

[8] Chandra Bose A. and Ramesh V., “Highly Accurate Grey Neural Network Classifier for an Abdominal Aortic Aneurysm Classification Based on Image Processing Approach,” The International Arab Journal of Information Technology, vol. 20, no. 2, pp. 215-223, 2023. https://doi.org/10.34028/iajit/20/2/8.

[9] Côté M., Osseni M., Brassard D., Carbonneau E., Robitaille J., et al., “Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis,” Frontiers in Nutrition, vol. 9, 2022. doi: 10.3389/fnut.2022.740898.

[10] Cottrell M., Girard B., Girard Y., Mangeas M., and Muller C., “Neural Modeling for Time Series: A Statistical Stepwise Method for Weight Elimination,” IEEE Transactions on Neural Networks, vol. 6, no. 6, pp. 1355-1364, 1995. DOI: 10.1109/72.471372.

[11] Cunningham P. and Delany S., “k-Nearest Neighbour Classifiers 2nd Edition (with python examples),” arXiv arXiv., 2020. https://doi.org/10.48550/arXiv.2004.04523

[12] Curchoe C. and Bormann C., “Artificial Intelligence and Machine Learning for Human Reproduction and Embryology Presented at ASRM and ESHRE 2018,” Journal of Assisted Reproduction and Genetics, vol. 36, no. 4, pp. 591-600, 2019. DOI: 10.1007/s10815-019- 01408-x.

[13] Dan Foresee F. and Hagan M., “Gauss-Newton Approximation to Bayesian Learning,” IEEE International Conference on Neural Networks, vol. 3, pp. 1930-1935, 1997. DOI: 10.1109/ICNN.1997.614194.

[14] Deng L., “A Tutorial Survey of Architectures, Algorithms, and Applications for Deep Learning- ERRATUM,” APSIPA Transactions on Signal and Information Processing, vol. 3, 2014. DOI: https://doi.org/10.1017/ATSIP.2014.4.

[15] Desai R., et al., “Comparison of Machine Learning Methods with Traditional Models for Use of Administrative Claims with Electronic Medical Records to Predict Heart Failure Outcomes,” JAMA Network Open, vol. 3, no. 1, pp. 1-15, 2020. doi:10.1001/jamanetworkopen.2019.18962

[16] Elgendy F., Alshewimy M., and Sarhan A., “Pain Detection/Classification Framework including Face Recognition based on the Analysis of Facial Expressions for E-Health Systems,” The International Arab Journal of Information Technology, vol. 18, no. 1, pp. 125-132, 2021. https://doi.org/10.34028/iajit/18/1/14.

[17] Fakhfakh S. and Ben Jemaa Y., “Deep Learning Shape Trajectories for Isolated Word Sign Language Recognition,” The International Arab Journal of Information Technology, vol. 19, no. 4, pp. 660-666, 2022. https://doi.org/10.34028/iajit/19/4/10.

[18] Gardner E.S., “Exponential Smoothing: The State of the art-Part II,” International Journal of Forecasting, vol. 22, no. 4, pp. 637-666, 2006. https://doi.org/10.1016/j.ijforecast.2006.03.005.

[19] Géron A., Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow, O’Reilly Media, 2019.

[20] Goodwin P. and Lawton R., “On the Asymmetry of the Symmetric MAPE,” International Journal of Forecasting, vol. 15, no. 4, pp. 405-408, 1999. https://doi.org/10.1016/S0169-2070(99)00007-2.

[21] Grebovic M., Filipovic L., Katnic I., Vukotic M., and Popovic T., “Overcoming Limitations of Statistical Methods with Artificial Neural Networks,” in Proceedings of the 23th International Arab Conference on Information Technology, Abu Dhabi, pp. 1-6, 2022. DOI: 10.1109/ACIT57182.2022.9994218.

[22] Hochreiter S. and Schmidhuber J., “Long Short- 512 The International Arab Journal of Information Technology, Vol. 20, No. 3A, Special Issue 2023 Term Memory,” Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997. DOI: 10.1162/neco.1997.9.8.1735.

[23] Hosmer D., Lemeshow S., and Sturdivant R., Applied Logistic Regression, John Wiley and Sons, 2013. DOI:10.1002/9781118548387.

[24] Howley T. and Madden M.G., “The Genetic Kernel Support Vector Machine: Description And Evaluation,” Artificial Intelligence Review, vol. 24, pp. 379-395, 2005.

[25] Hyndman R. and Koehler A., “Another Look at Measures of Forecast Accuracy,” International Journal of Forecasting, vol. 22, no. 4, pp. 679- 688, 2006. https://doi.org/10.1016/j.ijforecast.2006.03.001.

[26] Hyndman R., Koehler A., Snyder R., and Grose S., “A State Space Framework For Automatic Forecasting Using Exponential Smoothing Methods,” International Journal of Forecasting, vol. 18, no. 3, pp. 439-454, 2002. https://doi.org/10.1016/S0169-2070(01)00110-8.

[27] Jaouedi N., Boujnah N., and Bouhlel M., “A Novel Recurrent Neural Networks Architecture for Behavior Analysis,” The International Arab Journal of Information Technology, vol. 18, no. 2, pp. 133-139, 2021. https://doi.org/10.34028/iajit/18/2/1.

[28] Kumar Y., Verma S., and Sharma S., “Multi-Pose Facial Expression Recognition Using Hybrid Deep Learning Model with Improved Variant of Gravitational Search Algorithm,” The International Arab Journal of Information Technology, vol. 19, no. 2, pp. 281-287, 2022. https://doi.org/10.34028/iajit/19/2/15.

[29] Lafrenière J., Lamarche B., Laramée ., Robitaille J., and Lemieux S., “Validation of a Newly Automated Web-Based 24-Hour Dietary Recall Using Fully Controlled Feeding Studies,” BMC Nutrition, vol. 3, pp. 1-10, 2017. https://doi.org/10.1186/s40795-017-0153-3.

[30] Lippmann R., “An Introduction to Computing with Neural Nets,” IEEE ASSP Magazine, vol. 4, no. 2, pp. 4-22, 1987. DOI: 10.1109/MASSP.1987.1165576.

[31] Makridakis S., “The Forthcoming Artificial Intelligence (AI) Revolution: its Impact on Society And Firms,” Futures, vol. 90, pp. 46-60, 2017. https://doi.org/10.1016/j.futures.2017.03.006.

[32] Makridakis S., Spiliotis E., and Assimakopoulos V., “Statistical and Machine Learning Forecasting Methods: Concerns and Ways Forward,” PLoS ONE, vol. 13, no. 3, pp. 1-26, 2018. https://doi.org/10.1371/journal.pone.0194889.

[33] Makridakis S., Wheelwright S., and Hyndman R., Forecasting: Methods and Applications, John Wiley and Sons, 2008.

[34] Møller M., “A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning,” Neural networks, vol. 6, no. 4, pp. 525-533, 1993. https://doi.org/10.1016/S0893-6080(05)80056-5.

[35] Morgan S. and Winship C., Counterfactuals and Causal Inference: Methods and Principles for Social Research, Cambridge University Press, 2007.

[36] Narayanan L., Krishnan S., and Robinson H., “A Hybrid Deep Learning Based Assist System for Detection and Classification of Breast Cancer from Mammogram Images,” The International Arab Journal of Information Technology, vol. 19, no. 6, pp. 965-974, 2022. https://doi.org/10.34028/iajit/19/6/15.

[37] Nawaz M., Nazir T., and Masood M., “Glaucoma Detection using Tetragonal Local Octa Patterns and SVM from Retinal Images,” The International Arab Journal of Information Technology, vol. 18, no. 5, pp. 686-693, 2021.

[38] Paliwal M. and Kumar U., “Neural Networks And Statistical Techniques: a Review of Applications,” Expert Systems with Applications, vol. 36, no. 1, pp. 2-17, 2009. https://doi.org/10.1016/j.eswa.2007.10.005.

[39] Pearl J., Causality: Models, Reasoning, and Inference, Cambridge University Press, 2009.

[40] Pokkuluri K. and Nedunuri U., “Crop Disease Prediction with Convolution Neural Network (CNN) Augmented With Cellular Automata,” The International Arab Journal of Information Technology, vol. 19, no. 5, pp. 765-773, 2022. 0 https://doi.org/10.34028/iajit/19/5/8.

[41] Powers D. and Xie Y., Statistical Methods for Categorical Data Analysis, Emerald Publishing, 2008.

[42] Ramakrishnan D. and Radhakrishnan K., “Applying Deep Convolutional Neural Network (DCNN) Algorithm in the Cloud Autonomous Vehicles Traffic Model,” The International Arab Journal of Information Technology, vol. 19, no. 2, pp. 186-194, 2022. https://doi.org/10.34028/iajit/19/2/5.

[43] Ratna M., Bhattacharya S., Abdulrahim B., and McLernon D., “A Systematic Review of the Quality of Clinical Prediction Models in Vitro Fertilisation,” Human Reproduction, vol. 35, no. 1, pp. 100-116, 2020. https://doi.org/10.1093/humrep/dez258.

[44] Salaken S.M., Khosravi A., Nguyen T., and Nahavandi S., “Extreme Learning Machine Based Transfer Learning Algorithms: a Survey,” Neurocomputing, vol. 267, pp. 516-524, 2017. https://doi.org/10.1016/j.neucom.2017.06.037.

[45] Siristatidis C., Vogiatzi P., Pouliakis A., Trivella M., Papantoniou N., and Bettocchi S., “Predicting Ivf Outcome: A Proposed Web-Based System Machine Learning Models for Statistical Analysis 513 Using Artificial Intelligence,” in Vivo, vol. 30, no. 4, pp. 507-512, 2016.

[46] Song Y. and Lu Y., “Decision Tree Methods: Applications for Classification and Prediction,” Shanghai Archives of Psychiatry, vol. 27, no. 2, pp. 130-135, 2015. 10.11919/j.issn.1002- 0829.215044.

[47] Specht D., “A General Regression Neural Network,” IEEE Transactions on Neural Networks, vol. 2, no. 6, pp. 568-576, 1991.

[48] Tibshirani R., “Regression Shrinkage And Selection Via The Lasso: A Retrospective,” Journal of the Royal Statistical Society: Series B, vol. 73, part 3, pp 273-282, 2011.

[49] Tiwari S., Jain A., Yadav K., and Ramadan R., “Machine Learning-Based Model for Prediction of Power Consumption in Smart Grid,” The International Arab Journal of Information Technology, vol. 19, no. 3, pp. 323-329, 2022. https://doi.org/10.34028/iajit/19/3/5.

[50] Xie Y., “Values and Limitations of Statistical Models,” Research in Social Stratification and Mobility, vol. 29, no. 3, pp. 343-349, 2011. https://doi.org/10.1016/j.rssm.2011.04.001.

[51] Zhang C. and Ma Y., Ensemble Machine Learning: Methods and Applications, Springer, 2012. https://doi.org/10.1007/978-1-4419-9326- 7.

[52] Zhang L. and Suganthan P., “A Survey of Randomized Algorithms for Training Neural Networks,” Information Sciences, vol. 364-365, pp. 146-155, 2016. https://doi.org/10.1016/j.ins.2016.01.039. 514 The International Arab Journal of Information Technology, Vol. 20, No. 3A, Special Issue 2023