The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


HYAQP: A Hybrid Meta-Heuristic Optimization Model for Air Quality Prediction Using Unsupervised Machine Learning Paradigms

In the current decade, presence of air pollution leads towards serious health conditions, including respiratory ailments, cardiovascular disorders, and lung cancer, which impacts both the lifespan and overall well-being of individuals. Moreover, the air pollution has the potential to have detrimental effects on the environment, resulting in destruction to ecosystems, decreased agricultural output and so on. Emission control systems, better fuels, stronger laws, energy efficiency improvements, and renewable energy promotion are helping industries reduce air pollution. To monitor the quality of air methods such as Particulate Matter (PM2.5), PM10, Ozone (O3), and other meteorological indicators exists. These measures pose real-time air quality however it fails to predict air quality. Predicting air quality helps reduce air pollution by enabling timely interventions and preventive measures to mitigate pollution peaks. Thus, in this research a hybrid version of optimization algorithm namely Hybrid Air Quality Prediction system (HYAQP) which is a combination of k-means clustering algorithm and meta-heuristic algorithm Sine Cosine Algorithm (SCA) is proposed. The HYAQP holds SCA integrated with k-means algorithm to find optimal cluster centroid for grouping the air data into three clusters good, poor, and moderate quality. Then the cluster which is nearer to the test instance is found and the instances present in those clusters are passed to K-Nearest Neighbor Regressor (K-NNR). Comparing HYAQP on mean absolute error it outperforms 62.9% than Multiple Linear Regression (MLR), 58.5% than Support Vector Regression (SVR), 45.5% than Vanilla-Long Short-Term Memory (Vanilla-LSTM), 44.4% than Sparrow Search Algorithm based-LSTM (SSA-LSTM) and 53.8% than K-Nearest Neighbor (KNN).

[1] Abd Elaziz M., Nabil N., Ewees A., and Lu S., “Automatic Data Clustering Based on Hybrid Atom Search Optimization and Sine-Cosine Algorithm,” in Proceedings of the IEEE Congress on Evolutionary Computation, Wellington, pp. 2315-2322, 2019. DOI:10.1109/CEC.2019.8790361

[2] Air Quality Data from London, https://datahub.io/core/london-air- quality#readme, Last Visited, 2024.

[3] Air Quality Data from UK-AIR https://data.world/datagov-uk/3f4405c6-0aa7- 4087-8067-8efe0c364564, Last Visited, 2024.

[4] Athira V., Geetha P., Vinayakumar R., and Soman K., “Deepairnet: Applying Recurrent Networks for Air Quality Prediction,” Procedia Computer Science, vol. 132, pp. 1394-1403, 2018. https://doi.org/10.1016/j.procs.2018.05.068

[5] Barai S., Dikshit A., and Sharma S., Soft Computing in Industrial Applications, Springer, 2007. https://link.springer.com/chapter/10.1007/978-3- 540-70706-6_27

[6] Boushaki S., Bendjeghaba O., and Brakta N., “Accelerated Modified Sine Cosine Algorithm for Data Clustering,” in Proceedings of the IEEE 11th Annual Computing and Communication Workshop and Conference, Nevada, pp. 0715-0720, 2021. DOI:10.1109/CCWC51732.2021.9376122

[7] Centers for Disease Control and Prevention- Carbon Monoxide Poisoning, https://www.cdc.gov/carbon- monoxide/about/?CDC_AAref_Val=https://www. cdc.gov/co/faqs.htm, Last Visited, 2024.

[8] Central Pollution Control Board, https://cpcb.nic.in/, Last Visited, 2024.

[9] Chen X., Hu Y., Dong F., Chen K., and Xia H., “A Multi-Graph Spatial-Temporal Attention Network for Air-Quality Prediction,” Process Safety and Environmental Protection, vol. 181, pp. 442-451, 2024. https://doi.org/10.1016/j.psep.2023.11.040

[10] Chen X., Xia H., Wu M., Hu Y., and Wang Z., “Spatiotemporal Hierarchical Transmit Neural Network for Regional-Level Air-Quality Prediction,” Knowledge-Based Systems, vol. 289, pp. 111555, 2024. https://doi.org/10.1016/j.knosys.2024.111555

[11] Corani G., “Air Quality Prediction in Milan: Feed- Forward Neural Networks, Pruned Neural Networks and Lazy Learning,” Ecological Modelling, vol. 185, no. 2-4, pp. 513-529, 2005. https://doi.org/10.1016/j.ecolmodel.2005.01.008

[12] De Vito S., Massera E., Piga M., Martinotto L., and Di Francia G., “On Field Calibration of an Electronic Nose for Benzene Estimation in an Urban Pollution Monitoring Scenario,” Sensors and Actuators B: Chemical, vol. 129, no. 2, pp. 750-757, 2008. https://doi.org/10.1016/j.snb.2007.09.060

[13] Debnath J., Majumder D., and Biswas A., “Air Quality Assessment Using Weighted Interval Type-2Fuzzy Inference System,” Ecological Informatics, vol. 46, pp. 133-146, 2018. https://doi.org/10.1016/j.ecoinf.2018.06.002

[14] Ding C., Zheng Z., Zheng S., Wang X., Xie X., Wen D., Zhang L., and Zhang Y., “Accurate Air- Quality Prediction Using Genetic-Optimized Gated-Recurrent-Unit Architecture,” Information, vol. 13, no. 5, pp. 223, 2022. https://www.mdpi.com/2078-2489/13/5/223

[15] Environmental Protection Agency-Air Quality Index Basics, https://www.airnow.gov/aqi/aqi- basics/, Last Visited, 2024.

[16] Environmental Protection Agency-Ground-level Ozone Basics, https://www.epa.gov/ground-level- ozone-pollution/ground-level-ozone-basics, Last Visited, 2024.

[17] Environmental Protection Agency-National Ambient Air Quality Standards, https://www.epa.gov/naaqs, Last Visited, 2024.

[18] Fang W., Zhu R., and Lin J., “An Air Quality Prediction Model Based on Improved Vanilla LSTM with Multichannel Input and Multiroute Output,” Expert Systems with Applications, vol. 211, pp. 118422, 2023. https://doi.org/10.1016/j.eswa.2022.118422

[19] Green Peace, https://www.greenpeace.org/, Last Visited, 2024.

[20] Gupta S. and Deep K., “A Hybrid Self-Adaptive Sine Cosine Algorithm with Opposition Based Learning,” Expert Systems with Applications, vol. 119, pp. 210-230, 2019. https://doi.org/10.1016/j.eswa.2018.10.050

[21] Jain A., “Data Clustering: 50 Years Beyond K- Means,” Pattern Recognition Letters, vol. 31, no. 8, pp. 651-666, 2010. https://doi.org/10.1016/j.patrec.2009.09.011

[22] Kang G., Gao J., Chiao S., Lu S., and Xie G., “Air Quality Prediction: Big Data and Machine HYAQP: A Hybrid Meta-Heuristic Optimization Model for Air Quality Prediction Using ... 965 Learning Approaches,” International Journal of Environmental Science and Development, vol. 9, no. 1, pp. 8-16, 2018. https://www.ijesd.org/show- 103-1491-1.html

[23] Kaur M., Kaur R., Singh N., and Dhiman G., “SChoA: A Newly Fusion of Sine and Cosine with Chimp Optimization Algorithm for HLS of Datapaths in Digital Filters and Engineering Applications,” Engineering with Computers, vol. 38, no. 2, pp. 975-1003, 2022. https://link.springer.com/article/10.1007/s00366- 020-01233-2

[24] Kennedy J. and Eberhart R., “Particle Swarm Optimization,” in Proceedings of the International Conference on Neural Networks, Perth, pp. 1942- 1948, 1995. DOI:10.1109/ICNN.1995.488968

[25] Kok I., Simsek M., and Ozdemir S., “A Deep Learning Model for Air Quality Prediction in Smart Cities,” in Proceeding of the IEEE International Conference on Big Data, Boston, pp. 1983-1990, 2017. DOI:10.1109/BigData.2017.8258144

[26] Kuo R., Lin J., and Nguyen T., “An Application of Sine Cosine Algorithm-based Fuzzy Possibilistic C-ordered Means Algorithm to Cluster Analysis,” Soft Computing, vol. 25, no. 5, pp. 3469-3484, 2021. https://link.springer.com/article/10.1007/s00500- 020-05380-y

[27] Liao H., Yuan L., Wu M., and Chen H., “Air Quality Prediction by Integrating Mechanism Model and Machine Learning Model,” Science of the Total Environment, vol. 899, pp. 165646, 2023. https://doi.org/10.1016/j.scitotenv.2023.165646

[28] Lin Y., Lee S., Ouyang C., and Wu C., “Air Quality Prediction by Neuro-Fuzzy Modeling Approach,” Applied Soft Computing, vol. 86, pp. 105898, 2020. https://doi.org/10.1016/j.asoc.2019.105898

[29] Liu X. and Guo H., “Air Quality Indicators and AQI Prediction Coupling Long-Short Term Memory and Sparrow Search Algorithm: A Case Study of Shanghai,” Atmospheric Pollution Research, vol. 13, no. 10, pp. 101551, 2022. https://doi.org/10.1016/j.apr.2022.101551

[30] Ma J., Cheng J., Lin C., Tan Y., and Zhang J., “Improving Air Quality Prediction Accuracy at Larger Temporal Resolutions Using Deep Learning and Transfer Learning Techniques,” Atmospheric Environment, vol. 214, pp. 116885, 2019. https://doi.org/10.1016/j.atmosenv.2019.116885

[31] Mirjalili S., “SCA: A Sine Cosine Algorithm for Solving Optimization Problems,” Knowledge- based Systems, vol. 96, pp. 120-133, 2016. https://doi.org/10.1016/j.knosys.2015.12.022

[32] Mirjalili S., Evolutionary Algorithms and Neural Networks, Springer, 2019. https://link.springer.com/chapter/10.1007/978-3- 319-93025-1_4

[33] Navares R. and Aznarte J., “Predicting Air Quality with Deep Learning LSTM: Towards Comprehensive Models,” Ecological Informatics, vol. 55, pp. 101019, 2020. https://doi.org/10.1016/j.ecoinf.2019.101019

[34] Pan J., Fan F., Chu S., Zhao H., and Liu G., “A Lightweight Intelligent Intrusion Detection Model for Wireless Sensor Networks,” Security and Communication Networks, vol. 2021, no. 1, pp. 1- 15, 2021. https://onlinelibrary.wiley.com/doi/10.1155/2021/ 5540895

[35] Shakil M., Mohammed A., Arul R., Bashir A., and Choi J., “A Novel Dynamic Framework to Detect DDoS in SDN Using Metaheuristic Clustering,” Transactions on Emerging Telecommunications Technologies, vol. 33, no. 3, pp. 3622, 2022. https://onlinelibrary.wiley.com/doi/abs/10.1002/et t.3622

[36] Simsek M., Kok I., and Ozdemir S., “DeepFogAQ: A Fog-Assisted Decentralized Air Quality Prediction and Event Detection System,” Expert Systems with Applications, vol. 251, pp. 123920, 2024. https://doi.org/10.1016/j.eswa.2024.123920

[37] Singh K., Gupta S., Kumar A., and Shukla S., “Linear and Nonlinear Modeling Approaches for Urban Air Quality Prediction,” Science of the Total Environment, vol. 426, pp. 244-255, 2012. https://doi.org/10.1016/j.scitotenv.2012.03.076

[38] Sripada S. and Rao M., “Comparison of Purity and Entropy of K-Means Clustering and Fuzzy C Means Clustering,” Indian Journal of Computer Science and Engineering, vol. 2, no. 3, pp. 343- 346, 2011. file:///C:/Users/user/Downloads/BDCC-02- 00005-v2.pdf

[39] Sui S. and Han Q., “Multi-View Multi-Task Spatiotemporal Graph Convolutional Network for Air Quality Prediction,” Science of the Total Environment, vol. 893, pp. 164699, 2023. https://doi.org/10.1016/j.scitotenv.2023.164699

[40] Tillman D., Duong D., and Harding N., Solid Fuel Blending: Principles, Practices, and Problems, Elsevier Butterworth-Heinemann, 2012. https://lib.ugent.be/en/catalog/ebk01:2670000000 157860

[41] U.S. Pollution Data, https://www.kaggle.com/sogun3/uspollution, Last Visited, 2024.

[42] Wang J. and Song G., “A Deep Spatial-Temporal Ensemble Model for Air Quality Prediction,” Neurocomputing, vol. 314, pp. 198-206, 2018. https://doi.org/10.1016/j.neucom.2018.06.049

[43] Wang J., Zhang X., Guo Z., and Lu H., 966 The International Arab Journal of Information Technology, Vol. 21, No. 5, September 2024 “Developing an Early-Warning System for Air Quality Prediction and Assessment of Cities in China,” Expert Systems with Applications, vol. 84, pp. 102-116, 2017. https://doi.org/10.1016/j.eswa.2017.04.059

[44] World Health Organization-Ambient Air Pollution, https://www.who.int/news-room/fact- sheets/detail/ambient-(outdoor)-air-quality-and- health, Last Visited, 2024.

[45] Zhang W., Huang R., and Ye L., “Evaluation of Emission Reduction Performance of Power Enterprises Based on Least Squares Support Vector Machine,” The International Arab Journal of Information Technology, vol. 21, no. 5, pp. 854- 865, 2024. https://doi.org/10.34028/iajit/21/5/7

[46] Zhang Y., Wang Y., Gao M., Ma Q., Zhao J., Zhang R., Wang Q., and Huang L., “A Predictive Data Feature Exploration-based Air Quality Prediction Approach,” IEEE Access, vol. 7, pp. 30732-30743, 2019. DOI:10.1109/ACCESS.2019.2897754

[47] Zhu D., Cai C., Yang T., and Zhou X., “A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization,” Big Data and Cognitive Computing, vol. 2, no. 1, pp. 1-15, 2018. https://www.mdpi.com/2504- 2289/2/1/5