The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


Leveraging on Synthetic Data Generation Techniques to Train Machine Learning Models for Tenaga Nasional Berhad Stock Price Movement Prediction

This study employs machine learning models to explore stock price prediction for Tenaga Nasional Berhad (TNB), Malaysia’s primary electricity provider. It addresses the limitations of previous studies by incorporating various input variables, including the stock market, technical, financial, and economic data. This study also tackles the issue of imbalanced class distribution due to small datasets of stock market data by generating synthetic data using Synthetic Minority Over-Sampling Technique (SMOTE) and Generative Adversarial Network-Synthetic Minority Over-Sampling Technique (GAN-SMOTE) techniques. The performance of four classifier models (random forest, Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), and Artificial Neural Network (ANN) is evaluated without any synthetic data and with synthetic data generated. The SMOTE-ANN model is the best- performing model, exhibiting superior accuracy of 93%, F1-Score of 92%, precision of 90%, recall of 94%, and specificity of 92%. Overall, this research provides valuable insights into TNB stock price movements, offers a solution for imbalanced class distribution, and identifies the top-performing model for predicting TNB stock price movement. These findings are relevant to investors, analysts, and organisations in the utility sector.

[1] Asuero A., Sayago A., and González G., “The correlation coefficient: An overview,” Critical Reviews in Analytical Chemistry, vol. 36, no. 1, pp. 41-59, 2006. DOI: 10.1080/10408340500526766

[2] Bahrami A., Shamsuddin A., and Uylangco K., “Out-of-Sample Stock Return Predictability in Emerging Markets,” Accounting and Finance, vol. 58, no. 3, pp. 727-750, 2018. https://doi.org/10.1111/acfi.12234

[3] Bathla G., “Stock Price Prediction using LSTM and SVR,” in Proccedings of the 6th International Conference on Parallel, Distributed and Grid Computing, Waknaghat, pp. 211-214, 2020. doi: 10.1109/PDGC50313.2020.9315800

[4] Bekaert G. and Harvey C., “Emerging Markets Finance,” Journal of Empirical Finance, vol. 10, no 1-2, pp. 3-55, 2003. https://doi.org/10.1016/S0927-5398(02)00054-3

[5] Bozic J. and Djordje B,. “Financial Time Series Forecasting Using Hybrid Wavelet-Neural Model,” The International Arab Journal of Information Technology, vol. 15, no. 1, pp. 50-57, 2018.

[6] Carvajal-Patiño D. and Ramos-Pollán R., “Synthetic Data Generation with Deep Generative Models to Enhance Predictive Tasks in Trading Strategies,” Research in International Business and Finance, vol. 62, pp. 101747, 2022. https://doi.org/10.1016/j.ribaf.2022.101747

[7] Chaajer P., Shah M., and Kshirsagar A., “The Applications of Artificial Neural Networks, Support Vector Machines, and Long-Short Term Memory for Stock Market Prediction,” Decision Analytics Journal, vol. 2, pp. 100015, 2022. https://doi.org/10.1016/j.dajour.2021.100015

[8] Chatzis S., Siakoulis V., Petropoulos A., Stavroulakis E., and Vlachogiannakis N., “Forecasting Stock Market Crisis Events Using Deep and Statistical Machine Learning Techniques,” Expert Systems with Applications, vol. 112, pp. 353-371, 2018. https://doi.org/10.1016/j.eswa.2018.06.032

[9] Chopra R. and Sharma G., “Application of Artificial Intelligence in Stock Market Forecasting: A Critique, Review, and Research Agenda,” Journal of Risk and Financial Management, vol. 14, no. 11, pp. 526, 2021. https://doi.org/10.3390/jrfm14110526

[10] Das T., Khan A., and Saha G., “Classification of Imbalanced Big Data Using SMOTE with Rough Random Forest,” International Journal of Engineering and Advanced Technology, vol. 9, pp. 5174-5184, 2019. DOI: 10.35940/ijeat.B4096.129219

[11] Deng S., Zhu Y., Huang X., Duan S., and Fu Z., “High-Frequency Direction Forecasting of the Futures Market Using a Machine-Learning-Based Method,” Future Internet, vol. 14, no. 6, pp. 180, 2022. https://doi.org/10.3390/fi14060180

[12] Dhafer A., Nor F., Hashim W., Shah N., Bin- 494 The International Arab Journal of Information Technology, Vol. 21, No. 3, May 2024 Khairi K., and Alkawsi G., “A NARX Neural Network Model to Predict One-Day Ahead Movement of the Stock Market Index of Malaysia,” in Proccedings of the 2nd International Conference on Artificial Intelligence and Data Sciences, Ipoh, pp. 1-7, 2021. DOI: 10.1109/AiDAS53897.2021.9574394

[13] Eapen J., Bein D., and Verma A., “Novel Deep Learning Model with CNN and Bi-Directional LSTM for Improved Stock Market Index Prediction,” in Proceedings of the IEEE 9th Annual Computing and Communication Workshop and Conference, Las Vegas, pp. 0264-0270, 2019. doi: 10.1109/CCWC.2019.8666592

[14] Ghasemieh A. and Kashef R., “Deep Learning Vs. Machine Learning in Predicting the Future Trend of Stock Market Prices,” in Proccedings of the IEEE International Conference on Systems, Man, and Cybernetics, Melbourne, pp. 3429-3435, 2021. DOI: 10.1109/SMC52423.2021.9658938

[15] Kumar P., Bhatnagar R., Gaur K., and Bhatnagar A., “Classification of Imbalanced Data: Review of Methods and Applications,” in Proccedings of the IOP Conference Series: Materials Science and Engineering, Sanya, pp. 012077, 2021. doi:10.1088/1757-899X/1099/1/012077

[16] Ling L. and Belaidan S., “Stock Market Price Movement Forecasting on Bursa Malaysia using Machine Learning Approach,” in Proceedings of the 14th International Conference on Developments in eSystems Engineering, Sharjah, pp. 102-108, 2021. https://doi.org/10.1109/DeSE54285.2021.9719534

[17] Nabipour M., Nayyeri P., Jabani H., Shahab S., and Mosavi A., “Predicting Stock Market Trends Using Machine Learning and Deep Learning Algorithms Via Continuous and Binary Data: A Comparative Analysis,” IEEE Access, vol. 8, pp. 150199-150212, 2020. DOI: 10.1109/ACCESS.2020.3015966

[18] Ravikumar S. and Saraf P., “Prediction of Stock Prices Using Machine Learning Regression, Classification Algorithms,” in Proccedings of the International Conference for Emerging Technology, Belgaum, pp. 1-5, 2020. doi: 10.1109/INCET49848.2020.9154061.

[19] S&P Capital IQ. (2022). Tenaga Nasional Berhad: Public Company Profile, Last Visited, 2024.

[20] Sarumpaet N., Indwiarti., and Rohmawati A., “Performance Comparison Between Support Vector Regression and Long Short-Term Memory for Prediction of Stock Market,” in Proccedings of the 10th International Conference on Information and Communication Technology, Bandung, pp. 168-173. 2022. DOI: 10.1109/ICoICT55009.2022.9914839

[21] Tenaga Nasional Berhad. (2022). About TNB: Corporate Profile. https://www.tnb.com.my/about-tnb/corporate- profile/, Last Visited, 2024.

[22] Zhang K., Zhong G., Dong J., Wang S., and Wang Y., “Stock Market Prediction Based on Generative Adversarial Network,” Procedia Computer Science, vol. 147, pp. 400-406, 2019. https://doi.org/10.1016/j.procs.2019.01.256

[23] Zhao X., “The Prediction of Apple Inc. Stock Price with Machine Learning Models,” in Proccedings of the 3rd International Conference on Applied Machine Learning, Changsha, pp. 222- 225, 2021.