The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


Effects of Using Arabic Web Pages in Building Rank Estimation Algorithm for Google Search Engine Results Page

Search Engine Optimization (SEO) aims to improve a website's reputation and user experience. Without effective SEO strategies, it requires significant investment in paid advertisements. Search Engines (SEs) use algorithms to rank results, assessing on-page and off-page factors for relevance. Machine learning techniques have been used to build classifiers for estimating page rank. However, no research has compared rank estimation with other languages or analyzed the effects of different languages on performance or differences between SEO factors. The study aims to improve rank estimation algorithms for Arabic web pages on desktop devices using a new multi-category dataset from Google Search Engine Results Page (SERP). The experimental findings suggest that Arabic web pages are more suitable than English ones for training a model to estimate the ranking of Arabic web pages. Machine learning models were applied to two datasets. SE scraping was used to collect URLs, descriptions, and other data from the Google SE. Data preprocessing steps were taken before using the datasets for rank estimation algorithms. Experiments were conducted to assess the implications of using Arabic and English web page datasets.

[1] Al-Kabi M., Alsmadi I., and Wahsheh H., “Evaluation of Spam Impact on Arabic Websites Popularity,” Journal of King Saud University- Computer and Information Sciences, vol. 27, no. 2, pp. 222-229, 2015. https://doi.org/10.1016/j.jksuci.2014.04.005, 2014. DOI:10.14445/22312803/IJCTT-V12P140

[2] Al-Mukhtar F., Mahmoodd N., and Kareem S., “Search Engine Optimization: A Review,” Applied Computer Science, vol. 17, no. 1, pp. 69-79, 2021. DOI:10.23743/acs-2021-07

[3] An S. and Jung J., “A Heuristic Approach on Metadata Recommendation for Search Engine Optimization,” Concurrency and Computation Practice and Experience, vol. 33, no. 3, pp. 1-10, 2019. https://doi.org/10.1002/cpe.5407

[4] Arora P. and Bhalla T., “A Synonym Based Approach of Data Mining in Search Engine Optimization,” International Journal of Computer Trends and Technology, vol. 12, no. 4, pp. 201- 1006 The International Arab Journal of Information Technology, Vol. 20, No. 6, November 2023 205, 2014. DOI:10.14445/22312803/IJCTT- V12P140

[5] Attia M., Abdel-Fattah M., and Khedr A., “A Proposed Multi Criteria Indexing and Ranking Model for Documents and Web Pages on Large Scale Data,” Journal of King Saud University- Computer and Information Sciences, vol. 34, no. 10, pp. 8702-8715, 2022. https://doi.org/10.1016/j.jksuci.2021.10.009

[6] Banaei H. and Honarvar A., “Web Page Rank Estimation in Search Engine Based on SEO Parameters Using Machine Learning Techniques,” International Journal of Computer Science and Network Security, vol. 17, no. 5, pp. 95-100, 2017. https://www.researchgate.net/publication/317543 658

[7] Dalvi A. and Saraf R., “Inspecting Engineering College Websites for Effective Search Engine Optimization,” in Proceedings of the International Conference on Nascent Technologies in Engineering, Navi Mumbai, pp. 1-5, 2019, DOI:10.1109/ICNTE44896.2019.8945823

[8] Dick M., “Search Engine Optimisation in UK News Production,” Journalism Practice, vol. 5, no. 4, pp. 462-477, 2011, https://doi.org/10.1080/17512786.2010.551020

[9] Drivas I., Sakas D., Giannakopoulos G., and Kyriaki-Manessi D., “Big Data Analytics for Search Engine Optimization,” Big Data and Cognitive Computing, vol. 4, no. 2, pp. 1-22, 2020. https://doi.org/10.3390/bdcc4020005

[10] Giannakoulopoulos A., Konstantinou N., Koutsompolis D., Pergantis M., and Varlamis I., “Academic Excellence, Website Quality, SEO Performance: Is there a Correlation?,” Future Internet, vol. 11, no. 11, pp. 1-25, 2019. DOI:10.3390/fi11110242

[11] Giomelakis D., Karypidou C., and Veglis A., “SEO Inside Newsrooms: Reports from the Field,” Future Internet, vol. 11, no. 12, pp. 1-15, 2019. https://doi.org/10.3390/fi11120261

[12] Giomelakis D. and Veglis A., “Investigating Search Engine Optimization Factors in Media Websites: The Case of Greece,” Digital Journalism, vol. 4, no. 3, pp. 379-400, 2016. https://doi.org/10.1080/21670811.2015.1046992

[13] Halibas A., Cherian A., Pillai I., Reazol L., Delvo E., and Sumondong G., “Web Ranking of Higher Education Institutions: An SEO Analysis,” in Proceedings of the International Conference on Computation, Automation and Knowledge Management, Dubai, pp. 411-415, 2020. DOI:10.1109/ICCAKM46823.2020.9051481

[14] Jayaraman S., Ramachandran M., Patan R., Daneshmand M., and Gandomi A., “Fuzzy Deep Neural Learning Based on Goodman and Kruskal’s Gamma for Search Engine Optimization,” IEEE Transactions on Big Data, vol. 8, no. 1, pp 268-277, 2022. DOI:10.1109/TBDATA.2020.2963982

[15] Joglekar B., Bhatia R., Jayaprakash S., Raina K., and Mulchandani S., “Search Engine Optimization Using Unsupervised Learning,” in Proceedings of the 5th International Conference on Computing, Communication, Control and Automation, Pune, pp. 1-5, 2019, DOI:10.1109/ICCUBEA47591.2019.9129011

[16] Karyotakis M., Lamprou E., Kiourexidou M., and Antonopoulos N., “SEO Practices: A Study about the Way News Websites Allow the Users to Comment on their News Articles,” Future Internet, vol. 11, no. 9, pp. 1-13, 2019. https://doi.org/10.3390/fi11090188

[17] Manohar E. and Punithavathani D., “Effective Preprocessing and Knowledge Discovery in Web Usage Mining,” Middle-East Journal of Scientific Research, vol. 23, no. 10, pp. 2433-2439, 2015. DOI: 10.5829/idosi.mejsr.2015.23.10.22480

[18] Matošević G., Dobša J., and Mladenić D., “Using Machine Learning for Web Page Classification in Search Engine Optimization,” Future Internet, vol. 13, no. 1, pp. 1-20, 2021. https://doi.org/10.3390/fi13010009

[19] Özkan B., Özceylan E., Kabak M., and Dağdeviren M., “Evaluating the Websites of Academic Departments through SEO Criteria: A Hesitant Fuzzy Linguistic MCDM Approach,” Artificial Intelligence Review, vol. 53, no. 2. pp. 875-905, 2020. https://doi.org/10.1007/s10462- 019-09681-z

[20] Pan B., “The Power of Search Engine Ranking for Tourist Destinations,” Tourism Management, vol. 47, pp. 79-87, 2015. https://doi.org/10.1016/j.tourman.2014.08.015

[21] Pant P., Joshi P., and Joshi S., “A Comparative Study of Search Engines Results Using Data Mining and Statistical Analysis,” International Journal of Statistics and Applied Mathematics, vol. 5, no. 5, pp. 30-33, 2020. https://www.mathsjournal.com/pdf/2020/vol5issu e5/PartA/5-4-20-929.pdf

[22] Portier W., Li Y., and Kouassi B., “Improving Search Engine Ranking Prediction Based on a New Feature Engineering Tool,” in Proceedings of the 4th International Conference on Vision, Image and Signal Processing, Bangkok, pp. 1-6, 2020. https://doi.org/10.1145/3448823.3448878

[23] Portier W., Li Y., and Kouassi B., “Feature Selection Using Machine Learning Techniques Based on Search Engine Parameters,” in Proceedings of the 3rd International Conference on Signal Processing and Machine Learning, Beijing, pp. 28-34, 2020. DOI:10.1145/3432291.3432308

[24] Prawira I. and Rizkiansyah M., “Search Engine Optimization in News Production Online Effects of Using Arabic Web Pages in Building Rank Estimation Algorithm for Google ... 1007 Marketing Practice in Indonesia Online News Media,” Pertanika Journal of Social Sciences and Humanities, vol. 26, no. T, pp. 263-270, 2018. http://www.pertanika.upm.edu.my/pjtas/browse/r egular-issue?article=JSSH-T0727-2018

[25] Roslina A. and Nur Shahirah M., “Implementing White Hat Search Engine Technique in E- Business Website,” in Proceedings of the 10th International Conference on E-Education, E- Business, E-Management and E-Learning, Tokyo, pp. 311-314, 2019, https://doi.org/10.1145/3306500.3306533

[26] Salminen J., Corporan J., Marttila R., Salenius T., and Jansen B., “Using Machine Learning to Predict Ranking of Webpages in the Gift Industry: Factors for Search-Engine Optimization,” in Proceedings of the 9th International Conference on Information Systems and Technologies, Cairo, pp. 1-8, 2019. DOI:10.1145/3361570.3361578

[27] Schilhan L., Kaier C., and Lackner K., “Increasing Visibility and Discoverability of Scholarly Publications with Academic Search Engine Optimization,” Insights, vol. 34, pp. 1-16, 2021. DOI: 10.1629/uksg.534

[28] Shahzad A., Nawi N., Sutoyo E., Naeem M., “Search Engine Optimization Techniques for Malaysian University Websites: A Comparative Analysis on Google and Bing Search Engine,” International Journal on Advanced Science, Engineering and Information Technology, vol. 8, no. 4, pp. 1262-269, 2018. DOI:10.18517/ijaseit.8.4.5032

[29] Sharma P. and Yadav D., “A Novel Architecture for Search Engine using Domain Based Web Log Data,” The Internatonal Arab Juornal of Information Technology, vol. 20, no. 1, pp. 92- 101, 2023. https://doi.org/10.34028/iajit/20/1/10

[30] StatCounter Global Stats, “Search Engine Market Share Worldwide,” https://gs.statcounter.com/search-engine-market- share, Last Visited, 2023.

[31] Strzelecki A., “Google Web and Image Search Visibility Data for Online Store,” Data Descriptor, vol. 4, no. 3, pp. 1-10, 2019. https://doi.org/10.3390/data4030125

[32] Su A., Hu Y., Kuzmanovic A., and Koh C., “How to Improve your Search Engine Ranking,” ACM Transactions on the Web, vol. 8, no. 2, pp. 1-25, 2014. https://doi.org/10.1145/2579990

[33] Sujatha P. and Kavitha K., “Proficient Data Mining Approach for Search Engine Optimization,” Journal on Science Engineering and Technology, vol. 2, no. 3, pp. 190-194, 2015. http://jset.sasapublications.com/wp- content/uploads/2017/10/6702647.pdf

[34] Tsuei H., Tsai W., Pan F., and Tzeng G., “Improving Search Engine Optimization (SEO) by Using Hybrid Modified MCDM Models,” Artificial Intelligence Review, vol. 53, no. 1, pp. 1- 16, 2020. https://doi.org/10.1007/s10462-018- 9644-0

[35] Ullah A., Nawi N., Sutoyo E., Shazad A., Khan S., and Aamir M., “Search Engine Optimization Algorithms for Page Ranking: Comparative Study,” International Journal of Integrated Engineering, vol. 10, no. 6, pp. 19-25, 2018. DOI:10.30880/ijie.2018.10.06.003

[36] Vállez M. and Ventura A., “Analysis of the SEO Visibility of University Libraries and How they Impact the Web Visibility of their Universities,” Journal of Academic Librarianship, vol. 46, no. 4, pp. 102171, 2020. https://doi.org/10.1016/j.acalib.2020.102171

[37] Vyas C., “Evaluating State Tourism Websites Using Search Engine Optimization Tools,” Tourism Management, vol. 73, pp. 64-70, 2019. https://doi.org/10.1016/j.tourman.2019.01.019.

[38] Ziakis C. and Vlachopoulou M., “Web Content Management Systems Used by Search Engine Optimization Experts for Top Rankings in Search Engine Result Pages,” Wseas Transactions on Computers, vol. 20, pp. 207-216, 2021. DOI:10.37394/23205.2021.20.22

[39] Ziakis C., Vlachopoulou M., Kyrkoudis T., and Karagkiozidou M., “Important Factors for Improving Google Search Rank,” Future Internet, vol. 11, no. 2, pp. 1-12, 2019. DOI:10.3390/fi11020032.