The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


Evaluating Social Context in Arabic Opinion Mining Mohammed Al-Kabi1, Izzat Alsmadi2, Rawan Khasawneh3, and Heider Wahsheh4 1Computer Science Department, Zarqa University, Jordan 2Computer Science Department, University of New Haven, USA 3Computer Information Systems Department, Jordan University of Science and Technology, Jordan 4Computer Science Department, King Khaled University, Saudi Arabia

This study is based on a benchmark corpora consisting of 3,015 textual Arabic opinions collected from Facebook. These collected Arabic opinions are distributed equally among three domains (Food, Sport, and Weather), to create a balanced benchmark corpus. To accomplish this study ten Arabic lexicons were constructed manually, and a new tool called Arabic Opinions Polarity Identification (AOPI) is designed and implemented to identify the polarity of the collected Arabic opinions using the constructed lexicons. Furthermore, this study includes a comparison between the constructed tool and two free online sentiment analysis tools (SocialMention and SentiStrength) that support the Arabic language. The effect of stemming on the accuracy of these tools is tested in this study. The evaluation results using machine learning classifiers show that AOPI is more effective than the other two free online sentiment analysis tools using a stemmed dataset.


[1] Abbasi A., Chen H., and Salem A., Sentiment Analysis in Multiple Languages: Feature Selection for Opinion Classification in Web Forums, ACM Transactions on Information Systems, vol. 26, no. 3, 2008.

[2] Abdulla N., Ahmed N., Shehab M., Al-Ayyoub M., Al-Kabi M., and Al-Rifai S., Towards Improving the Lexicon-Based Approach for Arabic Sentiment Analysis, International Journal of Information Technology and Web Engineering, vol. 9, no. 3, pp. 55-71, 2014.

[3] Abdulla N., Majdalawi R., Mohammed S., Al- Ayyoub M., and Al-Kabi M., Automatic Lexicon Construction for Arabic Sentiment Analysis, in Proceedings of the 2nd International Evaluating Social Context in Arabic Opinion Mining 981 Conference on Future Internet of Things and Cloud, Barcelona, pp. 547-552, 2014.

[4] Abdul-Mageed M. and Diab M., AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis, in Proceedings of the 8th International Conference on Language Resources and Evaluation, Istanbul, pp. 3907-3714, 2012.

[5] Abdul-Mageed M., Diab M., and K bler S., SAMAR: Subjectivity and Sentiment Analysis for Arabic Social Media, Computer Speech and Language, vol. 28, no. 1, pp. 20-37, 2014.

[6] Ahmad K. and Almas Y., Visualising Sentiments in Financial Texts?, in Proceedings of 9th International Conference on Information Visualisation, London, pp. 363-368, 2005.

[7] Ahmad K., Cheng D., and Almas Y., Multi- Lingual Sentiment Analysis of Financial News Streams, in Proceedings of International Workshop on Grid Technology for Financial Modeling and Simulation, Palermo, 2006.

[8] Al Shboul B., Al-Ayyoub M., and Jararweh Y., Multi-Way Sentiment Classification of Arabic Reviews, in Proceedings of the 6th International Conference on Information and Communication Systems, Amman, pp. 206-211, 2015.

[9] Al-Ayyoub M., Bani-Essa S., and Alsmadi I., Lexicon-Based Sentiment Analysis of Arabic Tweets, International Journal of Social Network Mining, vol. 2, no. 2, pp. 101-114, 2015.

[10] Al-Kabi N., Abdulla N., and Al-Ayyoub M., An Analytical Study of Arabic Sentiments: Maktoob Case Study, in Proceedings of 8th International Conference for Internet Technology and Secured Transactions, London, pp. 121-126, 2013.

[11] Al-Kabi M., Gigieh A., Alsmadi I., Wahsheh H., and Haidar M., Opinion Mining and Analysis for Arabic Language, International Journal of Advanced Computer Science and Applications, vol. 5, no. 5, pp. 181-195, 2014.

[12] Al-Kabi M., Al-Qudah N., Alsmadi I., Dabour M., and Wahsheh H., Arabic/English Sentiment Analysis: An Empirical Study, in Proceedings of 4th International Conference on Information and Communication Systems, Irbid, pp. 1-6, 2013.

[13] Al-Kabi M., Gigieh A., Alsmadi I., Wahsheh H., and Haidar M., An Opinion Analysis Tool for Colloquial and Standard Arabic, in Proceedings of 4th International Conference on Information and Communication Systems, Irbid, 2013.

[14] Al-Kabi M., Al-Ayyoub M., Alsmadi I., and Wahsheh H., A Prototype for a Standard Arabic Sentiment Analysis Corpus, The International Arab Journal of Information Technology, vol. 13, no. 1A, pp. 163-170, 2016.

[15] Almas Y. and Ahmad K., A Note on Extracting Sentiments in Financial News in English, Arabic and Urdu, in Proceedings of 2nd Workshop on Computation, al Approaches to Arabic Script-based Languages, California, pp. 1-12, 2007.

[16] Al-Smadi M., Al-Sarhan H., Al-Ayyoub M., and Jararweh Y., Using Aspect-Based Sentiment Analysis to Evaluate Arabic News Affect on Readers, in Proceedings of 8th International Conference on Utility and Cloud Computing, Limassol, pp. 436-441, 2015.

[17] AL-Smadi M., Qawasmeh O., Talafha B., and Quwaider M., Human Annotated Arabic Dataset of Book Reviews for Aspect Based Sentiment Analysis, in Proceedings of 3rd International Conference on Future Internet of Things and Cloud, Rome, pp. 726-730, 2015.

[18] Al-Twairesh N., Al-Khalifa H., and Al-Salman A., Subjectivity and Sentiment Analysis of Arabic: Trends and Challenges, in Proceedings of 11th International Conference on Computer Systems and Applications, Doha, pp. 148-155, 2014.

[19] Aly M. and Atiya A., LABR: A Large Scale Arabic Book Reviews Dataset, in Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, pp. 494-498, 2013.

[20] Biltawi M., Etaiwi W., Tedmori S., Hudaib A., and Awajan A., Sentiment Classification Techniques for Arabic Language: A Survey, in Proceedings of the 7th International Conference on Information and Communication Systems, Irbid, pp. 339-346, 2016.

[21] ElSahar H. and El-Beltagy S., Building Large Arabic Multi-Domain Resources for Sentiment Analysis, in Proceedings of International Conference on Intelligent Text Processing and Computational Linguistics, Cairo, pp. 23-34, 2015.

[22] Kajanan S., Shariff A., Dutta K., and Datta A., Resolving Name Conflicts for Mobile Apps in Twitter Posts, International Federation for Information Processing, pp. 3-17, 2012.

[23] Khasawneh R., Wahsheh H., Al-Kabi M., and Alsmadi I., Sentiment Analysis of Arabic Social Media Content: A Comparative Study, in Proceedings of 8th International Conference for Internet Technology and Secured Transactions, London, pp. 101-106, 2013.

[24] Korayem M., Crandall D., and Abdul-Mageed M., Subjectivity and Sentiment Analysis of Arabic: A survey, in Proceedings of Advanced Machine Learning Technologies and Applications, Cairo, pp. 128-139, 2012.

[25] Laney D., 3D Data Management: Controlling Data Volume, Velocity, and Variety,

[Gartner Group, 2001]. Available at: http://blogs.gartner.com/doug- 982 The International Arab Journal of Information Technology, Vol. 15, No. 6, November 2018 laney/files/2012/01/ad949-3D-Data- Management-Controlling-Data-Volume- Velocity-and-Variety.pdf, Last Visited, 2015.

[26] Liu B., Sentiment Analysis and Opinion Mining, Morgan and Claypool Publishers, 2012.

[27] Nabil M., Aly M., and Atiya A., ASTD: Arabic Sentiment Tweets Dataset, in Proceedings of the Conference on Empirical Methods in Natural Language Processing, Lisbon, pp. 2515-2519, 2015.

[28] Nasukawa T. and Yi J., Sentiment Analysis: Capturing Favorability using Natural Language Processing, in Proceedings of the 2nd International Conference on Knowledge Capture, Sanibel Island, pp. 70-77, 2003.

[29] Obaidat I., Mohawesh R., Al-Ayyoub M., Al- Smadi M., and Jararweh Y., Enhancing the Determination of Aspect Categories and Their Polarities in Arabic Reviews Using Lexicon- Based Approaches, in Proceedings of Jordan Conference on Applied Electrical Engineering and Computing Technologies, Amman, pp. 1-6, 2015.

[30] Rushdi-Saleh M., Mart n-Valdivia M., Ure a- L pez L., and Perea-Ortega J., Bilingual Experiments with an Arabic-English Corpus for Opinion Mining, in Proceedings of Recent Advances in Natural Language Processing, Hissar, pp. 740-745, 2011.

[31] Rushdi-Saleh M., Mart n-Valdivia M., Ure a- L pez L., and Perea-Ortega J., OCA: Opinion Corpus for Arabic, Journal of the Association for Information Science and Technology, vol. 62, no. 10, pp. 2045-2054, 2011.

[32] Ryding K., A Reference Grammar of Modern Standard Arabic, ISBN-13 978-0-521-77151-1, Cambridge University Press, 2005.

[33] SentiStrength. Available at: http://sentistrength.wlv.ac.uk/, Last Visited, 2015.

[34] SocialMention. Available at: http://socialmention.com, Last Visited, 2015.

[35] Thelwall M., Buckley K., and Paltoglou G., Sentiment Strength Detection for the Social Web, Journal of the American Society for Information Science and Technology, vol. 63, no. 1, pp. 163-173, 2012.

[36] Thelwall M., Buckley K., Paltoglou G., Cai D., and Kappas A., Sentiment Strength Detection in Short Informal Text, Journal of the American Society for Information Science and Technology, vol. 61, no. 12, pp. 2544-2558, 2010.

[37] Witten I. And Frank E., Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann Series in Data Management Systems, 2005. Mohammed Al-Kabi obtained his PhD degree in Mathematics from the University of Lodz/Poland 2001, his master s degree in Computer Science from the University of Baghdad/Iraq 1989, and his bachelor degree in statistics from the University of Baghdad/Iraq 1981. He is an assistant Professor in the Computer Science Department, Faculty of IT, at Zarqa University. He is the author of more than 88 peer-reviewed articles in IR, Big Data, Sentiment Analysis, NLP, Data mining and software Engineering. His teaching interests focus on Information retrieval, big data, web programming, data mining, DBMS (ORACLE and MS Access). Izzat Alsmadi is an associate professor in the Department Of Computer Science at University of New Haven. He obtained his PhD degree in software engineering from NDSU (USA), his second master in software engineering from NDSU (USA) and his first master in CIS from University of Phoenix (USA). He has several published books, Journals and Conference articles largely in software engineering, data mining, IR and NLP. Rawan Khasawneh is a full time lecturer in the department of Computer Information Systems at Jordan University of Science and Technology. She obtained her master degree in Management Information Systems from Yarmouk University in Jordan (2013), and her bachelor degree in Management Information Systems from Yarmouk University in Jordan (2011). Khasawneh s research interests include: e-government, social media and sentiment analysis, E-marketing and E-CRM, knowledge management systems and group decision support systems. Heider Wahsheh obtained his Master degree in Computer Information Systems from Yarmouk University, Jordan, 2012. Between 2013-2015 he worked as a lecturer in the college of Computer Science at King Khalid University, Saudi Arabia. His research interests include information retrieval, sentiment analysis, NLP, data mining, and mobile agent systems