The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


Transfer-based Arabic to English Noun Sentence Translation Using Shallow Segmentation

The quality of machine translation systems decreases considerably when dealing with long sentences. In this paper, a transfer-based system is developed for translating long Arabic noun sentences into English. A simple method used for dividing a long sentence into phrases based on conjunctions, prepositions, and quantifier particles. These particles divide a sentence into phrases. The phrases of a source sentence are translated individually. In the end of translation process, target sentence is constructed by connecting the translated phrases. The system was tested on 100 thesis long titles from the management and economy domain. The results show that the method is very efficient with most of the tested sentences.


[1] Agiza H., Hassan A., and Salah N., An English- to-Arabic Prototype Machine Translator for Statistical Sentences, Intelligent Information Management, vol. 4, pp. 13-23, 2012.

[2] Algani Z. and Omar N., Arabic to English Machine Translation of Verb Phrases Using Rule-Based Approach, Journal of Computer Science, vol. 8, no. 3, pp. 277-286, 2012.

[3] Costa-Jussa M., Farrus M., Marino J., and Fonollosa J., Study and Comparison of Rule- Transfer-based Arabic to English Noun Sentence Translation Using Shallow Segmentation 893 based and Statistical Catalan-Spanish Machine Translation Systems, Computing and Informatics, vol. 31, vol. 2, pp. 245-270, 2012.

[4] Dastjerdi H. and Ghobadi S., Does Technology Help? Google Translation Vs. Human Translation, Intercontinental Journal of Educational Research, vol. 2, no. 1, pp. 1-6, 2012.

[5] Dwivedi S. and Sukhadeve P., Translation Rules for English to Hindi Machine Translation System: Homoeopathy Domain, The International Arab Journal of Information Technology, vol. 12, no. 6A, pp. 791-796, 2015.

[6] Hoyt F., Lemma in The Encyclopedia of Arabic Lnaguage and Linguistics, Brill, 2008.

[7] Hutchins J., Machine Translation: A Concise History, Computer Aided Translation: Theory and Practice, Chinese University of Hong Kong, 2007.

[8] Kremers J., The Arabic Noun Phrase: A Minimalist Approach, Utrecht: LOT, 2003.

[9] Larkey L., Ballesteros L., and Connell M., Improving Stemming for Arabic Information Retrieval: Light Stemming and Co-Occurrence Analysis, in Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, pp. 275-282, 2002.

[10] Mohamed A., Machine Translation of Noun Phrases: From English to Arabic, Thesis, Cairo Unversity, 2000.

[11] Nwesri A., Tahaghoghi S., and Scholer F., Stemming Arabic Conjunctions and Prepositions, in String Processing and Information Retrieval, Springer, 2005.

[12] Oliveira F., Wong F., and Hong I., Systematic Processing of Long Sentences In Rule Based Portuguese-Chinese Machine Translation, in Computational Linguistics and Intelligent Text Processing, Springer Berlin Heidelberg, 2010.

[13] Roh Y., Seo Y., Lee K., and Choi S., Long Sentence Partitioning using Structure Analysis for Machine Translation, in Proceedings of Natural Language Processing Pacific Rim Symposium, Tokyo, pp. 646-652, 2001.

[14] Sangeetha J., Jothilakshmi S., and Kumar R., An Efficient Machine Translation System for English to Indian Languages Using Hybrid Mechanism, International Journal of Engineering and Technology, vol. 6, no. 4, pp. 1909-1919, 2014.

[15] Shaalan K., Machine Translation of Arabic Interrogative Sentence into English, in Proceedings of the 8th International Conference on Artificial Intelligence Applications (Egyptian Computer Society, Egypt, pp. 473-483, 2000.

[16] Shaalan K., Rule-Based Approach in Arabic Natural Language Processing, International Journal on Information and Communication Technologies, vol. 3, no. 3, pp. 11-19, 2010.

[17] Shirko O., Omar N., Arshad H., and Albared M., Machine Translation Of Noun Phrases from Arabic to English Using Transfer-Based Approach, Journal of Computer Science, vol. 6, no. 3, pp. 350-356, 2010.

[18] Somers H., Review Article: Example-Based Machine Translation, Machine Translation, vol. 14, no. 2, pp. 113-157, 1999.

[19] Tripathi S. and Sarkhel J., Approaches to Machine Translations, Annals of Library and Information Studies, vol. 57, pp. 388-393, 2010. Namiq Abdullah is a lecturer in Electrical and Computer Engineering at the University of Duhok, Iraq. Prior to joining Duhok University, he worked many years at University of Mosul in Iraq and Al-Hussein Bin Talal University in Jordan. He obtained his M.Sc. degree in Computer Engineering from University of Technology/Iraq (1993), and his B.Sc. in Electronics and Communications Engineering from University of Mosul/Iraq (1979). His research interests include Natural Language Processing and Microprocessor based systems.