The International Arab Journal of Information Technology (IAJIT)


Middle Eastern and North African English Speech Corpus (MENAESC): Automatic Identification of

This study aims to explore the English accents in the Arab world. Although there are limited resources for a speech corpus that attempts to automatically identify the degree of accent patterns of an Arabic speaker of English, there is no speech corpus specialized for Arabic speakers of English in the Middle East and North Africa (MENA). To that end, different samples were collected in order to create the linguistic resource that we called Middle Eastern and North African English Speech Corpus (MENAESC). In addition to the “accent approach” applied in the field of automatic language/dialect recognition; we applied also the “macro-accent approach”-by employing Mel-Frequency Cepstral Coefficients (MFCC), Energy and Shifted Delta Cepstra (SDC) features and Gaussian Mixture Model-Universal Background Model (GMM-UBM) classifier- on four accents (Egyptian, Qatari, Syrian, and Tunisian accents) among the eleven accents that were selected based on their high population density in the location where the experiments were carried out. By using the Equal Error Rate percentage (EER%) for the assessment of our system effectiveness in the identification of MENA English accents using the two approaches mentioned above through the employ of the MENAESC, results showed we reached 1.5 to 2%, for “accent approach” and 2 to 3.5% for “macro-accents approach” for identification of MENA English. It also exhibited that the Qatari accent, of the 4 accents included, scored the lowest EER% for all tests performed. Taken together, the system effectiveness is not only affected by the approaches used, but also by the database size MENAESC and its characteristics. Moreover, it is impacted by the proficiency of the Arabic speakers of English and the influence of their mother tongue.

[46] Yusnita M., Paulraj M, Sazali Y., Yusuf R., and Shahriman A., “Analysis of Accent-Sensitive Words in Multi-Resolution Mel-Frequency Cepstral Coefficients for Classification of Accents in Malaysian English,” International Journal of Automotive and Mechanical Engineering, vol. 7, pp. 1053-1073, 2013. 76 The International Arab Journal of Information Technology, Vol. 18, No. 1, January 2021 Sara Chellali is a Ph.D. student at “Ecole nationale Supérieure d'Informatique (ESI, ex INI) ”, Algiers, Algeria. She received the Magister degree in Computer Sciences and the Master degree in Didactics of French as a foreign language from University of Amar Telidji, Laghouat, Algeria, and Engineer degree in Computer Systems from ESI. She is currently working as teacher- researcher in the “École Normale Supérieure de Laghouat ENSL”, Laghouat, Algeria. Her research is in language processing with particular emphasis on identification of accent/dialect, speech processing, deep learning, machine learnnig, pattern recognition and didactic of sciences (Mathematics). Somaya Al-Maadeed is a professor at Computer Science and Engineering Department at Qatar University. She received the Ph.D. degree in computer science from Nottingham, U.K., in 2004. She supervised students through research projects related to pattern recognition and Arabic recognition. She is currently the Head of the Computer Science Department, Qatar University. She is also the Coordinator of the Computer Vision Research Group, Qatar University. She enjoys excellent collaboration with national and international institutions, and industry. She is a principal investigator of several funded research projects generating approximately five million dollars in the last years. She published extensively in computer vision and pattern recognition and delivered workshops on teaching programming for undergraduate students. She attended workshops related to higher education strategy, assessment methods, and interactive teaching. In 2015, she was elected as the IEEE Chair of the Qatar Section. She and her team were the recipient of the best performance at ICDAR 2011 and ICDAR 2015 signature verification. Ouassila Kenai is Ph.D. student in speech communication in USTHB, Algiers, Algeria. She has got Magister degree in automatic speech processing from the Scientific and Technical Research Center for the Development of the Arabic Language CRSTDLA, Algeria and Engineer degree in communication (Electronics) from USTHB, Algeria. She is currently teacher at the institute of trades performing arts and audiovisual ISMAS, Algiers, Algeria. She also works as a teacher and consultant in the audiovisual field in several state and private establishments. Her research interests include speaker recognition -where she presented a new architecture based VAD for speaker diarization/detection systems (it was the subject of a published article)-, artificial intelligent, bioinformatics, speech and language processing, and forensic recognition (She published several conference papers on it). Maamar Ahfir received his “Ingeniorat” in Electronics and “Magister” in Optoelectronics, both from the University of Blida (Algeria), respectively in 1990 and 1997. He holds the E-science Doctorate degree in Electronics since 2008 from the “Ecole Nationale Polytechnique (ENP)” of Algiers (Algeria). He was Lecturer in the University of Laghouat (Algeria) from 1997 to 2019 and Head of the Informatics Department of the Technical College of Jizane (Saudi Arabia) from 2001 to 2002. He is currently Associate Professor at the University of Médéa (Algeria) since 2019 and Visiting Researcher to Applied DSP and VLSI Systems Laboratory of the University of Westminster, London, UK, since 2004.His areas of interest include room acoustics, speech and human heart sounds (Phonocardiogram) processing. Walid Hidouci is a professor in computer science at “Ecole nationale Superieure d'Informatique: ESI” in Algiers. He leads the "Advanced Database Systems" team in the LCSI research laboratory. His main topics of interests are: database systems, data structures, artificial intelligence, operating systems and parallel programming.