Transformer-Based Text Summarization: A Deep Learning Approach with Hybrid Optimization

Author Dabiah Alboaneen,

Keywords #Deep learning #text summarization #transformer #optimization #extractive and abstractive

Abstract

The amount of data on the internet is expanding rapidly. Thus, it is crucial to present essential information concisely. This would reduce the reading time and help minimize human effort. Therefore, a transformer-based text summarization technique is introduced in this paper. Initially, tokenization is applied to divide the text into words. Next, word embedding uses Global Vectors for word representation (GloVe) to represent the words in vectors. The embedded vector is given as input to the encoder with transformer architecture. This structure has Multi-Head Attention (MHA) and positional encoding context, which helps to identify the important context for summarization and to understand long-range dependencies. Each sentence is then scored based on its importance, and the sentences that have the top scores are separated. In the abstractive summarization stage, a Pointer-Generator Network (PGN) is introduced to create new words using its vocabulary. Furthermore, the cheetah optimizer’s exploration phase is combined with the exploitation phase of the Hippopotamus Optimization Algorithm (HOA) to improve the summary quality. The simulation analysis indicates that this proposed technique has higher Recall-Oriented Understudy for Gisting Evaluation (ROUGE) values than the existing summarization techniques.

References

[1] Abadi V. and Ghasemian F., “Enhancing Persian Text Summarization through a Three-Phase Fine- Tuning and Reinforcement Learning Approach with the mT5 Transformer Model,” Scientific Reports, vol. 15, no. 1, pp. 1-11, 2025. https://doi.org/10.1038/s41598-024-78235-3

[2] Abanoub G., Fawzy A., Waly R., and Gomaa W., “Generate Descriptions of Medical Dialogues through Two-Layers Transformer-based Summarization,” in Proceedings of the Intelligent Methods, Systems, and Applications, Giza, pp. 32- 37, 2023. DOI: 10.1109/IMSA58542.2023.10217636

[3] Abdel-Salam S. and Rafea A., “Performance Study on Extractive Text Summarization Using BERT Models,” Information, vol. 13, no. 2, pp. 1- 10, 2022. https://doi.org/10.3390/info13020067

[4] Abujar S., Hasan M., and Hossain S., “Sentence Similarity Estimation for Text Summarization Using Deep Learning,” in Proceedings of the 2nd International Conference on Data Engineering and Communication Technology, Pune, pp. 155- 164, 2017. https://link.springer.com/chapter/10.1007/978- 981-13-1610-4_16

[5] Alami N., En-nahnahi N., Ouatik S., and Meknassi M., “Using Unsupervised Deep Learning for Automatic Summarization of Arabic Documents,” Arabian Journal for Science and Engineering, vol. 43, pp. 7803-7815, 2018. https://doi.org/10.1007/s13369-018-3198-y

[6] Al-Maleh M. and Desouki S., “Arabic Text Summarization Using Deep Learning Approach,” Journal of Big Data, vol. 7, pp. 1-17, 2020. https://doi.org/10.1186/s40537-020-00386-7

[7] Alsuhaibani M., “Fine-Tuned PEGASUS: Exploring the Performance of the Transformer- based Model on a Diverse Text Summarization Dataset,” in Proceedings of the 9th World Congress on Electrical Engineering and Computer Systems and Sciences, London, pp. 1-9, 2023. DOI: 10.11159/cist23.117

[8] Anand D. and Wagh R., “Effective Deep Learning Approaches for Summarization of Legal Texts,” Journal of King Saud University-Computer and Information Sciences, vol. 34, no. 5, pp. 2141- 2150, 2022. https://doi.org/10.1016/j.jksuci.2019.11.015

[9] Bhargava R., Sharma G., and Sharma Y., “Deep Text Summarization Using Generative Adversarial Networks in Indian Languages,” Procedia Computer Science, vol. 167, pp. 147- 153, 2020. https://doi.org/10.1016/j.procs.2020.03.192

[10] Cai X., Liu S., Yang L., Lu Y., and et al., “COVIDSum: A Linguistically Enriched SciBERT-based Summarization Model for COVID-19 Scientific Papers,” Journal of Biomedical Informatics, vol. 127, pp. 103999, 2022. DOI: 10.1016/j.jbi.2022.103999

[11] Chen J., “An Entity-Guided Text Summarization Framework with Relational Heterogeneous Graph Neural Network,” Neural Computing and Applications, vol. 36, no. 7, pp. 3613-3630, 2024. https://doi.org/10.1007/s00521-023-09247-9

[12] Chen T., Wang X., Yue T., Bai X., Le C., and Wang W., “Enhancing Abstractive Summarization with Extracted Knowledge Graphs and Multi-Source Transformers,” Applied Sciences, vol. 13, no. 13, pp. 1-14, 2023. https://doi.org/10.3390/app13137753

[13] El-Kassas W., Salama C., Rafea A., and Mohamed H., “Automatic Text Summarization: A Comprehensive Survey,” Expert Systems with Applications, vol. 165, pp. 113679, 2021. https://doi.org/10.1016/j.eswa.2020.113679

[14] Elsaid A., Mohammed A., Ibrahim L., and Sakre M., “A Comprehensive Review of Arabic Text Summarization,” IEEE Access, vol. 10, pp. 38012- Transformer-Based Text Summarization: A Deep Learning Approach with Hybrid Optimization 971 38030, 2022. DOI: 10.1109/ACCESS.2022.3163292

[15] Hou S., Huang X., Fei C., Zhang S., and et al., “A Survey of Text Summarization Approaches Based on Deep Learning,” Journal of Computer Science and Technology, vol. 36, no. 3, pp. 633-663, 2021. https://doi.org/10.1007/s11390-020-0207-x

[16] Jain A., Arora A., Yadav D., Morato J., and Kaur A., “Text Summarization Technique for Punjabi Language Using Neural Networks,” The International Arab Journal of Information Technology, vol. 18, no. 6, pp. 807-818, 2021. https://www.iajit.org/portal/images/year2021/no6 /19758.pdf

[17] Liu S., Cao J., Yang R., and Wen Z., “Key Phrase Aware Transformer for Abstractive Summarization,” Information Processing and Management, vol. 59, no. 3, pp. 102913, 2022. https://doi.org/10.1016/j.ipm.2022.102913

[18] Ma C., Zhang W., Guo M., Wang H., and Sheng Q., “Multi-Document Summarization via Deep Learning Techniques: A Survey,” ACM Computing Surveys, vol. 55, no. 5, pp. 1-37, 2022. https://dl.acm.org/doi/abs/10.1145/3529754

[19] Magdum P. and Rathi S., Advances in Artificial Intelligence and Data Engineering, Springer, 2019. https://doi.org/10.1007/978-981-15-3514-7_30

[20] Mahalakshmi P. and Fatima N., “Summarization of Text and Image Captioning in Information Retrieval Using Deep Learning Techniques,” IEEE Access, vol. 10, pp. 18289-18297, 2022. DOI: 10.1109/ACCESS.2022.3150414

[21] Merrouni Z., Frikh B., and Ouhbi B., “EXABSUM: A New Text Summarization Approach for Generating Extractive and Abstractive Summaries,” Journal of Big Data, vol. 10, no. 1, pp. 1-34, 2023. https://doi.org/10.1186/s40537-023-00836-y

[22] Mohd M., Jan R., and Shah M., “Text Document Summarization Using Word Embedding,” Expert Systems with Applications, vol. 143, pp. 112958, 2020. https://doi.org/10.1016/j.eswa.2019.112958

[23] Moro G., Ragazzi L., Valgimigli L., Frisoni G., Sartori C., and Marfia G., “Efficient Memory- Enhanced Transformer for Long-Document Summarization in Low-Resource Regimes,” Sensors, vol. 23, no. 7, pp. 1-16, 2023. https://doi.org/10.3390/s23073542

[24] Mridha M., Lima A., Nur K., Das S., Hasan M., and Kabir M., “A Survey of Automatic Text Summarization: Progress, Process and Challenges,” IEEE Access, vol. 9, pp. 156043- 156070, 2021. DOI: 10.1109/ACCESS.2021.3129786

[25] Roul R., Sahoo J., and Goel R., “Deep Learning in the Domain of Multi-Document Text Summarization,” in Proceedings of the 7th International Conference on Pattern Recognition and Machine Intelligence, Kolkata, pp. 575-581, 2017. https://doi.org/10.1007/978-3-319-69900- 4_73

[26] Suleiman D. and Awajan A., “Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges,” Mathematical Problems in Engineering, vol. 2020, pp. 1-29, 2020. https://onlinelibrary.wiley.com/doi/10.1155/2020/ 9365340

[27] Tomer M. and Kumar M., “Multi-Document Extractive Text Summarization Based on Firefly Algorithm,” Journal of King Saud University- Computer and Information Sciences, vol. 34, no. 8, pp. 6057-6065, 2022. https://doi.org/10.1016/j.jksuci.2021.04.004

[28] Widyassari A., Rustad S., Shidik G., Noersasongko E., and et al., “Review of Automatic Text Summarization Techniques and Methods,” Journal of King Saud University- Computer and Information Sciences, vol. 34, no. 4, pp. 1029-46, 2022. https://doi.org/10.1016/j.jksuci.2020.05.006

[29] Yousefi-Azar M. and Hamey L., “Text Summarization Using Unsupervised Deep Learning,” Expert Systems with Applications, vol. 68, pp. 93-105, 2017. https://doi.org/10.1016/j.eswa.2016.10.017

[30] Zhang J., Wang X., Zhang H., Sun H., and Liu X., “Retrieval-based Neural Source Code Summarization,” in Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, Seoul, pp. 1385-1397, 2020. https://dl.acm.org/doi/10.1145/3377811.3380383

[31] Zhang M., Zhou G., Yu W., Huang N., and Liu W., “A Comprehensive Survey of Abstractive Text Summarization Based on Deep Learning,” Computational Intelligence and Neuroscience, vol. 2022, no. 1, pp. 1-21, 2022. https://doi.org/10.1155/2022/7132226