Impact of Data-Augmentation on Brain Tumor Detection Using Different YOLO Versions Models
Brain tumors are widely recognized as one of the world's worst and most disabling diseases. Every year, thousands of people die as a result of brain tumors caused by the rapid growth of tumor cells. As a result, saving the lives of tens of thousands of people worldwide needs speedy investigation and automatic identification of brain tumors. In this paper, we propose a new methodology for detecting brain tumors. The designed framework assesses the application of cutting-edge YOLO models such as YOLOv3, YOLO v5n, YOLO v5s, YOLO v5m, YOLOv5l, YOLOv5x, and YOLOv7 with varying weights and data augmentation on a dataset of 7382 samples from three distinct MRI orientations, namely, axial, coronal, and sagittal. Several data augmentation techniques were also employed to minimize detector sensitivity while increasing detection accuracy. In addition, the Adam and Stochastic Gradient Descent (SGD) optimizers were compared. We aim to find the ideal network weight and MRI orientation for detecting brain cancers. The results show that with an IoU of 0.5, axial orientation had the highest detection accuracy with an average mAP of 97.33%. Furthermore, SGD surpasses Adam optimizer by more than 20% mAP. Also, it was found that YOLO 5n, YOLOv5s, YOLOv5x, and YOLOv3 surpass others by more than 95% mAP. Besides that, it was observed that the YOLOv5 and YOLOv3 models are more sensitive to data augmentation than the YOLOv7 model.
[1] Abiwinanda N., Hanif M., Hesaputra S., Handayani A., and Mengko T., “Brain tumor Classification Using Convolutional Neural Network,” in Proceedings of the World congress on Medical Physics and Biomedical Engineering, Prague, pp. 183-189, 2019. DOI: 10.1007/978- 981-10-9035-6_33
[2] Atik M. and Duran Z., “Deep learning-based 3d Face Recognition Using Derived Features from Point Cloud,” The Proceedings of the 3rd International Conference on Smart City Applications, Karabukp, pp. 797-808, 2020.
[3] Atik S. and Ipbuker C., “Integrating Convolutional Neural Network and Multiresolution Segmentation for Land Cover and Land Use Mapping Using Satellite Imagery,” Applied Science, vol. 11, no. 12, p. 5551, 2021. https://doi.org/10.3390/app11125551
[4] Badran E., Mahmoud E., and Hamdy N., “An Algorithm for Detecting Brain Tumors in MRI Images,” in Proceedings of the International Conference on Computer Engineering and Systems, Cairo, pp. 368-373, 2010.
[5] Bakator M., and Radosav D., “Deep Learning and Medical Diagnosis: A Review of Literature,” Multimodal Technology Interact, vol. 2, no. 3, pp. 47, 2018. https://doi.org/10.3390/mti2030047
[6] Batra P., Hussain I., Abdul Ahad M., Casalino G., and Alam M., “A Novel Memory and Time- Efficient ALPR System Based on YOLOv5,” Sensors, vol. 22, no. 14, pp. 5283, 2022. https://doi.org/10.3390/s22145283
[7] Bayram A., Gurkan C., Budak A., and Karataş H., “A Detection and Prediction Model Based on Deep Learning Assisted by Explainable Artificial Intelligence for Kidney Diseases,” European Journal of Science and Technology, no. 40, pp. 67- 74, 2022. DOI: 10.31590/ejosat.1171777
[8] Cepni S., Atik M., and Duran Z., “Vehicle Detection Using Different Deep Learning Algorithms from Image Sequence,” Baltic Journal Modern Computing, vol. 8, no. 2, pp. 347- 358, 2020. DOI:10.22364/bjmc.2020.8.2.10
[9] Chan H., Hadjiiski L., and Samala R., “Computer‐ aided Diagnosis in the Era of Deep Learning,” Medical Physics, vol. 47, no. 5, pp. e218-e227, 2020. DOI: 10.1002/mp.13764
[10] Chen C., Liu M., Tuzel O., and Xiao J., “R-CNN for Small Object Detection,” in Proceedings of the Asian Conference on Computer Vision, Taipei pp. 214-230, 2016.
[11] Cheng J., Huang W., Cao S., Yang R., Yang W., Yun Z., Wang Z., and Feng Q., “Enhanced Performance of Brain Tumor Classification Via Tumor Region Augmentation and Partition,” PLoS One, vol. 10, no. 10, p. e0140381, 2015. 10.1371/journal.pone.0140381
[12] Dai J., Li Y., He K., and Sun J., “R-FCN: Object Detection Via Region-Based Fully Convolutional Networks,” in Proceedings of the 30th Conference on Neural Information Processing Systems, Barcelona, vol. 29, 2016.
[13] Gupta A., Ramanath R., Shi J., and Keerthi S., “Adam vs. SGD: Closing the Generalization Gap on Image Classification,” in Proceedings of the 13th Annual Workshop on Optimization for Machine Learning, 2021.
[14] He K., Gkioxari G., Dollár P., and Girshick R., “Mask R-CNN,” in Proceedings of the IEEE International Conference on Computer Vision, Venice, pp. 2961-2969, 2017. https://doi.org/10.48550/arXiv.1703.06870
[15] Henderson P. and Ferrari V., “End-to-end Training of Object Class Detectors for Mean Impact of Data-Augmentation on Brain Tumor Detection Using Different YOLO ... 479 Average Precision,” in Proceedings of the Asian Conference on Computer Vision, Taipei, pp. 198- 213, 2016. https://doi.org/10.48550/arXiv.1607.03476
[16] Huang X., Yue X., Xu Z., and Chen Y., “Integrating General and Specific Priors into Deep Convolutional Neural Networks for Bladder Tumor Segmentation,” in Proceedings of the International Joint Conference on Neural Networks, Shenzhen, pp. 1-8, 2021. doi: 10.1109/IJCNN52387.2021.9533813
[17] Kavitha R., Chitra L., and Kanaga L., “Brain Tumor Segmentation Using Genetic Algorithm with SVM Classifier,” International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, vol. 5, no. 3, pp. 1468-1471, 2016. DOI:10.15662/IJAREEIE.2016.0503043
[18] Khambhata K. and Panchal S., “Multiclass Classification of Brain Tumor in MR Images,” International Journal of Innovative Research in Computer and Communication Engineering, vol. 4, no. 5, pp. 8982-8992, 2016.
[19] Kuznetsova A., Maleva T., and Soloviev V., Cyber-Physical Systems: Modelling and Intelligent Control, Springer, 2021. https://doi.org/10.1007/978-3-030-66077-2_28
[20] Lee S., Kwak S., and Cho M., “Universal Bounding Box Regression and Its Applications,” in Proceedings of the Asian Conference on Computer Vision, Perth, pp. 373-387, 2018. https://doi.org/10.1007/978-3-030-20876-9_24
[21] Litjens G., Kooi T., Bejnordi B., Setio A., Ciompi F., Ghafoorian M., Laak J., Ginneken B., and Sánchez C., “A Survey on Deep Learning in Medical Image Analysis,” Medical Image Analysis, vol. 42, pp. 60-88, 2017. https://doi.org/10.1016/j.media.2017.07.005
[22] Logeswari T. Karnan M., “An Improved Implementation of Brain Tumor Detection Using Segmentation Based on Soft Computing,” Journal of Cancer Research and Experimental Oncology, vol. 2, no. 1, pp. 006-014, 2010.
[23] Lundervold A. and Lundervold A., “An Overview of Deep Learning in Medical Imaging Focusing on MRI,” Zeitschrift Für Medizinische Physik, vol. 29, no. 2, pp. 102-127, 2019. https://doi.org/10.1016/j.zemedi.2018.11.002
[24] Magnuska Z., Theek B., Darguzyte M., Palmowski M., and Stickeler E., “Influence of the Computer-Aided Decision Support System Design on Ultrasound-Based Breast Cancer Classification,” Cancers, vol. 14, no. 2, pp. 277, 2022. DOI: 10.3390/cancers14020277
[25] Mohiyuddin A., Basharat A., Ghani U., Peter V., and Abbas S., “Breast Tumor Detection and Classification in Mammogram Images Using Modified YOLOv5 network,” Computational and Mathematical Methods in Medicine, vol. 2022, 2022. doi: 10.1155/2022/1359019
[26] Montalbo F., “A Computer-Aided Diagnosis of Brain Tumors Using a Fine-Tuned YOLO-Based Model with Transfer Learning,” KSII Transactions on Internet and Information Systems, vol. 14, no. 12, pp. 4816-4834, 2020. DOI: 10.3837/tiis.2020.12.011
[27] Nepal U. and Eslamiat H., “Comparing YOLOv3, YOLOv4 and YOLOv5 for Autonomous Landing Spot Detection in Faulty UAVs,” Sensors, vol. 22, no. 2, pp. 464, 2022. https://doi.org/10.3390/s22020464
[28] Nogales A., Garcia-Tejedor A., Monge D., Vara J., and Antón C., “A Survey of Deep Learning Models in Medical Therapeutic Areas,” Artificial Intelligence in Medicine, vol. 112, pp. 102020, 2021. doi: 10.1016/j.artmed.2021.102020.
[29] Oza P., Sharma P., Patel S., Kumar P., “Deep Convolutional Neural Networks for Computer- Aided Breast Cancer Diagnostic: A Survey,” Neural Computing and Applications, vol. 34, no. 6, pp. 1-22, 2022. DOI: 10.1007/s00521-021- 06804-y
[30] Pan Y., Huang W., Lin Z., Zhu W., Zhou J., Wong J., Ding Z., “Brain Tumor Grading Based on Neural Networks and Convolutional Neural Networks,” in Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Milan, pp. 699- 702, 2015. DOI: 10.1109/EMBC.2015.7318458
[31] Rahman M. and Wang Y., “Optimizing Intersection-Over-Union in Deep Neural Networks for Image Segmentation,” International Symposium on Visual Computing, pp. 234-244, 2016.
[32] Redmon J. and Farhadi A., “Yolov3: An Incremental Improvement,” 2018.
[33] Redmon J., Divvala S., Girshick R., and Farhadi A., “You Only Look Once: Unified, Real-Time Object Detection,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp. 779-788, 2016. doi: 10.1109/CVPR.2016.91
[34] Ren S., He K., Girshick R., and SunJ., “Faster r- CNN: Towards Real-Time Object Detection with Region Proposal Networks,” Advances in Neural Information Processing Systems, vol. 28, 2015.
[35] Shelatkar T., Urvashi D., Shorfuzzaman M., Alsufyani A., and Lakshmanna K., “Diagnosis of Brain Tumor Using Light Weight Deep Learning Model with Fine-Tuning Approach,” Computational and Mathematical Methods in Medicine, vol. 2022, 2022. https://doi.org/10.1155/2022/2858845
[36] Singh L., Chetty G., and Sharma D., “A Novel Machine Learning Approach for Detecting the Brain Abnormalities from Mri Structural Images,” 480 The International Arab Journal of Information Technology, Vol. 21, No. 3, May 2024 in Proceedings of IAPR International Conference On Pattern Recognition in Bioinformatics, Tokyo, 2012, pp. 94-105.
[37] Swati Z., Zhao Q., Kabir M., Ali F., Ali Z., Ahmed S., and Lu J., “Content-Based Brain Tumor Retrieval for MR Images Using Transfer Learning,” IEEE Access, vol. 7, pp. 17809-17822, 2019. 10.1109/ACCESS.2019.2892455
[38] Tian L., Thalmann N., Thalmann D., Fang Z., and Zheng J., “Object grasping of humanoid robot based on YOLO,” in Proceedings of the Computer Graphics International Conference, Calgary 2019, pp. 476-482: Springer.
[39] Vengaloor R. and Muralidhar R., “Deep Learning Based Feature Discriminability Boosted Concurrent Metal Surface Defect Detection System Using YOLOv-5s-FRN,” The International Arab Journal of Information Technology, vol. 21, no. 1, pp. 94-106, 2024. https://doi.org/10.34028/iajit/21/1/9
[40] Walia I., Kumar D., Sharma K., Hemanth J., and Popescu D., “An Integrated Approach for Monitoring Social Distancing and Face Mask Detection Using Stacked ResNet-50 and YOLOv5,” Electronics, vol. 10, no. 23, pp. 2996, 2021.https://doi.org/10.3390/electronics1023299 6
[41] Wang C., Zhang Y., Zhou Y., Sun S., Zhang H., and Wang Y., “Automatic Detection of Indoor Occupancy Based on Improved YOLOv5 model,” Neural Computing and Applications, vol. 23, pp. 2575-2599, 2023. https://doi.org/10.1007/s00521- 022-07730-3
[42] Wang C., Bochkovskiy A., and Liao H., “YOLOv7: Trainable Bag-Of-Freebies Sets New State-of-The-Art for Real-Time Object Detectors,” 2023 IEEE/CVF Conference on Computer in Proceedings of the Vision and Pattern Recognition, Vancouver, 2022.
[43] Wang Z., Wu L., Li T., and Shi P., “A Smoke Detection Model Based on Improved YOLOv5,” Mathematics, vol. 10, no. 7, pp. 1190, 2022. https://doi.org/10.3390/math10071190
[44] Wenkel S., Alhazmi K., Liiv T., Alrshoud S., and Simon M., “Confidence Score: The Forgotten Dimension of Object Detection Performance Evaluation,” Sensors, vol. 21, no. 13, pp. 4350, 2021. https://doi.org/10.3390/s21134350
[45] White N., McDonald C., Farid N., Kuperman J., Kesari S., and Dale A., “Improved Conspicuity and Delineation of High-Grade Primary and Metastatic Brain Tumors Using “Restriction Spectrum Imaging”: Quantitative Comparison with High B-Value DWI and ADC,” American Journal of Neuroradiology, vol. 34, no. 5, pp. 958- 964, 2013.
[46] Xianjia Y., Salimpour S., Queralta J., and Westerlund T., “Analyzing General-Purpose Deep-Learning Detection and Segmentation Models with Images from a Lidar as a Camera Sensor,” arXiv Preprint, vol. arXiv:2203.04064v1, pp. 1-6, 2022. https://arxiv.org/pdf/2203.04064
[47] Xu S., Guo Z., Liu Y., Fan J., and Liu X., “An Improved Lightweight YOLOV5 Model Based on Attention Mechanism for Face Mask Detection,” in Proceedings of the International Conference on Artificial Neural Networks, Bristol, pp. 531-543, 2022. https://doi.org/10.1007/978-3-031-15934- 3_44
[48] Yan B., Fan P., Lei X., Liu Z., and Yang F., “A Real-Time Apple Targets Detection Method for Picking Robot Based on Improved YOLOv5,” Remote Sensing, vol. 13, no. 9, pp. 1619, 2021. https://doi.org/10.3390/rs13091619
[49] Zacharaki E ., Wang S., Chawla S., Yoo D., Wolf R., Melhem E., and Davatzikos C., “Classification Of Brain Tumor Type and Grade Using MRI Texture and Shape in A Machine Learning Scheme,” Magn Reson Med, vol. 62, no. 6, pp. 1609-1618, 2009. doi: 10.1002/mrm.22147.
[50] Zhang S., Zhang F., Ding Y., Li Y., “Swin- YOLOv5: Research and Application of Fire and Smoke Detection Algorithm Based on YOLOv5,” Computer Intelligence Neuroscience, vol. 2022, 2022. doi: 10.1155/2022/6081680
[51] Zhang Y., Shen Y., and Zhang J., “An Improved Tiny-YOLOv3 Pedestrian Detection Algorithm,” Optik, vol. 183, pp. 17-23, 2019. DOI:10.1016/j.ijleo.2019.02.038