The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


A Flexible Algorithm Design of Spatial Scalability for Real-time Surveillance Applications

The Surveillance Video and Audio Coding )SVAC( working group is currently developing the third-generation video compression standard, SVAC 3.0. To extend this standard, this paper proposes a spatial scalable coding Scalable Surveillance Video Coding )SSVC( framework, so that the SVAC3.0 video stream can gracefully adapt to different transmission bandwidth limitations and the requirements of decoding hardware, while keep the quality of the reconstructed image without degradation. In order to achieve the scalability of hardware implementation, SSVC designs a flexible reference frame marking and usage scheme, so that the enhanced layer coding does not directly depend on the basic layer, and effectively reduces the coding coupling between layers. SSVC improves the motion vector prediction method, effectively utilizes the encoding information of the base layer, and is mainly compatible with SVAC3.0 syntax structures. In addition, SSVC provides several different operational modes to adapt to various application scenarios and achieve an optimal trade-off between coding efficiency and complexity. The performance comparisons among simulcast stream and SSVC, single-layer stream and SSVC enhancement layer, as well as experimental data for different operational modes are provided. The coding efficiency and computational complexity are also analyzed.

[1] Advanced Video Coding for Generic Audiovisual Services, ITU-T Rec. H.264 and ISO/IEC 14496- 10 (MPEG-4 AVC), ITU-T and ISO/IEC, 2007.

[2] Bjontegaard G., “Calculation of Average PSNR Differences between Rd-Curves,” in Proceedings of the 13thVCEG-M33, VCEG Meeting, Austin, 2001.

[3] Boyce J., Ye Y., Chen J., and Ramasubramonian A., “Overview of SHVC: Scalable Extensions of the High Efficiency Video Coding Standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 26, no. 1, pp. 20-34, 2016. DOI:10.1109/TCSVT.2015.2461951

[4] Bross B., Wang Y., Ye Y., Liu S., Chen J., Sullivan G., and Ohm J., “Overview of the Versatile Video Coding (VVC) Standard and its Applications,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 10, pp. 3736-3764, 2021. DOI:10.1109/TCSVT.2021.3101953

[5] Chen Y., Murherjee D., Han J., Grange A., Xu Y., and Liu Z., et al., “An Overview of Core Coding Tools in the AV1 Video Codec,” in Proceedings of the Picture Coding Symposium (PCS), San Francisco, 2008. DOI:10.1109/PCS.2018.8456249

[6] Coding of Audio-Visual Objects-Part 2: Visual, ISO/IEC 14492-2 (MPEG-4 Visual), ISO/IEC JTC 1, Version 3: 2004.

[7] High Efficiency Video Coding, Document Rec. ITU-T H.265 and ISO/IEC 23008-2, 2014.

[8] Lee H., Kang J., Lee J., Choi J., Kim J., and Sim D., “Scalable Extension of HEVC for Flexible High-Quality Digital Video Content Services,” ETRI Journal, vol. 35, no. 6, pp. 990-1000, 2013. https://doi.org/10.4218/etrij.13.2013.0040

[9] Mokhtar Z. and Dawwd S., “3D VAE Video Prediction Model with Kullback Leibler Loss Enhancement,” The International Arab Journal of Information Technology, vol. 21, no. 5, pp. 879- 888, 2024. https://doi.org/10.34028/iajit/21/5/9

[10] Schwarz H., Marpe D., and Wiegand T., “Overview of the Scalable Video Coding Extension of the H.264/AVC Standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 9, pp. 1103-1120, 2007. DOI:10.1109/TCSVT.2007.905532

[11] Sjoberg R., Chen Y., Fujibayashi A., Hannuksela M., Samuelsson J., Tan T., Wang Y., and Wenger S., “Overview of HEVC High-Level Syntax and Reference Picture Management,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1858-1870, 2012. DOI:10.1109/TCSVT.2012.2223052

[12] Sullivan G., Ohm J., Han W., and Wiegand T., “Overview of the High Efficiency Video Coding (HEVC) Standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1649-1668, 2012. DOI:10.1109/TCSVT.2012.2221191

[13] Technical Specifications for Surveillance Video and Audio Coding (SVAC), Chinese National Standard: GB/T 25724.

[14] Versatile Video Coding, Standard ITU-T H.266, ISO/IEC 23090-3, 2020.

[15] Video Coding for Low Bit Rate Communication, ITU-T, ITU-T Recommendation H.263 Version 2, 2005.

[16] Wang Y., Skupin R., Hannuksela M., Deshpande S., Hendry., Drugeon V., Sjoberg R., Choi B., Seregin V., Sanchez Y., Boyce J., Wan W., and Sullivan J., “The High-Level Syntax of the Versatile Video Coding (VVC) Standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 10, pp. 3779-3800, 2021. DOI: 10.1109/TCSVT.2021.3070860

[17] Wu F., Li S., and Zhang Y., “A Framework for Efficient Progressive Fine Granularity Scalable Video Coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 11, no. 3, pp. 332-344, 2001. DOI: 10.1109/76.911159

[18] Ye Y., He Y., Wang Y., and Hendry., “SHVC, the Scalable Extensions of HEVC, and its Applications,” Zte Communications, vol. 14, no. 1, pp. 2016.

[19] Zhang J., Jia C., Lei M., Wang S., Ma S., and Gao W., “Recent Development of AVS Video Coding,” in Proceedings of the Picture Coding Symposium, Ningbo, 2019. DOI: 10.1109/PCS48520.2019.8954503

[20] Zhang X., Huang T., Tian Y., and Gao W., “Background-Modeling-Based Adaptive Prediction for Surveillance Video Coding,” IEEE Transactions on Image Processing, vol. 23, no. 2, pp. 769-784, 2014. DOI: 10.1109/TIP.2013.2294549

[21] Zuo X., Yu L., Yu H., Mao J., and Zhao Y., “Scene- Library-Based Video Coding Scheme Exploiting Long-Term Temporal Correlation,” Journal of Electronic Imaging, vol. 26, no. 4, pp. 043026, 2017.