The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


Colour Histogram and Modified Multi-layer Perceptron Neural Network based Video Shot

The paper proposes a shot boundary detection technique using colour histogram difference and modified Multi- Layer Perceptron (MLP). In this the learning process in the MLP is modified as an evolutionary learning process using Genetic Algorithm (GA) in which the weights of the hidden layer and output layer of the MLP are updated by GA. Colour Histogram Differences (HD) between two consecutive frames are used for feature extraction. Four values HDi,HDi-1 and-1 are used as an input for the modified MLP Neural Network where HDi is the colour histogram difference between frame fi and fi+1, HDi-1 is the colour histogram difference between frame fi-1 and fi and HDi+1 is the colour histogram difference between frame fi+1 and fi+2. The propose system is tested with the TRECVid 2001 and 2007 test data and it is also compared with latest algorithms and yields better results.


[1] Ayele E. and Dhok S., “Motion Estimation in Video Coding using Simplified Optical Flow Technique,” The International Arab Journal of Information Technology, vol. 13, no. 6A, pp. 770-776, 2016.

[2] Baraldi L., Grana C., and Cucchiara R., “Shot and Scene Detection Via Hierarchical Clustering for Re-Using Broadcast Video,” International Conference on Computer Analysis of Images and Patterns, vol. 9256, pp. 801-811, 2015.

[3] Brunelli R., Mich O., and Modena C., “A Survey on The Automatic Indexing of Video Data,” Journal of Visual Communication and Image Representation, vol. 10, no. 2, pp. 78-112, 1999.

[4] Bruyne S., Deursen D., Cock J., Neve W., Lambert P., and DeWalle R., “A Compressed- Domain Approach For Shot Boundary Detection on H.264/AVC Bit Streams,” Signal Processing: Image Communication, vol. 23, no. 7, pp. 473- 489, 2008.

[5] De Neve W., Deursen D., Schrijver D., Wolf K., and Walle R., “Using Bitstream Structure Descriptions for the Exploitation of Multilayered Temporal Scalability in H.264/AVCs Base Specification,” in Proceedings of Advances in Multimedia Information Processing, Berlin, pp. 641-652, 2005.

[6] Fang H., Jiang J., and Feng Y., “A Fuzzy Logic Approach for Detection of Video Shot Boundaries,” Pattern Recognition, vol. 39, no. 11, pp. 2092-2100, 2006.

[7] Ford R., Robson C., Temple D., and Gerlach M., “Metrics for Shot Boundary Detection in Digital Video Sequences,” Multimedia Systems, vol. 8, no. 1, pp. 37-46, 2000.

[8] Hu W., Xie N., Li L., Zeng X., and Maybank S., “A Survey on Visual Content-Based Video Indexing And Retrieval,” IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, vol. 41, no. 6, pp. 797- 819, 2011.

[9] Huang C. and Liao B., “A Robust Scene-Change Detection Method for Video Segmentation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 11, no. 12, pp. 1281- 1288, 2001. 692 The International Arab Journal of Information Technology, Vol. 16, No. 4, July 2019

[10] Jadon R., Chaudhury S., and Biswas K., “A Fuzzy Theoretic Approach for Video Segmentation Using Syntactic Features,” Pattern Recognition Letters, vol. 22, no. 13, pp. 1359- 1369, 2001.

[11] Janwe N. and Bhoyar K., “Video Shot Boundary Detection Based on JND Colour Histogram,” in Proceedings of International Conference on Image Information Processing, Shimla, pp. 476- 480, 2013.

[12] Kktun O., Gdkbay U., and Ulusoy Z., “Fuzzy Colour Histogram-Based Video Segmentation,” Computer Vision and Image Understanding, vol. 114, no. 1, pp. 125-134, 2010.

[13] Koprinska I. and Carrato S., “Temporal Video Segmentation: A Survey,” Signal Processing: Image Communication, vol. 16, no. 5, pp. 477- 500, 2001.

[14] Lam C. and Lee M., “Video Segmentation Using Colour Difference Histogram,” Multimedia Information Analysis and Retrieval, Berlin, pp. 159-174, 1998.

[15] Lu Z. and Shi Y., “Fast Video Shot Boundary Detection Based on SVD and Pattern Matching,” IEEE Transactions on Image Processing, vol. 22, no. 12, pp. 5136-5145, 2013.

[16] Luo M., DeMenthon D., and Doermann D., “Shot Boundary Detection Using Pixel-To-Neighbour Image Differences in Video,” in Proceedings of TRECVID Workshop Notebook Papers, pp. 1-6, 2004.

[17] Mas J. and Fernandez G., “Video Shot Boundary Detection Based on Colour Histogram,” in Proceedings of Notebook Papers TRECVID, 2003.

[18] Meier T. and Ngan K., “Video Segmentation for Content-Based Coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 8, pp. 1190-1203, 1999.

[19] Meng J., Juan Y., and Chang S., “Scene Change Detection in an MPEG Compressed Video Sequence,” in Proceedings of IS&T/SPIE Symposium Proceedings, San Jose, pp. 14-25, 1995.

[20] Priya G. and Domnic S., “Walsh-Hadamard Transform Kernel-Based Feature Vector for Shot Boundary Detection,” IEEE Transactions on Image Processing, vol. 23, no. 12, pp. 5187- 5197, 2014.

[21] Shen S. and Cao J., “Abrupt Shot Boundary Detection Algorithm Based on Fuzzy Clustering Neural Network,” in Proceedings of 3rd International Conference on Computer Research and Development, Shanghai, pp. 246-248, 2011.

[22] Smeaton A., Over P., and Doherty A., “Video Shot Boundary Detection: Seven Years of Trecvid Activity,” Computer Vision and Image Understanding, vol. 114, no. 4, pp. 411-418, 2010.

[23] Thounaojam D., Roy S., and Manglem K., “Video Shot Boundary Detection using Gray Level Cooccurrence Matrix,” Indian Journal of Science and Technology, vol. 9, no. 7, pp. 1-5, 2016.

[24] Wang D., “Unsupervised Video Segmentation Based On Watersheds and Temporal Tracking,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 8, no. 5, pp. 539-546, 1998.

[25] Wang X., Wang S., and Chen H., “A Fast Algorithm for Mpeg Video Segmentation Based on Macroblock,” in Proceedings of 4th International Conference on Fuzzy Systems and Knowledge Discovery, Haikou, pp. 715-718, 2007.

[26] Xu P., Xie L., Chang S., Divakaran A., Vetro A., and Sun H., “Algorithms and system for Segmentation and Structure Analysis in Soccer Video,” in Proceedings of IEEE International Conference on Multimedia and Expo, Tokyo, pp. 721-724, 2001.

[27] Yoo H., Ryoo H., and Jang D., “Gradual Shot Boundary Detection Using Localized Edge Blocks,” Multimedia Tools and Applications, vol. 28, no. 3, pp. 283-300, 2006.

[28] Zabih R., Miller J., and Mai K., “A Feature- Based Algorithm for Detecting and Classifying Scene Breaks,” in Proceedings of the 3rd ACM International Conference on Multimedia, San Francisco, pp. 189-200, 1995. Colour Histogram and Modified Multi-layer Perceptron Neural Network based ... 693 Dalton Thounaojam was born in Manipur, India. He did M.E. and Ph.D. from Anna University Coimbatore and Assam University Silchar, India in 2009 and 2017 respectively in Computer Science and Engineering and Information Technology. He is an assistant professor in the department of Computer Science and Engineering, NIT Silchar. His research interests include image processing, video shot boundary detection, fuzzy system and artificial neural network. Thongam Khelchandra was born in Manipur, India. He received his Ph.D. and M.S. degree in Computer Science and Engineering from The University of Aizu, Japan in 2016 and 2007 respectively. He is an assistant professor in the department of Computer Science and Engineering, NIT Manipur. His main research interest includes Machine Learning, Pattern Recognition, Artificial Neural Network, Fuzzy Systems, Evolutionary Algorithm, Hybrid Intelligent System and their applications in Robotics, Image and Video Processing, Network Security. Thokchom Jayshree was born in Manipur, India. She did B.E and M.Tech. from Manipur Institute of Technology, Manipur and National Institute of Technology Manipur, India in 2014 and 2016 respectively in Computer Science and Engineering. Her research interest includes image processing and video processing. Sudipta Roy was born in Birbhum, West Bengal, India. He did MCA, M.Tech. and Ph.D. from BIT, Mesra, Jadavpur University, Kolkata, Assam University, Silchar in 2002, 2005 and 2010. He is a professor in the department of Computer Science and Engineering (formerly department of Information Technology), Assam University, Silchar. His research interests include digital watermarking, image processing, Data Security and Computer Networks. Khumanthem Singh was born in Manipur, India. He did B.Tech., M.Tech., M.S. and Ph.D. from DEI, Agra, DU, Delhi, BITS, Pilani and IIT, Guwahati in 1986, 1992, 1994 and 2007 in Electrical Engineering, Control & Instrumentation, System Software and Image processing. He is an associate professor in the department of computer science & engineering, NIT Manipur. His research interests include digital watermarking, image processing, steganography and computer forensic.