The International Arab Journal of Information Technology (IAJIT)


A Combined Method of Skin-and Depth-based Hand Gesture Recognition

Kinect is a promising acquisition device that provides useful information on a scene through color and depth data. There has been a keen interest in utilizing Kinect in many computer vision areas such as gesture recognition. Given the advantages that Kinect provides, hand gesture recognition can be deployed efficiently with minor drawbacks. This paper proposes a simple and yet efficient way of hand gesture recognition via segmenting a hand region from both color and depth data acquired by Kinect v1. The Inception model of the image recognition system is used to check the reliability of the proposed method. Experimental results are derived from a sample dataset of Microsoft Kinect hand acquisitions. Under the appropriate conditions, it is possible to achieve high accuracy in close to real time.

[1] Ahn Y., Park Y., Choi K., Park W., Seo H., and Jung K., “TOF Depth Camera 3D Gesture Interaction System,” in Proceedings of International Conference on Advances in Information Technology, Bangkok, pp. 11-17, 2012.

[2] Bhattacharya S., Czejdo B., and Perez N., “Gesture Classification with Machine Learning Using Kinect Sensor Data,” in Proceedings of 3rd international Conference on Emerging Applications of Information Technology, Kolkata, pp. 727-31, 2012.

[3] Dean J., Corrado G., Monga R., Chen K., Devin M., Le Q., Mao M., Ranzato M., Senior A, Tucker P., Yang K., and Ng A., “Large Scale Distributed Deep Networks,” in Proceedings of the 25th International Conference on Neural Information Processing Systems, Lake Tahoe, pp. 1232-1240, 2012.

[4] Deyou X., “A Network Approach for Hand Gesture Recognition in Virtual Reality Driving Training System of SPG,” in Proceedings of the 18th International Conference on Pattern Recognition, Hong Kong, pp. 519-522, 2006.

[5] Dorninio F., Donadeo M., Marin G., Zanuttigh P., and Cortelazzo G., “Hand Gesture Recognition with Depth Data,” in Proceedings of the 4th ACMI/EEE International Workshop on Analysis and Retrieval of Tracked Events and 144 The International Arab Journal of Information Technology, Vol. 17, No. 1, January 2020 Motion in Imagery Stream, Barcelona, pp. 9-16, 2013.

[6] Elmezain M., Al-Hamadi A., Appenrodt J., and Michaelis B., “A Hidden Markov Model-Based Continuous Gesture Recognition System for Hand Motion Trajectory,” in Proceedings of the 19th International Conference on Pattern Recognition, Tampa, pp. 1-4, 2008.

[7] Gai S., Jung J., and Yi J., “Mobile Shopping Cart Application Using Kinect,” in Proceedings of the 10th International Conference on Ubiquitous Robots and Ambient Intelligence, Jeju, pp. 289-291, 2013.

[8] He G., Kang S., Song W., and Jung S., “Real- Time Gesture Recognition Using 3d Depth Camera,” in Proceedings of IEEE 2nd International Conference on Software Engineering and Service Science, Beijing, pp. 187-190, 2011.

[9] Ibraheem N., Hasan M., Khan R., and Mishra P., “Understanding Color Models: A Review,” ARPN Journal of Science and Technology, vol. 2, no. 3, pp. 265-275, 2012.

[10] Jerald J., the VR Book. Human-Centered Design for Virtual Reality, Association for Computing Machinery and Morgan and Claypool New York, 2016.

[11] Khoshelham K., “Accuracy Analysis of Kinect Depth Data,” In: ISPRS Workshop Laser Scanning, Calgary, 2011.

[12] Kulkarni V. and Lokhande S., “Appearance based Recognition of American Sign Language Using Gesture Segmentation,” International Journal on Computer Science and Engineering, vol. 2, no. 3, pp. 560-565, 2010.

[13] Li Z. and Jarvis R., “Real Time Hand Gesture Recognition Using A Range Camera,” in Proceedings of Australasian Conference on Robotics and Automation Sydney, pp. 529-534, Australia, 2009.

[14] Liu L., Sang N., Yang S., and Huang R., “Real- Time Skin Color Detection under Rapidly Changing Illumination Conditions,” IEEE Transactions on Consumer Electronics, vol. 57, no. 3, pp. 1295-1302, 2011.

[15] Madani T., Tahir M., Ziauddin S., Raza S., Ahmed M., Khan M., and Ashraf M., “An Accelerometer-Based Approach To Evaluate 3D Unistroke Gestures,” The International Arab Journal of Information Technology, vol. 12, no. 4, pp. 389- 394, 2015.

[16] Marin G., Dominio F., and Zanuttigh P., “Hand Gesture Recognition with Leap Motion and Kinect Devices,” in Proceedings of IEEE International Conference on Image Processing, pp. 1565-1569, Paris, 2014.

[17] Nagarajan S. and Subashini T., “Static Hand Gesture Recognition for Sign Language Alphabets using Edge Oriented Histogram and Multi Class SVM,” International Journal of Computer Applications, vol. 82, no. 4, pp. 28- 35, 2013.

[18] Pansare J., Gawande S., and Ingle M., “Real- Time Static Hand Gesture Recognition for American Sign Language (ASL) in Complex Background,” Journal of Signal and Information Processing, vol. 3, no. 3, pp 364- 367, 2012.

[19] Parvini F., McLeod D., Shahabi C., Navai B., Zali B., and Ghandeharizadeh S., “An Approach to Glove-Based Gesture Recognition,” in Proceedings of International Conference on Human-Computer Interaction, San Diego, pp. 236-245, 2009.

[20] Pavlovic V., Sharma R., and Huang T., “Visual Interpretation of Hand Gestures for Human- Computer Interaction: A Review,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 677-695, 1997.

[21] Ren Z., Meng J., and Yuan J., “Depth Camera Based Hand Gesture Recognition and its Applications in Human- Computer-Interaction,” in Proceedings of 8th International Conference on Information, Communications and Signal Processing, Singapore, pp. 1-5, 2011.

[22] Ren Z., Yuan J., and Zhang Z., “Robust Hand Gesture Recognition Based on Finger-Earth Movers Distance with a Commodity Depth Camera,” in Proceedings of the 19th International Conference on Multimedea, Scottsdale, pp. 1093-1096, 2011.

[23] Samantaray A., Nayak S., and Mishra A., “Hand Gesture Recognition using Computer Vision,” International Journal of Scientific and Engineering Research, vol. 4, no. 6, pp. 1602- 1609, 2013.

[24] Shaik K., Ganesan P., Kalist V., Sathish B., and Jenitha M., “Comparative Study of Skin Color Detection and Segmentation in HSV and YCbCr Color Space,” Procedia Computer Science, vol. 57, 41-48, 2015.

[25] Sharp T., Keskin C., Robertson D., Taylor J., Shotton J., Kim D., Rhemann C., Leichter I., Vinnikov A., Wei Y., Freedman D., Kohli P., Krupka E., Fitzgibbon A., and Izadi S., “Accurate, Robust, and Flexible Real-Time Hand Tracking,” in Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, Seoul, pp. 3633-3642 2015.

[26] Shotton J., Fitzgibbon A., Cook M., Sharp T., Finocchio M., Moore R., Kipman A., and Blake A., “Real-Time Human Pose Recognition in Parts from a Single Depth Image,” in Proceedings of IEEE Conference Computer A Combined Method of Skin-and Depth-based Hand Gesture Recognition 145 Vision and Pattern Recognition, Providence, pp. 1297-1304, 2011.

[27] Song Y., Gu Y., Wang P., Liu Y., and Li A., “A Kinect based Gesture Recognition Algorithm using GMM and HMM,” in Proceedings of 6th International Conference on Biomedical Engineering and Informatics, Hangzhou, pp. 750-754, 2013.

[28] Trigo T., Roberto S., and Pellegrino S., “An Analysis of Features for Hand-Gesture Classification,” in Proceedings of 17th International Conference on Systems, Signals and Image Processing, Rio de Janeiro, pp. 412- 415, 2010.

[29] Vaezi M. and Nekouie M., “3d Human Hand Posture Reconstruction Using A Single 2d Image,” International Journal of Human Computer Interaction, vol. 1, no. 4, pp. 83-94, 2011. Tuktaev Sokhib received the B.S. and M.S. degrees from Tashkent University of Information Technologies, Tashkent, Uzbekistan in 2015 and Gachon University, in the faculty of IT Convergence Engineering, Korea in 2018. His research interests include Computer Vision, Image Processing, Machine/Deep Learning and Artificial Intelligence. Taeg Keun Whangbo received the M.S. degree from City University of New York in 1988 and the Ph.D. degree both in Computer Science from Stevens Institute of Technology in 1995. Currently, he is a professor in the Department of Computer Science, Gachon University, Korea. Before he joined the Gachon University, he was the software developer in Q-Systems which is located in New Jersey from 1988 to 1993. He was also the researcher in Samsung Electronics from 2005 to 2007. From 2006 to 2008, he was the president of the Association of Korea Cultural Technology. His research areas include Computer Graphics, HCI and VR/AR.