The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


Analysis and Performance Evaluation of Cosine Neighbourhood Recommender System

Growth of technology and innovation leads to large and complex data which is coined as Bigdata. As the quantity of information increases, it becomes more difficult to store and process data. The greater problem is finding right data from these enormous data. These data are processed to extract the required data and recommend high quality data to the user. Recommender system analyses user preference to recommend items to user. Problem arises when Bigdata is to be processed for Recommender system. Several technologies are available with which big data can be processed and analyzed. Hadoop is a framework which supports manipulation of large and varied data. In this paper, a novel approach Cosine Neighbourhood Similarity measure is proposed to calculate rating for items and to recommend items to user and the performance of the recommender system is evaluated under different evaluator which shows the proposed Similarity measure is more accurate and reliable.


[1] Abbas A. and Liu J., Designing an Intelligent Recommender System using Partial Credit Model and Bayesian Rough Set, The International Arab Journal of Information Technology, vol. 9, no. 2, pp. 179-187, 2012.

[2] Adomavicius G. and Tuzhilin A., Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions, IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 6, pp. 734- 749, 2005.

[3] Anderson C., Real Time Data from Big Time Fun: Harnessing the Power of Mobile Technology and Social Media for Better Species Management, in proceeding of 2012 Oceans, California, pp.1-5, 2012.

[4] Bell R. and Koren Y., Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights, in proceeding of Seventh IEEE International Conference on Data Mining (ICDM 2007), USA, pp. 43-52, 2007.

[5] Boyd K., Costa V., Davis J. and Page C., Unachievable Region in Precision-Recall Space and Its Effect on Empirical Evaluation, in Proceeding of the 29th International Conference on Machine Learning, Edinburgh, pp. 1-8, 2012.

[6] Cremonesi P., Koren Y., and Turrin R., Performance of Recommender Algorithms on Top-N Recommendation Tasks, in Proceeding of RecSys 10 Proceedings of the 4th ACM Conference on Recommender Systems, New York, pp. 39-46, 2010.

[7] Demchenko Y., Zhiming Z., Grosso P., and Wibisono, A., de Laat, C., Addressing Big Data challenges for Scientific Data Infrastructure, in proceeding of IEEE 4th International Conference on Cloud Computing Technology and Science, Taipei, pp. 614-617, 2012.

[8] Elena G. and Olma F., Evaluation of recommender systems: A new approach, Expert Systems with Applications, vol. 35, no. 3, pp. 790-804, 2008.

[9] GroupLens, http://grouplens.org/datasets/movielens, Last Visited 2014.

[10] Gubanov M. and Pyayt A., MEDREADFAST: A Structural Information Retrieval Engine for Big Clinical Text, in proceeding of IEEE 13th International Conference on Information Reuse and Integration (IRI), USA, pp. 8-10, 2012.

[11] Gunawardana A., Shani G., Survey of Accuracy Evaluation Metrics of Recommendation Tasks, Analysis and Performance Evaluation of Cosine Neighbourhood Recommender System 753 Journal of Machine Learning Research, vol. 10 pp. 2935-2962, 2009.

[12] Han X., Tian L., Yoon M., and Lee M., A Big Data Model Supporting Information Recommendation in Social Networks, in proceeding of Second International Conference on Cloud and Green Computing, Washington, pp. 1-3, 2012.

[13] Humbetov S., Data-intensive Computing with Map-Reduce and Hadoop, in procedding of 6th International Conference on Application of Information and Communication Technologies, Georgia, pp. 17-19, 2012.

[14] Ibm, http://www.ibm.com/software/data/infosphere/ha doop/what-is-big-data-analytics.html, Last Visited 2014.

[15] Kaisler S., Armour F., Espinosa J., and Money W., Big Data: Issues and Challenges Moving Forward, in procedding of 46th Hawaii International Conference on System Sciences, Wailea, pp. 7-10, 2013.

[16] Kim J., Yang M., Hwang Y., Jeon S., Kim K., Jung I., Choi C., Cho W., and Na J., Customer Preference Analysis Based on SNS Data, in procedding of Second International Conference on Cloud and Green Computing, Washington, pp. 609-613, 2012.

[17] Li N., Zhang N., and Das S., Preserving Relation Privacy in Online Social Network Data, Internet Computing IEEE , vol. 15, no. 3, pp.35- 42, 2011.

[18] Mahout, http://mahout.apache.org/users/recommender/reco mmender-documentation.html, Last visited 2014.

[19] Martha V., Zhao W., and Xu X., h-MapReduce: A Framework for Workload Balancing in MapReduce, in procedding of IEEE 27th International Conference on Advanced Information Networking and Applications, Barcelona, pp. 637-644, 2013.

[20] Palit I. and Reddy C., Scalable and Parallel Boosting with MapReduce, IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1904-1916, 2012.

[21] Smith M., Szongott C., Henne B., Von V., Big Data Privacy Issues in Public Social Media, in procedding of 6th IEEE International Conference on Digital Ecosystems Technologies, Campione d'Italia, pp.1-6, 2012.

[22] Vozalis E. and Margaritis E., Analysis of Recommender Systems Algorithms, in Proceeding of the 6th Hellenic European Conference on Computer Mathematics and its Applications, Athens, pp. 1-14, 2003.

[23] Wikipedia, http://en.wikipedia.org/wiki/Big_data, Last Visited 2014.

[24] Wikipedia, http://en.wikipedia.org/wiki/Recommender_syst em, Last Visited 2014. Kola Sujatha Periyasamy is a Senior Assistant Professor in Madras Institute of Technology of Anna University, India. She received her M.C.A. in Computer Applications in 1999, M.E in Computer Science and Engineering in 2003 and her Ph.D degree in Computer Science and Engineering in 2013 from College of Engineering, Guindy, Anna University, India. She has 11 years teaching experience in the branch of Information Technology. Her current research focuses on Data Mining and Big Data Analytics. Jayadharini Jaiganesh completed her Bachelor Degree in Information Technology in Madras Institute of Technology, Anna University, India. Her areas of interests are data mining and image processing. She has published a paper IEEE Conference. She is pursuing her research in the area of data mining. Kanchan Kumar Ponnambalam completed his Bachelor Degree in Information Technology in Madras Institute of Technology, Anna University, India. He is well versed in Photoshop. His areas of interests are data mining and Operating Systems. Jeevitha Rajasekar completed her Bachelor Degree in Information Technology in Madras Institute of Technology, Anna University, India. Her areas of interests are data mining and soft computing. 754 The International Arab Journal of Information Technology, Volume 14, No. 5, September 2017 Kannan Arputharaj is a Professor and Head of the Department of Information Science and Technology, College of Engineering, Anna University, India. He received his B.Sc. degree in Mathematics from Madurai Kamaraj University in 1979, his Master degree in Mathematics from Annamalai University in 1986, his M.E degree in Computer Science and Engineering from College of Engineering, Guindy, Anna University, India in 1991 and his Doctorate in Intelligent Temporal Databases from the Faculty of Electrical Engineering, Anna University, India in 2000. He worked as a Programmer in Bhabha Atomic Research Centre (BARC), Mumbai, India from 1981 to 1989. He has published numerous papers in various International journals and conferences. He has 27 years of teaching experience. His area of research includes Database Management System, Artificial Intelligence, Big Data Analytics, Data Mining and Software Engineering.