Improved Collaborative Filtering Recommender System Based on Hybrid Similarity Measures
Recommender Systems (RS) based on collaborative filtering has been successfully applied to provide relevant and personalized recommendations from an enormous amount of data in various domains. To achieve this, similarity measurements, such as the Pearson Correlation Coefficient (PCC), Cosine, and Jaccard, are used to compute the similarity between users or items based on correlations among user preferences from the user-item rating matrix. However, existing similarity metrics suffer from drawbacks emanating from data sparsity caused by insufficient number of transactions and feedback and scalability of the system’s ability to handle increasing amounts of data efficiently. The objective of this study is to improve the recommendation quality and increase the prediction accuracy by addressing the problems of similarity computation in collaborative filtering. This paper presents a hybrid similarity measure that combines Adjusted Triangle similarity, User Rating Preference behavior, and the Jaccard (ATURPJ) coefficient. The proposed hybrid similarity measures were evaluated on four widely used and publicly available datasets, MovieLens, FilmTrust, and CiaoDVD, using the predictive accuracy metrics of Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) and recommendation quality of Precision, Recall, and F-measure. The experimental results show that the proposed hybrid similarity measure significantly outperforms existing approaches with MAE of 0.547 and RMSE of 0.735 compared to the baseline of 0.707 and 0.903 respectively on ML-100k dataset. Overall, this approach has the potential to improve the quality of recommendation and accuracy of the prediction.
[1] Abdalla H., Amer A., Amer Y., Nguyen L., and Al-Maqaleh ., “ oosting the Item-based Collaborative Filtering Model with Novel Similarity Measures,” International Journal of Computational Intelligence Systems, vol. 16, no. 1, pp. 1-16, 2023. https://link.springer.com/article/10.1007/s44196- 023-00299-2
[2] Al Hassanieh L., Abou Jaoudeh C., Bou Abdo J., and Demerjian J., “Similarity Measures for Collaborative Filtering Recommender Systems,” in Proceedings of the IEEE Middle East and North Africa Communications Conference, Jounieh, pp. 1-5, 2018. DOI:10.1109/MENACOMM.2018.8371003
[3] Al-bashiri H., Abdulgabber M., Romli A., and Hujainah F., “Collaborative Filtering Similarity Measures: Revisiting,” in Proceedings of the International Conference on Advances in Image Processing, Bangkok, pp. 195-200, 2017. https://doi.org/10.1145/3133264.3133299
[4] Alshammari G., Kapetanakis S., Polatidis N., and Petridis M., “A Triangle Multi-Level Item-Based Collaborative Filtering Method that Improves Recommendations,” in Proceedings of the 19th International Conference on Engineering Applications of Neural Networks, Bristol, pp. 145- 157, 2018. https://link.springer.com/chapter/10.1007/978-3- 319-98204-5_12
[5] Amer A., Abdalla H., and Nguyen L., “Enhancing Recommendation Systems Performance Using Highly-Effective Similarity Measures,” Knowledge-Based Systems, vol. 217, pp. 106842, 2021. https://doi.org/10.1016/j.knosys.2021.106842
[6] Arnold M., Goldschmitt M., and Rigotti T., “Dealing with Information Overload: A Comprehensive Review,” Frontiers in Psychology, vol. 14, pp. 1-23, 2023. https://doi.org/10.3389/fpsyg.2023.1122200
[7] Ayub M., Ghazanfar M., Mehmood Z., Saba T., Alharbey R., Munshi A., and Alrige M., “Modeling User Rating Preference ehavior to Improve the Performance of the Collaborative Filtering ased Recommender Systems,” PLoS One, vol. 14, no. 8, pp. 1-29, 2019. DOI:10.1371/journal.pone.0220129
[8] ag S., Kumar S., and Tiwari M., “An Efficient Recommendation Generation Using Relevant Jaccard Similarity,” Information Sciences, vol. 483, pp. 53-64, 2019. https://doi.org/10.1016/j.ins.2019.01.023
[9] obadilla J., Ortega F., and Hernando A., “A Collaborative Filtering Similarity Measure Based on Singularities,” Information Processing and Management, vol. 48, no. 2, pp. 204-217, 2012. https://doi.org/10.1016/j.ipm.2011.03.007
[10] Bobadilla J., Ortega F., Hernando A., and Gutierrez A., “Recommender Systems Survey,” Knowledge-Based Systems, vol. 46, pp. 109-132, 2013. DOI:10.1016/j.knosys.2013.03.012
[11] Breese J., Heckerman D., and Kadie C., “Empirical Analysis of Predictive Algorithms for Collaborative Filtering,” in Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence, Wisconsin, pp. 43-52, 1998. https://dl.acm.org/doi/10.5555/2074094.2074100
[12] urke R., “Hybrid Recommender Systems: Survey and Experiments,” User Modelling and User-Adapted Interaction, vol. 12, no. 4, pp. 331- 370, 2002. https://link.springer.com/article/10.1023/A:10212 40730564
[13] Chen R., Hua Q., Chang Y., Wang B., Zhang L., and Kong X., “A Survey of Collaborative Filtering-Based Recommender Systems: From Traditional Methods to Hybrid Methods Based on Social Networks,” IEEE Access, vol. 6, pp. 64301- 64320, 2018. Improved Collaborative Filtering Recommender System Based on Hybrid Similarity Measures 113 https://ieeexplore.ieee.org/document/8506344
[14] Chen Z., Gan W., Wu J., Hu K., and Lin H., “Data Scarcity in Recommendation Systems: A Survey,” ACM Transactions on Recommender Systems, vol. 1, no. 1, pp. 1-32, 2024. https://doi.org/10.1145/3639063
[15] Donchev D., 37 Mind Blowing YouTube Facts, Figures and Statistics, Fortunelords, https://fortunelords.com/youtube-statistics/, Last Visited, 2024.
[16] Fayyaz Z., Ebrahimian M., Nawara D., Ibrahim A., and Kashef R., “Recommendation Systems: Algorithms, Challenges, Metrics, and Business Opportunities,” Applied Sciences, vol. 10, no. 21, pp. 1-20, 2020. DOI:10.3390/app10217748
[17] Feng J., Feng X., Zhang N., and Peng J., “An Improved Collaborative Filtering Method Based on Similarity,” PLoS One, vol. 13, no. 9, pp. 1-18, 2018. DOI:10.1371/journal.pone.0204003
[18] Fkih F., “Similarity Measures for Collaborative Filtering-based Recommender Systems: Review and Experimental Comparison,” Journal of King Saud University-Computer and Information Sciences, vol. 34, no. 9, pp. 7645-7669, 2022. https://doi.org/10.1016/j.jksuci.2021.09.014
[19] Guo G., Zhang J., and Yorke-Smith N., “A Novel Evidence-Based Bayesian Similarity Measure for Recommender Systems,” ACM Transactions on the Web, vol. 10, no. 2, pp. 1-30, 2016. https://doi.org/10.1145/2856037
[20] Guo G., Zhang J., Thalmann D., and Yorke-Smith N., “ETAF: An Extended Trust Antecedents Framework for Trust Prediction,” in Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Beijing, pp. 540-547, 2014. DOI:10.1109/ASONAM.2014.6921639
[21] Harper F. and Konstan J., “The MovieLens Datasets: History and Context,” ACM Transactions on Interactive Intelligent Systems, vol. 5, no. 4, pp. 1-19, 2016. https://doi.org/10.1145/2827872
[22] Hug N., “Surprise: A Python Library for Recommender Systems,” Journal of Open-Source Software, vol. 5, no. 52, pp. 2174, 2020. DOI:10.21105/joss.02174
[23] Isinkaye F., Folajimi Y., and Ojokoh B., “Recommendation Systems: Principles, Methods and Evaluation,” Egyptian Informatics Journal, vol. 16, no. 3, pp. 261-273, 2015. https://doi.org/10.1016/j.eij.2015.06.005
[24] Jaccard P., “The Distribution of the Flora in the Alpine Zone,” New Phytologist, vol. 11, no. 2, pp. 37-50, 1912. https://nph.onlinelibrary.wiley.com/doi/abs/10.11 11/j.1469-8137.1912.tb05611.x
[25] Jain G., Mahara T., and Tripathi K., Soft Computing: Theories and Applications, Springer, 2020. https://link.springer.com/chapter/10.1007/978- 981-15-0751-9_32
[26] Jain G., Mahara T., Sharma S., and Sangaiah A., “A Cognitive Similarity-Based Measure to Enhance the Performance of Collaborative Filtering-based Recommendation System,” IEEE Transactions on Computational Social Systems, vol. 9, no. 6, pp. 1785-1793, 2022. https://ieeexplore.ieee.org/document/9829416
[27] Jannach D. and Zanker M., Collaborative Recommendations, World Scientific, 2018. https://doi.org/10.1142/9789813275355_0001
[28] Jannach D., Pu P., Ricci F., and Zanker M., “Recommender Systems: Past, Present, Future,” Association for the Advancement of Artificial Intelligence Magazine, vol. 42, no. 3, pp. 1-6, 2021. https://doi.org/10.1609/aimag.v42i3.18139
[29] Kaminskas M. and ridge D., “Diversity, Serendipity, Novelty, and Coverage,” ACM Transactions on Interactive Intelligent Systems, vol. 7, no. 1, pp. 1-42, 2017. DOI:10.1145/2926720
[30] Kamta S. and Verma V., Evolutionary Computing and Mobile Sustainable Networks, Springer, 2021. https://doi.org/10.1007/978-981-15-5258-8_80
[31] Kapoor K., Tamilmani K., Rana N., Patil P., Dwivedi Y., and Nerur S., “Advances in Social Media Research: Past, Present and Future,” Information Systems Frontiers, vol. 20, no. 3, pp. 531-558, 2018. https://link.springer.com/article/10.1007/s10796- 017-9810-y
[32] Knudsen E., “Modeling News Recommender Systems’ Conditional Effects on Selective Exposure: Evidence from two Online Experiments,” Journal of Communication, vol. 73, no. 2, pp. 138-149, 2023. https://doi.org/10.1093/joc/jqac047
[33] Koutrika G., Bercovitz B., and Garcia-Molina H., “FlexRecs: Expressing and Combining Flexible Recommendations,” in Proceedings of the International Conference on Management of Data and 28th Symposium on Principles of Database Systems, Rhode Island, pp. 745-757, 2009. https://doi.org/10.1145/1559845.1559923
[34] Liu H., Hu Z., Mian A., Tian H., and Zhu X., “A New User Similarity Model to Improve the Accuracy of Collaborative Filtering,” Knowledge- Based Systems, vol. 56, pp. 156-166, 2014. https://doi.org/10.1016/j.knosys.2013.11.006
[35] Manochandar S. and Punniyamoorthy M., “A New User Similarity Measure in a New Prediction Model for Collaborative Filtering,” Applied Intelligence, vol. 51, no. 1, pp. 586-615, 2021. https://doi.org/10.1007/s10489-020-01811-3
[36] Manouselis N., Drachsler H., Verbert K., and Santos O., “Workshop on Recommender Systems 114 The International Arab Journal of Information Technology, Vol. 22, No. 1, January 2025 for Technology Enhanced Learning,” in Proceedings of the 4th ACM Conference on Recommender Systems, Barcelona, pp. 377-378, 2010. https://dl.acm.org/doi/10.1145/1864708.1864797
[37] Marsh E., Perez Vallejos E., and Spence A., “Overloaded by Information or Worried about Missing out on it: A Quantitative Study of Stress, Burnout, and Mental Health Implications in the Digital Workplace,” Sage Open, vol. 14, no. 3, pp. 1-17, 2024. https://doi.org/10.1177/21582440241268830
[38] Meissa M., Benharzallah S., Kahloul L., and Kazar O., “A Personalized Recommendation for Web API Discovery in Social Web of Things,” The International Arab Journal of Information Technology, vol. 18, no. 3A, pp. 438-445, 2021. https://doi.org/10.34028/iajit/18/3A/7
[39] Nijor S., Rallis G., Lad N., and Gokcen E., “Patient Safety Issues from Information Overload in Electronic Medical Records Background and Objective: Electronic Health Records (EHRs) Have,” Journal of Patient Safety, vol. 18, no. 6, pp. 999-1003, 2022. https://pmc.ncbi.nlm.nih.gov/articles/PMC942276 5/
[40] Pearson K., “Note on Regression and Inheritance in the Case of two Parents,” Proceedings of the Royal Society of London, vol. 58, no. 347-352, pp. 240-242, 1895. https://royalsocietypublishing.org/doi/abs/10.109 8/rspl.1895.0041
[41] Resnick P., Iacovou N., Suchak M., Bergstrom P., and Riedl J., “GroupLens: An Open Architecture for Collaborative Filtering of Netnews,” in Proceedings of the ACM Conference on Computer Supported Cooperative Work, Carolina, pp. 175- 186, 1994. https://doi.org/10.1145/192844.192905
[42] Roetzel P., “Information Overload in the Information Age: A Review of the Literature from Business Administration, Business Psychology, and Related Disciplines with a Bibliometric Approach and Framework Development,” Business Research, vol. 12, no. 2, pp. 479-522, 2019, https://link.springer.com/article/10.1007/s40685- 018-0069-z
[43] Roy D. and Dutta M., “A Systematic Review and Research Perspective on Recommender Systems,” Journal of Big Data, vol. 9, no. 1, pp. 1-36, 2022. https://doi.org/10.1186/s40537-022-00592-5
[44] Sarwar B., Karypis G., Konstan J., and Riedl J., “Analysis of Recommendation Algorithms for e- Commerce,” in Proceedings of the 2nd ACM Conference on Electronic Commerce, Minnesota, pp. 158-167, 2000. https://dl.acm.org/doi/10.1145/352871.352887
[45] Shi Y., Larson M., and Hanjalic A., “Collaborative Filtering beyond the User-Item Matrix: A Survey of the State of the Art and Future Challenges,” ACM Computing Surveys, vol. 47, no. 1, pp. 1-45, 2014. https://dl.acm.org/doi/10.1145/2556270
[46] Silveira T., Zhang M., Lin X., Liu Y., and Ma S., “How Good your Recommender System is? A Survey on Evaluations in Recommendation,” International Journal of Machine Learning and Cybernetics, vol. 10, no. 5, pp. 813-831, 2019. https://link.springer.com/article/10.1007/s13042- 017-0762-9
[47] Sun S., Zhang Z., Dong X., Zhang H., Li T., Zhang L., and Min F., “Integrating Triangle and Jaccard Similarities for Recommendation,” PLoS One, vol. 12, no. 8, pp. 1-16, 2017. https://doi.org/10.1371/journal.pone.0183570
[48] Suryakant and Mahara T., “A New Similarity Measure Based on Mean Measure of Divergence for Collaborative Filtering in Sparse Environment,” Procedia Computer Science, vol. 89, pp. 450-456, 2016. https://doi.org/10.1016/j.procs.2016.06.099
[49] Yuan J., Shalaby W., Korayem M., Lin D., Aljadda K., and Luo J., “Solving Cold-Start Problem in Large-Scale Recommendation Engines: A Deep Learning Approach,” in Proceedings of the IEEE International Conference on Big Data, Big Data, Washington (DC), pp. 1901-1910, 2016. DOI:10.1109/BIGDATA.2016.7840810
[50] Zhang . and Yuan ., “Improved Collaborative Filtering Recommendation Algorithm of Similarity Measure,” AIP Conference Proceedings, vol. 1839, no. 1, 2017. https://doi.org/10.1063/1.4982532
[51] Zhang Z., Patra B., Yaseen A., Zhu J., Sabharwal R., Roberts K., Cao T., and Wu H., “Scholarly Recommendation Systems: A Literature Survey,” Knowledge and Information Systems, vol. 65, no. 11, pp. 4433-4478, 2023. DOI:10.1007/s10115- 023-01901-x
[52] Zolaktaf Z., AlOmeir O., and Pottinger R., “ ridging the Gap etween User-Centric and Offline Evaluation of Personalized Recommendation Systems,” in Proceedings of the Adjunct Publication of the 26th Conference on User Modeling, Adaptation and Personalization, Singapore, pp. 183-186, 2018. DOI:10.1145/3213586.3226216