The Strategy of Discriminating False Comments on the Internet by Fusing Probabilistic Topic and Word Vector Models

Author Fei Long,

Keywords #Probabilistic topics #word vectors #online reviews #false detection #CNN

Abstract

With the acceleration of the social process of “Internet+”, e-commerce has entered an era of rapid development. In response to the subjective judgments in current false information detection methods and the problem of low classification accuracy caused by the failure to extract semantic information hidden in text, this study proposes Latent Dirichlet Allocation- Word vector-Edge Convolutional Neural Network (LW-ECNN) Model. First, the probabilistic topic model is organically combined with the word vector model, and then the edge path structure is proposed to optimize the Convolutional Neural Network (CNN) discriminative classification model. Finally, the data-side optimization module is combined with the edge CNN classifier to form a web-based false comment discrimination model based on the probabilistic topic and word vector models. The results show that the probabilistic topic and word vector-based online false comment discrimination model has good results in detecting false comments, and can provide data reference for false comment detection by false detection-related departments, which is of great practical significance to assist the network environment cleaning work.

References

[1] Aljadani E., Assiri F., and Alshutayri A., “Detecting Spam Reviews in Arabic by Deep Learning,” The International Arab Journal of Information Technology, vol. 21, no. 3, pp. 495- 505, 2024. https://doi.org/10.34028/iajit/21/3/12

[2] Alsharif N., “Fake Opinion Detection in an e- Commerce Business Based on a Long-Short Memory Algorithm,” Soft Computing, vol. 26, no. 16, pp. 7847-7854, 2022. https://doi.org/10.1007/s00500-022-06806-5

[3] Alsubari S., Deshmukh S., Alqarni A., Alsharif N., and Aldhyani T., “Data Analytics for The Identification of Fake Reviews Using Supervised Learning,” Computers, Materials and Continua, vol. 70, no. 2, pp. 3189-3204, 2022. https://doi.org/10.32604/cmc.2022.019625

[4] Beer D. and Matthee M., “Approaches to Identify Fake News: A Systematic Literature Review,” in Proceedings of the Integrated Science in Digital Age, Batumi, pp. 13-22, 2020. https://doi.org/10.1007/978-3-030-49264-9_2

[5] Budhi G., Chiong R., Wang Z., and Dhakal S., “Using a Hybrid Content-Based and Behaviour- Based Featuring Approach in a Parallel Environment to Detect Fake Reviews,” Electronic Commerce Research and Applications, vol. 47, no. 1, pp. 101048, 2021. https://doi.org/10.1016/j.elerap.2021.101048

[6] Cao C., Li S., Yu S., and Chen Z., “Fake Reviewer Group Detection in Online Review Systems,” in Proceedings of the International Conference on Data Mining Workshops, pp. 935-942, 2021. DOI: 10.1109/ICDMW53433.2021.00122

[7] Chen L. and Zhu H., “Behavior Prediction Based on a Commodity Utility-Behavior Sequence Model,” Machine Learning with Applications, vol. 9, pp. 100314, 2022. https://doi.org/10.1016/j.mlwa.2022.100314

[8] Fang Y., Wang H., Zhao L., Yu F., and Wang C., “Dynamic Knowledge Graph Based Fake Review Detection,” Applied Intelligence, vol. 50, pp. 4281-4295, 2020. https://doi.org/10.1007/s10489- 020-01761-w

[9] Fowler A. and Montagnes B., “Distinguishing between False Positives and Genuine Results: The Case of Irrelevant Events and Elections,” The Journal of Politics, vol. 85, no. 1, pp. 304-309, 2023. https://doi.org/10.1086/719636

[10] Hameleers M., Meer T., and Vliegenthart R., “Civilized Truths, Hateful Lies? Incivility and Hate Speech in False Information-Evidence from Fact-Checked Statements in the US. Information,” Communication and Society, vol. 25, no. 11, pp. 1596-1613, 2022. https://doi.org/10.1080/1369118X.2021.1874038

[11] He S., Hollenbeck B., and Proserpio D., “The Market for Fake Reviews,” Marketing Science, vol. 41, no. 5, pp. 896-921, 2022. https://doi.org/10.1287/mksc.2022.1353

[12] Himangshu P. and Nikolaev A., “Fake Review Detection on Online E-commerce Platforms: A Systematic Literature Review,” Data Mining and Knowledge Discovery, vol. 35, no. 5, pp. 1830- 1881, 2021. https://doi.org/10.1007/s10618-021- 00772-6

[13] Hou J. and Zhu A., “Fake Online Review Recognition Algorithm and Optimisation Research Based on Deep Learning,” Applied Mathematics and Nonlinear Sciences, vol. 7, no. 2, pp. 861-874, 2021. https://doi.org/10.2478/amns.2021.2.00170

[14] Hui L., “A Review of Research on Identification of False Reviews in E-Commerce,” Journal of Management and Humanity Resources, vol. 3, pp. 9-15, 2020. http://dx.doi.org/10.22457/jmhr.v03a02102

[15] Jain P., Pamula R., and Srivastava G., “A Systematic Literature Review on Machine Learning Applications for Consumer Sentiment Analysis Using Online Reviews,” Computer Science Review, vol. 41, pp. 100413, 2021. https://doi.org/10.1016/j.cosrev.2021.100413 The Strategy of Discriminating False Comments on the Internet by Fusing... 529

[16] Kauffmann E., Peral J., Gil D., Ferrández A., and Sellers R., “A Framework for Big Data Analytics in Commercial Social Networks: a Case Study on Sentiment Analysis and Fake Review Detection for Marketing Decision-Making,” Industrial Marketing Management, vol. 90, pp. 523-537, 2020. https://doi.org/10.1016/j.indmarman.2019.08.003

[17] Kolhar M., “E-Commerce Review System to Detect False Reviews,” Science and Engineering Ethics, vol. 24, pp. 1577-1588, 2018. https://doi.org/10.1007/s11948-017-9959-2

[18] Kumar N. and Kumar R., “The Application of Artificial Intelligence in Electronic Commerce,” Turkish Journal of Computer and Mathematics Education, vol. 12, no. 12, pp. 1679-1682, 2021.

[19] Li L., Fan L., Atreja S., and Hemphill L., ““HOT” ChatGPT: The Promise of ChatGPT in Detecting and Discriminating Hateful, Offensive, and Toxic Comments on Social Media,” ACM Transactions, vol. 18, no. 2, pp. 1-36, 2024. https://doi.org/10.1145/3643829

[20] Mohawesh R., Tran S., Ollington R., and Xu S., “Analysis of Concept Drift in Fake Reviews Detection,” Expert Systems with Applications, vol. 169, pp. 114318, 2021. https://doi.org/10.1016/j.eswa.2020.114318

[21] Mohawesh R., Xu S., Springer M., Al-Hawawreh M., and Maqsood S., “Fake or Genuine? Contextualised Text Representation for Fake Review Detection,” arXiv preprint, arXiv: 2112.14343, 2021. https://doi.org/10.48550/arXiv.2112.14343

[22] Mohawesh R., Xu S., Tran S., Ollington R., and Springer M, “Fake Reviews Detection: A Survey,” IEEE Access, vol. 9, pp. 65771-65802, 2021. https://doi.org/10.1109/ACCESS.2021.3075573

[23] Moon S., Kim M., and Iacobucci D., “Content Analysis of Fake Consumer Reviews by Survey- Based Text Categorization,” International Journal of Research in Marketing, vol. 38, no. 2, pp. 343- 364. 2021. https://doi.org/10.1016/j.ijresmar.2020.08.001

[24] Plotkina D., Munzel A., and Pallud J., “Illusions of Truth-Experimental Insights into Human and Algorithmic Detections of Fake Online Reviews,” Journal of Business Research, vol. 109, pp. 511- 523, 2020. https://doi.org/10.1016/j.jbusres.2018.12.009

[25] Ruan N., Deng R., and Su C., “GADM: Manual Fake Review Detection for O2O Commercial Platforms,” Computers and Security, vol. 88, pp. 101657, 2020. https://doi.org/10.1016/j.cose.2019.101657

[26] Wang X., Zhou T., Wang X., and Fang Y., “Harshness-Aware Sentiment Mining Framework for Product Review,” Expert Systems with Applications, vol. 187, pp. 115887, 2022. https://doi.org/10.1016/j.eswa.2021.115887

[27] Wu Y., Ngai E., Wu P., and Wu C., “Fake Online Reviews: Literature Review, Synthesis, and Directions for Future Research,” Decision Support Systems, vol. 132, pp. 113280, 2020. https://doi.org/10.1016/j.dss.2020.113280

[28] Zhang W., Du Y., Yoshida T., and Wang Q., “DRI- RCNN: An Approach to Deceptive Review Identification Using Recurrent Convolutional Neural Network,” Information Processing and Management, vol. 54, no. 4, pp. 576-592, 2018. https://doi.org/10.1016/j.ipm.2018.03.007

[29] Zhou X. and Zafarani R., “A Survey of Fake News: Fundamental Theories, Detection Methods, and Opportunities,” ACM Computing Surveys, vol. 53, no. 5, pp. 3395046, 2020. https://doi.org/10.1145/3395046