
Automatic Plagiarism Detection Using Similarity Analysis
Plagiarism involves reproducing the existing inform ation in modified format or sometimes the original document as
it is. This is quiet common among students, researc hers and academicians. This has made some strong in fluence on research
community and awareness among academic peoples to p revent such a kind of malpractice. Though there exits some
commercial tools to detect plagiarism, still plagia rism is tricky and quiet challenging task due to ab undant information
available online. Commercially existing software ad opt methods like paraphrasing, sentence matching or keyword matching.
Such techniques are not too good in identifying the plagiarized contents effectively. However this paper focuses its attention on
identifying some key parameters that would help to identify plagiarism in a better manner. The results seem to be promising
and have further scope in detecting the plagiarism.
[1] Abdelmalek A., Zakaria E., and Michel S., Evaluation of Text Clustering Methods Using WordNet, The International Arab Journal of Information Technology , vol. 7, no. 4, pp. 349- 357, 2010.
[2] Alan P. and Hamblen J., Computer Algorithms for Plagiarism Detection, IEEE Transactions on Education , vol. 32, no. 2, pp. 94-99, 1989.
[3] Alberto C. and Paolo R., Towards the Exploitation of Statistical Language Models for Plagiarism Detection with Reference, in Proceedings of ECAI Workshop Uncovering on Plagiarism and Social Software Misuse PAN , Greece, pp. 15-19, 2008.
[4] Allan K., Kevin A., and Bruce B., An Automated System for Plagiarism Detection Using the Internet, in Proceedings of World Conference on Educational Multimedia, Hypermedia and Telecommunications , Chesapeake, pp. 3619-3625, 2004.
[5] Francisco R., Antonio G., Santiago R., Jose L., Pedraza M., and Manuel N., Detection of Plagiarism in Programming Assignments, IEEE Transactions on Education , vol. 51, no. 2, pp. 174-183, 2008.
[6] Hermann M., Frank K., and Bilal Z., Plagiarism -A Survey, Universal Computer Science , vol. 12, no. 8, pp. 1050-1084, 2006.
[7] Wikipedia, available at: http://en.wikipedia. org/wiki/Plagiarism, last visited 2004.
[8] Webconfs, available at: http://www.webconfs. com/stop-words.php, last visited 2006.
[9] Jinan F., Alkhanjari Z., Mohammed S., and Alhinai R., Designing a Portlet for Plagiarism Detections within a Campus Portal, Journal of Science , vol. 1, no. 1, pp. 83-88, 2005.
[10] Juan A., Nicholas C., and Rafael C., Applying Plagiarism Detection to Engineering Education, in Proceedings of School of Electrical and Information Engineering University of Sydney , NSW, pp. 722-731, 2006.
[11] Nathaniel G., Maria P., and Yiu N., Nowhere to Hide: Finding Plagiarized Documents Based on Sentence Similarity, in Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology , NSW, pp. 690-696, 2008.
[12] Ozlem U., Boris K., and Thade N., Using Syntactic Information to Identify Plagiarism, in Proceedings of Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory Cambridge , USA, pp. 37-44, 2005.
[13] Porter F., An Algorithm for Suffix Stripping , Emerald Group Publishing Limited, 1980.
[14] Steve E., Vivek L., and Michelle C., Plagiarism Detection Using Feature-Based Neural Networks, in Proceedings of the 38th Sigcse Technical Symposium on Computer Science Education , USA, pp. 34-38, 2007.
[15] Wadsworth, available at: http:// www.wadsworth. com/english_d/special-features/ plagiarism/, last visited 2004. Shanmugasundaram Hariharan received his BE degree specialized in computer science and engineering from Madurai Kammaraj University, Madurai, India in 2002, ME degree specialized in the field of computer science and engineering from Anna University, Chennai, India in 2004. He holds his PhD degree in the area of Information Ret rieval from Anna University, Chennai, India. He is a membe r of IAENG, IACSIT, ISTE, CSTA and has 8 years of experience in teaching. Currently he is presently w orking as associate professor in Department of Computer Scien ce and Engineering, TRP Engineering College, India. His re search interests include information retrieval, data minin g, opinion mining, web mining. He has to his credit several pa pers in referred journals and conferences. He also serves a s editorial board member and as program committee member for several international conferences and journals.