The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


Comparative Analysis of PSO and ACO Based Feature Selection Techniques for Medical Data

Preservation,
Sensitive medical dataset consist of large number of disease attributes or features, not all these features are used for diagnosis. In order to preserve the medical dataset it is not essential to perturb all the features before it is shared for mining purpose. To reduce the computational cost and to increase the efficiency, in this work tried to use Ant Colony Optimization (ACO) for feature subset selection which is used to reduce the dimension and also compared with feature subset selection using Particle Swarm Optimization (PSO) which is also used to reduce the dimension. Both the techniques are explored to reduce the dimension before applying preservation technique. By using randomization method a known distribution is added to the reduced sensitive data before the data is sent to the miner. The approach is analyzed using standard UCI medical datasets. The result is analyzed based on classification accuracy using machine learning algorithms (Naïve Bayes, Decision Tree) build on the randomized dataset. The experimental results show that the accuracy is maintained in the reduced perturbed datasets. The results also show that ACO search based feature selection has more accuracy than PSO search based selection.


[1] Abd-Alsabour N. and Randall M., “Feature Selection for Classification Using an Ant Colony System,” in Proceedings of 6th IEEE International Conference on E-Science Workshops, Brisbane, pp. 86-91, 2010.

[2] Chouchoulas A. and Shen Q., “Rough Set-Aided Keyword Reduction for Text Categorization,” Applied Artificial Intelligence, vol. 15, no. 9, pp. 843-873, 2001.

[3] Dhanalaksmi S., Abdul Samath J., and Prashanth R ., “Fuzzy Rough Set and PSO Search Based Feature Selection for Data Preservation,” International Journal of Applied Engineering Research, vol. 10, no. 20, pp. 17767-17770, 2015

[4] Dorigo M., Ant Colony Optimization, Scholarpedia, 2007.

[5] Dorigo M. and Thomas S., “The Ant Colony Optimization Metaheuristic: Algorithms, Applications, and Advances,” Handbook of Metaheuristics, Boston, pp. 250-285, 2003.

[6] Dorigo M., Marco A., Montes O., and Engelbrecht A., Particle Swarm Optimization, Scholarpedia, 2008.

[7] Eberhart R. and Kennedy J., “A New Optimizer Using Particle Swarm Theory,” in Proceedings of the 6th International Symposium on Micro Machine and Human Science, Nagoya, pp. 39-43, 1995.

[8] Han J. and Kamber M., Data Mining Concepts and Techniques, Morgan Kaufmann, 2006.

[9] HU X., “Particle Swarm Optimization: Tutorial

[online]. 2006

[cit. 2011-05-09].” PSO Tutorial.WWW:
[10] Jaganathan P., Thangavel K., Pethalashmi A., and Karnan M., “Classification Rule Discovery with Ant Colony Optimization and Improved Quick Reduct Algorithm,” International Journal of Computer Science, vol. 33, no. 1, pp. 50-55, 2007.

[11] Jensen R. and Shen Q., “Fuzzy-Rough Attribute Reduction with Application to Web Categorization,” Fuzzy Sets and Systems, vol. 141, no. 3, pp. 469-485, 2004.

[12] Jensen R. and Shen Q., “New Approaches to Fuzzy-Rough Feature Selection,” IEEE Transactions on Fuzzy Systems, vol. 17, no. 4, pp. 824-838, 2009.

[13] Jensen R. and Shen Q., Computational Intelligence and Feature Selection: Rough and Fuzzy Approaches, John Wiley and Sons, 2008.

[14] Jensen R. and Shen Q., “Semantics-Preserving Dimensionality Reduction: Rough and Fuzzy- Rough-Based Approaches,” IEEE Transactions on Knowledge and Data Engineering, vol. 16, no. 12, pp. 1457-1471, 2004.

[15] Kennedy J. and Eberhart R., “Particle Swarm Optimization,” in Proceedings of IEEE International Conference on Neural Networks, Perth, pp. 1942-1948, 1995.

[16] Lai C., Wu C., and Tsai M., “Feature Selection Using Particle Swarm Optimization with Application in Spam Filtering,” International Journal of Innovative Computing, Information and Control, vol. 5, no. 2, pp. 423-432, 2009.

[17] Li G. and Wang Y., “A Privacy-Preserving Classification Method Based on Singular Value Decomposition,” The International Arab Journal of Information Technology, vol. 9, no. 6, pp. 529- 534, 2012.

[18] Machine Learning Repository, archive.ics.uci.edu/ml/datasets.html, Last Visited, 2014.

[19] Martens D., Backer M., Haesen R., Member S., Vanthienen J., Snoeck M., and Baesens B., 736 The International Arab Journal of Information Technology, Vol. 16, No. 4, July 2019 “Classification with Ant Colony Optimization,” IEEE Transactions on Evolutionary Computation, vol. 11, no. 5, pp. 651-665, 2007.

[20] Merkle D. and Middendorf M., “Swarm intelligence,” in Proceedings of Search Methodologies, Boston, pp. 401-435, 2005.

[21] Pawlak Z., Rough Sets: Theoretical Aspects of Reasoning About Data, Springer, 1991.

[22] Shi Y. and Eberhart R., “A Modified Particle Swarm Optimizer,” in Proceedings of IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence, Anchorage, pp. 69-73, 1998.

[23] Wang X., Yang J., Teng X., Xia W., and Jensen R., “Feature Selection Based on Rough Sets and Particle Swarm Optimization,” Pattern Recognition Letters, vol. 28, no. 4, pp. 459-471, 2007.

[24] Wang X., Yang T., Jensen R., and Liu X., “Rough Set Feature Selection and Rule Induction for Prediction of Malignancy Degree in Brain Glioma,” Computer Methods Programs Biomedicine, vol. 83, no. 2, pp. 147-156, 2006.

[25] Xue B., Zhang M., and Browne W., “Multi- Objective Particle Swarm Optimisation (PSO) For Feature Selection,” in Proceedings of the 14th Annual Conference on Genetic and Evolutionary Computation, Philadelphia, pp. 81-88, 2012. Dhanalakshmi Selvarajan had received her Ph.D Degree in computer applications from Anna University. Her research interest includes privacy preserving data mining, security issues in databases. Abdul Samath Abdul Jabar had received his Ph.D Degree from Gandhigram Rural University. He is a member in several professional bodies likes CSI, ISCA, LIA, INNS, IAENG and ISTE. His research interest includes neural networks, simulation, image processing, data mining. Irfan Ahamed had received his Ph.D Degree from Alagappa University. He is also a member in several professional bodies. His research interests include networks, image processing and data mining.