The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


Synthesizing Conjunctive and Disjunctive Linear Invariants by K-means++ and SVM

The problem of synthesizing adequate inductive invariants lies at the heart of automated software verification. The state-of-the-art machine learning algorithms for synthesizing invariants have gradually shown its excellent performance. However, synthesizing disjunctive invariants is a difficult task. In this paper, we propose a method k++ Support Vector Machine (SVM) integrating k-means++ and SVM to synthesize conjunctive and disjunctive invariants. At first, given a program, we start with executing the program to collect program states. Next, k++SVM adopts k-means++ to cluster the positive samples and then applies SVM to distinguish each positive sample cluster from all negative samples to synthesize the candidate invariants. Finally, a set of theories founded on Hoare logic are adopted to check whether the candidate invariants are true invariants. If the candidate invariants fail the check, we should sample more states and repeat our algorithm. The experimental results show that k++SVM is compatible with the algorithms for Intersection Of Half-space (IOH) and more efficient than the tool of Interproc. Furthermore, it is shown that our method can synthesize conjunctive and disjunctive invariants automatically.


[1] Arthur D. and Vassilvitskii S., “K-Means++: The Advantages of Careful Seeding,” in Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms, New Orleans, pp. 1027-1035, 2007.

[2] Cortes C. and Vapnik V., “Support-Vector Networks,” Machine Learning, vol. 20, no. 3, pp. 273-297, 1995.

[3] Cousot P. and Cousot R., “Abstract Interpretation: A Unified Lattice Model for Static Analysis of Programs by Construction or Approximation of Fixpoints,” in Proceedings of the ACM Sigact-Sigplan Symposium on Principles of Programming Languages, New York, pp. 238-252, 1977.

[4] D'Silva V., Kroening D., and Weissenbacher G., “A Survey of Automated Techniques for Formal Software Verification,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 27, no. 7, pp. 1165-1178, 2008.

[5] Ding Z., Wang R., Hu J., and Liu Y., “Detecting Bugs of Concurrent Programs with Program Invariants,” in Proceedings of the IEEE International Conference on Software Quality, Reliability and Security Companion, Vienna, pp. 1-15, 2016.

[6] Garg P., Löding C., Madhusudan P., and Neider D., “ICE: A Robust Framework for Learning Invariants,” in Proceedings of the International Conference on Computer Aided Verification, Vienna, pp. 69-87, 2014.

[7] Garg P., “Learning-Based Inductive Invariant Synthesis,” Thesis, Univerisity of Illinois Urbana-Champaign, 2015.

[8] Garg P., Neider D., Madhusudan P., and Roth D., “Learning Invariants Using Decision Trees and Implication Counterexamples,” Acm Sigplan Notices, vol. 51, no. 1, pp. 499-512, 2015.

[9] Gulavani B., Chakraborty S., Nori A., and Rajamani S., “Automatically Refining Abstract Interpreta-tions,” in Proceedings of the Theory and Practice of Software, International Conference on TOOLS and Algorithms for the Construction and Analysis of Systems, Budapest, pp. 443-458, 2008.

[10] Gulavani B., Henzinger T., Kannan Y., and Nori A., “SYNERGY: A New Algorithm for Property Checking,” in Proceedings of the ACM Sigsoft Inter-national Symposium on Foundations of Software Engineering, Portland, pp. 117-127, 2006. Synthesizing Conjunctive and Disjunctive Linear Invariants by K-means++ and SVM 855

[11] Gulwani S., Srivastava S., and Venkatesan R., “Program Analysis as Constraint Solving,” in Proceedings of the ACM Sigplan Conference on Programming Language Design and Implementation, Tucson, pp. 281-292, 2008.

[12] Gupta A. and Rybalchenko A., “InvGen: An Efficient Invariant Generator,” in Proceedings of the International Conference on Computer Aided Verification, Grenoble, pp. 634-640, 2009.

[13] Hoare C., “An Axiomatic Basis for Computer Programming,” Communications of the ACM, vol. 12, no. 1, pp. 53-56, 1969.

[14] Jeannet B., “Interproc Analyzer for Recursive Programs with Numerical Variables,” http://popart.inrialpes.fr/interproc/interprocweb.cg i, pp. 6-11, Last Visited, 2010.

[15] Krishna S., Puhrsch C., and Wies T., “Learning Invariants using Decision Trees,” Computer Science, vol. 21, no. 7, pp. 44-59, 2015.

[16] Lin S., Sun J., Xiao H., Liu Y., Sanán D., and Hansen H., “FiB: Squeezing Loop Invariants by Interpolation between Forward/Backward Bredicate Transformers,” in Proceedings of the IEEE/ACM International Conference on Auto- mated Software Engineering, Urbana, pp. 793- 803, 2017.

[17] Li J., Sun J., Li L., and Le Q., “Automatic Loop- Invariant Generation and Refinement through Selective Sampling,” in Proceedings of the Ieee/Acm International Conference on Auto- mated Software Engineering, Buenos Aires, pp. 782-792, 2017.

[18] Jun S., Pham L., Thi L., Wang J., and Peng X., “Learning Likely Invariants to Explain Why a Program Fails,” in Proceedings of the International Conference on Engineering of Complex Computer Systems, Melbourne, pp. 70- 79, 2018.

[19] Mcdonald J., Trigg T., Roberts C., and Darden B., “Security in Agile Development: Pedagogic Lessons from an Undergraduate Software Engineering Case Study,” in Proceedings of Cyber Security Symposium, Coeur d'Alene, pp. 127-141, 2015.

[20] Mcmillan K., “Interpolation and SAT-Based Model Checking,” in Proceedings of the Computer Aided Verification, Berlin Heidelberg, Germany, pp. 1-13, 2003.

[21] Moura L. and Bjørner N., “Z3: An Efficient SMT Solver,” in Proceedings of International Conference on Tools and Algorithms for the Construction and Analysis of Systems, Budapest, pp. 337-340, 2008.

[22] Sharma R., Gupta S., Hariharan B., Aiken A., Liang P., and Nori A., “A Data Driven Approach for Algebraic Loop Invariants,” in Proceedings of European Conference on Programming Languages and Systems, Rome, pp. 574-592, 2013.

[23] Sharma R., “From Invariant Checking to Invariant Inference Using Randomized Search,” in Proceedings of Computer Aided Verification, Vienna, pp. 88-105, 2014.

[24] Sharma R., “Interpolants as Classifiers,” in Proceedings of International Conference on Computer Aided Verification, Berlin, pp. 71-87, 2012.

[25] Sharma R. and Aiken A., “Verification as Learning Geometric Concepts,” in Proceedings of International Static Analysis Symposium, Seattle, pp. 388-411, 2013.

[26] Stavnycha M., Yin H., and Römer T., “A Large- Scale Survey on the Effects of Selected Development Practices on Software Correctness,” in Proceedings of International Conference on Software and System Process, Korea, pp. 117-121, 2015.

[27] Uqaili I. and Ahsan S., “Machine Learning Based Prediction of Complex Bugs in Source Code,” The International Arab Journal of Information Technology, vol. 17, no. 1, pp. 26- 37, 2020.

[28] Vizel Y., Gurfinkel A., Shoham S., and Malik S., “IC3 - Flipping the E in ICE,” in Proceedings of International Conference on Verification, Model Checking and Abstract Interpretation, Paris, pp. 521-538, 2017. 856 The International Arab Journal of Information Technology, Vol. 17, No. 6, November 2020 Shengbing Ren was born on August 4, 1969 at Yueyang City, Hunan Province, China. He received his Bachelor of Science degree in Computer Software from Department of Mathematics, Huazhong Normal University, Hubei Province, China in 1992. He received his Master’s degree in Computer Application Technology in 1995 from Department of Computer, Central South University of Technology, Hunan Province, China. He received his Doctor’s degree in Control Theory and Control Engineering in 2007 from School of Information Science and Engineering, Central South University, Hunan Province, China. His research interests include: software engineering, embedded system, image processing, pattern recognition, dependable software. He is a professor in School of Computer Science and Engineering, Central South University, China. He is a Senior Member of China Computer Federation. He accomplished 11 research projects including 2 the National Natural Science Foundation of China as a key member, and over 60 papers was published. Now, he is dedicated to the research concentrated mostly on dependable software and pattern recognition. Xiang Zhang was born on April19, 1993 at Anqing City, Anhui Province, China. He received his Bachelor of Science degree in communication engineering, Central South University of forestry science and technology University, Hunan Province, China in 2015. He received his Master’s degree in software engineering, Central South University of Technology, Hunan Province, Chinain 2019.His research interests include: software engineering, machine learning. He is a student in School of Computer Science and Engineering, Central South University, China.