Survey on General Classification Techniques for Effective Bug Triage

Nitu Bhardwaj, A.S Bhattacharya

Citation :

Nitu Bhardwaj, A.S Bhattacharya, "Survey on General Classification Techniques for Effective Bug Triage," International Journal of Computer Science and Engineering , vol. 2, no. 11, pp. 6-10, 2015. Crossref, https://doi.org/10.14445/23488387/IJCSE-V2I11P102

Abstract

Data mining is the process of extraction of hidden and useful information from huge data. It is also called knowledge discovery process from data. Bug tracking systems are made to manage bug reports, which are collected from various sources. These bug reports are needed to be labeled as security bug reports or non-security bug reports. Data mining uses to apply mining algorithm to extract information which is stored in bug tracking systems. Classification is a task of data mining. Data mining can be applied to any kind of data as long as the data are meaningful for a target application. The most basic forms of data for mining applications are database data, data warehouse data and transactional data. This paper presents a survey on several classification techniques for effective bug triage which are generally used for data mining such as naïve bayes, decision tree, K- nearest neighbor, Rule based, neural network etc.

Keywords

Bug report, classification, naïve bayes, decisiontree, K-nearest neighbor, Rule based, neural network.

References

[1] Trevor Hastie and SaharonRosset," The Entire Regularization Pathfor the Support Vector Machine", Journal of Machine LearningResearch, pp 1391-1415, 2004
[2] J. Han and M. Kamber, Data mining concepts and techniques, MorganKaufmann, San Francisco 2006.
[3] A. Darwiche, Modeling and Reasoning with Bayesian Networks,Cambridge University Press, 2009.
[4] JiangtaoRen and Sau Dan Lee, “Naïve Bayes Classification ofUncertain Data”, IEEE, pp 944-949, 2009
[5] N. Suguna and Dr. K. Thanushkodi, “An Improved k- NearestNeighbor Classification Using Genetic Algorithm”, IJCSI, pp 18-21,2010
[6] I.H. Witten, E. Frank and M.A. Hall, Data mining practical machinelearning tools and techniques, Morgan Kaufmann publisher,Burlington 2011.
[7] KanhaiyaLal, N.C.Mahanti, "Role of soft computing as a tool indata mining”, IJCSIT, pp 526- 537, 2011.
[8] B V Chowdary& Annapurna Gummadi, “Decision Tree InductionApproach for Data Classification Using Peano Count Tree”,IJARCSSE, pp 475-479, 2012.
[9] A. S. Galathiya& A. P. Ganatra, “Improved Decision Tree InductionAlgorithm with Feature Selection, Cross Validation, ModelComplexity and Reduced Error Pruning ”, IJCSIT, pp 3427-3431,2012.
[10] SainaniArpitha and P.Raja Prakash Rao,"Clustering Algorithm forText Classification Using Fuzzy Logic”, IJARCSSE, pp 258-262,2012.
[11] NanditaSengupta and Jaya Sil," Evaluation of Rough Set TheoryBased Network Traffic Data Classifier Using DifferentDiscretization Method", IJIEE, pp 338-341, 2012.
[12] M. Thangaraj&C.R.Vijayalakshmi, “Performance Study on RulebasedClassification Techniques across Multiple DatabaseRelations”, IJAIS, pp 1-7, 2013.
[13] B.Madasamy&Dr.J.JebamalarTamilselvi, “ImprovingClassification Accuracy of Neural Network through ClusteringAlgorithms”, IJCTT, pp 3242-3246, 2013.
[14] Revathi N and Anjana Pete," Web Text Classification Using GeneticAlgorithm and a Dynamic Neural Network Model", IJARCET, pp436-442, 2013
[15] Ming Yao,” Research on Learning Evidence Improvement for kNNBased Classification Algorithm”, IJDTA, pp 103- 110, 2014.
[16] S. Sendhilkumar and K. Selvakumar," Application of Fuzzy Logicfor User Classification in Personalized Web Search", IJCI, pp 23- 49, 2014.
[17] Ming Yao,” Research on Learning Evidence Improvement for kNNBased Classification Algorithm”, IJDTA, pp 103- 110, 2014.