An Extensive Study of Data Analysis Tools (Rapid Miner, Weka, R Tool, Knime, Orange)

International Journal of Computer Science and Engineering
© 2018 by SSRG - IJCSE Journal
Volume 5 Issue 9
Year of Publication : 2018
Authors : Venkateswarlu Pynam, R Roje Spanadna, Kolli Srikanth

pdf
How to Cite?

Venkateswarlu Pynam, R Roje Spanadna, Kolli Srikanth, "An Extensive Study of Data Analysis Tools (Rapid Miner, Weka, R Tool, Knime, Orange)," SSRG International Journal of Computer Science and Engineering , vol. 5,  no. 9, pp.  4-11, 2018. Crossref, https://doi.org/10.14445/23488387/IJCSE-V5I9P102

Abstract:

In today’s data has been increasing in the concept of 3 v’s (volume, velocity and variety) technology. Due to the large and Complex Collection of Datasets is difficult to process on traditional data processing applications. So that leads to arrive a new technology called Data Analytics. It is a science of exploring raw data and elicitation the useful information and hidden pattern. The main aim of data analysis is to use advance analytics techniques for huge and different datasets. The size of the dataset may vary from terabytes to zetta bytes and that can be structured or unstructured. The paper gives the comprehensive and theoretical analysis of five open source data analytics tools which are RapidMiner, Weka, R tool, KNIME and Orange. By employing the study the choice and selection of tools and be made easy, these tools are evaluated on basis of various parameters like amount of data used, response time, ease of use, price tag, analysis of algorithm and handling.

Keywords:

Data Analytics, Big Data, Data analytical tools, Visualization tools, Data mining

References:

[1] Lekha R. Nair , Sujala D. Shetty. “Research in Big Data and Analytics: An Overview” presented at International Journal of Computer Applications, Volume 108 – No 14, 2014. 
[2] Mike Barlow. Real-Time Big Data Analytics: Emerging Architecture. Sebastopol, CA: O‚ÄüReilly Media, 2013, pp. 3. 
[3] Sanjay Rathee. “Big Data and Hadoop with components like Flume, Pig, Hive and Jaql,” presented at International Conference on Cloud, Big Data and Trust 2013, RGPV, 2015. 
[4] Swasti Singhal, Monika Jena. “A Study on WEKA Tool for Data Preprocessing, Classification and Clustering” presented at International Journal of Innovative Technology and Exploring Engineering (IJITEE), Volume-2, Issue-6,2013 
[5] Kalpana Rangra, Dr. K. L. Bansal. “Comparative Study of Data Mining Tools”, presented at International Journal of Advanced Research in Computer Science and Software Engineering, Volume 4, Issue 6, 2014. 
[6] Michael R. Berthold, Nicolas Cebron, Fabian Dill, Thomas R. Gabriel, Tobias K¨otter, Thorsten Meinl, Peter Ohl, Kilian Thiel and Bernd Wiswedel. “KNIME – The Konstanz Information Miner” presented at University of Konstanz Nycomed Chair for Bioinformatics and Information Mining, Germany. 
[7] http://bigdata-madesimple.com/top-30-big-data-tools-data-analysis/ 
[8] http://opensourceforu.com/2017/03/top-10-open-source-data-mining-tools/ 
[9] https://rapidminer.com/wp-content/uploads/2014/10/RapidMiner-5-Operator-Reference.pdf 
[10] http://pingax.com/understanding-data-analytics-project-life-cycle/