PERFORMING PREDICTIVE DATA ANALYTICS IN DATA MINING USING VARIOUS TOOLS
Abstract
Predictive Data Analytics is a branch of Data Mining. Performing Predictive Data Analytics on huge data sets will help us in quick Decision Making forecasting on the results obtained on live or sample data. Data mining, the extraction of hidden predictive information from large databases, is a powerful new technology with great potential to help companies or individuals to focus on the most important information in their data warehouses. This paper helps us in specifying how to do Predictive Data Analytics in data mining using various tools. There are various Open Source Tools which help us in performing Predictive Analytics such as R Studio, Weka, KNIME etc. This paper also lists various predictive analytic tools and specify there features and usage. A comparison also can be made or decision can be taken by the reader to use a specific tool based on the requirement. The main scope is to enhance the study of predictive data analysis and provide the necessary help in quick decision making in any of the important area. Predictive data analytics can be performed in various areas such as medical, agriculture, behavior prediction of kids, behavior of a customer in a particular business etc. In this aspect the paper elaborates on the tools available to do perform predictive data analytics and also introduce the importance of data mining. Predictive data analysis is done using variables as attributes known as predictors. This paper also includes information about Knowledge Discovery Process which is a part of Data Mining.
Keywords
References
Philip K. Chan, Florida Institute of Technology Wei Fan, Andreas L. Prodromidis, and Salvatore J. Stolfo, Columbia University “Distributed Data Mining in Credit Card Fraud Detection” IEEE intelligent systems.
P. Chan and S. Stolfo, “Metalearning for Multistrategy and Parallel Learning,” Proc. Second Int’l Workshop Multistrategy Learning, Center for Artificial Intelligence, George Mason Univ., Fairfax,Va., 1993, pp. 150–165
P. Chan and S. Stolfo, “Toward Scalable Learning with Nonuniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection,” Proc. Fourth Int’l Conf. Knowledge Discovery and Data Mining, AAAI Press, Menlo Park, Calif., 1998, pp. 164–168
http://www.cid.harvard.edu/ciddata/ciddata.html
http://thenewstack.io/six-of-the-best-open-source-data-mining-tools/
S. Stolfo et al., “JAM: Java Agents for Metalearning over Distributed Databases,” Proc. Third Int’l Conf. Knowledge Discovery and Data Mining, AAAI Press, Menlo Park, Calif., 1997, pp. 74–81.
http://www.techgyd.com/10-best-data-mining-software-better-analysis/14362/
Paško Konjevoda and Nikola Štambuk, “Open-Source Tools for Data Mining in Social Science ,” Theoretical and Methodological Approaches to Social Sciences and Knowledge Management, pp.163-176
Refbacks
- There are currently no refbacks.
Copyright © 2012 - 2023, All rights reserved.| ijitr.com
International Journal of Innovative Technology and Research is licensed under a Creative Commons Attribution 3.0 Unported License.Based on a work at IJITR , Permissions beyond the scope of this license may be available at http://creativecommons.org/licenses/by/3.0/deed.en_GB.