Rapidminer
Rapidminer
EXPERIMENT NO .10
Mini-project on RAPIDMINER.
Aim: To study RapidMiner data mining tool.
Requirements: Microsoft word, RapidMiner.
Introduction: RapidMiner is a data science software platform developed by the company of
the same name that provides an integrated environment for data preparation, machine learning,
deep learning, text mining, and predictive analytics. It is used for business and commercial
applications as well as for research, education, training, rapid prototyping, and application
development and supports all steps of the machine learning process including data preparation,
results visualization, model validation and optimization. RapidMiner is developed on an open
core model. The RapidMiner Studio Free Edition, which is limited to 1 logical processor and
10,000 data rows is available under the AGPL license, by depending on various non-open source
components. Commercial pricing starts at $5,000 and is available from the developer.
Advantages:
Flow based programming allows visualization of pipelines.
Contains modules for statistical analysis, machine learning, etl, etc.
No coding required.
Easy to setup.
Multiple deployment options based on your preference.
Strong visualization.
Accurate Preprocessing.
Multiple interfaces.
Java API available that can be used in programs.
Disadvantages:
Application Areas:
1.Business:
Optimize energy cost
Streamline customer service
Create a data-driven product portfolio
Predict delays to improve profitability
Increase yields
2.Education:
The RapidMiner provides a rich set of Machine Learning algorithms for Data Mining tasks, along with
a comprehensive set of operators (functions) for data pre-processing. RapidMiner has a repository
containing hundreds of machine learning algorithms and functions. RapidMiner is easy to use
because RapidMiner is a user-friendly visual workflow designer software. Visualization of the process
really helps users with data preparation and modelling. It makes my job easier in teaching machine
learning and predictive analytics because I can show them the role of each operator and which one
is vital in getting the right model. Students can directly see and understand the effect of using specific
algorithms and functions after a few clicks, drags and drops. RapidMiner is something quick and easy
to master.
DEPARTMENTOFINFORMATIONTECHNOLOGY
OUTPUT:
Dataset of iris:
DEPARTMENTOFINFORMATIONTECHNOLOGY
DEPARTMENTOFINFORMATIONTECHNOLOGY
Operations Performed:
ScatterPlot:
Histogram:
DEPARTMENTOFINFORMATIONTECHNOLOGY
Bell Curve:
Scatter 3D:
DEPARTMENTOFINFORMATIONTECHNOLOGY
Heat Map:
Comparison:
DEPARTMENTOFINFORMATIONTECHNOLOGY
Outcome:
RapidMiner Studio is a "downloadable GUI for machine learning, data mining, text mining,
predictive analytics and business analytics". It can also be used (for most purposes) in batch mode
(command line mode).