Big Data Analytics Lab
Big Data Analytics Lab
Prerequisites: NIL
Course Objectives:
This course enables the students to learn and understand Big data, data analytics, R
language, developing map reduce programs.
List of Programs
1. Basics of R programming, Data Analysis using R; Using Hadoop for applying
Map-reduce technique: Overview of R, R data types and objects, reading and
writing data, Control structures, functions, scoping rules, dates and times, Loop
functions, debugging tools, Simulation, code profiling
2. Demonstrate how variables are processed and data is displayed using R.
3. Using a suitable example show that R is case-sensitive.
4. Show how scalars and vectors lists are processed in R.
5. Show how matrices can be added and multiplied in R using an appropriate
example.
6. Demonstrate using different colors how graphics is processed and displayed using
R.
7. Using appropriate example show how graphs can be displayed using R.
8. Using appropriate example demonstrate statistical analysis using R (For example,
calculation of mean, variance etc).
9. Consider any real dataset and demonstrate Linear Regression, Random Forest and
CART (Classification and Regression Trees) using R.
10. Consider any real dataset and demonstrate Support Vector Machines (SVM) and
Neural Networks using R.
11. Use Apache Hadoop (or Apache Mahout) to demonstrate the Map-reduce model
for Naïve Bayes Classification.
12. Use Apache Hadoop (or Apache Mahout) to demonstrate the Map-reduce model
for K-means Clustering.
REFERENCES:
1. Frank J Ohlhorst, “Big Data Analytics: Turning Big Data into Big Money”, Wiley
and SAS Business Series, 2012.
2. Colleen Mccue, “Data Mining and Predictive Analysis: Intelligence Gathering and
Crime Analysis”, Elsevier, 2007
3. Machine Learning – Tom M. Mitchell, - MGH
4. Machine Learning: An Algorithmic Perspective, Stephen Marsland, Taylor &
Francis (CRC)
5. https://www.r-project.org/
6. https://cran.r-project.org/manuals.html
7. https://mahout.apache.org/
Course Outcomes:
199
At the end of the course, students will be able to:
1. Apply data analytics for a given problem using R Language.
2. Identify the need for big data analytics for a domain.
3. Use Hadoop, Map Reduce Framework
CO1 1 1 1 1 1 1 2
CO2 1 1 2 1 1 1 1 1 1 1
CO3 1 1 2 2 2 1 1 1 1 3 1 3 1
200