0% found this document useful (0 votes)
125 views3 pages

Big Data: NADC Says: Every Day, We Create 2.5 Quintillion Bytes of Data - So Much That 90% of The Data in The

This document provides an overview of a big data course. It begins by defining big data as very large data sets that can be analyzed computationally to reveal patterns. It notes that big data comes from many sources, like sensors, social media, pictures, purchases and cell phones. The course content includes introductions to big data and Apache Hadoop architecture. It will cover Hadoop distributions and components like HDFS, MapReduce, Hive and Pig. Students will learn to set up and install Hadoop, do HDFS programming, debug Hadoop programs and access Hadoop data using Hive. The workshop is two days and assumes a moderate knowledge of Java and databases. Certificates will be provided to participants, organizers and winners.

Uploaded by

Arihant Jain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
125 views3 pages

Big Data: NADC Says: Every Day, We Create 2.5 Quintillion Bytes of Data - So Much That 90% of The Data in The

This document provides an overview of a big data course. It begins by defining big data as very large data sets that can be analyzed computationally to reveal patterns. It notes that big data comes from many sources, like sensors, social media, pictures, purchases and cell phones. The course content includes introductions to big data and Apache Hadoop architecture. It will cover Hadoop distributions and components like HDFS, MapReduce, Hive and Pig. Students will learn to set up and install Hadoop, do HDFS programming, debug Hadoop programs and access Hadoop data using Hive. The workshop is two days and assumes a moderate knowledge of Java and databases. Certificates will be provided to participants, organizers and winners.

Uploaded by

Arihant Jain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

BIG DATA

Introduction
Big data is a terminology being given to very large data sets which can be analyzed computationally
to show us patterns or trends in the random data. Today whole IT Industry is re-structuring the way
they used to maintain their database. This data could be anything right from email IDs, numbers of
employees, clients or blood groups of patients, database collection of driving license numbers of
whole world.
Big Data in simple words is a technique to manage the important and scattered database and
analyze its behavior. This technology is the latest technology on which whole world is moving onto.
Enormous Jobs and Opportunities to start own business will be created in the field.
NADC Says: Every day, we create 2.5 quintillion bytes of data so much that 90% of the data in the
world today has been created in the last two years alone. This data comes from everywhere: sensors
used to gather climate information, posts to social media sites, digital pictures and videos, purchase
transaction records, and cell phone GPS signals to name a few. This data is Big Data.
Course Content
1.

Introduction to Big Data

Traditional Data Processing Technologies

Apache Hadoop Architecture

Hadoop Architecture

Hadoop and RDBMS

Hadoop Distributions

HDFS Architecture

Hadoop Ecosystem MapReduce, Hadoop Streaming , Hive, Pig, Hbase

Where Hadoop fits in the Enterprise

Hadoop Setup and Installation

HDFS Programming Basics

Hadoop Streaming

Performance Tuning

Debugging Hadoop Programs

MapReduce Architecture

MapReduce Programming Basics

MapReduce Programming Using Big Insights


Accessing Hadoop Data Using Hive

Hive Architecture.

Downloading, Installing and Configuring Hive.

Understand what Apache Hive is and Hive use cases.

Make basic configuration changes in a Hive installation.

Use DDL to create new Hive databases and tables.


Pre-Requirement
The Workshop content consists of an approximately equal mixture of lecture and hands-on lab. This
will be a Two days workshop. All students have at least moderate knowledge in Java and Database.
Recommendation: It is strongly recommended to bring your own LAPTOP during the training on
which you can install and run programs if you would like to do the optional, hands-on
experiments/exercises after the trainings/ workshops.
Certification
1. "Certificate of Appreciation" for Organizing Person from ARK Technosolutions & NADC
India,AMALGAM-IIT MADRAS.
2. "Certificate of Association" for Organizing College from ARK Technosolutions & NADC
India,AMALGAM-IIT MADRAS.
3. "Certificate of Participation" to every participant from ARK Technosolutions & NADC
India,AMALGAM-IIT MADRAS.
4. "Certificate of Merit" to the Zonal Winners from ARK Technosolutions & NADC India,AMALGAMIIT MADRAS.
5. "Certificate of Coordination" to the Coordinators from ARK Technosolutions
& NADC India,AMALGAM-IIT MADRAS.

Regards

RUTUJ KARANDIKAR
HEAD MANAGER , INDIA
AMALGAM , IIT MADRAS
08425858196,9769172667

You might also like