0% found this document useful (0 votes)
24 views10 pages

Bigdata Intro-Unit1

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
24 views10 pages

Bigdata Intro-Unit1

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 10

UNIT1

INTRODUCTION TO BIGDATA
DATA
• Data is the collection of raw facts and figures
• Examples of Data: • Student data on admission form-. •
Student’s examination data - obtained marks of different
subjects for all students
• • Census Report, Data of citizens- During census, data of
all citizens like number of persons living in a home,
literate or illiterate, number of children, cast, religion etc.
• • Survey Data –opinion of people about their product
like / unlike. They also collect data about their
competitor companies in a particular area.
INFORMATION
• Information: Processed data is called
information
• raw facts and figures are processed and
arranged in some proper order then they
become information
• Information has proper meanings
• nformation is useful in decision-making
INFORMATION
EXAMPLES
• student’s address labels- Stored data of students
can be used to print address labels of students
• Student’s examination, obtained marks in each
subject is processed to get total obtained marks
of a student.
• Census Report, Total Population- Census data is
used to get report/information about total
population of a country and literacy rate etc.
UNITS OF DATA
WHAT IS BIG DATA
• a collection of data sets that are large and complex,
• difficult to store and process using available database
management tools or traditional data processing
applications
• The definition of Big Data, given by Gartner is,
“Big data is high-volume, and high-velocity and/or high-
variety information assets that demand cost-effective,
innovative forms of information processing that
enable enhanced insight, decision making, and
process automation”.
Sources of Big data
• social media sites,
• sensor networks,
• digital images/videos,
• cell phones,
• purchase transaction records,
• web logs,
• medical records,
• archives,
• military surveillance,
• ecommerce,
• complex scientific research
Examples
• The New York Stock Exchange generates about one
terabyte of new trade data per day.
• Facebook stores, accesses, and analyzes 30+ Petabytes of
user generated data.
• Amazon handles 15 million customer click stream user data
per day to recommend products.
• Walmart handles more than 1 million customer
transactions every hour.
• 230+ millions of tweets are created every day.
• 294 billion emails are sent every day. Services analyses this
data to find the spams.

You might also like