0% found this document useful (0 votes)

708 views11 pages

Unit 2 Notes Data Analytics

Big data refers to large, complex datasets that cannot be analyzed using traditional techniques. It is important for insights, innovation, cost savings, and competitive advantage. Big data analytics involves analyzing big data to extract valuable insights using techniques like data mining and machine learning. Hadoop is a popular framework for processing and managing big data in a distributed, parallel manner using technologies like Spark, NoSQL databases, and Hive.

Uploaded by

Shubham Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

708 views11 pages

Unit 2 Notes Data Analytics

Uploaded by

Shubham Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

COLLEGE OF ENGINEERING ROORKEE

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

DATA ANALYTICS BCST-603
Big Data and its Importance: -

Big data refers to large and complex datasets that cannot be processed or analyzed using
traditional data processing techniques. It includes structured, semi-structured, and unstructured
data that is generated from a variety of sources, such as social media, mobile devices, sensors,
and machine logs.

Big data is important for several reasons:

 Insights and decision making: Big data can help organizations gain insights into
customer behavior, market trends, and operational efficiencies. This information can be
used to make data-driven decisions that can improve business outcomes.
 Innovation: Big data can fuel innovation by providing new opportunities to create new
products and services or improve existing ones.
 Cost savings: Big data technologies can help organizations reduce costs by optimizing
processes, identifying inefficiencies, and improving resource utilization.
 Competitive advantage: Organizations that leverage big data effectively can gain a
competitive advantage by making better decisions faster, identifying new revenue
streams, and improving customer experiences.
 Healthcare: Big data is important in healthcare as it can help in improving patient
outcomes, identifying disease patterns, and optimizing resource allocation.
 Social good: Big data can be used for social good by providing insights into social issues
such as poverty, hunger, and disease outbreaks, which can help policymakers make
informed decisions and allocate resources effectively.

In conclusion, big data has become an important aspect of modern business and society. Its
importance lies in its ability to provide insights, fuel innovation, save costs, gain a competitive
advantage, improve healthcare, and address social issues.

5 V’s of BigData:

The 5 V's of Big Data are:

Volume: This refers to the vast amount of data that is generated from various sources such as
social media, sensors, and logs. Big Data is characterized by its sheer volume and the need for
tools and technologies that can handle and process large datasets.

Page 1
Velocity: This refers to the speed at which data is generated, collected, and processed. With the
advent of IoT (Internet of Things) and other real-time data sources, data is being generated at a
faster rate than ever before. Big Data requires fast and real-time processing of data to extract
valuable insights.

Variety: This refers to the different types and formats of data that are generated. Big Data can
include structured, unstructured, and semi-structured data, such as text, audio, video, and social
media data. Analyzing these various data types requires advanced tools and technologies.

Veracity: This refers to the accuracy and reliability of the data. Big Data often includes
incomplete or inaccurate data that can lead to unreliable insights. It is essential to ensure that the
data being analyzed is trustworthy and of high quality.

Value: This refers to the importance and usefulness of the insights derived from the data. The
ultimate goal of analyzing Big Data is to extract valuable insights that can drive business
decisions, improve processes, and enhance customer experiences. The insights derived from Big
Data should be actionable and valuable to the organization.

Introduction to big data analytics

Big data analytics refers to the process of analyzing and interpreting large and complex datasets
to uncover valuable insights that can drive business decisions, optimize processes, and improve
customer experiences. It involves using advanced analytical techniques, such as data mining,
machine learning, and predictive modeling, to extract insights from vast amounts of data.

The growth of Big Data has been fueled by the proliferation of new data sources, such as social
media, mobile devices, and the Internet of Things (IoT). These sources generate vast amounts of
data, which can be used to gain insights into customer behavior, market trends, and operational
efficiencies.

Big Data analytics can help organizations in several ways, including:

 Predictive modeling: Big Data analytics can be used to build predictive models that can
forecast customer behavior, market trends, and operational outcomes.
 Customer insights: Big Data analytics can help organizations gain insights into
customer behavior and preferences, which can be used to improve customer experiences
and drive revenue growth.
 Operational efficiency: Big Data analytics can help organizations identify inefficiencies
in their processes and operations, which can lead to cost savings and improved resource
utilization.
 Risk management: Big Data analytics can be used to identify potential risks and
vulnerabilities in an organization's operations, supply chain, and customer interactions.

Page 2
 Product development: Big Data analytics can provide insights into customer needs and
preferences, which can inform the development of new products and services.

To effectively leverage Big Data analytics, organizations need to invest in the right tools and
technologies, as well as skilled data analysts and data scientists. It is essential to have a clear
understanding of the organization's goals and objectives and to develop a robust data strategy to
guide the analytics process.

Big data analytics applications in detail

Big data analytics has a wide range of applications across various industries. Here are some
examples of how big data analytics is being used in different domains:

 Healthcare: Big data analytics is being used to improve patient outcomes by identifying
disease patterns, predicting patient risks, and optimizing resource allocation. For
example, analytics can be used to predict which patients are at high risk of developing a
specific disease and intervene early to prevent the disease from progressing.
 Finance: Big data analytics is being used to detect fraudulent transactions, assess credit
risk, and identify investment opportunities. For example, analytics can be used to identify
patterns of fraudulent activity in credit card transactions and alert the bank to take
immediate action.
 Retail: Big data analytics is being used to optimize supply chain management,
personalize customer experiences, and forecast demand. For example, analytics can be
used to analyze customer data and provide personalized recommendations for products
and services.
 Manufacturing: Big data analytics is being used to improve quality control, optimize
production processes, and reduce downtime. For example, analytics can be used to
monitor machine performance in real-time and predict potential equipment failures before
they occur.
 Transportation: Big data analytics is being used to optimize logistics and improve
safety. For example, analytics can be used to optimize delivery routes to reduce fuel
consumption and improve on-time delivery.
 Education: Big data analytics is being used to personalize learning experiences, identify
at-risk students, and measure student performance. For example, analytics can be used to
track student progress and provide personalized recommendations for coursework.
 Sports: Big data analytics is being used to analyze player performance, optimize team
strategies, and enhance the fan experience. For example, analytics can be used to track
player movements on the field and provide insights into their performance and behavior.

Page 3
In conclusion, big data analytics has a wide range of applications across different industries and
domains. By leveraging advanced analytics techniques, organizations can gain valuable insights
into customer behavior, market trends, and operational efficiencies, which can drive business
decisions and improve outcomes.

Big Data technologies: Hadoop Parallel World

Hadoop is one of the most popular big data technologies used for processing and managing large
and complex datasets. It is an open-source framework that provides a distributed file system
(HDFS) and a framework for processing large-scale data (MapReduce). The Hadoop ecosystem
includes several other tools and technologies that work together to provide a complete big data
solution.

One of the key features of Hadoop is its ability to process data in a parallel and distributed
manner, which enables it to handle large datasets with ease. Hadoop uses a distributed computing
model, where data is divided into smaller chunks and processed in parallel across multiple nodes
in a cluster.

Some of the other technologies that work in parallel with Hadoop in the big data world include:

Spark: Apache Spark is an open-source data processing framework that is designed to work with
Hadoop. It provides an alternative to MapReduce and is known for its speed and ease of use.
Spark provides an in-memory processing model, which allows it to process data much faster than
traditional batch processing systems.

NoSQL databases: NoSQL databases, such as MongoDB and Cassandra, are designed to handle
large volumes of unstructured data. These databases are highly scalable and provide high
performance for read and write operations. They are often used in conjunction with Hadoop to
store and process large amounts of data.

Hive: Apache Hive is a data warehousing and SQL-like query language that is used to process
data stored in Hadoop. Hive provides a familiar interface for data analysts and allows them to
perform complex queries on large datasets.

Pig: Apache Pig is a platform for creating complex data processing pipelines. Pig provides a
high-level language (Pig Latin) for expressing data transformations and can be used to perform
complex data manipulations on large datasets.

In conclusion, Hadoop is a key technology in the big data world and provides a framework for
processing and managing large datasets in a parallel and distributed manner. Other technologies,
such as Spark, NoSQL databases, Hive, and Pig, work in parallel with Hadoop to provide a
complete big data solution. Together, these technologies enable organizations to extract valuable

Page 4
insights from large and complex datasets, which can drive business decisions and improve
outcomes.

Open source technology for Big Data Analytics

There are several open source technologies available for big data analytics. Here are some of the
most popular ones:

 Apache Hadoop: Hadoop is an open-source framework for distributed storage and

processing of large data sets. It provides a scalable, fault-tolerant, and cost-effective way
to handle big data.
 Apache Spark: Spark is a fast and general-purpose open-source data processing engine.
It provides an in-memory computing framework for processing large-scale data sets.
 Apache Flink: Flink is an open-source stream processing framework that provides
distributed processing of real-time data streams. It can handle large volumes of data with
low latency and high throughput.
 Apache Cassandra: Cassandra is an open-source distributed NoSQL database that can
handle large amounts of structured and unstructured data. It provides high scalability,
fault tolerance, and performance.
 Apache Kafka: Kafka is an open-source distributed streaming platform that can handle
high volumes of real-time data streams. It provides high scalability, fault tolerance, and
low latency.
 R: R is an open-source programming language for statistical computing and graphics. It
provides a wide range of statistical and graphical techniques for data analysis.
 Python: Python is a general-purpose programming language that is widely used in data
analysis and machine learning. It provides a rich set of libraries and tools for data
processing and analysis.

These are just a few examples of the many open-source technologies available for big data
analytics. The choice of technology depends on the specific requirements and use case of the
organization.

Cloud and Big Data

Cloud computing and big data are closely related, as the cloud provides a scalable and cost-
effective platform for storing and processing large data sets.

Here are some of the ways in which cloud computing can be used for big data:

Page 5
 Storage: Cloud storage services such as Amazon S3, Google Cloud Storage, and
Microsoft Azure Blob Storage provide a cost-effective way to store large amounts of
data. These services are highly scalable and can be used to store both structured and
unstructured data.
 Processing: Cloud computing platforms such as Amazon Web Services (AWS), Google
Cloud Platform (GCP), and Microsoft Azure provide tools and services for processing
big data. These include services such as Amazon EMR, Google Cloud Dataproc, and
Microsoft Azure HDInsight, which provide managed Hadoop and Spark clusters for
processing large data sets.
 Analytics: Cloud-based analytics platforms such as Google BigQuery, AWS Redshift,
and Microsoft Azure Synapse Analytics provide a powerful and scalable way to perform
data analysis on large data sets. These services can be used for a wide range of analytics
tasks, including data warehousing, business intelligence, and machine learning.
 Real-time processing: Cloud-based stream processing services such as AWS Kinesis,
Google Cloud Dataflow, and Azure Stream Analytics provide a scalable and cost-
effective way to process real-time data streams.

Overall, cloud computing provides a powerful and flexible platform for big data processing and
analysis, allowing organizations to store and analyze large data sets without the need for costly
on-premises infrastructure.

Predictive Analytics

Predictive analytics is the use of statistical algorithms and machine learning techniques to
analyze historical data and make predictions about future events or trends. It involves the use of
advanced analytics techniques to identify patterns and relationships in data, which can then be
used to make predictions about future outcomes.

Predictive analytics can be applied in a wide range of industries and use cases, including:

 Sales and marketing: Predictive analytics can be used to identify potential customers,
forecast sales volumes, and optimize marketing campaigns.
 Financial services: Predictive analytics can be used to detect fraud, manage risk, and
optimize investment portfolios.
 Healthcare: Predictive analytics can be used to identify high-risk patients, forecast
disease outbreaks, and optimize healthcare delivery.
 Manufacturing: Predictive analytics can be used to forecast demand, optimize
production schedules, and prevent equipment failures.
 Transportation: Predictive analytics can be used to optimize routes, predict maintenance
needs, and improve safety.

Page 6
To perform predictive analytics, organizations need to gather and analyze large amounts of data,
including historical data and real-time data. This data is then used to build predictive models,
which can be used to make predictions about future events or outcomes. The accuracy of
predictive analytics depends on the quality of the data and the complexity of the predictive
models used.

Predictive analytics in the context of Hadoop

Hadoop is an open-source framework for distributed storage and processing of large data sets. It
provides a scalable and fault-tolerant platform for storing and processing big data.

Predictive analytics can be performed on data stored in Hadoop using a variety of open-source
tools and frameworks, such as Apache Mahout, Apache Spark MLlib, and H2O.ai. These tools
provide a wide range of machine learning algorithms for building predictive models, including
regression, classification, clustering, and recommendation.

One of the key advantages of using Hadoop for predictive analytics is its ability to process large
data sets in parallel. Hadoop uses a distributed processing model, where data is stored across
multiple nodes in a cluster and processed in parallel. This allows organizations to analyze large
amounts of data quickly and efficiently, which is essential for predictive analytics.

Another advantage of using Hadoop for predictive analytics is its ability to handle a variety of
data types, including structured and unstructured data. This is particularly useful for applications
such as text mining and sentiment analysis, where unstructured data such as social media posts
and customer reviews are analyzed to make predictions about customer behavior.

Overall, Hadoop provides a powerful platform for performing predictive analytics on big data.
By using open-source tools and frameworks, organizations can build predictive models that are
accurate and scalable, allowing them to make better decisions and gain a competitive advantage.

Mobile Business Intelligence and Big-Data

Mobile Business Intelligence (BI) refers to the delivery of business intelligence solutions to
mobile devices such as smartphones and tablets. It allows business users to access and analyze
data on-the-go, providing them with real-time insights into key business metrics.

Big Data plays a critical role in Mobile BI, as it provides the large and complex data sets that
businesses need to analyze to make informed decisions. Big Data technologies such as Hadoop
and Spark provide a scalable and cost-effective platform for storing and processing the vast
amounts of data required for Mobile BI.

Page 7
Some of the benefits of Mobile BI and Big Data include:

 Real-time insights: Mobile BI allows business users to access and analyze data in real-
time, enabling them to make informed decisions on-the-go.
 Increased productivity: Mobile BI eliminates the need for users to be tied to their
desktops or laptops, allowing them to access data and insights from anywhere, at any
time.
 Better decision-making: Big Data provides the large and complex data sets required for
accurate analysis, enabling businesses to make better-informed decisions.
 Improved customer experience: Mobile BI allows businesses to analyze customer data
in real-time, enabling them to provide a more personalized and tailored customer
experience.
 Cost savings: Big Data technologies provide a cost-effective platform for storing and
processing large data sets, reducing the need for expensive on-premises infrastructure.

Overall, Mobile BI and Big Data provide a powerful combination for businesses looking to gain
insights into their data and make better-informed decisions. By leveraging the scalability and
flexibility of Big Data technologies, businesses can analyze vast amounts of data in real-time,
providing them with a competitive advantage in today's fast-paced business environment.

Crowd sourcing Analytics

Crowdsourcing analytics is a process of using a group of people to collect and analyze data,
typically through an online platform. This approach allows organizations to tap into the
collective knowledge and expertise of a diverse group of individuals to solve complex problems,
analyze large datasets, or make predictions.

Crowdsourcing analytics can be used in a variety of industries and use cases, including:

 Market research: Crowdsourcing can be used to collect data on consumer preferences,

buying behaviors, and trends.
 Scientific research: Crowdsourcing can be used to collect data on various scientific
subjects, such as biodiversity, weather patterns, and oceanography.
 Public health: Crowdsourcing can be used to track disease outbreaks, monitor
environmental hazards, and assess the effectiveness of public health interventions.
 Sports analytics: Crowdsourcing can be used to collect data on athlete performance, game
statistics, and fan preferences.

To perform crowdsourcing analytics, organizations typically use an online platform to collect

data from a large group of people. This data is then analyzed using various analytics techniques,
including machine learning, natural language processing, and sentiment analysis. The results are
then used to make predictions or solve complex problems.

Page 8
One of the key advantages of crowdsourcing analytics is its ability to collect and analyze large
amounts of data quickly and efficiently. By leveraging the collective knowledge and expertise of
a large group of individuals, organizations can analyze data more accurately and efficiently than
if they were relying solely on internal resources.

Overall, crowdsourcing analytics provides a powerful tool for organizations looking to collect
and analyze large amounts of data. By tapping into the collective knowledge and expertise of a
diverse group of individuals, organizations can make better-informed decisions and gain a
competitive advantage.

Inter and trans firewall analytics

Inter and trans firewall analytics refer to the analysis of network traffic that passes through or
between firewalls. Firewalls are network security devices that monitor and control incoming and
outgoing network traffic based on pre-defined security rules.

Inter firewall analytics involves analyzing traffic that passes through a single firewall. This
includes analyzing incoming and outgoing traffic to identify potential threats such as malware,
viruses, or unauthorized access attempts. This analysis can be used to detect and prevent network
attacks, protect sensitive data, and ensure compliance with security policies.

Trans firewall analytics, on the other hand, involves analyzing traffic that passes through
multiple firewalls. This is typically done in large and complex networks that have multiple layers
of security. Trans firewall analytics can help organizations identify potential vulnerabilities in
their network architecture, optimize network performance, and ensure compliance with
regulatory requirements.

Both inter and trans firewall analytics require the use of advanced analytics techniques such as
machine learning, anomaly detection, and behavioral analysis. These techniques allow
organizations to identify patterns and anomalies in network traffic, detect potential threats, and
respond quickly to security incidents.

Some of the benefits of inter and trans firewall analytics include:

 Improved threat detection: Analytics can help identify and mitigate potential security
threats before they can cause harm to the network.
 Enhanced compliance: Analytics can help ensure compliance with regulatory
requirements, such as HIPAA or PCI-DSS.
 Improved network performance: Analytics can help optimize network traffic, reducing
latency and improving overall network performance.
 Better visibility: Analytics can provide organizations with a better understanding of their
network traffic, including where it is coming from and where it is going.

Page 9
In summary, inter and trans firewall analytics are important tools for organizations looking to
improve their network security and performance. By using advanced analytics techniques to
analyze network traffic, organizations can detect potential threats, optimize network
performance, and ensure compliance with regulatory requirements.

Information Management

Information management refers to the processes, technologies, and policies used by

organizations to manage their data and information assets. Effective information management
enables organizations to capture, store, retrieve, and analyze data to support decision-making,
improve business processes, and gain a competitive advantage.

Some of the key components of information management include:

 Data governance: This involves the establishment of policies, procedures, and standards
to ensure the quality, accuracy, and security of data.
 Data architecture: This involves the design and implementation of a data infrastructure
that supports the organization's information needs, including data storage, retrieval, and
analysis.
 Data modeling: This involves the creation of data models that represent the organization's
data in a standardized and structured format, allowing for easier analysis and reporting.
 Data integration: This involves the process of combining data from different sources and
formats to create a unified view of the organization's information assets.
 Data security: This involves the protection of sensitive data from unauthorized access,
use, disclosure, or destruction.
 Data analytics: This involves the use of data to gain insights and make informed
decisions, including the use of tools such as business intelligence, data mining, and
predictive analytics.

Effective information management requires a combination of people, processes, and technology.

It is important for organizations to have a clear understanding of their information needs and to
establish a data strategy that aligns with their business goals.

Benefits of effective information management include:

 Improved decision-making: Access to accurate and timely data enables organizations to

make better-informed decisions.
 Increased efficiency: Effective information management streamlines business processes,
reducing manual effort and increasing productivity.
 Competitive advantage: Organizations that effectively manage their data and information
assets are better positioned to gain a competitive advantage in their market.
 Better customer experience: Effective information management enables organizations to
better understand their customers, providing a more personalized and tailored experience.

Page 10
Overall, information management plays a critical role in enabling organizations to effectively
capture, store, retrieve, and analyze their data and information assets. By establishing effective
information management processes, organizations can gain insights that drive business growth
and competitive advantage.

Page 11

Bihl, Trevor J. - Zobaa, Ahmed F - Big Data Analytics in Future Power Systems (2019)
No ratings yet
Bihl, Trevor J. - Zobaa, Ahmed F - Big Data Analytics in Future Power Systems (2019)
189 pages
Untitled 1
No ratings yet
Untitled 1
423 pages
Unit - 1 Emergence of Cyber Space
No ratings yet
Unit - 1 Emergence of Cyber Space
2 pages
Unit-2 Final Kshitij
No ratings yet
Unit-2 Final Kshitij
108 pages
PR 1 REPORT
No ratings yet
PR 1 REPORT
40 pages
Complete Notes On DBMS
No ratings yet
Complete Notes On DBMS
43 pages
ch09 Memory and Virtual Memory
No ratings yet
ch09 Memory and Virtual Memory
55 pages
Trends in Data Warehousing and Business Intelligence
No ratings yet
Trends in Data Warehousing and Business Intelligence
44 pages
Explanatory Case Study
No ratings yet
Explanatory Case Study
18 pages
Unit - 2 Inception - UseCase
100% (1)
Unit - 2 Inception - UseCase
54 pages
SPSS Data Entry
No ratings yet
SPSS Data Entry
8 pages
GSARJEL352024-Gelary-script
No ratings yet
GSARJEL352024-Gelary-script
11 pages
Golf 4
No ratings yet
Golf 4
18 pages
Ip Project PDF
No ratings yet
Ip Project PDF
50 pages
Cit726 2021
No ratings yet
Cit726 2021
2 pages
HBR Guide + Tools For: Data Analytics Basics For Managers
No ratings yet
HBR Guide + Tools For: Data Analytics Basics For Managers
23 pages
The Evaluation of the 4th Grade English Coursebook in Terms of Teachers' Views
No ratings yet
The Evaluation of the 4th Grade English Coursebook in Terms of Teachers' Views
15 pages
Sixth Semester 2023
No ratings yet
Sixth Semester 2023
22 pages
In Power Bi
No ratings yet
In Power Bi
20 pages
HP StoreVirtual VSA Design and Configuration Guide
No ratings yet
HP StoreVirtual VSA Design and Configuration Guide
46 pages
How To Create LDT Files
No ratings yet
How To Create LDT Files
4 pages
Chapter 01 Notes
No ratings yet
Chapter 01 Notes
11 pages
CS201 17data Mining
No ratings yet
CS201 17data Mining
65 pages
Search Algorithms in Artificial Intelligence
No ratings yet
Search Algorithms in Artificial Intelligence
17 pages
AI Intergration in Commerce
No ratings yet
AI Intergration in Commerce
6 pages
Assignment 2
100% (1)
Assignment 2
5 pages
Storage Technologies Question Bank (1)
No ratings yet
Storage Technologies Question Bank (1)
61 pages
Management Information System Week 1
No ratings yet
Management Information System Week 1
29 pages
4thBAMS Syllabus PDF
No ratings yet
4thBAMS Syllabus PDF
29 pages
Transaction Management and Concurrency Control
No ratings yet
Transaction Management and Concurrency Control
44 pages
Operating System
No ratings yet
Operating System
53 pages
AI in Marketing
No ratings yet
AI in Marketing
5 pages
Mca sYLLABUS
No ratings yet
Mca sYLLABUS
90 pages
Present
No ratings yet
Present
6 pages
Android App For Improving The Energy Efficient Design of The Artificial Equipment Used in Buildings
No ratings yet
Android App For Improving The Energy Efficient Design of The Artificial Equipment Used in Buildings
37 pages
Mysql - WS 1 - 10
No ratings yet
Mysql - WS 1 - 10
91 pages
HDD RAW Fix Partition !
No ratings yet
HDD RAW Fix Partition !
18 pages
Process Flow and Operations Management Within Food-Processing Factories (E-Track)
No ratings yet
Process Flow and Operations Management Within Food-Processing Factories (E-Track)
6 pages
SAP BO Tutorial - SAP Business Objects Training Tutorials
No ratings yet
SAP BO Tutorial - SAP Business Objects Training Tutorials
2 pages
SQL Quick Reference
No ratings yet
SQL Quick Reference
6 pages
6 Sem Solution Bank
No ratings yet
6 Sem Solution Bank
251 pages
Namma Kalvi 6th Standard Social Science Guide Term 1 EM 220955
No ratings yet
Namma Kalvi 6th Standard Social Science Guide Term 1 EM 220955
54 pages
Form 5
No ratings yet
Form 5
2 pages
Project STRVV Aarnav
No ratings yet
Project STRVV Aarnav
51 pages
Dcos: Cache Embedded Switch Architecture For Distributed Shared Memory Multiprocessor Socs
No ratings yet
Dcos: Cache Embedded Switch Architecture For Distributed Shared Memory Multiprocessor Socs
4 pages
BTCS-602 - (IOT) Internet of Things Notes
No ratings yet
BTCS-602 - (IOT) Internet of Things Notes
119 pages
UNIT-1 CYBER LAWS
No ratings yet
UNIT-1 CYBER LAWS
21 pages
ADA Solved
No ratings yet
ADA Solved
14 pages
DAA Course File III Year II SEM
No ratings yet
DAA Course File III Year II SEM
17 pages
Skill Enhancement Course (SEC) Artificial Intelligence
No ratings yet
Skill Enhancement Course (SEC) Artificial Intelligence
54 pages
Details of Coer Diploma Library: Particulars Availability Remarks
No ratings yet
Details of Coer Diploma Library: Particulars Availability Remarks
2 pages
Coer Advertisement Dt. 11-11-2020
No ratings yet
Coer Advertisement Dt. 11-11-2020
2 pages
Assignment 2
No ratings yet
Assignment 2
1 page
Cyber Security - (R20a898896202)
No ratings yet
Cyber Security - (R20a898896202)
75 pages
SQLServer Developer Resume
No ratings yet
SQLServer Developer Resume
3 pages
Dip Computer Science Extended 2021
No ratings yet
Dip Computer Science Extended 2021
8 pages
Time Complexity Analysis of Iterative Algorithm & Recursive Algorithm
No ratings yet
Time Complexity Analysis of Iterative Algorithm & Recursive Algorithm
24 pages
Cyber Security Unit - 5
No ratings yet
Cyber Security Unit - 5
43 pages
Attendance Report Dt. 1-8-2020
No ratings yet
Attendance Report Dt. 1-8-2020
4 pages
Aimlsyll
No ratings yet
Aimlsyll
113 pages
CYBER SECURITY MCQ UNIT - V
No ratings yet
CYBER SECURITY MCQ UNIT - V
4 pages
UNIT 2 Cyber Crime and Cyber Low
No ratings yet
UNIT 2 Cyber Crime and Cyber Low
19 pages
BE02000041 Funda of AI Unit 1 Introduction
No ratings yet
BE02000041 Funda of AI Unit 1 Introduction
63 pages
Unit 1 Dbms 17cic53 Notes QB
No ratings yet
Unit 1 Dbms 17cic53 Notes QB
15 pages
Cs8791 Cloud Computing Unit2 Notes
No ratings yet
Cs8791 Cloud Computing Unit2 Notes
37 pages
CYBER SECURITY SCORE BOOSTER
No ratings yet
CYBER SECURITY SCORE BOOSTER
37 pages
V Sem Solution Bank
100% (1)
V Sem Solution Bank
303 pages
Science Inquiry Rubric
No ratings yet
Science Inquiry Rubric
1 page
b.com vi sem cyber security lab
No ratings yet
b.com vi sem cyber security lab
15 pages
Sbec-Internet and Applications - I Bca 2017 PDF
100% (3)
Sbec-Internet and Applications - I Bca 2017 PDF
93 pages
CNS Notes
100% (1)
CNS Notes
51 pages
CP4153-Network Technologies unit 1 2021
No ratings yet
CP4153-Network Technologies unit 1 2021
34 pages
Chapter 1 - Cyber Security
100% (1)
Chapter 1 - Cyber Security
31 pages
Security in Computing - Chapter 2 Notes
100% (1)
Security in Computing - Chapter 2 Notes
14 pages
Software Engineering Notes (Unit-III)
No ratings yet
Software Engineering Notes (Unit-III)
21 pages
AI In: Mechanical Engineering
No ratings yet
AI In: Mechanical Engineering
12 pages
6 Access Layer PDF
50% (2)
6 Access Layer PDF
84 pages
Probabilistic Reasoning: 13.1 Representing Knowledge in An Uncertain Domain
100% (1)
Probabilistic Reasoning: 13.1 Representing Knowledge in An Uncertain Domain
1 page
CYBER SECURITY Unit 1
No ratings yet
CYBER SECURITY Unit 1
26 pages
ADA SolBank Final
No ratings yet
ADA SolBank Final
80 pages
Cse357 MCQ
No ratings yet
Cse357 MCQ
28 pages
Information Technology Curriculum Guide Grade 8
100% (2)
Information Technology Curriculum Guide Grade 8
4 pages
CS8792 CNS Unit 1 - R1
No ratings yet
CS8792 CNS Unit 1 - R1
89 pages
CCS341 Data Warehousing Notes Unit I
No ratings yet
CCS341 Data Warehousing Notes Unit I
30 pages
CNS Unit 3
No ratings yet
CNS Unit 3
15 pages
Big Data Analytics Unit 4
No ratings yet
Big Data Analytics Unit 4
83 pages
Syllabus BCA 6th Sem
100% (1)
Syllabus BCA 6th Sem
4 pages
CCL Viva QB Solved
No ratings yet
CCL Viva QB Solved
7 pages
AI 2marks Questions
100% (1)
AI 2marks Questions
121 pages
One-Time Pad or Vernam Cipher
No ratings yet
One-Time Pad or Vernam Cipher
19 pages
P.prabu (28x61c) CCS334 BDA - Unit 4
No ratings yet
P.prabu (28x61c) CCS334 BDA - Unit 4
28 pages
FDS Unit 1
No ratings yet
FDS Unit 1
21 pages
Cyber Security Notes Unit 4
No ratings yet
Cyber Security Notes Unit 4
15 pages
18cs62 Mod 1
No ratings yet
18cs62 Mod 1
64 pages
MC4112 Set1
100% (1)
MC4112 Set1
3 pages
DataWarehouseMining Complete Notes
No ratings yet
DataWarehouseMining Complete Notes
55 pages
Distributed Database
No ratings yet
Distributed Database
7 pages
Module II
No ratings yet
Module II
22 pages
CNS Bits
No ratings yet
CNS Bits
3 pages
Assignment #2 AI
No ratings yet
Assignment #2 AI
5 pages

Uploaded by

Uploaded by

COLLEGE OF ENGINEERING ROORKEE

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

Big data is important for several reasons:

The 5 V's of Big Data are:

Introduction to big data analytics

Big Data analytics can help organizations in several ways, including:

Big data analytics applications in detail

Big Data technologies: Hadoop Parallel World

Open source technology for Big Data Analytics

 Apache Hadoop: Hadoop is an open-source framework for distributed storage and

Cloud and Big Data

Predictive analytics in the context of Hadoop

Mobile Business Intelligence and Big-Data

Crowd sourcing Analytics

 Market research: Crowdsourcing can be used to collect data on consumer preferences,

To perform crowdsourcing analytics, organizations typically use an online platform to collect

Inter and trans firewall analytics

Some of the benefits of inter and trans firewall analytics include:

Information management refers to the processes, technologies, and policies used by

Some of the key components of information management include:

Effective information management requires a combination of people, processes, and technology.

Benefits of effective information management include:

 Improved decision-making: Access to accurate and timely data enables organizations to

You might also like