0% found this document useful (0 votes)
46 views4 pages

Big Data Assignment 1 1

Uploaded by

moghariyarohit
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views4 pages

Big Data Assignment 1 1

Uploaded by

moghariyarohit
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

MASTER OF COMPUTER APPLICATIONS

Big Data Analytics (MCA1306)


ASSIGNMENT - 1
Questions
1. What are the differences between structured, semi-structured, and unstructured data in
the context of Big Data.
2. What is the role of data storage in Big Data analytics, and what technologies are
commonly used for storing large datasets?
3. What are the responsibilities of Hadoop and Yarn in big data analytics?
4. What importance does data quality hold in big data analytics?
5. What risk related to ethical and privacy concern be associated with big data collection
and analysis?
6. What are the Characteristics of Big Data?
7. How can machine learning and artificial intelligence integrate with big data to provide
insights?
8. Describe some common challenges faced by conventional data systems when dealing
with big data collection and analysis.
9. Describe the emerging trends related to big data analytics that should be known to
business.
10. Define "Intelligent Data Analysis" and its role in Big Data.

11. What is MapReduce, and how does it work?

12. How can organizations leverage Big Data for predictive analytics? Provide examples
of industries that benefit from this capability.
13. Compare and contrast the traditional business approach to data analysis with the Big
Data business approach.
14. Explain the concept of data replication in HDFS.

15. How does Hadoop achieve scalability and fault tolerance?

16. What is the role of the "Shuffle and Sort" phase in a MapReduce job?
Multiple-Choice Questions (MCQs)
1. Which of the following is NOT considered one of the "3 Vs" of Big Data?
A) Variety B) Velocity
C) Value D) Volume
2. What is the primary challenge of conventional systems when dealing with Big Data?
A) Lack of data storage
B) Slow data processing speeds
C) Inability to manage unstructured data
D) High cost of cloud storage
3. Which type of data is characterized by being organized in a structured format, such as
rows and columns?
A) Unstructured data B) Semi-structured data
C) Structured data D) Raw data
4. What type of analysis is commonly used to make predictions based on Big Data?
A) Descriptive analysis
B) Predictive analysis
C) Prescriptive analysis
D) Diagnostic analysis
5. Which of the following tools is widely used for Big Data storage and processing?
A) Excel B) Hadoop
C) SQL Server D) PowerPoint
6. In Big Data, what does the term "data lake" refer to?
A) A type of database
B) A storage repository for raw data
C) A backup solution
D) A visualization tool
7. Which industry is most likely to benefit from Big Data analytics for customer
behaviour insights?
A) Agriculture B) Retail
C) Manufacturing D) Construction
8. What is a significant ethical concern regarding Big Data?
A) Increased data storage costs
B) High processing speed
C) Privacy violations
D) Data visualization challenges
9. What is the main advantage of using machine learning in Big Data?
A) Reduced data storage needs
B) Automation of data collection
C) Enhanced ability to uncover patterns in large datasets
D) Simplified data entry processes
10. Which of the following is an example of unstructured data?
A) Excel spreadsheets
B) Social media posts
C) SQL databases
D) CSV files
11. What is the primary focus of prescriptive analytics?
A) Understanding past trends
B) Predicting future outcomes
C) Recommending actions to achieve desired outcomes
D) Analyzing current performance
12. Which Big Data technology is known for real-time data processing?
A) Hadoop B) Apache Spark
C) MySQL D) MongoDB
13. Which of the following best describes Big Data analytics?
A) Analyzing small datasets for trends
B) Collecting and analyzing large volumes of diverse data
C) Using data solely for reporting purposes
D) Storing data in traditional databases
14. What is a primary benefit of using cloud computing for Big Data analytics?
A) Increased hardware costs
B) Scalability and flexibility
C) Limited access to data
D) Decreased processing speed
15. Which of the following is a common use case for Big Data in healthcare?
A) Inventory management
B) Patient outcome prediction
C) Supply chain optimization
D) Employee training programs

You might also like