Big Data Assignment 1 1
Big Data Assignment 1 1
12. How can organizations leverage Big Data for predictive analytics? Provide examples
of industries that benefit from this capability.
13. Compare and contrast the traditional business approach to data analysis with the Big
Data business approach.
14. Explain the concept of data replication in HDFS.
16. What is the role of the "Shuffle and Sort" phase in a MapReduce job?
Multiple-Choice Questions (MCQs)
1. Which of the following is NOT considered one of the "3 Vs" of Big Data?
A) Variety B) Velocity
C) Value D) Volume
2. What is the primary challenge of conventional systems when dealing with Big Data?
A) Lack of data storage
B) Slow data processing speeds
C) Inability to manage unstructured data
D) High cost of cloud storage
3. Which type of data is characterized by being organized in a structured format, such as
rows and columns?
A) Unstructured data B) Semi-structured data
C) Structured data D) Raw data
4. What type of analysis is commonly used to make predictions based on Big Data?
A) Descriptive analysis
B) Predictive analysis
C) Prescriptive analysis
D) Diagnostic analysis
5. Which of the following tools is widely used for Big Data storage and processing?
A) Excel B) Hadoop
C) SQL Server D) PowerPoint
6. In Big Data, what does the term "data lake" refer to?
A) A type of database
B) A storage repository for raw data
C) A backup solution
D) A visualization tool
7. Which industry is most likely to benefit from Big Data analytics for customer
behaviour insights?
A) Agriculture B) Retail
C) Manufacturing D) Construction
8. What is a significant ethical concern regarding Big Data?
A) Increased data storage costs
B) High processing speed
C) Privacy violations
D) Data visualization challenges
9. What is the main advantage of using machine learning in Big Data?
A) Reduced data storage needs
B) Automation of data collection
C) Enhanced ability to uncover patterns in large datasets
D) Simplified data entry processes
10. Which of the following is an example of unstructured data?
A) Excel spreadsheets
B) Social media posts
C) SQL databases
D) CSV files
11. What is the primary focus of prescriptive analytics?
A) Understanding past trends
B) Predicting future outcomes
C) Recommending actions to achieve desired outcomes
D) Analyzing current performance
12. Which Big Data technology is known for real-time data processing?
A) Hadoop B) Apache Spark
C) MySQL D) MongoDB
13. Which of the following best describes Big Data analytics?
A) Analyzing small datasets for trends
B) Collecting and analyzing large volumes of diverse data
C) Using data solely for reporting purposes
D) Storing data in traditional databases
14. What is a primary benefit of using cloud computing for Big Data analytics?
A) Increased hardware costs
B) Scalability and flexibility
C) Limited access to data
D) Decreased processing speed
15. Which of the following is a common use case for Big Data in healthcare?
A) Inventory management
B) Patient outcome prediction
C) Supply chain optimization
D) Employee training programs