0% found this document useful (0 votes)
10 views

Data Literacy II

Uploaded by

anaghacomputer
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views

Data Literacy II

Uploaded by

anaghacomputer
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

DATA LITERACY - II

Acquiring Data, Processing, and Interpreting Data

Q1. What are the different types of data?


● Textual data
● Numeric data

Q2. What are the differences between Textual and Numeric data?

Q3. Which are the types of Numeric Data?


● Continuous data is numeric data that is continuous. E.g., height, weight, temperature,
voltage
● Discrete data is numeric data that contains only whole numbers and cannot be fractional
E.g. the number of students in the class – it can only be a whole number, not in decimals

Q4. What is Data Acquisition/Acquiring Data? What are the steps in acquiring Data?
Data Acquisition involves searching for datasets suitable for training AI models. The process
comprises three key steps:
● Data discovery
● Data Augmentation
● Data Generation

Sample Data Discovery


The process of finding and identifying data from different sources. It’s about exploring and
understanding what data is available, where it comes from, and how it can be used.
Eg: Searching and downloading data from internet

Sample Data Augmentation


It is increasing the amount of data by making small changes to existing data.
Eg: in photos, this could mean flipping, rotating, or changing the color slightly to create more
examples for training.
Sample Data Generation
The act of creating or recording new data, often using sensors, simulations or algorithms. This
is done when you need more data than you have or when you want to create specific kinds of
data for models. Recorded/created data is stored in a computer in a suitable form.
Eg: Recording temperature readings of a building, creating synthetic images for training a
facial recognition system

Q5. What are different sources for acquiring data?


● Primary Data Sources — These are data collected directly from the original source
for a specific purpose.
Eg: surveys, interviews, experiments, etc.
● Secondary Data Sources—Secondary data collection obtains information from
external sources, rather than generating it personally.
Eg: Kaggle, data.gov, UCI machine learning repository, Google Dataset search etc.

Q6. What are the factors that make data good or bad?
Good data is accurate, complete, consistent, up-to-date, relevant, and well-organized.
Bad data has errors, missing information, inconsistencies, outdated details, or is irrelevant and
poorly structured.

Q7. How do you acquire data from websites?


Web scraping is a way to automatically gather data from websites. You use a tool to visit the
website, find the information you need, and then the tool extracts and saves that data for you.
This process helps you collect large amounts of data quickly without having to do it manually.

Q8. What are the Ethical concerns in data acquisition?


Bias: Ensuring that data is collected and used fairly, without favoring or discriminating against
any group.
Consent: Obtaining permission from individuals before collecting or using their personal data.
Transparency: Clearly disclosing how data is collected, used, and stored to build trust and
ensure openness.
Anonymity: Protecting individuals' identities by removing or disguising personal information
in the data.
Accountability: Being responsible for how data is handled and ensuring compliance with
ethical and legal standards.
Q9. What are the primary factors determining the usability of data?
There are three primary factors determining the usability of data:
1. Structure: Data structure refers to how data is organized and formatted, making it
easier to access and analyze.
2. Cleanliness: Data cleanliness means that the data is free from errors, duplicates, or
inconsistencies, ensuring reliable results.
3. Accuracy: Data accuracy ensures that the information is correct and reflects the
real-world values or facts it represents.

Q10. What are Features of Data/ Data features?


Data features are the characteristics or properties of the data. They are used to describe and
analyze the data for better understanding and modeling.
Eg: In a table of student records, features could include the student's name, age, or grade.
In a photo dataset, features might be the colors present in each image.

Q11.Which are the two types of data features?


● Independent features are the input to the model—they're the information we
provide to make predictions.
● Dependent features are the outputs or results of the model—they're what we're
trying to predict.

Q12. What is the difference between Data Processing and Data Interpretation?
Data Processing - involves organizing and transforming raw data into a usable format, like
cleaning and formatting it for analysis.
Data interpretation - is about analyzing the processed data to understand its meaning and
draw conclusions or insights.

Q13. Define the following keywords related to Data:


Acquire Data- to collect data from various data sources.
Data Processing- After raw data is collected, data is processed to derive meaningful
information from it.
Data Analysis – Data analysis is to examine each component of the data in order to draw
conclusions.
Data Interpretation – It is to be able to explain what these findings/conclusions mean in a
given context.
Data Presentation- In this step, you select, organize, and group ideas and evidence in a
logical way.
Q14. Which are the types of Data Interpretation? What are the methods used?
Based on the two types of data, there are two ways to interpret data-

1. Quantitative Data Interpretation


This involves analyzing numerical data to identify patterns or trends.
Eg: (how many) numbers of likes on the Instagram post

Data Collection Methods -Quantitative


● Interviews
● Polls
● Observations
● Longitudinal Studies
● Survey

4 Steps to Quantitative Data Analysis


1. Relate measurement scales with variables
2. Connect descriptive statistics with data
3. Decide a measurement scale
4. Represent data in an appropriate format
2. Qualitative Data Interpretation
This involves analyzing non-numerical data, like text or images, to understand themes or
patterns.
Eg: Reviews by customers

Data Collection Methods – Qualitative


● Record keeping
● Observation
● Case Studies
● Focus groups
● Longitudinal Studies
● One-to-One Interviews

5 Steps to Qualitative Data Analysis


1. Collect Data
2. Organize
3. Set a code to the Data Collected
4. Analyze your data
5. Reporting
Q15. Differentiate between Qualitative & Quantitative Data Interpretation

Q16. Explain the different ways in which Data can be interpreted.


1. Textual Data Interpretation: Data is represented in written form, such as in
paragraphs or reports. You read and analyze the text to extract key information and
understand the content. Eg: Customer reviews

2. Tabular Data Interpretation: Data is organized in tables with rows and columns.
Each cell in the table contains specific pieces of information, making it easier to
compare and analyze values across different categories.
Eg: A table that lists monthly sales figures for different products.

3. Graphical Data Interpretation: Data is shown using visual elements like charts and
graphs.
Eg: A pie chart that shows the market share of different smartphone brands.

Q17. Explain different graphical representations of data.


●Bar Graphs - Uses bars of different heights to show and compare quantities.
Eg: a bar graph might display the number of books sold each month, with each bar
representing a different month.
●Pie Chart: Shows data as slices of a circle, where each slice represents a portion of the
whole.
Eg: a pie chart could show how different expenses (like rent, food, and entertainment)
make up a person’s total monthly budget.
●Line Graph: Uses lines to connect data points over time, showing trends and changes.
Eg: a line graph might track the temperature changes throughout the year, with points
connected to show how the temperature rises and falls.
Q18. What is the importance of Data Interpretation?
● Informed Decision Making
● Reduced Cost
● Identifying Needs

Q19. What is Tableu?


Tableau is a tool that helps you visualize and explore your data. It allows you to create
graphs, charts, and dashboards from your data to make it easier to understand and analyze.

Q20. Cultivating Data Literacy means:


a) Utilize vocabulary and analytical skills
b) Acquire, develop, and improve data literacy skills
c) Develop skills in statistical methodologies
d) Develop skills in Math

Q21. Data Privacy and Data Security are often used interchangeably but they are
different from each other.
a) True b) False

Q22. The_____________________ provides guidance on using data efficiently and with


all levels of awareness.
a) data security framework b) data literacy framework
c) data privacy framework d) data acquisition framework

Q23. _____________ allows us to understand why things are happening in a particular


way.
a) data b) information c) knowledge d) wisdom

Q24.__________ is the practice of protecting digital information from unauthorized


access, corruption, or theft throughout its entire lifecycle.
a) data security b) data literacy
c) data privacy d) data acquisition

Q25. What are the basic building blocks of qualitative data?


a. Individuals
b. Units
c. Categories
d. Measurements
Q26. Which among these is not a type of data interpretation?
a. Textual b. Tabular c. Graphical d. Raw data

Q27. Quantitative data is numerical in nature.


a. True b. False

Q28. A Bar Graph is an example of?


a. Textual b. Tabular c. Graphical d. None of the above

Q29. _____________ relates to the manipulation of data to produce meaningful


insights.
a. Data Processing b. Data Interpretation
c. Data Analysis d. Data Presentation

Q30. At which stage of the AI project cycle does Tableau software prove useful?
Data Exploration

Q31. Name any five graphs that can be made using Tableau software.
Bar Graph, Line Graph, Pie Chart, Scatter Plot, Heat Map

Q32. What is the importance of data visualization?


Data visualization helps to present complex data in a clear and understandable way, making
it easier to identify patterns, trends, and insights for better decision-making.

You might also like