0% found this document useful (0 votes)
5 views3 pages

Data Science_notes_X

The document is a comprehensive overview of data science, including multiple choice questions and detailed answers about its concepts, applications, and methodologies. It covers topics such as data collection methods, data visualization, system maps, and the importance of reliable data sources. Additionally, it discusses Python modules and packages used in data science projects, along with common errors encountered in data collection.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views3 pages

Data Science_notes_X

The document is a comprehensive overview of data science, including multiple choice questions and detailed answers about its concepts, applications, and methodologies. It covers topics such as data collection methods, data visualization, system maps, and the importance of reliable data sources. Additionally, it discusses Python modules and packages used in data science projects, along with common errors encountered in data collection.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Data Science

Data Science

I. Multiple Choice Questions :


1. Which of the following language is used in Data science?
a) Java b) Ruby c) Python d) None of the mentioned
2. Name any 2 methods of collecting data.
a) Surveys and Interviews b) Rumours and Myths
c) AI models and applications d) Imagination and thoughts
3. Data science is the process of diverse set of data through ...... ?
a) organizing data b) processing data
c) analysing data d) All of the above
4. Which of the following are the Data Sources in data science?
a) Structured b) Un Structured c) Both A and B d) None of these
5. Which of the following is not a application for data science?
a) Recommendation Systems b) Genetics & Genomics
c) Online Price Comparison d) Privacy Checker
6. Which of the following step is performed by data scientist after acquiring the data?
a) Data Cleaning b) Data Integration
c) Data visualization d) All of the above
7. ........ employs techniques and theories drawn from many fields within the context of
Mathematics, Statistics, Computer Science, and Information Science.
a) Data Science b) Computer vision
c) Data interpreter d) Augmented Reality
8. For which of the following is Data visualization used?
a) Data modelling b) Data exploration
c) data formation d) Data identification
9. Which one of the following is NOT a part of problem scoping?
a) Goal identification b) Model selection
c) Stakeholders identification d) Ethical concerns identification

II. Answer the following :

1. What is data science?


Ans : Data science is a domain of AI related to data systems and processes, in which
the system collects numerous data, maintains data sets and derives meaning/sense
out of them. The information extracted through data science can be used to make a
decision about it.
2. Name any four applications of data science.
Ans : Some of the applications of data science are Fraud and Risk Detection,
Genetics & Genomics, Internet Search, Targeted Advertising, Website
Recommendations, Airline Route Planning.
3. Describe System Maps.
Ans : System Maps are the list of components which can be represented in the
graphical manner. A System Map comprises a) all the discrete elements within a
system b) Interconnections showing the relationships between different elements
4. What are the benefits of System Maps?
Ans : System Maps help us to find the relationships between different elements of the
problem which we have scoped. It helps us in strategizing the solution for achieving

Page 1 of 3
Data Science

the goal of our project. Its main use is to help structure a system and communicate
the result to others.
5. How is the relationship between the elements, shown in a system map?
Ans : In a System Map, the cause & effect relationship of elements with each other
are shown with the help of arrows. The arrowhead depicts the direction of the effect
and the sign (+ or –) shows their relationship. If the arrow goes from X to Y with a +
sign, it means that both are directly related to each other. That is, If X increases, Y
also increases and vice versa. On the other hand, If the arrow goes from X to Y with
a – sign, it means that both the elements are inversely related to each other which
means if X increases, Y would decrease and vice versa.
6. What are the sources of data?
Ans : Data may be collected offline or online. The offline sources of data are surveys,
interviews, observations, records, sensors etc. The online sources of data are open
sourced government portals, reliable web sites, open sourced statistical web sites
etc.
7. List out the points to be considered while collecting data from any data sources.
Ans : Following points should be kept in mind, while accessing data from any of the
data sources:
1. Data which is available for public usage only should be taken up.
2. Personal datasets should only be used with the consent of the owner.
3. One should never breach someone’s privacy to collect data.
4. Data should only be taken from reliable sources as the data collected from random
sources can be wrong or unusable.
5. Reliable sources of data ensure the authenticity of data which helps in proper
training of the AI model.
8. List out the commonly used formats to store tabular data.
Ans : CSV(Comma Separated Values), Spreadsheets & SQL are some commonly
used formats to store tabular data.
9. What is module and package in Python?
Ans : The module is a simple Python file that contains collections of functions and
global variables and with a .py extension file. It is an executable file.
A python package is a collection of modules. Modules that are related to each other
are mainly put in the same package
10. List out the packages used in data science projects.
Ans : NumPy, Pandas, MatPlotLib, NLTK, are the packages used in data science
projects.
11. List out any four types of graphs that can be plotted using Matplotlib.
Ans : Scatter plot, Bar chart, Histogram, Box plot
12. List out the kind of errors that may come with data while collecting it.
Ans : While collecting data, it is possible that the data might come with some errors.
They are, Erroneous Data – a) Incorrect values that do not resemble the kind of data
expected in that position b) Invalid or Null values which comprises corrupted data
values c) Missing Data - The values of these cells are missing and hence the cells
remain empty. d) Outliers - Data which does not fall in the range of a certain element
are referred to as outliers.
13. Why is data visualization important?
Ans : While collecting data, it is possible that the data might come with some errors.
Analysing the data collected can be difficult as it is all about tables and numbers.

Page 2 of 3
Data Science

While machines work efficiently on numbers, humans need visual aid to understand
and comprehend the information passed. Hence, data visualisation is important to
interpret the data collected and identify patterns and trends out of it.
14. Explain the following applications of Data science.
a) Targeted Advertisement b) Internet search
c) Recommender Systems d) Price Comparison
Ans : a) Targeted Advertisement : Targeted advertising is a form of online advertising
that focuses on the specific traits, interests, and preferences of a consumer.
Targeted advertising allows brands to send different messaging to different
consumers based on what the brand knows about the customer.
b) Internet search : Internet search is the process of exploring the Internet for
information with the use of a search engines like Google or Microsoft Bing. AI search
engines work by first crawling and indexing web pages across the internet, extracting
useful data like text, images and links.
c) Recommender Systems : A recommendation system is an AI algorithm, that
uses Big Data to suggest or recommend additional products to consumers. These can
be based on various criteria, including past purchases, search history, demographic
information, and other factors. Recommender systems are highly useful as they help
users discover products and services they might otherwise have not found on their
own.
d) Price Comparison : These are websites that compares the price of a particular
product or service in different stores or from different companies. Price comparison
site helps to find the best price, latest products, and online shopping deals.

Page 3 of 3

You might also like