Reading 3 Machine Learning
Reading 3 Machine Learning
The unsupervised machine learning algorithm that reduces highly correlated features into
fewer uncorrelated composite variables by transforming the feature covariance matrix best
describes:
The technique in which a machine learns to model a set of output data from a given set of
inputs is best described as:
A) unsupervised learning.
B) supervised learning.
C) deep learning.
Over the past several years, Kowalski has become aware that investment firms are increasingly
using technology to improve their investment decision making. Kowalski has become
particularly interested in machine learning techniques and how they might be applied to
investment management applications.
Kowalski has read a number of articles about machine learning in various journals for financial
analysts. However, she has only a minimal knowledge of how she might source appropriate
model inputs, interpret model outputs, and translate those outputs into investment actions.
Kowalski and Nowak meet to discuss plans for incorporating machine learning into their
investment model. Kowalski asks Nowak to research machine learning and report back on the
types of investment problems that machine learning can address, how the algorithms work,
and what the various terminology means.
After spending a few hours researching the topic, Nowak makes a number of statements to
Kowalski on the topics of:
Kowalski is left to work out which of Nowak's statements are fully accurate and which are not.
Which of the following statements Nowak makes about hierarchical clustering is most accurate?
A) In divisive hierarchical clustering, the algorithm seeks out the two closest clusters.
Hierarchical clustering is a supervised iterative algorithm that is used to build a
B)
hierarchy of clusters.
C) Bottom-up hierarchical clustering begins with each observation being its own cluster.
Which of the following statements Nowak makes about neural networks is most accurate?
Neural networks:
A) are effective in tasks with non-linearities and complex interactions among variables.
have four types of layers: an input layer, agglomerative layers, regularization layers,
B)
and an output layer.
have an input layer node that consists of a summation operator and an activation
C)
function.
Which supervised learning model is most appropriate (1) when the Y-variable is continuous and
(2) when the Y-variable is categorical
C) Regression Classification
A) reinforcement learning.
B) neural networks.
C) deep learning nets.
The degree to which a machine learning model retains its explanatory power when predicting
out-of-sample is most commonly described as:
A) hegemony.
B) generalization.
C) predominance.
What is the appropriate remedy in the presence of excessive number of features in a data set?
A) Unsupervised learning.
B) Big data analysis.
C) Dimension reduction.
A) supervised learning.
B) unsupervised learning.
C) clustering.
Considering the various supervised machine learning algorithms, a linear classifier that seeks
the optimal hyperplane and is typically used for classification, best describes:
Considering the various supervised machine learning algorithms, a penalized regression where
the penalty term is the sum of the absolute values of the regression coefficients best describes:
Joyce Tan manages a medium-sized investment fund at Marina Bay Advisors that specializes in
international large cap equities. Over the four years that she has been portfolio manager, Tan
has been invested in approximately 40 stocks at a time.
Tan has used a number of methodologies to select investment opportunities from the universe
of investable stocks. In some cases, Tan uses quantitative measures such as accounting ratios
to find the most promising investment candidates. In other cases, her team of analysts suggest
investments based on qualitative factors and various investment hypotheses.
Tan begins to wonder if her team could leverage financial technology to make better decisions.
Specifically, she has read about various machine learning techniques to extract useful
information from large financial datasets, in order to uncover new sources of alpha.
Tan is interested in using a supervised learning algorithm to analyze stocks. This task is least
likely to be a classification problem if the target variable is:
A) categorical.
B) ordinal.
C) continuous.
At first Tan bases her stock picks on the results of a single machine-learning model, but then
begins to wonder if she should instead be using the predictions of a group of models.
Compared to a single machine-learning model, an ensemble machine learning algorithm is
most likely to produce predictions that are:
Tan is interested in applying neural networks, deep learning nets, and reinforcement learning
to her investment process. Regarding these techniques, which of the following statements is
most accurate?
Neural networks with one or more hidden layers would be considered deep learning
A)
nets (DLNs).
Reinforcement learning algorithms achieve maximum performance when they stay as
B)
far away from their constraints as possible.
Neural networks work well in the presence of non-linearities and complex interactions
C)
among variables.
Question #23 of 23 Question ID: 1472211