Nasdaq Data Link Data Fabric
Nasdaq Data Link Data Fabric
For the first time ever, we’re empowering our clients with
access to the same technology and team that powers
Nasdaq Data Link. Enabling them to ingest and deploy data
within their organizations with greater speed and efficiency,
allowing them to focus on the core value-drivers of their firm.
2
Continued on the next page
Data Fabric by Nasdaq Data Link
Given how challenging the path is from wanting data, to deploying data,
this is an unsurprising state of affairs. From ingestion to cleansing
to productizing and finally to deployment, the journey of data onboarding
is perilous.
Ingest Onboard
Connect to Parse & reformat Normalize to std. Document & Load into data
Source data Clean & QA
source & transfer data format/symbology Catalogue warehouse
Productize
Portfolio Managers
Analysis Reports
Deploy
Trading Systems
Applications
3
Continued on the next page
Data Fabric by Nasdaq Data Link
With Data Fabric by Nasdaq Data Link, our goal is to make the middle
component invisible to our clients, eliminating the burden of the intervening
steps between data selection and deployment. Like so:
Select data
source Data Fabric
Portfolio Managers
Analysis Reports
Deploy
Trading Systems
Applications
This document will zoom in and break down the 3 super categories that
define data deployment and how the Data Fabric technology and team work
within your organization to make the entire process painless: Ingest,
Onboard and Productize.
4
Continued on the next page
Data Fabric by Nasdaq Data Link
Data Fabric’s data ingestion team will unify the collection of tables that make
up the dataset. They will then set up pipelines to blobs, SQL queries and
automated scripts to collect the data to a staging environment. Proprietary
Data Fabric monitoring technology ensures that data is being captured
accurately as it flows in from their respective sources.
5
Continued on the next page
Data Fabric by Nasdaq Data Link
The Data Fabric team understands that financial data science depends on
comparable and consistent data. In addition to checking data for missing
values, our team will standardize schema mapping, entity identifiers and
time scales. The mess of sources becomes a time series file ready for
further processing
Onboard
Nasdaq Data Link has deployed thousands of datasets since its’ inception.
That kind of publishing volume is only possible with the use of machine
intelligence that automates much of the process that would otherwise
require human intervention.
6
Continued on the next page
Data Fabric by Nasdaq Data Link
Our team of data scientists will deploy machine learning models to tag
relevant information in the dataset you wish to onboard. This can be content
to drive analysis, or metadata to help understand data lineage. Combined,
these create searchable and, most importantly, understandable data at the
end of the pipeline.
AAPL Company
$160
Price
Close price
Hold NLP text
7
Continued on the next page
Data Fabric by Nasdaq Data Link
8
Continued on the next page
Data Fabric by Nasdaq Data Link
A factored catalogue
Access
allows users to
understand what data
Public field
exists and how they
can use it Public field All users
Open field
9
Continued on the next page
Data Fabric by Nasdaq Data Link
Productize
The traditional analysis pipeline has a number of components which requires
key decisions at each step. When multiplied by the tens of thousands of
packages available, even small teams can end up with a set of fractured
services that don’t support each other.
With Fabric, the IDE serves as an input, so you have the flexibility to use the
IDE which has the best core functionality for you, and we’ll provide the
relevant financial analysis packages and libraries.
10
Continued on the next page
Data Fabric by Nasdaq Data Link
With Data Fabric, we’ve set up efficient compute services for dataframe
analysis, machine learning with tensorflow and other financial workloads.
Your workflows scale efficiently due to Nasdaq’s partnership with Databricks.
11
Continued on the next page
Data Fabric by Nasdaq Data Link
Starting with data discovery within the organization after the dataset has
already been deployed. Ensuring that data is easily discovered in a virtual
catalogue (with an interface not unlike Nasdaq Data Link) makes for an ideal
user interface. As a new investment manager or researcher starting within
an organization, knowing where to go to see all of its data will save countless
hours simply finding out what’s available. The alternative of which is merely
another system requiring maintenance.