The Rise of Computer Vision: Mechanics, Use Cases, Real World Successes
The Rise of Computer Vision: Mechanics, Use Cases, Real World Successes
2
In the spotlight
What is computer vision?
AT A GLANCE
Computer vision is an incredibly sophisticated tool.
The best way to appreciate this complexity is to
consider how human sight works. Human eyes have
In essence, computer vision is a subset
some six to seven million cone cells, each of which
of artificial intelligence that enables
contains one of three color-sensitive proteins, called
opsins. When light photons make contact with these computers to see and make sense of the
opsins a reaction occurs that generates electrical world. It is the AI application that allows a
signals. This message is transmitted to the brain computer to learn to analyze information
where it’s interpreted. from photos to video to thermal and
infrared data, amongst other sources,
Building an application capable of replicating that
and then make decisions or come to a
process is phenomenally intricate. Such a system would
have to be capable of processing and interpreting clear understanding of the environment
visual information so that it can be used for pattern and or situation based on that information.
object recognition, and for adapting 2D images from
our 3D world into 3D information. Yet in just a decade,
the accuracy of object identification rates has increased
from 50% to an incredible 99%.3 This makes computer NEXT, WE’LL REVIEW EXACTLY
vision more accurate than humans at reacting rapidly HOW COMPUTER VISION WORKS
to visual data.
3 https://www.sas.com/en_us/insights/analytics/computer-vision.html
3
Under the bonnet
How does computer vision work? Any training data will require within it sample images
of luggage containing prohibited items, such as
Computer vision is based on deep learning, a form
aerosols, weapons, and liquid containers, as well as
of machine learning that trains computers to perform
those containing permitted items. Each image will be
human-like tasks. Currently, these broadly include
tagged with metadata indicating the correct answer
understanding natural language, identifying and
– in this case, permitted or prohibited. A neural network
classifying objects in images or making predictions,
will process the visual data, using pattern recognition
but other uses are evolving. Rather than running
to identify the many different components of an image.
data through a series of predefined equations, deep
Its outputs, or ‘answers’ as to whether an item is allowed
learning establishes the structure of the model through
or not, are fed back into the system allowing it to
some basic parameters concerning the data, then
learn and improve in accuracy. So, instead of a human
trains the computer to learn on its own by recognizing
attributing certain characteristics to items, the machine
patterns using many layers of processing.
learns from the images it receives.
4
Beyond theory. AI can improve manufacturing 200,000 players analyzed 1,735,350 estimated new cases
detection rates by up to 90% for finding the next football of cancer diagnosed in the US
Into reality. Computer vision makes it possible star with AI in 2018
to spot defects not easily visible Computer vision makes it possible Computer vision helps identify areas
to the human eye to analyze every player, much to of concern in the livers and brains
enjoyment of the fans of cancer patients
Computer vision showcase
Airport security misses
Computer vision has more real-world
70% of unauthorized items $2 billion counterfeit bills 4 billion loss in US orange
applications than you might think – and
Computer vision enables safer in circulation in the USA alone market due to crop disease
the list is growing. The question many
CIOs and analytics leaders ask us is: will airline travel by identifying Computer vision makes it possible Computer vision makes it possible
the investment deliver groundbreaking unauthorized objects to spot counterfeit money and to detect early signs of plant disease
change? By that they mean faster and more prevent fraud to optimize crop yield
cost-effective ways to serve customers 93% decline in cheetah
and citizens, and innovative services that population over the past century 5-10mins is the maximum Proper oxygenation of the
differentiate them from competitors. Computer vision makes acceptable time customers placenta is essential for
Our answer is yes, but then we’ve been non-invasive monitoring of are prepared to wait in line a successful birth
working in the field of AI, specifically endangered species possible Computer vision makes automated Computer vision makes it possible
machine learning, for more than 40 years. checkout possible for a better to monitor oxygenation through
$460 billion counterfeit customer experience the umbilical cord
This collection of world vision projects
goods bought and sold
and their results highlights precisely how
around the world annually
transformational the technology can be.
Computer vision helps detect fake
goods and protects consumers
5
Life-saving and Exponential possibilities An in-depth scan has a lot of data and
Any time a human could benefit from interpreting diagnose and expedite treatment recommendations.
lots of activities or a complex, or rapidly changing
Radiographers have a huge and growing workload, Phase 2. Augment
scenario, computer vision can help. As the
typically reviewing hundreds if not thousands of images
showcase illustrates, computer vision is especially Image data was combined with patient data such as
every day. Under that kind of time pressure and strain,
powerful when human eyesight may be fatigued their age, medical treatment, family history, and risk
it’s possible that suspicious structures or risk indicators
or unable to perceive the details of an image, factors such as diet and lifestyle. Our model reviews
could be missed. Computer vision is being deployed
especially where a large volume of visual data this visual and contextual information to determine
not to replace the experience of radiographers, it is
is being analyzed at speed. Insurance, policing, the probability that the scan shows cancer.
simply augmenting their efforts in order to accelerate
utilities, manufacturing are all able to ‘see things
review times and improve patient outcomes.
more clearly’ and make faster, more accurate Phase 3. Recommend
decisions because of computer vision. It offers We then built a recommendation engine that suggests
Let’s take a look at the process we developed for
us the potential to save lives and to change the treatments and interventions plotting them against risk
Amsterdam University Medical Centers (AUMC),
world as exemplified by these extraordinary and likely outcomes. For example, an elderly patient
whose physicians wanted a better, faster way to
stories. Use them as food for thought for your with multiple existing morbidities might be served
diagnose patients with liver and brain cancer. This
board, your stakeholders, your teams. better with a monitoring and palliative intervention
project demonstrates the transformational outcomes
that can be achieved, especially when computer vision than invasive surgery.
Please reach out to us to discuss how
is combined with other AI and analytical capabilities
we can help you build your computer
vision projects or develop your broader
such as NLP and recommendation engines. Phase 4. Learn
AI strategy. In order to ensure the system continues to perform at
Phase 1. Triage outstanding levels, the hospital then tracks the outcomes
With thousands of images to review, we built a model to of the recommendation engine, creating a feedback
REQUEST A CALL AT triage cases. The first step was to process images and loop that helps it to learn and further improve outputs
SAS.COM/COMPUTERVISION analyze constructs such as the boundaries of a tumor at both the triage and recommendation phases.
or lesions using a convolutional neural network (CNN).
6
Wild about conservation
Seeing patterns, predicting behavior, Conservationists at WildTrack knew that invasive
monitoring of black rhino populations in the 1990s had
protecting species
negatively impacted female fertility and they wanted
to avoid impacting cheetahs in the same way. They
Over the past 100 years, the wild cheetah population looked to computer vision as a way to remotely monitor
in Africa has declined by 93 percent. Just 7,100 remain cheetahs rather than performing the task manually,
with none now found in 76% of their former habitats. ‘on the ground’, as they had done previously.
Competition with humans for land has resulted in many Specifically, the team wanted to identify individuals,
being killed, and cubs are smuggled internationally their sex, age class, and their movements in order
– a journey that only one in six survives. to answer fundamental questions about what was
happening to their numbers, why, and what could
be done to prevent further population decline.
7
Challenges and misperceptions
Honesty is always the best policy • Computer vision does not come ‘out-of-the-box’ Sounds sensible enough, doesn’t it? In reality, however,
– models have to be trained before they can deliver many IT teams stumble when putting computer vision
Computer vision is clearly a powerful technology that
business value. Do you have the in-house skills or into production because they have not taken full
delivers value rapidly. However, as with any advanced
expert partnerships to bring computer vision use account of the infrastructure investment required
analytics program, there are misperceptions and
cases to life? If not, know which expert vendors can – from storage to compute and networking – to run
challenges that will require thought, planning, and
help you develop skills and roll out your projects. these kinds of advanced analytical functions.
investment from IT and data science leaders. Here is a
quick overview based on our experience with clients: • There’s no such thing as perfect model prediction From the data science perspective, data preparation and
– AI models are subject to decay like any other management alongside model management can often
• Computer vision is not a crystal ball
analytical model. Ongoing training will be required be overlooked or undertaken by different people using
to ensure the efficacy of this technology. a variety of tools. Not to mention the governance of the
• It is not going to take over decision-making
entire analytics lifecycle. Without proper governance
within your organization.
• Data preparation is crucial – many people believe and compliance processes in place, projects can
that data can be loaded into a model and answers quickly become highly complex, difficult to follow and
• It can’t create or rectify models on its own
will be generated. Incorrect. High quality, data impossible to audit. This not only has legal implications,
– because it doesn’t think on its own. This means
preparation is critical, nowhere more so than in it can also become very difficult for data science and
you cannot simply throw data into an AI system
training sets. Data must be rigorously cleansed and analytics teams to answer important questions, such as:
and have it build a model.
put into standardized formats – specialist activities Where is data coming from? Who is accessing it?
• Computer vision requires huge compute power that cannot be achieved automatically. It’s crucial What programming changes have been made and why?
to analyze visual data – what will this mean for your that you recruit data scientists with an appetite and
data infrastructure investments in future? At SAS, real interest in data management, and that your There is a solution that will give you the freedom
we collaborate with Intel and NVIDIA to support the data management strategy is governed by cohesive to be creative with this form of AI, while helping you
core elements of our AI offerings – partnerships that processes that everyone can adhere to. to maintain control of projects while in development
help to deliver faster, more accurate insights. and production.
8
See computer vision clearly
One platform for the entire This streamlined, governed capability comes in the
form of SAS Viya®, our unifying analytics platform that
analytics lifecycle
gives you the perfect balance of choice and control.
At SAS we are putting the automation and learning Importantly if offers:
of AI into the analytics lifecycle, where we begin with
data (acquisition and preparation), move to discovery • Support to load and process images
(experimenting with and building models), and finally or biomedical image data
deployment (where computer vision will be put into
your production environment). • Popular image filtration and color conversion
algorithms directly from SAS Viya
9
We make all this possible
You need SAS Platform
with the SAS Platform (Viya) delivers
Run SAS Visual Data Mining and Machine Learning
(VDMML) on Viya and your organization will have a A full range of analytics, e.g.
high-performance solution that allows multiple users convolutional neural nets to
to explore vast sets of structured and unstructured autoencoder neural networks
data concurrently, via a simple visual interface.
Data engineers quickly build and run transformations, To explore data, move from
augment data and join data within the integrated visual simple models to highly
activity pipeline. All actions are performed in memory sophisticated ones
to maintain data structure consistency.
Governed processes,
You’ll access a broad suite of advanced statistical,
particularly if open source is
machine learning, deep learning and text analytics
used within your organization
algorithms, as well as analytical capabilities including
clustering, regressions, random forests, gradient
boosting models, support vector machines, natural To embrace open technologies
language processing, topic detection and others. and cloud with the ability
For expediency, you can test different modeling to move quickly to different
approaches together, compare results and put environments
the analytics into production across distributed Want to explore the possibilities
or traditional environments with auto-generated A streaming engine, for real- of computer vision yourself?
SAS code. time applications of AI, that
can be integrated into your
Finally, for maximum collaboration and flexibility, analytics solution so they can REQUEST A CALL AT
your people can access these capabilities from their talk to each other SAS.COM/COMPUTERVISION
preferred coding language.
To unify all technologies so
Here’s what our platform offers, mapped against
they are resilient and secure READ MORE ABOUT
your computer vision needs.
OUR CAPABILITIES
AND EXPERIENCE HERE
10
SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries.
® indicates USA registration. Other brand and product names are trademarks of their respective companies. Copyright © 2019, SAS Institute Inc. All rights reserved.