0% found this document useful (0 votes)

8 views27 pages

Simplifying Neural Networks and Deep Learning Basics!

The document provides an overview of neural networks and deep learning, detailing their structures such as perceptrons, multi-layer perceptrons, and various types of neural networks including RNNs and CNNs. It discusses training methods like backpropagation and challenges like the vanishing gradient problem, along with techniques like layer-wise pre-training and LSTM for long-term dependencies. Additionally, it highlights the architecture of CNNs and their application in feature extraction for images and text.

Uploaded by

Fake

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views27 pages

Simplifying Neural Networks and Deep Learning Basics!

Uploaded by

Fake

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Neural Network

and Deep
Learning
Ahmed Elhelbawy
Naxcen Quantum Society

December 2024 Research Community

Neural Network
•Mimics the functionality of a brain.
•A neural network is a graph with neurons
(nodes, units etc.) connected by links.
Neural Network: Neuron

Neural Network: Perceptron

• Network with only single layer.
• No hidden layers
Neural Network: Perceptron
X1
W1= ?
a
t=? AND Gate

X2 W2= ?

X1
W1= ?
a
t=? OR Gate

X2 W2= ?

a
X1 t=? NOT Gate
W1= ?

Neural Network: Perceptron

X1
W1= 1
a
t = 1.5 AND Gate

X2 W2= 1

X1
W1= 1
a
t = 0.5 OR Gate

X2 W2= 1

a
X1 t = -0.5 NOT Gate
W1= -1
Neural Network: Multi Layer Perceptron
(MLP) or Feed-Forward Network (FNN)
• Network with n+1 layers
• One output and n hidden layers.

Training: Back propagation algorithm

•Gradient decent algorithm

Training: Back propagation algorithm

Training: Back propagation algorithm
1. Initialize network with random weights
2. For all training cases (called examples):
a. Present training inputs to network and calculate
output
b. For all layers (starting with output layer, back to
input layer):
i. Compare network output with correct output (error
function)
ii. Adapt weights in current layer

Deep Learning
What is Deep Learning?
•A family of methods that uses deep architectures to

learn high-level feature representations

Example 1
MAN

Example 2

Why are Deep Architectures hard to train?

•Vanishing/Exploding gradient problem in Back

Propagation
Layer-wise Pre-training
•First, train one layer
at a time, optimizing
data-likelihood
objective P(x)

Layer-wise Pre-training
•Then, train second
layer next, optimizing
data-likelihood
objective P(h)
Layer-wise Pre-training
Finally, fine-tune labelled objective P(y|x) by
• Backpropagation

Deep Belief Nets

• Uses Restricted Boltzmann Machines (RBMs)
• Hinton et al. (2006), A fast learning algorithm
for deep belief nets.
Restricted Boltzmann Machine (RBM)

•RBM is a simple energy-based model:

where

Example:
• Let weights (h;1 x),1 (h; x)
1 3
be positive, others be
zero, b = d = 0.
• Calculate p(x,h) ?
• Ans: p(x1 = 1; x2 = 0; x3 = 1; h1 = 1; h2 = 0; h3 = 0)

Restricted Boltzmann Machine (RBM)

•P(x, h) = P(h|x) P(x)

•P(h|x): easy to compute
•P(x): hard if datasets are large.

Contrastive Divergence:
Deep Belief Nets (DBN) = Stacked RBM

Auto-Encoders: Simpler alternative to

RBMs
Deep Learning - Architecture
• Recurrent Neural Network (RNN)
• Convolution Neural Network (CNN)

Recurrent Neural Network (RNN)

Recurrent Neural Network (RNN)
•Enable networks to do temporal processing
and learn sequences

Character level language model Vocabulary: [h,e,l,o]

Training of RNN: BPTT

: Predicted
: Actual
V

W
U

Training of RNN: BPTT

One to many:
Sequence output (e.g. image captioning takes an image and outputs a sentence of
words)
Many to one:
Sequence input (e.g. sentiment analysis where a given sentence is classified as
expressing positive or negative sentiment)
Many to many:
Sequence input and sequence output (e.g. Machine Translation: an RNN reads a
sentence in English and then outputs a sentence in French)
Many to many:
Synced sequence input and output (e.g. Language modelling where we wish to
predict next words.

RNN Extensions
Bidirectional RNN
•
Deep (Bidirectional) RNNs
•
RNN (Cont..)
• “the clouds are in the sky”

clouds are in the W1

the clouds are in the

RNN (Cont..)
• “India is my home country. I can speak fluent Hindi.”
is my home fluent W2

India is my speak fluent

It is very hard for RNN to learn “Long Term Dependency”.

LSTM
•Capable of learning long-term dependencies.

Simple RNN

LSTM

LSTM
•LSTM remove or add information to the cell
state, carefully regulated by structures called
gates.

• Cell state: Conveyer belt of the cell

LSTM
• Gates
– Forget Gate
– Input Gate
– Output Gate

LSTM
• Gates
– Forget Gate
– Input Gate
– Output Gate
LSTM
• Gates
– Forget Gate
– Input Gate
– Output Gate

LSTM
• Gates
– Forget Gate
– Input Gate
– Output Gate
LSTM- Variants

Convolutional Neural Network (CNN)

•A special kind of multi-layer neural networks.

•Implicitly extract relevant features.
•Fully-connected network architecture does not
take into account the spatial structure.
•In contrast, CNN tries to take advantage of the
spatial structure.

Convolutional Neural Network (CNN)

1. Convolutional layer
2. Pooling layer
3. Fully connected layer
Convolutional Neural Network (CNN)

1. Convolutional layer 1 0 1

0 1 0

1 0 1
1 1 1 0 0
Convolution Filter
0 1 1 1 0
0 0 1 1 1
0 0 1 1 0
0 1 1 0 0
Image

Convolutional Neural Network (CNN)

1. Convolutional layer 1 0 1

0 1 0

1 0 1
Convolutional Neural Network (CNN)

1. Convolutional layer 1 0 1
• Local receptive field
• Shared weights 0 1 0

1 0 1

Convolutional Neural Network (CNN)

2. Pooling layer
Convolutional Neural Network (CNN)

3. Fully connected layer

.
. .
.

Convolutional Neural Network (CNN)

Putting it all together

Pooled
feature
Labels

Convolution
feature

Input matrix 3 convolution filter Pooling Flatten Fully-connected layers

Example 1: CNN for Image

Example 2: CNN for Text

Deep Learning Day 27
No ratings yet
Deep Learning Day 27
43 pages
DeepLearning Unit-III
No ratings yet
DeepLearning Unit-III
99 pages
lec-10
No ratings yet
lec-10
37 pages
Slides on RNNs 26th March 2025
No ratings yet
Slides on RNNs 26th March 2025
30 pages
DL UNIT IV
No ratings yet
DL UNIT IV
15 pages
Neural_Network_and_Deep_Learning_1736802600
No ratings yet
Neural_Network_and_Deep_Learning_1736802600
54 pages
Generative Deep Learning PDF
No ratings yet
Generative Deep Learning PDF
166 pages
rnn
No ratings yet
rnn
106 pages
07 RNN Recurrent Neural Networks
No ratings yet
07 RNN Recurrent Neural Networks
63 pages
19 Deep Learning
100% (1)
19 Deep Learning
49 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
10 pages
Lecture 3 V33
No ratings yet
Lecture 3 V33
52 pages
Rnn
No ratings yet
Rnn
50 pages
Deep Learning
No ratings yet
Deep Learning
34 pages
Deep Learning algorithms
No ratings yet
Deep Learning algorithms
19 pages
DLA Unit 4
No ratings yet
DLA Unit 4
38 pages
NN DL
No ratings yet
NN DL
54 pages
DL_MOD4 (3)
No ratings yet
DL_MOD4 (3)
105 pages
Deep Learning
No ratings yet
Deep Learning
90 pages
Lecture no 6 Deep Learning Algorithm
No ratings yet
Lecture no 6 Deep Learning Algorithm
37 pages
Analysing 3 Networks
No ratings yet
Analysing 3 Networks
30 pages
Python A.I. Stock Prediction
100% (1)
Python A.I. Stock Prediction
24 pages
Deep Learning (MODULE-4)
No ratings yet
Deep Learning (MODULE-4)
102 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
Unit 6
No ratings yet
Unit 6
41 pages
RNN - Recurrent Neural Networks
No ratings yet
RNN - Recurrent Neural Networks
20 pages
Day 4
No ratings yet
Day 4
22 pages
Large Scale Deep Learning
No ratings yet
Large Scale Deep Learning
170 pages
Recurrent Neural Networks(RNNs)
No ratings yet
Recurrent Neural Networks(RNNs)
45 pages
Cnn
No ratings yet
Cnn
56 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
49 pages
1 Recurrent Neural Networks (1)
No ratings yet
1 Recurrent Neural Networks (1)
34 pages
Lecture Notes on Lecture Notes on Deep Learning.docx
No ratings yet
Lecture Notes on Lecture Notes on Deep Learning.docx
8 pages
Unit_4
No ratings yet
Unit_4
13 pages
Unit 3
No ratings yet
Unit 3
41 pages
Time Series Rnn Lstm 1746197734
No ratings yet
Time Series Rnn Lstm 1746197734
25 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
2111CS010077 deep learning
No ratings yet
2111CS010077 deep learning
10 pages
Unit 5
No ratings yet
Unit 5
39 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
Image Captioning
67% (3)
Image Captioning
16 pages
Power of Recurrent Neural Networks (RNN) - Revolutionizing AI
No ratings yet
Power of Recurrent Neural Networks (RNN) - Revolutionizing AI
33 pages
15 Report PDF
No ratings yet
15 Report PDF
35 pages
IC Unit6 DeepLearning
No ratings yet
IC Unit6 DeepLearning
35 pages
6b. Recurrent Neural Networks
No ratings yet
6b. Recurrent Neural Networks
38 pages
RNN
No ratings yet
RNN
23 pages
Understanding LSTM
No ratings yet
Understanding LSTM
34 pages
Unit 3 RCNN Updated
No ratings yet
Unit 3 RCNN Updated
28 pages
ML DL Projects and Tutorials
100% (1)
ML DL Projects and Tutorials
21 pages
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
No ratings yet
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
24 pages
Movie Genre Classification
No ratings yet
Movie Genre Classification
5 pages
mcq_dlei
No ratings yet
mcq_dlei
16 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
Deep Learning - Unit 1 Notes
No ratings yet
Deep Learning - Unit 1 Notes
27 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Unit V Recurrent Neural Networks
No ratings yet
Unit V Recurrent Neural Networks
35 pages
Flood Prediction Using Rainfall-Flow Pattern in Data-Sparse Watersheds
100% (1)
Flood Prediction Using Rainfall-Flow Pattern in Data-Sparse Watersheds
12 pages
Stock Price Prediction Using News Sentiment Analysis
No ratings yet
Stock Price Prediction Using News Sentiment Analysis
4 pages
Learning facial expression and body gesture visual information for video emotion recognition
No ratings yet
Learning facial expression and body gesture visual information for video emotion recognition
14 pages
CP4252 ML UNIT- V
No ratings yet
CP4252 ML UNIT- V
17 pages
Unit 4
No ratings yet
Unit 4
27 pages
Deep Learning Report for Students
No ratings yet
Deep Learning Report for Students
32 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
18 pages
Group I - PPT
No ratings yet
Group I - PPT
20 pages
For Seminar
No ratings yet
For Seminar
17 pages
Ai ML
No ratings yet
Ai ML
23 pages
mti-06-00011-v3
No ratings yet
mti-06-00011-v3
23 pages
Stock
No ratings yet
Stock
9 pages
Cardiovascular Disease Detection Using Machine Learning and Risk Classification Based On Fuzzy Model
No ratings yet
Cardiovascular Disease Detection Using Machine Learning and Risk Classification Based On Fuzzy Model
21 pages
Ch 4 Deep Learning
No ratings yet
Ch 4 Deep Learning
7 pages
15.03.2024_CSA3007_A24+D23+D24 (1)
No ratings yet
15.03.2024_CSA3007_A24+D23+D24 (1)
8 pages
Intelligent Traffic Steering in Beyond 5G Open
No ratings yet
Intelligent Traffic Steering in Beyond 5G Open
29 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
Image Summarizer: Seeing Through Machine Using Deep Learning Algorithm
No ratings yet
Image Summarizer: Seeing Through Machine Using Deep Learning Algorithm
7 pages
Deakin Ms Data Science Programme
No ratings yet
Deakin Ms Data Science Programme
21 pages
2smart Model For Career Guidance Using Hybrid Deep Learning Technique
No ratings yet
2smart Model For Career Guidance Using Hybrid Deep Learning Technique
5 pages
OCI DL Fundations
No ratings yet
OCI DL Fundations
4 pages
THE PERFECT CHATBOT DOC
No ratings yet
THE PERFECT CHATBOT DOC
11 pages
Comprehensive Guide Attention Mechanism Deep Learning
No ratings yet
Comprehensive Guide Attention Mechanism Deep Learning
17 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
9 pages
Utilizing Text Mining, Data Linkage and Deep Learning in Police and Health Records To Predict Future Offenses in Family and Domestic Violence
No ratings yet
Utilizing Text Mining, Data Linkage and Deep Learning in Police and Health Records To Predict Future Offenses in Family and Domestic Violence
17 pages
Unsupervised Learning of Video Representations Using Lstms
No ratings yet
Unsupervised Learning of Video Representations Using Lstms
12 pages
DL Unit-3
No ratings yet
DL Unit-3
9 pages
MTCOMM_111467
No ratings yet
MTCOMM_111467
8 pages
Paper 38-Inspection System For Glass Bottle Defect Classification
No ratings yet
Paper 38-Inspection System For Glass Bottle Defect Classification
10 pages
Machine Learning With The Arduino Air Quality Pred
No ratings yet
Machine Learning With The Arduino Air Quality Pred
10 pages
CNN vs. LSTM For Turkish Text Classification
No ratings yet
CNN vs. LSTM For Turkish Text Classification
6 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet