0% found this document useful (0 votes)

201 views28 pages

Dense Net

The document describes DenseNet, a convolutional neural network architecture where each layer is directly connected to every other layer in a feed-forward fashion. DenseNet uses dense blocks where the output of each layer is concatenated with the outputs of preceding layers. This facilitates strong gradient flow and parameter efficiency. The architecture achieved state-of-the-art results on CIFAR, SVHN and ImageNet datasets using relatively fewer parameters than standard convolutional networks.

Uploaded by

Fahad Raza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

201 views28 pages

Dense Net

Uploaded by

Fahad Raza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

DENSELY CONNECTED CONVOLUTIONAL NETWORKS

Presentation by :

MariaWaheed ( l1f18bscs0460)

Farrukh Alam Virk ( l1f18bscs0424)

WHAT ARE COVERED IN THIS PRESENTATION

 Dense Block
 DenseNet Architecture
 Advantages of DenseNet
 CIFAR & SVHN Small-scale Dataset Results
 ImageNet Large-Scale Dataset Results
 Further Analysis on Feature Reuse
STANDARD
CONNECTIVITY

Dense Block:
A Dense Block is a module used in convolutional neural networks that connects all
layers (with matching feature-map sizes) directly with each other. To preserve the feed-
forward nature, each layer obtains additional inputs from all preceding layers and passes
on its own feature-maps to all subsequent layers.
In Standard ConvNet, input image goes through multiple convolution and obtain high-level
features.
R E S NET CONNECTIV ITY
Identity mappings promote gradient
propagation.

: E lement-wise addition

In ResNet, identity mapping is proposed to promote the gradient propagation. Element-wise addition is used.
It can be viewed as algorithms with a state passed from one ResNet module to another one.
DE NSE ARCHITECTURE
DE NSE
CONNECTIVITY

C C C C

C : Channel-wise
concatenation
In DenseNet, each layer obtains additional inputs from all preceding layers and passes on its own feature-maps to all
subsequent layers. Concatenation is used. Each layer is receiving a “collective knowledge” from all preceding layers.
DE NSE AND S LIM

C C C C

So, it have higher computational efficiency and memory efficiency. The following figure shows the
concept of concatenation during forward propagation
DenseNet Architecture:
Basic DenseNet Composition Layer:
For each composition layer, Pre-Activation Batch Norm (BN) and ReLU, then 3×3 Conv are done with output feature maps
of k channels, say for example, to transform x0, x1, x2, x3 to x4. This is the idea from Pre-Activation ResNet.

Convolution (3x3)
Batch Norm
x3x x4
x1

ReL
3
0 xx1

U
2 x 0
x 2
x
k
channels
x5 =h5([x0, …, x4])
DenseNet-B (Bottleneck Layers):
To reduce the model complexity and size, BN-ReLU-1×1 Conv is done before BN-ReLU-3×3 Conv.

Convolution (1x1)

Convolution (3x3)
x4
Batch Norm

Batch Norm
x3x

ReL

ReL
x 1x

U
2
0

lxk 4xk k
channels channels channels
Higher parameter and computational
efficiency
MULTIPLE DENSE BLOCKS WITH TRANSITION LAYERS:
1×1 CONV FOLLOWED BY 2×2 AVERAGE POOLING ARE USED AS THE TRANSITION LAYERS BETWEEN TWO
CONTIGUOUS DENSE BLOCKS.

FEATURE MAP SIZES ARE THE SAME WITHIN THE DENSE BLOCK SO THAT THEY CAN BE CONCATENATED TOGETHER
EASILY.

AT THE END OF THE LAST DENSE BLOCK, A GLOBAL AVERAGE POOLING IS PERFORMED AND THEN A SOFTMAX
CLASSIFIER IS ATTACHED.

Dense Block 1 Dense Block 2 Dense Block 3

Convolution

Convolution
Pooling

Pooling

Linea
Output

r
Pooling reduces Feature map sizes match
feature map sizes within each block
DENSENETS-B
DenseNets-B are just regular DenseNets that take advantage of 1x1 convolution to reduce the feature
maps size before the 3x3 convolution and improve computing efficiency. The B comes after the name
Bottleneck layer you are already familiar with from the work on ResNets.
DenseNet-BC (Further Compression):
 If a dense block contains m feature-maps, The transition layer generate θm output feature
maps, where 0<θ≤1 is referred to as the compression factor.
 When θ=1, the number of feature-maps across transition layers remains unchanged. DenseNet with
θ<1 is referred as DenseNet-C, and θ=0.5 in the experiment.
 When both the bottleneck and transition layers with θ<1 are used, the model is referred
as DenseNet-BC.
 Finally, DenseNets with/without B/C and with different L layers and k growth rate are trained.

 DenseNets-C are another little incremental step to DenseNets-B, for the

cases where we would like to reduce the number of output feature maps.
The compression factor (theta) determines this reduction. Instead of having
m feature maps at a certain layer, we will have theta*m. Of course, is in the
range [0–1]. So DenseNets will remain the same when theta=1, and will be
DenseNets-B otherwise.
ADVANTAGES OF
DENSENET
ADVANTAGE 1: STRONG GRADIENT
FLOW

Error
Signal

The error signal can be easily propagated to earlier layers more

directly. This is a kind of implicit deep supervision as earlier layers
can get direct supervision from the final classification layer.
ADVANTAGE 2: PARAMETER & COMPUTATIONAL
EFFICIENCY
For each layer, number of parameters in ResNet is directly proportional to C×C while Number of
parameters in Dense Net is directly proportional to l×k×k

ResNet connectivity: #parameters:

Input s Output
t ure
fea
at ed
r rel hl O(CxC)
Co
C C

DenseNet connectivity: k<<C

Input
ures
eat Output
ifie df
ver
s O(lxkxk)
Di k: Growth rate
lX hl
k
k
ADVANTAGE 3: MAINTAINS LOW COMPLEXITY
FEATURES
Standard Connectivity:

Classifier uses most complex (high level)

features

w4 y = w4h4(x)

x h1(x) h2(x) h3(x) h4(x) classifier

In Dense Net, classifier uses features of all complexity
levels. It tends to give more smooth decision
boundaries. It also explains why Dense Net performs
well when training data is insufficient.

Increasingly complex
features
ADVANTAGE 3: MAINTAINS LOW COMPLEXITY
FEATURES
Dense Connectivity:
w0 y = w 0x +
Classifier uses features of all complexity
levels w1 +w1h1(x)
w2 +w2h2(x)
w3 +w3h3(x)
C C C C w4
+w4h4(x)
x h1(x) h2(x) h3(x) h4(x) classifier

In DenseNet, classifier uses features of all complexity levels. It tends to give more smooth decision
boundaries. It also explains why DenseNet performs well when training data is insufficient.

Increasingly complex
features
RESULTS
RESULTS ON C I FA R -
10
ResNet (110 Layers, 1.7 M) ResNet (1001 Layers, 10.2 M)
DenseNet (100 Layers, 0.8 M) DenseNet (250 Layers, 15.3 M)

W i t h data augmentation Without data augmentation

12.0 12.0
11.0 11.0 11.26
10.0 10.0 10.56

9.0 9.0 Previous

8.0 8.0 SOTA
Test Error

7.3
7.0 7.0
6.0 6.41 Previous 6.0
(%)

SOTA 5.9
5.0 5.0 5.2
4.62
4.0 4.5 4.2 4.0
3.6
3.0 3.0
2.0 2.0
With data augmentation (C10+), test
error:
•Small-size ResNet-110: 6.41%
•Large-size ResNet-1001 (10.2M parameters): 4.62%
•State-of-the-art (SOTA) 4.2%
•Small-size Dense Net-BC (L=100, k=12) (Only 0.8M parameters):
4.5%
•Large-size Dense Net (L=250, k=24): 3.6%

Without data augmentation (C10),

test error:
•Small-size ResNet-110: 11.26%
•Large-size ResNet-1001 (10.2M parameters): 10.56%
•State-of-the-art (SOTA) 7.3%
•Small-size Dense Net-BC (L=100, k=12) (Only 0.8M parameters):
5.9%
•Large-size Dense Net (L=250, k=24): 4.2%
RESULTS ON C IFA R -
100
ResNet (110 Layers, 1.7 M) ResNet (1001 Layers, 10.2 M)
DenseNet (100 Layers, 0.8 M) DenseNet (250 Layers, 15.3 M)

W i t h data augmentation Without data augmentation

35.0 35.0 35 .5 8
33.47 Previous
30.0 30.0 SOTA
28.2
27.22 Previous
25.0 25.0
SOTA
Test Error

24.2
22.71 22.3
20.0 20.5 20.0
(%)

19.6
17.6
15.0 15.0

10.0 10.0
DETAIL RESULTS:

SVHN is the Street View House Numbers dataset. The blue

color means the best result. Dense Net-BC cannot get a
better result than the basic Dense Net, authors argue that
SVHN is a relatively easy task, and extremely deep models
may overfit the training set.
RESULTS ON
I M A GEN ET
DenseNet ResNet DenseNet ResNet
28.0 28.0
ResNet-34 ResNet-34

26.0 26.0
DenseNet-121 DenseNet-121

Top-1 error (%)

ResNet-50 ResNet-50
24.0 24.0
DenseNet-169 DenseNet-169

DenseNet-201ResNet-101 DenseNet-201 ResN et-101

ResNet-152 ResNet-152
22.0 22.0
DenseNet-264
DenseNet-264
DenseNet-264(k=48) DenseNet-264(k=48)

20.0 20.0

23
16

29
3
20

80
40

60
0

# Parameters (M) GFLOPs

Top-1: 20.27%
Top-5: 5.17%
MULTI-SCALE (Preview
DENSENET )

Classifier 1 Classifier 2 Classifier 3 Classifier 4 …

cat: 0.2 cat: 0.4 cat: 0.6
0.2 ≱ 0.4 ≱ 0.6 > threshold
threshold threshold
MULTI-SCALE (Preview
DENSENET )

Test …
Input
Inference Speed:
…
~ 2.6x faster than ResNets
~ 1.3x faster than DenseNets
…

Classifier 1 Classifier 2 Classifier 3 Classifier 4 …

“Easy” “Hard”
examples examples
CONVOLUTIONAL
NETWORKS
LeNet AlexNet

VGG Inception

ResNet

7-Knowledge Distillation
No ratings yet
7-Knowledge Distillation
29 pages
CNN Short
No ratings yet
CNN Short
61 pages
Glove
100% (1)
Glove
10 pages
Diffusion: by Aryan Jain
100% (1)
Diffusion: by Aryan Jain
55 pages
TICTOC
No ratings yet
TICTOC
6 pages
Dynetx
No ratings yet
Dynetx
72 pages
DenseNet_Presentation
No ratings yet
DenseNet_Presentation
11 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
Classify Webcam Images Using Deep Learning - MATLAB & Simulink
No ratings yet
Classify Webcam Images Using Deep Learning - MATLAB & Simulink
11 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
Introduction To Resnet
No ratings yet
Introduction To Resnet
14 pages
CrowdSafeNet Results Updated PPT
No ratings yet
CrowdSafeNet Results Updated PPT
23 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
ResNet
No ratings yet
ResNet
13 pages
Neural
No ratings yet
Neural
35 pages
PPT_Btech CSE
No ratings yet
PPT_Btech CSE
17 pages
CS7015 (Deep Learning) : Lecture 1
No ratings yet
CS7015 (Deep Learning) : Lecture 1
108 pages
AML - Theory - Syllabus - Chandigarh University
No ratings yet
AML - Theory - Syllabus - Chandigarh University
4 pages
I Jeter 039112021
No ratings yet
I Jeter 039112021
8 pages
Image_Segmentation_DeepLearning
No ratings yet
Image_Segmentation_DeepLearning
18 pages
LSTM
No ratings yet
LSTM
42 pages
2 Convolutional Neural Network For Image Classification
No ratings yet
2 Convolutional Neural Network For Image Classification
6 pages
Yolov3: An Incremental Improvement: Joseph Redmon, Ali Farhadi
No ratings yet
Yolov3: An Incremental Improvement: Joseph Redmon, Ali Farhadi
6 pages
Introduction to Deep Learning
100% (1)
Introduction to Deep Learning
24 pages
Google Net
100% (1)
Google Net
9 pages
Nepali Paper Currency Recognition Using Deep Learning Final
No ratings yet
Nepali Paper Currency Recognition Using Deep Learning Final
57 pages
CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
Object Tracking in Crowd Environment Using Deep Learning
No ratings yet
Object Tracking in Crowd Environment Using Deep Learning
8 pages
ML Notes MAKAUT 7th Sem
No ratings yet
ML Notes MAKAUT 7th Sem
31 pages
What Is The Need For Residual Learning?
No ratings yet
What Is The Need For Residual Learning?
3 pages
Lecture Notes - Logistic Regression
100% (1)
Lecture Notes - Logistic Regression
11 pages
3D U-Net Based Brain Tumor Segmentation
No ratings yet
3D U-Net Based Brain Tumor Segmentation
11 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
DL UNIT 1
No ratings yet
DL UNIT 1
19 pages
Inception Net
No ratings yet
Inception Net
88 pages
UNIT-I_Introduction to Computer Vision
No ratings yet
UNIT-I_Introduction to Computer Vision
45 pages
What Is Supervised Machine Learning
No ratings yet
What Is Supervised Machine Learning
3 pages
Large-Scale Deep Reinforcement Learning
No ratings yet
Large-Scale Deep Reinforcement Learning
6 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
2 pages
Various Neural Network Architect Assignment Questions
No ratings yet
Various Neural Network Architect Assignment Questions
9 pages
CSC445: Neural Networks
No ratings yet
CSC445: Neural Networks
51 pages
Waste Classification Using Convolutional Neural Network On Edge Devices
No ratings yet
Waste Classification Using Convolutional Neural Network On Edge Devices
5 pages
Knowledge Based Systems
No ratings yet
Knowledge Based Systems
57 pages
Unit III
No ratings yet
Unit III
60 pages
Deep-CNN-based-brain-tumor-detection-in-_2024_International-Journal-of-Intel
No ratings yet
Deep-CNN-based-brain-tumor-detection-in-_2024_International-Journal-of-Intel
8 pages
Autoencoder Report 1
No ratings yet
Autoencoder Report 1
34 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Knowledge Representation and Rule Based Systems
No ratings yet
Knowledge Representation and Rule Based Systems
17 pages
Technical Seminar: Sapthagiri College of Engineering
No ratings yet
Technical Seminar: Sapthagiri College of Engineering
18 pages
RAG with math
No ratings yet
RAG with math
7 pages
1. Deep Learning
No ratings yet
1. Deep Learning
127 pages
SIGIR21 Wang Et Al Decoupled GNN
No ratings yet
SIGIR21 Wang Et Al Decoupled GNN
10 pages
Mining The Web Graph: Technical Seminar Presentation On
No ratings yet
Mining The Web Graph: Technical Seminar Presentation On
15 pages
DFT Domain Image
No ratings yet
DFT Domain Image
65 pages
CETM47-Ass1 Tofun
No ratings yet
CETM47-Ass1 Tofun
12 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Chi-Wah Kok_ Wing-Shan Tam - Digital Image Denoising in MATLAB
No ratings yet
Chi-Wah Kok_ Wing-Shan Tam - Digital Image Denoising in MATLAB
227 pages
CV Ajay Mittal
No ratings yet
CV Ajay Mittal
4 pages
Technical Report On DenseNet Architecture (Deep Learning Network Model)
No ratings yet
Technical Report On DenseNet Architecture (Deep Learning Network Model)
9 pages
Dense Net
No ratings yet
Dense Net
15 pages
Seminar
No ratings yet
Seminar
16 pages
Engcon2017 - Finalpresented
No ratings yet
Engcon2017 - Finalpresented
65 pages
deepLearning
No ratings yet
deepLearning
2 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
The Adaline Learning Algorithm
No ratings yet
The Adaline Learning Algorithm
11 pages
Echo State Network
No ratings yet
Echo State Network
4 pages
Comparison of Neural Networks With Traditional Machine Learning Models
No ratings yet
Comparison of Neural Networks With Traditional Machine Learning Models
20 pages
Deep Learning: Huawei AI Academy Training Materials
No ratings yet
Deep Learning: Huawei AI Academy Training Materials
47 pages
Single Layer & Multilayer Perceptron
No ratings yet
Single Layer & Multilayer Perceptron
14 pages
KSC2016 - Recurrent Neural Networks
No ratings yet
KSC2016 - Recurrent Neural Networks
66 pages
Multilayer Feed Forward Neural Network
No ratings yet
Multilayer Feed Forward Neural Network
8 pages
Deep Learning - IIT Ropar - Unit 14 - Week 11
No ratings yet
Deep Learning - IIT Ropar - Unit 14 - Week 11
4 pages
DR - Amin.ML Ch07 DeepLearning 1
No ratings yet
DR - Amin.ML Ch07 DeepLearning 1
12 pages
DL Practical QP
No ratings yet
DL Practical QP
10 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
32 pages
CS3491 ARTIFICIAL INTELLIGENCE AND MACHINE LEARNIN.docx syllabus
No ratings yet
CS3491 ARTIFICIAL INTELLIGENCE AND MACHINE LEARNIN.docx syllabus
1 page
CP- THEORY- ML (1)
No ratings yet
CP- THEORY- ML (1)
6 pages
Gen AI Syllabus
No ratings yet
Gen AI Syllabus
2 pages
Large Language Models (LLM)
No ratings yet
Large Language Models (LLM)
139 pages
Modul 7 (Neural Network & Evaluasi)
No ratings yet
Modul 7 (Neural Network & Evaluasi)
29 pages
ML Unit Iv
No ratings yet
ML Unit Iv
17 pages
7 Deep Learning
No ratings yet
7 Deep Learning
75 pages
Human Activity Recognition For Elderly People Using Machine and Deep Learning Approaches
No ratings yet
Human Activity Recognition For Elderly People Using Machine and Deep Learning Approaches
14 pages
L22_Attention in Deep Learning
No ratings yet
L22_Attention in Deep Learning
65 pages
Unit 9 ANN
No ratings yet
Unit 9 ANN
14 pages
Autoencoders
No ratings yet
Autoencoders
20 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
III B.Tech I Sem MachineLearning (20AD5T04)
No ratings yet
III B.Tech I Sem MachineLearning (20AD5T04)
1 page
Transformer networks
No ratings yet
Transformer networks
53 pages

Uploaded by

Uploaded by

DENSELY CONNECTED CONVOLUTIONAL NETWORKS

Farrukh Alam Virk ( l1f18bscs0424)

k channels k channels k channels k channels

Dense Block 1 Dense Block 2 Dense Block 3

 DenseNets-C are another little incremental step to DenseNets-B, for the

The error signal can be easily propagated to earlier layers more

ResNet connectivity: #parameters:

DenseNet connectivity: k<<C

Classifier uses most complex (high level)

x h1(x) h2(x) h3(x) h4(x) classifier

W i t h data augmentation Without data augmentation

9.0 9.0 Previous

Without data augmentation (C10),

W i t h data augmentation Without data augmentation

SVHN is the Street View House Numbers dataset. The blue

Top-1 error (%)

DenseNet-201ResNet-101 DenseNet-201 ResN et-101

# Parameters (M) GFLOPs

Classifier 1 Classifier 2 Classifier 3 Classifier 4 …

Classifier 1 Classifier 2 Classifier 3 Classifier 4 …

You might also like