Download - Deep Learning and TensorFlow - PaviaDeep Learning: a theoretical introduction –Episode 2 [1] Deep Learning and TensorFlow Episode 2 The Quest for Deeper Networks Università degli

[1]Deep Learning: a theoretical introduction – Episode 2

Deep Learningand TensorFlowEpisode 2The Quest for Deeper Networks

Università degli Studi di Pavia

http://vision.unipv.it/AI/AIRG.html


Feed-Forward Neural Network


Training Feed-Forward Neural Networks


The Quest forDeeper Networks


Shallow vs. Deep Feed-Forward Neural Networks


Parity Circuits


Depth and piecewise linear functions


k > 2 h(2)



h

k h

pmax d k



About why they did not useDeep Networks

from the beginning


Problem: vanishing or exploding Gradients


• g

•

• W(i)


k


Problem: initial values of the parameters


A bag of wonderful tricks


Why ReLU is better (sometimes)


•



•

•



Overfitting


Dropout


•

•

Contrasting Overfitting


Improving on MBGD


AdaGrad


AdaGrad

B


AdaGrad

a1 a2


•

•

AdaGrad

d


AdaGrad


AdaDelta


Improving on MBGD


An aside:function approximation vs. classification


Classification: Softmax



i



m wl



h


Another aside:autoencoders


Auto-encoders

Download - Deep Learning and TensorFlow - PaviaDeep Learning: a theoretical introduction –Episode 2 [1] Deep Learning and TensorFlow Episode 2 The Quest for Deeper Networks Università degli

Top Related