csc411 tutorial #5 neural networksfidler/teaching/2015/slides/csc411/tutorial5... · csc411...

CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* [email protected] *Based on the lectures given by Professor Sanja Fidler and the prev. tutorial by Yujia Li.

Upload: vanthuy

Post on 07-Mar-2019

252 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

CSC411 Tutorial #5 Neural Networks

February 26, 2016

Boris Ivanovic*

[email protected]

*Based on the lectures given by Professor Sanja Fidler and the prev. tutorial by Yujia Li.

Outline for Today

• Neural Networks

• Questions

Neural Networks

High-Level Overview

• A Neural Network is (generally) comprised of: – Neurons which pass input values through

functions and output the result

– Weights which carry values between neurons

• We group neurons into layers. There are 3 main types of layers: – Input Layer

– Hidden Layer(s)

– Output Layer

Page 5: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

High-Level Overview

Page 6: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Neuron Breakdown

Page 7: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Activation Functions

Most popular recently for deep learning

Page 8: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Representation Power

Page 9: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

What does this mean?

• Neural Networks are POWERFUL, it’s exactly why with recent computing power there was a renewed interest in them.

BUT

• “With great power comes great overfitting.”

– Boris Ivanovic, 2016

• Last slide, “20 hidden neurons” is an example.

Page 10: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

How to mitigate this?

• Stay Tuned!

• First, how do we even use or train neural networks?

Page 11: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Training Neural Networks (Key Idea)

(Regression)

(Classification)

Page 12: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Training Compared to Other Models

• Training Neural Networks is a NON-CONVEX OPTIMIZATION PROBLEM.

• This means we can run into many local optima during training.

Weight

Page 13: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Training Neural Networks (Implementation)

• We need to first perform a forward pass

• Then, we update weights with a backward pass

Page 14: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Forward Pass (AKA “Inference”)

Page 15: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Backward Pass (AKA “Backprop.”)

Page 16: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Learning Weights during Backprop

• Do exactly what we’ve been doing!

• Take the derivative of the error/cost/loss function w.r.t. the weights and minimize via gradient descent!

Page 17: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Useful Derivatives

Page 18: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

On Board Training Example

Similar to what you’ve seen in lecture

Page 19: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Preventing Overfitting

Page 20: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Limiting the Size of the Weights

Page 21: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

The Effects of Weight-Decay

Page 22: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Early Stopping

Page 23: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Why Early Stopping Works

Page 24: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Let’s make our own Neural Network

Design a Neural Network that takes in two inputs and performs the binary OR operation on them.

Page 25: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Superb Convolutional Neural Network Visualization

• http://scs.ryerson.ca/~aharley/vis/conv/

http://scs.ryerson.ca/~aharley/vis/conv/

Page 26: CSC411 Tutorial #5 Neural Networksfidler/teaching/2015/slides/CSC411/tutorial5... · CSC411 Tutorial #5 Neural Networks February 26, 2016 Boris Ivanovic* csc411ta@cs.toronto.edu *Based

Questions

CSC411 A Bit of Neuroscienceguerzhoy/411/lec/W06/neuroscience.pdf · A Bit About Neuroscience CSC411: Machine Learning and Data Mining, Winter 2017 Michael Guerzhoy Slides from Geoffrey

Artificial Neural Networks: Introguerzhoy/411/lec/W03/neuralnetworks.pdfArtificial Neural Networks: Intro CSC411: Machine Learning and Data Mining, Winter 2017 Michael Guerzhoy “Making

Cross-Validation and Decision Trees - Department of ...fidler/teaching/2015/slides/CSC411/...CSC411 Tutorial #3 Cross-Validation and Decision Trees February 3, 2016 Boris Ivanovic*

CNS DEVELOPMENT. Stages in Neural Tube Development Neural plate. Neural plate. Neural folds. Neural folds. Neural tube. Neural tube

CSC411: Optimization for Machine Learning · 2020. 4. 22. · CSC411: Optimization for Machine Learning University of Toronto September 20–26, 2018 1 1based on slides by Eleni Triantaﬁllou,

Neural Networks Tutorialurtasun/courses/CSC411/tutorial5.pdfOverﬁng ) Picture credit: Chris Bishop. Pattern Recognition and Machine Learning. Ch.1.1

CSC411 Neural Networks Introguerzhoy/411/lec/W03/neuralnetworks.pdfTitle: CSC411 Neural Networks Intro Author: guerzhoy Created Date: 1/23/2017 4:31:10 PM

Neural Networks Tutorialurtasun/courses/CSC411... · by just creating extra training data: – for each training image, produce new training data by applying all of the transformations

Garret tutorial5 resumes

CSC411: Optimization for Machine Learning · 2020. 9. 23. · CSC411: Optimization for Machine Learning University of Toronto September 20–26, 2018 1 1based on slides by Eleni Triantaﬁllou,

CSC411- Machine Learning and Data Mining Tutorial 10– March 23 th, 2007 University of Toronto (Mississauga Campus)

Tutorial5: (real) Device Simulations – Quantum Dots Jean Michel D. Sellier Yuling Hsueh, Hesameddin Ilatikhameneh, Tillmann Kubis, Michael Povolotskyi,

Artificial Neural Networks: Intro - Teaching Labscsc411h/winter/lec/week3/n… · · 2018-03-22Artificial Neural Networks: Intro CSC411: Machine Learning and Data Mining, Winter

CSC411 Tutorial #6 Clustering: K-Means, GMM, EM

CSC411/2515 Lecture 2: Nearest Neighbors › ... › slides › lec02-slides.pdf · CSC411/2515 Lecture 2: Nearest Neighbors Roger Grosse, Amir-massoud Farahmand, and Juan Carrasquilla

Criando Efeitos Fotorealistas e Não- Fotorealistas Para Jogosprojeto.unisinos.br/sbgames/anais/tutorials/Tutorial5.pdf · Criando Efeitos Fotorealistas e Não-Fotorealistas Para

Joomla+Virtuemart25Template5–5StyleMart5 TUTORIAL5 · 2.1.5What+to+do+when+you+update+Virtuemart.+.....+55 2.1.1.5Tutorial+1:+Toshow+manufacturer+LOGOs+

Cross-Validation and Decision Treesfidler/teaching/2015/slides/CSC411/... · CSC411 Tutorial #3 Cross-Validation and Decision Trees February 3, 2016 Boris Ivanovic* [email protected]

Neural Networks Tutorial - cs.toronto.edujlucas/teaching/csc411/lectures/tut5... · CSC411 Tutorial #5 Neural Networks Oct, 2017 Shengyang Sun [email protected] *Based on the lectures

Web Intelligence - Tutorial5

Artificial Neural Networks: Intro · 2018-05-17 · Artificial Neural Networks: Intro CSC411: Machine Learning and Data Mining, Winter 2018 Michael Guerzhoy and Lisa Zhang “Making

CSC411/2515 Tutorial: K-NN and Decision Treejlucas/teaching/csc411/lectures/tut3... · CSC411/2515 Tutorial: K-NN and Decision Tree Aryan Arbabi [email protected] September

Cadence Tutorial5

WBv12.1 Emag Tutorial5 Rotating Machine

Artificial Neural Networks: Intro · 2018. 5. 17. · Artificial Neural Networks: Intro CSC411: Machine Learning and Data Mining, Winter 2018 Michael Guerzhoy and Lisa Zhang “Making

Matlab Tutorial5

CSC411: Midterm Reviewmren/teach/csc411_19s/tut/... · 2019-08-31 · CSC411: Midterm Review Xiaohui Zeng February 7, 2019 Built from slides by James Lucas, Joe Wu and others 1/23

Neural Networks Neural Networks

CSC411: Optimization for Machine Learningmren/teach/csc411_19s/tut/tut03.pdf · CSC411: Optimization for Machine Learning University of Toronto September 20–26, 2018 1 1based on

CSC411- Machine Learning and Data Mining

Mgw3130 tutorial5 w6

CSC 411 Lecture 10: Neural Networks I · XJ j=1 x jw kj Logistic regression! CSC411 Lec10 17 / 41. Feedforward network Feedforward network - Connections are adirected acyclic graphs

Tutorial5 Addingmorenodes 120206080917 Phpapp02

Neural Networks Tutorial - cs.toronto.eduurtasun/courses/CSC411/tutorial5.pdf · CSC411/2515 Fall 2015 Neural Networks Tutorial Yujia Li Oct. 2015 Slides adapted from Prof. Zemel’s

Tutorial5: (real) Device Simulations – Quantum Dots