foundations & core in computer vision: a system perspective ce liu microsoft research new...

7
Foundations & Core in Computer Vision: A System Perspective Ce Liu Microsoft Research New England

Upload: damian-tyler

Post on 17-Dec-2015

230 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Foundations & Core in Computer Vision: A System Perspective Ce Liu Microsoft Research New England

Foundations & Core in Computer Vision: A System Perspective

Ce Liu

Microsoft Research New England

Page 2: Foundations & Core in Computer Vision: A System Perspective Ce Liu Microsoft Research New England

Vision vs. Learning

• Computer vision: visual application of machine learning?

• Data features algorithms data

• ML: design algorithms given input and output data

• CV: find the best input and output data given available algorithms

Page 3: Foundations & Core in Computer Vision: A System Perspective Ce Liu Microsoft Research New England

Theoretical vs. Experimental

• Theoretical analysis of a visual system– Best & worst cases

– Average performance

• Theoretical analysis is challenging as many visual distributions are hard to model (signal processing: 2nd order processes, machine learning: exponential families)

• Experimental approach: full spectrum of system performance as a function of the amount of data, annotation, number of categories, noise, and other conditions

Page 4: Foundations & Core in Computer Vision: A System Perspective Ce Liu Microsoft Research New England

Quality vs. Speed

• HD videos, billions of images to index• Real time & 90% vs. one hour per frame & 95%?• Mechanism to balance quality and speed in modeling

Page 5: Foundations & Core in Computer Vision: A System Perspective Ce Liu Microsoft Research New England

Automatic vs. semi-automatic

• Common review feedback: parameters are hand-tuned; not clear how to set the parameters

• Vision system user feedback: I don’t know how to tweak parameters!

• Computer-oriented vs. human-oriented representations

• Human-in-the-loop (collaborative) vision– How to optimally use humans (what, which and how

accurate) beyond traditional active learning

– Model design by crowd-sourcing

– Learning by subtraction

Page 6: Foundations & Core in Computer Vision: A System Perspective Ce Liu Microsoft Research New England

Algorithms vs. Sensors

• Two approaches to solving a vision problem– Look at images, design algorithms, experiment, improve…

– Look at cameras, design new/better sensors, …

• Cameras for full-spectrum, high res, low noise, depth, motion, occluding boundary, object, …

• What’s the optimal sensor/device for solving a vision problem?

• What’s the limit of sensors?

Page 7: Foundations & Core in Computer Vision: A System Perspective Ce Liu Microsoft Research New England

Thank you!

Ce Liu

Microsoft Research New England