introductioncse252a, fall 2009 computer vision i illumination variability an annoyance “the...

14
1 CSE252A, Fall 2009 Computer Vision I Introduction Computer Vision I CSE 252A Lecture 1 CSE252A, Fall 2009 Computer Vision I Some slides regarding UC and the teach-in What’s so special about UC? Dating from the time of the Master Plan, UC is a one‐of‐a‐kind offering of affordable, quality education at multiple world‐ranking research campuses. The top 50 universities listed in U.S. World and News Report have an average tuition cost of $28,321 per year. Of the 13 universities in the top 50 with tuition less than $10,000 per year, 6 are University of California campuses . UCB, UCSD, and UCLA are top in Washington Monthly’s ranking, based mainly on Social Mobility (recruiting and graduating low‐income students), and Research (cutting‐edge PhDs). Students of California, many from low‐income backgrounds, have been getting an amazing educational opportunity that is only available to the rich elsewhere The Public Good … Year 2007‐2007 California is shifting its priorities By the 2012‐2013 fiscal year, $15.4 billion will be spent on incarcerating Californians, as compared with $15.3 billion spent on higher education Read more: http://www.sfgate.com/cgi‐bin/article.cgi? f=/c/a/2007/05/29/EDGGTP3F291.DTL#ixzz0RsVxXqGm

Upload: others

Post on 17-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

1

CSE252A, Fall 2009 Computer Vision I

Introduction

Computer Vision I CSE 252A Lecture 1

CSE252A, Fall 2009 Computer Vision I

•  Some slides regarding UC and the teach-in

What’ssospecialaboutUC?DatingfromthetimeoftheMasterPlan,UCisaone‐of‐a‐kindofferingofaffordable,qualityeducationatmultipleworld‐rankingresearchcampuses.

Thetop50universitieslistedinU.S.WorldandNewsReporthaveanaveragetuitioncostof$28,321peryear.

Ofthe13universitiesinthetop50withtuitionlessthan$10,000peryear,6areUniversityofCaliforniacampuses.

UCB,UCSD,andUCLAaretopinWashingtonMonthly’sranking,basedmainlyonSocialMobility(recruitingandgraduatinglow‐incomestudents),andResearch(cutting‐edgePhDs).

StudentsofCalifornia,manyfromlow‐incomebackgrounds,havebeengettinganamazingeducationalopportunitythatisonlyavailabletotherichelsewhere

ThePublicGood…

Year2007‐2007

CaliforniaisshiftingitsprioritiesBythe2012‐2013fiscalyear,$15.4billionwillbespentonincarceratingCalifornians,ascomparedwith$15.3billionspentonhighereducation

Readmore:http://www.sfgate.com/cgi‐bin/article.cgi?f=/c/a/2007/05/29/EDGGTP3F291.DTL#ixzz0RsVxXqGm

Page 2: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

2

7

Historicalperspective•  In1970,UCreceivedabout7%ofthestate’sgeneralfundbudget.Today,it

hasfallentoroughly3%.

•  Stateper‐studentfundingforeducatingUCstudentshasfallenfrom$14,210in2000‐01to$10,370today(inflation‐adjusted).

•  From1984to2004,California'spopulationincreased35%,whilestatefundingforhighereducationdecreasedby9%.HighereducationistheonlymajorpartofCalifornia’sbudgetthatgrewmoreslowlythanpopulation.

•  Today,lessthan20%ofUC’sbudgetcomesfromtheStateofCalifornia.MorethanhalfofUC’sresearchexpenditurescomefromfederalsources,andfederalfundsrepresentnearly20%ofgrantaidreceivedbyUCstudents.

http://www.ucthewayforward.org/budgetfactsheet.pdf

AbouttheUCBudgetCrisis2009

AwomanisarrestedbyUCpoliceafterprotestingattheMissionBaycampusinSanFrancisco.Faculty,staffandstudentsareurgingasystemwidewalkoutSept.24,thefirstdayofclassesforthefallquarteratmanyUCcampuses.(PaulChinn/AssociatedPress/September16)

9

Whatcrisis?

•  2010‐11UCBudgetGap$632.6M

•  Equivalentto:–  EliminatingStatesupportfor2medium‐sized Campuses

•  Reducingenrollmentby57,500students•  ClosingUClibrariesandpublicserviceprograms•  Terminating8,300employees•  Eliminatingallcore‐fundedstudentfinancialaid

http://www.ucop.edu/budget/pres/2010‐11/F1‐BudgetUpdate‐sep09.pdf 10

Mid‐yearfeehike

Consequences

11

Studentfeeswillincreasedramaticallyby44%by2010(makingUChavefeesinrankofsemi‐privateschoolslikeUofM)

Consequences

StaffandfacultyatallUCcampusesgivenpaycutsfrom4to10%

Top‐flightresearchfacultywillbepoached

Youropportunitiestogainfirst‐handexperience(fromresearchtoperformanceart)withtopfacultywillbegreatlydiminished

Consequences

Page 3: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

3

13

Whathappenswhenfee’sgoup?

•  WhentheUniversityofMichiganmovedtoasemi‐privatizedmodel,itrelaxedadmissionsstandardstorecruitmoreoutofstatestudentwhopaymuchmoreintuition.

•  >50%ofMichigan’s2003freshmanclasscamefromfamilieswithsix‐figureincomesinastatewhereonly13%offamiliesearnthatmuch.

•  TheresulthasbeensignificantlydiminishedaccessfortheresidentsofMichigan,especiallythemostdisadvantaged,andareductioninthequalityoftheUniversityasseeninitsdropinrankingsbyU.S.NewsandWorldReport.

http://keepcaliforniaspromise.org/wp‐content/uploads/2009/09/Understanding‐the‐Crisis.pdf 14

Bottomlineforyou

•  UCisbecomingmoreprivateandlesspublic,whichisbadforACCESS(itwillbecomeaschoolfortherichandout‐of‐staters)

•  Youreducationalopportunitieswillbeimpoverished,andthiswillhaveanimmeasurablyprofoundimpactonyourfuture

15

Whatyoucando

•  Beinformed:http://keepcaliforniaspromise.org/

•  Stickupforyoureducationbyspreadingtheword

•  Votewhenthetimecomes!Lotsofstudentsdon’tvote,andyourvotedefinitelycounts…

•  Writeyourlegislator:http://www.leginfo.ca.gov/

CSE252A, Fall 2009 Computer Vision I

What is Computer Vision? •  Trucco and Verri: Computing properties of the 3-

D world from one or more digital images

•  Sockman and Shapiro: To make useful decisions about real physical objects and scenes based on sensed images

•  Ballard and Brown: The construction of explicit, meaningful description of physical objects from images.

•  Forsyth and Ponce (Text): Extracting descriptions of the world from pictures or sequences of pictures”

CSE252A, Fall 2009 Computer Vision I

Why is this hard?

What is in this image? 1.  A hand holding a man? 2.  A hand holding a mirrored sphere? 3.  An Escher drawing?

•  Interpretations are ambiguous •  The forward problem (graphics) is well-posed •  The “inverse problem” (vision) is not

CSE252A, Fall 2009 Computer Vision I

We all make mistakes “640K ought to be enough for anybody.” –

Bill Gates, 1981

“…” – Marvin Minsky

Page 4: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

4

CSE252A, Fall 2009 Computer Vision I

What do you see?

CSE252A, Fall 2009 Computer Vision I

What was happening

CSE252A, Fall 2009 Computer Vision I

Should Computer Vision follow from our understanding of Human Vision?

Yes & No

1.  Who would ever be crazy enough to even try creating machine vision? 2.  Human vision “works”, and copying is easier than creating. 3.  Secondary benefit – in trying to mimic human vision, we learn about it.

1.  Why limit oneself to human vision when there is even greater diversity in biological vision

2.  Why limit oneself to biological vision when there may be greater diversity in sensing mechanism?

3.  Biological vision systems evolved to provide functions for “specific” tasks and “specific” environments. These may differ for machine systems

4.  Implementation – hardware is different, and synthetic vision systems may use different techniques/methodologies that are more appropriate to computational mechanisms

CSE252A, Fall 2009 Computer Vision I

The Near Future: Ubiquitous Vision •  Five years from now, digital

cameras will cost 1 cent (sensor cost).

•  Digital video will be a widely available commodity component embedded in cell phones, PDA’s, doorbells, bridges, security systems, cars, etc.

•  99.9% of digitized video won’t be seen by a person.

•  That doesn’t mean that only 0.1% is important!

[ Written two years ago, seemed a bit far fetched ]

CSE252A, Fall 2009 Computer Vision I

Applications: touching your life •  Football •  Movies •  Surveillance •  HCI – hand gestures •  Aids to the blind •  Face recognition &

biometrics •  Road monitoring •  Industrial inspection •  Homeland security •  Virtual Earth; street view

•  Robotic control •  Autonomous driving •  Space: planetary

exploration, docking •  Medicine – pathology,

surgery, diagnosis •  Microscopy •  Military •  Remote Sensing •  Digital photography •  Video games

CSE252A, Fall 2009 Computer Vision I

Some Vision Problems •  Segmentation

–  Breaking images and video into meaningful pieces

•  Reconstructing the 3D world –  from multiple views –  from shading –  from structural models

•  Recognition –  What are the objects in a scene? –  What is happening in a video?

•  Video –  Understand movement and change in image sequence. –  Tracking objects

Page 5: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

5

CSE252A, Fall 2009 Computer Vision I

Related Fields •  Image Processing •  Computer Graphics •  Pattern Recognition •  Perception •  Robotics •  AI

CSE252A, Fall 2009 Computer Vision I

Image Interpretation - Cues •  Variation in appearance in multiple views

–  stereo –  motion

•  Shading & highlights •  Shadows •  Contours •  Texture •  Blur •  Geometric constraints •  Prior knowledge

CSE252A, Fall 2009 Computer Vision I

Computer Vision: Fiction or Fact

Biometrics segment

CSE252A, Fall 2009 Computer Vision I

Shading and lighting Shading as a result of differences in lighting is

1.  A source of information 2.  An annoyance

CSE252A, Fall 2009 Computer Vision I

Illumination Variability An annoyance

“The variations between the images of the same face due to illumination and viewing direction are almost always larger than image variations due to change in face identity.” -- Moses, Adini, Ullman, ECCV ‘94

CSE252A, Fall 2009 Computer Vision I

How do we understand shading (An idealization of “engineering” research)

1.  Construct a model of the domain (usually mathematical, based on physics).

2.  Prove properties of that model to better understand the model and opportunities of using it.

3.  Develop algorithms to solve a problem that is correct under the model.

4.  Implement & evaluate it. 5.  Question assumptions of the model & start

all over again.

Page 6: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

6

CSE252A, Fall 2009 Computer Vision I

1. Image Formation

At image location (x,y) the intensity of a pixel I(x,y) is

I(x,y) = a(x,y) n(x,y) s where •  a(x,y) is the albedo of the surface projecting to (x,y). •  n(x,y) is the unit surface normal. •  s is the direction and strength of the light source.

n s

.

a

I(x,y)

CSE252A, Fall 2009 Computer Vision I

x 1

x 2

2. A property: 3-D Linear subspace

The set of images of a Lambertian surface with no shadowing is a subset of 3-D linear subspace.

[Moses 93], [Nayar, Murase 96], [Shashua 97]

x n

L = {x | x = Bs, ∀s ∈R3 }

where B is a n by 3 matrix whose rows are product of the surface normal and Lambertian albedo

L L0

CSE252A, Fall 2009 Computer Vision I

3,4 : An implemented algorithm: Relighting

Single Light Source CSE252A, Fall 2009 Computer Vision I

3,4: An implemented algorithm Photometric Stereo

CSE252A, Fall 2009 Computer Vision I

5. Question Assumpions •  Many objects are not Lambertian (specular,

complex reflectance functions).

CSE252A, Fall 2009 Computer Vision I

The course •  Part 1: The Physics of Imaging •  Part 2: Early Vision •  Part 3: Reconstruction •  Part 4: Recognition

Page 7: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

7

CSE252A, Fall 2009 Computer Vision I

Part I of Course: The Physics of Imaging

•  How images are formed – Cameras

•  What a camera does •  How to tell where the camera was located

– Light •  How to measure light •  What happens to light at surfaces •  How the brightness values we see in images are

determined

– Color •  The underlying mechanisms of color •  How to describe it and measure it CSE252A, Fall 2009 Computer Vision I

Cameras, lenses, and sensors

From Computer Vision, Forsyth and Ponce, Prentice-Hall, 2002.

•  Pinhole cameras •  Lenses •  Projection models •  Geometric camera parameters

CSE252A, Fall 2009 Computer Vision I

A real camera … and its model

CSE252A, Fall 2009 Computer Vision I

Lighting & Photometry •  How does measurement relate to light

energy?

•  Sensor response •  Light sources •  Reflectance

CSE252A, Fall 2009 Computer Vision I

Color

CSE252A, Fall 2009 Computer Vision I

Part II: Early Vision in One Image •  Representing small patches of image

– For three reasons •  We wish to establish correspondence between (say)

points in different images, so we need to describe the neighborhood of the points

•  Sharp changes are important in practice --- known as “edges”

•  Representing texture by giving some statistics of the different kinds of small patch present in the texture.

–  Tigers have lots of bars, few spots –  Leopards are the other way

Page 8: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

8

CSE252A, Fall 2009 Computer Vision I

Segmentation •  Which image components “belong together”? •  Belong together ≅ lie on the same object •  Cues

–  similar color –  similar texture –  not separated by contour –  form a suggestive shape when assembled

CSE252A, Fall 2009 Computer Vision I

Boundary Detection

http://www.robots.ox.ac.uk/~vdg/dynamics.html

CSE252A, Fall 2009 Computer Vision I

Boundary Detection: Local cues

CSE252A, Fall 2009 Computer Vision I

Gradients

CSE252A, Fall 2009 Computer Vision I

Boundary Detection

Finding the Corpus Callosum

(G. Hamarneh, T. McInerney, D. Terzopoulos) CSE252A, Fall 2009 Computer Vision I

(Sharon, Balun, Brandt, Basri)

Page 9: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

9

CSE252A, Fall 2009 Computer Vision I CSE252A, Fall 2009 Computer Vision I

Part 3: Reconstruction from Multiple Images

•  Photometric Stereo – What we know about the world from lighting

changes. •  The geometry of multiple views •  Stereopsis

– What we know about the world from having two eyes

•  Structure from motion – What we know about the world from having

many eyes •  or, more commonly, our eyes moving.

CSE252A, Fall 2009 Computer Vision I

Mars Rover Spirit

From Viking

CSE252A, Fall 2009 Computer Vision I

Façade (Debevec, Taylor and Malik, 1996) Reconstruction from multiple views, constraints, rendering

Architectural modeling: •  photogrammetry; •  view-dependent texture mapping; •  model-based stereopsis.

Reprinted from “Modeling and Rendering Architecture from Photographs: A Hybrid Geometry- and Image-Based Approach,” By P. Debevec, C.J. Taylor, and J. Malik, Proc. SIGGRAPH (1996). © 1996 ACM, Inc. Included here by permission.

CSE252A, Fall 2009 Computer Vision I

Images with marked features

CSE252A, Fall 2009 Computer Vision I

Resulting model & Camera Positions

Page 10: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

10

CSE252A, Fall 2009 Computer Vision I

Recovered

Recovered model edges reprojected through recovered camera positions into the three original images

CSE252A, Fall 2009 Computer Vision I

UNI High Movie

CSE252A, Fall 2009 Computer Vision I

Video-Motion Analysis •  Where “things” are

moving in image –segmentation.

•  Determining observer motion (egomotion)

•  Determining scene structure

•  Tracking objects •  Understanding

activities & actions

CSE252A, Fall 2009 Computer Vision I

Forward Translation & Focus of Expansion [Gibson, 1950]

CSE252A, Fall 2009 Computer Vision I

Visual Tracking

Main Challenges 1.  3-D Pose Variation 2.  Occlusion of the target 3.  Illumination variation 4.  Camera jitter 5.  Expression variation

etc.

[ Ho, Lee, Kriegman ] CSE252A, Fall 2009 Computer Vision I

Visual Tracking •  State: usually a finite number of parameters (a

vector) that characterizes the “state” (e.g., location, size, pose, deformation of thing being tracked.

•  Dynamics: How does the state change over time? How is that changed constrained?

•  Representation: How do you represent the thing being tracked

•  Prediction: Given the state at time t-1, what is an estimate of the state at time t?

•  Correction: Given the predicted state at time t, and a measurement at time t, update the state.

Page 11: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

11

CSE252A, Fall 2009 Computer Vision I

Tracking

(www.brickstream.com) CSE252A, Fall 2009 Computer Vision I

Tracking

CSE252A, Fall 2009 Computer Vision I

Tracking

CSE252A, Fall 2009 Computer Vision I

Tracking

CSE252A, Fall 2009 Computer Vision I

Tracking

CSE252A, Fall 2009 Computer Vision I

Part 4: Recognition

Given a database of objects and an image determine what, if any of the objects are present in the image.

Page 12: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

12

CSE252A, Fall 2009 Computer Vision I

Recognition Challenges •  Within-class variability

–  Different objects within the class have different shapes or different material characteristics

–  Deformable –  Articulated –  Compositional

•  Pose variability: –  2-D Image transformation (translation, rotation, scale) –  3-D Pose Variability (perspective, orthographic projection)

•  Lighting –  Direction (multiple sources & type) –  Color –  Shadows

•  Occlusion – partial •  Clutter in background -> false positives

CSE252A, Fall 2009 Computer Vision I

Object Recognition Issues: •  How general is the problem?

–  2D vs. 3D –  range of viewing conditions –  available context –  segmentation cues

•  What sort of data is best suited to the problem? –  Whole images –  Local 2D features (color, texture, –  3D (range) data

•  What information do we have in the database? –  Collection of images? –  3-D models? –  Learned representation? –  Learned classifiers?

•  How many objects are involved? –  small: brute force search –  large: ??

CSE252A, Fall 2009 Computer Vision I

Recognition Example: Face Detection: Classify face vs. non-face

CSE252A, Fall 2009 Computer Vision I

Why is Face Recognition Hard? Many faces of Madona

CSE252A, Fall 2009 Computer Vision I

Face Recognition: 2-D and 3-D

2-D

Face Database

Time (video)

2-D

Recognition Data

3-D 3-D

Recognition Comparison

CSE252A, Fall 2009 Computer Vision I

Yale Face Database B

64 Lighting Conditions 9 Poses => 576 Images per Person

Page 13: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

13

CSE252A, Fall 2009 Computer Vision I

Real vs. Synthetic

Real

Synthetic

CSE252A, Fall 2009 Computer Vision I http://www.ri.cmu.edu/projects/project_271.html

CSE252A, Fall 2009 Computer Vision I

Model-Based Vision

•  Given 3-D models of each object •  Detect image features (often edges, line segments, conic sections) •  Establish correspondence between model &image features •  Estimate pose •  Consistency of projected model with image.

CSE252A, Fall 2009 Computer Vision I

Object Classes: Chairs

(Funkhauser, Min, Kazhdan, Chen, Halderman, Dobkin, Jacobs)

CSE252A, Fall 2009 Computer Vision I

Scene Interpretation

“The Swing” Fragonard, 1766

CSE252A, Fall 2009 Computer Vision I

The Syllabus

Page 14: IntroductionCSE252A, Fall 2009 Computer Vision I Illumination Variability An annoyance “The variations between the images of the same face due to illumination and viewing direction

14

CSE252A, Fall 2009 Computer Vision I

cse252a: early vision and recognition - Cameras - Human Vision - Photometry (radiance, irradiance, BRDF) - Illumination cones - Shape from Shading, Photometric Stereo - Curves & Surfaces - Color - Filtering - Edges & Features - Stereo Matching - Optical Flow and Motion - Tracking - Statistical pattern recognition (Bayes, SVM, Kernel methods) - Object Recognition - Behavior Recognition (HMM's)

CSE252A, Fall 2009 Computer Vision I

cse252b: Multiview Geometry & Segmentation

- Multiview Geometry - Affine Structure from Motion - Projective Structure from Motion - Robust F-matrix estimation - Image Segmentation - Texture: Synthesis, Recognition, Shape-from - Motion Segmentation - Object Detection - Image Registration - Image Based Rendering

CSE252A, Fall 2009 Computer Vision I

Announcements •  Class Web Page is up:

–  http://www.cs.ucsd.edu/classes/fa09/cse252a/ •  Assignment 0: “Getting Started with

Matlab” to be posted to web page. •  Read Chapters 1 & 2 of Forsyth & Ponce