fourier analysis of video signals & frequency response of...

34
Fourier Analysis of Video Signals & Frequency Response of the HVS & Frequency Response of the HVS Yao Wang Polytechnic Institute of NYU Brooklyn NY11201 Polytechnic Institute of NYU, Brooklyn, NY11201

Upload: others

Post on 25-Aug-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Fourier Analysis of Video Signals& Frequency Response of the HVS& Frequency Response of the HVS

Yao WangPolytechnic Institute of NYU Brooklyn NY11201Polytechnic Institute of NYU, Brooklyn, NY11201

Page 2: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Outline

• Fourier transform over multidimensional space (review on yourFourier transform over multidimensional space (review on your own)

• Frequency domain characterization of video signalsFrequency domain characterization of video signals

• Frequency response of the HVS

• Video sampling – a brief discussion

Frequency Domain Analysis

Page 3: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Frequency domain characterization of video signalsvideo signals

• Spatial frequency• Spatial frequency• Temporal frequency• Temporal frequency caused by motionTemporal frequency caused by motion

© Yao Wang, 2003 Frequency Domain Analysis

Page 4: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Spatial Frequencyp q y

• Spatial frequency measures how fast the image• Spatial frequency measures how fast the image intensity changes in the image plane

• Spatial frequency can be completely characterized by p q y p y ythe variation frequencies in two orthogonal directions(e.g horizontal and vertical)

f : cycles/horizontal unit distance– fx: cycles/horizontal unit distance– fy : cycles/vertical unit distance

• It can also be specified by magnitude and angle of change

)/arctan(22 fffff

© Yao Wang, 2003 Frequency Domain Analysis 4

)/arctan(, xyyxs fffff

Page 5: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Illustration of Spatial Frequencyp q y

)2arctan(,125 sf

© Yao Wang, 2003 Frequency Domain Analysis 5

Page 6: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Angular Frequencyg q y

Previously defined spatial frequency (cycles per pixel) depends on viewing distance.y p q y ( y p p ) p gAngular frequency is what matters to the eye!

(degree)180n)h/2d(radia2(radian))2/arctan(2dhdh

© Yao Wang, 2003 Frequency Domain Analysis 6

ee)cycle/degr(f180

ff ss

hd

Page 7: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Temporal Frequencyp q y

• Temporal frequency measures temporal variation• Temporal frequency measures temporal variation (cycles/s)

• In a video, the temporal frequency is actually, p q y y2-dimensional; each point in space has its own temporal frequencyNon zero temporal frequency can be caused by• Non-zero temporal frequency can be caused by camera or object motion

• Start simple: single object with constant velocity

From © Yao Wang, 2003 Frequency Domain Analysis 7

Page 8: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Temporal Frequency caused by Linear Motioncaused by Linear Motion

© Yao Wang, 2003 Frequency Domain Analysis 8

Page 9: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Relation between Motion, Spatial and Temporal FrequencyTemporal Frequency

0atpatternimagetheAssume)(speedwithmovingobjectanConsider tvv

),(),,( 0 yx tvytvxtyx

is at timepattern image the),,( is

0at pattern imagetheAssume ).,(speedwith movingobject an Consider

0 tyx

tvv yx

)(),(),,( 0 yyxxtyxtyx

y

fvfvffffff

)(:frequency temporaland spatial, motion,between Relation

yyxxt fvfvf

The temporal frequency of the image of a moving object depends on motion as well as the spatial frequency of the object.

© Yao Wang, 2003 Frequency Domain Analysis 9

Example: A plane with vertical bar pattern, moving vertically, causes no temporal change; But moving horizontally, it causes fastest temporal change

Page 10: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Illustration of the Relation

© Yao Wang, 2003 Frequency Domain Analysis 10

Page 11: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Video in Frequency Domainq y

• Spatio temporal frequency domain is pretty empty• Spatio-temporal frequency domain is pretty empty• Individual objects are either still (ie, defined only for

zero temporal frequency) or moving at (nearly) p q y) g ( y)constant velocity (ie, defined only on a plane in the 3d frequency-domain)

© Yao Wang, 2003 Frequency Domain Analysis 11

Page 12: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Frequency response of the HVSq y p

• Temporal frequency response and flicker• Temporal frequency response and flicker ftmBt 2cos1

• Spatial frequency response xfmBtyx 2cos1,,

• Spatio-temporal response

fy ,,

ff

S th it t

tfxfmBtyx tx 2cos2cos1,,

© Yao Wang, 2003 Frequency Domain Analysis

• Smooth pursuit eye movement

Page 13: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Contrast Sensitivity Functiony

fxmBt 2cos1

• B brightness, f frequency, m modulation level• What is minimum modulation level at which

fxmBt 2cos1

What is minimum modulation level at which sinusoidal grating is visible?

• 1/mmin at a given frequency is the sensitivity• Contrast sensitivity function also known as the

Modulation Transfer Function of the human eye

• Humans less sensitive to variations in chrominance

© Yao Wang, 2003 Frequency Domain Analysis 13

Page 14: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Temporal Responsep p

200

100

50

Critical flicker frequency: The lowest frame rate at which the eye does not perceive flicker.

20

10

Con

tras

t sen

sitiv

ity Provides guideline for determining the frame rate when designing a video system.

5

2

0.06 trolands

850 trolands9300 trolands

7.1 trolands77 trolands

0.65 trolands

Critical flicker frequency depends on the mean brightness of the display:

12 5 10 20 50

Frequency (Hz)

0.06 trolands

Figure 2.5 The temporal frequency response of the HVS obtained by a visualexperiment. Different curves represent the responses obtained with different meanbrightness levels, B, measured in trolands. The horizontal axis represents the flickerfrequency f , measured in Hz. Reprinted from D. H. Kelly, Visual responses to

60 Hz is typically sufficient for watching TV.

Watching a movie needs lower

© Yao Wang, 2003 Frequency Domain Analysis 14

frequency f , measured in Hz. Reprinted from D. H. Kelly, Visual responses totime-dependent stimuli. I. Amplitude sensitivity measurements, J. Opt. Soc. Am.(1961) 51:422–29, by permission of the Optical Society of America.

gframe rate than TV

Page 15: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Spatial Responsep p

100

50

tras

t sen

sitiv

ity

20

10

Figure 2.6 The spatial frequencyresponse of the HVS, obtained by a visualexperiment. The three curves result fromdifferent stabilization settings used toremove the effect of saccadic eye

Con

t

5

2

ymovements. Filled circles were obtainedunder normal, unstablized conditions;open squares, with optimal gain settingfor stabilization; open circles, with thegain changed about 5 percent. Reprintedfrom D H Kelly Motion and vision I2

0.21

0.5 1Spatial frequency (cpd)

2 5 10

from D. H. Kelly, Motion and vision. I.Stabilized images of stationary gratings,J. Opt. Soc. Am. (1979), 69:1266–74, bypermission of the Optical Society ofAmerica.

© Yao Wang, 2003 Frequency Domain Analysis 15

Page 16: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Spatial sensitivityp y xfmBtyx 2cos1,,

© Yao Wang, 2003 Frequency Domain Analysis 16

Image provided by Amy Reibman

Page 17: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Spatiotemporal Responsep p p

300 300 The reciprocal relation

100

300

itivi

ty

100

300

itivi

ty

The reciprocal relation between spatial and temporal sensitivity was used in TV system design:

Interlaced scan provides

10

30

Con

tras

t sen

si

10

30

Con

tras

t sen

si

ptradeoff between spatial and temporal resolution.

0.3

3

1 3 10Spatial frequency (cpd)

30 0.3

3

1 3 10Temporal frequency (Hz)

30Spatial frequency (cpd)

(a)

Temporal frequency (Hz)

(b)

Figure 2.7 Spatiotemporal frequency response of the HVS. (a) Spatial frequency responsesfor different temporal frequencies of 1 Hz (open circles), 6 Hz (filled circles), 16 Hz (opentriangles), and 22 Hz (filled triangles). (b) Temporal frequency responses for different spatialf i f 0 5 d ( i l ) 4 d (fill d i l ) 16 d ( i l ) d 22 d

© Yao Wang, 2003 Frequency Domain Analysis 17

frequencies of 0.5 cpd (open circles), 4 cpd (filled circles), 16 cpd (open triangles), and 22 cpd(filled triangles). Reprinted from J. G. Robson, Spatial and temporal contrast sensitivityfunctions of the visual systems, J. Opt. Soc. Am. (1966), 56:1141–42, by permission of theOptical Society of America.

Page 18: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Smooth Pursuit Eye Movementy

• Smooth Pursuit: the eye tracks moving objects• Smooth Pursuit: the eye tracks moving objects• Net effect: reduce the velocity of moving objects on the retinal plane,

so that the eye can perceive much higher raw temporal frequencies than indicated by the temporal frequency responsethan indicated by the temporal frequency response.

yyxxt

yx

vv

fvfvf

vv

:)~~(atmovingiseyewhen theretinaat thefrequencytemporalObserved

)(

:),(at moving isobject n themotion wheobject by causedfrequency Temporal

yyxxt

yyxxtt

yx

vvvvf

fvfvff

vv

~,~ if 0~

)~~(~:),(at movingiseye when theretinaat thefrequency temporalObserved

© Yao Wang, 2003 Frequency Domain Analysis 18

yy

Page 19: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

ivity

0.15No SPEM

S eque

ncy

(Hz)

Con

tras

t sen

siti

0 20

30

5

0.1

0

0.05

Spatial frequency (cpd) Tem

pora

l fre

q

0

101015

20

(a)

0.15

0.1

0 05

0.15

0.1

0 05SPEM SPEM

Spatial frequency (cpd)

0

0.05

0

010

20

30

510

1520

Tem

pora

l fre

quen

cy (H

z)Spatial frequency (cpd)

0

0.05

0

010

20

30

510

1520

Tem

pora

l fre

quen

cy (H

z)SPEM2 deg/s

SPEM10 deg/s

(b)

pd) Te

(c)

pd) Te

Figure 2.8 Spatiotemporal response of the HVS under smooth pursuit eye movements:(a) without smooth pursuit eye movement; (b) with eye velocity of 2 deg/s; (c) with eyevelocity of 10 deg/s. Reprinted from Girod, B. “Motion compensation: visual aspects, accuracy,and fundamental limits.” In Sezan, M. I., and R. L. Lagendijk, eds., Motion Analysis and ImageSequence Processing, Boston: Kluwer Academic Publishers, 1993, 126–52, by permission ofKluwer Academic Publishers.

Page 20: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Video Sampling – A Brief Discussionp g

• Review of Nyquist sampling theorem in 1 D• Review of Nyquist sampling theorem in 1-D• Extension to multi-dimensions• Prefilter in video camerasPrefilter in video cameras• Interpolation filter in video displays

© Yao Wang, 2003 Frequency Domain Analysis

Page 21: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Nyquist Sampling Theorem in 1-Dyq p g

• Given a band limited signal with maximum frequency f it can• Given a band-limited signal with maximum frequency fmax, it can be sampled with a sampling rate fs>=2 fmax. The original continuous signal can be reconstructed (interpolated) from the samples exactly, by using an ideal low pass filter with cut-offsamples exactly, by using an ideal low pass filter with cut off frequency at fs /2.

• Practical interpolation filters: replication (sample-and-hold, 0th

order), linear interpolation (1st order), cubic-spline (2nd order)order), linear interpolation (1 order), cubic spline (2 order)• Given the maximally feasible sampling rate fs, the original signal

should be bandlimited to fs /2, to avoid aliasing. The desired prefilter is an ideal low-pass filter with cut-off frequency at f /2.prefilter is an ideal low pass filter with cut off frequency at fs /2.

• Prefilter design: Trade-off between aliasing and loss of high frequency

© Yao Wang, 2003 Frequency Domain Analysis 21

Page 22: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Extension to Multi-dimensions

• If the sampling grid is aligned in each dimension (rectangular in• If the sampling grid is aligned in each dimension (rectangular in 2-D) and one performs sampling in each dimension separately, the extension is straightforward:– Requirement: f >= 2 fRequirement: f s,i > 2 f max,i

– Interpolation/pre-filter: ideal low-pass in each dimension• If the sampling grid is an arbitrary lattice, the support region of

the signal spectrum must be limited within the Voronoi region ofthe signal spectrum must be limited within the Voronoi region of the reciprocal of the sampling lattice– See Chapters 3 and 4 for sampling and sampling-rate conversion

for K-D signals and for video in particular.for K D signals and for video in particular.– Interlaced scan uses a non-rectangular lattice in the vertical-

temporal plane.

© Yao Wang, 2003 Frequency Domain Analysis 22

Page 23: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Aliasing in 2Dg

Example provided by Amy Reibman

Page 24: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Pictures off the web

Example provided by Amy Reibman

Page 25: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Fourier examples of aliasing: jaggiesp g j gg

ARReibman 2009Example provided by Amy Reibman

Page 26: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Fourier examples of aliasing: jaggiesp g j gg

ARReibman 2009Example provided by Amy Reibman

Page 27: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Fourier examples of aliasing: jaggiesp g j gg

ARReibman 2009Example provided by Amy Reibman

Page 28: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Fourier examples of aliasing: jaggiesp g j gg

ARReibman 2009Example provided by Amy Reibman

Page 29: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Fourier examples of aliasing: jaggiesp g j gg

ARReibman 2009Example provided by Amy Reibman

Page 30: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal
Page 31: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Video Cameras

• Sampling mechanismSampling mechanism– All perform sampling in time– Film cameras capture continuous frames on film– Analog video cameras sample in vertical but not horizonal g p

direction, arrange the resulting horizontal scan lines in a 1-D continuous signal

– Digital cameras sample in both horizontal and vertical direction, yielding pixels with discrete 3-D coordinatesyielding pixels with discrete 3-D coordinates

• Sampling frequency (frame rate and line rate)– Depending on the maximum frequency in the underlying signal, the

human visual thresholds, as well as technical feasibility and costy• Prefilter

– Controlled by temporal exposure, scanning beam, etc.– Digital cameras may capture at higher sampling rates and then

i l li i fil i b f i l l i

© Yao Wang, 2003 Frequency Domain Analysis 31

implement explicit filtering before converting to lower resolution

Page 32: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Typical Camera Responseyp p

• Temporal prefilter: the value read out at any frame is• Temporal prefilter: the value read out at any frame is the average of the sensed signal over the exposure time

• Spatial prefilter: the value read out at any pixel is a weighted integration of the signal in a small windowweighted integration of the signal in a small window surrounding it, called the aperture, can be approximated by a box average or a 2-D Gaussian function

© Yao Wang, 2003 Frequency Domain Analysis 32

Page 33: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Video Displayp y

• The display device presents the analog or digital video on the• The display device presents the analog or digital video on the screen to create the sensation of continuously varying signal in both time and space.

• With CRT, three electronic beams strike red, green, and blue phosphors with the desired intensity at each pixel location. No explicit interpolation filters are used Spatial filtering determinedexplicit interpolation filters are used. Spatial filtering determined by the size of the scanning beam, temporal filtering determined by the decaying time of the phosphors.

• The eye performs the interpolation task: fuses discrete frames and pixels as continuously varying, if the temporal and spatial sampling rates are sufficiently high

© Yao Wang, 2003 Frequency Domain Analysis 33

sampling rates are sufficiently high.

Page 34: Fourier Analysis of Video Signals & Frequency Response of ...eeweb.poly.edu/~yao/EL6123old/FT_sampling_noCSFT.pdf · (degree) 180 2arctan( /2 )(radian) ... Figure 2.5 The temporal

Homework 2

• Reading assignment:• Reading assignment:– Chapter 2– Section 3.3, Section 3.4

• Written assignment– Prob. 2.1,2.3,2.5,2.6,2.7

© Yao Wang, 2003 Frequency Domain Analysis 34