ct20a6100 machine vision and digital image analysis

22
CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS MPEG Pauli Jutila Cristina Petre

Upload: eros

Post on 20-Jan-2016

79 views

Category:

Documents


0 download

DESCRIPTION

CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS. MPEG Pauli Jutila Cristina Petre. Contents. 1. Introduction to MPEG 2. MPEG Family 3. Intra Frame Coding Techniques 4. Non-Intra Frame Coding Techniques - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

CT20A6100MACHINE VISION AND DIGITAL

IMAGE ANALYSIS

MPEG

Pauli Jutila

Cristina Petre

Page 2: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Contents

1. Introduction to MPEG 2. MPEG Family3. Intra Frame Coding Techniques4. Non-Intra Frame Coding Techniques5.Summary

Page 3: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Why do we need video compression?

Example video compression- 1920x1080 pixels 30 frames/sec 8 bits/color => 1.5Gb/sec

- 6MHz channel bandwidth => 19.2Mb/sec

=> compression 83:1

Page 4: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

The MPEG Family

Page 5: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

MPEG-1

● Basic form using motion compensated block-based transform coding techniques

● Optimized to work for 352x240 at 30frames/sec

or 352x288 at 25frames/sec ,possible to go up

to 4095x4095

● Defined for progressive frames only

● No direct provision for interlaced video applications

Page 6: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

MPEG-2

● Designed for digital television broadcasting

● Potentially high quality video (4-9Mb/sec)

● Profiles and levels :

profile - bitstream scalability, colorspace resolution

level - image resolution, maximum bit-rate/profile

● MP@ML : 720x480 at 30 frames/sec

● MP@HL : 1920x1080 at 30 frame/sec

Page 7: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

MPEG-4

● Interactive content

● Object-oriented composite files

● Image blocks are arbitrary shaped

● Multiplexes and synchronizes data

● Divided into a number of parts-compatibility issue

Page 8: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Coding Techniques

Page 9: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Intra Frame Coding

Video Filter DCT Quantizer Run-Lenght VLC

Bit-StreamBuffer

Bit-Rate Control

Page 10: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Video Filter

• The human eye is less sensitive to chrominance than luminance.

• RGB YCbCr• Choice of sub-sampling ratios:

• 4:4:4 (Professional, Post-Production)• 4:2:2 (same as above)• 4:1:1 (DV Camcorder) • 4:2:0 (Web, DVD)

Page 11: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Discrete Cosine Transform

• Nearly optimal in energy consentration and decorrelating

• The human eye is less sensitive to changes in higher frequencies

• Calculated in blocks, usually 8x8

Page 12: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

DCT Coefficient Quantization

• 12-bit coefficient is divided by a corresponding value from intra quantization matrix

• Each value in the resultant matrix is pre-scaled by multiplying by a single value, known as the quantizer scale code (1-112)

• The goal is to force as many of the DCT coefficients to zero or near zero

Page 13: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Variable Length Coding

• Zigzag-scanning long runs of consecutive zeros

• Alternate pattern in MPEG-2 (for interlaced video)• Huffman coding

Page 14: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Coding Techniques

Page 15: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

I,P,B Frames

I-pictures: intra-coded no motion-compensated prediction used as reference picture

P-picture: inter-coded forward predicted (from I or P frame)

B-picture: inter-coded bi-directional prediction high coding efficiency

Page 16: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

I,P,B Frames

Page 17: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Motion Estimation

Page 18: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Matching

Page 19: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Prediction Error Compensation

Page 20: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Non-Intra Decoding

Page 21: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

Summary

● Video compression is needed!

● MPEG enables a wide range of applications

● Different standards but same principles

● Spatial and temporal processing

Page 22: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS

References

● Barry G. Haskell, Atul Puri, Arun N. Netravali, Digital Video: An Introduction to MPEG-2, Chapman and Hall, 1997.

● Joan L. Mitchell, William B. Pennebaker, Chad E. Fogg, Didier J. LeGall, MPEG Video Compression Standard, Chapman and Hall, 1997.

● John Wiseman, An Introduction to MPEG Video Compression, 1998

● Marius Preda, Les standards MPEG dans l'espace multimedia, 2005

● E. G. Richardson, Video Codec Design, Wiley, 2002● Wikipedia