audio video coding standard of (avs) china submitted by, swaminathan sridhar ee 5359 multimedia...

44
Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Upload: norah-rose

Post on 11-Jan-2016

228 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Audio Video coding Standard of (AVS) China

Submitted by,Swaminathan Sridhar

EE 5359 Multimedia Processing Project

Page 2: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Video coding standards [5]

Page 3: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Video coding standards [4], [5]MPEG-2 (DVD, MPEG-2 (DVD, SDTV, HDTV)• More than 10 years old• Compression efficiency• 4.7GB DVD – 2 hours movie (5.3Mbps)• 18GB – 2 hours high definition movie (20Mbps)MPEG-4 AVC/H.264 (Multimedia applications)• Advanced coding techniques• Multiple-reference frame prediction• Context-based adaptive binary arithmetic coding• High compression efficiency• 1.5~2Mbps for SD, 6~8Mbps for HD• Save storage space, channel bandwidth, and frequency spectrum

Page 4: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Development stages of AVS [3]December 2003• In the 7th AVS meeting, AVS-video (part-2) and AVS-system (part-1)

was finalized.December 2004• In the 11th AVS meeting, AVS-M (part-7) was finalized March 2005• Authentication of ‘AVS101’ – high definition decoding chipMay 2005• AVS Industry Alliance was set up.June 2005• Joint AVS/ISMA workshop on IPTV standard and industry forumFebruary 2006• AVS part-2 was announced as a national standard.

Page 5: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Parts of AVS [3]

Page 6: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Applications of the commonly used parts of AVS China [3]

AVS Part-2: HD/SD video• Jizhun Profile & Zengqiang Profile• HD broadcasting• High density storage media• Video surveillances• Video on demandAVS Part-7: Mobility video• Jiben Profile• Record and local playback on mobile devices• Multimedia Message Service (MMS)• Streaming and broadcasting• Real-time video conversation

Page 7: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Major and Minor coding tools used in AVS part 2 [1]

Major tools• Interlace handling: Picture-level adaptive frame/field coding (PAFF)• Macroblock-level adaptive frame/field coding (MBAFF)• Intra prediction: 5 modes for luma and 4 modes for chroma• Motion compensation: 16x16/16x8/8x16/8x8 block size• Resolution of MV: 1/4-pel, 4-tap interpolation filter• Transform: 16bit-implemented 8x8 integer cosine transform• Quantization and scaling: scaling only in encoder• Entropy coding: 2D-VLC and Arithmetic Coding• In-loop deblocking filterMinor tools• Motion vector prediction• Adaptive scan

Page 8: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

AVS encoder structure [1]

Page 9: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Different picture types [2]

Three types of picture are defined by AVS namely • Intra pictures (I) •Predicted pictures (P)- At most two reference frames (P or I) •Interpolated pictures (B)- two reference frames (I or P or both)

Page 10: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

MB level Adaptive frame coding [2]MB-level adaptive frame/field coding (MBAFF)• The frame/field encoding decision is made independently for each vertical pair of

macro blocks in a frame.• A frame consisting of both moving and non-moving regions is coded more

efficiently by:• frame mode for the non-moving regions• field mode for the moving regions• MBAFF is much more complicated than PAFF– zig-zag scanning– motion vector prediction– intra prediction– deblocking– context modeling in entropy coding• The advantage compared with the MBAFF in H.264– A field-coded MB belonging to the bottom field CAN use the top field ofthe same frame as a reference for motion prediction

Page 11: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Intra Prediction [2]• Five different modes for luma is as shown below

Page 12: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Luma Intra Prediction difference between AVS and H.264 [6]

AVS• Block size: 8x8• 5 modes• Reference pixels low pass filtered• Advantages: low complexity with less modesH.264• Block size: 4x4 or 16x16• 9 modes for 4x4 and 4 modes for 16x16• Advantage: better prediction• Disadvantage: more complex

Page 13: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Intra prediction modes for Chroma [2]• 4 Prediction modes for Chroma

Page 14: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Inter Prediction and Motion Compensation [1]

• At most 2 frames can be stored as reference for motion prediction.• Block size of motion prediction and compensation– 16x16, 16x8, 8x16 and 8x8• In each MB, the number of MV pairs can be 1, 2 or 4, depending on the block size

of MC.• MVD, the difference between the predicted MV and the real MV, is coded.• Resolution of MV– 1/4-pixel for luma– 1/8-pixel for chroma• Motion prediction modes• – Forward• – Backward (only applicable for B frame)• – Bi-directional (only applicable for B frame)• Skip• Direct• Symmetric

Page 15: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Reference Frame [1]

• At most 2 reference frames are used. No matter• PAFF or MBAFF is used,– if the current MB is frame-coded, 2 frames can be

used as reference for motion prediction.– if the current MB is field-coded, 4 fields can be

used.• Reference index should be coded with every MC

block to indicate which reference picture is used

Page 16: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Reference Index [1]

Page 17: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Motion Vector Prediction [3]

Page 18: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Motion Vector Prediction [3]• Use A, B, C, D’s MV (MVA, MVB, MVC and MVD) to predict E’s MV

(PredMVE)• Reason: reduce the bits for coding MV• Method:• Geometrical median of MVA, MVB, MVC• VAB = Dist(MVA, MVB)• VBC = Dist(MVB, MVC)• VCA = Dist(MVC, MVA)• FMV = Median(VAB, VBC, VCA) where Dist(MV1, MV2)=|x1-x2|+|y1-y2|.• Determine PredMVE• If FMV equals VAB, PredMVE=MVC.• If FMV equals VBC, PredMVE=MVA.• If FMV equals VCA, PredMVE=MVB.

Page 19: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Interpolation for Luma [3]

• Resolution– Quarter-pixel• Filter– Half-pixel• Blue: [-1, 5, 5, -1]– Quarter-pixel• White: [1, 7, 7, 1]• Red: bilinear

Page 20: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Interpolation for Luma [3]

Page 21: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Interpolation for Chroma [3]

Page 22: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Interpolation for Chroma [3]

• BilinearpredMatrix[x,y] = [(8–dx)×(8–dy)×A +

dx×(8dy)×B + (8–dx)×dy×C + dx×dy×D]/64

Page 23: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Forward and Backward Prediction [1]

• Forward prediction• MV pointing only to the previous frame• Get reference block only from the previous

frame• Backward prediction• MV pointing only to the backward frame• Get reference block only from the backward

frame

Page 24: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Forward and Backward Prediction [1]

Page 25: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Bi-directional Prediction [1]• Skip mode• Block size of MC: 16x16• No transform coefficient is coded, since they all equal zeros.• No MV is coded, since they can be calculated.• Direct mode• Block size of MC: 16x16 or 8x8• Transform coefficients are not all zeros, so they had to be coded.• No MV is coded, since they can be calculated the same way for skip mode.• Symmetric mode• Block size of MC: 16x16, 16x8, 8x16, 8x8.• Transform coefficients are not all zeros, so they had to be coded.• Only forward MV is coded, and the backward MV can be calculated by

using the forward one.

Page 26: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

MV Derivation for Skip and Direct Mode [1]

Page 27: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

MV Derivation for Symmetric Mode [1]

Page 28: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Pre-scale Transform [3]

Page 29: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

AVS 8×8 ICT Matrix [3]

Page 30: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Context-based Adaptive 2D VariableLength Coding (CA-2D-VLC) [1]

(level, run) pair mapping to CodeNum using VLC tables

• level>0: CodeNum is the number in VLC tables directly

• level<0: CodeNum is number+1 in VLC tables .• • Example• level= 2, run=1, CodeNum=11;• level= −2, run=1, CodeNum=12CodeNum mapping to bit• string using Exp-Colomb coding

Page 31: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Context-based Adaptive 2D VariableLength Coding (CA-2D-VLC) [1]

Page 32: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Deblocking Filter [3]8x8 block• Three steps• Choose boundary strength• (BS), according to• Prediction modes• MV• Decide whether to filter• according to• Quantization Parameter (QP)• BS• – Apply filter to the boundary

Page 33: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Deblocking Filter [3]

Page 34: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

AVS Part-2 vs H.264/AVC [4], [6]

Page 35: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

AVS Part-2 vs H.264/AVC [4], [6]

Page 36: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

AVS Part-2 performance [1], [A]

Page 37: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Container.qcif sequence [A]

Page 38: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Decoded frame [A]

Page 39: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Claire.qcif sequence [A]

Page 40: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Decoded frame [A]

Page 41: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

News.qcif sequence [A]

Page 42: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Decoded frame [A]

Page 43: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

References1] L. Yu et al. “An Overview of AVS-Video: tools, performance and complexity”, Visual Communications and Image Processing 2005, Proc. of SPIE, vol. 5960, pp.596021, July 31, 2006.2] L. Yu et al. “An area-efficient VLSI architecture for AVS intra frame encoder” Visual Communications and Image Processing 2007, Proc. of SPIE-IS & T Electronic Imaging, SPIE vol. 6508, pp. 650822, Jan. 29, 2007. 3] W. Gao et al. “AVS - The Chinese Next-Generation Video Coding Standard” NAB, Las Vegas, 2004. 4] T. Wiegand et al. “Overview of the H.264/AVC Coding Standard” IEEE Trans. Circuits Syst. Video Technol., vol.13, pp.560-576, July 2003. 5] J. Wang et al. “An AVS-to-MPEG2 Transcoding System” China Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing , Hong Kong, pp. 302-305, October 20-22, 2004.6] X. Wang et.al “Performance comparison of AVS and H.264/AVC video coding standards” J. Comput. Sci. & Technol., Vol.21, No.3, pp.310-314 J, May 2006.7] Bo Tang et al. “ AVS Encoder Performance and Complexity Analysis Based on Mobile Video Communication”, WRI International conference on Communications and Mobile Computing, CMC ‘09, volume 3, pp. 102-107, 6-8 Jan. 2009.

Page 44: Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project

Web References:AVS China softwareA] ftp://159.226.42.57/public/avs_doc/avs_software