1 - ipsj/itscj web view · 2008-09-0310061 study of iso/iec 14496-12/pdam.1 misc....

244
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC 1/SC 29/WG 11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC 1/SC 29/WG 11 N9963 Hannover, DE – July 2008 Sourc e: Leonardo Chiariglione Title : Report of 85 th meeting Statu s Report of 85 th meeting........................................1 Annex A – Attendance list...................................17 Annex B – Agenda............................................22 Annex C – Input contributions...............................26 Annex D – Output documents..................................37 Annex E – Requirements report...............................43 Annex F – Systems report....................................48 Annex G – Video report......................................82 Annex H – JVT report........................................94 Annex I – Audio report.....................................133 Annex J – 3DG report.......................................158 Report of 85 th meeting 1 Opening The 85 th MPEG meeting was held at Hannover on 2008/07/21T09:00- 27T19:30. 2 Roll call of participants Annex 1 gives the attendance list 1

Upload: vohanh

Post on 15-Mar-2018

220 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

INTERNATIONAL ORGANISATION FOR STANDARDISATIONORGANISATION INTERNATIONALE DE NORMALISATION

ISO/IEC JTC 1/SC 29/WG 11CODING OF MOVING PICTURES AND AUDIO

ISO/IEC JTC 1/SC 29/WG 11 N9963Hannover, DE – July 2008

Source: Leonardo Chiariglione Title: Report of 85th meetingStatus

Report of 85th meeting.................................................................................................................1Annex A – Attendance list........................................................................................................17Annex B – Agenda....................................................................................................................22Annex C – Input contributions..................................................................................................26Annex D – Output documents...................................................................................................37Annex E – Requirements report................................................................................................43Annex F – Systems report.........................................................................................................48Annex G – Video report............................................................................................................82Annex H – JVT report...............................................................................................................94Annex I – Audio report...........................................................................................................133Annex J – 3DG report.............................................................................................................158

Report of 85th meeting

1 Opening The 85th MPEG meeting was held at Hannover on 2008/07/21T09:00-27T19:30.

2 Roll call of participants Annex 1 gives the attendance list

3 Approval of agenda The agenda adopted is provided in Annex 2

4 Allocation of contributions The list of input documents is provided in Annex 3

5 Communications from Convenor There was no specific communication

1

Page 2: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

6 Report of previous meeting This was approved

7 Processing of NB Position PapersNational Body position papers were presented and responses drafted where appropriate

8 Work plan management

8.1 Media coding

8.1.1 AAC-ELDThe following document was approved

10032 Report on the Verification Test of MPEG-4 AAC-ELD

8.1.2 New Profiles for ALS and SLSThe following document was approved

10033 WD on New Profiles for ALS and SLS

8.1.3 Multiview Video Coding (MVC)The following documents were approved

9977 Disposition of Comments on ISO/IEC 14496-10:200X/FPDAM 1 9978 Text of ISO/IEC 14496-10:200X/FDAM 1 Multiview Video Coding

8.1.4 AFX 3rd editionThe following document was approved

10021 WD4.0 of AFX 3rd Edition

8.1.5 Frame-based Animated Mesh CompressionThe following document was approved

10017 Text of ISO/IEC 14496-16:2006/FDAM 2 (Frame based Animation Mesh Compression)

8.1.6 Multiresolution profile

8.1.7 Scalable-complexity 3D mesh compressionThe following documents were approved

10018 Text of WD2.0 of ISO/IEC 14496-16:2006/AMD 4 (Scalable Complexity 3D Mesh Compression)

10022 CE on Scalable Complexity 3D Mesh Coding

2

Page 3: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8.1.8 Open Font Format extensionsThe following documents were approved

10067 DoC on ISO/IEC CD 14496-22 2nd Edition10068 Text of ISO/IEC FCD 14496-22 2nd Edition

8.1.9 Codec Configuration RepresentationThe following document was approved

9984 Study Text of ISO/IEC FCD 23001-4 Codec Configuration Representation9988 Description of Core Experiments in RVC9989 RVC Work Plan and FU Development Status9990 RVC Vision

8.1.10 Video Tool LibraryThe following document was approved

9985 Study Text of ISO/IEC FCD 23002-4 Video Tool Library9987 WD 2 of ISO/IEC 23002-4/Amd.2 (Tools for MPEG-4 ASP, AVC HP and SVC)9989 RVC Work Plan and FU Development Status

8.1.11 Spatial Audio Object CodingThe following documents were approved

10045 Study on ISO/IEC CD 23003-2:200x, Spatial Audio Object Coding10046 Status and Workplan on SAOC Core Experiments

8.1.12 3D Video CodingThe following documents were approved

9991 Description of Exploration Experiments in 3D Video Coding9992 Results of 3D Experts Viewing

8.1.13 Unified Speech and Audio CodingThe following document was approved

10047 Report on Unified Speech and Audio Coding Call for Proposals

8.1.14 Media Value Chain OntologyThe following document was approved

10070 WD1.0 of ISO/IEC 21000-19 Media Value Chain Ontology

3

Page 4: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8.1.15 Representation of Sensory ExperienceThe following documents were approved

10081 WD1.0 of ROSE10082 Core Experiments on ROSE

8.1.16 New Video CodingThe following documents were approved

9993 New Challenges in Video Coding Standardization 9994 Workshop on New Challenges in Video Coding Standardization – Call for

Contributions9995 Workshop on New Challenges in Video Coding Standardization –Program

8.1.17 New Audio Coding

8.2 Composition coding

8.2.1 LASeR Amd.2The following documents were approved

10064 Request for ISO/IEC 14496-20 LASeR Amd.210065 Text of ISO/EC 14496-20 LASeR/PDAM2

8.2.2 Presentation of Structured InformationThe following document was approved

10066 WD1.0 of ISO/EC 14496-20 LASeR/Amd.3 Presentation of Structured Information

8.2.3 Rich Media UI Framework The following document was approved

10085 Context and Objectives of Rich Media UI Framework

8.2.4 Interactive Digital RadioThe following document was approved

10086 Requirements on technologies to support Interactive Digital Radio

4

Page 5: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8.3 Description coding

8.3.1 Image Signature ToolsThe following document was approved

9979 Study Text of ISO/IEC 15938-3:2001/FPDAM 3 Image Signature Tools

8.3.2 Video Signature ToolsThe following documents were approved

9980 Use Cases of MPEG-7 Visual Signature Tools9981 Call for Proposals on Video Signature Tools9982 MPEG-7 Visual XM Document version 34.0

8.4 Transport and File formats

8.4.1 Carriage of SVC in MPEG-2 Systems

The following documents were approved

10058 DoC on ISO/IEC 13818-1:2007/FPDAM3 Carriage of SVC in MPEG-2 Systems10059 Text of ISO/IEC 13818-1:2007/FDAM3 Carriage of SVC in MPEG-2 Systems

8.4.2 Miscellaneous additions to File FormatThe following document was approved

10061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF

8.4.3 AVC File Format extensions for MVCThe following document was approved

10062 ISO/IEC 14496-15 WD2.0 MVC File Format

8.5 Multimedia architecture

8.5.1 Interfaces between virtual and real worldsThe following documents were approved

10104 Draft Requirements for MPEG-V Version 210105 MPEG-V Extended Call for Requirements

8.5.2 MPEG eXtensible Middleware The following documents were approved

5

Page 6: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

10083 WD1.0 of MXM Architecture and Technologies10084 WD1.0 of MXM APIs10106 Requirements for additional MXM technologies10107 CfP for additional MXM technologies

8.6 Application formats

8.6.1 GeneralThe following documents were approved

10108 MAF Overview10111 MAF Overview presentation document

8.6.2 Photo Player Application FormatThe following document was approved

9983 Working Draft of ISO/IEC 23000-3/Amd.2 Conformance Testing for Photo Player MAF

8.6.3 Video Surveillance Application FormatThe following documents were approved

10075 Draft DoC on ISO/IEC FCD 23000-10 (Video Surveillance Application Format)10076 Study Text of ISO/IEC FCD 23000-10 (Video Surveillance Application Format)

8.6.4 Stereoscopic Video Application FormatThe following document was approved

10079 Study Text of ISO/IEC FCD 23000-11 (Stereoscopic Video Application Format)

8.6.5 Interactive Music Application FormatThe following document was approved

10109 Draft Call for Proposals on Interactive Music Application Format

8.7 Reference implementation

8.7.1 MPEG-1 and -2 on MPEG-4 Reference SoftwareThe following documents were approved

10043 DoC on ISO/IEC 14496-5:2001/FPDAM 20, MPEG-1 and -2 on MPEG-410044 ISO/IEC 14496-5:2001/FDAM 20, MPEG-1 and -2 on MPEG-4

6

Page 7: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8.7.2 SVC Reference SoftwareThe following document was approved

9975 Study Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video Coding

8.7.3 MVC Reference SoftwareThe following documents were approved

9973 Request for ISO/IEC 14496-5:2001/Amd.159974 Text of ISO/IEC 14496-5:2001/PDAM 15 Reference Software for Multiview Video

Coding

8.7.4 3D Graphics Compression Model Reference SoftwareThe following document was approved

10015 Text of ISO/IEC 14496-5:2001/FPDAM 22 (3DG Compression Model Reference Software)

8.7.5 Scene Partitioning Reference SoftwareThe following document was approved

10016 Request for ISO/IEC 14496-5:2001/AMD25 (Scene Partitioning Reference Software)

8.7.6 Professional Archival Application Format Reference SoftwareThe following document was approved

10072 WD1.0 of ISO/IEC 23000-6/Amd1 PA-AF Conformance and Reference Software

8.7.7 Digital Multimedia Broadcasting Application Format Reference SoftwareThe following documents were approved

10073 Request for ISO/IEC 23000-9/Amd.1 DMB AF Conf. And Ref. Soft.10074 Text of ISO/IEC 23000-9/PDAM1 DMB AF Conf. And Ref. Soft.

8.7.8 Video Surveillance Application Format Reference SoftwareThe following documents were approved

10077 Request for ISO/IEC 23000-10/Amd.1 VS AF Conf. And Ref. Soft.10078 Text of ISO/IEC 23000-10/PDAM1 VS AF Conf. And Ref. Soft.

7

Page 8: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8.7.9 Stereoscopic Video Application Format Reference SoftwareThe following document was approved

10080 Workplan for Stereoscopic Video Application Format Ref. Soft and Conf.

8.7.10 Video Tool Library Reference SoftwareThe following document was approved

9986 WD 2 of ISO/IEC 23002-4/Amd.1 (Conformance and Reference Software)

8.8 Conformance

8.8.1 New Profiles for Professional Applications ConformanceThe following documents were approved

9968 Disposition of Comments on ISO/IEC 14496-4:2004/FPDAM 309969 Text of ISO/IEC 14496-4:2004/FDAM 30 Conformance Testing for New Profiles for

Professional Applications

8.8.2 Scalable Video Coding ConformanceThe following documents were approved

9970 Disposition of Comments on ISO/IEC 14496-4:2004/FPDAM 319971 Text of ISO/IEC 14496-4:2004/FDAM 31 Conformance Testing for Scalable Video

Coding

8.8.3 Multiview Video Coding ConformanceThe following document was approved

9972 Working Draft 1 of ISO/IEC 14496-4:2004/Amd.38 Conformance Testing for Multiview Video Coding

8.8.4 MPEG-4 Audio ConformanceThe following document was approved

10041 Proposed ISO/IEC 14496-26, Audio Conformance

8.8.5 AAC-ELD, OAFI and additional AAC ConformanceThe following document was approved

10040 Study on ISO/IEC 14496-4:2004/PDAM 36, AAC-ELD, OAFI and additional AAC Conformance

8

Page 9: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8.8.6 MultiResolution Profile ConformanceThe following document was approved

10029 Text of ISO/IEC 14496-4:2004/FPDAM 33 (MultiResolution Profile Conformance)

8.8.7 3D Graphics Compression Model ConformanceThe following document was approved

10019 DOCR on ISO/IEC 14496-4:2004/PDAM 3410013 Text of ISO/IEC 14496-4:2004/FPDAM 34 (3DG Compression Model

Conformance)

8.8.8 Scene Partitioning ConformanceThe following document was approved

10014 Request for ISO/IEC 14496-4:2004/AMD39 (Scene Partitioning Conformance)

8.8.9 Professional Archival Application Format Conformance

10072 WD1.0 of ISO/IEC 23000-6/Amd1 PA-AF Conformance and Reference Software

8.8.10 Digital Multimedia Broadcasting Application Format Conformance

10073 Request for ISO/IEC 23000-9/Amd.1 DMB AF Conf. And Ref. Soft.10074 Text of ISO/IEC 23000-9/PDAM1 DMB AF Conf. And Ref. Soft.

8.8.11 Video Surveillance Application Format Conformance The following documents were approved

10077 Request for ISO/IEC 23000-10/Amd.1 VS AF Conf. And Ref. Soft.10078 Text of ISO/IEC 23000-10/PDAM1 VS AF Conf. And Ref. Soft.

8.8.12 Stereoscopic Video Application Format Conformance The following document was approved

10080 Workplan for Stereoscopic Video Application Format Ref. Soft and Conf.

8.8.13 Video Tool Library ConformanceThe following document was approved

9986 WD 2 of ISO/IEC 23002-4/Amd.1 (Conformance and Reference Software)

9

Page 10: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8.9 Maintenance

8.9.1 Systems coding standards

10063 ISO/IEC 14496-15/DCOR3

8.9.2 Video coding standards

9976 Study Text of ISO/IEC 14496-10:200X/DCOR 1

8.9.3 Audio coding standardsThe following documents were approved

10030 ISO/IEC 13818-4:2004/Amd 2:2005/DCOR 2, AAC Conformance10031 ISO/IEC 13818-7:2006/DCOR 1, AAC10034 ISO/IEC 14496-3:2005/DCOR 6, AAC10035 ISO/IEC 14496-3:2005/AMD 2:2006/DCOR 4, HE-AAC V2 Profile10036 ISO/IEC 14496-3:2005/AMD 3:2006/DCOR 2, SLS10037 ISO/IEC 14496-3:2005/AMD 9:2008/DCOR 1, AAC-ELD10039 ISO/IEC 14496-4:2004/DCOR 6, AAC-LD10042 ISO/IEC 14496-4:2004/AMD13:200x/DCOR 2, AAC-LD bitstreams

8.9.4 3DG coding standards

10023 DOCR on ISO/IEC 14496-5:2001/PDAM 2210020 Study Text of ISO/IEC 14496-16:2006/AMD1:2007/COR2

8.10 Time lineThe following documents were approved

10001 MPEG Standards10002 Table of unpublished FDISs10003 Work plan and time line

9 Organisation of this meeting

9.1 Tasks for subgroupsThe following tasks were assigned

Requirements Std Pt Amd20 Use of Laser and BIFS for UI

4 X Support of interactive digital radioA 12 Interactive music AFV CfR for Interfaces with virtual worlds

10

Page 11: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Transfer Media Value Chain Ontology results to SystemsTransfer Representation of Sensory Experience Information results to SystemsUser Interface frameworkTransfer MPEG eXtensible Middleware results to SystemsDescription of work items for 10003Contribution to press release Carriage of SVC on MP2 TS MVC CfP for video signature AAC ELD VT USAC kick off New Audio lossless profiles FAMCNew standard areas Video

Open Call: 1st August Paper submission: 5 September Final program: 10 September

Audio Gaming

Systems Std Pt Amd2 1 3 Carriage of SVC

Cor Carriage of AVC4 1 RA

4 26 Open Font Format Conformance37 FF conformance

5 14 Open Font Format Reference Software23 Synthetised texture RS

AVC FF RSSVC FF RS

11 6 Scene partitioning12 1 Miscellanea

Cor.1 Flute15 3 MVC File Format20 3 Laser-BIFS integration20 Presentation of Structured Information22 2nd Ed Open Font Format

7 5 MPEG query format RS6 MPEG query format Conformance

21 7 Cor1 Amendment on query fomat capability

8 1 Minor enhancement19 MVCO

A 4 1 Musical Slide Show MAF conformance & RS2 Protected Musical Slide Show MAF conformance & RS

5 2nd Ed Media Streaming MAF6 Professional Archival AF7 1 Open Access AF Refence Software and Conformance

11

Page 12: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8 Portable Video Player AF RS & C9 Cor 1 DMB MAF9 1 DMB MAF Reference Software and Conformance

10 Video Surveillance MAF1 Video Surveillance MAF Reference Software and

Conformance11 Stereoscopic video AF 12 Interactive music AF

B 2 1 Fragment Request Unit RS and ConformanceE 8 M3W Reference Software and Conformance

MPEG eXtensible MiddlewareInformation exchange with virtual worldsRepresentation of sensory effects informationUpdate MPEG technology web page MAFsContribution to press releaseDescription of work itemsOpen Font Format, MediaValue Chain Ontology, Representation of Sensory Experience, Joint use of Laser and BIFS, Presentation of Structured Information, AVC File Format extensions for MVC, MPEG eXtensible Middleware

Video2 2 3 1080P/60 level

4 3 1080P/60 level conformance4 2 5 Simple Studio Profile level 5 and 6

Cor 3 Simple Studio Profile level 44 35 Simple Studio Profile level 5 and 6 conformance

7 3 3 Image Signature Tools3 4 Video Signature Tools

4 9 3 Reference Hardware DescriptionA 3 1 Photo Player Reference Software

2 Photo Player ConformanceB 4 Codec Configuration DescriptionC 4 Video Tool Library

4 1 Video Tool Library Conformance & RS3DV/FTVUpdate MPEG technology web pageContribution to press releaseDescription of work items

JVT 4 4 30 New AVC Profiles for Professional Applications Conformance

31 Scalable Video Coding Conformance38 MVC C

5 19 Scalable Video Coding Reference SW15 MVC RS

12

Page 13: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

10 1 Multi-View Video CodingCor 1 Miscellanea2 AVC Constrained Baseline Profile

Update MPEG technology web pageContribution to press releaseDescription of work items

Audio 4 3 ALS profileSLS profile

3 AAC-ELD VT4 36 AAC-ELD conformance5 24 AAC-ELD Reference Software

D 2 Spatial Audio Object Coding3 USAC

Update MPEG technology web pageContribution to press releaseDescription of work itemsNew audio issues

3DG4 4 32 FAMC (Frame based Animated Mesh Compress.)

Conformance33 Multiresolution profile conformance34 3D Graphics Compression model Conformance

5 21 FAMC (Frame based Animated Mesh Compress.) Reference software

22 3D Graphics Compression model Reference Software11 6 Space partitioning16 3rd Ed

2 Frame-based animated mesh compression3 Multiresolution profile4 Scalable complexity 3DMC

25 3D Graphics Compression modelInformation exchange with virtual worldsUpdate MPEG technology web pageContribution to press releaseDescription of work items

9.2 Joint meetings The following joint meetings were held

Groups What Day Where TimeS, R RoSE, MVCO, Tue Sys 09:00-10:00S, R, 3 MXM, MPEG-V Tue Sys 10:00-11:00S, R UI framework reqs Tue Sys 11:00-12:00R, V Future video stds Tue Video 14:00-16:00R, S, V, A, 3 Gaming Wed Plenary 11:00-12:00A, S, R Interactive music AF Wed Audio 16:00-17:00

13

Page 14: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

S, R New BIFS profile for DMB Wed Sys 17:00-17:30A, R New directions in audio Wed Aud 17:00-18:00S, R LASeR reqs for per2per comm. Wed Sys 17:30-18:00R, V Future video stds Thu Video 09:00-10:00A, R New audio profiles Thu Req 11:00-11:30

10 WG management

10.1 Terms of reference The terms of reference were approved

10000 Terms of reference

10.2 Editors The following documents were approved

10404 Editors of MPEG standards10012 Project Editors for ISO/IEC Certificates of Appreciation

10.3 Liaisons The following liaison statements were issued

10088 Liaison statement to SC34/WG2 on Open Font Format10089 Liaison statement to WG 1 on PA-AF10090 Liaison statement to ITU-T SG16 on IPTV10092 Liaison statement to Open IPTV Forum on IPTV10093 Liaison statement to ATSC M/H on Rich Media10094 Liaison statement to DVB MIS10095 Liaison statement to TC 100 on Audio Book10096 Liaison statement to TC 100 on CDV 60728-1-210097 Liaison statement to TC 100 on NP: Multimedia Systems and Equipment10055 Liaison statement to ATSC on Carriage of SVC10056 Liaison statement to DVB on Carriage of SVC10057 Liaison statement to SCTE on Carriage of SVC9996 Liaison Statement to SMPTE re RVC 9997 Liaison Statement to ITU-T SG 9 re FTV 9998 Liaison Statement to ATSC9999 Liaison Statement from JVT to various Organizations re MVC10049 Liaison response to ITU-T SG 16 (m15261, m15516)10050 Liaison response to IEC TC 100 (m15555)

10.4 Ad hoc groups

10100 Ad Hoc Group on Application Format10099 Ad Hoc Group on MPEG File Formats

14

Page 15: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

10098 Ad Hoc Group on Scene Representation10027 AHG on 3D Video Coding10060 AHG on 3DGC documents, software maintenance and core experiments10053 AHG on Audio Standards Maintenance10103 AHG on Font Format10110 AHG on Information Exchange with Virtual Worlds10024 AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software

and Conformance10026 AHG on MPEG-7 Visual10101 AHG on MVCO WD Editing10028 AHG on New Challenges in Video Coding Standardization10025 AHG on Reconfigurable Video Coding10054 AHG on SAOC, USAC, and Support for new Loudspeaker Configurations10102 AHG on the RoSE Framework

10.5 Asset management The following documents were approved

10005 Schema assets updates10006 Software assets10007 Conformance assets10008 Content assets10009 URI assets

10.6 IPR management The following document was approved

10010 Standards under development for which a call for patent statements is issued

10.7 Work plan

11 Administrative matters

11.1 Responses to National Bodies The following documents were approved

10051 Response to FNB (m15551)10052 Response to KNB (m15720)

11.2 Schedule of future MPEG meetings The following meeting schedule was approved

# City Country yy mm dd-dd85 Hannover DE 08 07 21-2586 Busan KR 08 10 13-17

15

Page 16: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

87 Lausanne CH 09 02 02-0688 Maui, HI US 09 04 20-2489 London UK 09 06-07 29-0390 Xian CN 09 10 26-3091 Kyoto JP 10 01 18-2292 ? AU? 10 04 12-1693 ? IT 10 07 19-2394 ? ? 10 10

11.3 Promotional activities

The following documents were approved

10069 MPEG-4 White Paper – Font Format10071 MPEG-A White Paper – Professional Archival Application Format9965 Hannover press release

12 Resolutions of this meeting These were approved

13 A.O.B There was no other business

14 ClosingThe meeting closed at 2008/07/25T19:30

16

Page 17: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex A – Attendance list

First Name Last Name Affiliation CountryDr. Christian Timmerer Klagenfurt University AustriaMr. Kenneth Vermeirsch UGent - MMLab BelgiumMr. T. Wiegand Fraunhofer HHI BerlinDr. Philippe Gournay VoiceAge Corporation CanadaMr. Roch Lefebvre Université de Sherbrooke CanadaMr. Redwan Salami VoiceAge Corporation CanadaMrs. Dandan Ding Zhejiang University ChinaDr. Wen Gao Peking University ChinaPr. Yun He Tsinghua University ChinaDr. Junyan Huo Xidian University ChinaMr. Sixin Lin Huawei Tech. Ltd Co. ChinaMr. Siwei MA Peking University ChinaMr. Honggang Qi Chinese Academy of Sciences ChinaMr. Lianhuan Xiong Huawei Technologies Co. Ltd. ChinaPr. Lu Yu Zhejiang University China

Dr.Gwo Giun (Chris) Lee National Cheng Kung University China

Mr. Ying Chen Tampere University of Technology FinlandMr. Miska Hannuksela Nokia Research Center FinlandDr. Juha Vartiainen SFS FinlandDr. Ye-Kui Wang Nokia FinlandMr. Berthelot Bertrand France Telecom - Orange Labs FranceMr. Vincent Bottreau Thomson FranceMr. Marc Brelot RTL FranceMr. Cyril Concolato Telecom Paristech FranceDr. Patrick Gioia Orange Labs FranceMr. Marc Guez Vucher SCPP FranceMr. Owen Lagadec iKlax Media FranceMr. Jean Le Feuvre Telecom Paristech FranceMr. Patrick Lopez Thomson FranceMr. AVARO OLIVIER STREAMEZZO FranceMr. Stephane Pateux France Telecom FranceMr. Pierrick Philippe Orange Labs FranceMr. Marius Preda Telecom FranceMr. Laurent Primaux iKlax Media FranceMr. Micka‘l Raulet INSA de Rennes FranceMr. David Virette France Telecom FranceMr. Schreiner Stephan Fraunhofer Institut GermanyMr. Bacigalupo Virgilio Fraunhofer Institut GermanyMr. Gero Baese Siemens GermanyMr. Johannes Boehm Thomson GermanyPr. Karlheinz Brandenburg Fraunhofer IDMT GermanyMr. Heribert Brust Fraunhofer HHI GermanyMr. Gero BŠse Siemens GermanyMr. Stefan Doehlae Fraunhofer IIS GermanyDr. Ralf Geiger Fraunhofer IIS GermanyMr. Sebastian Gerke Fraunhofer HHI GermanyDr. Bernhard Grill Fraunhofer IIS GermanyMr. Karsten Grueneberg Fraunhofer HHI GermanyMr. Oliver Hellmuth Fraunhofer IIS GermanyDr. Florian Keiler Thomson Germany

17

Page 18: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Dr. Peter Kuhn European Patent Office GermanyDr. Terentiev Leonid Fraunhofer Institut GermanyMr. Tilman Liebchen LG Electronics GermanyMr. Marcus Mueller Heinrich-Hertz-Institut GermanyDr. Karsten Mueller Fraunhofer HHI GermanyMr. Markus Multrus Fraunhofer IIS GermanyMr. Frederik Nagel fraunhofer Institut GermanyMr. Max Neuendorf fraunhofer Institut GermanyPr. Jens-Rainer Ohm RWTH Aachen University GermanyMr. Heiko Purnhagen Dolby Germany GmbH GermanyMr. Fengyan Qi Huawei Technologies GermanyMr. Thomas Schierl Fraunhofer HHI GermanyMr. Andreas Schneider Dolby Germany GmbH GermanyMr. Markus Schnell Fraunhofer IIS GermanyDr. Gerald Schuller TU Ilmenau GermanyDr. Heiko Schwarz Fraunhofer HHI GermanyDr. Aljoscha Smolic Fraunhofer HHI GermanyMr. Ralph Sperschneider Fraunhofer IIS GermanyMr. HervŽ Taddei Huaewei Technologies GermanyMr. Herbert Thoma Fraunhofer IIS GermanyMr. Steffen Wittmann Panasonic GermanyDr. Oliver Wuebbolt Thomson Germany

Mr.Frederik Leonhard Zilly Fraunhofer HHI Germany

Dr. Yui-Lam Chan Hong Kong Polytechnic University Hong KongMs. Carmen Cheng Hong Kong ASTRI Hong KongMr. Roni Even Polycom IsraelMrs. sabina brufani sisvel ItalyMr. Leonardo Chiariglione Cedeo.net ItalyMr. Filippo Chiariglione Cedeo.net ItalyMr. Giovanni Cordara Telecom Italia ItalyMr. Diego Gibellino Telecom Italia ItalyMr. Kohtaro Asai Mitsubishi Electric Corporation JapanMr. Takeshi Chujoh Toshiba Corporation JapanMr. Shigeru Fukushima JVC JapanMr. Noboru Harada NTT JapanMr. Shinya Hasegawa Sharp Corporation JapanMr. Takashi Ito Fujitsu Laboratories Ltd. JapanMr. Satoshi Ito Toshiba Corporation JapanMr. Kota Iwamoto NEC Corporation JapanMr. Itaru Kaneko Tokyo Polytechnic JapanMr. Kimihiko Kazui Fujitsu Laboratories Ltd. JapanMr. Kei Kikuiri NTT DoCoMo JapanMr. Hideaki Kimata NTT JapanMr. Masatsugu Kitamura For more convenient AV life JapanMr. Takuyo Kogure Matsushita Electric Ind. Co , Ltd. JapanMr. Jun Matsumoto Sony JapanMr. Takehiro Moriya NTT JapanDr. Tokumichi Murakami Mitsubishi Electric Corporation JapanDr. Katsuyuki Nakamura Hitachi Ltd. JapanMr. Yasushige Nakayama NHK JapanMr. Takahiro Nishi Panasonic JapanMr. Toshiyuki Nomura NEC JapanMr. Takeshi Norimatsu Panasonic JapanMr. Ryoma Oami NEC Corporation JapanMs. Yukiko Ogura IPSJ Japan

18

Page 19: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Ms Yukiko Ogura IPSJ JapanMr. Shohei Saito Hitachi Ltd. JapanMr. Satoru Sakazume Victor Company of Japan, Ltd. JapanMr. Shun-ichi Sekiguchi Mitsubishi Electric Corporation JapanDr. Takanori Senoh NICT JapanMr. Osamu Shimada NEC Corporation JapanMr. Shinya Shimizu NTT JapanMr. Kazuo Sugimoto Mitsubishi Electric Corporation JapanDr. Ken Sugiyama NEC Corporation JapanMr. Teruhiko Suzuki Sony Corp. JapanDr. Kazuyoshi Suzuki Nagoya University JapanMr. Thiow Keng Tan NTT DoCoMo, Inc. JapanPr. Masayuki Tanimoto Nagoya University JapanMr. Akiyuki Tanizawa Toshiba JapanMr. Yasuaki Tokumo Sharp Corporation JapanDr. Akio Yamada NEC JapanMr. Tomoo Yamakage Toshiba Corporation JapanMr. Tomoyuki Yamamoto Sharp JapanMr. Takahiro Yamasaki Oki Electric Industry Co., Ltd. JapanDr. Yoshiyuki Yashima NTT Japan

Il Hong Shin ETRI KoreaMr. Dong Gyu Sim Kwangwoon University KoreaMr. Jeong-Hwan Ahn Samsung Electronics KoreaDr. Elena Alshina Samsung Electronics KoreaMr. Gun Bang ETRI KoreaDr. Jihun Cha ETRI KoreaDr. Maeng-Sub Cho ETRI KoreaMrs. Ayoung Cho Inha University KoreaPr. Yoonsik Choe Yonsei University KoreaDr. Haechul Choi ETRI KoreaDr. Miran Choi ETRI KoreaMr. Byeongho Choi KETI KoreaMr. Bumsuk Choi ETRI KoreaMr. Kevin S.C Ham Audizen KoreaMr. Yo-Sung Ho GIST KoreaMr. Jin Woo Hong ETRI KoreaMr. Seo§Young Hwang Samsung Electronics KoreaMrs. Inseon Jang ETRI KoreaPr. Euee Seon Jang Hanyang University KoreaMr. Yong-Joon Jeon KoreaMr. Byeong Moon Jeon LG Electronics KoreaMr. Byeungwoo Jeon Sungkyunkwan University KoreaMr. Dong-Seok Jeong Inha University Korea Mr. Ju Kyong Jin Inha University KoreaMr. Sanghyun Joo ETRI KoreaDr. Yang-Won Jung LG Electronics KoreaMr. Kyeong Ok Kang ETRI KoreaMr. Dong Soo Kim LG Electronics KoreaDr. Hui Yong Kim ETRI KoreaMr. Kyuheon Kim KoreaMr. Byungchul Kim KoreaMr. Hyungyu Kim Hanyang University KoreaMr. Daiyong Kim Hanyang University KoreaDr. Yongtae Kim Samsung KoreaMr. Yongje Kim KoreaPr. Sang-Kyun Kim Myongji Univeristy Korea

19

Page 20: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Mr. Seon Tae Kim ETRI KoreaJung Hoe Kim Samsung Electronics KoreaJongyoun Kim Netntv Korea

Mr. Jung Wan Ko Samsung Electronics Co.,Ltd KoreaMr. Seo Kyong-Sok Samsung KoreaMr. Sinwook Lee Hanyang Univ KoreaDr. Sangyoun Lee Yonsei University KoreaMr. Cheon Lee GIST KoreaMr. Seung Wook Lee ETRI KoreaMr. Taejin Lee ETRI KoreaMr. Kyohyuk Lee KoreaMrs. Tammy Lee Samsung Electronics Co. Ltd KoreaMr. Yung Lyul Lee Jejong University KoreaMr. Taebeom Lim KETI KoreaDr. Jung Eun Lim LG Electronics KoreaMr. Young«-Kwon Lim Net&Tv Inc. KoreaMr. Kim Myoung-Jin Soongsil University KoreaDr. Weon geun O ETRI KoreaProf. Seoung-Jun Oh Kwangwoon University KoreaDr. Weon geun Oh ETRI KoreaMr. Hee-Suk Pang Sejong University KoreaMr. Changseob Park Korean Broadcasting System KoreaMr. HeungSik Park KoreaProf. Hochong Park Kwangwoon University KoreaMr. JeongHoon Park Samsung Electronics KoreaPr. Young-cheol Park Yonsei University Korea

Kyungmo Park Samsung Electronics KoreaMr. Jeongil Seo ETRI KoreaMr. Hwaseon Shin KETI KoreaMr. KyoungSoo Son Hanyang University Korea

Jaeyeon Song Samsung Electronics KoreaMr. Jung Suk Suh Samsung Electronics Co., Ltd KoreaMr. Jaewon Sung LG Electronics KoreaMr. Hendry Tan Korea ICU KoreaDr. Gi-Mun Um ETRI KoreaMr. Kwanghyun Won Sungkyunkwan University KoreaMr. Won-Keun Yang ETRI KoreaMr. Alex Chungku Yie Humax Co., Ltd. KoreaDr. Jeong-Ju Yoo ETRI KoreaMr. Sungyong Yoon LG Electornics KoreaMr. Kugjin Yun ETRI KoreaMr. Kyoung-ill Kim ETRI KoreaDr. Henney Oh LG Electronics KoreaDr. Eunmi Oh Samsung Electronics KoreaDr. Hyungsik suh LG electronics koreaMr. Wiebe de Haan Philips NederlandsMr. Fons Bruls Philips Research Netherlands

Sampaio Lobo Lincoln Motorola NetherlandsMr. Werner Oomen Philips NetherlandsMr. Yann Bodo Joost Technologies NetherlandsMr. Sebastian Brangoulo Joost Technologies NetherlandsMr. Jean H.A. Gelissen Philips Research Netherlands

Marek Domanski Poznan Uni Tech PolandMr. Krzysztof Klimaszewski Poznan University of Technology, KTMiM PolandMr. Olgierd Stankiewicz Poznan University of Technology, KTMiM PolandMr. Ti Eu Chan Institute For Infocomm Research (A*STAR) Singapore

20

Page 21: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Dr. Tuan Kiang Chiew Institute for Infocomm Research A*STAR SingaporeDr. Haibin Huang Institute for Infocomm Research SingaporeDr. Haiyan Shu Institute for Infocomm Research SingaporeMr. Tai Hou Tng Institute for Infocomm Research SingaporeMrs. Wei Yao Institute for Infocomm Research SingaporeMrs. Huan Zhou Panasonic Singapore Laboratories Ltd. SingaporeDr. Yongwei Zhu Institute for Infocomm Research SingaporeMr. Pablo Carballeira López Universidad Politécnica de Madrid SpainPr. Jaime Delgado UPC SpainDr. Oscar Divorra Escoda Telefonica SpainMr. Marc Gauvin sDae SpainPr. Francisco Mor‡n Burgos Universidad PolitŽcnica de Madrid SpainMr. Victor Rodriguez-Doncel Universitat Polit?nica de Catalunya SpainDr. Per Froejdh Ericsson SwedenMr. Kristofer Kjšrling Dolby SwedenMrs. Ivana Radulovic Ericsson SwedenDr. Anisse Taleb Ericsson SwedenPr. Touradj Ebrahimi EPFL SwitzerlandMr. Christophe Lucarz EPFL SwitzerlandMr. James Annesley Kingston University UKDr. Miroslaw Bober Mitsubishi Electric ITE-VIL UKDr. Paul Brasnett Mitsubishi Electric ITE UKDr. Yue Feng Universtiy of Glasgow UKDr. Panos Kudumakis Queen Mary University of London UKDr. Stavros Paschalakis Mitsubishi Electric ITE B.V. VIL UKDr. Leszek Cieplinski Mitsubishi Electric ITE-VIL UKMr. Motta Giovanni QUALWMM USADr. Lazar Bivolarski Logitech, Inc. USAMr. Peisong Chen Qualcomm USADr. Yi-Jen Chiu Intel Corporation USAMr. Alberto Duenas W&W Communications, Inc. USAMr. Alex Eleftheriadis Vidyo Inc. USAMr. Louis Fielder Dolby Laboratories Inc. USAMr. Munsi Alaul Haque Intel Corporation USADr. Barry G. Haskell Apple USADr. Shih-Ta Hsiang Motorola, Inc. USADr. Walter Husak USA

Marta Karczewicz Quakomm USAMr. Vladimir Levantovsky Monotype USAMr. Jian Lu Vobile, Inc. USAMr. Ajay Luthra Motorola USADr. Sam Narasimhan Motorola USAMr. Purvin Pandit Thomson USAMr. Schuyler Quackenbush Audio Research Labs USADr. Yuriy A. Reznik Qualcomm Inc. USAMr. Suman Sharma Intel Corporation USAMrs. Xiaojin Shi Apple USAMr. David W Singer Apple USADr. Gary J Sullivan Microsoft USADr. Huifang Sun Mitsubishi Elec Res Labs USADr. Dong Tian Thomson Inc. USADr. Alexandros Tourapis Dolby Laboratories USADr. Anthony Vetro Mitsubishi Elec Res Labs USAMr. Xin Wang ContentGuard, Inc. USADr. Sehoon Yea Mitsubishi Elec Res Labs USADr. Haoping Yu Huawei Technologies USA

21

Page 22: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex B– Agenda

1    Opening 2    Roll call of participants 3    Approval of agenda 4    Allocation of contributions 5    Communications from Convenor 6    Report of previous meeting 7    Processing of NB Position Papers 8    Work plan management   1  Media coding    1MPEG-2 Main Profile Level for 1080@50/60p    2MPEG-4 Visual Simple Studio Profile Levels 5 and 6    3AAC-ELD    4Multiview Video Coding (MVC)    5Frame-based Animated Mesh Compression    6Multiresolution profile    7Low-complexity 3D mesh compression    8Open Font Format extensions    9Codec Configuration Representation   10Video Tool Library   11Spatial Audio Object Coding   123D Video Coding   13Unified Speech and Audio Coding   14Media Value Chain Ontology   15Representation of Sensory Experience   16New Video Coding   17New Audio Coding  2  Composition coding    1Joint use of Laser and BIFS    2Scene partitioning    3Presentation of Structured Information    4User Interface framework  3  Description coding    1Image Signature Tools    2Video Signature Tools    3MPEG-7 Query Format

22

Page 23: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

  4  Systems support  5  IPMP  6  Digital Item    1Schema files for MPEG-21 standards    2Security in Event Reporting  7  Transport and File formats    1Carriage of SVC in MPEG-2 Systems    2AVC File Format extensions for MVC    3Digital Item File Format  8  Multimedia architecture    13D Graphics Compression Model    2Interfaces between virtual and real worlds    3Extensible Multimedia Platform  9  Application formats    1Protected Musical Slide Show Application Format    2Professional Archival Application Format    3Open Access Application Format    4Portable Video Player Application Format    5Video Surveillance Application Format    6Video Stereoscopic Application Format    7Interactive music Application Format 10  Reference implementation    1MPEG-1 and -2 on MPEG-4 Reference Software    2BSAC Extensions Reference Software    3AAC-ELD Reference Software    4New Profiles for Professional Applications Reference Software    5SVC Reference Software    6File Format Reference Software    7Geometry and Shadow Reference Software    8Frame-based Animated Mesh Compression Reference Software    93D Graphics Compression Model Reference Software   10Open Font Format Reference Software   11MPEG-7 Systems Reference Software   12MPEG-21 REL Reference Software   13Photo Player MAF Reference Software   14Musical Slide Show MAF Reference Software   15Binary MPEG format for XML Reference Software   16MPEG Surround Reference Software

23

Page 24: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

   17M3W Reference Software 11  Conformance    1MPEG-2 Main Profile Level for 1080@50/60p Conformance    2MPEG-4 Visual Simple Profile Level 6 Conformance    3New Profiles for Professional Applications Conformance    4Scalable Video Coding Conformance    5AAC-ELD, OAFI and additional AAC Conformance    6MPEG-1 and -2 Audio in MPEG-4 Conformance    7BSAC Conformance    8File Format Conformance    9Geometry & Shadow Conformance   10Frame-based Animated Mesh Compression Conformance   11MultiResolution Profile Conformance   123D Graphics Compression Model Conformance   13Synthesized Texture Conformance   14Laser Conformance   15Open Font Format Conformance   16Improvements to Geographic Descriptor Conformance   17MPEG Surround Conformance   18M3W Conformance   19Video Tool Library Conformance 12  Maintenance    1Systems coding standards    2Video coding standards    3Audio coding standards    43DG coding standards    5Visual description coding standards    6Audio description coding standards    7MPEG-21 standards    8MPEG-A standards 13  Time line

9    Organisation of this meeting     1 Tasks for subgroups    2 Joint meetings

10    WG management   1 1 Terms of reference   2 2 Officers   3 3 Editors

24

Page 25: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

  4 4 Liaisons   5 5 Work item assignment   6 6 Ad hoc groups   7 7 Asset management      1Reference software      2Conformance      3Test material      4URI     8 IPR management     9 Work plan

11    Administrative matters   1  Responses to National Bodies   2  Schedule of future MPEG meetings   3  Promotional activities

12    Resolutions of this meeting 13    A.O.B 14    Closing

25

Page 26: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex C – Input contributions

No. Source Title

m15492 Webmaster Hannover document register

m15493Yi-Shin Tung, Teruhiko Suzuki Ad Hoc Group on Maintenance of MPEG-4

Visual related Documents, Reference Software and Conformance

m15494 Euee S. Jang, Marco Mattavelli, Yoshihisa Yamada

Ad Hoc Group on Reconfigurable Video Coding

m15495 Miroslaw Bober, Ryoma Oami Ad Hoc Group on MPEG-7 Visual

m15496 Hideaki Kimata, Karsten Müller Ad Hoc Group on 3D Video and Free Viewpoint Television Coding

m15497 Lazar Bivolarski, Jens-Rainer Ohm, Jörn Ostermann

Ad Hoc Group on New Challenges in Video Coding Standardization

m15498 Patrick Gioia, Francisco Morán Burgos

Ad Hoc Group on 3DGC documents and software maintenance

m15499 R. Sperschneider Ad Hoc Group on Audio Standards Maintenance

m15500 S. Quackenbush, Eunmi Oh Ad Hoc Group on Unified Speech and Audio Coding, SAOC and AAC-ELD

m15501 Sanghyun Joo, Jean Gelissen, Christian Timmerer Ad Hoc Group on the RoSE Framework

m15502 Xin Wang Ad Hoc Group on Call for Proposals for a Media Value Chain Ontology

m15503 Jean Gelissen, Marius Preda Ad Hoc Group on Information Exchange with Virtual Worlds

m15504Young-Kwon Lim , Jean-Claude Dufourd, Jaeyeon Song, Cyril Concolato

Ad Hoc Group on Scene Representation

m15505 David Singer Ad Hoc Group on MPEG File Formats

m15506 Kyuheon Kim, Hui Yong Kim, Jean Cha, Noboru Harada, Hendry Ad Hoc Group on Application Format

m15507 Kyoungro Yoon, Mario Doeller Ad Hoc Group on MPEG Query Format

m15508 Olivier Avaro Ad Hoc Group on MxM

m15509 ITTF via SC29 Secretariat Table of Replies on ISO/IEC 21000-18:2007/FDAM 1 [SC 29 N 9414]

26

Page 27: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15510 ITTF via SC29 Secretariat Table of Replies on ISO/IEC FDIS 23002-2 [SC 29 N 9415]

m15511 ITTF via SC29 Secretariat Table of Replies on ISO/IEC FDIS 29116-1 [SC 29 N 9416]

m15512 ITTF via SC29 Secretariat Table of Replies on ISO/IEC 23002-1:2006/FDAM 1 [SC 29 N 9457]

m15513 SC29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/FPDAM 30 [SC 29 N 9481]

m15514 ITU-T SG 9 via SC29 Secretariat Liaison Statement from ITU-T SG 9 [SC 29 N 9482]

m15515 ITU-T SG 9 via SC29 Secretariat Liaison Statement from ITU-T SG 9 [SC 29 N 9483]

m15516 ITU-T SG 16 via SC29 Secretariat Liaison Statement from ITU-T SG 16 [SC 29 N 9498]

m15517 ITTF via SC29 Secretariat Table of Replies on ISO/IEC 14496-3:2005/FDAM 9 [SC 29 N 9505]

m15518 SC29 Secretariat Summary of Voting on ISO/IEC 14496-16:2006/FPDAM 2 [SC 29 N 9506]

m15519 ITU-T SG 16 via SC29 Secretariat Liaison Statement from ITU-T SG 16 on IPTV [SC 29 N 9516]

m15520 Mark Callow Guide to the MPEG Subversion Repository

m15521 Itaru Kaneko Architecture for the MPEG User Interface Framework

m15522 Richard Clark Summary of Voting on ISO/IEC 13818-1:2007/FPDAM 3.2 [SC 29 N 9517]

m15523 Marius Preda Summary of Voting on ISO/IEC FCD 23000-10 [SC 29 N 9518]

m15524 Francisco Morán Table of Replies on ISO/IEC 14496-4:2004/FDAM 29 [SC 29 N 9527]

m15525 Cyril Concolato Table of Replies on ISO/IEC 14496-15:2004/FDAM 2 [SC 29 N 9528]

m15526 Jean Le Feuvre Table of Replies on ISO/IEC 21000-5:2004/FDAM 3 [SC 29 N 9529]

m15527 KyungMo Park Table of Replies on ISO/IEC 14496-5:2001/FDAM 16 [SC 29 N 9530]

m15528 SC29 Secretariat Table of Replies on ISO/IEC 14496-5:2001/FDAM 17 [SC 29 N 9531]

m15529 SC29 Secretariat Table of Replies on ISO/IEC FDIS 23000-5

27

Page 28: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

[SC 29 N 9532]

m15530 ITTF via SC29 Secretariat Table of Replies on ISO/IEC FDIS 23000-7 [SC 29 N 9533]

m15531 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC FDIS 23000-9 [SC 29 N 9534]

m15532 ITTF via SC 29 Secretariat Summary of Voting on ISO/IEC 14496-10:200X/FPDAM 1

m15533 ITTF via SC 29 Secretariat Summary of Voting on ISO/IEC CD 14496-22 [2nd Edition]

m15534 ITTF via SC 29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/FPDAM 31

m15535 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 23003-1:2007/FDAM 1

m15536 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 23003-1:2007/FDAM 2

m15537 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 14496-4:2004/FDAM 20

m15538 SC29 Secretariat Liaison Statement from DVB TM-AVC [SC 29 N 9536]

m15539SC29 Secretariat IEC NP: Multimedia systems and equipment

-- Multimedia home network configuration -- Basic reference model

m15540 SC29 Secretariat Depth Map Estimation Software version 3

m15541 ITTF via SC 29 Secretariat Reference software of OctreeImage and PointTexture

m15542 ITTF via SC 29 Secretariat Liaison Statement from TTA [SC 29 N 9559]

m15543 ITTF via SC29 Secretariat IEC CDV 60728-1-2

m15544 DVB via SC 29 Secretariat Liaison Statement from ITU-T SG 16 on MXW [SC 29 N 9564]

m15545 IEC TC 100 via SC29 Secretariat Speech and Audio listening test lab report from Dolby

m15546 Olgierd Stankiewicz Analysis of speech and audio listening test data

m15547 Krzysztof Wegner Technical description of the Dolby Philips proposal for the speech and audio work- item

m15548 Jeong-Hwan Ahn Speech and Audio listening test lab report from Philips

28

Page 29: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15549 Shinjun Lee Proposed network of functional units for AVC High Profile

m15550 GyeongJa Jang New BIFS Profile for Interactive Digital Radio

m15551 TTA via SC 29 Secretariat FRNB comment on the Unified Speech and Audio Coding Exploration Activity

m15552IEC TC 100 via SC29 Secretariat Information on an interim technical standard

for next-generation satellite digital broadcasting in Japan

m15553 ITU-T SG 16 via SC 29 Secretariat Results of Exploration Experiment in 3D Video Coding for Dog Data Set

m15554 Kristofer Kjörling Liaison Statement from 3GPP

m15555 Werner Oomen IEC CD 61937-10

m15556 Kristofer Kjörling IEC CD 62571

m15557 Heiko Purnhagen Liaison Statement from DVB

m15558 Kristofer Kjörling JNB Comment on Text of ISO/IEC 14496-10:200X/DCOR1 (N9574)

m15559 Werner Oomen Proposed update of FCD ISO/IEC 23002-4: Video Tool Library (VTL)

m15560 Jonas Samuelsson Sensory Effects Metadata for RoSE Framework

m15561 Lars Villemoes User Sensory Preferences Metadata for RoSE Framework

m15562 Barbara Resch Sensory Device Capabilities Metadata for RoSE Framework

m15563 Erik Schuijers Sensory Device Commands Metadata for RoSE Framework

m15564 Pontus Carlsson Response to CfP on unified speech and audio coding

m15565 Werner Oomen Speech & Audio - Listening Test, Report & Results - Thomson

m15566 Jeroen Koppens Speech & Audio - Description of Technology of the Thomson proposal

m15567 Gwo Giun Lee Listening test results for Unified Speech and Audio Coding from Samsung

m15568 He-Yuan Lin Technical description of the ETRI proposal

29

Page 30: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

for the unified speech and audio coding

m15569Yuan-Long Cheng Proposal on standardization of a filter set for

converting between 4:2:2/4:2:0 chrominance sampling

m15570 Ming-Jiun Wang Unified speech and audio coding listening test report from ETRI

m15571 Benoît Pellan Liaison Statement from WG 1

m15572Young-Kwon Lim Listening Test Report for CE on separating

real-environment signals into multiple objects from ETRI

m15573 Cyril Concolato 3D Video Exploration Experiment on Baseline Length Dependency

m15574Pierrick Philippe on behalf of the FRNB

Listening test results of Test 1 for SAOC CE on separating real-environment signals into multiple objects

m15575Yasushige Nakayama Listening test results of Test 2 for SAOC CE

on separating real-environment signals into multiple objects

m15576 Aljoscha Smolic A Proposal on preset functionality for teleconferencing application

m15577 Yongzhe Wang Requirements for next-generation video coding standard

m15578 Karsten Mueller On motion block size and partitions for 4:4:4 coding

m15579 3GPP via SC 29 Secretariat Analysis for Ad Hoc Group on New Challenges in Video Coding Standardization

m15580IEC TC 100 via SC 29 Secretariat Technical considerations for Ad Hoc Group

on New Challenges in Video Coding Standardization

m15581 IEC TC 100 via SC 29 Secretariat Report on the USAC listening test at LGE

m15582 DVB via SC 29 Secretariat LGE submission to Unified Speech & Audio Coding

m15583teruhikos_at_jp.sony.com Results of Depth Estimation and View

Synthesis on Doorflowers Test Data Set for EEs in 3D Video Coding

m15584 Kazuo Sugimoto Experimental Results on Depth Estimation and View Synthesis with subpixel-precision

m15585 Kenji Otoi RoSE description schemes for realistic TV commercials

30

Page 31: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15586 Yoshihisa Yamada Listening test reports for CE on separating real-environment signals

m15587 Kohtaro Asai Proposed syntax revision for redundancy reduction in SAOC

m15588 B.S Choi Comments on SAOC operation modes and module

m15589 Sanghyun Joo Exploration Experiments on seq 'Arrive book' & 'Alt-moabit'

m15590 B.S Choi Creation of LDV streams out of MV sequences

m15591 Sanghyun Joo Proposals for MVC File Format

m15592 Sanghyun Joo Proposal for support of the additional downmix by MPEG SAOC

m15593B.S Choi Proposed Text for White Paper of ISO/IEC

23000-6 Professional Archival Application Format

m15594 Sanghyun Joo Enhancement of Temporal Consistency for Multi-view Depth Map Estimation

m15595 B.S Choi Results of Exploration Experiment on View Synthesis

m15596 Eunmi Oh Proposed Text for Working Draft of ISO/IEC 23000-6 PA-AF Reference Software

m15597 JungHoe Kim Boundary Filtering on Synthesized Images for 3D Video

m15598 Miyoung Kim Proposal of work plan for a 3D multichannel audio configuration

m15599 KiHyun Choo Experimental Results of EE on LeavingLaptop sequence for MPEG-FTV

m15600 Hosang Sung On MVC File Format

m15601 Johannes Boehm Additional Use Cases for UI Framework

m15602 Oliver Wuebbolt Analysis of MPEG-21 UED for Personalization

m15603 Florian Keiler First feedback/comments on the DCCI programmatic API for personalization

m15604 Florian Keiler Addition information on the MPEG UI Framework

m15605 Oliver Wuebbolt List of Adjacent Clip Pairs in the VCE-7 Dataset

31

Page 32: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15606 Johannes Boehm Removal of Partially Identical Clips from the VCE-7 Dataset

m15607 Eunmi Oh Camera Capturing of Video Clips for VCE-7 Robustness Test

m15608 Miyoung Kim Report on USAC subjective tests at VoiceAge test site

m15609 JungHoe Kim Technical Description of the VoiceAge Candidate for USAC

m15610 Taejin Lee Analysis of Combined Listening Test Results for USAC

m15611 Seungkwon Beack File format support for reception of SRTP and MIKEY messages

m15612 Minje Kim Response to the call for technologies for RoSE

m15613 Kyeongok Kang Analysis of unified speech and audio coding listening test results

m15614 Hochong Park CD proposal of LASeR in IPTV

m15615 Youngcheol Park Response to the extended call for requirements for MPEG-V

m15616 Akira Nakagawa A Proposal on Conformance Testing for Photo Player Application Format

m15617 Kimihiko Kazui Response to the extended call for requirements for MPEG-V

m15618 Shunsuke Kobayashi Speech and Audio listening test lab report (Ericsson)

m15619 Taejin Lee Report on USAC Subjective Tests at Fraunhofer IIS Test Site

m15620Seungkwon Beack Exploration Experiment results on

'Champagne_tower' and 'Pantomime' sequences

m15621 Minje Kim Technical Decsription of the Fraunhofer IIS Submission for the CfP on USAC

m15622 Kyeongok Kang Proposal for the Presentation of Structured Information

m15623WG 1 via SC 29 Secretariat Proposal on implementation Capturing on

camera and Interlaced/Progressive conversion for VCE-7 modification

m15624 Jeongil Seo Suggestion to solve the problem of unstable trimming point in modification for VCE-7

32

Page 33: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15625 Seungkwon Beack Analysis of USAC Listening Test Data

m15626 Kyeongok Kang A proposal for technical specification of Interactive Music AF

m15627 Taka Senoh Cross verification results of Image Signature

m15628 Kenji Yamamoto Proposed new profile for MPEG-4 SLS

m15629 Ryutaro Oi Proposed Corrigendum on MPEG-4 SLS

m15630 Tomoyuki Mishina New version of the BSDL2CALML parser in RVC

m15631 Makoto Okui Ontology for IP Media Networks (IPM-Net): A response to the MVCO CfP

m15632Osamu Shimada Listening test report for CE on separating

real-environment signals into multiple objects for the MPEG SAOC system

m15633Toshiyuki Nomura Additional information on the mono

downmix mode for the enhanced Karaoke / Solo processing for the MPEG SAOC system

m15634Akihiko Sugiyama Proposal for adoption of additional

downmix / upmix scenarios for the MPEG SAOC system

m15635 Osamu Hoshuyama Report on corrections for the MPEG SAOC CD text

m15636Osamu Shimada Proposal for the file-based storage of

rendering information for the MPEG SAOC system

m15637 Toshiyuki Nomura Proposed ELD Verification Test Report

m15638 Akihiko Sugiyama proposed correction to MPEG-2 SBR conformance

m15639 Osamu Hoshuyama proposed clarification for MPEG-2 and MPEG-4 AAC

m15640 Osamu Shimada proposed correction of the MPEG-4 HE AAC v2 profile

m15641Keisuke Masatoki further considerations regarding the 960

transform length in the AAC profile, the HE AAC profile and the HE AAC v2 profile

m15642 Toshiyuki Nomura Update on Conformance for AAC-ELD

m15643 Akihiko Sugiyama 84th MPEG Audio Report

m15644 Osamu Hoshuyama Draft Report on Unified Speech and Audio Coding Call for Proposals Submissions

33

Page 34: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15645Hiroshi Saruwatari AAC decoder complexity consideration

relative to the support of different frame lengths

m15646 Shun-ichi Sekiguchi Improved Matching for Global Image Signature

m15647 Yoshihisa Yamada Proposed Improvements for Local Image Signature

m15648 Yoshiaki Kato EE results on Leaving laptop

m15649 Kohtaro Asai Timed Representation of Metadata for the Presentation of Structured Information (PSI)

m15650 Tokumichi Murakami Possible Instantiations of the RoSE Framework

m15651 Shun-ichi Sekiguchi Response to Call for Proposals on RoSE: Sensory Effect Metadata

m15652 Shuichi Yamagishi Response to Call for Proposals on RoSE: Sensory Device Capabilities

m15653 Yusuke Itani TFAN software description

m15654 Kohtaro Asai Update of the API for 3D compression benchmark with MyMultimediaWorld.com

m15655 Tokumichi Murakami France Telecom listening test results for the CfP on Unified Speech and Audio Coding

m15656Kyohyuk Lee France Telecom contribution to the analysis

of the listening test results following the CfP on Unified Speech and Audio Coding

m15657 Elena Alshina Meeting Report of the 27th JVT Meeting (24-29 April 2008, Geneva, Switzerland)

m15658 Jeonghoon Park Image Signature for Complex Conditions

m15659 Woojin Han Error in the ?FRext? extension to the AVC file format

m15660 Junghye Min Proposal for MPEG-4 ALS profile/levels

m15661 Kyohyuk Lee EE results on lovebird2

m15662 Elena Alshina Liaison response on N9787

m15663 Jeonghoon Park CAVLC Validation Procedure and Implementation for AVS in RVC Framework

m15664 Woojin Han Updated text and reference SW for ISO/IEC 23000-9 (DMB-AF)

m15665 Junghye Min Conformance file contribution for ISO/IEC

34

Page 35: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

23000-9 (DMB-AF)

m15666 Dong Soo Kim Validation Report for VCE-5

m15667 Sungyong Yoon CE Report on the SC3DMC

m15668 Jaehyun Lim Editorial and Technical Comments on Working Draft of SC3DMC

m15669 Hyun-Kook Lee A Proposal on Quantization of 3D Mesh Coding

m15670 Henney Oh Video content contribution for VCE-7 from KBS

m15671 Yang-Won Jung Video content contribution for VCE-7 from Inha University

m15672 Dong Soo Kim View Synthesis Software and assessment of its performance

m15673 Sungyong Yoon Comments on the preliminary call for proposal for video signature

m15674 Jaehyun Lim EE results on lovebird1 sequence

m15675 Hyun-Kook Lee MPEG User Interface Framework and W3C WebApps Widgets

m15676 Shinya Shimizu Requirements for the MPEG UI Framework

m15677 Hideaki Kimata LASeR Requirements for Person to Person Communication Services Signaling

m15678Shinya Shimizu Report on the subjective testing of Unified

Speech and Audio Coding proposals at Huawei

m15679 Hideaki Kimata DDL of Intra MPEG AVC decoder

m15680 Maeng-Sub Cho DDL Graph Editor

m15681 Jin-Seo Kim Proposal for RoSE Metadata

m15682 Bon-Ki Koo Industry Trends around Video Coding

m15683 Yong Soo Joo Encoder implementation for FAMC

m15684 Sang-Kyun Kim Study text of ISO/IEC 23000-11 FCD Stereoscopic Video Application Format

m15685 Yang-Won Jung Workplan for ISO/IEC 23000-11 Conformance and Reference Software

m15686 Henney Oh Response to MXM Call for Proposals

m15687 Dong Soo Kim Collection of MXM Use Cases

35

Page 36: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15688 Sungyong Yoon Proposal of MXM Technologies

m15689 Yang-Won Jung Proposal of MXM API

m15690 Henney Oh Response to the extended call for requirements for MPEG-V

m15691 Yang-Won Jung Source code for WaveletSubdivisionSurfaces and Footprint-Based Coding

m15692 Henney Oh Migration of the Reference Software on the SVN

m15693fons.bruls_at_philips.com Proposal for adding support of platform-

independent language tag strings in the 'name' table.

m15694 lincoln.lobo_at_philips.com Proposal for adding support for left-to-right and right-to-left glyph alternatives.

m15695 fons.bruls_at_philips.com Preliminary comments on the 14496-12 PDAM

m15696 lincoln.lobo_at_philips.com Simple View Synthesis

m15697 rene.klein.gunnewiek_at_philips.com Experimental Results for Exploration Experiment in 3D Video

m15698 Miska M. Hannuksela Proposing an Architecture for MPEG Media Value Chain Ontology

m15699 Ye-Kui Wang Modified FUID definition for multiple tool libraries

m15700 Ying Chen Delete

m15701 Jeongil Seo The Single Non-hierarchical Image Signature for All Modifications

m15702 Seungkwon Beack Cross verification result for ETRI VCE-6 proposal

m15703 Kyeongok Kang Introduction of Token description in VTL

m15704 Kwangki Kim Consideration on an approach for developing a new video codec

m15705 hendry_at_icu.ac.kr Update of ISO/IEC 23000-10/Amd1 WD2.0 Conformance and Reference Software

m15706 mkim_at_icu.ac.kr Deleted

m15707 harada.noboru_at_lab.ntt.co.jp Comments on ISO/IEC 23002-4 FCD

m15708 moriya.takehiro_at_lab.ntt.co.jp Comments on ISO/IEC 23001-4 FCD

36

Page 37: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15709 sblee_at_gist.ac.kr (Sang-Beom Lee) Liaison Statement from SC 27/WG 2

m15710 hoyo_at_gist.ac.kr (Yo-Sung Ho) iKlax Requirements for MPEG Interactive Music

m15711 leecheon_at_gist.ac.kr (Cheon Lee) ATSC Liaison

m15712hoyo_at_gist.ac.kr (Yo-Sung Ho) Java software based on Mpeg7 schema

including amendment ISO/IEC 15938-5:2003/Amd.3:2008

m15713 hendry_at_icu.ac.kr USAC listenig test analysis from LGE

m15714 houari_at_icu.ac.kr Report of AHG on MXM

m15715 mkim_at_icu.ac.kr Comment on proposal for MPEG-4 ALS profiles/levels

m15716 harada.noboru_at_lab.ntt.co.jp Encoder implementation for WaveletSubdivisionSurface

m15717 moriya.takehiro_at_lab.ntt.co.jp MPEG RoSE Framework: A New Work Item?

m15718 leecheon_at_gist.ac.kr (Cheon Lee) Requirements for additional MXM technologies

m15719 hoyo_at_gist.ac.kr (Yo-Sung Ho) Encoder implementation for Footprint-based Coding

m15720 Yasushige Nakayama KNB comment on the Unified Speech and Audio Coding Exploration Activity

m15721 Gi-Mun Um Delete

37

Page 38: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex D– Output documents

No. Source Title

09960 Convener List of Documents from the 85th Meeting in Hannover, Germany

09961 Convener Resolutions of the 85th Meeting in Hannover, Germany

09962 Convener List of AHGs Established at 85th Meeting in Hannover, Germany

09963 Convener Report of the 85th Meeting in Hannover, Germany

09964 Convener Guidelines for Electronic Distribution of MPEG M and N Documents

09965 Convener Press Release of the 85th Meeting in Hannover, Germany

09966 Convener Meeting Notice of the 86th Meeting in Busan, Rep. of Korea

09967 Convener Guide for WG 11 Meeting Hosts

09968 Video Disposition of Comments on ISO/IEC 14496-4:2004/FPDAM 30

09969 Video Text of ISO/IEC 14496-4:2004/FDAM 30 Conformance Testing for New Profiles for Professional Applications

09970 Video Disposition of Comments on ISO/IEC 14496-4:2004/FPDAM 31

09971 Video Text of ISO/IEC 14496-4:2004/FDAM 31 Conformance Testing for Scalable Video Coding

09972 Video Working Draft 1 of ISO/IEC 14496-4:2004/Amd.38 Conformance Testing for Multiview Video Coding

09973 Video Request for ISO/IEC 14496-5:2001/Amd.15

09974 Video Text of ISO/IEC 14496-5:2001/PDAM 15 Reference Software for Multiview Video Coding

09975 Video Study Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video Coding

09976 Video Study Text of ISO/IEC 14496-10:200X/DCOR 1

09977 Video Disposition of Comments on ISO/IEC 14496-10:200X/FPDAM 1

09978 Video Text of ISO/IEC 14496-10:200X/FDAM 1 Multiview Video Coding

09979 Video Study Text of ISO/IEC 15938-3:2001/FPDAM 3 Image Signature Tools

09980 Video Use Cases of MPEG-7 Visual Signature Tools

09981 Video Call for Proposals on Video Signature Tools

09982 Video MPEG-7 Visual XM Document version 34.0

09983 Video Working Draft of ISO/IEC 23000-3/Amd.2 Conformance Testing for

38

Page 39: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Photo Player MAF

09984 Video Study Text of ISO/IEC FCD 23001-4 Codec Configuration Representation

09985 Video Study Text of ISO/IEC FCD 23002-4 Video Tool Library

09986 Video WD 2 of ISO/IEC 23002-4/Amd.1 (Conformance and Reference Software)

09987 Video WD 2 of ISO/IEC 23002-4/Amd.2 (Tools for MPEG-4 ASP, AVC HP and SVC)

09988 Video Description of Core Experiments in RVC

09989 Video RVC Work Plan and FU Development Status

09990 Video RVC Vision

09991 Video Description of Exploration Experiments in 3D Video Coding

09992 Video Results of 3D Experts Viewing

09993 Video New Challenges in Video Coding Standardization

09994 Video Workshop on New Challenges in Video Coding Standardization - Call for Contributions

09995 Video Workshop on New Challenges in Video Coding Standardization - Programme

09996 Convener Liaison Statement to SMPTE re RVC

09997 Convener Liaison Statement to ITU-T SG 9 re FTV

09998 Convener Liaison Statement from JVT to ATSC

09999 Convener Liaison Statement from JVT to various Organizations re MVC

10000 Convener Terms of reference

10001 Convener MPEG Standards

10002 Convener Table of unpublished FDISs

10003 Convener Work plan and time line

10004 Convener Editors of MPEG standards

10005 Convener Schema assets updates

10006 Convener Software assets

10007 Convener Conformance assets

10008 Convener Content assets

10009 Convener URI assets

10010 Convener Standards under development for which a call for patent statements is

39

Page 40: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

issued

10011 Convener List of organisation with which MPEG is in liaison

10012 Convener Project Editors for ISO/IEC Certificate of Appreciation

10013 3DGC Text of ISO/IEC 14496-4:2004 FPDAM 34 (3DG Compression Model Conformance)

10014 3DGC Request for ISO/IEC 14496-4:2004/AMD39 (Scene Partitioning Conformance)

10015 3DGC Text of ISO/IEC 14496-4:2004 FPDAM 22 (3DG Compression Model Reference Software)

10016 3DGC Request for ISO/IEC 14496-4:2004/AMD25 (Scene Partitioning Reference Software)

10017 3DGC Text of ISO/IEC 14496-16:2006 FDAM 2 (Frame based Animation Mesh Compression)

10018 3DGC Text of WD2.0 of ISO/IEC 14496-16:2006 AMD 4 (Scalable Complexity 3D Mesh Compression)

10019 3DGC DOC on ISO/IEC 14496-16:2006 AMD1:2007 DCOR2 (for 3DMC compression)

10020 3DGC Study Text of ISO/IEC 14496-16:2006 AMD1:2007 COR2

10021 3DGC WD4.0 of AFX 3rd Edition

10022 3DGC CE on Scalable Complexity 3D Mesh Coding

10023 3DGC DOCR on ISO/IEC 14496-5:2001/PDAM 22

10024 Convener AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance

10025 Convener AHG on Reconfigurable Video Coding

10026 Convener AHG on MPEG-7 Visual

10027 Convener AHG on 3D Video Coding

10028 Convener AHG on New Challenges in Video Coding Standardization

10029 3DGC Text of ISO/IEC 14496-4:2004 FPDAM 33 (MultiResolution Profile Conformance)

10030 Audio ISO/IEC 13818-4:2004/Amd 2:2005/DCOR 2, AAC Conformance

10031 Audio ISO/IEC 13818-7:2006/DCOR 1, AAC

10032 Audio Report on the Verification Test of MPEG-4 AAC-ELD

10033 Audio WD on New Profiles for ALS and SLS

10034 Audio ISO/IEC 14496-3:2005/DCOR 6, AAC

40

Page 41: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

10035 Audio ISO/IEC 14496-3:2005/AMD 2:2006/DCOR 4, HE-AAC V2 Profile

10036 Audio ISO/IEC 14496-3:2005/AMD 3:2006/DCOR 2, SLS

10037 Audio ISO/IEC 14496-3:2005/AMD 9:2008/DCOR 1, AAC-ELD

10038 Audio Considerations regarding the 960 frame length in the MPEG-4 AAC family of profiles

10039 Audio ISO/IEC 14496-4:2004/DCOR 6, AAC-LD

10040 Audio Study on ISO/IEC 14496-4:2004/PDAM 36, AAC-ELD, OAFI and additional AAC Conformance

10041 Audio Proposed ISO/IEC 14496-26, Audio Conformance

10042 Audio ISO/IEC 14496-4:2004/AMD13:200x/DCOR 2, AAC-LD bitstreams

10043 Audio DoC on ISO/IEC 14496-5:2001/FPDAM 20, MPEG-1 and -2 on MPEG-4

10044 Audio ISO/IEC 14496-5:2001/FDAM 20, MPEG-1 and -2 on MPEG-4

10045 Audio Study on ISO/IEC CD 23003-2:200x, Spatial Audio Object Coding

10046 Audio Status and Workplan on SAOC Core Experiments

10047 Audio Report on Unified Speech and Audio Coding Call for Proposals

10049 Convener Liaison response to ITU-T SG 16 (m15261, m15516)

10050 Convener Liaison response to IEC TC 100 (m15555)

10051 Convener Response to FNB (m15720)

10052 Convener Response to KNB (m15551)

10053 Convener AHG on Audio Standards Maintenance

10054 Convener AHG on SAOC, USAC, and Support for new Loudspeaker Configurations

10055 Convener Liaison statement to ATSC on Carriage of SVC

10056 Convener Liaison statement to DVB on Carriage of SVC

10057 Convener Liaison statement to SCTE on Carriage of SVC

10058 Systems DoC on ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of SVC in MPEG-2 Systems

10059 Systems Text of ISO/IEC 13818-1:2007/FDAM3.2 Carriage of SVC in MPEG-2 Systems

10060 Convener AHG on 3DGC documents, software maintenance and core experiments

10061 Systems Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF

41

Page 42: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

10062 Systems ISO/IEC 14496-15 WD2.0 MVC File Format

10063 Systems ISO/IEC 14496-15/DCOR3

10064 Systems Request for ISO/IEC 14496-20 LASeR Amd.2

10065 Systems Text of ISO/EC 14496-20 LASeR/PDAM2

10066 Systems WD1.0 of ISO/EC 14496-20 LASeR/Amd.3 Presentation of Structured Information

10067 Systems DoC on ISO/IEC CD 14496-22/ 2nd Edition

10068 Systems Text of ISO/IEC FCD 14496-22 2nd Edition

10069 Systems MPEG-4 White Paper - Font Format

10070 Systems WD1.0 of ISO/IEC 21000-19 Media Value Chain Ontology

10071 Systems MPEG-A White Paper - Professional Archival Application Format

10072 Systems WD1.0 of ISO/IEC 23000-6/Amd1 PA-AF Conformance and Reference Software

10073 Systems Request for ISO/IEC 23000-9/Amd.1 DMB AF Conf. And Ref. Soft.

10074 Systems Text of ISO/IEC 23000-9/PDAM1 DMB AF Conf. And Ref. Soft.

10075 Systems Draft DoC on ISO/IEC FCD 23000-10 (Video Surveillance Application Format)

10076 Systems Study Text of ISO/IEC FCD 23000-10 (Video Surveillance Application Format)

10077 Systems Request for ISO/IEC 23000-10/Amd.1 VS AF Conf. And Ref. Soft.

10078 Systems Text of ISO/IEC 23000-10/PDAM1 VS AF Conf. And Ref. Soft.

10079 Systems Study Text of ISO/IEC FCD 23000-11 (Stereoscopic Video Application Format)

10080 Systems Workplan for Stereoscopic Video Application Format Ref. Soft and Conf.

10081 Systems WD1.0 of ROSE

10082 Systems Core Experiments on ROSE

10083 Systems WD1.0 of MXM Architecture and Technologies

10084 Systems WD1.0 of MXM APIs

10085 Systems Context and Objectives of Rich Media UI Framework

10086 Requirements Requirements on technologies to support Interactive Digital Radio

10087 Webmaster VOID - Document number not used

10088 Convener Liaison statement to SC34/WG2 on Open Font Format

42

Page 43: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

10089 Convener Liaison statement to WG 1 on PA-AF

10090 Convener Liaison statement to ITU-T SG16 on IPTV

10091 Convener Guide to MPEG Subversion Repository

10092 Convener Liaison statement to Open IPTV Forum on IPTV

10093 Convener Liaison statement to ATSC M/H on Rich Media

10094 Convener Liaison statement to DVB MIS

10095 Convener Liaison statement to TC 100 on Audio Book

10096 Convener Liaison statement to TC 100 on CDV 60728-1-2

10097 Convener Liaison statement to TC 100 on NP: Multimedia Systems and Equipment

10098 Convener Ad Hoc Group on Scene Representation

10099 Convener Ad Hoc Group on MPEG File Formats

10100 Convener Ad Hoc Group on Application Format

10101 Convener AHG on MVCO WD Editing

10102 Convener AHG on the RoSE Framework

10103 Convener AHG on Font Format

10104 Requirements Draft Requirements for MPEG-V Version 2

10105 Requirements MPEG-V Extended Call for Requirements

10106 Requirements Requirements for additional MxM Technologies

10107 Requirements CfP for additional MXM technologies

10108 Requirements MAF Overview

10109 Requirements Draft Call for Proposals on Interactive Music Application Format

10110 Convener AHG on Information Exchange with Virtual Worlds

10111 Requirements MAF Overview presentation document

43

Page 44: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex E – Requirements report

Source: Jörn Ostermann (Leibniz Universität Hannover)

1. Requirements documents approved at this meeting

No. TitleN10086

Requirements on technologies to support Interactive Digital Radio

N10104

Draft Requirements for MPEG-V Version 2

N10105

MPEG-V Extended Call for Requirements

N10106

Requirements for additional MXM technologies

N10107

CfP for additional MXM technologies

N10108

MAF Overview

N10109

Draft Call for Proposals on Interactive Music Application Format

N10111

MAF Overview presentation document

2. MPEG-V: Information exchange with virtual worlds

The following responses to the Extended Call for requirements have been received as contributions to the 85th Hannover MPEG meeting:

M15615: Response to the extended call for requirements for MPEG-VM15617: Response to the extended call for requirements for MPEG-V M15690: Response to the extended call for requirements for MPEG-V

These contributions focus mainly on MPEG-V Part 3 ‘Interfaces between virtual worlds and physical world’. They were integrated into the existing MPEG-V draft requirements document (N9902). The architecture for MPEG-V has been further developed (Figure 1). It appears that MPEG-V as well as Rose can fit into one architecture (Figure 2).

Several companies indicated that there are additional requirements that will be contributed at the next meeting. In order to accommodate these requirements, a new requirements document N10104 (Draft Requirements for MPEG-V Version 2) and another call for requirements N10105 (MPEG-V Extended Call for Requirements) were issued. The work plan agreed upon in the 83rd meeting has been delayed by one meeting again since the requirements for this work item are not yet in a sufficiently mature state. However, another delay in issuing the call for proposals beyond the 86th meeting might endanger the MPEG-V project.

44

Page 45: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Figure 1 Architecture for MPEG-V

Figure 2 Mapping of Rose onto the MPEG-V Architecture

ConsumerDevice(s)

SDC

SE

SDCUSP

RoSE Engine

RoSE

RoSE Engine

Sensory Effect

Metadata

Provider Consumer Device(s)

User Sensory Preferences

Sensory Device

Capabilities

Sensory Device

CommandsSensory Effect Delivery Format

User Preferences Delivery Format

Sensory Device Capability Delivery Format

1

1

2

2

3

4

3

4 Sensory Device Command Delivery Format

Scope of standarization(Normative)

Not in the scope of standarization(Informative)

Digital Content Provider (Virtual World,(serious) game,simulator, DVD, …)

45

Page 46: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

3. RoseThe following documents have been submitted as a response to the call for proposals (N9899) on three work items (Sensory Effects Metadata, User Sensory Preferences Metadata, and Sensory Device Capabilities/Commands Metadata):M15560: Sensory Effects Metadata for RoSE FrameworkM15561: User Sensory Preferences Metadata for RoSE FrameworkM15562: Sensory Device Capabilities Metadata for RoSE FrameworkM15563: Sensory Device Commands Metadata for RoSE FrameworkM15585: RoSE description schemes for realistic TV commercialsM15612: Response to the call for technologies for RoSEM15650: Possible Instantiations of the RoSE FrameworkM15651: Response to Call for Proposals on RoSE: Sensory Effect MetadataM15652: Response to Call for Proposals on RoSE: Sensory Device CapabilitiesM15681: Proposal for RoSE Metadata

The evaluation of the response to the call extracted sufficient technology to start the technical work for the standard. Therefore, Rose was transferred to the Systems Group. There was no agreement yet on which part of MPEG should be the home for Rose. At this point, the options are a separate standard for Rose, Rose as part of MPEG-4 and Rose as part of MPEG-V. The latter option appears to be the most appealing.

4. Media Value Chain OntologyAs a response to its Call for Proposals (N9907), MPEG received two contributions by three organisations:M15631 Ontology for IP Media Networks (IPM-Net)M15698 Proposing an Architecture of MPEG Media Value Chain OntologyThe evaluation of the response to the call extracted sufficient technology to start the technical work for the standard. Therefore, MVCO was transferred to the Systems Group. It will become part of MPEG 21.

5. Interactive Music Application Format MAFA MAF supporting playback of multiple audio streams was requested at the 84th meeting. Compared to the music player MAF, additional streams have to play synchronously. An additional requirement was presented at this meeting:M15710: iKlax Requirements for MPEG Interactive Music In order to provide some control over the playback to the music author, constraints on how different streams are played back have to be definable. It appears that no new coding technology has to be developed in order to enable the available requirements.

A Draft Call for Proposals on Interactive Music Application Format (N10109) was issued. We expect to finalize the call at the next meeting. Since the companies presenting requirements are already selling products in the market, a timely development of a MAF appears to be important.

6. MPEG eXtensible Middleware (MXM)In response to the CfP (N9894) for MXM and associated Requirements (N9893), one proposal supported by several companies has been received.M15686: Letter accompanying the response

46

Page 47: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

M15687: Collection of use casesM15688: Specification of the technologies that are required to match the use casesM15689: Proposed specificationSince the proposed solution provides sufficient technical substance for technical work, this work item was transferred to the Systems Group. In their proposal, the contributors identify additional requirements for MXM mainly in the area of protocols for content identification, authorisation and storage. Since these requirements were found to be valid, new requirements and a new call were issued: N10106 Requirements for additional MXM technologiesN10107 CfP for additional MXM technologies

7. LASeR for Person to Person CommunicationsLASeR is currently used on many mobile platforms. However, it lacks efficient capabilities to communicate with other terminals:M15677 LASeR Requirements for Person to Person Communication Services Signallingdefines these.

8. Lossless Audio CodingSeveral requests for standardizing a profile for lossless audio coding were made by different industries. M15628 Proposed new profile for MPEG-4 SLSM15660 Proposal for MPEG-4 ALS profile/levelsM15715 Comment on proposal for MPEG-4 ALS profiles/levelsThe first two profiles for coding lossless audio are now defined in WD on New Profiles for ALS and SLS (N10033):

SLS, which is supported by several companies, provides an enhancement to AAC and it will be called HD-AAC. The selected tools provide a good balance between coding efficiency and complexity.

ALS, which is requested by ARIB (Association of Radio Industries and Business) in Japan, is seen as a low complexity alternative to PCM coding of multi-channel audio signals. In order to stabilize this profile, firm support from the ARIB consortium will have to be provided.

9. Explorations

9.1. Requirements for Interactive RadioRequirements for Interactive Radio over DMB for French radio stations were brought to MPEG:M15550 New BIFS Profile for Interactive Digital Radio

While the proposers identify BIFS as a technology to satisfy the requirements, it is not clear whether that solution is preferable to a LASeR-based solution. A requirements documentN10086 Requirements on technologies to support Interactive Digital Radiohas been issued. This document is technology agnostic.

9.2. Exploration on MPEG User Interface FrameworkAs user interfaces (UI) become more divers and more plentiful, MPEG would like to explore currently available UI technology and investigate what is still missing in order to make MPEG technology more usable in modern user interfaces considering requirements like description language for UI specification, personalization, UI exchange between devices and

47

Page 48: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

UI aggregation. This exploration was described in N9895 at the last meeting. Additional requirements were received at this meeting:M15601 Additional Use Cases for UI FrameworkM15676 Requirements for the MPEG UI FrameworkThe exploration document was not updated. Work on the architecture for MPEG UI progressed.

9.3. Presentation of Structured Information (PSI)MPEG wants to develop a standard that enables declarative representation of structured information such as MPEG-21 Digital Item to maintain interoperable presentation of it across different implementations by conforming such standard, so-called PSI (Presentation of Structured Information). MPEG received two responses to the call for proposals (N9905) and associated requirements (N9904):M15622 Proposal for the Presentation of Structured InformationM15649 Timed Representation of Metadata for the Presentation of Structured Information

(PSI)Since most of the requirements are at least partially fulfilled, the technical work started in the Systems Group.

9.4. Future Work ItemsAudio, 3DGC, and Video subgroups continued to brainstorm on directions subject to further MPEG standards. Gaming was identified as an application area that could make use of MPEG technology beyond the currently used image, video and texture coding technology. Especially the use of geometry coding appears to be beneficial to the industry.

A meeting on the future of video coding with several company contributions outlining the roadmap for display, storage device and the requirements on network bandwidth was attended by more than 200 participants. M15577 Requirements for next-generation video coding standardM15704 Consideration on an approach for developing a new video codecM15579 Analysis for Ad Hoc Group on New Challenges in Video Coding Standardization M15580 Technical considerations for Ad Hoc Group on New Challenges in Video Coding

StandardizationThere appears to be a trend towards consumer displays with a resolution of 4k or 8k pixels/line at 60 Hz progressive scan. Storage capacity will be available. The request for transmitting video to the consumer at a rate below 50Mb/s was made. 2012 is envisioned for the start of using 4k displays in consumer applications. One demonstration showed HD video coding rate reductions of 25%. However, it is unclear whether current MPEG test sequences were shot with cameras of appropriate optical and electrical resolution.Internal documents capturing the state of discussion were improved and appropriate adhoc groups established. At the 85th meeting, a one-day workshop on New Challenges in Video Coding Standardization will be held.

48

Page 49: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex F – Systems report

Source: Systems Chair and Break-out group ChairsTitle: Systems Meeting ReportStatus: DraftEditor: Olivier Avaro (Streamezzo)

Contributors: David Singer (Apple), Young-Kwon Lim (Net&TV), Gero Baese (Siemens), Jaieme Delgado (UPC), Christian Timmerer (University of Klagenfurt)

1 OverviewThe main outputs of the meeting from the Systems Sub-group perspective are:No. Title TBP AvailableX 13818-1 MPEG-2 Systems10058 DoC on ISO/IEC 13818-1:2007/FPDAM3.2 Carriage of SVC in MPEG-2

SystemsNo 08/07/25

10059 Text of ISO/IEC 13818-1:2007/FDAM3.2 Carriage of SVC in MPEG-2 Systems No 08/08/15X 14496-12 ISO File Format10061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF No 08/07/25X 14496-15 AVC File Format10062 ISO/IEC 14496-15 WD2.0 MVC File Format No 08/07/2510063 ISO/IEC 14496-15/DCOR3 No 08/07/25X 14496-20 LASeR10064 Request for ISO/IEC 14496-20 LASeR Amd.2 No 08/07/2510065 Text of ISO/EC 14496-20 LASeR/PDAM2 No 08/07/2510066 WD1.0 of ISO/EC 14496-20 LASeR/Amd.3 No 08/07/25X 14496-22 Open Font Format10067 DoC on ISO/IEC CD 14496-22 2nd Edition No 08/07/2510068 Text of ISO/IEC FCD 14496-22 2nd Edition Yes 08/09/1210069 MPEG-4 White Paper – Font Format Yes 08/07/25X 21000-19 Media Value Chain Ontology10070 WD1.0 of ISO/IEC 21000-19 Media Value Chain Ontology No 08/07/25X 23000-6 Professional Archival Application Format10071 MPEG-A White Paper – Professional Archival Application

FormatYes 08/08/15

10072 WD1.0 of ISO/IEC 23000-6 PA-AF/Amd.1 Conformance and Reference Software

No 08/07/25

X 23000-9 Digital Multimedia Broadcasting Application Format10073 Request for ISO/IEC 23000-9/Amd.1 DMB AF Conf. And Ref. Soft. No 08/07/2510074 Text of ISO/IEC 23000-9/PDAM1 DMB AF Conf. And Ref. Soft. No 08/07/25X 23000-10 Video Surveillance Application Format10075 Draft DoC on ISO/IEC FCD 23000-10 (Video Surveillance Application Format) No 08/07/2510076 Study Text of ISO/IEC FCD 23000-10 (Video Surveillance Application Format) No 08/08/1510077 Request for ISO/IEC 23000-10/Amd.1 VS AF Conf. And Ref. Soft. No 08/07/2510078 Text of ISO/IEC 23000-10/PDAM1 VS AF Conf. And Ref. Soft. No 08/08/15X 23000-11 Stereoscopic Video Application Format10079 Study Text of ISO/IEC FCD 23000-11 (Stereoscopic Video Application Format) No 08/07/2510080 Workplan for Stereoscopic Video Application Format Ref. Soft and Conf. No 08/07/25X Project Started – Representation of Sensory Experience10081 WD1.0 of ROSE No 08/07/2510082 Core Experiments on ROSE No 08/07/25X Project Started – MPEG eXtensible Middleware10083 WD1.0 of MXM Architecture and Technologies No 08/07/25

49

Page 50: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

10084 WD1.0 of MXM APIs No 08/07/25X Exploration10085 Context and Objectives of Rich Media UI Framework No 08/07/25X Assets and Standing Documents10005 MPEG Schema Assets Updates No 08/07/2510009 MPEG URIs No 08/07/25X Liaison10088 Liaison statement to SC34/WG2 on Open Font Format No 08/07/2510089 Liaison statement to WG 1 on PA-AF No 08/07/2510090 Liaison statement to ITU-T SG16 on IPTV No 08/07/2510091 Liaison statement to ATIS IPTV FG on IPTV No 08/07/2510092 Liaison statement to Open IPTV Forum on IPTV No 08/07/2510093 Liaison statement to ATSC M/H on Rich Media No 08/07/2510094 Liaison statement to DVB MIS No 08/07/2510095 Liaison statement to TC 100 on Audio Book No 08/07/2510096 Liaison statement to TC 100 on CDV 60728-1-2 No 08/07/2510097 Liaison statement to TC 100 on NP: Multimedia Systems and

EquipmentNo 08/07/25

10055 Liaison statement to ATSC on Carriage of SVC No 08/07/2510056 Liaison statement to DVB on Carriage of SVC No 08/07/2510057 Liaison statement to SCTE on Carriage of SVC No 08/07/25

50

Page 51: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

2 General issues

2.1 GeneralThe meeting report from Antalya has been approved.The following demonstrations have been made:

None.

2.2 List of standards under developmentPr Pt Edit. Project Description CfP WD CD FCD FDIS

2 1 2006 Cor.2 Transport of AVC 08/04 08/104 1 200x Amd.4 Registration

Authority07/10 08/10 09/01

4 4 2008 Amd.37 File Format Conf. 08/04 08/10 09/02 09/074 5 2007 Amd.14 Open Font Format

Ref. Soft07/10 07/10 08/04 09/01

4 5 2007 Amd.xx AVC File Format Ref. Soft

TBS

4 5 2007 Amd.xx SVC File Format Ref. Soft

TBS

4 5 2007 Amd.23 Synth. Texture Ref. Soft

08/04 08/10 09/04

4 11 2005 Amd.6 Scene Partitioning 08/01 08/04 08/104 12 200x Amd.1 Misc. Addition to FF 08/04 08/10 09/044 12 200x Cor.1 Cor. On Flute 08/04 08/104 15 200x Amd.3 MVC File Format 08/04 08/10 09/02 09/074 20 2008 Amd.2 Scene Adaptation 08/07 08/10 09/074 20 2008 Amd.3 PSI 08/07 08/10 09/02 09/074 22 2008 2nd Ed. Open Font Format 08/01 08/07 09/0121 7 200x Cor.1 DIA Cor. 08/04 08/1021 8 200x Amd.1 Minor Enhance. Ref.

Soft07/10 08/04 08/10

21 19 200x 1st Ed. Media Value Chain Onto.

08/07 08/10 09/02 09/07

A 4 200x Amd.1 MSS AF Conf. and Soft

07/07 07/10 08/04 08/10

A 4 200x Amd.2 Prot. MSSAF Conf. & Soft

08/04 08/10 09/04

A 5 200x 2nd Ed. MS AF 08/01 08/10 09/02 09/07A 6 200x 1st Ed. Professional

Archival AF07/10 08/01 08/04 08/10

A 7 200x Amd.1 OA AF Ref. Soft and Conf.

08/01 08/04 08/10

A 8 200x Amd.1 PVP AF Soft. And Conf.

A 9 200x Amd.1 DMB AF Soft. And Conf.

08/07 08/10 09/04

A 10 200x 1st Ed. Video Surveillance 07/04 07/07 08/01 08/10

51

Page 52: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

AFA 10 200x Amd.1 Video Surveillance

AFA 11 200x 1st Ed. Stereoscopic Video

AF07/04 08/01 08/04 08/10

A 11 200x 1st Ed. SVAF Ref. Soft. And Conf.

08/07 08/10 09/04 09/07

B 2 200x Amd.1 FRU Ref. Soft. And Conf.

E 8 200x 1st Ed. Ref. Soft. and Conformance

07/01 07/07 08/04 08/10

X - - - Rep. of Sens. Exp. 08/07 08/10 08/02 09/07X - - - MxM 08/07 08/10 08/02 09/07

52

Page 53: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

2.3 Standing Documents

Pr Pt Documents No. Meeting1 1 MPEG-1 White Paper – Multiplex Format N7675 05/07 Nice1 1 MPEG-1 White Paper – Terminal Architecture N7676 05/07 Nice1 1 MPEG-1 White Paper – Multiplexing and

SynchronizationN7677 05/07 Nice

2 1 MPEG-2 White Paper – Multiplex Format N7678 05/07 Nice2 1 MPEG-2 White Paper – Terminal Architecture N7679 05/07 Nice2 1 MPEG-2 White Paper – Multiplexing and

SynchronizationN7680 05/07 Nice

2 11 MPEG-2 White Paper – MPEG-2 IPMP N7503 05/07 Poznan4 1 MPEG-4 White Paper – MPEG-4 Systems N7504 05/07 Poznan4 1 MPEG-4 White Paper – Terminal Architecture N7610 05/10 Nice4 1 MPEG-4 White Paper – M4MuX N7921 06/01 Bangkok4 1 MPEG-4 White Paper – OCI N8148 06/04 Montreux4 6 MPEG-4 White Paper – DMIF N8149 06/04 Montreux4 11 MPEG-4 White Paper – BIFS N7608 05/10 Nice4 12 MPEG-4 White Paper – ISO File Format N8150 06/04 Montreux4 14 MPEG-4 White Paper – MP4 File Format N7923 06/01 Bangkok4 15 MPEG-4 White Paper – AVC FF N7924 06/01 Bangkok4 13 White Paper on MPEG-4 IPMP N7505 05/07 Poznan4 13 MPEG IPMP Extensions Overview N6338 04/03 München4 17 White Paper on Streaming Text N7515 05/07 Poznan4 18 White Paper on Font Compression and Streaming N7508 05/07 Poznan4 20 Presentation Material on LASER N6969 05/01 Hong-

Kong4 20 White Paper on LASeR N7507 05/07 Poznan4 22 White Paper on Open Font Format N7519 05/07 Poznan7 1 MPEG-7 White Paper - MPEG-7 Systems N7509 05/07 Poznan7 1 MPEG-7 White Paper – Terminal Architecture N8151 06/04 Montreux21 9 MPEG-21 White Paper – MPEG-21 File Format N7925 06/01 BangkokA X MPEG Application Format Overview N9421 07/10 ShenzhenA X MAF Overview Document N9840 08/04

ArchampsA X MAF Overview Presentation N9841 08/04

ArchampsB X MPEG-B White Paper – BinXML N7922 06/01 BangkokE X MPEG Multimedia Middleware Context and

ObjectivesN6335 04/03 München

E X 1rst M3W White paper N7510 05/07 PoznanE X 2nd M3W White Paper : Architecture N8152 06/04 MontreuxE X Tutorial on M3W N8153 06/04 MonreuxE X M3W White Paper : Multimedia Middleware

ArchitectureN8687 06/10 Hanzhou

E X M3W White Paper : Multimedia API N8688 06/10 HanzhouE X M3W White Paper : Component Model N8689 06/10 HanzhouE X M3W White Paper : Resource and Quality

ManagementN8690 06/10 Hanzhou

53

Page 54: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

E X M3W White Paper : Component Download N8691 06/10 HanzhouE X M3W White Paper : Fault Management N8692 06/10 HanzhouE X M3W White Paper : System Integrity

ManagementN8693 06/10 Hanzhou

54

Page 55: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

2.4 Mailing Lists Reminder

Topic Information Kindly Managed by

General Systems

List

Liste Reflector : [email protected]:

http://lists.uni-klu.ac.at/mailman/listinfo/gen-sysmailto:[email protected]?subject=subscribe

List-Archive: http://lists.uni-klu.ac.at/pipermail/gen-sysList-Help: mailto:[email protected]?subject=help

University of Klagenfurt

BiM

Liste Reflector : [email protected]:

http://lists.uni-klu.ac.at/mailman/listinfo/mpeg7-sysmailto:[email protected]?subject=subscribe

List-Archive: http://lists.uni-klu.ac.at/pipermail/mpeg7-sysList-Help: mailto:[email protected]?subject=help

University of Klagenfurt

File Format

Liste Reflector : [email protected]:

http://lists.uni-klu.ac.at/mailman/listinfo/mp4-sysmailto:[email protected]?subject=subscribe

List-Archive: http://lists.uni-klu.ac.at/pipermail/mp4-sysList-Help: mailto:[email protected]?subject=help

University of Klagenfurt

LASeR

Liste Reflector : [email protected]:

http://lists.uni-klu.ac.at/mailman/listinfo/mpeg-lasermailto:[email protected]?subject=subscribe

List-Archive: http://lists.uni-klu.ac.at/pipermail/mpeg-laserList-Help: mailto:[email protected]?subject=help

University of Klagenfurt

MAF

Liste Reflector : [email protected]:http://lists.uni-klu.ac.at/mailman/listinfo/maf-sysmailto:[email protected]?subject=subscribeList-Archive: http://lists.uni-klu.ac.at/pipermail/maf-sysList-Help: mailto:[email protected]?subject=help

University of Klagenfurt

MPEG-2 on File Format

#1: Please subscribe via http://lists.uni-klu.ac.at/mailman/listinfo/isoff-transport. Please use only that email address for posting messages with which you're subscribed. Otherwise the email won't be delivered. #2: The email address for posting messages is: [email protected]

University of Klagenfurt

55

Page 56: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

 #3: The archive is accessible via http://lists.uni-klu.ac.at/mailman/private/isoff-transport/ for list members only.

2.5 FAQThe FAQ were updated as needed.

2.6 AOBNone.

3 MPEG-2 Systems (13818-1)

3.1 General

3.2 13818-1:2005 Amd.3 Carriage of SVC

3.2.1 Topics1. Transport of Scalable Video Coding

3.2.2 ContributionsM15522: Summary of Voting on ISO/IEC 13818-1:2007/FPDAM 3.2 [SC 29 N 9517].See DoC. Taken into account to produce FDAM text.

Work Completed

3.3 13818-1:2005 DCOR.2

3.3.1 Topics1. Corrigendum on the carriage of AVC

3.3.2 ContributionsNone.

Technical Work in Progress.

4 MPEG-4 Conformance (14496-4)

4.1 14496-4 Amd.37 File Format Conformance

4.1.1 Topics1. File Format Conformance

56

Page 57: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Since we said we were waiting for critical mass, and nothing arrived here, probably no action now. Editors to review whether (a) we have all the information for these files and (b) what features still need conformance action.

[After meeting: the spreadsheet looks pretty good. Only a few features are not covered. If no comments and no new files in Busan, maybe we should issue PDAM there.]

Technical Work in Progress.

5 MPEG-4 Reference Software (14496-5)

5.1 14496-5 Amd.14

5.1.1 Topics1. Open Font Format Reference Software

5.1.2 ContributionsNone.

Technical Work In Progress.

5.2 14496-5 Amd.23

5.2.1 Topics1. Synthesized Texture Reference Software

5.2.2 ContributionsNone.

Technical Work In Progress.

6 MPEG-4 Scene Presentation (14496-11)

6.1 14496-11/Amd.6

6.1.1 Topics1. Scene Partitionning

6.1.2 ContributionsNone.

Technical Work In Progress.

57

Page 58: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

7 MPEG-4 ISO Base File Format (14496-12)

7.1 14496-12/Cor1

7.1.1 Topics1. Corrigendum Items

7.1.2 ContributionsNone.

Technical Work In Progress.

7.2 14496-12:2008/Amd1

7.2.1 Topics1. New Edition & Amd.1

7.2.2 15611 File format support for reception of SRTP and MIKEY messagesExcellent, thank you. Editors to make a study text and a resolution asking NBs to consider the study. The ROC storage remains somewhat an open question (sample groups? sample entry? sample?). Comments, please. Also, we suggest boxing the key indicator length as this is syntactically compatible with OMA. However, the DVB liaison has a different design (5.4.2.3.2) and it’s hard to see how we can be compatible with both. Comments, please.

7.2.3 15695 Preliminary comments on the 14496-12 PDAMNoted, thank you. Some of these questions were explained at this meeting and this may help you revise the questions.

We are slightly concerned that we need to do a lot, once we have received comments, and that we might not have all the experts we need in Busan. Would an ad-hoc meeting help? If so, neither date nor location are obvious.

Technical Work in Progress.

8 MPEG-4 AVC Base File Format (14496-15)

8.1 14496-15:2004/Amd.3? MVC FF

8.1.1 Topics1. MVC File Format Extensions

8.1.2 15600 On MVC File FormatThank you. We agree a file format sample is one or more view components (of one access unit). The track references seem right; maybe we can allow applications to decide whether the AVC base must be in its own track or not?Some of this addresses the same questions as the next contribution and the minutes there reflect consideration of both.

58

Page 59: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8.1.3 15591 Proposals for MVC File FormatThank you. Lots of detailed work here. One overall question for the two contributions is to what extent we can use the same code points (e.g. track reference types) as SVC even if the 4 characters aren’t quite ‘right’ for MVC. Another is whether some use of extractors would enable building complete ‘cookbook’ tracks for a given (set of) view(s). (Tracks which were then essentially ‘complete’ could then be so marked).Maybe in the case that there are both (a) multiple views per track and (b) multiple tracks, then the groups are mandatory (or else merging two tracks would require decoding NAL unit headers and perhaps slice headers, and SPSs). (Indeed, this is in the revised WD).

The use of alternate groups seems complex. Are the members of the groupdifferent views from the same set of multiple views?different multiple view groups?alternative ‘simple video’ versions of particular views of a multiple view set?

Looks like the proposal covers this, with the major difference from SVC that (a) are not a group. We can use sample groups, track references, and track selection info however. We probably need a track selection entry that says “I differ by multi-view or view coding” (e.g. the same set of views with a different base).

Some concern about movie-level structures as they are fragile with respect to track addition or deletion. There is also some overlap here with track selection, but of course this is handling sets of tracks etc. But it’s not at all clear that this can be re-written as a ‘distributed program’ across the multiple tracks. One possible idea is to have tracks with the groups, meta-data and track references but no samples? Is there a place for all the data? What boxes in the sample table would be empty or missing? This needs thought. Indeed, such a track could have extractors to say how to make it (optionally) and indeed we could have a bit next to complete_rep that says whether the track knows how to build all the data for this view set. But that adds a ‘mode’ to the reader. More thought needed. This might mesh well with using track selection information, on first look.

We’re currently not using extractors, but it may be that there are some uses; perhaps for indicating ‘cookbook’ all the data for a set of views (including dependency). Indeed, in that case, where more than one view is wanted, the order of extractors and NAL units would be define at least partially the needed output order (but not fully)?

The maximum disparity might vary over time. Should this be in a temporal structure (e.g. sample groups)? The trouble is that it is a piece of information about the relationship between two views, which in turn might be in multiple tracks. Clearly at file level this can be the overall max_disparity, of course.

Do we need to say more about where parameter sets are stored (in the track closest to the base, following dependencies, of the views that need this parameter set?).

Do we go to PDAM? If it doesn’t make a difference to the completion date, no. The FF group also leans towards no, but the editor to check the calendar and systems plenary to decide.

[After the meeting: There isn’t time for a ballot now before Busan, so any FPDAM would be in Lausanne either way, so we hold off and do PDAM at Busan.]

59

Page 60: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8.1.4 15659 Error in the ‘FRext’ extension to the AVC file format Three choices:change the if to simply if (more_data), and use the sps_ext array always; question: what

about sps_ext in SVC enhancement layers?move the sps_ext to the sps array always; but what about existing implementations using the

indicated profiles?hybrid; leave it where it is for these profiles, and put them in the sps array otherwise.

We go with (c).On the reserved NAL units, it should be that both the AVC spec and the file format spec reserve the value. Then ignore the NA.

Technical Work in Progress.

9 MPEG-7

9.1 15938-5 Reference Software

M15712: Java software based on Mpeg7 schema including amendment ISO/IEC 15938-5:2003/Amd.3:2008. Document not uploaded.

10 LASeR (14496-20)

10.1 14496-20/Amd.xxx

10.1.1 Topics1. LASeR Extensions2. Presentation of Structured Information

10.1.2 ContributionsM15614: CD proposal of LASeR in IPTV. This contribution analysis the open issues listed in the current WD. Dispositions to proposed items are as follows:

attribute vs. element : Accepted to define new attributes. There was understanding that adding new attribute might violate the extension rule defined by W3C. However, this rule doesn’t seem to be generally obeyed. Therefore, it would be better to take technically preferable solution.

attributes to describe adaptation criteria : accepted to define only minimum values for screen size and available memory. It was concerned that it’s not easy to measure and apply CPU power and remaining battery in a meaningful way.  

display size event for dynamic adaptation : accepted to include. EnhancedSceneIndicator in the LASeR header : accepted priority: Accepted in principal. Detailed syntax and semantics should clearly mention

the selection criteria for the case of combination of this priority and other attributes. alternative : rejected because this is duplication of features already supported by

switch

60

Page 61: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

resource (change) events : accepted to include DisplaySizeChange and MemoryStatus events

M15622: Proposal for the Presentation of Structured Information. Was reviewed in detail in the AHG (see AHG report). Taken as a basis to produce related specification.

M15649: Timed Representation of Metadata for the Presentation of Structured Information (PSI). Was reviewed in detail in the AHG (see AHG report). Already a standard of MPEG. Can be used in a more general framework where server side standardization is usefull.

Technical Work in Progress.

11 LASeR (14496-22) Open Font Format

11.1 14496-22/Amd.1

11.1.1 Topics

1. Open Font Format Extension

11.1.2 ContributionsM15533: Summary of Voting on ISO/IEC CD 14496-22 [2nd Edition]. See Doc. Taken into account to produce FCD text.

M15693: Proposal for adding support of platform-independent language tag strings in the 'name' table. M15694: Proposal for adding support for left-to-right and right-to-left glyph alternatives. Support for the comments in the DoC. Accepted.

Technical Work in Progress.

12 21000 MPEG-21

12.1 21000-7 DIA

12.1.1 Topics1. Corrigendum

12.1.2 ContributionsNone.

Technical Work In Progress.

12.2 21000-8 Reference Software

12.2.1 Topics1. MPEG-21 Reference Software

61

Page 62: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

12.2.2 ContributionsNone.

Technical Work In Progress.

12.3 21000-19 Media Value Chain Ontology

12.3.1 Topics1. Media Value Chain Ontology

12.3.2 ContributionsM15631, M15698 : Ontology for IP Media Networks (IPM-Net): A response to the MVCO CfP and Proposing an Architecture for MPEG Media Value Chain Ontology. Taken as a basis to produce working draft of MVCO.

Technical Work In Progress.

13 MPEG-A MAF (23000)

13.1 23000-5 Media Streaming MA

13.1.1 Topics1. Media Streaming MAF

13.1.2 ContributionsNone.

Technical Work in Progress.

13.2 23000-6 Professional Archival AF

13.2.1 Topics1. Professional AF

13.2.2 ContributionsM15593: Proposed Text for White Paper of ISO/IEC 23000-6 Professional Archival Application Format. Taken as the basis to output the white paper on PA-AF.

M15596: Proposed Text for Working Draft of ISO/IEC 23000-6 PA-AF Reference Software. Taken as basis to out the WD1.0 of ref. soft.

Technical Work in Progress.

13.3 23000-7 Open Access Application Format

13.3.1 Topics1. Open Access Application Format

13.3.2 ContributionsNone.

Technical Work in Progress.

62

Page 63: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

13.4 23000-9 DMB Application Format

13.4.1 Topics1. DMB Application Format

13.4.2 ContributionsM15665: Conformance file contribution for ISO/IEC 23000-9 (DMB-AF). Used as the basis to produce PDAM text.M15664: Updated text and reference SW for ISO/IEC 23000-9 (DMB-AF). Used as the basis to produce PDAM text.

Technical Work in Progress.

13.5 23000-10 Video Surveillance MAF

13.5.1 Topics1. Video Surveillance MAF 1st Edition

13.5.2 ContributionsM15523: Summary of Voting on ISO/IEC FCD 23000-10 [SC 29 N 9518]. See DoC.M15705: Update of ISO/IEC 23000-10/Amd1 WD2.0 Conformance and Reference Software.

Technical Work in Progress.

13.6 23000-11 Stereoscopic Video AF

13.6.1 Topics1. Stereoscopic Video AF

13.6.2 ContributionsM15684: Study text of ISO/IEC 23000-11 FCD Stereoscopic Video Application Format. Taken as basis to produce study text.M15685: Workplan for ISO/IEC 23000-11 Conformance and Reference Software. Taken as basis to produce workplan for Conf. and Ref. Soft.

Technical Work in Progress.

13.7 23000-12 Interactive Music AF

13.7.1 Topics1. Interactive Music AF

13.7.2 ContributionsM15710: iKlax Requirements for MPEG Interactive Music. Requirements accepted and integrated in the MAF Overview document.M15626: A proposal for technical specification of Interactive Music AF. Contribution reviewed and noted.

Cf. requirements report for next steps on this activity.

63

Page 64: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

14 MPEG-E Multimedia Middleware (23004)

14.1 Multimedia Middleware

14.1.1 Topics1. MPEG Multimedia Middleware

14.1.2 ContributionsNone.

Technical Work in Progress.

15 Project Started

15.1 Representation of Sensory Effects

15.1.1 Topics1. Representation of Sensory Effects

15.1.2 Contributions

Input Contribution Document No.Sensory Effects Metadata for RoSE Framework m 15560 Summary:

— Technologies used: XML Schema, MPEG-7 DS, SNVT from LonWorks— General Structure: Organization of SEM description into SEM, Desc, Seg, frag— Effect List for the mapping process in the RoSE engine, i.e., the mapping of sensory

effects metadata to sensory device commands according to the user preferences and sensory device capabilities Metadata

— Effect Variables for synchronized control of SEM— General information: MPEG-7 description metadata— Attributes: Priority, isMandatory, isAdaptable, DependentEffectID,

AlternativeEffectID— Various sensory effects (see m 15560 for details)— Simple Types (LuxType, FreqType, etc.) adopted from LonWorks— Converted several SNVT binary types to their XML equivalents— Which kind of technology is needed to implement this mapping process? (complexity,

efficiency, etc.)

Action: Go to WD and conduct CE monitored by AhG

Input Contribution Document No.User Sensory Preferences Metadata for RoSE Framework m 15561 Summary:

— Technologies used: XML Schema, MPEG-7 DS— General Structure: Person – Descriptions – Preferences— PreferenceDescription: priority, disableAll + allows for describing detailed effect

preferences

64

Page 65: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

— Various preferences (see m 15561 for details) aligned with m 15560 — Harmonize/integrate with MPEG-21 DIA UED User Characteristics and the various

preference types defined there?

Action: Go to WD and conduct CE monitored by AhG

Input Contribution Document No.Sensory Device Capabilities Metadata for RoSE Framework m 15562 Summary:

— Technologies used: XML Schema, LonWorks— Capabilities divided into common capabilities and device-specific capabilities— Various capabilities (see m 15562 for details)— Harmonize/integrate with MPEG-21 DIA UED Terminal Capabilities?

Action: Go to WD and conduct CE monitored by AhG

Input Contribution Document No.Sensory Device Commands Metadata for RoSE Framework m 15563 Summary:

— Technologies used: XML Schema, LonWorks— Also divided into common commands and device-specific commands— Various commands (see m 15563 for details) and aligned with m 15562

Action: Go to WD and conduct CE monitored by AhG

Input Contribution Document No.RoSE description schemes for realistic TV commercials m15 585 Summary:

— Technologies used: XML Schema, MPEG-7 DS/D, MPEG-21 DIA tools, LonWorks— CommercialSensoryEffect: various sensory effects (see m15 585 for details);

overlaps exits with other proposals— CommercialSensoryPreference: various user preferences (see m15 585 for details);

overlaps exits with other proposals— CommercialSensoryDeviceCapability: various device capabilities (see m15 585 for

details); overlaps exist with other proposals— CommercialSensoryDeviceCommand: the commands (see m15 585 for details);

overlaps exist with other proposals— Complete proposal (similar to m15560-63) and it may need to be harmonized with

other proposals, i.e., identify overlaps, etc.

Action: Go to WD and conduct CE monitored by AhG

Input Contribution Document No.Response to the call for technologies for RoSE m15 612 Summary:

— Technologies used: no detailed information available (see below)— Overview of amBX which stands for “ambient experience”— amBX script = sensory effect metadata— Experience description: high-level abstraction (sunny, cool, explosion) and low-level

abstraction (turn on fan at speed x)

65

Page 66: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

— Timing: start, end, duration— Conditional and priority mechanisms— If the set of devices changes, the engines takes into account, e.g., if device is removed

the engine will try to transfer experience parts to remaining devices— Some minor remarks w.r.t. the RoSE requirements— No xml schema’s provided at this meeting, will be provided after discussions of the

terms and conditions of the scope of the activity — Need to revise the requirements based on the remarks of this input contribution— Need to discuss the details of the technologies when provided

Action: Go to WD and conduct CE monitored by AhG

Input Contribution Document No.Possible Instantiations of the RoSE Framework m15 650 Summary:

— Informative contribution presenting possible instantiations of the RoSE framework: unicast, multicast/broadcast, download & play

Action: as it is informative not action required.

Input Contribution Document No.Response to Call for Proposals on RoSE: Sensory Effect Metadata m15 651 Summary:

— Technologies used: XML Schema, MPEG-21 DIA— General structure of sensory effect metadata which is referred to as SEM schema

tools— Usage of patterns for describing sensory effects— Fragmentation of sensory effect metadata (i.e., for piece-wise, streamed processing

thereof) and associating time information to these fragments— It may need to be harmonized with other proposals, i.e., identify overlaps, etc.

Action: Go to WD and conduct CE monitored by AhG

Input Contribution Document No.Response to Call for Proposals on RoSE: Sensory Device Capabilities

m15 652

Summary:— Technologies used: XML Schema, MPEG-21 DIA— General structure of sensory device capabilities: extension of the terminal

capabilities as defined in MPEG-21 Digital Item Adaptation (note: this approach might by also applicable for the user sensory preferences).

— Actual sensory device capabilities based on patterns— It may need to be harmonized with other proposals, i.e., identify overlaps, etc.

Action: Go to WD and conduct CE monitored by AhG

Input Contribution Document No.Proposal for RoSE Metadata m1 5681 Summary:

— Technologies used: XML Schema, MPEG-7 DS

66

Page 67: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

— Provider Preference (Sensory Effect Metadata): expresses preferences of the provider in terms of the permissible range for the control parameter of user sensory device

— Device Group Control (Sensory Effect Metadata): ChannelTypeID for the configuration of devices and ControlData for the parameters of sensory effects

— User Effect Adaptation (User Sensory Preferences): very detailed user preferences w.r.t. who, when, where, what, and how.

— Looks very complex but also innovative; it may need to be harmonized with other proposals, i.e., identify overlaps, etc.

Action: Go to WD and conduct CE monitored by AhG

Technical Work in Progress.

67

Page 68: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

15.2 MPEG eXtensible Middleware

15.2.1 Topics1. MPEG eXtensible Middleware Architecture and Technology2. MXM APIs

15.2.2 ContributionsOne very detailed and complete response was received in answer to the call for proposal. The response was organized in four different documents summarized below:

M15686: Letter accompanying the responseM15687: Collection of use casesM15688: Specification of the technologies that are required to match the use casesM15689: Proposed specification

The tables below summarizes how the responses match the requirements of the CfP:

Table 1 : Identification of requirements satisfied1. Offer access to selected individual MPEG and related technologies through standard APIs Y2. Offer APIs to

a. Access i. Media encoders/decoders

1. Audio Y2. Video Y3. 3DG Y4. Binary XML N

ii. Audio/Video/3DG players Yiii. Presentation Engine

1. BIFS N2. LASeR Y3. HTML N

iv. IPMP functionality1. IPMP components Y

b. Create/edit/parsei. Digital Items Y

ii. Metadata1. MPEG-7 Y2. MPEG-21 Y3. TVA N

iii. Licences Yiv. File Formats Y

c. Stream/consume contenti. Audio Y

ii. Video Yiii. Scene Yiv. Metadata Yv. Digital Items Y

d. Store/consume content in distributed environmentsi. Peer-to-peer (P2P) infrastructure based on e.g. Distributed Hash Tables (DHT) Y

ii. Search for content Ye. Communicate between devices

68

Page 69: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

i. Player with Licence server etc. Y3. Support interfacing with security elements in devices such as those provided by ISO/IEC

11889 Trusted Platform ModuleY

4. Be documented in at least one Programming Language (APIs) Ya. A Programming Language independent version of the APIs may be supported as well

by MXMY

5. Be modular to enable easy customisation for specific needs and to form various media value-chain easily

Y

6. Be designed to be re-usable in complete environments such as the M3W Framework Y

Table 2: Identification of requirements not satisfied and suggested ways by which MPEG could obtain the corresponding technology/ies

Requirements not satisfied

Reasons/How requirements could be satisfied

Binary XML The proponents could not devote resources to satisfy this requirement. As MPEG has identified several application for Binary XML, it could easily extend the current proposal to support this requirement using it internal expertise

BIFS and HTML Presentation

The proponents could not devote resources to satisfy these requirements. However, the BIFS and HTML specificities can be easily accommodated with the material contained in the submission

Visual description The proponents have provided support to Visual descriptors used in the Photo Player Application Format. More Visual descriptors can be supported using the same procedure highlighted in the proposal

Audio description The proponents could not devote resources to satisfy this requirement. As MPEG has identified several application for audio description, it could easily extend the current proposal to support this requirement using it internal expertise

TVA metadata The proponents could not devote resources to satisfy this requirement. However, there exists enough openly accessible significant material that can be used to develop the TVA Metadata part of the MXM API

In addition, new requirements were satisfied by the proposal (even they were not called for): - Content identification (APIs and protocol)- Content upload (APIs and protocol)- License upload (APIs and protocol)- Access to Video output /Audio output / Event APIs (APIs)

They are detailed in the table below and new CfP was issued (Cf. Req. report):

Table 3: Identification of additional requirements satisfiedNew requirement How it is handled

API to remote store a content item (Digital Item + resources)

Proposal of API and of specific technology provided. If accepted this could be added to ISO/IEC 29116

API to remote store a licence

Supported by the proposed solution.

API to identify a Digital Item or elements thereof

Supported by the proposed solution.

API to authenticate a Digital Item or

Supported by the proposed solution.

69

Page 70: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

elements thereofAPI to verify a role Document m15689 proposes a specific API to verify a role, but there is no

Media Value Chain Ontology (MVCO) technology proposal in m15688. We believe that the proposed API is MVCO-technology agnostic. If MPEG will develop its own MVCO technology there may be a need to make fine adjustments as the MVCO standardisation progresses

Resource rendering Proposal of API providedDigital Item Presentation

Document m15689 proposes a specific API to present digital items, but there is no Presentation of Structured Information (PSI) technology proposal in m15688. We believe that the proposed API is PSI-technology agnostic. If MPEG will develop its own PSI technology there may be a need to make fine adjustments as the PSI standardisation progresses

A process for developing the standard was discussed and approved by MPEG:- MXM use cases definition- MXM API requirements definition- MXM API specification- MXM technologies specification

The inputs contribution was taken as the basis to produce the 2 output WD:- Architecture and Technologies- APIs

Technical Work in Progress.

16 Requirements and Exploration

16.1 Requirements & ProfilesM15550: New BIFS Profile for Interactive Digital Radio. See Req.M15677: LASeR Requirements for Person to Person Communication Services Signaling.

16.2 Standing DocumentsNone.

16.3 New Proposals

16.3.1 Rich Media UI FrameworkM15521: Discussed in Paris. See AHG report.

M15601: Additional Use Cases for UI Framework. Use cases to be added as annex in the Rich Media UI Framework Context and Objectives.

M15604: Addition information on the MPEG UI Framework. Used as input for the Rich Media UI Framework Context and Objectives.

M15676: Requirements for the MPEG UI Framework. Used as input for the Rich Media UI Framework Context and Objectives.

70

Page 71: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

M15602: Analysis of MPEG-21 UED for Personalization. Proposal to use MPEG-21 UED for describing Users Preferences and Terminal capability. Noted.

M15603: First feedback/comments on the DCCI programmatic API for personalization. For information. Noted.

Decision on next steps on dynamic Rich Media Scene adaptation:- Definition of an ontology for Rich Media Scene Adaptation- Specification of APIs to access this ontology from the scene in IDL

languagei. With Ecmascript bindings.

ii. Other mappings ? - Walthrough for using this framework (TO BE INVESTIGATED):

i. With PSIii. With Script

iii. Others

M15675: MPEG User Interface Framework and W3C WebApps Widgets. For information. Analysis part integrated in the Contexte and Objective document.

16.3.2 Proposal for Standardization of Interfaces with Virtual Worlds M15615, M15617, M15690: See Req. report.

17 LiaisonCf. Liaison output.

71

Page 72: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

18 Latest References and Publication StatusReference on the ISO Web Site : http://www.itscj.ipsj.or.jp/sc29/open/29view/29n9270c.htm

Pr Pt Standard No. Issue Status Doc. With Purpose ISO Award

2 1 ISO/IEC 13818-1/Amd.7 Published 2000/12 Done2 1 ISO/IEC 13818-1:2000 (MPEG-2 Systems 2nd Edition) 00/12 Published 2000/12 Proposed2 1 ISO/IEC 13818-1:2000/COR1 (FlexMux Descr.) N3844 01/01 Pisa Published 2002/03 N/A2 1 ISO/IEC 13818-1:2000/COR2 (FlexMuxTiming_ descriptor) N4404 01/12 Pattaya Published 2002/12 N/A2 1 ISO/IEC 13818-1:2000/Amd.1 (Metadata on 2) & COR1 on Amd.1 N5867 03/07

TrondheimPublished 2003/12 Proposed

2 1 ISO/IEC 13818-1:2000/Amd.2 (Support for IPMP on 2) N5604 03/03 Pattaya Published 2004/03 N/A2 1 ISO/IEC 13818-1:2000/Amd.3 (AVC Carriage on MPEG-2) N5771 03/07

TrondheimPublished XXXX Proposed

2 1 ISO/IEC 13818-1:2000/Amd.4 (Metadata Application CP) N6847 04/10 Palma FDAM ITTF to be published N/A2 1 ISO/IEC 13818-1:2000/Amd.5 (New Audio P&L Sig.) N6585 04/07

RedmondFDAM ITTF to be published N/A

2 1 ISO/IEC 13818-1:2000/COR3 (Correction for Field Picture) N6845 04/10 Palma COR ITTF to be published N/A2 1 ISO/IEC 13818-1:2000/COR4 (M4MUX Code Point) N7469 05/07 Poznan COR ITTF to be published N/A2 1 ISO/IEC 13818-1:2000/COR5 (Corrections related to 3rd Ed.) N7895 06/01

BangkokCOR ITTF to be published N/A

2 1 ISO/IEC 13818-1:2006 (MPEG-2 Systems 3rd Edition) 06/xx Published ITTF TBP2 1 ISO/IEC 13818-1:2006/Amd.1 (Transport of Streaming text) N8369 06/07

KlagenfurtFDAM ITTF to be published TBP

2 1 ISO/IEC 13818-1:2006/Amd.2 (Carriage of Auxialiry Video Data) N8798 07/01 Marrakech

FDAM ITTF to be published TBP

2 1 ISO/IEC 13818-1:2006/Cor.1.2 (Reference to AVC Specification) N9365 07/10 Shenzhen

FDAM ITTF to be published TBP

2 1 ISO/IEC 13818-1:2006/Amd.3 (SVC in MPEG-2 Systems) N1005 08/07 FDAM ITTF to be published TBP

72

Page 73: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

8 Hannover2 11 ISO/IEC 13818-1:2003 (IPMP on 2) N5607 03/03 Pattaya Published 2003/12 Proposed4 1 ISO/IEC 14496-1 (MPEG-4 Systems 1st Ed.) N2501 98/10 Atl. City Published 1999/12 Done4 1 ISO/IEC 14496-1/Amd.1 (MP4, MPEG-J) N3054 99/12 Hawaii Published 2001/11 Done4 1 ISO/IEC 14496-1/Cor.1 N3278 00/03

Noordwijk.Published 2001/11 N/A

4 1 ISO/IEC 14496-1:2001 (MPEG-4 Systems 2nd Ed.) N3850 01/01 Pisa Published 2001/11 N/A4 1 ISO/IEC 14496-1:2001/Amd.1 (Flextime) Published 2002/10 Done4 1 ISO/IEC 14496-1:2001/Cor.1 N4264 01/07 Sydney COR ITTF N/A4 1 ISO/IEC 14496-1:2001/Cor.2 N5275 02/10 Shangai COR ITTF N/A4 1 ISO/IEC 14496-1:2001/Cor.3 N6587 04/07

RedmondCOR ITTF N/A

4 1 ISO/IEC 14496-1:2001/Amd.2 (Textual Format) N4698 02/03 Jeju Island

AMD ITTF N/A

4 1 ISO/IEC 14496-1:2001/Amd.3 (IPMP Extensions) N5282 02/10 Shanghai

Published 2004-05 N/A

4 1 ISO/IEC 14496-1:2001/Amd.4 (SL Extension) N5471 02/12 Awaji Published 2003/12 N/A4 1 ISO/IEC 14496-1:2001/Amd.7 (AVC on 4) N5976 03/10

BrisbannePublished 2004-08 N/A

4 1 ISO/IEC 14496-1:2001/Amd.8 (ObjectType Code Points) N6202 03/12 Hawaii AMD ITTF to be published N/A4 1 ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) N7229 05/04 Busan PDAM ITTF Final Text

EditingN/A

4 1 ISO/IEC 14496-1:200x/Cor4 (Node Coding Table) N7473 05/07 Poznan PDAM ITTF to be published N/A4 1 ISO/IEC 14496-1 (MPEG-4 Systems 3rd Ed.) N5277 02/10

ShanghaiIS ITTF to be published Proposed

4 1 ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) N7229 05/04 Busan PDAM ITTF Final Text Editing

N/A

4 1 ISO/IEC 14496-1:200x/Cor.1 (Clarif. On audio codec behavior) N8117 06/04 Montreux

COR ITTF Final Text Editing

N/A

4 1 ISO/IEC 14496-1:200x/Amd.2 (3D Profile Descriptor Extensions) N8372 06/07 Klagenfurt

PDAM ITTF to be published N/A

4 1 ISO/IEC 14496-1:200x/Cor.2 (OD Dependencies) N8646 06/10 COR ITTF to be published N/A

73

Page 74: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Hangzhou4 1 ISO/IEC 14496-1:200x/Amd.3 (JPEG 2000 support in Systems) N8860 07/01

MarrakechPDAM ITTF to be published N/A

4 4 ISO/IEC 14496-1:200x/Amd.17 (ATG Conformance) N8861 07/01 Marrakech

PDAM ITTF to be published N/A

4 4 ISO/IEC 14496-1:200x/Amd.22 (AudioBIFS v3 conformance) N9295 07/07 Lausanne

PDAM ITTF to be published N/A

4 4 ISO/IEC 14496-1:200x/Amd.23 (Synthesized Texture conformance) N9369 07/10 Shenzhen

PDAM ITTF to be published N/A

4 4 ISO/IEC 14496-1:200x/Amd.24 (File Format Conformance) N9370 07/10 Shenzhen

PDAM ITTF to be published N/A

4 4 ISO/IEC 14496-1:200x/Amd.25 (LASeR V1 Conformance) N9372 07/10 Shenzhen

PDAM ITTF to be published N/A

4 4 ISO/IEC 14496-1:200x/Amd.26 (Open Font Format Conf.) N9815 08/04 Archamps

PDAM ITTF to be published N/A

4 4 ISO/IEC 14496-1:200x/Amd.27 (LASeR Amd.1 Conformance) N9816 08/04 Archamps

PDAM ITTF to be published N/A

4 5 ISO/IEC 14496-1:200x/Amd.12 (File Format) N9020 07/04 San Jose PDAM ITTF to be published N/A4 5 ISO/IEC 14496-1:200x/Amd.16 (SMR Ref. Soft) N9672 08/01 Antalya PDAM ITTF to be published N/A4 5 ISO/IEC 14496-1:200x/Amd.17 (LASeR Ref. Soft) N9674 08/01 Antalya PDAM ITTF to be published N/A4 6 ISO/IEC 14496-6:2000 Published 2000/12 N/A4 8 ISO/IEC 14496-8 (MPEG-4 on IP Framework) N4712 02/03 Jeju Published 2004-05 Proposed4 11 ISO/IEC 14496-11 (MPEG-4 Scene Description 3rd

Edition) N6960 05/01

HongKongFDIS SC29 Final Text

EditingProposed

4 11 ISO/IEC 14496-11/Amd.1 (AFX) N5480 02/12 Awaji FDAM ITTF Integration in 1st

Ed.N/A

4 11 ISO/IEC 14496-11/Amd.2 (Advanced Text and Graphics) N6205 03/12 Hawaii FDAM ITTF Integration in 1st

Ed.N/A

4 11 ISO/IEC 14496-11/Cor.1 N6203 03/12 Hawaii COR SC29 N/A4 11 ISO/IEC 14496-11/Cor.3 Valuator/AFX related correction N6594 04/07

RedmondCOR ITTF Integration in 1st

Ed.N/A

4 11 ISO/IEC 14496-11/Amd.3 Audio BIFS Extensions N6591 04/07 FDAM ITTF Integration in 1st Proposed

74

Page 75: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Redmond Ed.4 11 ISO/IEC 14496-11/Amd.4 XMT and MPEG-J Extensions N6959 05/01

HongKongFDAM ITTF Integration in 1st

Ed.N/A

4 11 ISO/IEC 14496-11/Cor.3 (Audio BIFS Integrated in 3rd Edition) N7230 05/04 Busan COR ITTF Final Text Editing

N/A

4 11 ISO/IEC 14496-11/Cor.5 (Misc Corrigendum) N8383 06/07 Klagenfurt

COR SC29 N/A

4 11 ISO/IEC 14496-11/Amd.5 Symbolic Music Representation

N8657 06/10 Hangzhou

FDAM ITTF TBP

4 11 ISO/IEC 14496-11/Cor.6 (AudioFx Correction) N9021 07/04 San Jose COR SC29 N/A4 12 ISO/IEC 14496-12 (ISO Base Media File Format) N5295 02/10

ShanghaiPublished 2004-02 Proposed

4 12 ISO/IEC 14496-12/Amd.1 ISO FF Extension N6596 04/07 Redmond

FDAM ITTF FDAM 04/11/30 N/A

4 12 ISO/IEC 14496-12/Cor.1 (Correction on File Type Box)

N7232 05/04 Busan COR ITTF Final Text Editing

N/A

4 12 ISO/IEC 14496-12/Cor.2 (Miscellanea) N7901 06/01 Bangkok

COR ITTF Final Text Editing

N/A

4 12 ISO/IEC 14496-12/Amd.1 (Description of timed metadata)

N8659 06/10 Hangzhou

FDAM ITTF N/A

4 12 ISO/IEC 14496-12/Cor.3 (Miscellanea) N9024 07/04 San Jose COR ITTF Final Text Editing

N/A

4 12 ISO/IEC 14496-12/Amd.2 (Flute Hint Track) N9023 07/04 San Jose FDAM ITTF N/A4 13 ISO/IEC 14496-13 (IPMP-X) N5284 02/10

ShanghaiIS ITTF to be published Proposed

4 14 ISO/IEC 14496-14 (MP4 File Format) N5298 02/10 Shanghai

Published 2003-11 Proposed

4 14 ISO/IEC 14496-14/Cor.1 (Audio P&L Indication) N7903 06/01 Bangkok

COR ITTF Final Text Editing

N/A

4 15 ISO/IEC 14496-15 (AVC File Format) N5780 03/07 Trondheim

Published 2004-04 Proposed

4 15 ISO/IEC 14496-15/Amd.1 (Support for FREXT) N7585 05/10 Nice FDAM ITTF Final Text N/A

75

Page 76: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Editing4 15 ISO/IEC 14496-15/Cor.1 N7575 05/10 Nice COR ITTF N/A4 15 ISO/IEC 14496-15/Cor.2 (NAL Unit Restriction) N8387 06/07

KlagenfurtCOR ITTF N/A

4 15 ISO/IEC 14496-15/Amd.2 (SVC File Format Extension)

N9682 08/01 Antalya FDAM ITTF N/A

4 17 ISO/IEC 14496-17 (Streaming Text) N7479 05/07 Poznan FDAM ITTF TBP4 18 ISO/IEC 14496-18 (Font Compression and Streaming) N6215 03/12 Hawaii Published 2004-07 Proposed4 18 ISO/IEC 14496-18/Cor.1 (Misc. corrigenda and

clarification)N8664 06/10

HangzhouCOR ITTF N/A

4 19 ISO/IEC 14496-19 (Synthesized Texture Stream) N6217 03/12 Hawaii Published 2004-07 Proposed4 20 ISO/IEC 14496-20 (LASeR) N7588 05/10 Nice FDAM Editor TBP4 20 ISO/IEC 14496-20/Cor.1 (Misc. corrigenda and

clarification)N8666 06/10

HangzhouCOR ITTF N/A

4 20 ISO/IEC 14496-20/Amd.1 (LASeR Extension) N9029 07/04 San Jose FDAM ITTF N/A4 20 ISO/IEC 14496-20/Cor.2 (Profile Removal) N9381 07/10

ShenzhenFDAM ITTF N/A

4 20 ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support) N9384 07/10 Shenzhen

FDAM ITTF N/A

4 22 ISO/IEC 14496-22 (Open Font Format) N8395 06/07 Klagenfurt

FDAM Editor Final Text Editing

TBP

7 1 ISO/IEC 15938-1 (MPEG-7 Systems) N4285 01/07 Sydney Published 2002/07 Done7 1 ISO/IEC 15938-1/Amd.1 (MPEG-7 Systems Extensions) N6326 04/03 Munich FDAM ITTF FDAM 04/11/28 N/A7 1 ISO/IEC 15938-1/Cor.1 (MPEG-7 Systems Corrigendum) N6328 04/03 Munich COR Editor N/A7 1 ISO/IEC 15938-1/Cor.2 (MPEG-7 Systems Corrigendum) N7490 05/07 Poznan COR ITTF N/A7 1 ISO/IEC 15938-1/Amd.2 (BiM extension) N7532 05/10 Nice FDAM ITTF N/A7 2 ISO/IEC 15938-2 (MPEG-7 DDL) N4288 01/07 Sydney Published 2002/02 Done7 7 ISO/IEC 15938-7/Amd.2 (Fast Access Ext. Conformance) N8672 06/10

HangzhouFDAM ITTF N/A

7 12 ISO/IEC 15938-12 MPEG Query Format N9830 08/04 Archamps

FDAM ITTF N/A

76

Page 77: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

21 9 ISO/IEC 21000-9 (MPEG-21 File Format) N6975 05/01 HongKong

FDIS ITTF FDIS 05/01/21 Done

21 9 ISO/IEC 21000-9/Amd.1 (MPEG-21 Mime Type) N9837 08/04 Archamps

FDAM ITTF Done

21 15 ISO/IEC 21000-15 (Security in Event Reporting) N9839 08/04 Archamps

FDIS ITTF TBP

21 16 ISO/IEC 21000-16 (MPEG-21 Binary Format) N7247 05/04 Busan FDIS ITTF FDIS 05/04/22 TBP21 5 ISO/IEC 21000-5 (Open Release Content Profile) N9687 08/01 Antalya FDAM ITTF TBPA 4 ISO/IEC 23000-4 (Musical Slide Show MAF) N9037 07/04 San Jose FDIS ITTF TBPA 4 ISO/IEC 23000-4 (Musical Slide Show MAF 2nd Ed.) N9843 08/04

ArchampsFDIS ITTF TBP

A 7 ISO/IEC 23000-7 (Open Access MAF) N9698 08/01 Antalya FDIS ITTF TBPA 8 ISO/IEC 23000-8 (Portabe Video AF) N9853 08/04

ArchampsFDIS ITTF TBP

A 9 ISO/IEC 23000-9 (Digital Multi. Broadcasting MAF) N9397 07/10 Shenzhen

FDIS ITTF TBP

A 9 ISO/IEC 23000-9/Cor.1 (Digital Multi. Broadcasting MAF)

N9854 08/04 Archamps

COR ITTF TBP

B 1 ISO/IEC 23001-1 (XML Binary Format) N7597 05/10 Nice FDIS ITTF TBPB 1 ISO/IEC 23001-1/Cor.1 (Misc. Editorial and technical

clar.)N8680 06/10

HangzhouCOR ITTF N/A

B 1 ISO/IEC 23001-1/Cor.2 (Misc. Editorial and technical clar.)

N9049 07/04 San Jose COR ITTF N/A

B 1 ISO/IEC 23001-1/Amd.1 (Reference Soft. & Conf.) N8886 07/01 Marrakech

FDAM ITTF N/A

B 1 ISO/IEC 23001-1/Amd.1 (Exten. On encoding of wild cards)

N9296 07/07 Lausanne

PDAM ITTF to be published N/A

B 2 ISO/IEC 23001-2 (Fragment Request Unit) N9051 07/04 San Jose FDIS ITTF TBPB 3 ISO/IEC 23001-3 (IPMP XML Messages) N9416 07/04 San Jose FDIS ITTF TBPE 1 ISO/IEC 23008-1 Architecture N8892 07/01

MarrakechFDAM ITTF N/A

E 2 ISO/IEC 23008-2 Multimedia API N8893 07/01 FDAM ITTF N/A

77

Page 78: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

MarrakechE 3 ISO/IEC 23008-3 Component Model N8894 07/01

MarrakechFDAM ITTF N/A

E 4 ISO/IEC 23008-4 Ressource & Quality Management N8895 07/01 Marrakech

FDAM ITTF N/A

E 5 ISO/IEC 23008-5 Component Download N9053 07/04 San Jose FDAM ITTF N/AE 6 ISO/IEC 23008-6 Fault Management N9054 07/04 San Jose FDAM ITTF N/AE 7 ISO/IEC 23008-7 System Integrity Management N9055 07/04 San Jose FDAM ITTF N/A

29116 1 ISO/IEC 29116 Media Streaming MAF Protocols N9420 07/10 Shenzhen

FDAM ITTF N/A

78

Page 79: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

19 Resolutions of Systems

Cf. WG11 resolution.

20 Contributions Reviewed by the Systems Subgroup

N° Title Authors15519 (IPTV) Liaison Statement from ITU-T SG 16 on IPTV [SC

29 N 9516] ITU-T SG 16 via SC29 Secretariat

15521 Architecture for the MPEG User Interface Framework Cyril ConcolatoJean Le FeuvreKyungMo Park

15522 Summary of Voting on ISO/IEC 13818-1:2007/FPDAM 3.2 [SC 29 N 9517]

SC29 Secretariat

15523 Summary of Voting on ISO/IEC FCD 23000-10 [SC 29 N 9518]

SC29 Secretariat

15533 Summary of Voting on ISO/IEC CD 14496-22 [2nd Edition]

SC29 Secretariat

15539 (Thank you) IEC NP: Multimedia systems and equipment -- Multimedia home network configuration -- Basic reference model

IEC TC 100 via SC29 Secretariat

15542 (BIFS Profile) Liaison Statement from TTA [SC 29 N 9559]

TTA via SC 29 Secretariat

15543 (Thank you) IEC CDV 60728-1-2 IEC TC 100 via SC29 Secretariat

15544 (IPTV) Liaison Statement from ITU-T SG 16 on MXW [SC 29 N 9564]

ITU-T SG 16 via SC 29 Secretariat

15550 New BIFS Profile for Interactive Digital Radio Benoît PellanYoung-Kwon LimCyril Concolato

15556 (Audio Book) IEC CD 62571 IEC TC 100 via SC 29 Secretariat

15560 Sensory Effects Metadata for RoSE Framework B.S ChoiSanghyun Joo

15561 User Sensory Preferences Metadata for RoSE Framework B.S ChoiSanghyun Joo

15562 Sensory Device Capabilities Metadata for RoSE Framework

Sanghyun JooB.S Choi

15563 Sensory Device Commands Metadata for RoSE Framework

Sanghyun JooB.S Choi

15571 (PA MAF) Liaison Statement from WG 1 WG 1 via SC 29 Secretariat15585 RoSE description schemes for realistic TV commercials Maeng-Sub Cho

Jin-Seo KimBon-Ki KooYong Soo Joo

79

Page 80: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

N° Title AuthorsSang-Kyun Kim

15591 Proposals for MVC File Format Miska M. HannukselaYe-Kui WangYing Chen

15593 Proposed Text for White Paper of ISO/IEC 23000-6 Professional Archival Application Format

mailto:Hendrymailto:Munchurl%20Kimmailto:Noboru%20Haradamailto:Takehiro%20Moriya

15596 Proposed Text for Working Draft of ISO/IEC 23000-6 PA-AF Reference Software

mailto:Hendrymailto:Houari%20Sabirinmailto:Munchurl%20Kimmailto:Noboru%20Haradamailto:Takehiro%20Moriya

15600 On MVC File Format Karsten GrünebergThomas Schierl

15601 Additional Use Cases for UI Framework KyungMo ParkCyril ConcolatoJean Le Feuvre

15602 Analysis of MPEG-21 UED for Personalization Cyril ConcolatoJean Le FeuvreKyungMo Park

15603 First feedback/comments on the DCCI programmatic API for personalization

Cyril ConcolatoJean Le FeuvreKyungMo Park

15604 Addition information on the MPEG UI Framework Cyril ConcolatoJean Le FeuvreKyungMo Park

15611 File format support for reception of SRTP and MIKEY messages

Daniel CatreinPer Fröjdh

15612 Response to the call for technologies for RoSE Mark VerberktJean H.A. Gelissen (ed.)

15614 CD proposal of LASeR in IPTV Jaeyeon SongSeo-Young HwangYoung-Kwon Lim

15615 Response to the extended call for requirements for MPEG-V

Dirk BrokkenJean H.A. Gelissen

15617 Response to the extended call for requirements for MPEG-V

Jun HuJean H.A. Gelissen (ed.)

15622 Proposal for the Presentation of Structured Information Jihun ChaInjae LeeYoung-kwon LimHan-kyu LeeJinwoo Hong

15626 A proposal for technical specification of Interactive Music AF

Inseon JangHui Yong KimJeongil SeoKyeongok Kang

15631 Ontology for IP Media Networks (IPM-Net): A response to the MVCO CfP

Marc Gauvin et al.

15649 Timed Representation of Metadata for the Presentation of Christian Timmerer

80

Page 81: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

N° Title AuthorsStructured Information (PSI) [email protected]

klu.ac.at. Michael Ransburg15650 Possible Instantiations of the RoSE Framework Markus Waltl

Christian Timmerer15651 Response to Call for Proposals on RoSE: Sensory Effect

MetadataMarkus WaltlChristian Timmerer

15652 Response to Call for Proposals on RoSE: Sensory Device Capabilities

Markus WaltlChristian Timmerer

15659 Error in the ?FRext? extension to the AVC file format David Singer15664 Updated text and reference SW for ISO/IEC 23000-9

(DMB-AF)Hui Yong KimMyungSeok KiHanKyu LeeYong Han KimHouari SabirinMunchurl Kim

15665 Conformance file contribution for ISO/IEC 23000-9 (DMB-AF)

Hui Yong KimSeung Jun YangHanKyu LeeHyunsoon KimDaebok Kwon

15675 MPEG User Interface Framework and W3C WebApps Widgets

Diego GibellinoGiovanni Cordara

15676 Requirements for the MPEG UI Framework Diego GibellinoGiovanni Cordara

15677 LASeR Requirements for Person to Person Communication Services Signaling

Diego GibellinoGiovanni Cordara

15681 Proposal for RoSE Metadata Yasuaki TokumoTakuya IwanamiYoshiaki OgisawaShuichi WatanabeNorio Ito

15684 Study text of ISO/IEC 23000-11 FCD Stereoscopic Video Application Format

Next generation Broadcasting Forum (Korea)

15685 Workplan for ISO/IEC 23000-11 Conformance and Reference Software

Next generation Broadcasting Forum (Korea)

15686 Response to MXM Call for Proposals Walter Allasia et al.15687 Collection of MXM Use Cases Walter Allasia et al.15688 Proposal of MXM Technologies Walter Allasia et al.15689 Proposal of MXM API Walter Allasia et al.15690 Response to the extended call for requirements for MPEG-

V Esko DijkJean H. A. Gelissen

15693 Proposal for adding support of platform-independent language tag strings in the 'name' table.

Simon DanielsMichelle HillVladimir Levantovsky

15694 Proposal for adding support for left-to-right and right-to-left glyph alternatives.

Sairus Patel

15695 Preliminary comments on the 14496-12 PDAM David Singer

81

Page 82: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

N° Title Authors15698 Proposing an Architecture for MPEG Media Value Chain

OntologyMiran ChoiMyung Gil JangHyunki Kim

15705 Update of ISO/IEC 23000-10/Amd1 WD2.0 Conformance and Reference Software

James AnnesleyHouari Sabirin

15710 iKlax Requirements for MPEG Interactive Music Owen LagadecFabien GallotLaurent Primaux

15712 Java software based on Mpeg7 schema including amendment ISO/IEC 15938-5:2003/Amd.3:2008

Tanya Beech

82

Page 83: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex G – Video report

Source: Jens Ohm and Gary Sullivan, Chairs

1 Development of AVC

The video subgroup jointly approved the ISO standard related output documents that were produced during the 28th JVT meeting which was held in parallel. Important work items in this context were

– Final work on conformance testing for New Professional Profiles and SVC (both FDAM)

– Improvement of reference software for SVC (Study of FPDAM)– Final work on MVC (FDAM)– Work on MVC software (PDAM) and conformance (WD)– Further work on the Cor.1 (Study of DCOR).

For two issues related to the corrigendum, clarification is expected during the October 2008 meeting: - Definition of level limits as discussed in M15558,- Designation of “constrained baseline” bitstreams and decoders in clause A.2.1.1 (which does this as a kind of naming convention, without explicitly defining conformance point(s)).

Documents reviewed:m15513 Summary of Voting on ISO/IEC 14496-4:2004/FPDAM 30 [SC 29 N

9481]SC29 Secretariat

m15532 Summary of Voting on ISO/IEC 14496-10:200X/FPDAM 1 SC29 Secretariatm15534 Summary of Voting on ISO/IEC 14496-4:2004/FPDAM 31 SC29 Secretariatm15558 JNB Comment on Text of ISO/IEC 14496-10:200X/DCOR1 (N9574) Teruhiko Suzuki

Documents approved:No. Title TBP Available

14496-4 Conformance testing9968 Disposition of Comments on ISO/IEC 14496-4:2004/FPDAM 30 N 08/07/259969 Text of ISO/IEC 14496-4:2004/FDAM 30 Conformance Testing

for New Profiles for Professional ApplicationsN 08/07/25

9970 Disposition of Comments on ISO/IEC 14496-4:2004/FPDAM 31 N 08/07/259971 Text of ISO/IEC 14496-4:2004/FDAM 31 Conformance Testing

for Scalable Video CodingN 08/10/01

9972 Working Draft 1 of ISO/IEC 14496-4:2004/Amd.38 Conformance Testing for Multiview Video Coding

N 08/08/31

14496-5 Reference Software 9973 Request for ISO/IEC 14496-5:2001/Amd.15 N 08/07/259974 Text of ISO/IEC 14496-5:2001/PDAM 15 Reference Software for

Multiview Video CodingN 08/10/01

9975 Study Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video Coding

N 08/08/01

14496-10 Advanced Video Coding9976 Study Text of ISO/IEC 14496-10:200X/DCOR 1 N 08/08/01

83

Page 84: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

9977 Disposition of Comments on ISO/IEC 14496-10:200X/FPDAM 1 N 08/07/259978 Text of ISO/IEC 14496-10:200X/FDAM 1 Multiview Video

CodingN 08/10/01

2 MPEG-7 Visual and Photo Player MAF

2.1 MPEG-7 Visual related work in ArchampsThe MPEG-7 breakout group was active during the whole week. Input documents related to the Visual part in 15938-3 are listed in the table below.

m15605 List of Adjacent Clip Pairs in the VCE-7 Dataset Kota IwamotoRyoma Oami

m15606 Removal of Partially Identical Clips from the VCE-7 Dataset Kota IwamotoRyoma Oami

m15607 Camera Capturing of Video Clips for VCE-7 Robustness Test Kota IwamotoRyoma Oami

m15616 A Proposal on Conformance Testing for Photo Player Application Format

Ryoma Oami

m15623 Proposal on implementation Capturing on camera and Interlaced/Progressive conversion for VCE-7 modification

Weon-Geun OhJu-Kyong JinJun-Woo LeeDong-Seok Jeong

m15627 Cross verification results of Image Signature Karol Wnukowiczm15646 Improved Matching for Global Image Signature

Factor 7 speedup by using simplified Hamming-distance matching (0.3% loss in precision), no change to descriptor and extraction process

Paul BrasnettMiroslaw Bober

m15647 Proposed Improvements for Local Image SignatureBased on speeded-up matching as described for the global descriptor in 15646, it is possible to achieve more precision in a 4-stage hierarchical matching process, up to 80 localized features. Now, average precision including medium and heavy conditions is approx. 98%. Still most problems in case of heavy cropping.Decision to adopt the localized descriptor for Study of FPDAM (text to be provided by end of week). Further discussion whether more than 80 local features should be used, e.g. to allow even more heavy cropping in cases of high-resolution images.

Paul BrasnettMiroslaw Bober

m15658 Image Signature for Complex Conditions Weon-Geun OhAyoung ChoWon-Keun YangJun-Woo LeeDong-Seok Jeong

m15666 Validation Report for VCE-5VCE-5 did not exist. Therefore, this document which obviously contains outdated content was not considered.

Sangki KimHyobin LeeSangyoun Lee

m15670 Video content contribution for VCE-7 from KBS Geun-Soo ParkJu-Kyong JinWeon-Geun OhDong-Seok Jeong

m15671 Video content contribution for VCE-7 from Inha University Ju-Kyong JinWeon-Geun OhDong-Seok Jeong

m15673 Comments on the preliminary call for proposal for video signature Weon-Genu OhDaeil YoonHae-Kwang Kim

m15701 Image Signature for all modifications3D histogram signature (MGST feature, AP feature, Hue feature). In Addition, 2 versions of concentric circle image signature. Those are operated in 3-stage matching. Matching precision 96.4 % under all modifications.

Weon-Geun OhAyoung ChoWon-Keun YangJun-Woo LeeDong-Seok Jeong

m15702 Cross verification result for ETRI VCE-6 proposal Min-Jeong LeeHeung-Kyu Lee

84

Page 85: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

A Study Text of ISO/IEC 15938-3:2001/FPDAM 3 Image Signature Tools (N9979) was issued. This contains the additional possibility to extract signatures from local image features, where up to 80 local Trace Transform features can be defined for prominent feature points as detected by scale-adaptive Harris corner and Laplacian edge detectors (see M15647). Fast matching is implemented as a hierarchical 4-stage process, using simplified Hamming distance. The performance (with same false alarm rate) was reported to be approx. 98% precision (average) including "medium" and "heavy" conditions.For the previous global signature descriptor, another speedup of the (non-normative) matching was achieved as well, which now enables to search 70 million images per second with only marginal loss in detection rate (under "light" conditions which are suitable for the global signature type, 99.3% precision are still achieved under the same negligible false alarm rate as before). The MPEG-7 Visual XM Document version 34.0 (N9982) contains these new non-normative fast matching procedures.A document “Use Cases of Visual Signature Tools” (N9980) was issued, listing various scenarios such as

• Metadata association (services annotating unknown content)• Signature-based search for duplicate content• Identification of lost or stolen content, e.g. museum catalogues• Sharing content in multi-user environment• Unknown object identification, e.g. buildings, animals, …

With regard to the subsequent Amd.4 of MPEG-7 Visual, the Call for Proposals on Video Signature Tools (N9981) was issued. Currently, over 100 hours of content are available from donations and download of freely available materials. The National Film and Television School (UK) has offered up to 200 hours of additional content (selection of shots that are usable for the given purpose is still pending / to be done). Therefore, it appears sure that sufficient amount of content (200 hours in total required) is now available or should be available in near future. The timeline is as follows:

• Pre-registration open since end of Hannover meeting, distribution of (currently< available) datasets starts immediately from different sites in Europe, Asia, US

• Update of CfP and final definition of dataset 08/10• Responses due 09/02, possibly WD• PDAM 09/04• FPDAM 09/10• FDAM 10/04

ISO/IEC 23000-3 / FPDAM 1 Photo Player MAF Reference Software (N9222 of July 2007 meeting) is now delivered and under ballot. The Working Draft of ISO/IEC 23000-3/Amd.2 Conformance Testing for Photo Player MAF (N9983) was started with following timeline:

– PDAM 08/10– FPDAM 09/02– FDAM 09/07

2.2 Output documents related to MPEG-7 Visual

No. Title TBP Available15938-3 Visual

9979 Study Text of ISO/IEC 15938-3:2001/FPDAM 3 Image Signature Tools

N 08/08/08

85

Page 86: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

9980 Use Cases of MPEG-7 Visual Signature Tools Y 08/07/259981 Call for Proposals on Video Signature Tools Y 08/07/259982 MPEG-7 Visual XM Document version 34.0 N 08/08/08

23000-3 Photo Player Application Format9983 Working Draft of ISO/IEC 23000-3/Amd.2 Conformance

Testing for Photo Player MAFN 08/07/25

3 23002 MPEG-C Video Technologies

3.1 23001-4 and 23002-4 Reconfigurable Video Coding (RVC)

3.1.1 General status of work

The two parts related to RVC (ISO/IEC 23001-4 Codec Configuration Representation in MPEG-B and ISO/IEC FCD 23002-4 Video Tool Library in MPEG-C) were progressed into FCD status in April with editorial period, and delivered in time for the ballots, which will be closing before the October meeting (where it is planned to promote both parts for FDIS). The main issue of work during the week in the RVC breakout group was related to editorial improvements on both texts (see items below). Study documents were drafted. Main issues still to be resolved are the relationship of RVC-BSDL as defined in 23001-4 with BSDL (should it restrict the usage of BSDL elements that are not currently needed for RVC context?), and the proper definition of tools in 23002-4 (usually only input-output behaviour of FUs needs to be normative, but some FUs are described in a fashion of sequential algorithms internally; furthermore, careful checking is necessary about whether the FU descriptions are precisely equivalent to the elements of the MPEG standards from which they were drawn).

While the FDIS as planned for October will give full support only for MPEG-2 main profile, MPEG-4 simple profile and MPEG-4 AVC baseline profile, further tools for MPEG-4 advanced simple profile, AVC high profile and SVC will be supported in Amd.2, for which the second version of WD was issued. Likewise, WD 2 of software and conformance amendment (Amd.1) was also produced.

To clarify the relationships of MPEG and non-MPEG FUs / toolboxes that could both be used in the RVC context, a document entitled RVC vision (N9990) was issued which clearly expresses that only the elements defined by MPEG (in 23001-4 and 23002-4) are handled as MPEG compliant, while usage of non-MPEG tools is under sole responsibility of their respective users.

3.1.2 Assignment of editorsDocuments Editors

Study of FCD of 23001-4 (MPEG-B CCR) Gwo Giun, TK, Hyungyu, ChristopheStudy of FCD of 23002-4 (MPEG-C VTL) Yishin, TK, Hwa SeonRVC CE description Kazuo, Hyungyu, DandanWorkplan Christophe, Yishin, Conformance & RSM WD Gwo Giun, ChristopheExtension to VTL MikaelLiaison response & RVC Vision Euee, Hwa Seon, Dandan

86

Page 87: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

3.1.3 Allocation of input contributions Study of FCDDoc. No. Titlem15559 Proposed update of FCD ISO/IEC 23002-4: Video Tool Library

(VTL)- Reviewed during the AhG meeting on Sunday

m15707 Comments on ISO/IEC 23002-4 FCD- Reviewed during the AhG meeting on Sunday

m15708 Comments on ISO/IEC 23001-4 FCD- Reviewed during the AhG meeting on Sunday

MPEG-B RelatedDoc. No. Titlem15549 Proposed network of functional units for AVC High Profile

- Some FUs are missing.- [Recommendation] Need to have a ‘top-level’ connections of FUs for

supported decoding configurations.M15630 New version of the BSDL2CALML parser in RVC

- Example to convert BSDL Schema representation to CAL(ML) parser.- [Discussion] BSDL vs. RVC BSDL

Some definitions of BSDL can be replaced with other definitions in BSDL.

Still there is a need for further experiment. CE till the next meeting.

- [Recommendation] to continue the work to complete the work till the next meeting as CE to review the first draft of CE on Thursday morning

m15663 CAVLC Validation Procedure and Implementation for AVS in RVC Framework- CA-2D VLD of AVS is given as example.- [Recommentation] continue as CE

M15679 DDL of Intra MPEG AVC decoder- FU network example of AVC decoder provided.- [Recommendation]

to include some ‘top-level’ decoder network in MPEG-B FCD Study io include DDL examples in RSM implementation

M15680 DDL Graph Editor- Can be used to provide a conformance testing of connection of FUs in the

future.- [Recommendation] to include this tool in RSM

MPEG-C related Doc. No. TitleM15662 Liaison response on N9787

- Reviewed during the AhG meeting on Sundaym15699 Modified FUID definition for multiple tool libraries

- [Recommendation] To put guidelines on registration of toolboxes in MPEG-B To modify FU naming convention according to the proposal in MPEG-C

87

Page 88: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15703 Introduction of Token description in VTL- [Recommendation]

Token precedence will be studied further till the next meeting. Put token list description in VTL study. Further investigation on FU and Token description is needed till the next

meeting.

Study of FCD- Points identified for clarification:

1. Bitstream syntax and parsing description (BSPD)- Thorough comparison between BSDL-schema and RVC-BSDL schema is

needed. Core Experiment- What is added on top of BSDL-schema (i.e., port connections to FUs) needs

a precise description. Core Experiment- The current concept of RVC does not allow the on-the-fly generation of new

FUs with an exception of parser FU. Study of FCD (in introduction)- Example BSDL schema of one example decoder configuration (e.g., MPEG-

4 SP) in Annex (informative) pointer to MPEG-C for other examples.2. FU network description (FUND)

- (XML) Description of top-level decoder network of FUs for one example decoder configuration in Annex (informative): pointer to MPEG-C for other examples.

3. FUs in VTL- Separation of normative and nonnormative FUs in VTL- Diagram of top-level decoder network of FUs for decoder configuration for

the currently supported profiles and levels- Clear description of interfaces of FU- (XML) Descriptions of top-level decoder network of FUs for the supported

decoder configurations as example in Annex (informative)- Example BSDL schema of example decoder configuration of the currently

supported profiles and levels in Annex (informative)

Output Documents:

No. Title TBP Available23001-4 Codec Configuration Representation

9984 Study Text of ISO/IEC FCD 23001-4 Codec Configuration Representation

N 08/09/30

23002-4 Video Tool Library9985 Study Text of ISO/IEC FCD 23002-4 Video Tool Library N 08/09/309986 WD 2 of ISO/IEC 23002-4/Amd.1 (Conformance and Reference

Software)N 08/07/25

9987 WD 2 of ISO/IEC 23002-4/Amd.2 (Tools for MPEG-4 ASP, AVC HP and SVC)

N 08/07/25

9988 Description of Core Experiments in RVC N 08/08/089989 RVC Work Plan and FU Development Status N 08/07/259990 RVC Vision Y 08/07/25

88

Page 89: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

4 Explorations – 3D Video

The goal of 3D video, as a first step towards a broader range of free-viewpoint (FTV) applications, is to generate interpolated views from available videos of multiview camera configurations. The target application is mostly seen for upcoming generations of (auto-) stereoscopic displays, for which only a low number (1) of video sequences shall be transmitted, but rendering of additional views shall be enabled by associated depth information. To make a next step towards development of such a system, a set of exploration experiments had been set up in Archamps, during which the available test sequences were used with the two available depth estimators and associated view interpolation methods. Various cases of sparseness (baseline distances between cameras to be taken from the dense set) were investigated. Experts viewing using stereoscopic displays was performed in Hannover. Three different display types (two stereoscopic, one autostereoscopic) were available (see N9992 for more details). A first glimpse at the results clearly indicated that only for the “light” case of small baseline distance, results without heavy artifacts could be obtained; therefore, the more intense experts viewing, that involved larger groups of people was only performed to this case. A high-level summary is given here:

a) Results on original views:Most sequences have been rated “good” for 3D viewing, except “Alt Moabit”, which is an outdoor scene with mostly far-away content. Some comments were made that could be helpful in case of future production of test material:

- Black background is OK when the foreground object has sufficient depth structure (e..g champagne tower)

- In general, sequences with more motion / less static background might be desirable (may have implication on encoding and rates to be defined per sequence)

- Depth structure was too low in one sequence (dog) – can be resolved by using different camera baseline

- Framing problem: Conflict of depth perception when the disparity at screen boundary is large

Conclusion: In general, the sequences are suitable for 3D viewing. As a general experience, it is necessary to get familiar with 3D data viewing.

b) Results on synthesized views:Currently, synthesized views (even for the “light” case) are not acceptable. Main problems are as follows:

- Flickering- Ringing- Ghosting- Temporal Instability- Boundary distortions

Conclusion: For the software frame work that is available (Depth Estimation /DE + View Synthesis /VS) the quality is not yet acceptable. It is expected that this situation can be highly improved by relatively simple measures until the next meeting. Before progressing with experiments on AVC/MVC-encoded sequences, further improvements are required for both DE and VS.

c) Judgement of displays:- Usually larger displays are considered better, but smaller displays had better depth

impression sometimes.

89

Page 90: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

- It is necessary to clearly define viewing distance, depending on display size, only few people possible simultaneously, controlled seating positions

- Stereoscopic vs. autostereoscopic: Stereoscopic has less problems with individual adjustment. This should be the default display, autostereoscopic should also be checked if available, because this is the most likely target of the application (note: the interpolation artifacts are likewise visible on both stereoscopic and autostereoscopic diplays)

The next round of experts viewing is planned for the Busan meeting.

The most important step until then is to proceed with improving depth estimation and view synthesis. Those key points will be investigated in a collaborative approach within the next round of Exploration Experiments:

• Improve results of depth estimation (subpel, temporal consistency) – EE 1• Improve results of view synthesis (line projection from rectified images, filling of

holes, boundary filtering) – EE 2• Investigate alternative view synthesis from Layered Depth Video (LDV), using same

depth maps as from EE 1 – EE 3

The EEs will run separately (improving depth estimation and view synthesis) until 2 weeks before next meeting, then the participants will exchange results (software bin or src, depth maps etc.). Combinations of other, newly developed DE (bring depth maps) and VS algorithms can also be computed on-site at next meeting, after AHG reviews the separate results. The Common software basis will be maintained via the MPEG SVN repository.

Once the view synthesis results for the uncoded case provide sufficient quality, encoding experiments (using AVC/MVC both for videos and depth maps) will be performed with the purpose to select appropriate bit rates. This could at earliest start after the Busan meeting. Therefore, the shortest possible tentative timeline could be as follows:

• Until 08/10: Generate improved depth maps and interpolated view sequences from the test sets

• 08/10: Perform experts viewing with different stereoscopic displays, again decide about maturity of anchors

• 08/10: Draft CfP (internal)• Until 09/02: Perform experiments to decide about bit rates for the MVC-based video-plus-

depth anchors, selection of sequences• 09/02: Experts viewing again, preliminary CfP• Until 09/04: Refinement of test conditions• 09/04: Final CfP • 09/07: Responses due

Documents reviewed m15540 Depth Map Estimation Software version 3 Olgierd Stankiewicz

Krzysztof Wegnerm15553 Results of Exploration Experiment in 3D Video Coding for Dog

Data SetAljoscha SmolicYongzhe WangKarsten Mueller

m15573 3D Video Exploration Experiment on Baseline Length Dependency

Taka SenohKenji YamamotoRyutaro OiTomoyuki MishinaMakoto Okui

90

Page 91: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15583 Results of Depth Estimation and View Synthesis on Doorflowers Test Data Set for EEs in 3D Video Coding

Shinya ShimizuHideaki Kimata

m15584 Experimental Results on Depth Estimation and View Synthesis with subpixel-precision

Shinya ShimizuHideaki Kimata

m15589 Exploration Experiments on seq 'Arrive book' & 'Alt-moabit' [email protected]@philips.com

m15590 Creation of LDV streams out of MV sequences [email protected]@[email protected]

m15594 Enhancement of Temporal Consistency for Multi-view Depth Map Estimation

[email protected] (Sang-Beom Lee)[email protected] (Yo-Sung Ho)

m15595 Results of Exploration Experiment on View Synthesis [email protected] (Cheon Lee)[email protected] (Yo-Sung Ho)

m15597 Boundary Filtering on Synthesized Images for 3D Video [email protected] (Cheon Lee)[email protected] (Yo-Sung Ho)

m15599 Experimental Results of EE on LeavingLaptop sequence for MPEG-FTV

Gi-Mun UmGun BangNamho HurJinWoong Kim

m15620 Exploration Experiment results on 'Champagne_tower' and 'Pantomime' sequences

Ivana RadulovicPer Fröjdh

m15648 EE results on Leaving laptop Purvin PanditDong TianCristina Gomila

m15661 EE results on lovebird2 Jaewon SungJung Eun Lim

m15672 View Synthesis Software and assessment of its performance Mateusz GotfrydKrzysztof [email protected]. Marek Doma?ski

m15674 EE results on lovebird1 sequence Patrick LopezGuillaume Boisson

m15696 Simple View Synthesis Dong TianPurvin PanditCristina Gomila

m15697 Experimental Results for Exploration Experiment in 3D Video Sehoon YeaKwan-Jung OhAnthony Vetro

Output documents:No. Title TBP Available9991 Description of Exploration Experiments in 3D Video Coding N 08/07/259992 Results of 3D Experts Viewing N 08/07/25

5 Explorations – New Challenges in Video Coding

A brainstorming session on topics of future video coding standardization was held on Tuesday afternoon (presentation of contributions as listed below), with a follow-up on Thursday morning. The main points of discussion are summarized in N9993. Possible application areas for higher-compression video are mainly seen in• High resolution HD and beyond (4Kx2K, potentially 8Kx4K) for Regular broadcast and

IPTV, Packaged media, Surveillance,P2P communication• Resolution from VGA up to beyond-HD for Mobile Broadcast and downloads

In addition, preliminary investigations were reportted about the performance of AVC and some modifications to it (Macroblock and transform block sizes, adaptive transforms, adaptive interpolation filters) showed possible improvements mainly for the cases of high rates and high resolutions.

First thoughts about requirements were mentioned as follows:

91

Page 92: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

• Time line : Beyond 2012• (open) Should the next-generation video standard still be generic or application specific?• Color sampling: 4:2:0 and 4:4:4 (certainly no interlaced needed any more); Progressive

12, … , 60 fps, high frame rates beyond 60 could also happen; Display resolutions VGA … 4Kx2K, potentially 8Kx4K

• Compression: Two times over AVC HP desirable for the target application(s); at no point of the entire range should it be worse than existing standard(s).

• Complexity increase as compared to AVC shoudl matches what will be possible for implementation by the expected time of usage

• (open) Support for Stereo? Scalability? Lossless/near lossless?

As the next step, to face the challenges that emerging applications impose on the requirements of video coding standardization, it is planned to organize a full-day workshop in Korea prior to or during the meeting. The key intention of the workshop is to acquire solid information about the context in which video coding will be operating in the future, which will enable MPEG to draw conclusions for the needs and chances in video coding standardization during the next years and to start drafting three key documents: Technology context, applications and requirements for a new video coding standard. For this purpose it is planned to invite speakers on key topics and in addition select a variety of proposed contributions. The following topics will be considered:

1) Applications: Which video signal resolutions are expected, which data rates will be manageable, which networks/storage media will be dominant starting from the middle of the next decade for

a. Unicast and broadcast video/movie distribution,b. Wireless and mobile,c. Packaged media.

2) Next-Generation Networks (around the year 2015)a. Which data rates and quality of service characteristics are expected for

distribution over IP, WAN, IP-TV, wireless, mobile; unicast, multicast, broadcast, peer 2 peer, access and core networks?

b. Existence of approaches for transmission, modulation, channel coding etc. that could break existing barriers for terrestrial, satellite and cable broadcast, mobile applications; are there limits in terms of efficient usage of bandwidth and cost?

3) Video equipment: Availability of cameras, displays, projectors etc. (both for professional and consumer sectors) around the year 2015

a. Cameras: Resolution, color space, quality of lenses, noise characteristics for HD and beyond in particular, cost.

b. Displays/projectors: Resolution, color space, size, resolution of typical input signals, cost.

c. Human factors: What size/resolution is useful considering the properties of human visual system? How likely is it that people will put very large displays (beyond HD) into their houses?

4) Compression technologya. What are known limitations or missing functionality of current compression

technology, considering new applications? b. New compression methods that could overcome such limitations.c. Methods to evaluate performance: Visual quality, complexity.

92

Page 93: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Procedure:• Both invited and proposed contributions• Both MPEG and non-MPEG experts• Call for Contributions: 08/08/01• Responses: 08/09/05• Selection, final Program: 08/09/10• Workshop: 08/10/11 or t.b.d. (full day) in Busan or t.b.d.

Documents reviewed (see N9993 for detailed comments)

m15577 Requirements for next-generation video coding standard Shun-ichi SekiguchiYoshihisa YamadaYoshiaki KatoKohtaro AsaiTokumichi Murakami

m15704 Consideration on an approach for developing a new video codec Tomoyuki YamamotoTomoko AonoNorio Itoh

m15682 Industry Trends around Video Coding Jung Suk SuhKyong Sok Seo

m15579 Analysis for Ad Hoc Group on New Challenges in Video Coding Standardization

Kyohyuk LeeElena AlshinaJeonghoon ParkWoojin HanJunghye Min

m15580 Technical considerations for Ad Hoc Group on New Challenges in Video Coding Standardization

Kyohyuk LeeElena AlshinaJeonghoon ParkWoojin HanJunghye Min

m15578 On motion block size and partitions for 4:4:4 coding Shun-ichi SekiguchiShuichi YamagishiYusuke ItaniKohtaro AsaiTokumichi Murakami

Output documents:No. Title TBP Available9993 New Challenges in Video Coding Standardization N 08/07/259994 Workshop on New Challenges in Video Coding Standardization –

Call for ContributionsY 08/08/01

9995 Workshop on New Challenges in Video Coding Standardization –Program

Y 08/09/10

6 Other – Color conversion in case of multiple codecs in series

m15569 Proposal on standardization of a filter set for converting between 4:2:2/4:2:0 chrominance sampling

Distortions occur in case of concatenating several codecs with down/upsampling. New filter set is claimed to be capable avoiding this. Retains same phase position as conventional (“reference”) filter. Filters proposed are 8tap/4tap for down- and upsampling. Results: Improvement by 2.5 dB after 5 iterations (only one sequence presented).

Question: Will we need interlaced in the future? 4:4:4 and 4:2:0 could be sufficient.Answer: Similar approach can be designed for progressive.

Akira NakagawaKimihiko KazuiShunsuke Kobayashi

93

Page 94: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Question:What is meant by “reference” filter set? MPEG standards do not have any normative approach for MPEG down-/upsampling.Answer: Apparently the filters used in MPEG-4 VM.

Understanding of video group: So far, this was never standardized. To do it would require 1. A critical mass of companies requesting and 2. More evidence about the necessity (not just one sequence) 3. More understanding how typical this application case is in practice.

If at all, MPEG-C could be a home for such stuff, which seems to work somewhat across different video standards (similar to IDCT)

94

Page 95: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex H– JVT report

Source: Gary Sullivan and Jens Ohm, Chairs1. Documents of the JVT meeting 96

1.1. Input documents 961.1.1 Administrative input contributions.......................................................................961.1.2 Input liaison statements and parent-body inputs...................................................961.1.3 Non-administrative input contributions................................................................961.1.4 Late-registered input contributions, BoG reports, etc...........................................97

1.2. Late document availability 971.3. Withdrawn document registrations 981.4. Major output documents 98

JVT-AB200 Meeting report of the 28th JVT meeting (this document)..................................................98JVT-AB201-M (WG 11 N9969) Draft conformance testing for New Profiles for Professional Applications (T. Suzuki) [2008-07-25]...................................................................................................98JVT-AB202-M (WG 11 N9971) Draft conformance testing for SVC (V. Bottreau) [2008-10-01].......99JVT-AA203-M (WG 11 N9975) Draft reference software for SVC (H. Schwarz) [2008-08-01]..........99JVT-AA204 (WG 11 N9978) Joint draft multi-view video coding (MVC) [2008-10-01].....................99JVT-AA205-M (WG 11 N9976) ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced video coding defect report (G. Sullivan, H. Schwarz) [2008-08-01]............................................................................99JVT-AB206-M (WG 11 N9972) Draft conformance testing for MVC (A. Vetro et al.) [2008-08-31]. 99JVT-AA207-M (WG 11 N9974) Draft reference software for MVC (A. Vetro et al.) [2008-10-01]....99

1.5. JVT internal output documents 992. JVT administrative and liaison topics99

2.1. IPR policy reminder and update 992.2. Meeting opening and remarks by the chairmen 1012.3. JVT communication practices1022.4. Scheduling and logistics notes 1022.5. Administrative documents 102

JVT-AB000 List of documents of Hannover meeting.........................................................................102JVT-AB001 (AhG) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG Report: Proj mgmt and errata......................................................................................................................................................103JVT-AB002 (AhG) [T. Wiegand, K. Suehring, A. Tourapis, T. Suzuki, G. J. Sullivan] AHG Report: JM text, ref soft, bitstream, conf...........................................................................................................103JVT-AB003 (AhG) [H. Schwarz, J. Vieron, T. Wiegand, M. Wien, A. Eleftheriadis, V. Bottreau] AHG Report: JSVM text, S/W, conf..............................................................................................................103JVT-AB004 (AhG) [A. Segall, T. Wiegand, Y.-J. Chiu] AHG Report: SVC bit depth, color gamut, and chroma format.......................................................................................................................................105JVT-AB005 (Ahg) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, Y. Chen] AHG Report: MVC JD & JMVM text & software.........................................................................................................................105JVT-AB006 (AhG) [Y. Chen, S. Gao, Y.-J. Jeon] AHG Report: MVC JMVM coding tools..............107JVT-AB007-M (AhG) [G. J. Sullivan, A. Rodriguez, S. Narasimhan] AHG Report: Splicing operation...............................................................................................................................................................108

2.6. Closing session notes 1082.7. JVT liaison communications and parent-body communications 108

JVT-AB014 / M15538 [DVB TM-AVC] Incoming LS to WG 11 on Constrained Baseline...............108JVT-AB015 / M15554 [3GPP TSG SA4] Incoming LS to WG 11 on Constrained Baseline..............109Discussion of "Constrained Baseline" issue..........................................................................................110JVT-AB016 / M15711 [ATSC] Incoming LS to WG 11 on AVC Profiles..........................................110

3. AVC base specification, errata, and related topics 112JVT-AB010 / VCEG-AI07 (Errata) [HHI] ITU-T AAP Last Call comments on H.264(2007)/Cor.1. 112JVT-AB011 / VCEG-AI08 (Errata) [Microsoft] ITU-T AAP Last Call comments on H.264(2007)/Cor.1................................................................................................................................113JVT-AB012 / VCEG-AI09 (Errata) [Sony] ITU-T AAP Last Call comments on H.264(2007)/Cor.1 113

95

Page 96: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

JVT-AB013 / M15558 (Errata) [JNB of WG11] WG 11 NB comments on ISO/IEC 14496-10:200X/DCOR1.......................................................................................................................113JVT-AB021 (Prop 2.2/3.1) [K. Kazui, A. Nakagawa, S. L. Bhavani, S. Kobayashi (Fujitsu)] Modification of POC type 1 specification.............................................................................................113JVT-AB036 (Errata 2.2.1/3.1) [Y.-K. Wang (Nokia)] SVC errata items.............................................114

4. Scalable video coding (SVC) 1154.1. SVC bit depth, color gamut, and chroma format scalability 115

JVT-AB029 (Req 2.2/3.1) [Y.-J. Chiu, H. Jiang, Y.-T. Peng, L. Xu (Intel)] Requirements for scalable video coding: color gamut scalability and bit depth scalability............................................................115

4.2. SVC Conformance 116JVT-AB023 (Ed....................................................................................................................................116Draft) [V. Bottreau (Thomson), A. Eleftheriadis (Vidyo)] SVC conformance testing.........................116

5. Multi-view coding (MVC) 1165.1. Comments on JD 116

JVT-AB020 (Prop 2.2/3.1) [Y.-J. Jeon, B.-M. Jeon (LG)] MVC comment on JD...............................116JVT-AB022 (Prop 2.0) [J. Huo, Y. Chang, M. Li, H. Yang (Xidian Univ.), S. Lin, S. Gao, L. Xiong (Huawei)] Comment on MVC JD 7.0...................................................................................................117JVT-AB024 (Prop 2.2.1/3.1) [Y. -K. Wang, Y. Chen, M. M. Hannuksela (Nokia)] Comments to MVC JD 7.0....................................................................................................................................................118JVT-AB028 (Ed. Draft) [A. Vetro (MERL), P. Pandit (Thomson), H. Kimata (NTT), A. Smolic (HHI), Y. -K. Wang (Nokia)] Editor's Input on MVC......................................................................................119

5.2. MVC profiles 119JVT-AB035 (Prop Prof 2.2/3.1) [H. Kimata (NTT), T. Nomura (Sharp), H. Nakamura (JVC), T. Itoh (Fujitsu)] On Multiview High profile....................................................................................................119JVT-AB037-L (Info) [A. Vetro (MERL)] MVC Profile/Level Definitions for Stereo........................121

5.3. MVC SEI 122JVT-AB025 (Info SEI) [M. Ji, Y. Chen, Y. -K. Wang, M. M. Hannuksela, H. Li (Nokia)] Showcase for non-required view component SEI message...................................................................................122JVT-AB026 (Info SEI) [M. Ji, Y. Chen, Y. -K. Wang, M. M. Hannuksela, H. Li (Nokia)] Showcase for view dependency structure SEI message.........................................................................................122JVT-AB027 (Info SEI) [M. Ji, Y. Chen, Y. -K. Wang, M. M. Hannuksela, H. Li (Nokia)] Showcase for operation point not present SEI message.........................................................................................123

5.4. MVC JMVM 124JVT-AB030 (Prop 2.2/3.1) [Y.-J. Jeon, B.-M. Jeon (LG), S. Gao, L. Xiong (Huawei)] Comments on JMVM 8.0.............................................................................................................................................124

5.5. MVC conformance testing 1245.6. MVC verification testing 125

6. AVC reference encoding methods 125JVT-AB031 (Ref soft 2.0/3.1) [W.-S. Kim, P. Pahalawatta, Z. Li, A. M. Tourapis (Dolby)] New Video Quality Metrics in JM reference software.............................................................................................125JVT-AB032 (Ref soft 2.0/3.1) [P. Pahalawatta, A. M. Tourapis (Dolby)] Reorg of Error Resilient RDO in the ref software..................................................................................................................................125JVT-AB033 (Ref soft 2.0/3.1) [A. Leontaris, A. M. Tourapis (Dolby)] Weighted Prediction Alternative to Adaptive Interpolation mechanisms..................................................................................................126JVT-AB034 (Ref soft 2.0/3.1) [E. Maani, L. Liu, A. M. Tourapis (Dolby)] JM Reference Software Enhancements........................................................................................................................................126

7. H.222.0 | MPEG-2 Systems extension for SVC 1268. JVT internal operating rules 1279. List of AHGs established 128

9.1. JVT project management and errata reporting 1289.2. JM Text, reference software, bitstream exchange and conformance 1299.3. SVC JSVM text, software and conformance 1299.4. SVC bit depth, and chroma format scalability 1299.5. MVC JD and JMVM text, software, and conformance 1299.6. Splicing operation 129

10. Future meeting plans 13011. Attendance 130

96

Page 97: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

97

Page 98: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

1 Documents of the JVT meeting

1.1 Input documents

1.1.1 Administrative input contributions

JVT-AB000 List of documents of Hannover meetingJVT-AB001-M (AhG) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG Report: Proj mgmt and errataJVT-AB002-M (AhG) [T. Wiegand, K. Suehring, A. Tourapis, T. Suzuki, G. J. Sullivan] AHG Report: JM text, ref soft, bitstream, confJVT-AB003 (AhG) [H. Schwarz, J. Vieron, T. Wiegand, M. Wien, A. Eleftheriadis, V. Bottreau] AHG Report: JSVM text, S/W, confJVT-AB004 (AhG) [A. Segall, T. Wiegand, Y.-J. Chiu] AHG Report: SVC bit depth, color gamut, and chroma formatJVT-AB005 (Ahg) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, Y. Chen] AHG Report: MVC JD & JMVM text & softwareJVT-AB006 (AhG) [Y. Chen, S. Gao, Y.-J. Jeon] AHG Report: MVC JMVM coding toolsJVT-AB007-M (AhG) [G. J. Sullivan, A. Rodriguez, S. Narasimhan] AHG Report: Splicing operation

1.1.2 Input liaison statements and parent-body inputsThe following ITU-T Q.6/16 and WG 11 parent-body input contributions were considered:

JVT-AB010 / VCEG-AI07 [HHI] ITU-T AAP Last Call comments on H.264(2007)/Cor.1JVT-AB011 / VCEG-AI08 [Microsoft] ITU-T AAP Last Call comments on H.264(2007)/Cor.1JVT-AB012 / VCEG-AI09 [Sony] ITU-T AAP Last Call comments on H.264(2007)/Cor.1JVT-AB013 / M15558 [JNB of WG11] WG 11 NB comments on ISO/IEC 14496-10:200X/DCOR1JVT-AB014 / M15538 [DVB TM-AVC] Incoming LS to WG 11 on Constrained BaselineJVT-AB015 / M15554 [3GPP TSG SA4] Incoming LS to WG 11 on Constrained BaselineJVT-AB016 / M15711 [ATSC] Incoming LS to WG 11 on AVC Profiles

1.1.3 Non-administrative input contributions

JVT-AB020 (Prop 2.2/3.1) [Y.-J. Jeon, B.-M. Jeon (LG)] MVC comment on JDJVT-AB021 (Prop 2.2/3.1) [K.Kazui, A.Nakagawa, S.L.Bhavani, S.Kobayashi (Fujitsu)] Modification of POC type 1 specificationJVT-AB022 (Prop 2.0) [J. Huo, Y. Chang, M. Li, H. Yang (Xidian Univ.), S. Lin, S. Gao, L. Xiong (Huawei)] Comment on MVC JD 7.0JVT-AB023 (Ed. Draft) [V. Bottreau (Thomson), A. Eleftheriadis (Vidyo)] SVC conformance testingJVT-AB024 (Prop 2.2.1/3.1) [Y. -K. Wang, Y. Chen, M. M. Hannuksela (Nokia)] Comments to MVC JD 7.0JVT-AB025 (Info SEI) [M. Ji, Y. Chen, Y. -K. Wang, M. M. Hannuksela, H. Li (Nokia)] Showcase for non-required view component SEI messageJVT-AB026 (Info SEI) [M. Ji, Y. Chen, Y. -K. Wang, M. M. Hannuksela, H. Li (Nokia)] Showcase for view dependency structure SEI message

98

Page 99: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

JVT-AB027 (Info SEI) [M. Ji, Y. Chen, Y. -K. Wang, M. M. Hannuksela, H. Li (Nokia)] Showcase for operation point not present SEI messageJVT-AB028 (Ed. Draft) [A. Vetro (MERL), P. Pandit (Thomson), H. Kimata (NTT), A. Smolic (HHI), Y. -K. Wang (Nokia)] Editor's Input on MVCJVT-AB029 (Req 2.2/3.1) [Y.-J. Chiu, H. Jiang, Y.-T. Peng, L. Xu (Intel)] Requirements for scalable video coding: color gamut scalability and bit depth scalabilityJVT-AB030 (Prop 2.2/3.1) [Y.-J. Jeon, B.-M. Jeon (LG)] Comments on JMVM 8.0JVT-AB031 (Ref soft 2.0/3.1) [W.-S. Kim, P. Pahalawatta, Z. Li, A. M. Tourapis (Dolby)] New Video Quality Metrics in JM reference softwareJVT-AB032 (Ref soft 2.0/3.1) [P. Pahalawatta, A. M. Tourapis (Dolby)] Reorg of Error Resilient RDO in the ref softwareJVT-AB033 (Ref soft 2.0/3.1) [A. Leontaris, A. M. Tourapis (Dolby)] Weighted Prediction Alternative to Adaptive Interpolation mechanismsJVT-AB034 (Ref soft 2.0/3.1) [E. Maani, L. Liu, A. M. Tourapis (Dolby)] JM Reference Software EnhancementsJVT-AB035 (Prop Prof 2.0/3.1?) [H. Kimata (NTT), T. Nomura (Sharp), H. Nakamura (JVC), T. Itoh (Fujitsu)] On Multiview High profileJVT-AB036 (Errata 2.2.1/3.1) [Y.-K. Wang (Nokia)] SVC errata items

1.1.4 Late-registered input contributions, BoG reports, etc.

JVT-AB037-L (Info) [A. Vetro (MERL)] MVC Profile/Level Definitions for Stereo

1.2 Late document availabilityNon-administrative documents with document numbers suffixed in this report with "-L", "-Q", or "-M" were classified as late. Such documents will only be considered as information documents only (unless agreed otherwise by the group) if time permits, and consideration of them may be shifted to the end of the meeting as determined appropriate by the group.

For some time now, the JVT has agreed that no late-uploaded non-administrative (non-AHG-report, non-liaison, non-verification, non-errata-report, non-editor's draft) contribution would be presented without having a minimum of 4 JVT participants (from different other than that of the primary contribution author) recorded by name as supporting the allowance of such a presentation, in addition to a consensus of the general JVT membership to allow the presentation. Such support to allow a presentation is to be understood to not necessarily imply support of the adoption of the content of the late contribution, but only as a positive expression that the document should be allowed to be presented. Additionally, the provider of a presented late contribution shall send an email apology to the JVT email reflector. This rule does not apply to material requested by the JVT at the meeting (e.g., reports of JVT-authorized side activities).

JVT decision: Agreed.

Suffixes for contribution numbers in this report are explained below:– "-L" indicates a non-administrative contribution that was somewhat late but was available by

the second meeting day (contribution JVT-AB037 was in this category at this meeting).– "-Q" were more late than that (there were no contribution in this category at this meeting).– "-M" were still missing at the time of preparation of this draft of this report (there are no

contributions in this category for this meeting).– "-B" were break-out group discussion reports and other input requested during the meeting

Further suffixing by “V” indicates a contribution that contains a cross-verification of a proposal.99

Page 100: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

On consideration of JVT-AB037An apology was sent.Supporting presentation were V. Bottreau, Y.-K. Wang, A. Eleftheriadis, and H. Schwarz.

There were no objections to presentations of late documents at this meeting.

It was noted that the situation surrounding the need for on-time availability of contributions has substantially improved since our lateness penalty rules were adopted.

1.3 Withdrawn document registrationsNone.

1.4 Major output documentsMajor output documents submitted to parent-body review included the following. (Dates listed are planned dates of availability.)

Review of missing outputs of last meeting:Prof prof reference software [ISO/IEC FDAM version (now done), JVT version]WD 1 of MVC reference software [JVT-AA212 ISO/IEC w9761]JMVM software [JVT-AA208 ISO/IEC w9763]

(Also, AVC Annex F to ISO/IEC secretariat needs editors' attention.)

Overview of outputs planned for this meetingErrata [Study status for ISO/IEC] (1 Aug)Prof prof conformance testing [FDAM for ISO/IEC, JVT?] (no editing period)Prof prof conformance testing DoCR on FPDAM ballot (no editing period)SVC conformance testing [FDAM for ISO/IEC, JVT] (1 Oct)SVC conformance DoCR on FPDAM ballot (no editing period)SVC reference software [Study status for ISO/IEC] (1 Aug – before ballot close)MVC text [FDAM for ISO/IEC] (1 Oct)MVC text DoCR on FPDAM ballot (no editing period)Draft 2 of MVC reference software [PDAM for ISO/IEC] (1 Oct)Draft 1 of MVC conformance [WD for ISO/IEC] (31 Aug)LS to ATSC (to coordinate with WG 11)LS to various orgs on MVC completion (no editing period)

Note that MPEG Systems plans to finalize SVC on ITU-T H.222.0 | ISO/IEC 13818-1 MPEG-2 Systems.

Notes on meeting resolutionsThank GNB for comments on SVC conf FDPAMThank ETRI, HHI, Orange, Sharp, Thomson, and Vidyo for providing bitstreams and

associated descriptions for inclusion in SVC conf FDAM

1.4.1.1.1 JVT-AB200 Meeting report of the 28th JVT meeting (this document)

1.4.1.1.2 JVT-AB201 -M (WG 11 N9969) Draft conformance testing for New Profiles for Professional Applications (T. Suzuki) [2008-07-25]

100

Page 101: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

(Conveyed to WG 11 as "Text of ISO/IEC 14496-4:2004/FDAM 30 Conformance Testing for New Profiles for Professional Applications".)

1.4.1.1.3 JVT-AB202 -M (WG 11 N9971) Draft conformance testing for SVC (V. Bottreau) [2008-10-01]

(Conveyed to WG 11 as "Text of ISO/IEC 14496-4:2004/FDAM 31 Conformance Testing for Scalable Video Coding".)

1.4.1.1.4 JVT-AB203 -M (WG 11 N9975) Draft reference software for SVC (H. Schwarz) [2008-08-01]

(Conveyed to WG 11 as "Study Text of ISO/IEC 14496-5:2001/FPDAM 19 Reference Software for Scalable Video Coding".)

1.4.1.1.5 JVT-AB204 (WG 11 N9978) Joint draft multi-view video coding (MVC) [2008-10-01]

(Conveyed to WG 11 as "Text of ISO/IEC 14496-10:200X/FDAM 1 Multiview Video Coding".)

1.4.1.1.6 JVT-AB205 -M (WG 11 N9976) ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced video coding defect report (G. Sullivan, H. Schwarz) [2008-08-01]

(Conveyed to WG 11 as "Study Text of ISO/IEC 14496-10:200X/DCOR 1".)

1.4.1.1.7 JVT-AB206 -M (WG 11 N9972) Draft conformance testing for MVC (A. Vetro et al.) [2008-08-31]

(Conveyed to WG 11 as " Working Draft 1 of ISO/IEC 14496-4:2004/Amd.38 Conformance Testing for Multiview Video Coding".)

1.4.1.1.8 JVT-AB207 -M (WG 11 N9974) Draft reference software for MVC (A. Vetro et al.) [2008-10-01]

(Conveyed to WG 11 as " Text of ISO/IEC 14496-5:2001/PDAM 15 Reference Software for Multiview Video Coding".)

1.5 JVT internal output documentsJVT internal output documents included the following. (Dates listed are planned dates of availability.)

2 JVT administrative and liaison topics

2.1 IPR policy reminder and updateParticipants were reminded of the IPR policy established by the parent organizations of the JVT and were referred to the parent body web sites for further information. The IPR policy was summarized for the participants.

Participants were particularly reminded of the need to supply a completed JVT IPR status reporting form in all technical proposals for normative standardization. Participants were also reminded of the need to formally report patent rights to the top-level parent bodies (using the common reporting form found on the database listed below) and to make verbal and/or document

101

Page 102: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

IPR reports within the JVT as necessary in the event that they are aware of unreported patents that are essential to implementation of a standard or of a draft standard under development.

The JVT chair noted that the top-level parent bodies have agreed upon a common patent policy among ITU-T, ITU-R, ISO, and IEC.

Some relevant links for organizational and IPR policy information are provided below:– http://www.itu.int/ITU-T/ipr/index.html (common patent policy for ITU-T, ITU-R, ISO, IEC

and guidelines and forms for formal reporting to the parent bodies)– http://ftp3.itu.int/av-arch/jvt-site (JVT contribution template for each meeting)– http://www.itu.int/ITU-T/studygroups/com16/jvt/index.html (JVT founding charter)– http://www.itu.int/ITU-T/dbase/patent/index.html (ITU-T IPR database)– http://www.itscj.ipsj.or.jp/sc29/29w7proc.htm (SC29 Procedures)

The JVT chair noted that the ITU TSB director's AHG on IPR had issued a clarification of the IPR reporting process for ITU-T standards, as follows (and as previously sent to the JVT email reflector), per SG 16 TD 327 (GEN/16):

“TSB has reported to the TSB Director’s IPR Ad Hoc Group that they are receiving Patent Statement and Licensing Declaration forms regarding technology submitted in Contributions that may not yet be incorporated in a draft new or revised Recommendation. The IPR Ad Hoc Group observes that, while disclosure of patent information is strongly encouraged as early as possible, the premature submission of Patent Statement and Licensing Declaration forms is not an appropriate tool for such purpose.

In cases where a contributor wishes to disclose patents related to technology in Contributions, this can be done in the Contributions themselves, or informed verbally or otherwise in written form to the technical group (e.g. a Rapporteur’s group), disclosure which should then be duly noted in the meeting report for future reference and record keeping.

It should be noted that the TSB may not be able to meaningfully classify Patent Statement and Licensing Declaration forms for technology in Contributions, since sometimes there are no means to identify the exact work item to which the disclosure applies, or there is no way to ascertain whether the proposal in a Contribution would be adopted into a draft Recommendation.

Therefore, patent holders should submit the Patent Statement and Licensing Declaration form at the time the patent holder believes that the patent is essential to the implementation of a draft or approved Recommendation.”

The JVT chair noted (as also previously remarked on the JVT email reflector) that since we are at the completion of the MVC amendment project, it was suggested that if anyone needs to report IPR on that topic and has not yet done so, now would be a good time to file formal notices to the parent bodies for any patent rights that are believed to be essential to the implementation of the MVC extensions (not to mention any notices not previously filed relating to the new SVC profiles, AVC professional profiles, or other previous projects).

It is suggested that, to enable proper interpretation of such formal notices, the MVC amendment should be clearly identified in such formal notices. For example, as “ITU-T Rec. H.264 and ISO/IEC 14496-10 Advanced video coding (2007 Ed.) Amendment 1 (2008): Multiview video

102

Page 103: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

coding”. Notices pertaining to other efforts should be made with a similar degree of clarity of identification of the specific standardization work item to which the declaration pertains.

The chair invited participants to make any necessary verbal reports of previously-unreported IPR in draft standards under preparation and opened the floor for such reports: No such verbal reports were made.

2.2 Meeting opening and remarks by the chairmen

The meeting was opened at approximately 10:00 a.m. on Sunday 20 July 2008.

The JVT meeting will end by lunchtime on Friday 25 July at latest.Our goal for this meeting is to finish all major JVT business by noon on Thursday 24 July, leaving only perhaps some minor "tidying up" work after that.

Document handling and other working practices were reviewed for the participants.

A document template was attached to various email announcements and could also be found at http://ftp3.itu.int/av-arch/jvt-site/JVT-ABxxx.dot. It contained important instructions and policy information. Participants had been instructed to read it and use it as the basis of their contributions.

Documents were made available for download at http://ftp3.itu.int/av-arch/jvt-site/2008_07_Hannover.

The deadline was Wednesday 16 July 2008 for registrations and uploads.

The top areas of contributions were MVC and reports/suggestions for errata corrections.

At the opening session of the meeting, the JVT chairs reminded participants of the relevant IPR policy as described above, and reviewed the status and plans for the major projects under way in the JVT. The largest area of activity consisted of multi-view video coding (MVC) extensions of the ITU-T Rec. H.264 | ISO/IEC 14496-10 Advanced video coding (AVC) standard. SVC work was categorized as "phase 1" or "phase 2", depending on whether the work related to the recently-designed initial SVC amendment or to a potential future further SVC extension.

Opening remarks:– IPR policy reminder– Professional profiles – follow up work on reference software and conformance– SVC phase II – work areas included investigation of bit depth, color gamut, and chroma

format scalability and fine-granularity scalability– Multiview video coding (MVC) was a major project underway, with the goal of finishing the

amendment text for ISO/IEC as an output of this meeting– Corrigendum work is needed, and was a major priority for this meeting

Further work and additional needs on the development, standardization, and maintenance of the base specification and the recently-completed SVC and professional profiles, and of associated reference software and conformance specifications was noted.

The chair remarked that there were few late document uploads this time, and that the submitted documents seem to be generally adhering to the JVT guidelines in terms of formatting, filenames,

103

Page 104: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

etc., which is a good development, although further improvement (particularly in the formatting conventions) is still needed. The JVT operating rules on that subject have helped.

2.3 JVT communication practicesJVT documents were available at http://ftp3.itu.int/av-arch/jvt-site.

These can also be accessed via ftp with the site name ftp3.itu.int, user ID avguest and password Avguest. Upon login, documents will then be found in the directory "jvt-site". Uploading of contributions is done by upload via ftp protocol to the "jvt-site/dropbox" directory using this account ID and password.

JVT email lists are managed through the site http://mailman.rwth-aachen.de/mailman/options/jvt-xyz, and to send email to one of these reflectors, the email address is "[email protected]", where "xyz" corresponds to– "experts" for general experts group discussions– "bitstream" for bitstream exchange activities– "svc" for SVC work– "mvc" for MVC work

2.4 Scheduling and logistics notesInformation about the meeting, including participation registration information, had been made available at http://mpeg.tnt.uni-hannover.de/index.php.

Participants had been reminded of the need to be properly qualified to attend. Interested parties had been instructed to contact the JVT management team if they sought clarification of what proper qualification entails or if they needed help learning how to obtain such qualification.

JVT document registration and contribution archiving followed ordinary JVT practices.

Requests to register documents were handled by email to Gary Sullivan ([email protected] or [email protected]).

The JVT work was suspended during plenary sessions of the ISO/IEC JTC 1/SC 29/WG 11 (MPEG) parent body.

Some “break-out group” (BoG) side activities and informal study efforts were conducted during the meeting. Documents produced by break-out group activities (if any) are listed in this report with the abbreviation “BoG” and are suffixed with "-B".

Note that the JVT has agreed that no late-uploaded (non-AHG-report, non-liaison, non-verification) contribution will be presented without having a minimum of 4 non-affiliated JVT participants from different organizations recorded by name as supporting the allowance of such a presentation, in addition to a consensus of the general JVT membership to allow the presentation. Additionally, the provider of a presented late contribution must send an email apology to the JVT email reflector. Additional details on this topic are described elsewhere in this report.

2.5 Administrative documents

Administrative & AHG report topics

104

Page 105: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

2.5.1.1.1 JVT-AB000 List of documents of Hannover meeting

2.5.1.1.2 JVT-AB001 (AhG) [G. J. Sullivan, J.-R. Ohm, A. Luthra, T. Wiegand] AHG Report: Proj mgmt and errata

2.5.1.1.3 JVT-AB002 (AhG) [T. Wiegand, K. Suehring, A. Tourapis, T. Suzuki, G. J. Sullivan] AHG Report: JM text, ref soft, bitstream, conf

2.5.1.1.4 JVT-AB003 (AhG) [H. Schwarz, J. Vieron, T. Wiegand, M. Wien, A. Eleftheriadis, V. Bottreau] AHG Report: JSVM text, S/W, conf

This document presented the report of the AhG on JSVM text, JSVM software, and conformance.

The text of the Joint Scalable Video Model was not modified relative to JVT-X202 (JSVM-11).

The JSVM software has been improved relative to JVT-Y211/JVT-Z203. Known bugs have been fixed. Adoptions of the last JVT meeting and some new features (which were missing) have implemented into the software. The latest version of the JSVM software is JSVM 9.13.1 (CVS tag: JSVM_9_13_1).

Changes relative to JVT-Y211/JVT-Z203 (CVS tag: JSVM_9_12_2): Several bug fixes

o derivation of intra prediction modes in AVC rewrite modeo direct mode in enhancement layerso field picture coding (base layer only)

Several (minor) corrections related to decisions of the last JVT meeting in Genevao prefix NAL unit syntaxo removal of filler prefix NAL unitso fix in residual upsamplingo derivation of RestrictedSpatialResolutionChangeFlago I_BL mode in loopfilter of MBAFF frameso border extension before intra upsampling for interlaced

Correction of setting level_idc in encoder (included added support of level 1b) Full support of scaling matrices in encoder and decoder (including spatial & SNR

scalability and including a new encoder option for specifying scaling matrices) Full IPCM support in encoder and decoder (including spatial & SNR scalability) Correction of DPB implementation in decoder in particular regarding MMCO commands

for field pictures (more or less completely rewritten) Modification of coding order of pictures in encoder in particular for low-delay

configurations (supports "real" low-delay coding) Encoder support for (high-delay) hierarchical P pictures Encoder and decoder support for long-term pictures (requires smaller DPB for

hierarchical coding structures) Configuration file parameter for specifying the profile Check of profile constraints added in encoder Check of several level constraints added in encoder

o Maximum number of motion vectorso Minimum block size for bi-predictiono Maximum slice size for Scalable Baseline & Scalable High profileo Support of interlaced coding tools (frame_mbs_only_flag == 0)

105

Page 106: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

o Support of CABAC & 8x8 transform in Scalable Baseline Some corrections in software manual

Required additions/corrections for future version: check/correction of SNR scalability with Mbaff frames (currently this is considered as

spatial scalability in the software) support of field pictures in scalable configurations in encoder (currently only correctly

supported in single-layer coding) addition of remaining profile/level constraints in encoder (add checks in decoder?) decoder support for AVC rewrite mode in connection with incomplete layer

representations (as adopted at the Geneva meeting in April 2008) error detection & concealment (without the restriction to two layers and fixed prediction

structures that was found in the previously implemented version)

Further desired improvements of the software: general improvement of encoder structure, including

o using same functions for P and B pictures ("key" and "non-key" pictures)o using same functions for "slice mode"o combine "motion estimation", "motion compensation", and "residual encoding"

support of more flexible encoder configurations, includingo GOP structures with non-dyadic prediction structureso combination of MGS and CGS codingo higher flexibility of "MGS vector mode"

In order to keep track of the changes in software development and to always provide an up-to-date version of the JSVM software, a CVS server for the JSVM software has been set up at the Rheinisch-Westfälische Technische Hochschule (RWTH) Aachen. The CVS server can be accessed using TortoiseCVS, WinCVS, or any other CVS client. The server is configured to allow read access only using the parameters specified in the table below. Write access to the JSVM software server is restricted to the JSVM software coordinators group.

authentication: pserverhost address: garcon.ient.rwth-aachen.depath: /cvs/jvtuser name: jvtuserpassword: jvt.Amd.2module name: jsvm or jsvm_red

Example 1 shows how the JSVM software can be accessed by using a command line CVS client.

Example 1: Accessing the JSVM software with a command line CVS clientcvs –d :pserver:jvtuser:[email protected]:/cvs/jvt logincvs –d :pserver:[email protected]:/cvs/jvt checkout jsvm

In Example 2, it is shown how a specific JSVM software version – specified by a tag (JSVM_9_8 in Example 2) – can be obtained using a command line CVS client. Note that co represents an abbreviation for the command checkout, which was used in Example 1.

Example 2: Accessing the JSVM software version with the tag JSVM_9_8 with a command line CVS clientcvs –d :pserver:jvtuser:[email protected]:/cvs/jvt logincvs –d :pserver:[email protected]:/cvs/jvt co –r JSVM_9_8 jsvm

It is possible to checkout only a reduced JSVM software package by using the module name jsvm_red instead of jsvm. In this case, the directories JSVM0-config-sample and MVC-Configs are omitted in the checkout, see Example 3.

106

Page 107: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Example 3: Accessing the JSVM software without the JSVM0 and MVC directories.cvs –d :pserver:jvtuser:[email protected]:/cvs/jvt logincvs –d :pserver:[email protected]:/cvs/jvt co jsvm_red

The CVS repository includes a JSVM software manual, which provides further information on the JSVM software.

The text of the conformance document "Draft conformance testing for SVC" was submitted as JVT-AA205. A revised version of the conformance draft was provided as input to this meeting as document JVT-AB023.

2.5.1.1.5 JVT-AB004 (AhG) [A. Segall, T. Wiegand, Y.-J. Chiu] AHG Report: SVC bit depth, color gamut, and chroma format

This contribution reported on activity for SVC bit depth, color gamut, and chroma format. The mandates of the AhG were: – Identify applications– Work out suggestions for detailed needs– Find/create test material– Study bit-depth reduction techniques, e.g., tone-mapping tools– Study color space and/or gamma conversion requirements– Study combined spatial and bit depth scalability– Define experiments and test conditions– Investigate software and text modification needs– Identify complexity issues

There was no activity reported on the reflector.It was noted that there was one relevant input contribution on the topic: JVT-AB029.

The AhG recommended the following: To review related contributions during the meeting To continue the AhG

2.5.1.1.6 JVT-AB005 (Ahg) [H. Kimata, A. Smolic, P. Pandit, A. Vetro, Y. Chen] AHG Report: MVC JD & JMVM text & software

At the Geneva meeting, JVT established the AhG on MVC JD and JMVM text and software with the following mandates:

1. Collect comments on draft, perform necessary editing and delivery.2. Maintain JMVM and JD document and collect comments on the text.3. Coordinate JD/JMVM software integration4. Coordinate bug-fixing process for the JD/JMVM software5. Maintain JD/JMVM software manual

The JMVM8 and JD7 were submitted to JVT as JVT-AA207 and JVT-AA209, respectively.

The JMVM8 text reflected changes described in the following contributions: JVT-AA028 [Huawei] Comment on Motion Skip JVT-AA031 [LG] High level syntax for motion skip mode JVT-AA047: Breakout Group Proposal for JMVM Motion Skip

107

Page 108: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

It was reported that there had been several major changes made to the JD7 text. The changes were a result of input contributions and break-out group discussions during the Geneva meeting. The most significant changes are summarized below. Comments on JD [Ref: JVT-AA020 / JVT-AA021 / JVT-AA033 / JVT-AA046]

o Level limits and decoding resource requirements not be specified in terms of the number of views

o Bug fix to signal idr_pic_id in slice header (revised condition in 7.3.3 slice header syntax)

o Added text for SPS/PPS activation, NAL unit order/associations, detection of first VCL NAL unit

o Bug fix in NAL unit header SVC MVC extension syntax – no need to advance 3 byteso Remove max_num_view_depo Apply decoding process for POC independently for each viewo RPL construction process with discarded view components (text added to H.8.2)o Remove RPL construction for anchor pictures (text removed from H.8.2.1)o Specify sub-bitstream extraction for more than one target viewo For sub-bitstream extraction, added specification to ensure base view in output. o Assign SEI payload types

New SEI Messageso JVT-AA034: Non-required view component SEIo JVT-AA035: View dependency change SEI and operation point not present SEI

Other changes:o Aligned the structure of the syntax and semantics subclauses in Annex H to the base

spec – therefore, the syntax and semantics of NAL unit header extension and subset SPS, and the semantics of slice header, have been moved to Annex H.

o Updated Table 7-1 for description of different NAL unit types as well as VCL and VCL NAL units categarization, made corresponding changes in the base spec and Annex G.

o Added some new definitions, namely anchor access unit, anchor view component, decoded view component, IDR picture, IDR view component, inter-view prediction reference, inter-view reference index, non-anchor access unit, non-anchor picture, non-anchor view component, and reference picture. Removed the definition of V-IDR picture as it turned out to be useless in the new context. Also updated some of the existing definitions. Renamed inter-view only reference picture to inter-view only reference component.

o Updated NAL unit extension syntax (the order of syntax elements and name of the reserved bits syntax elements have been changed) and semantics, including some bug fixes (e.g. the first bit of the NAL unit header exttension shall be 1 for SVC and 0 for MVC, but was the other way round).

o In the subset SPS syntax, one bit equal to 1 was added before the syntax strucutre seq_parameter_set_mvc_extension( ), to make the subset SPS syntax compatible to the SVC subset SPS syntax specified in Annex G.

o Fixed the conditions related to slice_type in the ref_pic_list_reordering( ) syntax table.o Updated semantics for SPS MVC extension syntax elements and specified the

persistence and accuracy of these syntax elements.o Updated the reference picture list reordering semantics with both editorial

improvements and bug fixes – the bugs are related to the use of syntax element num_anchor_refs_lX[ i ].

o Updated the text for the reference picture list construction process, with both editorial improvemens and bug fixes – the bugs are related to the use of syntax elements num_anchor_refs_lX[ i ], anchor_ref_lX[ i ][ j ], num_non_anchor_refs_lX[ i ], and non-anchor_ref_lX[ i ][ j ] in subclause H.8.2.1.

108

Page 109: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

o Updated the specification of the Multivew High profile regarding restriction to the base view bitstream and definition of the conforming decoders.

o Added exentions to semantics of AVC and SVC SEI messages (mainly regarding being not included or included in an MVC scalable nesting SEI message).

o Changed the name of scalable nesting SEI message to MVC scalable nesting SEI message to avoid using the same name as the one for SVC, and updated the semantics.

Some remaining open issues related to the JD text include: Revisit limit of 16 on max decoder frame buffering Review following text

o SPS/PPS activation, NAL unit order/associations, detection of first VCL NAL unito Sub-bitstream extraction processo All level limits

Semantics of profile, level and HRD parameters etc. in SPS and subset SPS and whether to signal additional sets of profile and level in SPS and subset SPS

Various editing notes in the text need to be resolved Consider relevant SVC errata [JVT-AA025]Resolved as reported elsewhere in this document.

The JMVM8 software had been delivered to the group on 6 June 2008 as JVT-AA208. This release contained:

High-level syntax changes to accommodate adoptions related to motion skip tool Software improvements and bug fixes

The JD reference software had also been produced, which reportedly corresponded to the JD text, i.e., no block level coding tools. This software was delivered on 6 June 2008 as JVT-AA212.

Some software issues that still need to be addressed were reported to include:1. Output order of views is not sequential or parallel2. Prepare validation scripts (work in progress)3. Provide support GOPsize=1 (with motion skip & IC)4. Trace file support for arbitrary view_id assignments

The AhG recommended1. To consider the input to this meeting in preparing future versions of the JMVM and JD2. To improve the manual created for the JD/JMVM software3. To follow the same software integration guidelines present in JSVM (repeated below)

2.5.1.1.7 JVT-AB006 (AhG) [Y. Chen, S. Gao, Y.-J. Jeon] AHG Report: MVC JMVM coding tools

The mandates of this AHG reported in the previous meeting are reused here. – Investigate simplification and improvement of current JMVM coding tools (IC and motion

skip)– Investigate techniques for single loop decoding to reduce complexity starting with motion

skip– Investigate approaches for enhancing MVC coding efficiency using spatial downsampling– Investigate low-complexity methods for mobile stereoscopic 3DTV applications– Investigate other potential approaches to achieving enhanced MVC capability– Coordinate software, test material, and experiment conditions for these techniques

109

Page 110: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

– Evaluate performance of enhanced MVC proposals (including CAVLC operation in particular)

No email messages were reported to have been exchanged on this topic on the reflector.The only contribution identified by the AHG as relevant to this AHG in this meeting was JVT-AB030.

The AHG recommended to review this contribution and to discuss the mandates and the need for having such an AHG in the future.

2.5.1.1.8 JVT-AB007 -M (AhG) [G. J. Sullivan, A. Rodriguez, S. Narasimhan] AHG Report: Splicing operation

2.6 Closing session notes

In the closing session there were no requests to reopen discussions of preceding agenda topics and side activities recorded elsewhere in this report.

The JVT thanked its ISO/IEC JTC 1/SC 29/WG 11 (MPEG) parent body and the meeting organizers for hosting the 28th JVT meeting.

The meeting was closed at approximately 5:30 p.m. on Thursday 24 July 2008.

2.7 JVT liaison communications and parent-body communications

The JVT did not directly receive formal liaison communications at this meeting. However some JVT contributions were originated as National Body input liaison contributions to the WG 11 parent body as discussed below.

2.7.1.1.1 JVT-AB014 / M15538 [DVB TM-AVC] Incoming LS to WG 11 on Constrained Baseline

This document was an incoming liaison statement from DVB TM-AVC regarding constraints on AVC Baseline profile.

DVB TM-AVC indicated that they are pleased to learn that ISO/IEC JTC 1/SC 29/WG 11 has studied the references to using H.264/AVC in DVB applications, as specified in ETSI TS 102 005 and ETSI TS 101 154. They reported that these specifications have included the option to use H.264/AVC coding since 2005.

They indicated that they welcomed our provision of the "constraint_set1_flag" syntax element of H.264/AVC, to permit a hierarchy of strict supersets and subsets between the High, Main and Baseline profiles. This strict hierarchy was reported to be important in DVB applications, e.g. to ensure that bitstreams broadcast to mobile receiving devices may also be decoded by fixed SDTV or HDTV receivers. They stated that they are therefore happy to confirm that they have chosen to utilize the "constraint_set1_flag" syntax element, for the reasons explained in Annex A (informative) of TS 102 005. The relevant extracts were reproduced in the contribution, for our convenience.

110

Page 111: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

The document does not directly address the topic of whether the JVT should define a "constrained baseline" capability or profile.

2.7.1.1.2 JVT-AB015 / M15554 [3GPP TSG SA4] Incoming LS to WG 11 on Constrained Baseline

This documents was an incoming liaison statement from 3GPP TSG-SA4 regarding constraints on AVC Baseline profile.

They report that they have indeed adopted AVC into four of their services: Packet-switched Streaming service (PSS), Multi-media Messaging Service (MMS), Multicast-Broadcast Multimedia Service (MBMS), and Multimedia Telephony Service for IMS (MTSI).

They report that they have examined their records and document archive, and there were two major motivations for asking that bitstreams be compatible with both Main and Baseline profiles:– there was a significant desire for broad bitstream interoperability, in particular that bitstreams

generated in or for 3G services should be broadly playable; specific mention was made of the desire to be able to play streams on set-top boxes and similar devices, which they expected would be Main profile;

– the tools that are present in Baseline but not in Main were mostly targeted towards error resilience, and in the environments in which errors do not occur, they add complexity without returning value.

Following on from point (a), they reported that they had some thought that in the future they might want to use Main profile (as terminals become more capable), and having the streams be Main-profile compatible would ease such an introduction.

Taking into consideration the following:1) that the AVC profiles do not ‘nest’, that is, bitstream compatibility with one profile

cannot be taken as implying compatibility with another;2) that there is no ‘compatibility list’ provided to indicate the compatibility of a bitstream

with multiple profiles (other than the constraint_set flags);3) that 3G bitstreams are reportedly probably most naturally considered as Baseline streams;4) that the JVT provided the means to signal compatibility with more than one profile, in the

part of the specification where profiles are signaled;they reportedly decided to ask that bitstreams be marked as Baseline, compatible with Main profile. They reported that they did not see any other way of doing this, and the way chosen used indicators provided in the AVC specification at the level of the signalling of profile and level. (For historical reasons, 3G specifications are reportedly written in terms of the decoder rather than the bitstream, but it was asserted to always be clear that decoders may exceed these requirements in any way.)

They reported that they appreciate the recent definition of a name for these bitstreams that are bi-compatible (“constrained baseline” streams). They indicated that they are not sure how to further improve the situation, as they are concerned that any change of signaling (e.g. to introduce a list) or profile (e.g. to introduce a new profile indication) could lead to incompatibility with existing terminals or bitstreams.

In summary, for our immediate purposes, the document indicates that they are pleased to have a definition of the "constrained baseline" name for a bitstream/capability class. It does not directly address the topic of whether the JVT should specify this configuration to be a profile (other than to say that they do not want a change of signaling – which no one was ever contemplating in any case).

111

Page 112: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

2.7.1.1.3 Discussion of "Constrained Baseline" issueIn JVT discussions of JVT-AB014 and JVT-AB015 and the issues surrounding this topic, there was a desire expressed by the group to define a new Constrained Baseline profile corresponding to the capability/bitstream class currently drafted in the draft corrigendum as "constrained baseline". The envisioned benefits of defining such a profile would be

1) to enable implementations of the subset of capabilities currently-specified in these widely-deployed external standards to claim conformance to a profile of the standard by specification of a conformance point that corresponds to the capability set defined in these standards, and

2) to enable additional external specifications to specify selection of a profile of the standard which corresponds to a maximal compatibility subset of the capabilities of the existing profiles

3) to enable the specification of the video surveillance MAF in WG 11 to specify selection of this profile.

It was clear in this discussion that definition of such a profile would not involve any new value of profile_idc (or any other change of syntax relative to what is found in the DCOR as the definition of "constrained baseline"). This new profile should presumably be specified by the amendment approval process in ISO/IEC and the AAP process in ITU-T. ITU-T Consent could be achieved in October 2008 or February 2009.

Regarding the text of the DCOR, the group suggests leaving it as previously drafted, so that the new amendment would only need to modify the text to add the use of the terms "profile" and "conformance".

Agreed (to be coordinated with WG 11).

2.7.1.1.4 JVT-AB016 / M15711 [ATSC] Incoming LS to WG 11 on AVC Profiles

The ATSC reported that it is developing specifications for a new service to deliver video to mobile and handheld devices, known as ATSC M/H. A tentative decision has reportedly been made to employ AVC coding, Baseline Profile at level 1.3, with one picture format of 416 by 240 luma samples (WQVGA), with frame rates from 12.0 to 30 fps. These choices were made based on input from device manufacturers who indicate they can support early market entry of a large number of receiving devices.

Broadcasters have reportedly expressed interest in transmitting higher resolution (up to wide SDTV) pictures over ATSC M/H. A tentative decision has also reportedly been made that higher resolution may be handled by use of SVC.

However, concerns have reportedly been expressed that by restricting the video tools that may be used (e.g. no 'B' frames), and relying on SVC as a path to higher resolution, some inefficiency may be designed in that could lead to a long term under-utilization of precious data bandwidth. Before decisions are final, broadcasters would reportedly like to better understand the consequences to eventual bit rate possibilities.

ATSC would welcome any information that MPEG could provide to them as to the possible increased efficiency (compared to Baseline Profile, Level 1.3, 416x240 pixel image format constrained for SVC) that could be obtained by choosing a set of AVC tools that would give us more efficient coding (e.g. use of 'B' frames, weighted prediction, directly coding wide SDTV vs using SVC, etc.).

112

Page 113: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

ATSC would like to reference a specific MPEG AVC Profile in the M/H specification. If a set of AVC tools that do not fit within an existing AVC Profile are the best solution for the M/H application, they ask if MPEG would consider defining a new Profile that ATSC could reference in its standards. If such a new profile option would be considered, they ask in what time frame this work could be completed.

Remark: It was suggested that the context of the intent was that picture aspect ratio would always be approximately 16:9, always have approximately square luma samples, and always be "progressive" scan. Further, it was suggested that the desire would be to have standardization completed in WG 11 and ATSC by the end of 2009, and to have devices actually deployed by that time. It was suggested that the total bit rate for the low-res and SD video might be about 0.5 Mbits/s.

Basic aspects of contribution:– Interest in greater coding efficiency than Baseline– Initial focus on 390-macroblock pictures, with longer term interest in SD– Some interest in SVC

Suggestion: Respond saying:

The use of the High profile may be appropriate. Our understanding is that very low cost, low power decoder implementations of the High profile are already available in the marketplace. For example, we have located product announcements for High profile decoding such as:– an implementation for 1920x1080p60 decoding that uses 160 mW,– an implementation quoting 1080p using 120 mW,– a reference to 3.5G and 4G mobile implementations of high bit rate High profile 1080p

decoding in 2009-2011,– an announcement referring to HD decoding using 45mW,Further such information should be discoverable with somewhat more study. Note that support for 1920x1080p60 would include support for a sample processing speed ratio that is higher by a factor of more than 40 relative to the requirements for 416x240x30.

Regarding compression capability, some information that we have received suggests that about a 40-60% bit rate savings for use of High profile rather than Baseline profile might be expected for material such as the 416x240 use that they describe (e.g., per JVT-N014 of Jan 2005, and we note that encoding technology for use of High profile features has advanced substantially further since that time).

If the High profile does not meet the needs of your application, we are prepared to collaborate further with you to study the subject and take appropriate action as necessary. Although we suggest that no new profile specification should be needed, in the event that this initial assessment is incorrect, the ISO/IEC approval process (and typical parallel ITU-T approval processing) for a new profile specification would ordinarily take 1-1.5 years after the decision to create a new profile and the determination of the detailed requirements of the profile design.

Selection of the High profile for lower resolution use would not preclude the use of scalable video coding (SVC) with this lower resolution video as the base layer, as the AVC specification includes a Scalable High profile that is specified to operated in that fashion.

We encourage further study and communication regarding the detailed application requirements, such as bit rates, image sizes, frame rates, etc. for the combined usage scenarios for the SD delivery in conjunction with WQVGA over ATSC M/H.

113

Page 114: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Agreed (to be coordinated with WG 11).

3 AVC base specification, errata, and related topicsThe latest prior errata reporting status was provided in JVT-Z210.

3.1.1.1.1 JVT-AB010 / VCEG-AI07 (Errata) [HHI] ITU-T AAP Last Call comments on H.264(2007)/Cor.1

This contribution contains various errata comments that were submitted as ITU-T AAP Last Call comments. Notes on particular topics in the contribution are provided below.

Regarding MMCO = 3, just delete "short-term" and add "(equal to 3 or 6)". In the following NOTE that contains "shall"s, change the "shall"s to "must"s and pluralize "field" at the end of the NOTE, and rephrase the extra NOTE as reviewed.

SPS activation for SVC – the SPS_id in the slice header can be different than the one in a BP SEI. The issue relates to the MGS feature. The precision of the position calculation depends on the level_idc, which may be different for different MGS layers. Two suggested approaches– use the level_idc of the layer with quality_id = 0 to determine the position calculation

precision (decouple the active SPS for HRD purposes from the active SPS for position calculation purposes)

– make the precision depend on frame size (width or height greater than 2048 or number of macroblocks greater than what's in 1920x1088 = 8160 Mbs) rather than level_idc.

Pick the first of those two options.

Active layer SPS should be activated by BP SEI – not just VCL NAL units. Agreed – seems obvious.

Removal of redundant constraint – yes – editorial.

Restriction for intermediate layers – yes – obvious.

Intra prediction constrained_intra_pred_flag and I_PCM – yes – obvious.

Clarification of overscan – minor reword – agreed.

4:2:2 and 4:4:4 – editorial/hypothetical – agreed.

Number of MVs per 2MB and bipred – 8x8 inference – touchy, but yes – agreed.

Motion constrained slice group SEI needs num_slice_groups_minus1 for parsing – add a NOTE suggesting to store the RBSP until the active PPS is known, then parse – agreed.

Two rewriting issues – no action needed on either item.

End of sequence NAL unit in SVC context – all layers must have IDR after that – agreed.

Persistence of SVC SEI messages – needs further study by editors. E.g., scalability information SEI persists forever – say that it ends if there's IDR at all layers.

114

Page 115: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Discretion given to editors for determining aspects not noted and finalizing according to their best understanding.

3.1.1.1.2 JVT-AB011 / VCEG-AI08 (Errata) [Microsoft] ITU-T AAP Last Call comments on H.264(2007)/Cor.1

This contribution contains various errata comments that were submitted as ITU-T AAP Last Call comments. Notes on particular topics in the contribution are provided below.

Comment 5(c): Seems to only be an issue when using Tools->Options->View-All, so it may not be necessary to fix.

Comment 5(k): Specify that the decoder shall ignore the content of the SEI message in those cases.

Comment 5(m): Re-examine aspects relating to "overscan", including in particular "do neither use".

Comment 5(n).i: We may have too many cases like that to try to fix them all, and in this case you can include the opening parenthesis in the search.

Comment 5(n).iii: May have already been fixed.

Comment 5(o): Maybe that's OK as-is.

Further details to be addressed in editing work.

3.1.1.1.3 JVT-AB012 / VCEG-AI09 (Errata) [Sony] ITU-T AAP Last Call comments on H.264(2007)/Cor.1

See discussion below regarding JVT-AB013.

3.1.1.1.4 JVT-AB013 / M15558 (Errata) [JNB of WG11] WG 11 NB comments on ISO/IEC 14496-10:200X/DCOR1

Discussed together with JVT-AB012.

These contributions advocate not to delete certain problematic level limits that had been specified to apply to (only) the first access unit of the bitstream. A.3.3.a and G.10.2.2.a: Number of slices in access unit 0. A.3.1.c, A.3.3.h, G.2.2.h: Number of bytes for access unit 0.

Remark: The problem of excess load possibility was sort of already there, since timing information could be stretched by an encoder to cause "hostile" bitstream characteristics.

JVT Disposition: These limits will not be entirely removed, however, they will not be made overly tight either – exact details are left to be worked out by the editors and interested parties and editors are to put their best effort outcome into the draft.

3.1.1.1.5 JVT-AB021 (Prop 2.2/3.1) [K. Kazui, A. Nakagawa, S. L. Bhavani, S. Kobayashi (Fujitsu)] Modification of POC type 1 specification

115

Page 116: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

At the previous meeting in Geneva, JVT-AA022 it was asserted that the current specification of AVC does not always guarantee correct recovery of relative POC value when POC type 1 is used and a decoder starts the decoding process from a recovery point rather than an IDR picture. JVT-AA022 proposed three candidate actions to address the issue.

Remark: Because the semantics of the recovery point SEI message include a "promise" made to the decoder that the decoder can recover the correct pictures if it operates in a certain manner, therefore an encoder is not allowed to put the RP SEI message into the bitstream if such a use would create a non-recovery behavior. Agreed.

Since no prescription has been established in the text, the decoder should, in principle, able to initially guess any value of POC within the 32 bit range for the current picture and for any reference to a non-existing picture, and allocate a greater than 32 bit dynamic range to its POC estimate for subsequent delta accumulations and then everything must work properly once the recovery_frame_cnt has gone by. (It is also hypothetically possible to keep the value within a 32 bit range by adjusting the value of POC on every picture in the DPB by an offset whenever a value would otherwise be created that would violate the 32 bit range.)

It is the encoder's responsibility to ensure that no matter what value of POC the decoder initially guesses, once the recovery_frame_cnt has gone by, everything must work.

Add a NOTE to the spec to clarify this understanding.

The encoder must not (i.e. shall not) send a RP SEI message that would put the decoder into an undefined output state that persists beyond recovery_frame_cnt.

The RP SEI establishes a promise. The promise must be fulfilled.

Agreed.

3.1.1.1.6 JVT-AB036 (Errata 2.2.1/3.1) [Y.-K. Wang (Nokia)] SVC errata items

Some issues reported here were covered in the review of JVT-AB010.

Editor asked to study/resolve use of "RBSP" terminology relating to parameter sets. Agreed.

Editor to ensure proper specification of constraint on "active SPS" as well as "active layer SPS". Agreed.

Profile definitions – decoder conformance requirement for decoding ordinary AVC bitstreams should specify decoding of all ordinary compatible AVC bitstreams in a manner equivalent to what applies in Annex A. Agreed.

On restrictions to scalable nesting SEI message clarify that the "less than any of the particular values…" refers to the values of ( sei_dependency_id[i] << 4 ) + sei_quality_id[i], not the value of i. Agreed (insert commas around "for all present i" or otherwise restructure logic to clarify).

116

Page 117: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

4 Scalable video coding (SVC)

4.1 SVC bit depth, color gamut, and chroma format scalability

4.1.1.1.1 JVT-AB029 (Req 2.2/3.1) [Y.-J. Chiu, H. Jiang, Y.-T. Peng, L. Xu (Intel)] Requirements for scalable video coding: color gamut scalability and bit depth scalability

This document considers a new application scenario for video coding in which color gamut scalability of consumer electronics product is asserted to be desired. The architecture of such a system design was presented, which includes modules of color gamut compression and color gamut expansion to achieve the color gamut scalability in the framework of SVC. Suggested requirements for color gamut scalability were described, and preliminary results that aim to demonstrate the benefits of the wide color gamut video at the receiver sides were shown. The document also suggests that higher bit depth formats are required for the video conveyed in the enhancement layer to increase the description of the level of color fidelity contained in the wide color gamut video.

Remark: Part of the asserted motivation is a quality assurance motivation, as opposed to an interoperability enabling motivation. Some systems will do a good job of gamut mapping if they receive out-of-gamut colors (and some will not) – that seems like a question of product quality assurance motivation rather than interoperability-enabling standardization motivation.

Counter-remark: But there is an issue here of trying to deploy new technology into a domain in which there is widespread use of existing technology that was not designed to accommodate the presence of the new signals.

The proponent indicated that there is a waste of bandwidth associated with sending wider color gamut representation to a limited-gamut receiver.

Question: But how much of a waste is that? Most scenes don't contain those extreme colors, and even for those that do, how many bits would be saved by mapping them to a narrower color gamut before encoding? Response from proponent: This is scene dependent – the bit rate savings potential may need study.

Envisioning a greater future emphasis on 4:4:4 coding and increased bit depths, the proponent indicated that the difference may become more important in such scenarios.

Examples were shown of pictures that look different, where the difference was asserted to be due to the limited gamut range of the BT.709 color primaries, coupled with a poor use of mapping (i.e., component-by-component clipping rather than proper color mapping) prior to limited-gamut display.

It was suggested that it would be preferable to design a transmitting system to perform the color mapping gamut limitation prior to encoding as a base layer, and then send the differences that extend the color gamut range as an enhancement layer.

Color gamut scalability had already been included in the scope of the relevant AhG work area.

Remark: Some of the demonstrated differences seem to have to do with 4:2:0 sampling rather than color gamut range.

117

Page 118: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Remark: We could offer a package of scalable features that would provide an enhancement of the sort that could not be provided by only increasing the bit rate/fidelity of the same representation. Consider:– extended gamut– increased bit depth– tone mapping– 4:4:4 enhancement of 4:2:0 or 4:2:2

Remark: Such an enhancement layer could be a separate business model as well as a separate enhancement bitstream.

Remark: What is the application? Responding remark: Potentially, optical disc (e.g. Blu-ray).

Remark: Why would such an application need scalability? Could someone just fit an extra single-layer bitstream on a Blu-ray disc.

JVT Disposition: Further study the requirements and determine the level of industry interest in such a scalable approach to delivery of these features.

4.2 SVC Conformance

4.2.1.1.1 JVT-AB023 (Ed.

4.2.1.1.2 Draft) [V. Bottreau (Thomson), A. Eleftheriadis (Vidyo)] SVC conformance testing

This contains changes proposed by the editors relative to JVT-AA205 (WG 11 N9755, no formal status in ITU-T), which was the output status of the previous meeting.

Section numbering was corrected to start at 30. Some corrections and updates about particular bitstreams were provided.

Latest bitstream collection status: 55 (75%) of the total of 73 planned bitstreams are available; another 7 bitstreams (10%) are expected to be delivered imminently. More verification of bitstream validity is desired (only verified by JSVM)

Preparation of DoCR to be done.

5 Multi-view coding (MVC)

5.1 Comments on JD

5.1.1.1.1 JVT-AB020 (Prop 2.2/3.1) [Y.-J. Jeon, B.-M. Jeon (LG)] MVC comment on JD

118

Page 119: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

This contribution provided a comment on the MVC JD7.0 that was asserted to be editorial. It was asserted that the current specification does not clearly specify the decoding process of all the view components of the access unit. To clarify and specify the decoding process of all view components of the access unit, a comment describing the decoding process according to VOIdx (view order index) is presented.

JVT Disposition: The proponent was asked to work with the editors to clarify the text as necessary (without technical change). Done.

5.1.1.1.2 JVT-AB022 (Prop 2.0) [J. Huo, Y. Chang, M. Li, H. Yang (Xidian Univ.), S. Lin, S. Gao, L. Xiong (Huawei)] Comment on MVC JD 7.0

Two comments on the MVC JD7.0 were provided in this contribution. The first proposed change was related to the view dependency change SEI message, and a modification was proposed that was asserted to relate to compatibility with the sequence parameter set MVC extension.

IPR status clarification (verbal): The IPR declaration (box 2.0) in the contribution applies to both contributing organizations.

The first proposed change was to replace the lower index of a "for" loop – changing it from 0 to 1 (skipping over view 0). View 0 is the base view, which does not have dependencies, so its inclusion in the syntax loop was concluded to be an error.

JVT decision: Adopted (excess "{ }" curly brackets around single statements in syntax table will also be removed).

The second proposed change was related to the non-required view component SEI message, and a derivation method was proposed for the decoder to derive the non-required view component so that the explicit transmission of the SEI message for every related access unit could be avoided.

It was proposed to suggest that encoders not express dependencies in non-required view SEI messages that would be redundant under the assumption that decoders will be able to infer those dependencies (by recognizing conditions establishing which other components are non-required components, so that the transmission of redundant information can be avoided).

Concern: Dependency is made from SEI messages of other access units. Deriving the information at the decoder may not be simple and require dependency information beyond the SEI message itself.

Remarks: Proposal is to encourage not sending some SEI messages by depending on inferred relationships established from SEI messages of other access units. It was commented that some of these relationships that would need to be inferred might not be so easily derived. It was further commented that the current SEI message semantics would already allow such information to be inferred by a decoder – the text just does not include any remark pointing out this fact. Remark: The degree of redundancy expressed by use of the existing SEI message is already under the control of the encoder.

Suggestion: Have off-line study [Miska, Ye-Kui, and proponent] to consider whether providing only some additional non-normative remarks to point out what the decoder may be able to infer when some information is not directly expressed in the syntax. This discussion was held, and the subject was then further discussed by the group as a whole.

119

Page 120: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

After this off-line study, some text suggested by the proponent was discussed.

Remark: This is about encouraging decoders to derive a relationship that they can already derive if they wish – so it may not be necessary to explicitly point out that this is possible.

Question: How much coding efficiency improvement would result from the encoder not sending redundant expressions of dependencies? It seems to be a very small amount of bits. Remark: And it is not easy to describe this extra dependency derivation.

Remark: There may actually be a benefit in terms of loss resilience and simple design functionality by simply having the minor degree of redundancy in the bitstream.

Although the proposal may be valid in the abstract sense, the foreseen benefit seems very minor, the envisioned decoder processing would need to become more sophisticated and possibly less robust, and when considering the need to finish the specification so quickly, it seems most prudent not to take action in this regard.

5.1.1.1.3 JVT-AB024 (Prop 2.2.1/3.1) [Y. -K. Wang, Y. Chen, M. M. Hannuksela (Nokia)] Comments to MVC JD 7.0

This document proposed to 1) remove the active view information SEI message from the MVC specification, and 2) address extraction of SEI messages in the sub-bitstream extraction process, which was not addressed in the latest MVC draft.

On active view information SEI message: It was proposed to remove the active view information SEI message. This SEI message was designed be included in the bitstream to indicate which views are the target output views, such that the target output views can be known from the bitstream. However, if the same MVC bitstream is used at a different operation point, then it was asserted that the content of the SEI message must be changed. It was asserted that this implies that for multicast with receivers using different operation points, some network elements between the sender and the receivers must change or insert active view information SEI messages into the outgoing sub-bitstreams. Due to this reason (among others), it was reportedly decided not to use this means to derive the set of target output views in the HRD specification in the MVC JD.

Suggestion: In the semantics of the SEI message, we can indicate that target view information may be conveyed by other means determined by the application, and that the determination of what to do in response to the content of the SEI message, if present, is determined by the application.

Suggestion: Remove the SEI message, specify that the target output views are determined by the application (by means not specified in this Specification) and that in the absence of any selection of target output views by the application, there shall be one target output view which is the base view. JVT decision: Agreed.

Definition of "target output view" needs editing work to express that well. See below.

Also add remarks as necessary to other new SEI messages, in the spirit of what is stated for, e.g., the film grain characteristics SEI message, that the actual decoder response to the SEI message is not normatively specified. JVT decision: Agreed.

On the sub-bitstream extraction process: The extraction process proposed in the editors' input (JVT-AB028) does not address SEI messages. In our opinion, all the information needed for

120

Page 121: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

possibly further bitstream subsets, e.g. picture timing and buffering period SEI messages, should be retained after extraction and the extraction process should be appended accordingly. A note could be added to state that real applications can do extraction in other ways, e.g. to keep only those SEI messages required for certain operation points that are likely to be used. JVT decision: Agreed.

5.1.1.1.4 JVT-AB028 (Ed. Draft) [A. Vetro (MERL), P. Pandit (Thomson), H. Kimata (NTT), A. Smolic (HHI), Y. -K. Wang (Nokia)] Editor's Input on MVC

This document included editors’ input on the draft MVC specification. An important addition was the signaling of HRD parameters for bitstream subsets, along with related changes to the HRD subclauses. Other revisions include changes to the MVC scalable nesting SEI to enable signaling of HRD parameters. Some topics reported as bugs in the sub-bitstream extraction process had also reportedly been resolved, the semantics of SPS extension had reportedly been clarified, and some definitions had been added.

Regarding HRD, the MVC editors recommended that the approach documented in this contribution be adopted.

It was recommended that the revisions in the attached document be reviewed in detail in preparation for JD8 output.

The contribution was reviewed, various minor issues (typographical formatting, use of "if" versus "when", a space preceding a syntax structure parenthesis, etc.) were noted and recorded by the editors to be addressed.

JVT decision: Adopted (including suggestions described in included Powerpoint presentation deck – with minor modifications as recorded by the editors during the review) with number of views and max_dec_frame_buffering issues dealt with as discussed below in section on JVT-AB035.

5.2 MVC profiles

5.2.1.1.1 JVT-AB035 (Prop Prof 2.2/3.1) [H. Kimata (NTT), T. Nomura (Sharp), H. Nakamura (JVC), T. Itoh (Fujitsu)] On Multiview High profile

A revised version will be uploaded to clarify the IPR statement status. Done.

Target applications and comments on the related requirements of level constraints of Multiview High Profile were presented, relative to the current MVC draft.

Target applications for Multiview High Profile were discussed. Target applications were described as interactive video applications, such as panorama video and FTV / free viewpoint video, as well as multi-view 3D video. In addition, it was discussed how to use MVC in those applications, and parameters such as the number of views were discussed. Based on these parameters, comments were made on the level constraints of Multiview High Profile, relative to what is specified in the current MVC draft.

Particular points raised in the contribution:

121

Page 122: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

– Requirements on the number of views to be decoded (see presentation slide with that title). Addressed by actions noted elsewhere.

– Limitation of 16 on max_dec_frame_buffering. Is this an expression of a limit on a capacity of storing pictures (each of which may contain multiple view components) or is it a limit on a capacity of storing view components? Remark: In the current draft, it is the latter. Let's keep that convention. This becomes part of the discussion of structuring of level definitions. See below.

– The value of num_views_minus1: In H.7.4.2.1.4 (Sequence parameter set MVC extension semantics), it was reported that there is a NOTE 2 for num_views_minus1, as follows: "NOTE 2 - This value should be changed by the sub-bitstream extraction process as specified in subclause H.8.3 to correspond with the exact number of views in a bitstream subset." Because NumViews reportedly affects the level constraints, it was suggested to change this to a normative specification. Remark: this syntax element should be understood to express an upper bound rather than an exact representation of the characteristics of the bitstream subset. JVT decision: Agreed – clarify in text.Question: Should we make this a u(n) rather than a ue(v) to enable changing the value without changing the coded number of bits (with n-1 signaled)? Remark: Wait – a loop counter depends on this value and the specified process will not work correctly if the value is changed.Conclusion: So the quoted NOTE should be removed. JVT decision: Agreed.Question: Change num_views_minus1 to num_views_minus2? No.

– The value of level_idc: Do we allow changing the value of level_idc by the sub-bitstream extraction process? See below.

– Calculations in the a) and b) level constraints: It was reported that there are calculations using mvcScaleFactor in a) and b) level constraints, as follows: Max( NumViews * PicSizeInMbs mvcScaleFactor * MaxMBPS, fR ). Does mvcScaleFactor apply to MaxMBPS? If so, it was suggested to change this to Max( NumViews * PicSizeInMbs (mvcScaleFactor * MaxMBPS), fR ). JVT Decision: Agreed.

– Calculations in the p) and q) level constraints: In the p) level constraint for VCL HRD parameters, NumViews is not multiplied for calculating BitRate[ SchedSelIdx ], however, in the q) level constraint for NAL HRD parameters, NumViews is multiplied for calculating BitRate[ SchedSelIdx ]. Clarification is needed if this is correct. JVT Decision: Agreed that this expression should not depend on NumViews, so condition "q" should be changed.

Suggestion: Any level definition for MVC should include capability of decoding at least two views of the corresponding type found as an ordinary AVC capability for the same level. This way the ordinary understanding of the capability associated with a level will still apply to the multiview case. (We did something like this in the SVC case.) Responsive remark from an editor: That's what the mvcScaleFactor does.

Suggestion: Add another dimension to level definitions to express view quantity capability, e.g., a level 3.2.x decoder can decode up to x views, where x is an integer greater than 1, with a DPB capacity that multiplies by x, max bit rate is R*(1+0.75*(x-1)) and CPB capacity C*(1+0.75*(x-1)), and limits on other things such as max_dec_frame_buffering may depend similarly on x. When x is not provided for a decoder capability description, a value of 2 would be implied for x. No – let's try not to go there if possible. Another amendment could address this later if necessary.

By inspecting the high-level syntax of a bitstream, it should be possible to determine that a decoder that conforms to some specified profile@level is required to be able to consume some well-defined subset of the bitstream and output some well-defined set of target views. Agreed.

122

Page 123: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

In the absence of a selection of a set of target views by external means (not specified in the Specification), the selected set of target views is considered to be the base view. Agreed.

In the current draft, in the "main SPS" part of the "subset SPS", the profile_idc and level_idc indicate a decoder P@L that can consume the entire bitstream and output all views in it. This information will always be there. (Thus there needs to be some profile@level that is capable of consuming and outputting all views of the entire bitstream.) Suggestion: Let's change that. In an application there may never be a need to decode all views. Suggestion: Let's impose P@L constraints only on extracted subsets, and make the "global" P@L indication optional. In the absence of the global P@L indicator, we would specify conformance in terms of conformance of extracted subsets, not complete bitstreams. That seems agreed.

Each view that is present in the bitstream must be decodable (and outputable) by some profile@level decoder that is specified in the bitstream that consumes the subset of the bitstream associated with that view. Agreed.

The profile_idc and level_idc in the SPS extension part of the subset SPS for the view identifies that profile@level. Agreed.

Move operation point level_idc indications from SEI into SPS extension. Editor given discretion to draft syntax. Agreed.

For a coded video sequence, the number of views that are output per access unit is a constant. Agreed.

We are designing the profiles and levels based on the expectation that some cross-view dependencies will be present in the bitstream, i.e., MVC is being used for more than a simulcast multiplex. Agreed.

mvcScaleFactor = 2. Agreed.

NumViews is a number derived from the dependency tree for the operation point. Agreed.

Note: A coded video sequence is from IDR to IDR. There's a flag in the NAL unit header extension called an anchor_pic_flag to indicate an "anchor access unit". An anchor access unit seems to be essentially the same thing as a random access point SEI message with recovery_frame_cnt equal to 1.

Level limit on upper bound on max_dec_frame_buffering (within the larger Min( A, B ) expression as B), is Max(1, Ceil( log2( NumViews ) ) ) * 16. Agreed.

5.2.1.1.2 JVT-AB037 -L (Info) [A. Vetro (MERL)] MVC Profile/Level Definitions for Stereo

This informative contribution highlights reported benefits of MVC for stereo applications relative to other options such as simulcast and stereo interleaving. The suitability of the currently defined Multiview High profile and corresponding level limits is also considered.

There has been increased momentum recently in the production of 3D content for cinema applications; for the most part, this has been limited to stereo content. There are also a variety of display technologies on the market that support 3DTV, each offering a different viewing experience and having different input requirements. More specifically, stereoscopic displays

123

Page 124: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

support stereo content and require glasses, while auto-stereoscopic displays avoid the need for glasses by rendering view-dependent stereo pairs for a multitude of viewing angles.

There are also a wide range of standardization bodies and consortia considering 3D, including Blu-Ray Disc Association, ATSC, SMPTE and 3D@Home, to name a few. It was suggested that MPEG/JVT should consider opening up liaison communications with these bodies.

The current market needs for stereo content could be satisfied with MVC. This would also be an appropriate format for future extensions to support auto-stereoscopic displays, either with the existing Multiview High profile or a backward compatible extension that includes information such as depth as part of the format.

AVC "stereo video SEI" has stereo capability, but requires some system and decoder adaptation to support decoding. MVC bitstreams can be decoded by legacy decoders and can operate in legacy system designs (provided no buffering constraints are violated).

A hypothetical new Level 4.3 was suggested to be considered for MVC purposes (perhaps not taking into account the mvcScaleFactor).

When considering a simulcast approach, note that this doubles the full bit rate and CPB and DPB capacity requirements of the decoders.

Remark: Many devices already include full dual decoder capability for PIP support.

Liaison communication suggestion: Drafted in side activity and later reviewed and approved.

5.3 MVC SEI

5.3.1.1.1 JVT-AB025 (Info SEI) [M. Ji, Y. Chen, Y. -K. Wang, M. M. Hannuksela, H. Li (Nokia)] Showcase for non-required view component SEI message

At the Geneva meeting, a non-required view component SEI message (JVT-AA034) was adopted into Joint Draft 7.0 of MVC. It specifies the view components that are not required for decoding a view. In this contribution, a showcase for the non-required view component SEI message was presented.

Example decoder operation was demonstrated with decoding of substantially fewer view components when the SEI message is recognized.

JVT disposition: The showcase is considered adequate to demonstrate the usefulness of this SEI message.

5.3.1.1.2 JVT-AB026 (Info SEI) [M. Ji, Y. Chen, Y. -K. Wang, M. M. Hannuksela, H. Li (Nokia)] Showcase for view dependency structure SEI message

At the Geneva meeting, a view dependency change SEI message (JVT-AA035) was adopted into Joint Draft 7.0 of MVC. It specifies the changes of the view dependency, and the changed dependency is a subset of the view dependency specified in the active sequence parameter set. In this contribution, a showcase for the view dependency change SEI message was presented.

124

Page 125: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Example decoder operation was demonstrated with decoding of substantially fewer view components when the SEI message is recognized.

JVT disposition: The showcase is considered adequate to demonstrate the usefulness of this SEI message.

5.3.1.1.3 JVT-AB027 (Info SEI) [M. Ji, Y. Chen, Y. -K. Wang, M. M. Hannuksela, H. Li (Nokia)] Showcase for operation point not present SEI message

At the Geneva meeting, an "operation point not present" SEI message (JVT-AA035) was adopted into Joint Draft 7.0 of MVC. It indicates those operation points that are no longer present after a certain point in bitstream order. In this contribution, a showcase for the operation point not present SEI message was presented.

The SEI message indicates that some view is not present in the bitstream.

Question: Is this SEI message to indicate a situation that should not actually intentionally occur (a decoder operating in an application with a selection of target views such that some selected target views are missing from the bitstream)? Responsive remark: Perhaps – or perhaps this message is intended to be a way to try to prevent such a situation from happening or to enable the detection of such a condition as an error condition (e.g. when a system-level indication/negotiation establishes an expectation that is contradicted or indicated as an exception by the SEI message). Remark: Some potential uses seem philosophically questionable, as this could be an SEI message designed to operate in a situation that perhaps should not actually be allowed to occur (i.e., a decoder operating with some selection of a target view that is not actually available in the bitstream).

Remark: A similar situation occurs for an SVC case, and there is a similar SEI message called a "layers not present" SEI message. Responsive remark: When presented with a conforming bitstream, the SVC spec says to decode what is present, not to guess that because the (potentially perfectly conforming) bitstream has characteristics that differ from what the decoder has been designed to expect, the decoder can do something different from what the standard says that it shall do in response to the conforming bitstream.

Remark: The question appears to be, in some potential uses, whether a decoder might be allowed to act as if there is something wrong with the bitstream when the bitstream is actually a conforming bitstream, and whether we should supply information to help decoders in such situations to exhibit better behavior. However, possibly there may be an indication/negotiation at the system level that the bitstream should have certain properties, or perhaps the SEI message can be the way to indicate the actual properties of the bitstream within the application environment.

The topic was further discussed. It was decided that such an SEI message is just a description of the characteristics of the (conforming) bitstream and that this ability to explicitly describe these characteristics appears likely to be useful in creating more robust decoders, by providing decoders with a description of the properties that the bitstream has (or should have if no data has been lost – which would enable a decoder to check whether the received data actually matches the described properties – such that a lack of a match would be an error condition). As long as the semantics of such an SEI message do not specifically refer to enabling an intentional non-conforming bitstream state (i.e. they are only a description of the properties of a conforming bitstream), there should be no problem with defining such an SEI message.

The fact that the decoder should not be placed into a situation (e.g. by external means at the systems level) in which it expects some data to be present in the bitstream means that the

125

Page 126: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

availability of such SEI data that points out an absence of particular data subsets in the bitstream could be valuable in such a system environment as a check to ensure that external negotiations of bitstream properties match the internal properties of the actual delivered bitstream (or as the way established in an application environment to indicate what subset of the maximal set negotiated by external means at the system level is actually not present).

JVT disposition: The showcase is considered adequate to demonstrate the usefulness of this SEI message.

5.4 MVC JMVM

5.4.1.1.1 JVT-AB030 (Prop 2.2/3.1) [Y.-J. Jeon, B.-M. Jeon (LG), S. Gao, L. Xiong (Huawei)] Comments on JMVM 8.0

At the last meeting, a one bit flag, non_anchor_ref_lX_sample_pred_flag, was adopted to allow different view dependencies for inter-view sample prediction (ISP) and motion skip (MS). From this, the view dependencies for interview sample prediction and motion skip can be set differently.

In some old JMVM, the dependencies for ISP and MS were the same. At the last meeting, there were proposals to change that, and a flag was created for the SPS. Flag = 0 indicates not used for ISP but used for MS; Flag = 1 indicates used for ISP and may or may not used for MS.

Flag is called non_anchor_ref_lX_sample_pred_flag.

Remark: There may be an issue with common conditions or software, but not with the draft spec. ISP is allowed for P views for non-anchor pictures, although that capability is not used in the common conditions.

Proponent wants to be able to determine if a particular view is used for MS. Remark: Why? What is important to know, for single-loop decoding purposes, is only whether a view is used for ISP, not whether it is used for MS.

Off-line study was conducted to determine if there is a spec problem or a common conditions problem or a software problem or no problem and the topic was discussed further. It was concluded that no action is needed to address this issue.

5.5 MVC conformance testingRemark: Hypothetically, it might be possible to write a tool to construct multi-view conformance test sequences from ordinary AVC conformance test sequences.

Volunteers to provide bitstreams: Mitsubishi, Nokia, NTT, Thomson

Thoughts on desired functionality coverage:– Dependency structures (simulcast, P views, B views, temporal prediction structures, only

inter-view prediction, only anchor cross-view prediction, independent non-base view)– Use of anchor/non-anchor functionality (with and without exhibiting open GOP prediction

behavior, with different dependency structures for anchors than for other points)– Variation of number of views (e.g. 2, 3, 8)– Different vs. shared subset SPSs– Large value of max_num_ref_frames

126

Page 127: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

– Reference picture list reordering– Bitstreams prior to extraction of operation point data subsets– Various extracted operation point subsets of such bitstreams– Some SEI message examples– MVC VUI– Bitstreams that have characteristics that change at IDR access units (e.g. operation point view

subsets)

5.6 MVC verification testing

This topic has not been discussed much in previous JVT discussions. It was noted that the project began after a rigorous subjective test effort to establish whether a substantial benefit could be demonstrated from the use of MVC. The basic design capability (in terms of picture coding tools and enabled dependencies) has not changed since that test in terms of coding efficiency capability. Thus the JVT considers the prior subjective test to suffice as a verification of the coding efficiency benefit of the design. Agreed.

6 AVC reference encoding methods

6.1.1.1.1 JVT-AB031 (Ref soft 2.0/3.1) [W.-S. Kim, P. Pahalawatta, Z. Li, A. M. Tourapis (Dolby)] New Video Quality Metrics in JM reference software

The H.264/AVC JM reference software currently only provides encoder distortion information using Mean Square Error (MSE) based error metrics. The computation is also constrained by the color format of the source sequence. It is claimed that such metrics do not model the human visual system very accurately. In this contribution the reference software is extended to support additional distortion metrics in the encoder such as the Structural Similarity Index (SSIM), the Multi-scale SSIM (MS-SSIM), and distortion computation in different color spaces such as RGB. The introduced code is claimed to be very flexible and could be further extended in the future with additional distortion metrics and could also consider different color spaces if so desired.

It is implemented such that the user of the JM encoder provides in addition to PSNR an SSIM value as output per picture. Average SSIM and also MS-SSIM values are output for the entire video sequence.ITU-R 601 is used for color conversion in case the measurement is done for RGB.

JVT decision: Adopt.

6.1.1.1.2 JVT-AB032 (Ref soft 2.0/3.1) [P. Pahalawatta, A. M. Tourapis (Dolby)] Reorg of Error Resilient RDO in the ref software

This document describes several major problems in the existing implementation of error resilient rate distortion optimization (ER-RDO) in the H.264/AVC reference software. It is presented that the existing code had to be significantly revamped in order to provide useful results. The document describes the steps taken to remedy some of the issues and improve the usability of the code, and provides simulation results using the corrected implementation. The motivation for this work is that the ER-RDO is a very useful component for improving the error resiliency of compressed AVC content. Fixing the implementation and improving its usability will greatly enhance the ability to further study rate distortion optimization schemes under lossy conditions.

127

Page 128: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Current implementation does not support B slices, and color distortion measurement. Planned to be available in next release.

Results have been presented showing that at 0% packet loss rate, the result is identical to the normal RDO. For 1% packet loss rate, an improvement of 3dB is reported.

JVT decision: Adopt.

Question on ER-RDO for JSVM. Needs checking.

6.1.1.1.3 JVT-AB033 (Ref soft 2.0/3.1) [A. Leontaris, A. M. Tourapis (Dolby)] Weighted Prediction Alternative to Adaptive Interpolation mechanisms

It is claimed that many proposals have been presented suggesting that H.264/AVC could be improved by the use of different interpolation mechanisms from the one currently supported in the reference software. In this contribution evidence is provided that one may consider simple tricks using the weighted prediction process, which is already supported by the standard, to emulate these mechanisms and achieve similar improvements.

Methods for use of the existing weighted prediction syntax are presented. Tried IPPP, IBBPBBP, and hier. B picture structures.

Results are reported in the IPPP setting for the case when the most recent reference picture is duplicated using RPLR syntax and assigned an offset of -1. Average bit-rate reductions are shown to be 4.65 for HD, 0.92% for CIF and 0.79% for QCIF.

Results are reported for the IBBP setting with list 0 unchanged and a similar modification was applied for list 1 as in IPPP. Average bit-rate reductions are shown to be 2.58% for HD, 0.68% for CIF and 0.87% for QCIF.

For hier. B pictures, the method is applied when coding reference P and reference B pictures. Average bit-rate reductions are shown to be 2.67% for HD, 1.06% for CIF and 1.38% for QCIF.

Contribution noted. Further work encouraged.

6.1.1.1.4 JVT-AB034 (Ref soft 2.0/3.1) [E. Maani, L. Liu, A. M. Tourapis (Dolby)] JM Reference Software Enhancements

In this contribution several enhancements that have been introduced into the H.264/AVC JM reference software, are presented. The enhancements include proper support of Adaptive Rounding for the low complexity rate distortion optimized mode decision, and support of bi-predictive motion estimation for partitions smaller than 16x16, among others.

Contribution noted. Further work encouraged.

7 H.222.0 | MPEG-2 Systems extension for SVC

Design features: - T-STD / P-STD model slightly extended: No additional buffer requirement- Transport of AVC in its own PID- Prefix NAL units are part of AVC PID- SVC “part” of AVC stream in one or more PIDs- New stream type 0x1F for SVC sub-bitstreams

128

Page 129: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

- Hierarchy_Descriptor for dependency indications- New SVC_Extension_Descriptor for scalability information- Access unit delimiter (AUD) like NAL unit in SVC sub-bitstreams - Access unit reassembly based on DTS and SVC AUD- For the temporal scalability case, a pointer to DTS of “lower” sub-bitstream is added, if

needed- SVC subset SPS and SVC PPS go into SVC PID - SVC-related SEI messages may go into SVC sub-bitstream, requires in-Access Unit re-

ordering

Status:All open issues and NB comments addressed. Some editing still required. Final draft of text of ISO/IEC 13818-1:2007/FDAM3 with 3 weeks editing period.

The text of the current draft was briefly reviewed. Implementation available? yes, by FT and HHI. Will be presented at 16:00 (Thursday).

The JVT has approved the document.

8 JVT internal operating rulesJVT decision: The following clarifications/adjustments of JVT operating rules have been adopted.

The JVT decided that participants shall to refrain from long (=more than 4 Minutes) presentations of their proposal, if the results of their coding efficiency experiments have provided less than 2% bit-rate on average (or equivalently 0.1 dB gain on average).

Presentations should also not use "cherry picking" of results for summary reporting in abstracts and presentations. Summary reports must be true summaries – not highlights of best results while ignoring worst results.

Regarding late contributions: Due to our difficulties with a large quantity of late-submitted contributions at previous meetings, the JVT has agreed that for its next meeting, no late-uploaded (non-AHG-report, non-liaison, non-verification) contribution will be presented without having a minimum of 4 JVT participants (working for separate organizations other than that of the primary contribution author) recorded by name as supporting the allowance of such a presentation, in addition to a consensus of the general JVT membership to allow the presentation. Such support to allow a presentation is to be understood to not necessarily imply support of the adoption of the content of the late contribution, but only as a positive expression that the document should be allowed to be presented. Additionally, the provider of such a presented late contribution shall send an email apology to the JVT email reflector. This rule does not apply to material requested by the JVT at the meeting (e.g., reports of JVT-authorized "break out group" side activities).

For all contributions that have presentation material that is used to present them to the group (e.g., PowerPoint presentations), the presentation material should be provided along with the written contribution (within the same zip container file). PDF is preferred over PPT for presentations when the PPT filesize is large and there is no need for the slide deck to be editable by others.

All submissions must be made in JVT-ABxxx.zip format with the word docs, excel sheets and other information being in the zip container. The document must contain an abstract and be accompanied with an e-mail notification containing title, authors and abstract (identical to the one

129

Page 130: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

in the doc) which is no longer than 200 words and no shorter than 25 words and is written in 3rd person language in a manner that does not express endorsement of the content of the document.

On filenames inside of .zip containers – use a filename so that if someone takes the files out of the zip container, they would still know what contribution they came from. Every file (or directory) in the .zip container for document JVT-ABxxx should start with JVT-ABxxx. Example: JVT-ABxxx.doc (main document), JVT-ABxxx_presentation.pdf, JVT-ABxxx_results1.xls, etc.

When providing additional or revised files, do not include copies of files that were already included in the prior .zip archive for the same contribution and do not re-use the same filenames without adding revision numbers (r1, r2, etc.) – this saves us needing to worry about whether the files someone obtains with the same filenames are the same or different.

Independent verification (necessary for adoption of a proposal) is provided either througha) independent implementation by 1 or more organization different than that of the

proponent based on the textual description (after adoption, both decoder source code versions must be made publicly available along with one encoder version), or

b) providing source code to all CE participants prior to the meeting (CEs can only be joined at the meeting, when the CE is created. CEs are created at each meeting and last until the next meeting.)

Simply running binary executables provided by a proponent is not ordinarily considered independent verification. Source code should be provided and used, and the verifying party should invest a proper degree of effort to ensure that the “verification” they perform is a meaningful and professional study with significant depth rather than just a perfunctory procedural formality.

For every SEI message and every syntax element that are currently in the SVC/MVC draft, a showcase has to be provided in order to retain it in the JSVM/JMVM/JD. If such a showcase is not provided at the next meeting for an SEI message or parts of it, the SEI message or the respective parts will be removed from the JSVM/JMVM/JD. The source code and executables for the showcase must be made available.

When Core Experiments (CEs) are to be established, a first CE description should be available at the last day of the meeting (or at least within a few days). Changes of the CE description are only allowed until 3 weeks prior to the next meeting. These changes must be of evolutionary characteristic relative to the input documents on which the CE is based and must be agreed by those who contributed the respective input document(s) or be added as an option.

Contributions that are proposals of new technology that was not what was described as being tested in a CE (even if related to the tested technology) should not indicate that they are CE documents in their title and abstract.

9 List of AHGs establishedThe following JVT “ad hoc groups” (AHGs) were established to progress work on identified topics until the next meeting of the JVT.

9.1 JVT project management and errata reportingDiscussion: [email protected]

130

Page 131: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Chair: Gary Sullivan, Jens Rainer Ohm, Ajay Luthra, and Thomas WiegandMandates:– Collect errata reports on standards under management of JVT– Coordinate overall interim JVT progress– Prepare status information for JVT status reporting

9.2 JM Text, reference software, bitstream exchange and conformanceDiscussion: [email protected]: Thomas Wiegand, Karsten Sühring, Alexis Tourapis, Teruhiko Suzuki, Gary SullivanMandates:– Maintain and update JM algorithm description text– Maintain and update JM reference software and its usage manual– Facilitate exchange of test bitstreams to aid interoperability testing– Collect bitstreams for inclusion in (non-SVC, non-MVC) Conformance specifications– Identify and correct problems in Conformance specifications and associated bitstreams

9.3 SVC JSVM text, software and conformanceDiscussion: [email protected]: Heiko Schwarz, Jérome Vieron, Thomas Wiegand, Mathias Wien, Alex Eleftheriadis, Vincent BottreauMandates:– Edit and deliver improved JSVM text– Coordinate JSVM software integration– Coordinate bug-fixing process for the JSVM software– Maintain JSVM software manual– Plan, edit, and collect bitstreams for SVC conformance specification

9.4 SVC bit depth, and chroma format scalabilityDiscussion: [email protected]: Andrew Segall, Thomas WiegandMandates:– Identify applications– Work out suggestions for detailed needs– Find/create test material– Study bit-depth reduction techniques, e.g., tone-mapping tools– Study color space and/or gamma conversion requirements– Define experiments and test conditions– Investigate software and text modification needs– Identify complexity issues

9.5 MVC JD and JMVM text, software, and conformanceDiscussion: [email protected]: Hideaki Kimata, Aljoscha Smolić, Purvin Pandit, Anthony Vetro, Ying ChenMandates:– Collect comments on draft, perform necessary editing and delivery.– Maintain JMVM and JD document and collect comments on the text.– Coordinate JD/JMVM software integration

131

Page 132: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

– Coordinate bug-fixing process for the JD/JMVM software– Maintain JD/JMVM software manual– Plan, edit, and collect bitstreams for MVC conformance specification

9.6 Splicing operationDiscussion: [email protected]: Gary Sullivan, Arturo Rodriguez, Sam NarasimhanMandates:– Study the use of bitstream splicing in applications– Investigate potential needs for SEI data to aid in splicing operations, including consideration

of JVT-Z040, JVT-Z041, and JVT-Z042 and the issues raised in their discussion– Study the implications of ITU-T Rec. J.181 and the draft new ITU-T Rec. J.h-dpi– Gather information about activities of other relevant organizations regarding the

development of specifications relating to bitstream splicing

10 Future meeting plans

The JVT chairmen proposed to hold the 29th JVT meeting during 13-17 October 2008 under WG 11 auspices in Busan, KR; and the 30th JVT meeting during 29 January – 3 February 2009 under ITU-T SG 16 auspices in Geneva, CH.

11 AttendancePersons registered to attend the meeting, as recorded by a sign-in sheet circulated during the meeting, were the following (90 listed participants): 1. Bang, Gun (ETRI)2. Bivolarski, Lazar Malinov (Logitech)3. Bottreau, Vincent (Thomson R&D France)4. Carballeira, Pablo (Univ. Polit. de Madrid)5. Chen, Peisong (Qualcomm)6. Chen, Ying (Tampere Univ. Tech.)7. Cheng, Ka Man Carmen (ASTRI)8. Chiu, Yi-Jen (Intel)9. Choi, Hae-Chul (ETRI)10. Chujoh, Takeshi (Toshiba)11. Cieplinski, Leszek (Mitsubishi Electric)12. de Haan, Wiebe (Philips)13. Ding, Dandan (Zhejiang Univ.)14. Divorra, Òscar (Thomson)15. Domański, Marek (Poznań Univ. Tech.)16. Duenas, Alberto (W&W Communications)17. Eleftheriadis, Alex (Layered Media)18. Even, Roni (Polycom)19. Feng, Yue (Univ. Glasgow)20. Fukushima, Shigeru (JVC)21. Hannuksela, Miska (Nokia)22. Haque, Munsi (Motorola)23. Haskell, Barry (Apple)24. Ho, Yo-Sung (GIST)

132

Page 133: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

25. Hsiang, Shih-Ta (Motorola)26. Huo, Junyan (Xidian Univ.)27. Husak, Walt (Dolby Labs)28. Itoh, Takashi (Fujitsu Labs)29. Jeon, Byeong-Moon (LG Electronics)30. Jeon, Byeungwoo (Sungkyunkwan Univ. - SKKU)31. Jeon, Yongjoon (LG Electronics)32. Karczewicz, Marta (Qualcomm)33. Kazui, Kimihiko (Fujitsu)34. Kiang, Chiew Tuan (Inst. Infocomm. Res.)35. Kim, Myoung-Jin (Soongsil Univ.)36. Kimata, Hideaki (NTT)37. Klimansewski, Krynxtox (Poznań Univ. Tech.)38. Kogure, Tokuyo (Univ. Tokyo)39. Lee, Cheon (GIST)40. Lee, Yung-Lyul (Sejong Univ.)41. Lim, Jung Eun (LG Electronics)42. Lin, Sixin (Huawei)43. Luthra, Ajay (Motorola)44. Ma, Siwei (Peking Univ.)45. Motta, Giovanni (Qualcomm)46. Nakamura, Katsuyuki (Hitachi)47. Nishi, Takahiro (Panasonic)48. Oh, Seoung-Jun (Kwangwoon Univ.)49. Ohm, Jens-Rainer (RWTH Aachen Univ.)50. Pandit, Purvin (Thomson)51. Pateux, Stephane (Orange - France Telecom)52. Qi, Yingyong (Qualcomm)53. Ridge, Justin (Nokia)54. Sakazume, Satoru (JVC)55. Saito, Shohei (Hitachi)56. Schwarz, Heiko (Fraunhofer HHI)57. Segall, Andrew (Sharp Labs USA)58. Sekiguchi, Shun-ichi (Mitsubishi Electric)59. Senoh, Takanori (Univ. Tokyo)60. Shi, Xiaojin (Apple)61. Shimizu, Shinya (NTT)62. Sim, Donggyu (Kwangwoon Univ.)63. Singer, David (Apple)64. Stankiewicz, Olgierd (Poznań Univ. Tech.)65. Sullivan, Gary (Microsoft Corp.)66. Sun, Huifang (Mitsubishi Electric)67. Sung, Jaewon (LG Electronics)68. Suzuki, Teruhiko (Sony)69. Tan, Thiow Keng (NTT DoCoMo)70. Tanizawa, Akiyuki (Toshiba / Mitsubishi Electric)71. Thoma, Herbert (Fraunhofer IIS)72. Tian, Dong (Thomson)73. Tourapis, Alexandros (Dolby Labs)74. Tung, Yi-Shin (MStar Semi.)75. Vermeirsch, Kenneth (Ghent Univ.)76. Vetro, Anthony (Mitsubishi Electric)77. Wang, Ye-Kui (Nokia)

133

Page 134: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

78. Wiegand, Thomas (Fraunhofer HHI)79. Wittmann, Steffen (Panasonic)80. Won, Kwanghyun (SKKU)81. Xiong, Lianhuan (Huawei)82. Yamakage, Tomoo (Toshiba)83. Yamamoto, Tomoyuki (Sharp)84. Yamasaki, Takahiro (Oki Electric Industry)85. Yao, Wei (I2R)86. Yashima, Yoshiyuki (NTT)87. Yea, Sehoon (MERL)88. Yoo, Jeong-Ju (ETRI)89. Yu, Haoping (Huawei)90. Yu, Lu (Zhejiang Univ.)

134

Page 135: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex I – Audio report

Source: Schuyler Quackenbush, Chair, Audio Subgroup1 Opening of the meeting.........................................................................................................1342 Administrative matters...........................................................................................................134

2.1 Communications from the Chair 1342.2 Approval of agenda and allocation of contributions 1342.3 Creation of Task Groups 1342.4 Approval of previous meeting report 1342.5 Review of AHG reports 1342.6 Joint meetings 1342.7 Received National Body Comments and Liaison matters 134

3 Record of AhG meetings.......................................................................................................1353.1 AhG Meeting on SAOC, Unified Speech and Audio Sunday 1000-1700 135

3.1.1 Unified Speech and Audio 0900-1400........................................................................1353.1.2 SAOC 1400-1800........................................................................................................139

4 Plenary presentations.............................................................................................................1395 Task group activities..............................................................................................................140

5.1 Joint meetings and documents from other groups 1405.1.1 MPEG standards for Gaming.......................................................................................1405.1.2 Interactive Music AF, M15626....................................................................................1405.1.3 Proposed Audio profiles for ALS, SLS.......................................................................141

5.2 Task Group discussions 1415.2.1 MPEG-2 and MPEG-4 audio, conformance, reference software................................1415.2.2 MPEG-4 AAC Enhanced Low Delay..........................................................................1445.2.3 MPEG-D SAOC..........................................................................................................1455.2.4 MPEG-D Unified Speech and Audio..........................................................................146

6 Meeting deliverables..............................................................................................................1477 Responses to Liaison and NB comments...............................................................................148

7.1 Recommendations for final plenary 1487.2 Establishment of Ad-hoc Groups 1487.3 Approval of output documents 1487.4 Press statement 148

8 Future activities.....................................................................................................................1488.1 Schedule of future meetings 1488.2 Agenda for next meeting 1488.3 All other business 1488.4 Closing of the meeting 148

Annex A Participants...............................................................................................................149Annex B Audio Contributions and Schedule...........................................................................151Annex C Task Groups.............................................................................................................155Annex D Output Documents....................................................................................................156Annex E Agenda for the 85th MPEG Audio Meeting..............................................................157

135

Page 136: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

1 Opening of the meetingThe MPEG Audio Subgroup meeting was held during the 85th meeting of WG11, July 21- 25, Hannover, DE. The list of participants is given in Annex A.

2 Administrative matters2.1 Communications from the ChairThe Chair summarised the issues raised at the Sunday evening Chair’s meeting, proposed task groups for the week, and proposed agenda items for discussion in Audio plenary.

2.2 Approval of agenda and allocation of contributionsThe agenda and schedule for the meeting was discussed, edited and approved. It shows the documents contributed to this meeting and presented to the Audio Subgroup, either in the task groups or in Audio plenary. The Chair brought relevant documents from Requirements, Systems to the attention of the group. It was revised in the course of the week to reflect the progress of the meeting, and the final version is shown in Annex B.

2.3 Creation of Task GroupsTask groups were convened for the duration of the MPEG meeting, as shown in Annex C. Results of task group activities are reported below.

2.4 Approval of previous meeting reportThe 84th Audio Subgroup meeting report was registered as a contribution, and was approved.

2.5 Review of AHG reportsThere were no requests to review any of the AHG reports.

2.6 Joint meetingsThe joint meetings with Audio for the week are shown below:Groups What Where Day TimeAll MPEG standards for Gaming Plenary Wed 1100-

1200Sys., Audio, Req.

Interactive Music AF, M15626 Audio Wed 1600-1700

Audio, Req. Proposed Audio profiles for ALS, SLS

Audio Thu 1100-1130

2.7 Received National Body Comments and Liaison mattersThe NB Comments and Liaison documents for the meeting that require a response are as shown below.No. Title Topic Response bym15261 Liaison Statement from ITU-T SG 16 Enhanced Low

Delay AACSchnell

m15516 Liaison Statement from ITU-T SG 16 full band audio coding

Schnell

m15555 IEC 61937-10 Digital Audio Interface ALS over IEC 61937-10

Quackenbush

m15551 FRNB comment on the Unified Speech and Audio Coding Exploration Activity

USAC Quackenbush

136

Page 137: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15720 KNB comment on the Unified Speech and Audio Coding Exploration Activity

USAC Quackenbush

3 Record of AhG meetings3.1 AhG Meeting on SAOC, Unified Speech and Audio Sunday 1000-17003.1.1 Unified Speech and Audio 0900-1400

Test Site ReportsThe first set of presentations reported on the test site listening facilities. These were very straightforward and factual, and in each case there was no discussion to record. Kristofer Kjörling, Dolby, presentedm15545 Speech and Audio listening test lab report from Dolby Kristofer Kjörling

Anisse Taleb, Ericsson, presentedm15618 Speech and Audio listening test lab report (Ericsson) Anisse Taleb

Taejin Lee, ETRI, presentedm15570 Unified speech and audio coding listening test report from

ETRITaejin LeeSeungkwon BeackMinje KimKyeongok Kang

Herve Taddei, Huawei, presentedm15678 Audio Report on the subjective testing of Unified

Speech and Audio Coding proposals at HuaweiLijing XuHerve Taddei

Pierrick Philippe, France Telecom, presentedm15655 France Telecom listening test results for the CfP on

Unified Speech and Audio CodingPierrick PhilippeDavid Virette

Markus Multrus, FhG, presentedm15619 Report on USAC Subjective Tests at Fraunhofer IIS Test

SiteMarkus MultrusRalf Geiger

Dong Soo Kim, LGE, presentedm15581 Report on the USAC listening test at LGE Dong Soo Kim

Sungyong YoonJaehyun LimHyun-Kook LeeHenney OhYang-Won Jung

Werner Oomen, Philips, presentedm15548 Speech and Audio listening test lab report from Philips Werner Oomen

Jeroen Koppens

Eunmi Oh, Samsung, presentedm15567 Listening test results for Unified Speech and Audio

Coding from SamsungEunmi OhMiyoung KimJungHoe Kim

Oliver Wuebbolt, Thomson, presentedm15565 Speech & Audio - Listening Test, Report & Results -

ThomsonJohannes Boehm Oliver Wuebbolt Florian Keiler

Roch Lefebvre, VoiceAge, presented137

Page 138: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15608 Report on USAC subjective tests at VoiceAge test site Roch Lefebvre

System DescriptionsThe next set of presentations reported on the proponent system architectures.Kristofer Kjörling, Dolby, presentedm15547 Technical description of the Dolby Philips proposal for the

speech and audio work- itemKristofer KjörlingWerner OomenJonas SamuelssonLars VillemoesBarbara ReschErik SchuijersPontus Carlsson

The Dolby/Philips system builds on HE-AAC V2 architecture, but adding prediction-based time domain processing elements. The tools in HE-AAC V2 have been slightly modified so as to offer additional compression performance for the diverse speech and audio content. The MDCT permits a number of window lengths and block sizes, in which all transform bins have equal bandwidth. All AAC-LC tools were available in the core coder. The codec delay can be as much as 1.2 seconds, which is due primarily to encoder look-ahead and blocking into super-frames.Taejin Lee, ETRI, presentedm15568 Technical description of the ETRI proposal for the unified

speech and audio codingTaejin LeeSeungkwon BeackMinje KimKyeongok KangHochong ParkYoungcheol Park

The ETRI system combined the individual tools from AMR-WB+ and HE-AAC V2 to create a new codec, with modifications to the tools as needed to address the coding of diverse content types. It used two processing models: Steady State using linear prediction coding tools, and Complex State using transform coding tools. There is a very flexible signal state decision module that results in a hard decision to use one of the two processing models for each block. Bernhard Grill, FhG, presentedm15621 Technical Decsription of the Fraunhofer IIS Submission

for the CfP on USACMarkus MultrusRalf GeigerBernhard GrillNikolaus RettelbachMax Neuendorf

The FhG system is a collaboration between Fraunhover IIS and VoiceAge. The system combines elements from MPEG Surround, the SBR tool, AAC and ACELP linear prediction coding. The tools were modified as appropriate for better performance for coding diverse speech and audio content. MPEG Surround is used for stereo processing, although it can easily be extended for processing multi-channel signals. In the submission it only uses a single OTT box. The SBR technology employed the same filterbank as MPEG Surround, but was enhanced to support very low cross-over frequencies. It can be switched off for high-bitrate operation. The core coder is comprised of a Linear Prediction tool and a MDCT Transform coder or Time-Domain coder. In the core coder, the Linear Prediction tools can be activated on a block-by-block basis or can always be active. The LP tool is followed by either a time-domain coding tool (ACELP) or a frequency-domain coding tool (AAC). At high bitrates the system is virtually identical to AAC. The AAC tool uses an improved entropy coder and optionally uses time-warping for improved performance on speech signals.The possible configurations of the core coder are:State Stage 1 Stage 2

1 LPC Time-Domain residual coder2 LPC MDCT residual coder3 LPC

offMDCT Coder

138

Page 139: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Philippe Gournay, VoiceAge, presentedm15609 Technical Description of the VoiceAge Candidate for

USACRoch Lefebvre

The VoiceAge system is a collaboration between Fraunhover IIS and VoiceAge. The decoder is essentially the same system as was presented in the FhG contribution. The time-domain core coding tool is the AMR-WB ACELP coding tool.The LPC coding tool is typically switched on for speech-like signals. In this case, the core coder can switch between time-domain (ACELP) and frequency-domain coding of the residual (TCX) on a block-by-block basis. If the LPC coding tool is switched off, the coder uses frequency-domain coding. At high bitrates the codec is very similar to AAC. The VoiceAge encoder employed a different psychoacoustic model, SBR tool and MDCT coder. The MDCT did not employ time-warping.The complexity of the decoder is no more than 1.8 times that of HE-AAC V2.

Dong Soo Kim, LGE, presentedm15582 LGE submission to Unified Speech & Audio Coding Dong Soo Kim

Sungyong YoonJaehyun LimHyun-Kook Lee

The LG system switched between three coding modules for coding speech-rich signals, music-rich and mixed speech and music. The speech-rich coder employs a modified AMR-WB+ tools, music-rich coder used a modified HE-AAC V2 tools and mixed coder uses a residual coding scheme. The decoder employs a delay compensation process so that the system can switch between coding modes on a block-by-block basis. JungHoe Kim, Samusng, presentedm15564 Response to CfP on unified speech and audio coding Eunmi Oh

JungHoe KimMiyoung KimKiHyun ChooHosang Sung

The presentation noted that coding of music signals is most effective using atransform coder with perceptual threshold for shaping the quantization noise while coding of speech signals is most effective using a linear predictive speech model. The Samsung system uses a non-uniform bandwidth MDCT tool and any of the T/F information can be coded be either the high-temporal resolution tool or HE-AAC like quantization tools. In addition, it uses a TNS tool, a SBR tool and a parametric stereo tool. The high-temporal resolution tool is only used at the lower bitrates. In discussion it was noted that, without considering quantization errors, the variable T/F MDCT tool operates such that time-domain aliasing is always cancelled.Oliver Wuebbolt, Thomson, presentedm15566 Speech & Audio - Description of Technology of the

Thomson proposalFlorian KeilerOliver WuebboltJohannes Boehm

The Thomson system uses a speech/audio switch to select either a time-domain coding tool or a frequency-domain coding tool in the core coder. In addition, it incorporates SBR and Parametric Stereo coding tools. Analysis of Listening Test DataThe final set of presentations reported on analysis of the listening test data.Werner Oomen, Philips, presentedm15546 Analysis of speech and audio listening test data Werner Oomen

Kristofer KjörlingHeiko Purnhagen

This contribution presents the Philips and Dolby experts analysis of the listening test data. The data was analyzed to determine if the requirements as specified in the Evaluation document were met, for both 95% and 99% confidence intervals. The results showed that systems 2, 4, 8, 10 and 11 were promising systems with respect to the Requirements analysis.Taejin Lee, ETRI, presentedm15613 Analysis of unified speech and audio coding listening test Taejin Lee

139

Page 140: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

results Seungkwon BeackMinje KimKyeongok Kang

This contribution presents an analysis of the Figure of Merit statistics. It presents a count of how often each system has the highest mean performance for each test and each signal category. Its conclusion is that there is no dominant best system.Pierrick Philippe, France Telecom R&D, presentedm15656 France Telecom contribution to the analysis of the

listening test results following the CfP on Unified Speech and Audio Coding

Pierrick PhilippeDavid Virette

This contribution presented graphs of system performance as compared to VC. For each signal category it performed single-sided T-tests with respect to the best performing systemCategory System with best mean score

S 4, but not different from performance of 10S+M 2 is best

M 10, but not different from performance of 11The overall conclusions are that systems 2, 4, 8, 10 and 11 all seem promising for this new work.Ralf Geiger, FhG, presentedm15625 Analysis of USAC Listening Test Data Frederik Nagel

Ralf GeigerMax NeuendorfMarkus Multrus

The contribution first presents the calculation of VC for each test and each category.For the Requirement that the New Technology be no worse than VC, only sys10 and sys11 meet the requirements for very test and every category for both the 95% and 99% levels of significance.For the Figure of Merit, only sys10 and sys11 have a non-negative “d” statistic for all tests, at the 95% level of significance. When averaged over all tests, only sys2, sys10 and sys11 have a non-negative “d” statistic, at the 95% level of significance, but sys10 has the best mean “d” statistic, but this mean value is not different from that of sys11 at the 95% level of significance.Roch Lefebvre, VoiceAge, presentedm15610 Analysis of Combined Listening Test Results for USAC Roch Lefebvre

This contribution presents a number of interesting views of the test results based on ranking of the proponent mean scores based on test, signal category and signal. The overall conclusion is that sys10 is the best overall proposal, followed closely by sys11.Eunmi Oh, Samsung, presented a statistical analysis of listening test data, which was provided in contribution m15567. It presented a table showing which systems had the best performance by test and signal category, and also showed the systems whose performance were not different from the best. It concluded that there is no dominant system when performance is viewed by test and signal category. Systems sys2, sys4, sys8, sys10 and sys11 all seem promising. Frederik Nagel, FhG, noted that presenting a table of results based on best mean score might be misleading, in that several systems may have mean scores that are not different from the best at the 95% level of significance.Dong Soo Kim, LG, presented m15713 USAC listening test analysis from LGE Dong Soo Kim

Sungyong YoonThis contribution presents a statistical analysis of listening test data. This was registered as m15713 and was uploaded on Sunday July 20. For the Requirements, it showed that only sys10 and sys11 pass the Requirements for all tests and all signal categories. Furthermore, it noted that sys4 passed the Requirements most often for speech category signals and sys2 passed the Requirements most often for mixed category signals. It presents a Figure of Merit analysis by signal, sys2, sys10 and sys11 all have similarly good performance.Review of AhG ReportThe AhG Chairs presented the AhG report and proposed

140

Page 141: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

To use France Telecom R&D mono/stereo graphs of performance with respect to VC, but removing LP anchors, AMR-WB+ and HE-AAC-V2.

And additionally summarize the test results as sys10 and sys11 are the best or not different from the best for almost every bitrates (mono

and stereo) and content types sys2, sys4 and sys8 are the best or not different from the best for many bitrates (mono and

stereo) and content types

3.1.2 SAOC 1400-1800

CE: Separation of real-environment signalsOsamu Shimada, NEC, made short presentation that reminder the Audio experts the framework for and meaning of test1 and test2 in this core experiment.Jeongil Seo, ETRI, presented the test1 results inm15572 Listening Test Report for CE on separating real-

environment signals into multiple objects from ETRIJeongil SeoSeungkwon BeackKyeongok Kang

Yang-Won Jung, LG, presented the test1 results in m15586 Listening test reports for CE on separating real-

environment signals Yang-Won JungHenney OhDong Soo KimSungyong Yoon

Leonid Terentiev , FhG, presented the test1 results in m15632 Listening test report for CE on separating real-

environment signals into multiple objects for the MPEG SAOC system

Leonid Terentiev Cornelia Falch Oliver Hellmuth Johannes Hilpert

The contribution showed that all listeners in Test1 had at best “no preference” and at worst “did not prefer” the proposed technology. There was considerable discussion on the slide which presented mean score for all listeners in Test 1. Osamu Shimada, NEC, presentedm15574 Listening test results of Test 1 for SAOC CE on

separating real-environment signals into multiple objectsOsamu ShimadaToshiyuki NomuraAkihiko SugiyamaOsamu Hoshuyama

The set of contributions were discussed. The Chair noted that averaging over listener responses may obscure the individual’s intent rather than minimize the noise in the listener responses. It was decided that a break-out group will consider how best to interpret the data we have at this meeting and report back later in the week.

4 Plenary presentationsThe Chair reviewed the Liaison documents and reviewed initial drafts of responsesThe Chair reviewed the current and target location of Audio reference software, as shown in the following table:Code Modules Responsible Party Target LocationMPEG-1MPEG-2MPEG-4

FhG FhG CVS server

ALS LG SVN serverSLS I2R SVN serverMPEG Surround Dolby/FhG/Philips FhG CVS serverSAOC FhG FhG CVS serverMPEG-7 FhG IDMT SVN server

141

Page 142: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

The Chair presented documents m15688 and m15689 on MXM to the Audio Subgroup members.The Chair discussed the issue of ER bistream syntax specification (as in ER AAC-LD) versus reference code implementation. It was decided that a task group of the Audio Chair, Ralph Sperschneider, FhG, and Markus Schnell, FhG, will investigate and report back.The Chair presented N9900, New Directions in Audio Standardization. Karlheinz Brandenburg, FhG, noted that before proceeding with any new standardization effort, MPEG should:

Confirm the need for the standard. For example, that it is truly responsive to the needs of identified customers.

Consider whether MPEG is the correct body for the standardization effort.

5 Task group activities5.1 Joint meetings and documents from other groups5.1.1 MPEG standards for Gaming

MPEG-4 media steams are compressed, which permits easy storage and, more importantly, easy transmission. On-line games offer MPEG an opportunity.The gaming industry does not embrace open standards, but does use defacto standards (e.g. PS3 or X-Box platforms). However, at the rendering level, 99 percent of games today use either OpenGL or DirectX. It was noted that what is needed is contacts to gaming industry, for example to learn to how audio is treated within the game engine and understand what needs the gaming industry might have.Game architecture can be roughly represented by the following flowchart:

Content → Game Engine → Rendering API (e.g. OpenGL, DirectX) → Drivers and presentationIt appears that HE-AAC or USAC streams for audio objects, which are rendered using with SAOC and MPEG Surround (including binaural rendering) could map onto this flowchart. As another example, MP3 technology in Flash might be extended to support SAOC / MPEG Surround. This may be particularly appropriate for simple PC-based network games.Gaming uses middleware to leverage their resources, particularly as a means to be portable across gaming platforms. This potential interface to standardized technology could represent an opportunityIn order to increase awareness of and contacts into the gaming industry audio experts might

Attend gaming conferences (are there such things)? Contact industry associations, e.g. Game Academy (in Berlin)

Finally, it would be a very beneficial exercise to construct a detailed block diagram of a specific networked gaming platform and map onto it possible MPEG audio technology.

5.1.2 Interactive Music AF, M15626

M15710 - RequirementsThe presentation emphasized that interactive music permits a very new user experience. For example, several mixes of a song could be presented. Interactivity options are represented via a graph concept and supports tracks (objects), volume, equalization, specialization. The main contribution is the notion of constraints on the user experience (i.e. possible mixes). There is strong support among artists for the concept of constraints, where such constraints can be

Constraints on tracks selected Constraints on volume of tracks or relative volume of tracks Constraints on equalization of tracks

There are three components of Interactive Music AF: Metadata Presets and Constraints Audio representation

142

Page 143: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Audio experts noted that SAOC could be an excellent coded representation for applications in which the audio information flows across a narrowband transmission channel (e.g. streaming to a mobile phone). Juergen Schmidt, Thomson, noted that MPEG-4 BIFS could be used to implement many of the mixing functionalities. The Systems Chair noted that BIFS does not have extensive support for the concept of saving and retrieving presets.The Audio Chair noted that there should be a requirement that the solution should be able to operate in a way that the audio information is retrieved via a relatively narrow-band communication channel.The Audio Chair noted that the current requirements are captured in N9840, MAF under considerations. These requirements, as augmented by contributions and contributions from this joint meeting, will be reviewed Friday at 11:30 in the Requirements room.M15626 – MAF under consideration. Inseon Jang, ETRI, presented the technical details of the Interactive Music AF. The AF used a specialization of the MPEG-4 FF with two new boxes: Preset Container Box and Preset Box. The AF FF has the brand “im01.” The Preset Box contains the mix information associated with the relevant tracks. The presentation further explained how the metadata is to be stored in the MPEG-4 File.

5.1.3 Proposed Audio profiles for ALS, SLS

Ralf Geiger, FhG, presented the proposed profile for SLS. The consensus of the experts in the joint meeting was to incorporate the profile definition into PDAM to MPEG-4 Audio. Takehiro Moriya, NTT, presented the proposed profile for SLS. The consensus of the experts in the joint meeting was to incorporate the profile definition into a WD of an amendment to MPEG-4 Audio. The Audio and Requirements Chairs requested that at next MPEG meeting there be available

Some positive statement or feedback from ARIB concerning intent to adopt this profile The contact names of business unit persons at companies that intend to adopt the profile Ideas for a profile name

5.2 Task Group discussions5.2.1 MPEG-2 and MPEG-4 audio, conformance, reference software

MPEG-2 and MPEG-4New ProfilesTakehiro Moriya, NTT, presentedm15660 Proposal for MPEG-4 ALS profile/levels Takehiro Moriya

Tilman LiebchenYasushige NakayamaToshiyuki NomuraNoboru HaradaYutaka Kamamoto

This proposes a profile that incorporates a limited set of tools from the ALS specification. A set of levels is defined that addresses scoping of complexity. A figure was presented that motivates the selected set of tools and limitations based on the compression/complexity tradeoff. Complexity figures were based on an ARM9 fixed-point platform and the presenter noted that BGMC coding was division intensive such that it is quite expensive on a fixed-point.There was considerable discussion. Tilman Liebchen, LG, assured that ARIB is seriously moving ahead with this service. The Chair noted that ARIB’s interest would be limited to Level 1 and suggested at this time to limit the proposed profile to include only that level. Yuriy Reznik, Qualcomm suggested that a liaison could be sent to IFPI requesting information on interest in other levels for this profile (e.g. for portable player or consumer music archives).It was the consensus of the group to draft an output document and to review it with the Requirements group. Given a favourable review, this will be issued as a Working Draft of a new amendment to 14496-3.

143

Page 144: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Yuriy Reznik, Qualcomm, presentedm15715 Comment on proposal for MPEG-4 ALS profiles/levels Yuriy Reznik

This contribution noted that the complexity figures in the previous contribution, m15660, appear to be inconsistent with those in N7686, “Verification Report on MPEG-4 ALS.” These differences will be resolved in a task group that will report back to the Audio Subgroup.Ralf Geiger, FhG, presentedm15628 Proposed new profile for MPEG-4 SLS Ralf Geiger

Ralph SperschneiderVirgilio Bacigalupo

This contribution requests the creation of a new profile “HD-AAC” that includes the MPEG-4 AAC-LC and SLS audio object types and supports up to two channels, 96 kHz sampling rate and 24 bit word length. The tools in the SLS audio object type were restricted to exclude CBAC. Eunmi Oh, Samsung, questioned the complexity figures provided by FhG to motivate exclusion of the CBAC tools and indicated that Samsung internal results are different. The Chair requested that FhG make available to the group its complexity data and that, if possible, Samsung make available to the group its complexity data. The Chair proposes that the profile definition, as in the contribution, be reviewed by Requirements. However, a consensus position in the Audio Subgroup on the complexity figures that motivate the tool exclusion must be reached prior to inclusion of the profile into a Working Draft of a new amendment to 14496-3. DiscussionAfter some discussion in closing Audio plenary, it was determined that if both ALS and SLS profiles issue as WD at this meeting, they will progress to FDAM at the 89th meeting and the progressing to PDAM at this meeting does not advance the FDAM date. It is the understanding of the Audio Subgroup experts that additional information on the performance of CBAC will be available at the 86th meeting and the issue of whether or not this tool is included in the proposed SLS profile will be resolved at that meeting.CorrigendumRalf Geiger, FhG, presentedm15629 Proposed Corrigendum on MPEG-4 SLS Ralf Geiger

Ralph SperschneiderVirgilio Bacigalupo

This document requests technical changes: Change coding of lle_ics_length in the text to conform to coding of lle_ics_length as in

the reference software. Change the function of read_bits() in the reference software so that it conforms to what is

stated in the text and change one conformance stream and decoded waveform.

Eunmi Oh, Samsung, noted that the proposed fix for read_bits() may not be actually correct. Samsung experts will review and get back to the group. The Chair suggested that, after further break-out work, the proposed DCOR text be brought forward for further review. Andreas Schneider, Dolby, presentedm15638 proposed correction to MPEG-2 SBR conformance Andreas Schneider

This contribution pointed that certain conformance data for MPEG-4 AAC (in which SBR is used) were corrected, but that the corresponding conformance data in MPEG-2 AAC were never corrected. It was the consensus of the Audio Subgroup to issue at this meeting a DCOR to MPEG-2 Conformance.Andreas Schneider, Dolby, presentedTm15640 proposed correction of the MPEG-4 HE AAC v2 profile Andreas Schneider

his contribution pointed out that mono and stereo mixdown elements are excluded from AAC and HE-AAC profiles but not HE-AAC V2 profile. It is proposed that this restriction be added to the V2 profile. It was the consensus of the Audio Subgroup to issue at this meeting a DCOR to MPEG-4 Audio.

144

Page 145: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Andreas Schneider, Dolby, presentedm15639 proposed clarification for MPEG-2 and MPEG-4 AAC Andreas Schneider

Karsten Linzmeier

The contribution proposes to align the MPEG-2 and MPEG-4 text with the reference software. It was the consensus of the Audio Subgroup to issue at this meeting a DCOR to MPEG-2 Audio and to add the correction for this issue to the DCOR to MPEG-4 Audio associated with the previous contribution.960/1024 block lengthAndreas Schneider, Dolby, presentedm15641 further considerations regarding the 960 transform length

in the AAC profile, the HE AAC profile and the HE AAC v2 profile

Andreas Schneider

The contribution reviewed the external specifications that use one block length or the other the reference source code available (or not available) for guiding implementers risk of leakage of content from block-length “island” to the other

Ralph Sperschneider, FhG, presentedm15645 Overhead consideration for an AAC decoder that supports

both frame lengths compared to an AAC decoder that supports only one frame length

Bernhard GrillMarc GayerMarkus MultrusJohannes HilpertRalph Sperschneider

The contribution gives an analysis of the relative complexity of implementing 1024 vs 960 block length AAC decoders.DiscussionThere are two alternatives:

1. Remove 960 block length from the AAC and HE-AAC profile.2. Keep the current profiles and request that manufacturers implement both block lengths.

This education effort must include enhancing conformance and reference software to support 960 block length.

Dave Singer, Apple, noted that the real goal is to minimize the confusion in the marketplace when DAB 960 block length content becomes available in the “Internet download” marketplace. Experts should investigate whether DAB bitstreams can be saved to an external storage device.After considerable discussion, all agreed that one way forward is:

Minimize the number of 960 block length bitstreams Maximize the number of new dual-block block length decoders

And that another way forward is: Restrict decoders to 1024 block lengths (960 for DAB and similar special applications) Educate 960 block length content producers that they must transocode to 1024 block

length in order to make content available outside their integrated systems.

Yasushige Nakayama, NHK, presentedm15598 Proposal of work plan for a 3D multichannel audio

configuration Yasushige Nakayama

This contribution notes external standards that specify mapping from audio channels to loudspeakers. It further presented relevant points from the draft technical broadcasting standard from ARIB (Japan), which include 22.2 channel broadcast using AAC or ALS coding technology. To support this initiative, NHK requests that MPEG extend AAC-LC to signal a mapping of up to 22.2 audio channels to the associated loudspeakers. In order for this to align with the ARIB schedule, they request FDIS by October 2009.Audio Chair noted that it would be most desirable to have a general audio channel to loudspeaker mapping, and that a BIFS scene might support such a channel mapping.Karlheinz Brandenburg, Fraunhofer Ilmenau, noted that there are many loudspeaker configuration that should be addressed, including:

145

Page 146: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

5.1 7.2 16 channel (US theatres) 22.2 (NHK) Higher-order Ambisonics Wavefield synthesis

Juergen Schmidt, Thomson, made a presentation on how Audio BIFS might be used to address audio channel to output loudspeaker mappings. BIFS is a tree-based organization. The BIFS stream specifies the node tree and the mapping and further manages the connections from compressed stream to audio composition.A rather free-form set of notes from that presentation follows:

AudioChannnelConfig nodeLabels input channelsSee Channel Configuration Table in 14496-3Parametric channel oriented configuration

Generic format with no limitationsBIFS scene decoder can be unbounded in complexity. However profiles and levels can be used to limit complexity to be very low. This could use the following

Sound2D AudioChannelConfig AudioSource

Decoder instancesVery simple BIFS decoderSee Abv3_Aac06a.txt BIFS conformance dataWay forwardDefine new profile and level to meet this application and keep bitrate and complexity low.Streaming format

Use MPEG-4 Systems Use “Carousell” to enable break-in

The Audio Chair noted that it would be appropriate to ask the Systems Chair if Laser can support 22.2 channel audio.It was decided to create an AhG mandate to continue this work, consisting of at least the following interested experts:

Jurgen Schmidt, ThomsonSchuyler Quackenbush, Audio Research LabsKarlheinz Brandenburg, Fraunhofer Ilmenau,Yasushige Nakayama, NHK Tilman Liebchen, LG

5.2.2 MPEG-4 AAC Enhanced Low Delay

Markus Schnell, FhG, presented a high-level overview ofm15637 Proposed ELD Verification Test Report Markus Schnell

The Chair indicated that he plans for a detailed presentation of the report results in closing plenary. The report was made available to all interested experts for a more careful review.Markus Schnell, FhG, presentedm15642 Update on Conformance for AAC-ELD Markus Schnell

This contribution proposes Correction of frame length flag in one conformance data file (frame length is signalled as

512, but is actually 480) New naming convention for existing AAC-LD conformance data

And also proposed to create new conformance data files for the 480 frame lengthIt was the consensus of the Audio Subgroup to

146

Page 147: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Issue the renaming corrections for AAC-LD as a DCOR Issue the remaining propoals as a Study on ISO/IEC 14496-4:2004/FDAM XX, AAC-

ELD, OAFI and additional AAC Conformance

5.2.3 MPEG-D SAOC

Yang-Won Jung, LG, presentedm15588 Comments on SAOC operation modes and module Yang-Won Jung

Henney Oh

This contribution notes that there a number of “missing” modes in SAOC operation, and presents a table summarizing the currently supported modes and the proposed “missing” modes. Discussion was deferred until after the next presentation.Oliver Hellmuth, FhG. presentedm15634 Proposal for adoption of additional downmix / upmix

scenarios for the MPEG SAOC systemhel @ iis.fraunhofer.dehlp @ iis.fraunhofer.demun @ iis.fraunhofer.defch @ iis.fraunhofer.detev @ iis.fraunhofer.deheiko.purnhagen @ dolby.comJENGD @ dolby.comjeroen.koppens @ philips.com

This contribution proposes adding the same modes to SAOC as was proposed in the previous contribution. A mathematical description of the SAOC processing for the missing modes was presented. Furthermore, it proposes to explicitly describe how a SAOC decoder can be connected to a MPEG Surround decoder by means of a stream of unquantized parameters. Listening test data was presented that show that two of the missing modes implemented via an SAOC decoder operate as well as the implementation of SAOC transcoder plus MPEG Surround decoder. All modes except stereo to binaural have software implementation.It was the consensus of the Audio Subgroup that all modes except stereo to binaural processing be incorporated into the CD text. Also, that the text describing how a SAOC decoder can be connected to a MPEG Surround decoder by means of a stream of unquantized parameters be incorporated into the CD text.Leonid Terentiev, FhG, presentedm15636 Proposal for the file-based storage of rendering

information for the MPEG SAOC systemJonas EngdegårdLeonid TerentievCornelia FalchOliver HellmuthJohannes Hilpert

The contribution presents a format for storing and retrieving the rendering information. It was the consensus of the Audio Subgroup to incorporate this text into the SAOC CD text.Osamu Shimada, NEC, presentedm15576 A Proposal on preset functionality for teleconferencing

applicationOsamu ShimadaKeisuke MasatokiToshiyuki NomuraAkihiko SugiyamaOsamu HoshuyamaHiroshi Saruwatari

This contribution proposes to derive lateralization from the “cross-talk” of participants in a multi-microphone teleconferencing setting. It proposes that presets should be dynamic in time and frequency. This permits the decoder to reconstruct the original microphone signals, but still enable manipulation of the actual objects.The Audio Subgroup welcomes additional information at the next meeting.Leonid Terentiev, FhG, presentedm15633 Additional information on the mono downmix mode for the

enhanced Karaoke / Solo processing for the MPEG SAOC system

Leonid TerentievCornelia FalchOliver HellmuthJohannes Hilpert

FhG will be able to provide processed signals for informal listening. The Chair suggested that this topic be brought up again Thursday afternoon of the MPEG week. Heiko Purnhagen, Dolby, presented

147

Page 148: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

m15635 Report on corrections for the MPEG SAOC CD text Leonid TerentievCornelia FalchOliver HellmuthJohannes HilpertHeiko PurnhagenJonas EngdegårdJeroen Koppens

This contribution presents Editorial corrections to the SAOC CD text. Minor technical “bug-fixes” to the CD text. New syntax text that clearly indicates that meta-data strings can be UTF-8 byte strings. Proposed solution for instability in SAOC processing noted at the 84th Meeting.

It was the consensus of the Audio Subgroup to adopt all aspects of the contribution into the SAOC CD text.Yang-Won Jung, LGE, presentedm15587 Proposed syntax revision for redundancy reduction in

SAOCYang-Won JungHenney Oh

The contribution proposes syntax revisions that reduce the number of required bits in the SAOC audioSpecificConfig() header. Heiko Purnhagen, Dolby, noted that the proposed syntax was too restrictive and could not support the flexibility in the SAOC CD text. He agreed to draft alternate syntax text that captures the improvement in coding efficiency. This would include an example of bitrate savings in a typical application. No decision on this contribution is made at this time, but the topic will be revisited when the group reviewed the text from Dolby.After subsequent discussion, it was the consensus of the Audio Subgroup to adopt the technology as it was in the presentation to the Audio Subgroup. Jeongil Seo, ETRI, presentedm15592 Proposal for support of the additional downmix by MPEG

SAOCJeongil SeoSeungkwon BeackKyeongok KangKwangki Kim

The proposal shows how to adapt the artistic downmix gain concept from MPEG Surround to SAOC for the purpose of using a “Mastered Mix” to drive a SAOC decoder. The concept is proposed to be a new core experiment. A break-out will discuss the proposal and report back. Osamu Shimada, NEC, presentedm15575 Listening test results of Test 2 for SAOC CE on

separating real-environment signals into multiple objectsOsamu ShimadaToshiyuki NomuraAkihiko SugiyamaOsamu Hoshuyama

5.2.4 MPEG-D Unified Speech and Audio

The Audio Chair presentedm15644 Draft Report on Unified Speech and Audio Coding Call for

Proposals SubmissionsSchuyler Quackenbush

He envisions that summary statements and figures capturing the consensus view of the outcome of the CfP listening tests be inserted into this framework, that listening test data be included in the zip archive and the document issues as an output. The Audio Chair presented the FNB commentm15551 FRNB comment on the Unified Speech and Audio Coding

Exploration ActivityPierrick Philippe on behalf of the FRNB

The Chair drafted response to this NB comment and this was presented and discussed. There was no consensus that it fully addressed the FNB comment. It was decided to address the issues raised as an AhG activity with the goal of fully addressing the concerns raised at the next MPEG meeting.The Audio Chair noted that the following KNB comment was presented in Wednesday plenary session.m15720 KNB comment on the Unified Speech and

Audio Coding Exploration ActivityJeong Ju Yoo on behalf of the KNB

148

Page 149: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

The Chair drafted response to this NB comment and this was discussed and accepted by representatives of the Korean National Body, based largely on the text in the Report on Unified Speech and Audio Coding Call for Proposals Submissions.The Audio Chair presented a version of the Report on Unified Speech and Audio Coding Call for Proposals Submissions that contained an analysis of the subjective data and an identification of the RM0 technology. As a first action in the RM process, a possible merge of sys2 and sys4 technology into sys10 will be investigated. After considerable discussion on key paragraphs in this document, a final version was achieved that gained the consensus approval of the Audio Subgroup. Discussions in final plenaryThe Audio Chair presented text that described a process that audio experts could use to develop a reference-quality USAC Reference Source Code encoder. Schuyler Quackenbush, Audio Research Labs, expressed the following point of view on this topic:

At the 86th MPEG meeting the following reference source code will be available for the Unified Speech and Audio Coding work:

Normative reference source code for the decoding process. Informative reference source code for the encoding process. Encoder source code

need only produce a bitstream that is consistent with the normative decoder bitstream syntax.

Subsequent to the 86th meeting, audio experts can use the encoder source as a foundation for an open and cooperative software development project in which source code modules from HE-AAC V2, AMR-WB+ and RM0 normative decoder source code are incorporated into the Encoder reference source code.Typically, in current practice, CE modules under test are incorporated into the RM0 proponent encoder via object-modules conforming to an agreed-upon API. The process outlined above for creating USAC encoder reference software can result in the USAC reference software encoder being as good as or better than the RM0 proponent encoder.

Experts from Huawei, Ericsson, Nokia, LG, France Telecom R&D, Qualcomm and Polycom, expressed the following point of view on this topic:

In order to facilitate the integration of relevant technologies in Core experiments, they should be conducted on a single reference source code for both encoder and decoder that is made available for each interested party in the MPEG audio subgroup. This reference code should be of sufficient quality and available before the start of the core experiment process. The encoder code should have the following characteristics: Part of the encoder code should be made available (e,.g. signal path, MDCT, CELP

parts) Parts which are felts sensitive by the proponents of the RM0 could be delivered as

libraries with the appropriate API and kept up to date with the latest Reference Model. The binary modules could be replaced by source code if felt appropriate.

The Audio Chair notes that there are WG11 documents that define the CE process and hence CE work can commence at the 86th meeting.

6 Meeting deliverables

7 Responses to Liaison and NB commentsThe responses to Liaison and NB comments were prepared and approved.

7.1 Recommendations for final plenaryThe Audio recommendations were presented and approved.

149

Page 150: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

7.2 Establishment of Ad-hoc GroupsThe following ad-hoc groups were established by the Audio subgroup:

No. Title Mtg10053 AHG on Audio Standards Maintenance No10054 AHG on Unified Speech and Audio Coding and SAOC Yes

7.3 Approval of output documentsAll output documents, shown in Annex D, were presented in Audio plenary and were approved.

7.4 Press statementThe Audio contribution to the press statement was reviewed and approved via email.

8 Future activities8.1 Schedule of future meetingsAd Hoc group meetings are indicated in Section 7.2. Unless otherwise indicated, Ad Hoc group meetings will be held at the location of the next MPEG meeting on the weekend preceding that meeting.

8.2 Agenda for next meetingThe agenda for the next MPEG meeting is shown in Annex E.

8.3 All other businessThere was none.

8.4 Closing of the meeting The 85th Audio Subgroup meeting was adjourned Friday at 14:00. Run to lunch!

150

Page 151: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex A Participants

First Name Last Name Country AffiliationVirgilio Bacigalupo Germany Fraunhofer IISJohannes Boehm DE ThomsonTi Eu Chan SG I2RSascha Disch Germany Leibniz University HanoverBernd Edler DE University of HannoverRoni Even IL PolycomQi Fengyan Germany Huawei TechnologiesLouis Fielder USA Dolby LabsRalf Geiger DE Fraunhofer IISPhilippe Gournay Canada VoiceAge Corp. / Univ. of SherbrookeBernhard Grill DE Fraunhofer IISOliver Hellmuth DE Fraunhofer IISJin Woo Hong KR ETRIHaibin Huang SG I2RYang-Won Jung KR LG ElectronicsKyeong Ok Kang Korea ETRIFlorian Keiler Germany ThomsonKei Kikuiri JP NTT DOCOMODong Soo Kim KR LG ElectronicsJunghoe Kim KR Samsung AITKristofer Kjörling SE DolbyTaejin Lee KR ETRIRoch Lefebvre Canada VoiceAge Corp. / Univ. of SherbrookeTerentiev Leonid DE Fraunhofer IISTe Li SG I2RTilman Liebchen DE LG ElectronicsTakehiro Moriya JP NTTMarkus Multrus DE Fraunhofer IISFrederik Nagel Germany Fraunhofer IISYasushige Nakayama JP NHKMax Neuendorf Germany Fraunhofer IISToshiyuki Nomura JP NECTakeshi Norimatsu JP PanasonicEunmi Oh KR SamsungHenney Oh KR LG ElectronicsWerner Oomen NL Philips Applied TechnologiesHee-Suk Pang KR Sejong UniversityHochong Park KR Kwangwoon UniversityYoung-cheol Park KR Yonsei UniversityPierrick Philippe FR France Telecom R&DHeiko Purnhagen SE DolbySchuyler Quackenbush USA ARLYuriy Reznik USA QualcommRedwan Salami Canada VoiceAge Corporation

151

Page 152: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Andreas Schneider DE DolbyMarkus Schnell DE Fraunhofer IISStephan Schreiner Germany Fraunhofer IISGerald Schuller Germany Fraunhofer IDMTJeongil Seo KR ETRIOsamu Shimada JP NECRalph Sperschneider DE Fraunhofer IISAkihiko Sugiyama JP NECHerve Taddei DE Huawei TechnologiesAnisse Taleb SE Ericsson ABTai Hou Tng Singapore Institute for Infocomm ResearchDavid Virette FR France Telecom R&DOliver Wuebbolt DE ThomsonSungyong Yoon KR LG ElectronicsHuan Zhou SG Panasonic Singapore LaboratoriesYongwei Zhu Singapore Institute for Infcomm Resarch

152

Page 153: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex B Audio Contributions and Schedule

Day / Time Task GroupSunday0900-1500 AhG: Unified Speech and Audio Codingm15545 Speech and Audio listening test lab report from Dolby Kristofer Kjörling

m15618 Speech and Audio listening test lab report (Ericsson) Anisse Taleb

m15570 Unified speech and audio coding listening test report from ETRI Taejin Lee

m15678 Audio Report on the subjective testing of Unified Speech and Audio Coding proposals at Huawei

Seungkwon Beack

m15655 France Telecom listening test results for the CfP on Unified Speech and Audio Coding

Minje Kim

m15619 Report on USAC Subjective Tests at Fraunhofer IIS Test Site Kyeongok Kang

m15581 Report on the USAC listening test at LGE Lijing Xu

m15548 Speech and Audio listening test lab report from Philips Herve Taddei

m15567 Listening test results for Unified Speech and Audio Coding from Samsung Pierrick Philippe

m15565 Speech & Audio - Listening Test, Report & Results - Thomson David Virette

m15608 Report on USAC subjective tests at VoiceAge test site Markus Multrus

m15547 Technical description of the Dolby Philips proposal for the speech and audio work- item

Ralf Geiger

m15568 Technical description of the ETRI proposal for the unified speech and audio coding

Dong Soo Kim

m15621 Technical Decsription of the Fraunhofer IIS Submission for the CfP on USAC Sungyong Yoon

m15582 LGE submission to Unified Speech & Audio Coding Jaehyun Lim

m15564 Response to CfP on unified speech and audio coding Hyun-Kook Lee

m15566 Speech & Audio - Description of Technology of the Thomson proposal Henney Oh

m15609 Technical Description of the VoiceAge Candidate for USAC Yang-Won Jung

1200-1300 Lunch Werner Oomen

m15546 Analysis of speech and audio listening test data Jeroen Koppens

m15613 Analysis of unified speech and audio coding listening test results Eunmi Oh

m15656 France Telecom contribution to the analysis of the listening test results following the CfP on Unified Speech and Audio Coding

Miyoung Kim

m15625 Analysis of USAC Listening Test Data JungHoe Kim

m15610 Analysis of Combined Listening Test Results for USAC Johannes Boehm

m15713 Audio USAC listenig test analysis from LGE Oliver Wuebbolt

1500-1700 AhG: SAOC Florian Keiler

m15572 Listening Test Report for CE on separating real-environment signals into multiple objects from ETRI

Roch Lefebvre

m15632 Listening test report for CE on separating real-environment signals into multiple objects for the MPEG SAOC system

Kristofer Kjörling

m15586 Listening test reports for CE on separating real-environment signals Werner Oomen

m15574 Listening test results of Test 1 for SAOC CE on separating real-environment signals into multiple objects

Jonas Samuelsson

Review of AhG Report Lars Villemoes

m15500 Ad Hoc Group on Unified Speech and Audio Coding, SAOC and AAC-ELD Barbara Resch

1800- Chairs Meeting (K17) Erik Schuijers

Pontus Carlsson

153

Page 154: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Monday Taejin Lee

0900-1230 MPEG Plenary Seungkwon Beack

1300-1400 Lunch Minje Kim

1400-1430 Audio Plenary Kyeongok Kang

Welcome Hochong Park

Sunday Chairs meeting Youngcheol Park

Main topics for the week Markus Multrus

General Ralf Geiger

m15643 84th MPEG Audio Report Bernhard Grill

m15499 Ad Hoc Group on Audio Standards Maintenance Nikolaus Rettelbach

m15500 Ad Hoc Group on Unified Speech and Audio Coding, SAOC and AAC-ELD Max Neuendorf

1430-1700 Plenary Activities Dong Soo Kim

USAC Sungyong Yoon

m15644 Draft Report on Unified Speech and Audio Coding Call for Proposals Submissions

Jaehyun Lim

m15551 FRNB comment on the Unified Speech and Audio Coding Exploration Activity Hyun-Kook Lee

Liaison Responses Eunmi Oh

Audio Reference SoftwareALS – FTP on TUBSLS – at I2R, move to SVNMPEG7 – FhG-IDMTMPS – FhGSAOC – FhG

JungHoe Kim

MXM (m15688, m15689) Miyoung Kim

DoC and Text on 14496-5:2001/FDAM 20 KiHyun Choo

ER (e.g. ER AAC-LD) bistream syntax specification vs reference code implementation

Hosang Sung

1630-1700 MPEG-4 ALS Florian Keiler

m15660 Proposal for MPEG-4 ALS profile/levels Oliver Wuebbolt

m15715 Comment on proposal for MPEG-4 ALS profiles/levels Johannes Boehm

1800- HoD Meeting Roch Lefebvre

Tuesday Werner Oomen

0900-1300 MPEG-2 and MPEG-4 Kristofer Kjörling

m15637 Proposed ELD Verification Test Report Heiko Purnhagen

m15628 Proposed new profile for MPEG-4 SLS Taejin Lee

m15629 Proposed Corrigendum on MPEG-4 SLS Seungkwon Beack

m15640 proposed correction of the MPEG-4 HE AAC v2 profile Minje Kim

m15638 proposed correction to MPEG-2 SBR conformance Kyeongok Kang

m15639 proposed clarification for MPEG-2 and MPEG-4 AAC Pierrick Philippe

m15642 Update on Conformance for AAC-ELD David Virette

m15588 Comments on SAOC operation modes and module Frederik Nagel

m15634 Proposal for adoption of additional downmix / upmix scenarios for the MPEG SAOC system

Ralf Geiger

154

Page 155: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

1300-1400 Lunch Max Neuendorf

m15641 further considerations regarding the 960 transform length in the AAC profile, the HE AAC profile and the HE AAC v2 profile

Markus Multrus

m15645 Overhead consideration for an AAC decoder that supports both frame lengths compared to an AAC decoder that supports only one frame length

Roch Lefebvre

SAOC Dong Soo Kim

m15636 Proposal for the file-based storage of rendering information for the MPEG SAOC system

Sungyong Yoon

m15576 A Proposal on preset functionality for teleconferencing application

1900- Chairs Meeting (K17) Jeongil Seo

Seungkwon Beack

Wednesday Kyeongok Kang

0900-1100 MPEG Plenary Leonid Terentiev

1100-1200 Joint Meeting at PlenaryMPEG for Gaming

Cornelia Falch

1200-1300 MPEG-2 Oliver Hellmuth

m15598 Proposal of work plan for a 3D multichannel audio configuration Johannes Hilpert

1300-1400 Lunch Yang-Won Jung

1400 USAC Henney Oh

m15720 KNB comment on the Unified Speech and Audio Coding Exploration Activity

Dong Soo Kim

1400-1600 SAOC Sungyong Yoon

m15633 Additional information on the mono downmix mode for the enhanced Karaoke / Solo processing for the MPEG SAOC system

Osamu Shimada

m15635 Report on corrections for the MPEG SAOC CD text Toshiyuki Nomura

m15587 Proposed syntax revision for redundancy reduction in SAOC Akihiko Sugiyama

m15592 Proposal for support of the additional downmix by MPEG SAOC Osamu Hoshuyama

1600-1700Joint Meeting at AudioInteractive Music AF m15710 (req), m15626 (tech)

1730-1830 Core Experiment framework for USAC S. Quackenbush, Eunmi Oh

1900 - Social (starts 19:30)

Thursday0900-1300 Task Group Activities

USAC1100-1130 Review of proposed ALS and SLS profiles

USAC1200-1300 Open issues in SAOC CEs

1300-1400 Lunch

1700-1730 Review of N9900 New Directions in Audio Standardization Schuyler Quackenbush

1500-1600 USAC output documents R. Sperschneider

Open issues in SAOC CEs S. Quackenbush, Eunmi

155

Page 156: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Ohm15575 Listening test results of Test 2 for SAOC CE on separating real-environment

signals into multiple objects

1800- Chairs Meeting Schuyler Quackenbush

Pierrick Philippe on behalf of the FRNB

Friday0900-1300 Audio plenary

Remarks on Thursday Chairs meetingRecommendations for final plenaryEstablishment of new Ad-hoc groupsAhG MandatesGet document numbers Takehiro Moriya

1000 Approve Responses to NB comments and Liaison Tilman Liebchen

1030 Approval of output documents Yasushige Nakayama

Review of Audio presentation to MPEG plenary Toshiyuki Nomura

Agenda for next meeting Noboru Harada

A.O.B. Yutaka Kamamoto

Closing of the Audio meeting Yuriy Reznik

1300-1400 Lunch1400- MPEG Plenary

156

Page 157: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex C Task Groups

1. MPEG-2 and MPEG-4 audio, conformance, reference software2. MPEG-4 AAC-ELD Verification Test Report3. MPEG-D SAOC4. MPEG-D Unified Speech and Audio Coding

157

Page 158: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex D Output DocumentsNo. Title TBP Available

13818-4 Conformance testing10030 ISO/IEC 13818-4:2004/Amd 2:2005/DCOR 2, AAC Conformance No 08/07/25No. Title TBP Available

13818-7 Advanced Audio Coding10031 ISO/IEC 13818-7:2006/DCOR 1, AAC No 08/07/25No. Title TBP Available

14496-3 Audio10032 Report on the Verification Test of MPEG-4 AAC-ELD YES 08/07/2510033 WD on New Profiles for ALS and SLS No 08/07/2510034 ISO/IEC 14496-3:2005/DCOR 6, AAC No 08/07/25

10035 ISO/IEC 14496-3:2005/AMD 2:2006/DCOR 4, HE-AAC V2 Profile

No 08/07/25

10036 ISO/IEC 14496-3:2005/AMD 3:2006/DCOR 2, SLS No 08/07/2510037 ISO/IEC 14496-3:2005/AMD 9:2008/DCOR 1, AAC-ELD No 08/07/25

10038 Considerations regarding the 960 frame length in the MPEG-4 AAC family of profiles

No 08/07/25

No. Title TBP Available14496-4 Conformance testing

10039 ISO/IEC 14496-4:2004/DCOR 6, AAC-LD No 08/07/25

10040 Study on ISO/IEC 14496-4:2004/PDAM 36, AAC-ELD, OAFI and additional AAC Conformance

No 08/07/25

10041 Proposed ISO/IEC 14496-26, Audio Conformance No 08/07/25

10042 ISO/IEC 14496-4:2004/AMD13:200x/DCOR 2, AAC-LD bitstreams

No 08/07/25

No. Title TBP Available14496-5 Reference Software

10043 DoC on ISO/IEC 14496-5:2001/FPDAM 20, MPEG-1 and -2 on MPEG-4

No 08/07/25

10044 ISO/IEC 14496-5:2001/FDAM 20, MPEG-1 and -2 on MPEG-4 No 08/09/12No. Title TBP Available

23003-2 SAOC10045 Study on ISO/IEC CD 23003-2:200x, Spatial Audio Object Coding No 08/08/1510046 Status and Workplan on SAOC Core Experiments No 08/07/25No. Title TBP Available

23003-3 Unified Speech and Audio Coding10047 Report on Unified Speech and Audio Coding Call for Proposals No 08/07/25No. Title TBP Available

Liaison Statements10049 Liaison response to ITU-T SG 16 (m15261, m15516) No 08/07/2510050 Liaison response to IEC TC 100 (m15555) No 08/07/25No. Title TBP Available

Responses to National Bodies10051 Response to FNB (m15720) No 08/07/2510052 Response to KNB (m15551) No 08/07/25

158

Page 159: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex E Agenda for the 86th MPEG Audio Meeting

Agenda Item1. Opening of the meeting2. Administrative matters

2.1. Communications from the Chair2.2. Approval of agenda and allocation of contributions2.3. Review of task groups and mandates2.4. Approval of previous meeting report2.5. Review of AhG reports 2.6. Joint meetings2.7. Received national body comments and liaison matters

3. Plenary issues4. Task group activities

4.1. Spatial Audio Object Coding4.2. Unified Speech and Audio Coding4.3. MPEG Maintenance, including MPEG-1, MPEG-2, MPEG-4, SMR and MPEG

Surround issues5. Discussion of unallocated contributions6. Meeting deliverables

6.1. Responses to Liaison and NB comments6.2. Recommendations for final plenary6.3. Establishment of new Ad-hoc groups6.4. Approval of output documents6.5. Press statement

7. Future activities8. Agenda for next meeting9. A.O.B10. Closing of the meeting

159

Page 160: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Annex J – 3DG report

Source: Marius Preda (Institut TELECOM)

3DGC meeting reportHannover, July, 2008

1 Opening of the meeting

1.1 Approval of the agendaThe agenda is approved.

1.2 Goals for the weekThe goals of this week are: Review SC-3DMC contributions and issue the associated WD (2.0) and CE Discuss the software status for SC-3DMC Review the votes Discuss FAMC RefSoftware and Conformance Compile and test reference software Check the validity and re-generate when necessary conformance data for 3DGC Investigate future developments of MPEG 3D Graphics Compression

1.3 Standards from 3DGC

Std Pt Edit Project Description CfP WDCD/

PDAMFCD/

FPDAMFDIS/FDAM

4 4 2004 Amd.32 FAMC conformance

07/04 07/10 08/04 08/10

4 4 2004 Amd.33 Multiresolution profile conformance

07/04 07/10 08/07 09/01

4 4 2004 Amd.34 3DGC Model Conformance

08/01 08/07 09/01

4 4 Amd.39 Scene partitioning Conformance

4 5 2001 Amd.21 FAMC reference software

07/04 07/10 08/04 08/10

4 5 2001 Amd.22 3DG Compr. Model RefSof

06/07 08/01 08/07 09/01

4 5 Amd.25 Scene partitioning Reference software

4 11 2005 Amd.6 Scene partitioning 08/01 08/04 08/104 16 2006 Amd.2 FAMC 07/07 08/01 08/07

160

Page 161: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Std Pt Edit Project Description CfP WDCD/

PDAMFCD/

FPDAMFDIS/FDAM

4 16 2006 2nd Ed./Cor.2

Node code table 08/07 09/01

4 16 2006 Amd.4 Scalable Complexity 3D Mesh Compression

08/01 08/10 09/01 09/07

4 16 200x 3rd Ed. AFX 07/10 09/01Delayed to

include the COR

4 25 200x 1st Ed. 3D Graphics Compression model

07/04 07/10 08/04 08/10

1.4 Room allocation3DGC: K17

161

Page 162: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

1.5 Allocation of contributionsN° Title Schedule ActivityD1 Monday D1

MPEG Plenary D1 09:00~11:30

MPEG General

Lunch Break D1 13:00~14:00

3DG Plenary D1 14:00~15:30

3DG General

Roll call, Agenda, Goals, FAQ, etc., Marius PredaStatus of www.mpeg-3dgc.com Patrick Gioia

m15498 Report of AHG on 3DGC documents, experiments and software maintenance

Patrick Gioia, Francisco Moran

Validation session for the RefSoft and test data set

Francisco Moran

Validation session for the conformance data set Marius Preda

Results of voting- ISO/IEC 14496-16:2006/FPDAM 2: (approved).

Marius Preda

Coffee Break 15:30~16:00

3DG Plenary 16:00 – 18:00

m15441 Reference software of OctreeImage and PointTexture

Jeong-Hwan AhnShinjun LeeGyeongJa Jang

m15683 Encoder implementation for FAMC Khaled Mamou

162

Page 163: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

N° Title Schedule ActivityTitus Zaharia Marius Preda Francoise Prêteux

m15691Source code for WaveletSubdivisionSurfaces and Footprint-Based Coding

Patrick GioiaJérôme Royan

MPEG-V and impact on group activities Marius PredaMXM and impact on group activities Marius Preda

D2 Tuesday D2Scalable Complexity 3D Mesh Encoding (SC-3DMC)- review of the contributions

D2 09:00~12:00

m15669 A Proposal on Quantization of 3D Mesh Coding

Seungwook LeeBonki KooByungjun KimEuee S. Jang

09:00-09:30

m15654 Update of the API for 3D compression benchmark with MyMultimediaWorld.com

Benoit le BonhommeMarius PredaFrancoise Prêteux

9:30-10:00

Joint meeting on MXM and MPEG-V (Systems room) 10:00 – 11:30

m15654 Update of the API for 3D compression benchmark with MyMultimediaWorld.com (continuation of discussion)

Benoit le BonhommeMarius PredaFrancoise

11:30-12:00

163

Page 164: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

PrêteuxLunch Break D2 12:00~14:00Scalable Complexity 3D Mesh Encoding (SC-3DMC) D2 14:00~18:00

m15653 TFAN software description

Khaled MamouTitus ZahariaMarius PredaFrancoise Prêteux

14:00-14:30

m15667 CE Report on the SC3DMC

Seungwook LeeBonki KooDaiyong KimEuee S. Jang

14:30-15:30

Coffee Break 15:30~16:00

m15668 Editorial and Technical Comments on Working Draft of SC3DMC

Seungwook LeeBonki KooKyoungsoo SonEuee S. Jang

16:00-18:00

Fixes on the reference software Francisco Moran

D3 Wednesday D3MPEG Plenary D3 09:00~11:00Gaming (joint with all MPEG) 11:00~12:00

Lunch Break D3 13:00~14:00MXM Architecture and API Filippo

Chiariglione 14:00 – 15:40Coffee Break 15:40~16:00AFX Amd.4 16:00 – 16:30

164

Page 165: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

SC-3DMC Editing Seungwook LeeP25 Jeong-Hwan 16:30~17:00P25 ref software status and conformanceAFX Amd.4 16:30~17:30SC-3DMC Editing Seungwook LeeMPEG-V 17:30~18:00Clarification on the interchange format between the virtual worlds Jean Gelissen

D4 Thursday D4AFX Reference Software 09:00~09:30Results on checking reference encoders 3DMC Daiyong Kim

Amd.4 SC-3DMC 09:30~10:30Edit of the CE SeungwookRefSoft and Conformance 10:30~11:30

Test the RefSoft on the test data Francisco Moran

Plan for the conformance data Marius PredaAmd.2 FAMC Marius Preda 11:30 – 12:00Part 16 AMD2 FAMC EditingLunch BreakP11 SP Patrick Gioia 15:00 – 15:10ISO/IEC 14496-11 AMD6Amd.3 MRZ Patrick Gioia 15:10 – 15:20MRZ Conformance P16 Marius Preda3rd Edition4 weeks editing period 15:20 – 15:30

SC-3DMC Editing WD2.0OK editor notes added for what is still to be done Seungwook 15:30 – 16:30

165

Page 166: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

P16 Cor1 Marius Preda 16:30 – 16: 45ISO/IEC 14496-16 2nd Ed. Amd.1 Cor.2 P25 Marius Preda3DGCM Conformance Editing3DGCM RefSoft Editing

D5 Friday D53DG output documents preparation D4 09:00~12:00 3DG General

AhGs and resolutions all

Lunch Break D5 12:00~14:00

MPEG Plenary D5 14:00~ MPEG General

166

Page 167: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

1.6 Attendance listName Country CompanyMarius Preda France Institut TELECOMPatrick Gioia France OrangeLabsFrancisco Morán Burgos Spain UPMKarsten Muller Germany FHG-HHISeung Wook Lee Korea ETRIEuee S. Jang Korea Hanyang Univ.Byoungjun Kim Korea Hanyang Univ.Mingxiao Chen Korea Hanyang Univ.Itaru Kaneko JapanJeong-Hwan Ahn Korea SamsungNikolce Stefanoski Germany University of Hannover

2 General issues

2.1 General discussion

2.1.1 Reference SoftwareIt is recalled that the source code of both decoder AND encoder should be provided as part of the Reference Software for all technologies to be adopted in MPEG standards. Moreover, not providing the complete software for a published technology shall conduct to the removal of the corresponding technical specification from the standard.

2.1.2 Web siteOrangeLabs proposed a new version of the web site, now available at www.mpeg-3dgc.com. The goal of the web site is to disseminate the group activities (documents, software and demonstration), to maintain the FAQ and to be active in providing answers through the use of the Forum. 3DGC contributors are kindly asked to check the web-site and provide comments.

3 AFX (14496-16) related activities

3.1 Scalable Complexity 3D Mesh Compression (14496-16 Amd.4)

Title A Proposal on Quantization of 3D Mesh CodingAuthors Seungwook Lee, Bonki Koo, Byungjun Kim, Euee S. Jang

Summary Quantization algorithm described in 3DMC specification is different from the one implemented in IM1. In addition the inverse quantization equations described in the 3DMC specification is wrong. This contribution recommends using the algorithm described in the textual document, to change IM1 accordingly and to correct the

167

Page 168: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

equations. The contribution also contains the corrected version of the reference software.

Resolution

Resolution: accepted.The corrections for 3DMC specification are integrated in the current Cor.2 of 14496-16:2006/Amd.D1. Update the proposed software on the SVN.

Title Update of the API for 3D compression benchmark with MyMultimediaWorld.com

Authors Benoit le Bonhomme, Marius Preda, Francoise Prêteux

Summary

A new version of the API is proposed in order to easy the use of MMW.com. The encoding parameters can be set in a textual file, that will be provided together with the encoder. The performances issues were discussed and libraries are compiled in Release mode

Resolution

Accepted. The new API will be used in the rest of experiments on SC-3DMC

Title TFAN software descriptionAuthors Khaled Mamou, Titus Zaharia, Marius Preda, Françoise Prêteux

SummaryThe contribution presents the encoder and decoder classes and diagrams for TFAN. It is currently integrated in MMW.com benchmarking platform.

Resolution Accepted.

Title CE Report on the SC3DMCAuthors Seungwook Lee, Bonki Koo, Daiyong Kim, Euee S. Jang

Summary

The contribution presents a set of evaluation results for the following methods: SVA with Binary Arithmetic Coding, SVA with Arithmetic Coding, SVA with Binarization, Quantization Only. The results were obtained on an independent platform.

Resolution

Accepted. Action Point: Integrate the encoder and decoder for the four methods in MMW.com API.

3.1.1 Scene partitioning (14496-11 Amd.6)SP is followed as a joint activity between Systems and 3DGC. The technology is integrated in Part 11. There was no joint meeting with Systems on this topic during this meeting.SP activity on conformance and reference software is initiated.

3.2 Profiles (14496-16 AMD3)

Title MRZ Conformance Authors Patrick Gioia,

Summary No need to update the textual specification; need to update the files (delay due to bugs in the refsoft)

168

Page 169: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Resolution Generate the bitstreams by using the current RefSoft

3.3 Maintenance

3.3.1 Node TemplatesTitle Node Templates V7 and V8

Authors

Summary

Generating IM1 code and BIFS tables from node templates is not solved yet. For this version of the reference software BIFS table was produced by Mikael's scripts but the IM1 code was created manually

Resolution

1. Use GPAC for creating the node tables2. Update GPAC for being able to generate IM1 code

3.3.2 AFX 3rd EditionThe document was updated during the week. However, editing is not finished (an editing period of 4 weeks was accepted).

3.4 Dataset and benchmarking

For Scalable Complexity 3D Mesh Coding, the www.MyMultimediaWorld.com will be used for benchmarking.

3.5 Software

Title Reference software of OctreeImage and PointTextureAuthors Jeong-Hwan Ahn, Shinjun Lee, GyeongJa Jang

Summary Donation of encoders for OctreeImage and PointTexture

Resolution

- compilation of the software is validated- accept the two encoders for becoming Reference Software- commit the two modules on the SVN (mandate of the AhG)

Title Encoder implementation for FAMCAuthors Khaled Mamou, Titus Zaharia, Marius Preda, Francoise Prêteux

Summary Donation of encoders for FAMC

Resolution

- compilation of the software is validated- accept the encoder for becoming Reference Software- commit the software on the SVN (mandate of the AhG)

Title Source code for WaveletSubdivisionSurfaces and Footprint-Based Coding

Authors Patrick Gioia, Jérôme Royan

Summary Donation of encoders for WaveletSubdivisionSurfaces and Footprint-Based Coding

169

Page 170: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

Resolution

- compilation of the software is validated- accept the encodesr for becoming Reference Software- commit the two modules on the SVN (mandate of the AhG)

Title Results on checking reference encoders 3DMCAuthors Daiyong Kim

Summary

3MDC correction was done in encoder and decoderFAMC Encoder was compiled and tested succesfulWSS encoder was compiledFootPrint Encoder was compiled

Resolution

Action Points:3DMC (similar for FAMC, WSS, FootPrint)1. Standalone version of the encoder to be committed in /MPEG-4/standalone/trunk/3D/AFX/3DMC-Encoder2. Standalone version of the decoder to be commited in /MPEG-4/standalone/trunk/3D/AFX/3DMC-Decoder3. Integrated version of the decoder to be commited in /MPEG-4/im1/trunk/IM1Decoders/AFX/3DMC4. Remove the 3DMC encoder from /MPEG-4/im1/trunk/IM1Decoders/AFX/3DMC5. Make the same correction in P25 (Encoder and Decoder)

Title Test the RefSoft on the test dataAuthors Francisco Moran

Summary

- encoders and players (debug and release) done- methods from NodeStub.cpp were commented- test files were regenerated (it lacks BBA, FootPrint, Shadow)

Resolution

Action Points:1. Francisco prepares the RefSoft2. Marius commits it on the SVN3. Commit corrections on the 3DMC (Daiyong)4. Khaled commits FAMC and the test files for FAMC5. Marius commits test files for BBA6. Patrick commits test files for FP

Title Plan for the conformance dataAuthors Marius Preda

Summary

Resolution

Action Points:- Create the Conformance directory on the SVN (Marius)- Create directories for each tool (Marius)- Each directory contains all the uncompressed files necessary to produce the bitstreams as well as the compressed files- Take all the conformance documents for AFX and check the status on each file- When files fail, initial providers are asked to regenerate them - Issue the corrigendum(s) for conformance

170

Page 171: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

3.6 PromotionsTitle Special Session on 3D Graphics Compression in ICIP08

Authors Karsten Muller, Marius Preda

Summary A special session on 3D graphics compression will be organized in ICIP 2008.

Resolution -

3.6.1 Web SiteTitle Status of www.mpeg-3dgc.com

Authors Patrick Gioia

Summary The web site is in beta version but no improvement was done since the last meeting

Resolution

Action Point:Patrick Gioia will ask more actively contributions for demos from individual parties.

3.7 Future

3.7.1 MPEG-V - Information Exchange with Virtual Worlds (formally Metaverse)

Title MPEG-V and impact on group activitiesAuthors Marius Preda

Summary Informal discussion on possible impact of MPEG-V activities on technologies developed by 3DG group

Resolution

Action Point:Actively monitor and participate in setting the current documents for this standard (currently in a Requirements phase)

3.7.2 MXM

Title MXM and impact on group activitiesAuthors Marius Preda

Summary Informal discussion on possible impact of MXM activities on technologies developed by 3DG group

Resolution

Action Point:Actively participate in proposing a complete API for accessing 3D graphics tools.

Title MXM Architecture and APIAuthors Filippo Chiarliglione

Summary - presentation of the Chillout Project- review of the 3D API for MXM

Resolution

Action Point:Investigate on the JAVA wrapper for the MPEG-4 GT (JNA/JNI may be the solution)Update the API for supporting creating 3D graphics content.

171

Page 172: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

3.7.3 Future of 3D Graphics CompressionNo discussion during the meeting in this issue

4 3D Graphics Compression Model (14496-25) activities

4.1 Textual specificationNo modifications.

4.2 Software and conformance

Title P25 ref software status and conformanceAuthors Jeong-Hwan Ahn

Summary FAMC encoder/decoder is included. Conformance document was updated to include FAMC and IC

Resolution

Action Points:Commit the MP25 on MPEG SVNIntegrate IC encode and decoderAdd the files for FAMC and IC in the conformance data set

5 Output documents and Resolutions of 3DGC

5.1 Part 4 Conformance testing

5.1.1 The 3DGC subgroup recommends approval of the following documentsNo. Title TBP Available

14496-4 Conformance testing10029 Text of ISO/IEC 14496-4:2004/FPDAM 33 (MultiResolution

Profile Conformance)No 08/07/25

10019 DOCR on ISO/IEC 14496-4:2004/PDAM 34 No 08/07/2510013 Text of ISO/IEC 14496-4:2004/FPDAM 34 (3DG Compression

Model Conformance)No 08/07/25

10014 Request for ISO/IEC 14496-4:2004/Amd.39 (Scene Partitioning Conformance)

No 08/07/25

5.2 Part 5 Reference Software

5.2.1 The 3DGC subgroup recommends approval of the following documentsNo. Title TBP Available

172

Page 173: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

14496-5 Reference Software 10023 DOCR on ISO/IEC 14496-5:2001/PDAM 22 No 08/07/2510015 Text of ISO/IEC 14496-5:2001/FPDAM 22 (3DG Compression

Model Reference Software)No 08/07/25

10016 Request for ISO/IEC 14496-5:2001/Amd.25 (Scene Partitioning Reference Software)

No 08/07/25

5.3 Part 16 Animation Framework eXtension (AFX)

5.3.1 The 3DGC subgroup recommends approval of the following documentsNo. Title TBP Available

14496-16 Animation Framework eXtension (AFX)10017 Text of ISO/IEC 14496-16:2006/FDAM 2 (Frame based Animation

Mesh Compression)No 08/07/25

10018 Text of WD2.0 of ISO/IEC 14496-16:2006/Amd.4 (Scalable Complexity 3D Mesh Compression)

No 08/07/25

10020 Study Text of ISO/IEC 14496-16:2006/Amd.1:2007/COR2 No 08/07/2510021 WD4.0 of AFX 3rd Edition No 08/08/2510022 CE on Scalable Complexity 3D Mesh Coding No 08/07/25

5.4 Part 25 3D Graphics Compression Model

5.4.1 The 3DGC subgroup recommends approval of the following documentsNo. Title TBP Available

14496-25 3D Graphics Compression Model

5.5 Establishment of 3DGC Ad-Hoc Groups10060 AHG on 3DGC documents, software maintenance and core experimentsMandate: 1. Ensure the correct migration of the Reference Software on the SVN provided

by Elysium2. Coordinate 3DGC related conformance and reference software3. Maintain and edit 3DGC documents 4. Coordinate editing of the www.mpeg-3dgc.com web site 5. Conduct the experiments in Scalable Complexity Mesh Compression

Chairmen: Patrick GioiaFrancisco Morán Burgos

Duration: Until 86th MeetingMeetings Sunday before 86th meetingReflector: mpeg-3dgc AT gti. ssr. upm. esSubscribe: http://www.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc

6 Closing of the MeetingSee you in Busan.

173

Page 174: 1 - IPSJ/ITSCJ Web view · 2008-09-0310061 Study of ISO/IEC 14496-12/PDAM.1 Misc. Addition to FF ... Images for 3D Video m15598 Miyoung Kim Proposal of work plan for a 3D multichannel

174