signal & image processing 1 thales communications france this document is the property of thales...

18
SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE nt is the property of THALES Communications, its content cannot be reproduced, disclosed or utilized without the Company's written SPEECH & IMAGE PROCESSING (TSI/LMM - Laboratoire MultiMédia) Contacts : Frédéric Chartier Tél : +33 1 46 13 31 05 Gwénaël Guilmin Tél : +33 1 46 13 28 35 Fax : +33 1 46 13 25 55 email : [email protected] up.com [email protected] .com

Upload: fay-neal

Post on 30-Dec-2015

216 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 1

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

SPEECH & IMAGE PROCESSING(TSI/LMM - Laboratoire MultiMédia)

SPEECH & IMAGE PROCESSING(TSI/LMM - Laboratoire MultiMédia)

Contacts :

Frédéric Chartier Tél : +33 1 46 13 31 05Gwénaël Guilmin Tél : +33 1 46 13 28 35

Fax : +33 1 46 13 25 55

email : [email protected] [email protected]

Page 2: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 2

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

Propose Technical strategy, research innovation and advanced studies

Perform advanced and feasibility studies, demonstrators and SP modules in Thales Com. products

Maximise Efficiency/synergy within Thales Com.for SP R&D Maintain close links with French administration, SMEs, University

laboratories and European research actors Provide expertise and support for Thales Com. units in its field

Hire and Train young engineers in SP domain Disseminate new technologies and best practices within Thales Com.

Represent Thales Com. within Thales Common Efficiency Teams

Missions

Page 3: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 3

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

Technical and Technological Challenges

High Data Rate Radio modem

SIP framework

Wireless Telecom

Electronic Warfare

Civilian Technologies

Antenna Processing

DSP use

generalised

Multimedia

and Internet

Signal & ImageProcessingEvolutions

Software Radio

Page 4: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 4

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

Team

Multimedia : 11 engineers (4 experts)

Radiocommunications : 16 engineers (5 experts)

Sensor Processing : 26 engineers (5 experts)

Software Development : 16 engineers (2 experts)

2 technicians, 1 secretary, 8 thesis students

80perso

ns

Active participation to CNRS SP working groupmemberships in IEEE, SEE and EURASIP6 patents and 15 publications per year on average

Page 5: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 5

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

Domains of expertiseLow and very low rate speech compressionWatermarkingJPEG 2000 & Video Codec

Antenna diversity & Jammer-interference rejectionHigh resolution direction findingArray optimisation on perturbing platformsSmart antennas and SDMA

Detection, numbering (energy, cyclic, high order stats., ...)Recognition/identification of modulation and coding schemes Blind demodulation and equalisation Localisation

Multimedia

AntennaProcessing

SignalAnalysis

Software radio Digital exciters and receivers, amplifier linearisation

Modem

VLF, HF, VUHF and satellite modemsSingle and multi-carrier modulationsSpread spectrum and CDMASource and channel coding optimisationSpectral efficiency optimisation

Page 6: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 6

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

Speech processing

Compression

Low and very low bit rate compression research and development activity

LPC : 800 and 2400 bit/s

HSX : 1200, 2400 and 3200 bit/s

CELP 4.8 kbit/s and TETRA (4567 bit/s)

VLBR : 200 to 400 bit/s (combining recognition and synthesis)

Wide Band Low Bite rate speech Coder : 3200 bit/s

Knowledge/Implementation of higher bit rate coders, but no research activity

Vocal Activity Detector, echo cancellation.

Noise reduction : passive pre-processing or processing included in vocoder

System optimisation of channel and source coding

Best adaptation to service and system/propagation environment

Page 7: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 7

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

Speech processing

1k 2k 4k 8k 16k 32k 64k

MOS

1

2

3

4

5 G711(72)

G726(88)

ST4209(83)

G 728(92)

FS 1016(90)

G 729(96)

ST 4479(93)

ST 4198(87)

ST 4591(02)

LPC 10(83)

GSM(87)

ST 4591(02)

G 723-1(96)

WBLBR

VLBRHSX

Page 8: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 8

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

Speech processing

1

2

3

4

5

1980 1990 2000

IndicativeQuality

G.711(64 kb/s) G.721

(32 kb/s) G.729(8 kb/s)

G.728(16 kb/s)

LPC 10(2,4 kb/s)

HSX(2,4 kb/s)

Consumer quality

Minimum qual. for high cost application

Minimum qual. For low cost application

1970

G.723(5.3 kb/s)

Page 9: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 9

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

THC Major achievements

Standards

THC coders chosen for STANAG 4479 (800 bit/s) in 1994

ETSI TETRA (4567 bit/s) for PMR (licence to Motorola, Nokia, Philips/Simoco,..)

Present participation at NATO for new low bit rate coder STANAG 4591 (1200 and 2400 bit/s, associated noise reduction)

Products

LPC10e implementation within Spartacus, Syracuse, HF processor

Vocoder ASIC for the PR4G (LPC 800, LPC10e 2400, ACELP 4800)

Vocoders (SW) for the PR4G/VS4 (LPC 800, LPC10e 2400, ACELP 4800)

HSX in Sawari, Synthesis in a consumer pager (Info-realité) and analysis in PC, OKI (Asic), Leo (Singapore).

Tetra coders in base-station for ISR

G723.1 and G726 in ATM switch

Page 10: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 10

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

Existing vocoders at THC

Vocoder

STANAG 4479, 800 b/s

STANAG 4198, 2400 b/s LPC

HSX 2400 b/s

HSX 1200 b/s

ACELP, 4800 b/s

TETRA, 4567 b/s

ITU G723-1, 6.4/5.3 kb/s

ITU G726, 16,24,32,40 kb/s

ITU G728, LD CELP 16kb/s

ITU G729, CS ACELP 8kb/s

GSM

STANAG 4591 (2400/1200 b/s)

Simulation

For/C/FixC

For/C/FixC

C/FixC

C/FixC

For/C/FixC

FixC

C/FixC

C

C/FixC

FixC

C

C/FixC

C25C50

x

x

x

x

ASIC

x

x

x

C30C40

x

Product

PR4G, PHF, Sawari,info Tel

PR4G, PHF,Spartacus, Syr. II

Aztec, Sawari, OKI

InfoTelecom, OKI

PR4G, PHF, Spartacus

Rameau, ISR

ATM switch

ATM switch

ATM switch

C62

x

x

x

TRPC

x

x

x

x

x

x

x

x

x

C54x

x(*)

x(*)

x

xS

x(*)

x

sharc

x

x

Page 11: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 11

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

Cooperations

Sherbrooke University (Canada) ACELP specialists

University of Rennes (noise reduction) hand-free telephone

ENST Paris & ESIEE Very Low Bit Rate Speech Coding (combining recognition and

synthesis). Wide Band Low Bite rate speech Coder.

Fraunhofer institute MPEG II layer 3, MPEG 4 audio coders

Page 12: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 12

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

VLBR speech Codec

Thanks to the developed speech encoding solution, the system will be used on Very-Low-Bit-Rate channel, lower than 400 bits/s.

This technology could be also used to: speech recognition, speaker/language identification,

VLBR speech Codec

Page 13: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 13

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

VLBR speech Codec

Very Low Bit Rate speech coding by indexing natural speech units of variable size

Solution based on a new concept making use of various speech processing technologies Temporal Decomposition (TD) for robust segmentation of

speech HMM modelling for determination of speech units Harmonic/Stochastic modelling for speech re-synthesis by

concatenating identified speech units Jan Cernocky, PhD Thesis (Orsay) 1998

Page 14: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 14

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

VLBR speech CodecVLBR Encoder

Spectral Analysis

Prosody Analysis

HMM-based

Recognition Determination of optimal synthesis

units (DTW)

Codebook HMM models

HMM index Index of synthesis unitPitch and Energy Profiles

Prosody Encoding

Input speech signal

Codebook

synthesis units

CODER

Page 15: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 15

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

VLBR speech Codec

HNM Synthesis

Extraction of synthesis units

HMM index

Index of synthesis unitPitch and Energy Profiles

Prosody Decoding

Codebook

synthesis units

Output synthesised speech signal

DECODER

VLBR Decoder

Page 16: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 16

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

WLBR speech Codec

WLBR speech Codec algorithms

Parametric Wide Band speech coder (from 50Hz to 7000Hz).

Bit-rate: below 4 kbits (3200 bit/s & 3600 bit/s) Wide Band speech pre-processing

– Noise Reduction, spectral compression, temporal speed modification

Voice activity detection.

Page 17: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 17

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

WLBR speech Codec

Intérêt pour applications professionnelles: offrir un plus produit (il n’existe pas encore de codeur de ce

type actuellement), la cible visée étant très intéressée par ce genre d ’amélioration.

Le débit reste compatible des réseaux HF/VUHF « Simple » évolution du codeur HSX (implémentation

maîtrisée, C fixe disponible) Intérêt pour applications civiles:

La seule norme civile existante en WB (AMR WB) offre un débit supérieur à 10 kbit/s. Les utilisateurs vont demander de plus en plus une qualité WB.

Notre offre produit: codeur propriétaire WB à très bas débit, marché potentiel: portail web, enregistreur Numérique, PDA, radio numérique.

Page 18: SIGNAL & IMAGE PROCESSING 1 THALES COMMUNICATIONS FRANCE This document is the property of THALES Communications, its content cannot be reproduced, disclosed

SIGNAL & IMAGE PROCESSING 18

THALES COMMUNICATIONS FRANCE

Thi

s do

cum

ent i

s th

e pr

oper

ty o

f THA

LES

Com

mun

icatio

ns, i

ts c

onte

nt c

anno

t be

repr

oduc

ed, d

isclo

sed

or u

tilize

d wi

thou

t the

Com

pany

's wr

itten

app

rova

l

WLBR speech Codec

Codage large bande (0-7kHz) Amélioration de la qualité perçue Aide à la discrimination des fricatives Rehaussement de l’intelligibilité

Extension pleine bande (full band)

Modèle paramétrique sur toute la bande (AR ordre 16)

Choix algorithmiques Longueur de trame : 360 éch.

Voisement sur 0-4kHz

4 fréquences de coupure

Bande haute non voisée

0 7kHzfc

Ordre 16