1 speech processing. 2 speech processing: review of dsp concepts review of probability and...

16
1 Speech Processing

Upload: sabrina-tucker

Post on 18-Jan-2016

229 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

1

Speech Processing

Page 2: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

2

Speech Processing: Review of DSP Concepts Review of Probability and Stochastic Processes Anatomy and Physiology of Speech Production

System Phonemics and Phonetics Spectrogram Reading Linear Prediction Analysis Speech Coding and Compression Speech Synthesis (Text to Speech) Speech Quality Assessment (Subjective and

Objective) Speech Recognition (Speech to Text) Speech Enhancement

Page 3: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

3

Speech Processing: Marking Scheme:

Homeworks: 10%Projects : 15%Quizzes: 20%Midterm: 20%Final Exam: 40%----------------------Total: 105%

Page 4: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

4

Speech Processing: Text:

Spoken language processing Huang, Acero, Hon, Prentice Hall, 2000

Discrete time processing of speech Signals Deller,Proakis,Hansen,1993

Fundamentals of speech recognition Rabiner,Juang,1993

Advances in speech signal processing Furui,Sondhi,1991

Page 5: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

:ارسطو. است ناطق حيوان انسان،

5

Page 6: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

Old Speech Synthesizers

– Speech organ of Wheatstone, based on a system proposed by Wolfgang

von Kempelen in 1791

Page 7: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

Old Speech Synthesizers (cont’d)

– Speech organ of Joseph Faber (1830-40)

Page 8: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

Old Speech Synthesizers (cont’d)– Voder demonstrated in 1939

Source: http://www.ling.su.se/staff/hartmut/kemplne.htm

Page 9: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

More modern labs (ICP lab in Grenoble, France)

– Study of the face movements to be included in speech synthesis (and recognition).

Page 10: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

Communication via Spoken Language

Page 11: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

Communication via Spoken Language

Page 12: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

Virtues of Spoken Language

Natural: Requires no special training

Flexible: Leaves hands and eyes free

Efficient: Has high data rate

Economical: Communicated inexpensively

Expressive: Conveys more than just words

Popular/preferred: Verbal-acoustic problem solving

Much longer evolution, compared to written language

Page 13: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

Virtues of Spoken Language

Speech interfaces are ideal for information access and management when:

The information space is broad and complex, The users are not allowed (or at ease or capable) to use

their eyes to read text messages, The users are technically naive, or Only telephones are available.

Page 14: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

Diverse Sources of Constraint forSpoken Language Communication

Acoustic: human vocal tractPhonetic: let us pray

lettuce sprayPhonological: gas shortage

fish sandwichPhonotactic: sprachst (german)Syntactic: I am flying to Chicago tomorrow

tomorrow I flying Chicago am toSemantic: Is the baby crying

Is the bay bee cryingContextual: It is easy to recognize speech

It is easy to wreck a nice beach

Page 15: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

A Conversational System Architecture

Page 16: 1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech

Demo: Conversational Interface Jupiter weather information system

Access through telephone 500 cities worldwide Harvest weather information from the Web

several times daily