1 speech processing. 2 speech processing: review of dsp concepts review of probability and...
TRANSCRIPT
1
Speech Processing
2
Speech Processing: Review of DSP Concepts Review of Probability and Stochastic Processes Anatomy and Physiology of Speech Production
System Phonemics and Phonetics Spectrogram Reading Linear Prediction Analysis Speech Coding and Compression Speech Synthesis (Text to Speech) Speech Quality Assessment (Subjective and
Objective) Speech Recognition (Speech to Text) Speech Enhancement
3
Speech Processing: Marking Scheme:
Homeworks: 10%Projects : 15%Quizzes: 20%Midterm: 20%Final Exam: 40%----------------------Total: 105%
4
Speech Processing: Text:
Spoken language processing Huang, Acero, Hon, Prentice Hall, 2000
Discrete time processing of speech Signals Deller,Proakis,Hansen,1993
Fundamentals of speech recognition Rabiner,Juang,1993
Advances in speech signal processing Furui,Sondhi,1991
:ارسطو. است ناطق حيوان انسان،
5
Old Speech Synthesizers
– Speech organ of Wheatstone, based on a system proposed by Wolfgang
von Kempelen in 1791
Old Speech Synthesizers (cont’d)
– Speech organ of Joseph Faber (1830-40)
Old Speech Synthesizers (cont’d)– Voder demonstrated in 1939
Source: http://www.ling.su.se/staff/hartmut/kemplne.htm
More modern labs (ICP lab in Grenoble, France)
– Study of the face movements to be included in speech synthesis (and recognition).
Communication via Spoken Language
Communication via Spoken Language
Virtues of Spoken Language
Natural: Requires no special training
Flexible: Leaves hands and eyes free
Efficient: Has high data rate
Economical: Communicated inexpensively
Expressive: Conveys more than just words
Popular/preferred: Verbal-acoustic problem solving
Much longer evolution, compared to written language
Virtues of Spoken Language
Speech interfaces are ideal for information access and management when:
The information space is broad and complex, The users are not allowed (or at ease or capable) to use
their eyes to read text messages, The users are technically naive, or Only telephones are available.
Diverse Sources of Constraint forSpoken Language Communication
Acoustic: human vocal tractPhonetic: let us pray
lettuce sprayPhonological: gas shortage
fish sandwichPhonotactic: sprachst (german)Syntactic: I am flying to Chicago tomorrow
tomorrow I flying Chicago am toSemantic: Is the baby crying
Is the bay bee cryingContextual: It is easy to recognize speech
It is easy to wreck a nice beach
A Conversational System Architecture
Demo: Conversational Interface Jupiter weather information system
Access through telephone 500 cities worldwide Harvest weather information from the Web
several times daily