guided by t.jayasankar, asst.professor of ece, anna university of tiruchirappalli. presented by...
TRANSCRIPT
![Page 1: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/1.jpg)
TAMIL WORDS SPEECH SYNTHESIS IN COCHLEAR IMPLANT USING ACOUSTIC MODEL
GUIDED BY
T.JAYASANKAR,
ASST.PROFESSOR OF ECE,
ANNA UNIVERSITY OF TIRUCHIRAPPALLI.
PRESENTED BYC.SENTHILKUMAR,
REG.NO:810011992018,M.E(MBCBS),COM SYSTEM,VI MODULE.
![Page 2: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/2.jpg)
OBJECTIVE
A cochlear implant (CI) is a surgically implanted electronic device that provides a sense of sound to a person who is profoundly deaf or severely hard of hearing.
The main objective of this work is to develop the system that reproduces the incoming sound/speech signals as naturally as possible
![Page 3: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/3.jpg)
LITERATURE SURVEY S.NO TITLE AUTHORS YEAR & PUBLICATION CONCEPT
1 Estimation of Vowel Recognition WithCochlear Implant Simulations
Chuping Liu and Qian-Jie Fu IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 54, NO. 1, JANUARY 2007
In this paper, Mel-frequency cepstrum coefficients(MFCCs) were used to estimate the acoustic vowel space forvowel stimuli processed by the CI simulations.
2 Improving Speech Intelligibility in Cochlear Implants usingAcoustic Models
P. VIJAYALAKSHMI, T. NAGARAJAN and PREETHI MAHADEVAN
ISSN: 1790-5052 Issue 4, Volume 7, October 2011
In this paper to improve the perceptualquality of the speech generated by a CI model, system specific parameters are analyzed by developinguniform bandwidth filterbank-based acoustic CI models
3 MIMICKING THE HUMAN EAR Philipos C.Loizon IEEE SIGNAL PROCESSING MAGAZINE1053-5888/98/$10.000 1998IEEE
An overview of Signal- Processing Strategies for converting sound into Electrical sgnals in cochlear implants
![Page 4: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/4.jpg)
SYSTEM DESIGN
Speech Data Collection
Tamil words are recorded from a male speaker at a sampling frequency of 16 kHz with a head mounted carbon microphone of frequency range 20 Hz – 20 kHz using s PRAAT tool
![Page 5: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/5.jpg)
CHANNEL VOCODER BASED ACOUSTIC MODEL
General block Diagram
ANALYZER SYNTHESIZER
ACOUSTIC MODEL
INPUTSPEECH
SYNTHETICSPEECH
![Page 6: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/6.jpg)
CHANNEL VOCODER
Uniform bandwidth filter bank method Critical bandwidth filter bank method
![Page 7: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/7.jpg)
CHANNEL VOCODER ANALYZER(UNIFORM BANDWIDTH)
![Page 8: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/8.jpg)
CONT…
Acoustic model parameters Sampling Frequency : 16000Hz Frequency Range : 0-8200Hz Filter Type : IIR – Chebyshev type-
2 No. of Channels : 21 (1 LPF +20 BPF) Bandwidth : 400Hz Order of filter : 5
![Page 9: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/9.jpg)
CONT..
Filter order Mean squared difference
1 0.00332 0.00323 0.00314 0,00315 0.00306 1.4861e+2387 4.3540e+3008 1.6547e+2999 2.7285e+296
10 3.3515e+299
![Page 10: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/10.jpg)
CHANNEL VOCODER SYNTHESIZER(UNIFORM BANDWIDTH)
![Page 11: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/11.jpg)
WAVEFORM OF THE TAMIL WORD /அம்மா�/
![Page 12: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/12.jpg)
FILTERED SIGNAL & ITS ENVELOPE
![Page 13: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/13.jpg)
TRAIN OF IMPULSE
Pitch period =0.0063sec
![Page 14: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/14.jpg)
MODULATED AND SYNTHESIZED FILTER OUTPUT
![Page 15: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/15.jpg)
ORIGINAL & SYNTHESIZED SPEECH SIGNAL
![Page 16: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/16.jpg)
CRITICAL BANDWIDTH FILTER BANK BASED ACOUSTIC CI MODEL
Critical band is the smallest band of frequencies that activate the same part of basilar membrane and human ear can able to discriminate two tones that differ in critical bands.
![Page 17: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/17.jpg)
DESIGN OF CI MODEL BASED ON CRITICAL BANDS
Filter bank is designed based on critical bands of the human auditory system.
The critical band of each auditory band-pass filter is computed using equivalent rectangular bandwidth (ERB).
If the center frequencies (fc) of filters are known, then the corresponding ERBs are calculated using the following formula,
ERB=24.7((0.00437*fc) +1) (1)
![Page 18: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/18.jpg)
INPUT WORD/அம்மா�/
![Page 19: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/19.jpg)
WAVEFORM OF INPUT AND SYNTHESIZED SPEECH FOR THE TAMIL WORD /அம்மா�/
![Page 20: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/20.jpg)
MEAN SQUARE DIFFERENCE BETWEEN UNIFORM BANDWIDTH FILTER-BASED CI MODEL AND AUDITORY
CI MODEL
Uniform Bandwidh Model Critical Bandwidth Model0
0.0005
0.001
0.0015
0.002
0.0025
0.003
0.0035
0.004
0.0045
0.005
Mean square Difference
அம்மா�
அன்பு
மாலர்
மாரம்
முகம்
நி றம்
பணம்
![Page 21: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/21.jpg)
MEAN OPINION SCORE(MOS) FOR UBW & CBW SYSTEM
UBW CBW4.054.1
4.154.2
4.254.3
4.354.4
4.454.5
4.55
MEAN OPINION SCORE
MEAN OPINION SCORE
![Page 22: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/22.jpg)
CONCLUSION
The Critical band CI model is performed well when compared with the Uniform bandwidth filter bank method based on the mean square difference & Mean opinion score.
![Page 23: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/23.jpg)
REFERENCES
P. Vijayalakshmi , T. Nagarajan and Preethi Mahadevan,(2011), “ Improving Speech Intelligibility in Cochlear Implants using Acoustic Models’’, WSEAS TRANSACTIONS on SIGNAL PROCESSING, Issue 4, Volume 7, October 2011, pp. 131 – 144.
Gladston, A.R.; Vijayalakshmi, P.; Thangavelu, N., "Improving speech intelligibility in cochlear implants using vocoder-centric acoustic models," Recent Trends In Information Technology (ICRTIT), 2012 International Conference on , vol., no., pp.66,71, 19-21 April 2012.
D. K. Eddington, W. M. Rabinowitz, and L.Dellzome, “Sound Processing for Cochlear Implants”, in Proceedings of International IEEE EMBC, 2001, pp. 3449- 3452.
B. Gold and N. Morgan, “Speech and audio signal processing - processing and perception of speech and music”. John Wiley and Sons. Inc., 2000.
P. C. Loizou, “Speech processing in vocoder-centric cochlear implants” Cochlear and Brainstem Implants. Adv Otorhinolaryngol. Basel, Karger, vol 64, pp 109–143, 2006.
P.C. Loizou, ”Mimicking the human ear” IEEE Signal Processing magazine, vol. 15, no. 5, Sep. 1998, pp. 101-130
![Page 24: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI](https://reader036.vdocuments.mx/reader036/viewer/2022062421/56649f425503460f94c61dce/html5/thumbnails/24.jpg)
Thank You