+ All Categories
Home > Documents > GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY...

GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY...

Date post: 14-Dec-2015
Category:
Upload: beatrix-morton
View: 221 times
Download: 3 times
Share this document with a friend
24
TAMIL WORDS SPEECH SYNTHESIS IN COCHLEAR IMPLANT USING ACOUSTIC MODEL GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI MODULE.
Transcript
Page 1: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

TAMIL WORDS SPEECH SYNTHESIS IN COCHLEAR IMPLANT USING ACOUSTIC MODEL

GUIDED BY

T.JAYASANKAR,

ASST.PROFESSOR OF ECE,

ANNA UNIVERSITY OF TIRUCHIRAPPALLI.

PRESENTED BYC.SENTHILKUMAR,

REG.NO:810011992018,M.E(MBCBS),COM SYSTEM,VI MODULE.

Page 2: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

OBJECTIVE

A cochlear implant (CI) is a surgically implanted electronic device that provides a sense of sound to a person who is profoundly deaf or severely hard of hearing.

The main objective of this work is to develop the system that reproduces the incoming sound/speech signals as naturally as possible

Page 3: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

LITERATURE SURVEY S.NO TITLE AUTHORS YEAR & PUBLICATION CONCEPT

1 Estimation of Vowel Recognition WithCochlear Implant Simulations

Chuping Liu and Qian-Jie Fu IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 54, NO. 1, JANUARY 2007

In this paper, Mel-frequency cepstrum coefficients(MFCCs) were used to estimate the acoustic vowel space forvowel stimuli processed by the CI simulations.

2 Improving Speech Intelligibility in Cochlear Implants usingAcoustic Models

P. VIJAYALAKSHMI, T. NAGARAJAN and PREETHI MAHADEVAN

ISSN: 1790-5052 Issue 4, Volume 7, October 2011

In this paper to improve the perceptualquality of the speech generated by a CI model, system specific parameters are analyzed by developinguniform bandwidth filterbank-based acoustic CI models

3 MIMICKING THE HUMAN EAR Philipos C.Loizon IEEE SIGNAL PROCESSING MAGAZINE1053-5888/98/$10.000 1998IEEE

An overview of Signal- Processing Strategies for converting sound into Electrical sgnals in cochlear implants

Page 4: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

SYSTEM DESIGN

Speech Data Collection

Tamil words are recorded from a male speaker at a sampling frequency of 16 kHz with a head mounted carbon microphone of frequency range 20 Hz – 20 kHz using s PRAAT tool

Page 5: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

CHANNEL VOCODER BASED ACOUSTIC MODEL

General block Diagram

ANALYZER SYNTHESIZER

ACOUSTIC MODEL

INPUTSPEECH

SYNTHETICSPEECH

Page 6: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

CHANNEL VOCODER

Uniform bandwidth filter bank method Critical bandwidth filter bank method

Page 7: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

CHANNEL VOCODER ANALYZER(UNIFORM BANDWIDTH)

Page 8: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

CONT…

Acoustic model parameters Sampling Frequency : 16000Hz Frequency Range : 0-8200Hz Filter Type : IIR – Chebyshev type-

2 No. of Channels : 21 (1 LPF +20 BPF) Bandwidth : 400Hz Order of filter : 5

Page 9: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

CONT..

Filter order Mean squared difference

1 0.00332 0.00323 0.00314 0,00315 0.00306 1.4861e+2387 4.3540e+3008 1.6547e+2999 2.7285e+296

10 3.3515e+299

Page 10: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

CHANNEL VOCODER SYNTHESIZER(UNIFORM BANDWIDTH)

Page 11: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

WAVEFORM OF THE TAMIL WORD /அம்மா�/

Page 12: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

FILTERED SIGNAL & ITS ENVELOPE

Page 13: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

TRAIN OF IMPULSE

Pitch period =0.0063sec

Page 14: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

MODULATED AND SYNTHESIZED FILTER OUTPUT

Page 15: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

ORIGINAL & SYNTHESIZED SPEECH SIGNAL

Page 16: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

CRITICAL BANDWIDTH FILTER BANK BASED ACOUSTIC CI MODEL

Critical band is the smallest band of frequencies that activate the same part of basilar membrane and human ear can able to discriminate two tones that differ in critical bands.

Page 17: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

DESIGN OF CI MODEL BASED ON CRITICAL BANDS

Filter bank is designed based on critical bands of the human auditory system.

The critical band of each auditory band-pass filter is computed using equivalent rectangular bandwidth (ERB).

If the center frequencies (fc) of filters are known, then the corresponding ERBs are calculated using the following formula,

ERB=24.7((0.00437*fc) +1) (1)

Page 18: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

INPUT WORD/அம்மா�/

Page 19: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

WAVEFORM OF INPUT AND SYNTHESIZED SPEECH FOR THE TAMIL WORD /அம்மா�/

Page 20: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

MEAN SQUARE DIFFERENCE BETWEEN UNIFORM BANDWIDTH FILTER-BASED CI MODEL AND AUDITORY

CI MODEL

Uniform Bandwidh Model Critical Bandwidth Model0

0.0005

0.001

0.0015

0.002

0.0025

0.003

0.0035

0.004

0.0045

0.005

Mean square Difference

அம்மா�

அன்பு

மாலர்

மாரம்

முகம்

நி றம்

பணம்

Page 21: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

MEAN OPINION SCORE(MOS) FOR UBW & CBW SYSTEM

UBW CBW4.054.1

4.154.2

4.254.3

4.354.4

4.454.5

4.55

MEAN OPINION SCORE

MEAN OPINION SCORE

Page 22: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

CONCLUSION

The Critical band CI model is performed well when compared with the Uniform bandwidth filter bank method based on the mean square difference & Mean opinion score.

Page 23: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

REFERENCES

P. Vijayalakshmi , T. Nagarajan and Preethi Mahadevan,(2011), “ Improving Speech Intelligibility in Cochlear Implants using Acoustic Models’’, WSEAS TRANSACTIONS on SIGNAL PROCESSING, Issue 4, Volume 7, October 2011, pp. 131 – 144.

Gladston, A.R.; Vijayalakshmi, P.; Thangavelu, N., "Improving speech intelligibility in cochlear implants using vocoder-centric acoustic models," Recent Trends In Information Technology (ICRTIT), 2012 International Conference on , vol., no., pp.66,71, 19-21 April 2012.

D. K. Eddington, W. M. Rabinowitz, and L.Dellzome, “Sound Processing for Cochlear Implants”, in Proceedings of International IEEE EMBC, 2001, pp. 3449- 3452.

B. Gold and N. Morgan, “Speech and audio signal processing - processing and perception of speech and music”. John Wiley and Sons. Inc., 2000.

P. C. Loizou, “Speech processing in vocoder-centric cochlear implants” Cochlear and Brainstem Implants. Adv Otorhinolaryngol. Basel, Karger, vol 64, pp 109–143, 2006.

P.C. Loizou, ”Mimicking the human ear” IEEE Signal Processing magazine, vol. 15, no. 5, Sep. 1998, pp. 101-130

Page 24: GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO:810011992018, M.E(MBCBS),COM SYSTEM,VI.

Thank You


Recommended