Date post: | 08-Apr-2018 |
Category: |
Documents |
Upload: | vigneshshenoy2 |
View: | 217 times |
Download: | 0 times |
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 1/21
Krishnananda Prabhu
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 2/21
Objective:yPractical Application of
Interacting with thecomputers using Voice
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 3/21
Key terms:yA utomatic Speech Recognition
Technique(A SR):This is the Technique
used to convert Speech to Text
yText To Speech ConversionTechnique(TTS): This is the Techniqueused to convert Text to Speech
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 4/21
[Contd]Hidden Markov Model:
oModern general-purpose speechrecognition systems are based on
HiddenMarkov Models. These are
statistical models which output asequence of symbols or quantities.
o Used in Real time Applications
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 5/21
Introduction:y Transfer of information between
human and machine is normally
accomplished via ones senses.
yTo communicate with our
environment, we send out signals orinformation visually, auditorily, andthrough gestures
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 6/21
[contd]yHumancomputer interactions often use
a mouse and keyboard as machineinput, and a computer screen or printeras output.
y One can read text and understand imagesmuch more quickly on a two- dimensional(2-D) computer screen than when listeningto a [one-dimensional (1-D)] speech signal.
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 7/21
[contd..]� However, most people can speak more
quickly than they can type, and are much
more comfortable speaking than typing� Henceforth we come across the technique of
Interacting with computers using Voice
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 8/21
Model for Speech Recognition
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 9/21
Automatic Speech
Recognition(ASR):Defintion: This is the Technique used to convert
Speech to Text
Automatic speech recognition is among otherthings useful in situations where an operator isinputting data to a computer in parallel with usinghis hands for other tasks.
The recognition strategy used can in short bedescribed as an extraction of a number of speech
parameters from the acoustic speech signal foreach word.
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 10/21
[contd]y In a training phase the operator will read all
the words of the vocabulary of the current
application. The word patterns are storedand later when a word is to be recognised itspattern is compared to the stored patternsand the word that gives the best
correspondence is selected. This techniqueis generally referred to as PatternRecognition.
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 11/21
[contd]
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 12/21
DESCRIPTION OF THE HARDWARE
BOARDSy The hardware of the word recogniser consists of a
general micro computer, and a signal processor for
the acoustic analysis of the speech signal.y The micro computer board consists of theMotorolaMC-68000 micro processor and also hasfacilities for the input and output of data andmemory managing circuits for the memory cards(to store the vocabulary).
y The speech analysis board implements a spectrumanalyser in the form of a 16 channel filter bank
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 13/21
Text toSpeech Conversion(TTS):
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 14/21
[contd.]yThe TTS is based on Speech synthesis by
diaphonic concatenation and consists of the
following three modules together with theuser interface module.
y Diaphone Database
y Text Processing moduley Speech Synthesiser.
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 15/21
y The three main parts of the TTSsystem comprises of theIt consists of three parts
1. Preprocessing module
2.Text analysis module3. Synthesizer module.
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 16/21
TTS representation using
concatenation:
Concatenation:A process/Technique for producing sound
from a text.It uses a set of basic sound elements for
Recognition.
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 17/21
Hidden MarkovModel:Definition:. These are statistical models which
output a sequence of symbols or quantities.
� Modern general-purpose speech recognitionsystems are based on Hidden Markov Models.
� HMMs are used in speech recognition because aspeech signal can be viewed as a piecewise
stationary signal or a short-time stationary signal.
� HMMs are popular is because they can be trainedautomatically and are simple and computationally
feasible to use
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 18/21
[contd.]y In speech recognition, the hidden Markov model
would output a sequence of n-dimensional real-
valued vectors (with n being a small integer, suchas 10), outputting one of these every 10milliseconds.
y The vectors would consist of cepstral coefficients,
which are obtained by taking a Fourier transformof a short time window of speech anddecorrelating the spectrum using a cosinetransform, then taking the first (most significant)
coefficients.
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 19/21
[contd]y A hidden Markov model for a sequence of words or
phonemes is made by concatenating the individual
trained hiddenM
arkov models for the separate wordsand phonemes.
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 20/21
Advantages and Disadvantages:Advantages:
1.)Phsyically disabled persons can also interact withthe computers using this technique.
2.)Typing can be done even faster without actual effort.3.)We can ask the computer to make anannouncement.eg:reading an e-mail.
Disadvantages:1.)This Mechanism doesnot help if the given word is out of its
vocabulary
2.)Doesnt take multiple inputs i.e when 2 persons are talkingsimultaneously.
3.)Doesnt work properly when there is cahnge of ascent in the givenword.
8/7/2019 seminar ppt on ineracting with computers
http://slidepdf.com/reader/full/seminar-ppt-on-ineracting-with-computers 21/21
Conclusions:y The Possible ways of Interacting with a
Computer using Voice are discussed in the
Paper.y A utomatic speech recognition technique(A SR)
and Text to speech conversion techniques areused for its implementation .
y The concept of Hidden Markov Model is realtime used for the synthesis of this Real-Timeaplication.