+ All Categories
Home > Documents > Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F....

Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F....

Date post: 11-Jul-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
9
Proceedings October 3-6, 1996 Wyndham Franklin Plaza Hotel Philadelphia, PA, USA ICSLP 96 y,x Sponsored by University of Delaware Alfred I. duPont Institute in conjunction with Acoustical Society of America Acoustical Society of Japan American Speech-Language-Hearing Association Australian Speech Science and Technology Association European Speech Communication Association IEEE Signal Processing Society Incorporated Canadian Acoustical Association International Phonetic Association Linguistic Society of America UB/TIB Hannover 116 238 925 89 Fourth International Conference on Spoken Language Processing
Transcript
Page 1: Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser-Vocal Tract Geometry I 784 Human Palate and Related Structures: Their Articulatory

Proceedings

October 3-6, 1996Wyndham Franklin Plaza Hotel

Philadelphia, PA, USA

ICSLP 96

y,xSponsored by

University of DelawareAlfred I. duPont Institute

in conjunction withAcoustical Society of America

Acoustical Society of JapanAmerican Speech-Language-Hearing Association

Australian Speech Science and Technology AssociationEuropean Speech Communication Association

IEEE Signal Processing SocietyIncorporated Canadian Acoustical Association

International Phonetic AssociationLinguistic Society of America

UB/TIB Hannover116 238 925

89

Fourth International Conference on Spoken Language Processing

Page 2: Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser-Vocal Tract Geometry I 784 Human Palate and Related Structures: Their Articulatory

502 Improved Probability Estimation with Neural NetworkModelsWei Wei, Erienne Barnard, Mark Fanty

506 A Neural Network Using Acoustic Sub-word Units forContinuous Speech RecognitionHa-Jin Yu, Yung-Hwan Oh

510 On the Error Criteria in Neural Networks as a Tool forHuman Classification ModellingLouis F. M. ten Bosch, Roel Smits

514 A Non-linear Hltering Approach to Stochastic Trainingof the Articulatory-acoustic Mapping Using the EMAlgorithmGordon Ramsay

518 A Tool for Automated Design of Language ModelsY.P. Yang, J.R. Deller Jr.

522 Acoustic-phonetic Decoding Based on Elman PredictiveNeural NetworksF. Freitag, E. Monte

526 On Improving Discrimination Capability of an RNNBased RecognizerTan Lee, P.C. Ching

530 An Evaluation of Statistical Language Modeling forSpeech Recognition using a Mixed Category of BothWords and Parts-of-speechYumi Wakita, Jun Kawai, Hitoshi lida

554 Dialogue Processing in a Conversational SpeechTranslation SystemAlon Lavie, Lori Levin, Yan Qu, Alex Waibel, DonnaGates, Marsal Gavalda, Laura Mayfield, Maite Taboada

534 A Dialogue Control Strategy Based on the Reliability ofSpeech RecognitionYasuhisa Niimi, Yutaka Kobayashi

538 Speech Wear: A Mobile Speech SystemAlexander I. Rudnicky, Stephen Reed, Eric H. Thayer

542 WHEELS: A Conversational System in the AutomobileClassifieds DomainHelen Meng, Senis Busayapongchai, James Glass,David Goddeau, Lee Hetherington, Edward Hurley,Christine Pao, Joseph Polifroni, Stephanie Seneff,Victor Zue

546 Effective Human-computer Cooperative SpokenDialogue: The AGS DemonstratorM.D. Sadek, A. Ferrieux, A. Cozannet, P. Bretier, F.Panaget, J. Simonin

550 Dialog in the RAILTEL Telephone-based SystemS.K. Bennacef, L. Devillers, 5. Rosset, Lori Lame I

558 Novel Speech Processing Mechanism Denved fromAuditory Neocortical Circuit AnalysisBoris Aleksandrovsky, James Whitson, Gretchen Andes,Gary Lynch, Richard Granger

562 Modeling Neurons in the Anteroventral CochlearNucleus for Amplitude Modulation (AM) Processing:Application to Speech SoundPing Tang, Jean Rouat

566 Noise Suppression and Loudness Normalization in anAuditory Model-based Acoustic Front-endHalewijn Vereecken, Jean-Pierre Martens

570 A Psychoacousnc Model for the Noise Masking ofVoiceless Plosive BurstsJim Hant, Brian Strope, AbeerAlwan

574 Training Machine Classifiers to Match the Performanceof Human Listeners in a Natural Vowel ClassificationTaskMartin Hunke, Thomas Holton

578 A Neural Matrix Model for Active Tracking ofFrequency-modulated TonesKiyoaki Aikawa, Hideki Kawahara, Minoru Tsuzaki

-m

FrAlLl - Utterance Verification and Word Spotting

582 A User-Configurable System for Voice LabelRecognitionRichard C. Rose, Eduardo Ueida, G.W. Erhart, R.V.Grubbe

586 Keyword Spotting Enhancement for Video SoundtrackIndexingPhilippe Gelin, Chris. J. Wellekens

590 New Efficient Fillers for Unlimited Word Recognitionand Keyword SpottingRachida El Meliani, Douglas O'Shaughnessy

594 Automatic Transcription of General Audio Data:Preliminary AnalysesMichelle S. Spina, Victor Zue

598 Transcribing Radio NewsFrancis Kubala, Tasos Anastasakos, Hubert Jin, LongNguyen, Richard Schwartz

Page 3: Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser-Vocal Tract Geometry I 784 Human Palate and Related Structures: Their Articulatory

602 Correcting Recognition Errors via DiscriminativeUtterance VerificationAnand R. Setiur, Rafid A. Sukkar, John Jacob

650 Modeling Intra-Speaker Pitch Range Variation:Predicting F0 Targets when "Speaking Up"Elizabeth Shriberg, D. Robert Ladd, Jacques Terken

<m/LeanungTxainingX2

606 Does Training in Speech Perception Modify SpeechProduction?Reiko Akahane-Yamada, Yoh'ichi Tohkura, Ann R.Bradlow, David B. Pisoni

610 Phrase-Final Lengthening and Stress-Timed Shorteningin the Speech of Native Speakers and Japanese Learnersof EnglishMotoko Ueyama

614 Japanese Accentuations by Foreign Students andJapanese Speakers of Non-Tokyo DialectNobuko Yamada

618 Devoicing of Japanese Vowels by Taiwanese Learners ofJapaneseJ. Kevin Varden, Tsutomu Sato

622 Fluency and Use of Segmental Dialect Features in theAcquisition of a Second Language (French) by EnglishSpeakersDaniele Archambault, Catherine Foucher, BlagovestaManeva

626 Estimating Child and Adolescent Formant FrequencyValues From Adult DataP. Martland, S.P. Whiteside, Steve W. Beet, L Baghai-Ravary

£ocus, Stress and Accent

630 Acoustic Correlates of Linguistic Stress and Accent inDutch and American EnglishAgaath M.C. Sluijter, Vincent J. van Heuven

634 On the Levels of Accentuation in Spoken JapaneseHiroya FujisoM, Sumio Ohno, Osamu Tomita

638 Tonal Distinctions Between Emphatic Stress andPretonic Lengthening in Quebec FrenchLinda Thibault, Marise Ouellet

642 Distinction Between "Normal' Focus and'Contrastive/Emphatic' FocusAnja (Petzold) Eisner

646 Perception of Tonal Accent by Americans LearningJapaneseYuhhiro Nishinuma, Masako Arai, Takako Ayusawa

— SpokenXangnage

654 Predicting Dialogue Acts for a Speech-To-SpeechTranslation SystemNorbert Reithinger, RalfEngel, Michael Kipp, MartinKlesen

658 Automatic Speech Translation Based on the SemanticStructureJohannes Midler, Holger Stahl, Manfred Lang

662 A Methodology for Application Development forSpoken Language SystemsLewisM. Norton, CarlE. Weir, K.W. Scholz, DeborahA. Dahl Ahmed Bound

665 A New Restaurant Guide Conversational System: Issuesin Rapid Prototyping for Specialized DomainsStephanie Seneff, Joseph Polifroni

669 Semantic Interpretation of a Japanese Complex Sentencein an Advisory Dialogue - Focused on the PostpositionalWord "KEDO," Which Works as a Conjunction BetweenClausesTadahiko Kumamoto, Akira ho

673 A Korean Morphological Analyzer for SpeechTranslation SystemYoungkuk Hong, Myoung-Wan Koo, Gijoo Yang

GTT Generic and Domain-specific Aspects of the WaxholmNLP and Dialog ModulesRolf Carlson, Sheri Hunnicutt

681 A Real-Time System for Summarizing Human-HumanSpontaneous Spoken DialoguesMegumi Kameyama, Goh Kawai, Isao Arima

685 Evaluation of Spoken Language Understanding andDialogue SystemsBemd Hildebrandt, Heike Rautenstrauch, GerhardSagerer

689 Inter-Speaker Interaction of F0 in DialogsKunikoKakita

693 A Robust Dialogue System for Making an AppointmentHans Brandt-Pook, GemotA. Fink, Bemd Hildebrandt,Franz Kummert, Gerhard Sagerer

697 Segmentation of Spoken Dialogue by Interjections,Disfluent Utterances and PausesKazuyuki Takagi, Shuichi hahashi

Page 4: Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser-Vocal Tract Geometry I 784 Human Palate and Related Structures: Their Articulatory

701 A Form-Based Dialogue Manager for Spoken LanguageApplicationsDavid Goddeau, Helen Meng, Joe Polifroni, StephanieSeneff, Senis Busayapongchai

705 The Design of Complex Telephony Applications UsingLarge Vocabulary Speech TechnologyS.J. Whittaker, D.J. Attwater

709 Building 10,000 Spoken Dialogue SystemsStephen Sutton, David G. Novick, Ronald A. Cole, PieterVermeulen, Jacques de Villiers, Johan Schalkwyk, MarkFanty

713 Speaker Intention Modeling for Large VocabularyMandarin Spoken DialoguesYen-Ju Yang, Lee-Feng Chien, Lin-Shan Lee

717 Hybrid Language Models and Spontaneous LegalDiscourseP.E. Kenne, Mary O'Kane

721 Topic Change and Local Perplexity in Spoken LegalDialogueP.E. Kenne, Mary O'Kane

725 Intonational Cues to Discourse Structure in JapaneseJennifer J. Venditti, Marc Swerts

729 Principles for the Design of Cooperative Spoken Human-Machine DialogueNiels Ole Bemsen, Hans Dybkjcer, Laila Dybkjcer

733 Development and Comparison of Three Syllable StressClassifiersKaren L Jenkin, Michael S. Scordilis

i?~ }:'"~";l' ^B"rAlE2—SpeechDiswdenst-^ > V -'•'•

- > • / . . = < • ' • * . . ' ; . . . •* . - » • * • — • » . < * . . - . . r .

737 Interaction of Speech Disorders with Speech Coders-Effects on Speech IntelligibilityD.G. Jamieson, Li Deng, M. Price^Vijay Parsa, J. Till

741 Detecting Arytenoid Cartilage Misplacement throughAcoustic and Electroglottographic Jitter AnalysisMaurilio N. Vieira, Arnold G. D. Maran, Fergus R.Mclnnes, Mervyn A. Jack

745 Robust FO and Jitter Estimation in Pathological VoicesMaurilio N. Vieira, Fergus R. Mclnnes, Mervyn A. Jack

749 Speech Monitoring of Infective LaryngitisF. Plante, H. Kessler, B.M.G. Cheetham, J. Earis

753 Searching for Nonlinear Relations in Whitened JitterTime Series/. Schoentgen, R. De Guchteneere

757 Vocal Fold Pathology Assessment using AMAutocorrelation Analysis of the Teager Energy OperatorLiliana Gavidia-Ceballos, John H.L. Hansen, James F.Kaiser

761 Continuous Positive Airway Pressure (CPAP) in theTreatment of HypemasalityDavid P. Kuehn

764 Enhancement of Alaryngeal Speech by Adaptive FilteringCarol Y. Espy-Wilson, Venkatesh R. Chari, Caroline B.Huang

768 Simulation of Disordered Speech Using a Frequency-Domain Vocal Tract ModelLi Deng, Xuemin Shen, D.G. Jamieson, J. Till

772 A Stochastic Model of Fundamental Period Perturbationand Its Application to Perception of Pathological VoiceQualityYasuo Endo, Hideki Kasuya

776 A Screening Test for Speech Pathology AssessmentUsing Objective Quality MeasuresEric J. Wallen, John H.L Hansen

780 Recent Advances in Hypemasal Speech Detection usingthe Nonlinear Teager Energy OperatorDouglas A. Cairns, John H.L. Hansen, James F. Kaiser

-Vocal Tract Geometry I

784 Human Palate and Related Structures: Their ArticulatoryConsequencesKiyoshi Honda, Shinji Maeda, Michiko Hashi, JimDembowski, John R. Westbury

788 A Continuum Mechanics Representation of TongueDeformationEdward P. Davis, Andrew Douglas, Maureen Stone

793 From MRI and Acoustic Data to Articulatory Synthesis:A Case Study of the Lateral Approximants in AmericanEnglishPhilbert Bangayan, Abeer Alwan, Shrikanth Narayanan

797 Liquids in TamilShrikanth Narayanan, Abigail Kaun, Dani Byrd, PeterLadefoged, Abeer Alwan

FrA2Ll - Prosody in ASR and Segmentation

801 Modeling Hyperarticulate Speech during Human-computer Error ResolutionSharon Oviatt, Gina-Anne Levow, MargaretMacEachem, Karen Kuhn

Page 5: Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser-Vocal Tract Geometry I 784 Human Palate and Related Structures: Their Articulatory

805 Using Stress to Disambiguate Spoken Thai SentencesContaining Syntactic AmbiguitySiripong Potisuk, Mary P. Harper, Jackson T. Gandour

809 Use of Prosodic Information to Integrate Acoustic andLinguistic Knowledge in Continuous Mandarin SpeechRecognition with Very Large VocabularyHung-yun Hsieh, Ren-yuan Lyu, Lin-shan Lee

813 Word Boundary Detection using Pitch VariationsG. V. Ramana Rao, J. Srichand

817 Detection of Phrase Boundaries in Japanese by Low-Pass Filtering of Fundamental Frequency ContoursAtsuhiro Sakurai, Keikichi Hirose

821 A New Method for Speech Delexicalization, and itsApplication to the Perception of French ProsodyV. PageU N. Carbonell, Yves Laprie

*«*. ^^825 Task Adaptation for Dialogues Via Telephone Lines

UdoBub '-

829 The Influence of Bigram Constraints on WordRecognition by Humans: Implications for ComputerSpeech RecognitionRonald A. Cole, Yonghong Yon, Troy Bailey

833 ALICE: Acquisition of Language In ConversationalEnvironment - An Approach to Weakly SupervisedTraining of Spoken Language System for LanguagePortingTetsunori Kobayashi

837 Pitch Pattern Clustering of User Utterances in Human-Machine DialogueTakashi Yoshimura, Satoru Hayamizu, Hiroshi Ohmura,Kazuyo Tanaka

841 Simplifying Language through Error-correctingDecodingJ.C. AmenguaL E. Vidal, J.M. Benedi

845 A Mixed Approach to Speech UnderstandingMauro Cettolo, Anna Corazza, Renato De Mori

FrA2L3 -^Dialogue Systems

849 Speech Recognition for an Information KioskJ.L Gauvain, J.J. Gangolf, L. Lamel

853 Localizing an Automatic Inquiry System for PublicTransport InformationHelmer Strik, Albert Russel, Henk van den Heuvel, CatiaCucchiarini, Louis Boves

857 Prompt Constrained Natural Language - Evolving theNext Generation of Telephony ServicesStephen M. Marcus, Deborah W. Brown, Randy G.Goldberg, Max S. Schoeffler, William R. Wetzel,Richard R. Rosinski

861 Key-Phrase Detection and Verification for FlexibleSpeech UnderstandingTatsuya Kawahara, Chin-Hui Lee, Biing-Hwang Juang

865 Interactive Recovery from Speech Recognition Errors inSpeech User InterfacesBemhard Suhm, Brad Myers, Alex Waibel

869 Estimation of Language Models for New SpokenLanguage ApplicationsSunil Issar

FrA2El—Speech Enhancementand Robust-

873 H-infinity Filtering for Speech EnhancementXuemin Shen, Li Deng, Anisa Yasmin

877 A Comparitive Analysis of Channel-Robust Features andChannel Equalization Methods for Speech RecognitionSaeed V. Vaseghi, Ben Milner

881 Robust Speech Recognition Features Based on TemporalTrajectory Filtering of Frequency Band SpectrumJia-lin Shen, Wen-liang Hwang, Lin-shan Lee

885 Durational Modelling for Improved Connected DigitRecognitionKevin Power

889 Study on the Dereverberation of Speech Based onTemporal Envelope FilteringCarlos Avendano, Hynek Hermansky

893 Estimating Markov Model StructuresThorsten Brants

897 A Fertility Channel Model for Post-Correction ofContinuous Speech RecognitionEric K. Ringger, James F. Allen

901 Restoration of Wide Band Signal from TelephoneSpeech using Linear Prediction Error ProcessingHiroshi Yasukawa

905 Smoothed Spectral Subtraction for a Frequency-Weighted HMM in Noisy Speech RecognitionHiroshi Matsumoto, Noboru Naitoh

909 A Simple Architecture for using Multiple Cues in SoundSeparationWilliam S. Woods, Martin Hansen, Thomas Wittkop,Birger Kollmeier

Page 6: Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser-Vocal Tract Geometry I 784 Human Palate and Related Structures: Their Articulatory

913 On the Robust Automatic Segmentation of SpontaneousSpeechBojan Petek, Ove Andersen, Paul Dalsgaard

917 Bayesian Adaptation of Speech Recognizers to FieldSpeech DataC.G. Miglietta, C. Mokbel, D. Jouvet, J. Monne

921 Sub-band Adaptive Filtering Applied to SpeechEnhancementA. J. Darlington, D. J. Campbell

925 Noise Robust Estimate of Speech Dynamics for SpeakerRecognitionJ. P. Openshaw, John S. Mason

929 Overview of Speech Enhancement Techniques forAutomatic Speaker RecognitionJavier Ortega-Garcia, Joaquin Gonzdlez-Rodriguez

933 Dynamic Features for Segmental Speech RecognitionNaomi Hone, Saeed V. Vaseghi, Ben Milner

937 Speech Recognition Based on a Model of HumanAuditory SystemTakuya Koizumi, Mikio Mori, Shuji Taniguchi

941 APVQ Encoder Applied to Wideband Speech CodingJ.M. Salavedra, E. Masgrau

945 Simple Fast Vector Quantization of the line SpectralFrequenciesJin Zhou, Yair Shoham, Ali Akansu

MrJOSl^VeaiTr&ct Geometry U

949 Speaker Individualities of Vocal Tract Shapes ofJapanese Vowels Measured by Magnetic ResonanceImagesChang-Shehg Yang, Hideki Kasuya

953 Vocal Tract Acoustics Using the Transmission LineMatrix (TLM) MethodS. El-Masri, X. Pelorson, P. Saguet, P. Badin

957 Building Sensori-motor Prototypes from AudiovisualExemplarsGerard Badly

961 Parameterized VT Area Function InversionMats B&veg&rd, Gunnar Font

965 An Improved Vocal Tract Model of Vowel ProductionImplementing Piriform Resonance and Transvelar NasalCouplingJianwu Dang, Kiyoshi Honda

969 Pseudo-articulatory Speech Synthesis for Recognitionusing Automatic Feature Extraction from X-Ray DataC. 5. Blackburn, S. J. Young

FrPlLl - Speaker Adaptation and ISIormallzation I

973 N-best-based Instantaneous Speaker Adaptation Methodfor Speech RecognitionTomoko Matsui, Sadaoki Furui

977 Mixture Splitting Technic and Temporal Control in aHMM-based Recognition SystemC. Montacie, M.-J. Carary, C. Barras

981 An Unified Spectral Transformation AdaptationApproach for Robust Speech RecognitionLei Yao, Dong Yu, Taiyi Huang

985 On-line Adaptive Learning of the Correlated ContinuousDensity Hidden Markov Models for Speech RecognitionQiang Huo, Chin-Hui Lee

989 Speaker Adaptation by Modeling the Speaker Variationin a Continuous Speech Recognition SystemNikko Strom

993 An Enquiring System of Unknown Words in TV Newsby Spontaneous Repetition (Application of SpeakerNormalization by Speaker Subspace Projection)Yasuo Ariki, Shigeaki Tagashira

-#•*•_>?•-"

"3rPlL2r Spoken Language and NLPI

997 Language Understanding using Hidden UnderstandingModelsRichard Schwartz, Scott Miller, David Stallard, JohnMakhoul

1001 Processing of Semantic Information in Fluently SpokenLanguageAllen L Gorin

1005 Automatic Linguistic Segmentation of ConversationalSpeechAndreas Stolcke, Elizabeth Shriberg

1009 Towards Understanding Spontaneous Speech: WordAccuracy vs. Concept AccuracyM. Boros, W. Eckert, Florian Gallwitz, G. Gorz. G.Hanrieder, Heinrich Niemann

1013 A Stochastic Case Frame Approach for NaturalLanguage UnderstandingWolfgang Minker, S.K. Bennacef, J.L Cauvain

1017 Improving Speech Understanding by IncorporatingDatabase Constraints and Dialogue HistoryFrank Seide, Bemhard Ruber, Andreas Kellner

Page 7: Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser-Vocal Tract Geometry I 784 Human Palate and Related Structures: Their Articulatory

- FrPlI3-Spoken Discourse Analyas/Synthesis

s*y ^ : c ,-a>x . ^ ~ . . - J •„ , , ,?.-, , : ,,. ^ _1021 A New Discourse Structure Model for Spontaneous

Spoken DialogueTetsuro Chino, Hiroyuki Tsuboi

1025 An Architecture for Spoken Dialogue Management

David Duff, Barbara Gates, Susann LuperFoy

1029 Pausing Strategies in Discourse in Dutch

Monique E. van Donzel, Florien J. Koopmans-vanBeinum

1033 Filled Pauses as Markers of Discourse StructureMarc Swerts, Anne Wichmann, Robbert-Jan Beun

1037 The Prosodic Analysis of Korean Dialogue Speech -Through a Comparative Study with Read SpeechCheol-jae Seong, Minsoo Hahn

1041 Changing the Topic: How Long Does it Take?Mary O'Kane, P.E. Kenne

FrPlPl - Acoustic ModeBng I

1045 Learning Pronunciation Dictionary from Speech DataChristian-Michael Westendorf, Jens J ditto

1049 The Trended HMM with Discriminative Training forPhonetic ClassificationC. Rathinavelu, Li Deng

1053 Improving Decision Trees for Acoustic ModelingAriane Lazarides, Yves Normandin, Roland Kuhn

1057 An Improved Training Algorithm in HMM-basedSpeech RecognitionGongjun Li, Taiyi Huang

1061 Speech Recognition Using a Strong CorrelationAssumption for the Instantaneous SpectraJ. Ming, P. O'Boyle, J. McMahon, F. J. Smith

1065 On Parameter Filtering in Continuous Subword-unit-based Speech RecognitionPau Paches-Leal Climent Nadeu

1069 Estimation of Statistical Phoneme Center ConsideringPhonemic EnvironmentsShigeki Okawa, Katsuhiko Shirai

1073 Integration of Context-dependent Durational Knowledgeinto HMM-based Speech RecognitionXue Wang, Louis F. M. ten Bosch, Louis C. W. Pols

1077 Speech Recognition Based on Acoustically DerivedSegment UnitsT. Fukada, M. Bacchiani, K.K. Paliwal, YoshinoriSagisaka

1081 Robust Gender-dependent Acoustic-phonetic Modellingin Continuous Speech Recognition Based on a NewAutomatic Male/Female ClassificationRivarol Vergin, Azarshid Farhat, DouglasO 'Shaughnessy

1085 A Codebook Adaptation Algorithm for SCHMM UsingFormant DistributionToe Young Yang, Won Ho Shin, Weon Goo Kim, DaeHee Youn

1089 Parameter Tying for Flexible Speech RecognitionJ. Simonin, S. Bodin, D. Jouvet, K. Bartkova

1093 Word-spotting Based on Inter-word and Intra-wordDiphone ModelsTsuneo Nitta, Shin'ichi Tanaka, Yasuyula Masai,Hiroshi Matsu'ura

1097 Duration Modeling with Expanded HMM Applied toSpeech RecognitionAntonio Bonafonte, Josep Vidal, Albino Nogueiras

1101 Different Strategies for Distribution Clustering usingDiscrete, Semicontinuous and Continuous HMMs inCSRRicardo de Cdrdoba, Jose M. Pardo

1105 Improved HMM Phone and Triphone Models forRealtime ASR Telephony Applicationsllija Zeljkovic, Shrikanth Narayanan

1109 Improved Extended HMM Composition byIncorporating Power VarianceYasuhiro Minami, Sadaoki Furui

1113 Optimal Filtering and Smoothing for SpeechRecognition using a Stochastic Target ModelGordon Ramsay, Li Deng

1117 Speech Recognition Using Syllable-like UnitsZhihong Hu, Johan Schalkwyk, Etienne Barnard,Ronald A. Cole

FrPlSl - Physks and Simulation of the Vocal Tract I

1121 Search for Unexplored Effects in Speech ProductionC.H. Coker, M.H. Krone, B.Y. Reis, RA. Kubli

* Computational Models for Speech GenerationS. Levinson

1125 Articulatory Synthesis from X-rays and Inversion for anAdaptive Speech RobotP. Badin, C. Abry

Page 8: Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser-Vocal Tract Geometry I 784 Human Palate and Related Structures: Their Articulatory

FrP2L3—DuratfemandRhyfliin

1129 Adaptive Recognition Method Based on Posterior Useof Distribution Pattern of Output ProbabilitiesJin-Song Zhang, Beiqian Dai, Changfu Wang,Hingkeung Kwan, Keikichi Hirose

1133 Iterative Unsupervised Adaptation Using Maximumlikelihood Linear RegressionP.C. Woodland, D. Pye, M.J.F. Gales

Vt37 A Compact Model for Speaker-Adaptive TrainingTasos Anastasakos, John McDonough, RichardSchwartz, John Makhoul

1141 Iterative Unsupervised Speaker Adaptation for BatchDictationShigeru Homma, Jun-ichi Takahashi, Shigeki Sagayama

1145 Rapid Unsupervised Adaptation to Children's Speech ona Connected-Digit TaskDaniel C. Burnett, Mark Fanty

1149 Speaker Adaptation Using Tree Structured Shared-StateHMMsJun Ishii, Masahiro Tonomura, Shoichi Matsunaga

FrP2L2 -Spokenl^nguage andNLP H

1153 Learning to Parse Spontaneous SpeechFinn Dag Buo, Alex Waibel

1157 Spontaneous Speech and Natural Language ProcessingALPES: A Robust Semantic-led ParserJean-Yves Antoine

1161 The Natural Language Processing Module for a VoiceAssisted Operator at Telef6nica I-t-D/. Alvarez-Cercadillo, J. Caminero-Gil, C. Crespo-Casas, D. Tapias-Merino

1165 Compound Words in Large-Vocabulary German SpeechRecognition SystemsAndre Berton, Pablo Fetter, Peter Regel-Brietzmann

1169 Prosody, Empty Categories and Parsing - A SuccessStoryAnton Batliner, A. Feldhaus, S. Geissler, T. Kiss, RalfKompe, Elmar Noth

1173 "Almost Parsing" Technique for Language ModelingB. Srinivas

1177 From Segmental Duration Properties to RhythmicStructure: A Study of Interactions Between High andLow Level ConstraintsMarise Ouellet, Benoit Tardif

1181 Analysis of Context-dependent Segmental Duration forAutomatic Speech RecognitionXue Wang, Louis C. W. Pols, Louis F. M. ten Bosch

1185 The Role of the Rhythmic Groups in the Segmentationof Continuous French SpeechDelphine Dahan

1189 The Implications of Temporal Patterns for the Prosodyof Boundary Signaling in Connected SpeechZita McRobbie-Utasi

1193 Experimental Phonetic Study of the Syllable Duration ofKorean with Respect to the Positional EffectHyunbok Lee, Cheol-jae Seong

1197 Timing of Pitch Movements and Accentuation ofSyllablesDik J. Hermes

FrP2Pl—Acoustic Analysfa:

1201 A Probabilistic Approach to AMDF Pitch DetectionGoangshiuan S. Ying, Leah H. Jamieson, Carl D.Michell

1205 From Sagittal Cut to Area Function: An RMIInvestigationAlain Soquet, Vironique Lecuit, Thierry Metens, DidierDemolin

1209 Pitch Detection and Voiced/Unvoiced DecisionAlgorithm Based on Wavelet TransformsLeonard Janer, Juan Jose Bonet, Eduardo Lleida-Solano

1213 Decomposition of Speech Signals into a Deterministicand a Stochastic PartYannis Stylianou

1217 Improved Glottal Closure Instant Detector based onLinear Prediction and Standard Pitch ConceptCheol-Woo Jo, Ho-Gyun Bang, WA. Ainsworth

1221 Analysis of Speech Segments using VariableSpectral/Temporal ResolutionXihong Wang, Stephen A. Zahorian, Stefan Auberg

1225 Time-based Clustering for Phonetic SegmentationBrian Eberman, William Goldenthal

Page 9: Proceedings - GBVthe Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser-Vocal Tract Geometry I 784 Human Palate and Related Structures: Their Articulatory

1229 Formant Analysis Using Mixtures of Gaussians

Parham Zolfaghari, Tony Robinson

1233 Deriving Articulatory Representations from Speech withVarious Excitation ModesHywel B. Richards, John S. Mason, Melvyn J. Hunt,John S. Bridle

1237 "Blind" Speech Segmentation: Automatic Segmentationof Speech Without Linguistic KnowledgeManish Sharma, Richard J. Mammone

1241 Speech Synthesis Using a Nonlinear Energy DampingModel for the Vocal Folds Vibration EffectHiroshi Ohmura, Kazuyo Tanaka

1245 Neural Networks Learning with LI Criteria and ItsEfficiency in Linear Prediction of Speech SignalsMunehiro Namba, Hiroyuki Kamata, Yoshihisa Ishida

1249 Preprocessing and Neural Classification of English StopConsonants [b,d,g,p,t,k]A. Esposito, C. E. Ezin, M. Ceccarelli

1253 A Comparison of Modified k-means(MKM) and NNbased Real Time Adaptive Clustering Algorithms forArticulatory Space Codebook FormationK.S. Ananthakrishnan

1257 A Novel Approach to the Estimation of Voice Sourceand Vocal Tract Parameters from Speech SignalsWen Ding, Hideki Kasuya

1261 Syllable Detection in Read and Spontaneous Speech

Hartmut R. Pfitzinger, Susanne Burger, Sebastian Held

1265 Maximum likelihood Learning of Auditory FeatureMaps for Stationary VowelsKuansan Wang, Chin-Hui Lee, Biing-Hwang Juang

1269 Explicit Segmentation of Speech using Gaussian ModelsAntonio Bonafonte, Albino Nogueiras, AntonioRodriguez-Garrido

1273 A Comparison of Several Recent Methods ofFundamental Frequency and Voicing DecisionEstimationE. Mousset, WA. Ainsworth, Jose A. R. Fonollosa

1277 Robust Pitch Estimation with Harmonics Enhancementin Noisy Environments Based on InstantaneousFrequency

Toshihiko Abe, Takao Kobayashi, Satoshi Imai

1281 Integrated Polispectrum on Speech Recognition

Asuncidn Moreno, Miquel Rutlldn

FrP2Sl - Physics and Simulation of the Vocal Tract

1285 Analysis of Acoustic Properties of the Nasal Tract Using3-DFEMHisayoshi Suzuki, Takayoshi Nakai, Hirosi Sakakibara

1289 Experiments with Analysis By Synthesis of GlottalAirflow

Johan Liljencrants

SaAlLl -Sp«e& Recognition Usiiig ^

1293 An Incremental Speaker-Adaptation Technique forHybrid HMM-MLP RecognizerJoao P. Neto, Ciro A. Martins, Luis B. Almeida

1297 Phoneme Segmentation of Continuous Speech usingMulti-layer PerceptronYoungjoo Sun, Youngjik Lee

1301 Stochastic Perceptual Speech Models with DurationalDependenceJeff Bilmes, Nelson Morgan, Su-Lin Wu, Herve Bowlard

1305 Boosting the Performance of Connectionist LargeVocabulary Speech RecognitionG.D. Cook, A.J. Robinson

1309 HMMs and OWE Neural Network for ContinuousSpeech RecognitionNicolas Pican, Dominique Fohr, Jean-Francois Man

1313 Smoothed Local Adaptation of Connectionist SystemsSteve Waterhoust, Dan Kershaw, Tony Robinson

SaAlL2 - Adverse £mlroiiinentSiand*Mi*Micropfaones

1317 Robust Speech Recognition with Speaker Localizationby a Microphone ArrayTakeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano

1321 Sound Source Localization in Reverberant Environmentsusing an Outlier Elimination AlgorithmEa-Ee Jan, James L Flanagan

1325 The 1995 Abbot LVCSR System for Multiple UnknownMicrophonesDan Kershaw, Tony Robinson, Steve Renals

1329 Experiments of Speech Recognition in a Noisy andReverberant Environment using a Microphone Arrayand HMM AdaptationD. Giuliani, M. Omologo, P. Svaizer


Recommended