ICSLP 1992:
Banff, Alberta, Canada
The Second International Conference on Spoken Language Processing, ICSLP 1992, Banff, Alberta, Canada, October 13-16, 1992.
ISCA 1992
Word Spotting in ASR
Speech Coding
Speech Production:
Coarticulation
- Antonio Bonafonte, José B. Mariño, Montse Pardàs:
Efficient integration of coarticulation and lexical information in a finite state grammar.

- Herbert A. Leeper, A. P. Rochet, Ian R. A. MacKay:
Characteristics of nasalance in canadian speakers of English and French.

- Christine H. Shadle, Andre Moulinier, Christian U. Dobelke, Celia Scully:
Ensemble averaging applied to the analysis of fricative consonants.

- Andrew Slater, Sarah Hawkins:
Effects of stress and vowel context on velar stops in british English.

- Emanuela Magno Caldognetto, Kyriaki Vagges, Giancarlo Ferrigno, Maria Grazia Busa:
Lip rounding coarticulation in Italian.

Auditory Models
Recognition of Telephone Speech
- Hynek Hermansky, Nelson Morgan:
Towards handling the acoustic environment in spoken language processing.

- Alberto Ciaramella, Davide Clementino, Roberto Pacifici:
Real-time speaker-independent large-vocabulary CDHMM-based continuous telephonic speech recognizer.

- Matthew Lennig, Douglas Sharp, Patrick Kenny, Vishwa Gupta, Kristin Precoda:
Flexible vocabulary recognition of speech.

- Benjamin Chigier, Hong C. Leung:
The effects of signal representations, phonetic classification techniques, and the telephone network.

Text-to-Speech Synthesis 1, 1
- Leon Gulikers, Rijk Willemse:
A lexicon for a text-to-speech system.

- Rijk Willemse, Leon Gulikers:
Word class assignment in a text-to-speech system.

- Gösta Bruce, Björn Granström, Kjell Gustafson, David House:
Aspects of prosodic phrasing in Swedish.

- Kirk P. H. Sullivan, Robert I. Damper:
Synthesis-by-analogy: a bilingual investigation using German and English.

- Leonard C. Manzara, David R. Hill:
Degas: a system for rule-based diphone speech synthesis.

- Shyam Sunder Agrawal, Kenneth N. Stevens:
Towards synthesis of Hindi consonants using KLSYN88.

- Louis C. W. Pols:
Multi-lingual synthesis evaluation methods.

- Björn Granström, Petur Helgason, Hoskuldur Thrainsson:
The interaction of phonetics, phonology and morphology in an icelandic text-to-speech system.

Voice Source Characteristics
Speech Perception:
Higher-Order Processes 1, 1
- David R. Traum, James F. Allen:
A "speech acts" approach to grounding in conversation.

- Sheila Meltzer:
Antecedent activation by empty pronominals in Spanish.

- Ron Smyth:
Multiple feature matching in pronoun resolution: a new look at parallel function.

- Keh-Yih Su, Jing-Shin Chang, Yi-Chung Lin:
A discriminative approach for ambiguity resolution based on a semantic score function.

- Nobuaki Minematsu, Sumio Ohno, Keikichi Hirose, Hiroya Fujisaki:
The influence of semantic and syntactic information on spoken sentence recognition.

- Lynne C. Nygaard, Mitchell Sommers, David B. Pisoni:
Effects of speaking rate and talker variability on the representation of spoken words in memory.

- Hugo Quené, Yvette Smits:
On the absence of word segmentation at "weak" syllables.

- Mitchell Sommers, Lynne C. Nygaard, David B. Pisoni:
Stimulus variability and the perception of spoken words: effects of variations in speaking rate and overall amplitude.

- James M. McQueen, Anne Cutler:
Words within words: lexical statistics and lexical access.

Speaker-Independent Word Recognition
- Stephan Euler, Joachim Zinke:
Experiments on the use of the generalized probabilistic descent method in speech recognition.

- Ricardo de Córdoba, José Manuel Pardo, José Colás:
Improving and optimizing speaker independent, 1000 words speech recognition in Spanish.

- John F. Pitrelli, David Lubensky, Benjamin Chigier, Hong C. Leung:
Multiple-level evaluation of speech recognition systems.

- Tatsuya Kimura, Mitsuru Endo, Shoji Hiraoka, Katsuyuki Niyada:
Speaker independent word recognition using continuous matching of parameters in time-spectral form based on statistical measure.

- R. Roddeman, H. Drexler, Louis Boves:
Automatic derivation of lexical models for a very large vocabulary speech recognition system.

Human Factors
Continuous Speech Recognition 1, 2
- Patrick Kenny, Rene Hollan, Gilles Boulianne, Harinath Garudadri, Yan Ming Cheng, Matthew Lennig, Douglas D. O'Shaughnessy:
Experiments in continuous speech recognition with a 60, 000 word vocabulary.

- Gilles Boulianne, Patrick Kenny, Matthew Lennig, Douglas D. O'Shaughnessy, Paul Mermelstein:
HMM training on unconstrained speech for large vocabulary, continuous speech recognition.

- David Rainion, Shigeki Sagayama:
Appropriate error criterion selection for continuous speech HMM minimum error training.

- Akito Nagai, Kenji Kita, Toshiyuki Hanazawa, Tadashi Suzuki, Tomohiro Iwasaki, Tsuyoshi Kawabata, Kunio Nakajima, Kiyohiro Shikano, Tsuyoshi Morimoto, Shigeki Sagayama, Akira Kurematsu:
Hardware implementation of realtime 1000-word HMM-LR continuous speech recognition.

- Madeleine Bates, Robert J. Bobrow, Pascale Fung, Robert Ingria, Francis Kubala, John Makhoul, Long Nguyen, Richard M. Schwartz, David Stallard:
Design and performance of HARC, the BBN spoken language understanding system.

- Otoya Shirotsuka, G. Kawai, Michael Cohen, Jared Bernstein:
Performance of speaker-independent Japanese recognizer as a function of training set size and diversity.

- Kouichi Yamaguchi, Shigeki Sagayama, Kenji Kita, Frank K. Soong:
Continuous mixture HMM-LR using the a* algorithm for continuous speech recognition.

- Kenji Kita, Tsuyoshi Morimoto, Kazumi Ohkura, Shigeki Sagayama:
Continuously spoken sentence recognition by HMM-LR.

- Akinori Ito, Shozo Makino:
Word pre-selection using a redundant hash addressing method for continuous speech recognition.

- Andrej Ljolje, Michael D. Riley:
Optimal speech recognition using phone recognition and lexical access.

Natural Language Processing and Speech Understanding 1-3
- Nick Waegner, Steve J. Young:
A trellis-based language model for speech recognition.

- Carla B. Zoltowski, Mary P. Harper, Leah H. Jamieson, Randall A. Helzerman:
PARSEC: a constraint-based framework for spoken language understanding.

- Gareth J. F. Jones, Jeremy H. Wright, E. N. Wrigley:
The HMM interface with hybrid grammar-bigram language models for speech recognition.

- Atsuhiko Kai, Seiichi Nakagawa:
A frame-synchronous continuous speech recognition algorithm using a top-down parsing of context-free grammar.

- Fernando Pereira, David B. Roe:
Empirical properties of finite state approximations for phrase structure grammars.

- Stephanie Seneff, Helen M. Meng, Victor Zue:
Language modelling for recognition and understanding using layered bigrams.

- David Goddeau:
Using probabilistic shift-reduce parsing in speech recognition systems.

- Tim Howells, David Friedman, Mark A. Fanty:
Broca, an integrated parser for spoken language.

- P. V. S. Rao, Nandini Bondale:
Blank slate language processor for speech recognition.

- Eric Jackson:
Integrating two complementary approaches to spoken language understanding.

- Marcello Pelillo, Mario Refice:
Learning compatibility coefficients for word-class disambiguation relaxation processes.

- Kaichiro Hatazaki, Jun Noguchi, Akitoshi Okumura, Kazunaga Yoshida, Takao Watanabe:
INTERTALKER: an experimental automatic interpretation system using conceptual representation.

- Tsuyoshi Morimoto, Toshiyuki Takezawa, Kazumi Ohkura, Masaaki Nagata, Fumihiro Yato, Shigeki Sagayama, Akira Kurematsu:
Enhancement of ATR's spoken language translation system: SL-TRANS2.

- Tsuyoshi Morimoto:
Continuous speech recognition using a combination of syntactic constraints and dependency relationship.

- Roberto Pieraccini, Zakhar Gorelov, Esther Levin, Evelyne Tzoukermann:
Automatic learning in spoken language understanding.

Language Learning and Acquisition 1, 2
Speech Perception:
Units of Processing
Prosody:
The Phrase and Beyond 1, 2
- Agaath M. C. Sluijter, Vincent J. van Heuven, Anneke Neijt:
The influence of focus distribution and lexical stress on the temporal organisation of the syllable.

- Agaath M. C. Sluijter, Jacques M. B. Terken:
The development and perceptive evaluation of a model for paragraph intonation in dutch.

- Nobuyoshi Kaiki, Yoshinori Sagisaka:
Pause characteristics and local phrase-dependency structure in Japanese.

- Bernd Möbius, Matthias Pätzold:
F0 synthesis based on a quantitative model of German intonation.

- Kenneth N. Ross, Mari Ostendorf, Stefanie Shattuck-Hufnagel:
Factors affecting pitch accent placement.

- Marc Swerts, Ronald Geluykens, Jacques M. B. Terken:
Prosodic correlates of discourse units in spontaneous speech.

- Shin'ya Nakajima, James F. Allen:
Prosody as a cue for discourse structure.

- Barbara J. Grosz, Julia Hirschberg:
Some intonational characteristics of discourse structure.

- Hiroya Fujisaki, Keikichi Hirose, Haitao Lei:
Prosody and syntax in spoken sentences of standard Chinese.

- Kathleen Bishop:
Modeling sentential stress in the context of a large vocabulary continuous speech recognizer.

Speaker Adaptation
Pronunciation Training
- Jean-Paul Lefèvre, Mervyn A. Jack, Claudio Maggio, Mario Refice, Fabio Gabrieli, Michelina Savino, Luigi Santangelo:
An interactive system for automated pronunciation improvement.

- Edmund Rooney, Steven M. Hiller, John Laver, Mervyn A. Jack:
Prosodic features for automated pronunciation improvement in the spell system.

- Maria-Gabriella Di Benedetto, Fabrizio Carraro, Steven M. Hiller, Edmund Rooney:
Vowels pronunciation assessment in the spell system.

Self-Organizing Systems in ASR 1, 2
- Franck Poirier:
Self-organizing map with supervision for speech recognition.

- Gregory R. De Haan, Ömer Egecioglu:
Topology preservation for speech recognition.

- Gary Bradshaw, Alan Bell:
Towards the performance limits of connectionist feature detectors.

- Helge B. D. Sørensen:
Context-dependent and -independent self-structuring hidden control models for speech recognition.

- Marie-José Caraty, Claude Montacié, Claude Barras:
Integration of frequential and temporal structurations in a symbolic learning system.

- Enric Monte, José B. Mariño, Eduardo Lleida:
Smoothing hidden Markov models ay means of a self organizing feature map.

- Jyri Mäntysalo, Kari Torkkola, Teuvo Kohonen:
LVQ-based speech recognition with high-dimensional context vectors.

- Mikko Kurimo, Kari Torkkola:
Application of self-organizing maps and LVQ in training continuous density hidden Markov models for phonemes.

- Paul Dalsgaard, Ove Andersen:
Identification of mono- and poly-phonemes using acoustic-phonetic features derived by a self-organising neural network.

- Pekka Utela, Samuel Kaski, Kari Torkkola:
Using phoneme group specific LVQ-codebooks with HMMs.

Speech Synthesis 1-3
- Naoto Iwahashi, Yoshinori Sagisaka:
Speech segment network approach for an optimal synthesis unit set.

- Yoshinori Sagisaka, Nobuyoshi Kaiki, Naoto Iwahashi, Katsuhiko Mimura:
ATR μ-talk speech synthesis system.

- Bert Van Coile, Steven Leys, Luc Mortier:
On the development of a name pronunciation system.

- Inger Karlsson:
Consonants for female speech synthesis.

- Jan P. H. van Santen:
Diagnostic perceptual experiments for text-to-speech system evaluation.

- Marcello Balestri, Enzo Foti, Luciano Nebbia, Mario Oreglia, Pier Luigi Salza, Stefano Sandri:
Comparison of natural and synthetic speech intelligibility for a reverse telephone directory service.

- Richard Sproat, Julia Hirschberg, David Yarowsky:
A corpus-based synthesizer.

- Tomohisa Hirokawa, Kenzo Itoh, Hirokazu Sato:
High quality speech synthesis based on wavelet compilation of phoneme segments.

- David R. Williams, Corine A. Bickley, Kenneth N. Stevens:
Inventory of phonetic contrasts generated by high-level control of a formant synthesizer.

- Mikael Goldstein, Ove Till:
Is % overall error rate a valid measure of speech synthesiser and natural speech performance at the segmental level?

- Willy Jongenburger, Renée van Bezooijen:
Text-to-speech conversion for dutch: comprehensibility and acceptability.

- Masayo Katoh, Shin'ichiro Hashimoto:
The rhythm rules in Japanese based on the centers of energy gravity of vowels.

- Kenzo Itoh, Tomohisa Hirokawa, Hirokazu Sato:
Segmental power control for Japanese speech synthesis.

- Jean Schoentgen:
Glottal waveform synthesis with volterra shapers.

- Ken Ceder, Bertil Lyberg:
Yet another rule compiler for text-to-speech conversion?

- Kazuhiko Iwata, Yukio Mitome:
Prosody generation models constructed by considering speech tempo influence on prosody.

- Alex I. C. Monaghan:
Extracting microprosodic information from diphones - a simple way to model segmental effects on prosody for synthetic speech.

- Arjan van Hessen:
Generation of natural sounding speech stimuli by means of linear cepstral interpolation.

- W. Nick Campbell, Colin W. Wightman:
Prosodic encoding of syntactic structure for speech synthesis.

- Susan R. Hertz, Marie K. Huffman:
A nucleus-based timing model applied to multi-dialect speech synthesis by rule.

- Jill House, Nick J. Youd:
Evaluating the prosody of synthesized utterances within a dialogue system.

- Marcel Tatham, Eric Lewis:
Prosodics in a syllable-based text-to-speech synthesis system.

- Rabia Belrhali, Véronique Aubergé, Louis-Jean Boë:
From lexicon to rules: toward a descriptive method of French text-to-phonetics transcription.

- Marianne Elmlund, Ida Frehr, Niels Reinholt Petersen:
Formant transformation from male to female synthetic voices.

- Panagiotis A. Rentzepopoulos, George K. Kokkinakis:
Multilingual phoneme to grapheme conversion system based on HMM.

- Noriyo Hara, Hisayoshi Tsubaki, Hisashi Wakita:
Fundamental frequency control using linguistic information.

- Andrew P. Breen:
A comparison of statistical and rule based methods of determining segmental durations.

- J. R. Andrews, K. M. Curtis, Volker Kraft:
Generation and extraction of high quality synthesis units.

- T. I. Boogaart, Kim E. A. Silverman:
Evaluating the overall comprehensibility of speech synthesizers.

- Olivier Boëffard, Laurent Miclet, S. White:
Automatic generation of optimized unit dictionaries for text to speech synthesis.

- Hideki Kasuya, Seiki Kasuya:
Relationships between syllable, word and sentence intelligibilities of synthetic speech.

- David R. Hill, Craig-Richard Schock, Leonard C. Manzara:
Unrestricted text-to-speech revisited: rhythm and intonation.

- Anton J. Rozsypal:
Wavelet speech synthesizer in the classroom and speech laboratory.

- Thomas Portele, Birgit Steffan, Rainer Preuß, Walter F. Sendlmeier, Wolfgang Hess:
HADIFIX - a speech synthesis system for German.

- Cristina Delogu, Stella Conte, Andrea Paoloni, Ciro Sementina:
Two different methodologies for evaluating the comprehension of synthetic passages.

- Carlos Gussenhoven, Toni C. M. Rietveld:
A target-interpolation model for the intonation of dutch.

Speech Perception:
Phonetic Processes 1, 2
- Katharine Davis, Patricia K. Kuhl:
Best exemplars of English velar stops: a first report.

- Kenneth N. Stevens, Sharon Y. Manuel, Stefanie Shattuck-Hufnagel, Sharlene Liu:
Implementation of a model for lexical access based on features.

- Dieter Huber:
Perception of aperiodic speech signals.

- Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka:
Acceptability and discrimination threshold for distortion of segmental duration in Japanese words.

- Anne Bonneau, Sylvie Coste, Linda Djezzar, Yves Laprie:
Two level acoustic cues for consistent stop identification.

- Rolf Carlson, James R. Glass:
Vowel classification based on analysis-by-synthesis.

- Maria-Gabriella Di Benedetto, Jean-Sylvain Liénard:
Extrinsic normalization of vowel formant values based on cardinal vowels mapping.

- Terrance M. Nearey:
Applications of generalized linear modeling to vowel data.

- David B. Pisoni:
Some comments on invariance, variability and perceptual normalization in speech perception.

- Stephen D. Goldinger, Thomas J. Palmeri, David B. Pisoni:
Words and voices: perceptual details are preserved in lexical representations.

Speech Enhancement
Speech Recognition 1, 2
- Yolande Anglade, Dominique Fohr, Jean-Claude Junqua:
Selectively trained neural networks for the discrimination of normal and lombard speech.

- Aaron E. Rosenberg, Joel DeLong, Chin-Hui Lee, Biing-Hwang Juang, Frank K. Soong:
The use of cohort normalized scores for speaker verification.

- Tomoko Matsui, Sadaoki Furui:
Speaker recognition using concatenated phoneme models.

- Younès Bennani:
Speaker identification through a modular connectionist architecture: evaluation on the timit database.

- Claude Montacié, Jean-Luc Le Floch:
AR-vector models for free-text speaker recognition.

- Florian Schiel:
Rapid non-supervised speaker adaptation of semicontinuous hidden Markov models.

- D. Ederveen, Louis Boves:
Rule-based recognition of phoneme classes.

- Jie Yi, Kei Miki:
A new method of speaker-independent speech recognition using multiphone HMM.

- Myoung-Wan Koo, Chong Kwan Un:
A speaker adaptation based on corrective training and learning vector quantization.

- Katsuhiko Shirai, Shigeki Okawa, Tetsunori Kobayashi:
Phoneme recognition in continuous speech based on mutual information considering phonemic duration and connectivity.

- Shinji Koga, Ryosuke Isotani, Satoshi Tsukada, Kazunaga Yoshida, Kaichiro Hatazaki, Takao Watanabe:
A real-time speaker-independent continuous speech recognition system based on demi-syllable units.

- Saeed Vaseghi, Ben P. Milner:
Speech recognition in noisy environments.

- Fergus R. McInnes:
An enhanced interpolation technique for context-specific probability estimation in speech and language modelling.

- Lorenzo Fissore, Pietro Laface, Giorgio Micca, G. Sperto:
Channel adaptation for a continuous speech recognizer.

- S. Cifuentes, José Colás, Mohammed H. Savoji, José Manuel Pardo:
A new algorithm for connected digit recognition.

- Günther Ruske, Bernd Plannerer, Tanja Schultz:
Stochastic modeling of syllable-based units for continuous speech recognition.

- David M. Goblirsch, Toffee A. Albina:
HARK: an experimental speech recognition system.

- Akito Nagai, Jun-ichi Takami, Shigeki Sagayama:
The SSS-LR continuous speech recognition system: integrating SSS-derived allophone models and a phoneme-context-dependent LR parser.

- Shinsuke Sakai, Michael S. Phillips:
J-SUMMIT: a Japanese segment-based speech recognition system.

- Shinobu Mizuta, Kunio Nakajima:
Optimal discriminative training for HMMs to recognize noisy speech.

- Shingo Kuroiwa, Kazuya Takeda, Fumihiro Yato, Seiichi Yamamoto, Kunihiko Owa, Makoto Shozakai, Ryuuji Matsumoto:
Architecture and algorithms of a real-time word recognizer for telephone input.

- Hiroyasu Kuwano, Kazuya Nomura, Atsushi Ookumo, Shoji Hiraoka, Taisuke Watanabe, Katsuyuki Niyada:
Speaker independent speech recognition method using word spotting technique and its application to VCR programming.

- S. Lennon, Eliathamby Ambikairajah:
Transputer implementation of front-end processors for speech recognition systems.

- Yasuhiro Minami, Tatsuo Matsuoka, Kiyohiro Shikano:
Phoneme HMM evaluation algorithm without phoneme labeling.

- Andreas Noll, Henning Bergmann, Hans-Hermann Hamer, Annedore Paeseler, Horst Tomaschewski:
Architecture of a configurable application interface for speech recognition systems.

- Mark A. Fanty, John Pochmara, Ronald A. Cole:
An interactive environment for speech recognition research.

- Yoshiharu Abe, Kunio Nakajima:
An approach to unlimited vocabulary continuous speech recognition based on context-dependent phoneme modeling.

- Chuck Wooters, Nelson Morgan:
Acoustic subword models in the berkeley restaurant project.

- Claus Nedergaard Jacobsen:
SIRtrain, an open standard environment for CHMM recognizer development.

- Yutaka Kobayashi, Yasuhisa Niimi:
Segmented trellis algorithms for the continuous speech recognition.

- Bo Xu, Z. W. Lin, Taiyi Huang, D. X. Xu, Y. Q. Gao:
A. 46 500 word Chinese speech recognition system.

- Dao Wen Chen:
Study of the time extension flat net for speech recognition.

Hidden Markov Models
- Frank Fallside:
A hidden Markov model structure for the acquisition of speech by machine, ASM.

- Yasuyuki Masai, Shin'ichi Tanaka, Tsuneo Nitta:
Speaker-independent keyword recognition based on SMQ/HMM.

- Régis Cardin, Diane Goupil, Roxane Lacouture, Evelyne Millien, Charles Snow, Yves Normandin:
CRIM's spontaneous speech recognition system for the ATIS task.

- Fabio Brugnara, Renato de Mori, Diego Giuliani, Maurizio Omologo:
Improved connected digit recognition using spectral variation functions.

- Andrew Tridgell, Bruce Millar, Kim-Anh Do:
Alternative preprocessing techniques for discrete hidden Markov model phoneme recognition.

Dialogue
Prosody and Phonology 1, 2
- Li-chiung Yang:
A semantic and pragmatic analysis of tone and intonation in Mandarin Chinese.

- Yoshimasa Tsukuma:
On prosodic features in speech - comparative studies between Japanese and standard Chinese.

- W. Nick Campbell:
Prosodic encoding of English speech.

- Gunnar Fant, Anita Kruckenberg, Lennart Nord:
Prediction of syllable duration, speech rate and tempo.

- Rolf Carlson, Björn Granström, Lennart Nord:
Experiments with emotive speech - acted utterances and synthesized replicas.

- J. Caspars, Vincent J. van Heuven:
Phonetic properties of dutch accent lending pitch movements under time pressure.

- Jacques M. B. Terken, Karin van den Hombergh:
Judgments of relative prominence for adjacent and non-adjacent accents.

- Frédéric Beaugendre, Christophe d'Alessandro, Anne Lacheret-Dujour, Jacques M. B. Terken:
A perceptual study of French intonation.

- Mark Liberman, J. Michael Schultz, Soonhyun Hong, Vincent Okeke:
The phonetics of IGBO tone.

- Stefanie Shattuck-Hufnagel:
Stress shift as pitch accent placement: within-word early accent placement in american English.

Speaking Styles 1, 2
ASR in Noise
- Claude Lefebvre, Dariusz A. Zwierzynski, David R. Starks, Gary Birch:
Further optimisation of a robust IMELDA speech recogniser for applications with severely degraded speech.

- Richard M. Stern, Fu-Hua Liu, Yoshiaki Ohshima, Thomas M. Sullivan, Alejandro Acero:
Multiple approaches to robust speech recognition.

- Tadashi Kitamura, Satoshi Ando, Etsuro Hayahara:
Speaker-independent spoken digit recognition in noisy environments using dynamic spectral features and neural networks.

- Douglas A. Cairns, John H. L. Hansen:
ICARUS: an mwave-based real-time speech recognition system in noise and lombard effect.

- Chafic Mokbel, L. Barbier, Y. Kerlou, Gérard Chollet:
Word recognition in the car: adapting recognizers to new environments.

Dialogues and Applications
Sampling of Speech Production 1, 2
Labelling of Speech 1, 2
- Alain Marchal, William J. Hardcastle, K. Nicolaidis, Noël Nguyen, Fiona Gibbon:
Non-linear annotation of multi-channel speech data.

- Shingo Fujiwara, Yasuhiro Komori, Masahide Sugiyama:
A phoneme labelling workbench using HMM and spectrogram reading knowledge.

- Michael S. Phillips, Victor Zue:
Automatic discovery of acoustic measurements for phonetic classification.

- Itou Katunobu, Hayamizu Satoru, Tanaka Hozumi:
Detection of unknown words and automatic estimation of their transcriptions in continuous speech recognition.

- Fabio Brugnara, Daniele Falavigna, Maurizio Omologo:
A HMM-based system for automatic segmentation and labeling of speech.

- Robert W. P. Luk, Robert I. Damper:
A modification of the viterbi algorithm for stochastic phonographic transduction.

- Paul C. Bagshaw, Briony J. Williams:
Criteria for labelling prosodic aspects of English speech.

- Yifan Gong, Jean Paul Haton:
DTW-based phonetic labeling using explicit phoneme duration constraints.

- Kim E. A. Silverman, Mary E. Beckman, John F. Pitrelli, Mari Ostendorf, Colin W. Wightman, Patti Price, Janet B. Pierrehumbert, Julia Hirschberg:
TOBI: a standard for labeling English prosody.

- Barbara Eisen, Hans-Günther Tillmann, Christoph Draxler:
Consistency of judgements in manual labelling of phonetic segments: the distinction between clear and unclear cases.

Speech Production:
Models and Theories 1, 2
- Gunnar Fant:
Vocal tract area functions of Swedish vowels and a new three-parameter model.

- Jean-Claude Junqua:
Acoustic and production pilot studies of speech vowels produced in noise.

- Yves Laprie, Marie-Odile Berger:
Active models for regularizing formant trajectories.

- René Carré, Samir Chennoukh, Mohamad Mrayati:
Vowel-consonant-vowel transitions: analysis, modeling, and synthesis.

- Maureen Stone, Subhash Lele:
Representing the tongue surface with curve fits.

- Katherine S. Harris, Eric Vatikiotis-Bateson, Peter J. Alfonso:
Muscle forces in vowel vocal tract formation.

- Makoto Hirayama, Eric Vatikiotis-Bateson, Mitsuo Kawato, Kiyoshi Honda:
Neural network modeling of speech motor control.

- Eric Vatikiotis-Bateson, Makoto Hirayama, Kiyoshi Honda, Mitsuo Kawato:
The articulatory dynamics of running speech: gestures from phonemes?

Phonetics
Speech Data Bases
Stochastic ASR
- Victor Abrash, Horacio Franco, Michael Cohen, Nelson Morgan, Yochai Konig:
Connectionist gender adaptation in a hybrid neural network / hidden Markov model speech recognition system.

- Michael Cohen, Horacio Franco, Nelson Morgan, David E. Rumelhart, Victor Abrash:
Hybrid neural network/hidden Markov model continuous-speech recognition.

- Gernot A. Fink, Franz Kummert, Gerhard Sagerer, Ernst Günter Schukat-Talamazzini, Heinrich Niemann:
Semantic hidden Markov networks.

Analysis of Disfluencies in Speech 1, 2
Linguistic Phonetics:
Reduction
Hearing/Speech Impaired 1, 2
- Christian Benoît, Tayeb Mohamadi:
The lip benefit: auditory and visual intelligibility of French speech in noise.

- Anne-Marie Öster:
Phonological assessment of deaf children's productive knowledge as a basis for speech-training.

- Hideaki Seki, Akiko Hayashi, Satoshi Imaizumi, Takehiko Harada, Hiroshi Hosoi:
Factors affecting voicing distinction of stops for the hearing impaired.

- Arthur Boothroyd, Robin S. Waldstein, Eddy Yeung:
Investigations into the auditory F0 speechreading enhancement effect using a sinusoidal replica of the F0 contour.

- Francesco Cutugno:
Some considerations on pitch and timing control in deaf children.

- Shari R. Baum:
Rate of speech effects in aphasia: an acoustic analysis of voice onset time.

- Parth M. Bhatt:
Fundamental frequency attributes following unilateral left or right temporal lobe lesion.

- Hiroshi Hosoi, Satoshi Imaizumi, Akiko Hayashi, Takehiko Harada, Hideaki Seki:
Cue extraction and integration in speech perception for the hearing impaired.

- Anna K. Nabelek:
The relationship between spectral details in naturally produced vowels and identification errors in noise and reverberation.

- Donald G. Jamieson, Leonard Cornelisse:
Speech processing effects on intelligibility for hearing-impaired listeners.

Unit-Based ASR
Language and Dialect Characterization
Speech Production, Parception, and Analysis
- B. Boyanov, Gérard Chollet:
Pathological voice analysis using cepstra, bispectra and group delay functions.

- Qianje Fu, Peyu Xia, Ren-Hua Wang:
Lateralization of speech sounds by binaural distributing processing.

- H. H. Rump:
Timing of pitch movements and perceived vowel duration.

- J. P. Liu, Geneviève Baudoin, Gérard Chollet:
Studies of glottal excitation and vocal tract parameters using inverse filtering and a parameterized input model.

- Dennis Norris, Brit van Ooyen, Anne Cutler:
Speeded detection of vowels and steady-state consonants.

- Elzbieta B. Slawinski:
Temporal factors in the perception of consonants for different age and hearing impairment groups.

- Abeer Alwan:
The role of F3 and F4 in identifying place of articulation for stop consonants.

- Thomas R. Sawallis:
A new measure for perceptual weight of acoustic cues: an experiment on voicing in French intervocalic [t, d].

- Alan Wrench, Mervyn A. Jack, John Laver, Mary S. Jackson, David S. Soutar, A. Gerry Robertson, Janet MacKenzie Beck:
Objective speech quality assessment in patients with intra-oral cancers: voiceless fricatives.

- Bruce Connell:
Tongue contact, active articulators, and coarticulation.

- Makio Kashino, Astrid van Wieringen, Louis C. W. Pols:
Cross-languages differences in the identification of intervocalic stop consonants by Japanese and dutch listeners.

- Minoru Tsuzaki:
Effects of typicality and interstimulus interval on the discrimination of speech stimuli: within-subject comparison.

- Ronald A. Cole, Yeshwant K. Muthusamy:
Perceptual studies on vowels excised from continuous speech.

- Raymond S. Weitzman:
The relative perceptual salience of spectral and durational differences.

- Florien J. Koopmans-van Beinum:
Can 'level words' from one speaking style become teaks' when spliced into another speaking style?

- Beverley Gable, Helen Nemeth, Martin Haran:
Speech errors and task demand.

- John H. Esling, B. Craig Dickson, Roy C. Snell:
Analysis of phonation type using laryngographic techniques.

- Sumi Shigeno:
Effect of prototypes of vowels on speech perception in Japanese and English.

- Tomo-o Morohashi, Tetsuya Shimamura, Hiroyuki Yashima, Jouji Suzuki:
Characteristics of voice picked up from outer skin of larynx.

- Igor V. Nabelek:
Coding of voicing in whispered plosives.

- Margaret F. Cheesman, Shelly Lawrence, Allison Appleyard:
Performance on a nonsense syllable test using the articulation index.

- Donald G. Jamieson, Ketan Ramji, Issam Kheirallah, Terrance M. Nearey:
CSRE: a speech research environment.

Linguistic Phonetics
- Kazue Hata, Yoko Hasegawa:
A study of F0 reset in naturally-read utterances in Japanese.

- H. Samuel Wang, Fu-Dong Chiu:
On the nature of tone sandhi rules in taiwanese.

- Geoffrey S. Nathan:
How shallow is phonology: declarative phonologies meet fast speech.

- Junko Hosaka, Toshiyuki Takezawa, Noriyoshi Uratani:
Analyzing postposition drops in spoken Japanese.

- Jialu Zhang, Xinghui Hu:
Fundamental frequency patterns of Chinese in different speech modes.

- Knut Kvale, Arne Kjell Foldvik:
The multifarious r-sound.

- Zita McRobbie-Utasi:
The role of preaspiration duration in the voicing contrast in skolt sami.

- Eiji Yamada:
Parameter setting for abstract stress in tokyo Japanese.

- Georg Ottesen:
A method for studying prosody in texts read aloud.

- Vincent J. van Heuven:
Linguistic versus phonetic explanation of consonant lengthening after short vowels: a contrastive study of dutch and English.

- Kjell Elenius, Mats Blomberg:
Comparing phoneme and feature based speech recognition using artificial neural networks.

- Eva Strangert:
Prosodic cues to the perception of syntactic boundaries.

- Paul Taylor, Stephen Isard:
A new model of intonation for use with speech synthesis and recognition.

- Rudolf Weiss:
Computerized error detection/correction in teaching German sounds: some problems and solutions.

- Ahmed M. Elgendy:
Velum and epiglottis behavior during the production of Arabic pharyngeals and laryngeals: a fiberscopic study.

- Kim E. A. Silverman, Eleonora Blaauw, Judith Spitz, John F. Pitrelli:
A prosodic comparison of spontaneous speech and read speech.

- John J. Ohala, Maria Grazia Busa, Karen Harrison:
Phonological and psychological evidence that listeners normalize the speech signal.

- Elizabeth A. Hinkelman:
Intonation and the request/question distinction.

- Robert F. Port, Fred Cummins:
The English voicing contrast as velocity perturbation.

- Michael S. Ziolkowski, Mayumi Usami, Karen L. Landahl, Brenda K. Tunnock:
How many phonologies are there in one speaker? some experimental evidence.

- Hirokazu Sato:
Decomposition into syllable complexes and the accenting of Japanese loanwords.

- Jianfen Cao:
Temporal structure in bisyllabic word frame: an evidence for relational invariance and variability from standard Chinese.

- Shih-ping Wang:
The integration of phonetics and phonology: a case study of taiwanese "gemination" and syllable structure.

Perception and Production
- James H. Bradford:
Towards a robust speech interface for teleoperation systems.

- Piero Cosi, Paolo Frasconi, Marco Gori, N. Griggio:
Phonetic recognition experiments with recurrent neural networks.

- Mikael Goldstein, Björn Lindström, Ove Till:
Some aspects on context and response range effects when assessing naturalness of Swedish sentences generated by 4 synthesiser systems.

- Marcello Pelillo, Franca Moro, Mario Refice:
Probabilistic prediction of parts-of-speech from word spelling using decision trees.

- Dieter Barschdorff, Ulrich Gärtner:
Single word detection system with a neural classifier for recognizing speech at variable levels of background noise.

- Sharon L. Oviatt, Philip R. Cohen, Martin Fong, Michael Frank:
A rapid semi-automatic simulation technique for investigating interactive speech and handwriting.

- Sang-Hwa Chung, Dan I. Moldovan:
Speech understanding on a massively parallel computer.

- Chan-Do Lee:
Rationale for "performance phonology".

- Takuya Koizumi, Jyoji Urata, Shuji Taniguchi:
The effect of information feedback on the performance of a phoneme recognizer using kohonen map.

- Yasuharu Asano, Keikichi Hirose, Hiroya Fujisaki:
A method of dialogue management for the speech response system.

- Yumi Takizawa, Eiichi Tsuboka:
Syllable duration prediction for speech recognition.

- F. Canavesio, Giuseppe Castagneri, Giuseppe Di Fabbrizio, Francesco Senia:
Comparison between two methodologies of testing isolated word speech recognizers.

- He Jun, Henri Leich:
Extracting fuzzy features from MLP for recognition of speech.

- Keiji Fukuzawa, Yoshinaga Kato, Masahide Sugiyama:
A fuzzy partition model (FPM) neural network architecture for speaker-independent continuous speech recognition.

- A. Ennaji, Jean Rouat:
Conception of speech filters based on a neural network.

- Hong-Kwang Jeff Kuo, Chin-Hui Lee, Aaron E. Rosenberg:
Speaker set identification through speaker group modeling.

- Stephen Springer, Sara Basson, Judith Spitz:
Identification of principal ergonomic requirements for interactive spoken language systems.

- Thomas E. Jacobs, Eric R. Buhrke:
Performance of the united kingdom intelligent network automatic speech recognition system.

- Guy Deville, Pierre Mousel:
Evaluation of parsing strategies in natural language spoken man-machine dialogue.

- Yasuhisa Niimi, Yutaka Kobayashi:
An information retrieval system with a speech interface.

- J. P. Eatock, J. S. D. Mason:
Phoneme performance in speaker recognition.

- Evelyne Tzoukermann, Roberto Pieraccini, Zakhar Gorelov:
Natural language processing in the chronus system.

- Dominique Francois, Dominique Fohr:
Contribution of neural networks for phoneme identification in the APHODEX expert system.

- Douglas B. Paul:
A CSR-NL interface architecture.

- Pierre Lefebvre, Frank Poirier, G. Duncan:
Speech interface for a man-machine dialog with the unix operating system.

- P. Bardaud, François Capman, Chafic Mokbel, Chakib Tadj, Gérard Chollet:
Transformation of databases for the evaluation of speech recognizers.

- Yoichi Yamashita, Riichiro Mizoguchi:
Dialog management for speech output from concept representation.

- Seiichiro Hangai, Shigetoshi Sugiyama, Kazuhiro Miyauchi:
Speaker verification using locations and sizes of multipulses on neural networks.

- Carlos Teixeira, Isabel Trancoso:
Word rejection using multiple sink models.

- Boerge Lindberg:
Verification of language specific performance factors from recogniser testing on EUROM.1 CVC material.

- Alain Cozannet:
Modeling task driven oral dialogue.

- Wei-ying Li, Kechu Yi, Zheng Hu:
Introducing neural predictor to hidden Markov model for speech recognition.

- Feng Liu, Jianxin Jiang, Jun Cheng, Kechu Yi:
A neural network based on subnets - SNN.

Analysis/Systems
- Ute Ziegenhain:
Syntactic anaphora resolution in a speech understanding system.

- Marion Mast, Ralf Kompe, Franz Kummert, Heinrich Niemann, Elmar Nöth:
The dialog module of the speech recognition and dialog system EVAR.

- Yan Ming Cheng, Douglas D. O'Shaughnessy, Paul Mermelstein:
Statistical recovery of wideband speech from narrowband speech.

- Henk van den Heuvel, Toni C. M. Rietveld:
Speaker related variability in cepstral representations of dutch speech segments.

- Per Rosenbeck, Bo Baungaard:
Experiences from a real-world telephone application: teledialogue.

- K. Y. Lee, P. Ha, J. Rheem, S. Ann, I. Song:
Robust estimation of time-varying LP parameters on speech.

- Javier Hernando, Climent Nadeu, Eduardo Lleida:
On the AR modelling of the one-sided autocorrelation sequence for noisy speech recognition.

- Hiroshi Shimodaira, Mitsuru Nakai:
Robust pitch detection by narrow band spectrum analysis.

- S. Eady, B. Craig Dickson, Roy C. Snell, J. Woolsey, P. Ollek, A. Wynrib, J. Clayards:
A microcomputer-based system for real-time analysis and display of laryngograph signals.

- N. M. Veilleux, Mari Ostendorf, Colin W. Wightman:
Parse scoring with prosodic information.

- Ying Cheng, Paul Fortier, Yves Normandin:
Topic identification using a neural network with a keyword-spotting preprocessor.

- Shane Switzer, Tim Anderson, Matthew Kabrisky, Steven K. Rogers, Bruce W. Suter:
Frequency domain speech coding.

- Raymond Descout, Robert Bergeron, Bernard Meriald:
MEDIATEX-TASF: a closed captioning real-time service in French.

- S. A. Wilde, K. M. Curtis:
The wavelet transform for speech analysis.

- Pablo Aibar, Andrés Marzal, Enrique Vidal, Francisco Casacuberta:
Problems and algorithms in optimal linguistic decoding: a unified formulation.

- Jean Rouat, Sylvain Lemieux, Alain Migneault:
A spectro-temporal analysis of speech based on nonlinear operators.

- Miguel A. Berrojo, Javier Corrales, Jesus Macias, Santiago Aguilera:
A PC graphic tool for speech research based on a DSP board.

- Satoru Hayamizu, Katunobu Itou, Masafumi Tamoto, Kazuyo Tanaka:
A spoken language dialogue system for automatic collection of spontaneous speech.

- Shingo Nishioka, Yoichi Yamashita, Riichiro Mizoguchi:
A powerful disambiguating mechanism for speech understanding systems based on ATMs.

- Najib Naja, Jean-Marc Boucher, Samir Saoudi:
A mixed Gaussian-stochastic code book for CELP coder in LSP speech coding.

- Hiroyuki Kamata, Yoshihisa Ishida:
A method to estimate the transfer function of ARMA model of speech wave using prony method and homomorphic analysis.

- Børge Lindberg, Bjarne Andersen, Anders Baekgaard, Tom Brøndsted, Paul Dalsgaard, Jan Kristiansen:
An integrated dialogue design and continuous speech recognition system environment.

- Alain Marchal, Christine Meunier, P. Gavarry:
The PSH/DISPE helium speech cdrom.

Last update Mon May 20 23:49:52 2013
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page