Paul Dalsgaard, Børge Lindberg, Henrik Benner, Zheng-Hua Tan (Eds.): EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001. ISCA 2001
What do Industry and Universities Expect from Each Other? (Special Session)
Ilkka Niiniluoto: Universities and industry: marriage or co-operation between independent partners? 11-12
Yrjö Neuvo: Considerations on what industry expects from universities. 13-14
Gary W. Strong: A perspective on industry/university relationships in the US. 15-16
Khalid Choukri: ELRA contribution to bridge the gap between industry and academia. 17-18
Ming-Yi Tsai, Fu-Chiang Chou, Lin-Shan Lee: Pronunciation variation analysis with respect to various linguistic levels and contextual conditions for Mandarin Chinese. 1445-1448
Bojan Kotnik, Zdravko Kacic, Bogomir Horvat: A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm. 197-200
Jon Barker, Martin Cooke, Phil D. Green: Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise. 213-217
Mirjam Sepesy Maucec, Zdravko Kacic: Topic detection for language model adaptation of highly-inflected languages by using a fuzzy comparison function. 243-246
Martin Jansche: Information extraction via heuristics for a movie showtime query system. 459-462
Speech Perception:
Recognition and Intelligibility
Takashi Otake, Anne Cutler: Recognition of (almost) spoken words: evidence from word play in Japanese. 465-468
Vincent Colotte, Yves Laprie, Anne Bonneau: Perceptual experiments on enhanced and slowed down speech sentences for second language acquisition. 469-473
Steven Greenberg, Takayuki Arai: The relation between speech intelligibility and the complex modulation spectrum. 473-476
Olivier Crouzet, William A. Ainsworth: Envelope information in speech processing: acoustic-phonetic analysis vs. auditory figure-ground segregation. 477-480
Patti Adank, Roeland Van Hout, Roel Smits: A comparison between human vowel normalization strategies and acoustic vowel transformation techniques. 481-484
Olga van Herwijnen, Jacques M. B. Terken: Evaluation of PROS-3 for the assignment of prosodic structure, compared to assignment by human experts. 529-532
Yoichi Yamashita, Tomoyoshi Ishida: Stochastic F0 contour model based on the clustering of F0 shapes of a syntactic unit. 533-536
Xuejing Sun, Ted H. Applebaum: Intonational phrase break prediction using decision tree and n-gram model. 537-540
Peter Jancovic, Ji Ming: A multi-band approach based on the probabilistic union model and frequency-filtering features for robust speech recognition. 579-582
Liang Gu, Kenneth Rose: Split-band perceptual harmonic cepstral coefficients as acoustic features for speech recognition. 583-586
Astrid Hagen, Hervé Bourlard: Error correcting posterior combination for robust multi-band speech recognition. 587-590
Bojana Gajic, Kuldip K. Paliwal: Robust parameters for speech recognition based on subband spectral centroid histograms. 591-594
William H. Edmondson, Li Zhang: Pseudo-articulatory representations and the recognition of syllable patterns in speech. 595-598
Jeff Z. Ma, Li Deng: Efficient decoding strategy for conversational speech recognition using state-space models for vocal-tract-resonance dynamics. 603-606
Eric Zee, Wai-Sum Lee: An acoustical analysis of the vowels in beijing Mandarin. 643-646
Véronique Delvaux, Alain Soquet: Discriminant analysis of nasal vs. oral vowels in French: comparison between different parametric representations. 647-650
Johan Frid: Prediction of intonation patterns of accented words in a corpus of read Swedish news through pitch contour stylization. 915-918
Paavo Alku, Juha Vintturi, Erkki Vilkman: The use of fundamental frequency raising as a strategy for increasing vocal intensity in soft, normal, and loud phonation. 919-922
Tomoyosi Akiba, Katunobu Itou: A structured statistical language model conditioned by arbitrarily abstracted grammatical categories based on GLR parsing. 705-708
Melissa Barkat, Ioana Vasilescu: From perceptual designs to linguistic typology and automatic language identification : overview and perspectives. 1065-1068
Susan Fitt: Morphological approaches for an English pronunciation lexicon. 1069-1072
Wei-Ho Tsai, Wen-Whei Chang, Chao-Shih Huang: Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification. 771-774
Hyunsong Chung, Mark Huckvale: Linguistic factors affecting timing in Korean with application to speech synthesis. 815-818
Felix Schaeffler: Measuring rhythmic deviation in second language speech. 819-822
Ian Maddieson: Good timing: place-dependent voice onset time in ejective stops. 823-826
Speech Synthesis:
Concatenation
Hélène François, Olivier Boëffard: Design of an optimal continuous speech database for text-to-speech synthesis considered as a set covering problem. 829-832
Christos Vosnidis, Vassilios Digalakis: Use of clustering information for coarticulation compensation in speech synthesis by word concatenation. 833-836
Nicholas W. D. Evans, John S. D. Mason: Noise estimation without explicit speech, non-speech detection: a comparison of mean, modal and median based approaches. 893-896
Rathi Chengalvarayan: Evaluation of front-end features and noise compensation methods for robust Mandarin speech recognition. 897-900
Ikuyo Masuda-Katsuse: A new method for speech recognition in the presence of non-stationary, unpredictable and high-level noise. 1119-1122
Bojan Kotnik, Zdravko Kacic, Bogomir Horvat: A computational efficient real time noise robust speech recognition based on improved spectral subtraction method. 1123-1126
Damjan Vlaj, Zdravko Kacic, Bogomir Horvat: The use of noisy frame elimination and frequency spectrum magnitude reduction in noise robust speech recognition. 1127-1130
Jen-Tzung Chien: Combined linear regression adaptation and Bayesian predictive classification for robust speech recognition. 1131-1134
Florian Hilger, Hermann Ney: Quantile based histogram equalization for noise robust speech recognition. 1135-1138
Philippe Daubias, Paul Deléglise: Evaluation of an automatically obtained shape and appearance model for automatic audio visual speech recognition. 1031-1034