Volume 17,
Number 1,
January 2009
- Serdar Yildirim, Shrikanth Narayanan:
Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio-Visual Information.
2-12
- Abhinav Sethy, Panayiotis G. Georgiou, Bhuvana Ramabhadran, Shrikanth S. Narayanan:
An Iterative Relative Entropy Minimization-Based Data Selection Approach for n-Gram Model Adaptation.
13-23
- Jiucang Hao, Hagai Attias, Srikantan S. Nagarajan, Te-Won Lee, Terrence J. Sejnowski:
Speech Enhancement, Gain, and Noise Spectrum Adaptation Using Approximate Bayesian Estimation.
24-37
- Simon Doclo, Marc Moonen, Tim Van den Bogaert, Jan Wouters:
Reduced-Bandwidth and Distributed MWF-Based Noise Reduction Algorithms for Binaural Hearing Aids.
38-51
- Y. Nagata, S. Iwasaki, T. Hariyama, T. Fujioka, T. Obara, T. Wakatake, M. Abe:
Binaural Localization Based on Weighted Wiener Gain Improved by Incremental Source Attenuation.
52-65
- J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, J. Isogai:
Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm.
66-83
- Shih-Hsiang Lin, Berlin Chen, Yao-Ming Yeh:
Exploring the Use of Speech Features and Their Corresponding Distribution Characteristics for Robust Speech Recognition.
84-94
- Yi-Ting Chen, Berlin Chen, Hsin-Min Wang:
A Probabilistic Generative Framework for Extractive Broadcast News Speech Summarization.
95-106
- Y. J. Wu, T. D. Abhayapala:
Theory and Design of Soundfield Reproduction Using Continuous Loudspeaker Concept.
107-116
- Radoslaw Mazur, Alfred Mertins:
An Approach for Solving the Permutation Problem of Convolutive Blind Source Separation Based on Statistical Signal Models.
117-126
- Chung-Hsien Wu, Chung-Han Lee, Chung-Hau Liang:
Idiolect Extraction and Generation for Personalized Speaking Style Modeling.
127-137
- S. Ananthakrishnan, S. Narayanan:
Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition.
138-149
- R. M. M. Derkx, K. Janse:
Theoretical Analysis of a First-Order Azimuth-Steerable Superdirective Microphone Array.
150-162
- Ebru Arisoy, Murat Saraclar:
Lattice Extension and Vocabulary Adaptation for Turkish LVCSR.
163-173
- Cyril Joder, Slim Essid, Gaël Richard:
Temporal Integration for Audio Classification With Application to Musical Instrument Classification.
174-186
- Man-Hung Siu, Xi Yang, Herbert Gish:
Discriminatively Trained GMMs for Language Classification Using Boosting Methods.
187-197
Volume 17,
Number 2,
February 2009
- Chang-Wen Hsu, Lin-Shan Lee:
Higher Order Cepstral Moment Normalization for Improved Robust Speech Recognition.
205-220
- Rade Kutil:
Optimized Sinusoid Synthesis via Inverse Truncated Fourier Transform.
221-230
- T. Yoshioka, T. Nakatani, M. Miyoshi:
Integrated Speech Enhancement Method Using Noise Suppression and Dereverberation.
231-246
- G. Wersenyi:
Effect of Emulated Head-Tracking for Reducing Localization Errors in Virtual Audio Simulation.
247-252
- P. Krishnamoorthy, S. Prasanna:
Reverberant Speech Enhancement by Temporal and Spectral Processing.
253-266
- Jen-Tzung Chien, Meng-Sung Wu:
Minimum Rank Error Language Modeling.
267-276
- P. C. Pandey, M. S. Shah:
Estimation of Place of Articulation During Stop Closures of Vowel-Consonant-Vowel Utterances.
277-286
- George Almpanidis, Margarita Kotti, Constantine Kotropoulos:
Robust Detection of Phone Boundaries Using Model Selection Criteria With Few Observations.
287-298
- Leandro E. Di Persia, Diego H. Milone, Masuzo Yanagida:
Indeterminacy Free Frequency-Domain Blind Separation of Reverberant Audio Sources.
299-311
- Matthias Wölfel:
Enhanced Speech Features by Single-Channel Joint Compensation of Noise and Reverberation.
312-323
- Marc Delcroix, Tomohiro Nakatani, S. Watanabe:
Static and Dynamic Variance Compensation for Recognition of Reverberant Speech With Dereverberation Preprocessing.
324-334
- François Pachet, Pierre Roy:
Improving Multilabel Analysis of Music Titles: A Large-Scale Validation of the Correction Approach.
335-343
- R. Saeidi, H. R. S. Mohammadi, T. Ganchev, R. D. Rodman:
Particle Swarm Optimization for Sorted Adapted Gaussian Mixture Models.
344-353
- Yasser Hifny, Steve Renals:
Speech Recognition Using Augmented Conditional Random Fields.
354-365
- J. Hansen, V. Varadarajan:
Analysis and Compensation of Lombard Speech Across Noise Type and Levels With Application to In-Set/Out-of-Set Speaker Recognition.
366-378
- S. Subasingha, M. N. Murthi, Søren Vang Andersen:
Gaussian Mixture Kalman Predictive Coding of Line Spectral Frequencies.
379-391
- Jacek Dmochowski, Jacob Benesty, Sofiène Affes:
An Information-Theoretic Viewof ArrayProcessing.
392-401
Volume 17,
Number 3,
March 2009
- A. Katsamanis, George Papandreou, Petros Maragos:
Face Active Appearance Modeling and Speech Acoustic Information to Recover Articulation.
411-422
- George Papandreou, A. Katsamanis, V. Pitsikalis, Petros Maragos:
Adaptive Multimodal Fusion by Uncertainty Compensation With Application to Audiovisual Speech Recognition.
423-435
- E. Sanchez-Soto, A. Potamianos, K. Daoudi:
Unsupervised Stream-Weights Computation in Classification and Recognition Tasks.
436-445
- Jon Barker, Xu Shao:
Energetic and Informational Masking Effects in an Audiovisual Speech Recognition System.
446-458
- Javier Melenchon, Elisa Martinez, Fernando De La Torre, José A. Montero:
Emphatic Visual Speech Synthesis.
459-468
- Jianhua Tao, Le Xin, Panrong Yin:
Realistic Visual Speech Synthesis Based on Hybrid Concatenation Method.
469-477
- Peng Liu, Frank K. Soong:
Graph-Based Partial Hypothesis Fusion for Pen-Aided Speech Input.
478-485
- Pui-Yu Hui, Helen M. Meng:
Cross-Modality Semantic Integration With Hypothesis Rescoring for Robust Interpretation of Multimodal User Interactions.
486-500
- Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo, Daniel Gatica-Perez:
Modeling Dominance in Group Conversations Using Nonverbal Activity Cues.
501-513
Volume 17,
Number 4,
May 2009
- A. Homayoun Kamkar-Parsi, Martin Bouchard:
Improved Noise Power Spectrum Density Estimation for Binaural Hearing Aids Operating in a Diffuse Noise Field Environment.
521-533
- Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Masato Miyoshi:
Suppression of Late Reverberation Effect on Speech Signal Using Long-Term Multiple-step Linear Prediction.
534-545
- Ronen Talmon, Israel Cohen, Sharon Gannot:
Relative Transfer Function Identification Using Convolutive Transfer Function Approximation.
546-555
- S. R. Mahadeva Prasanna, B. V. Sandeep Reddy, P. Krishnamoorthy:
Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies.
556-565
- Liang Wang, Woon-Seng Gan:
Convergence Analysis of Narrowband Active Noise Equalizer System Under Imperfect Secondary Path Estimation.
566-571
- Leonardo Rey Vega, Hernan Rey, Jacob Benesty, Sara Tressens:
A Family of Robust Algorithms Exploiting Sparsity in Adaptive Filters.
572-581
- Carlos Busso, Sungbok Lee, Shrikanth Narayanan:
Analysis of Emotionally Salient Aspects of Fundamental Frequency for Emotion Detection.
582-596
- M. Kuster:
Multichannel Room Impulse Response Generation With Coherence Control.
597-606
- S. Sen, Arye Nehorai:
Performance Analysis of 3-D Direction Estimation Based on Head-Related Transfer Function.
607-613
- B. Yegnanarayana, K. S. R. Murty:
Event-Based Instantaneous Fundamental Frequency Estimation From Speech Signals.
614-624
- Zhaozhang Jin, DeLiang Wang:
A Supervised Learning Approach to Monaural Segregation of Reverberant Speech.
625-638
- Hiroko Kato Solvang, Y. Nagahara, Shoko Araki, Hiroshi Sawada, Shoji Makino:
Frequency-Domain Pearson Distribution Approach for Independent Component Analysis (FD-Pearson-ICA) in Blind Source Separation.
639-649
- Yu Takahashi, Tomoya Takatani, Keiichi Osako, Hiroshi Saruwatari, Kiyohiro Shikano:
Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment.
650-664
- Huawei Chen, Wee Ser:
Design of Robust Broadband Beamformers With Passband Shaping Characteristics Using Tikhonov Regularization.
665-681
- Jwu-Sheng Hu, Wei-Han Liu:
Location Classification of Nonstationary Sound Sources Using Binaural Room Distribution Patterns.
682-692
- Jesper Højvang Jensen, Mads Græsbøll Christensen, D. P. W. Ellis, Søren Holdt Jensen:
Quantitative Analysis of a Common Audio Similarity Measure.
693-703
- Hong Kook Kim, R. C. Rose:
Cepstrum-Domain Model Combination Based on Decomposition of Speech and Noise Using MMSE-LSA for ASR in Noisy Environments.
704-713
- Kai Yu, M. J. F. Gales, Philip C. Woodland:
Unsupervised Adaptation With Discriminative Mapping Transforms.
714-723
- Teemu Hirsimäki, Janne Pylkkönen, Mikko Kurimo:
Importance of High-Order N-Gram Models in Morph-Based Speech Recognition.
724-732
- Jost Schatzmann, S. Young:
The Hidden Agenda User Simulation Model.
733-747
- C. Longworth, M. J. F. Gales:
Combining Derivative and Parametric Kernels for Speaker Verification.
748-757
- Nicolás Morales, Doroteo T. Toledano, John H. L. Hansen, Javier Garrido:
Feature Compensation Techniques for ASR on Band-Limited Speech.
758-774
- Y. Agiomyrgiannakis, Yannis Stylianou:
Wrapped Gaussian Mixture Models for Modeling and High-Rate Quantization of Phase Data of Speech.
775-786
- Jingdong Chen, Jacob Benesty, Yiteng Huang:
Study of the Noise-Reduction Problem in the Karhunen-LoÈve Expansion Domain.
787-802
- Klaus Macherey, Oliver Bender, Hermann Ney:
Applications of Statistical Machine Translation Approaches to Spoken Language Understanding.
803-818
- Wen Zhang, Rodney A. Kennedy, Thushara D. Abhayapala:
Efficient Continuous HRTF Model Using Data Independent Basis Functions: Experimentally Guided Approach.
819-829
- Malay Gupta, Scott C. Douglas:
A Spatio-Temporal Speech Enhancement Technique Based on Generalized Eigenvalue Decomposition.
830-839
- Pavel Ircing, Josef V. Psutka, Josef Psutka:
Using Morphological Information for Robust Language Modeling in Czech ASR System.
840-847
- V. R. Apsingekar, P. L. De Leon:
Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications.
848-853
Copyright © Tue Nov 10 01:04:29 2009
by Michael Ley (ley@uni-trier.de)