Volume 16,
Number 1,
January 2008
- Julio Vargas, Steve McLaughlin:
Cascade Prediction Filters With Adaptive Zeros to Track the Time-Varying Resonances of the Vocal Tract.
1-7
- J. Tepperman, S. Narayanan:
Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation.
8-22
- Ian Vince McLoughlin:
Subjective Intelligibility Testing of Chinese Speech.
23-33
- N. Malyska, T. F. Quatieri:
Spectral Representations of Nonmodal Phonation.
34-46
- Carlos Toshinori Ishi, K.-I. Sakakibara, Hiroshi Ishiguro, Norihiro Hagita:
A Method for Automatic Detection of Vocal Fry.
47-56
- V. Grancharov, Jan H. Plasberg, J. Samuelsson, W. Bastiaan Kleijn:
Generalized Postfilter for Speech Quality Enhancement.
57-64
- L. A. Ekman, W. Bastiaan Kleijn, M. N. Murthi:
Regularized Linear Prediction of Speech.
65-73
- Jerome R. Bellegarda:
Unit-Centric Feature Mapping for Inventory Pruning in Unit Selection Text-to-Speech Synthesis.
74-82
- Gerard Hotho, Lars F. Villemoes, Jeroen Breebaart:
A Backward-Compatible Multichannel Audio Codec.
83-93
- Te Li, Susanto Rahardja, Soo Ngee Koh:
Frequency Region-Based Prioritized Bit-Plane Coding for Scalable Audio.
94-105
- S. Grofit, Y. Lavner:
Time-Scale Modification of Audio Signals Using Enhanced WSOLA With Management of Transients.
106-115
- Pierre Leveau, E. Vincent, G. Richard, Laurent Daudet:
Instrument-Specific Harmonic Atoms for Mid-Level Music Representation.
116-128
- C. D. Creusere, K. D. Kallakuri, R. Vanam:
An Objective Metric of Human Subjective Audio Quality Optimized for a Wide Range of Audio Fidelities.
129-136
- Wei Chu, B. Champagne:
A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification.
137-150
- Heidi Christensen, Yoshihiko Gotoh, Steve Renals:
A Cascaded Broadcast News Highlighter.
151-161
- Y. Avargel, I. Cohen:
Adaptive System Identification in the Short-Time Fourier Transform Domain Using Cross-Multiplicative Transfer Function Approximation.
162-173
- Cédric Févotte, Bruno Torrésani, Laurent Daudet, Simon J. Godsill:
Sparse Linear Regression With Structured Priors and Application to Denoising of Musical Audio.
174-185
- A. S. Park, J. R. Glass:
Unsupervised Pattern Discovery in Speech.
186-197
- Jen-Tzung Chien, Meng-Sung Wu:
Adaptive Bayesian Latent Semantic Analysis.
198-207
- Imed Zitouni:
Constrained Minimization and Discriminative Training for Natural Language Call Routing.
208-215
- S. Ananthakrishnan, Shrikanth S. Narayanan:
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence.
216-228
- Yi Hu, Philipos C. Loizou:
Evaluation of Objective Quality Measures for Speech Enhancement.
229-238
- Jen-Tzung Chien, Chuan-Wei Ting:
Factor Analyzed Subspace Modeling and Selection.
239-248
Volume 16,
Number 2,
February 2008
- Anssi Klapuri:
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model.
255-266
- M. R. Every:
Discriminating Between Pitched Sources in Music Audio.
267-277
- Mathieu Lagrange, Luis Gustavo Martins, Jennifer Murdoch, George Tzanetakis:
Normalized Cuts for Predominant Melodic Source Separation.
278-290
- Kyogu Lee, Malcolm Slaney:
Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio.
291-301
- Peter Jan O. Doets, Reginald L. Lagendijk:
Distortion Estimation in Compressed Music Using Only Audio Fingerprints.
302-317
- M. Levy, M. Sandler:
Structural Segmentation of Musical Audio by Constrained Clustering.
318-326
- Shlomo Dubnov:
Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection.
327-337
- Min-Yen Kan, Ye Wang, Denny Iskandar, Tin Lay Nwe, Arun Shenoy:
LyricAlly: Automatic Synchronization of Textual Lyrics to Acoustic Music Signals.
338-349
- Jyh-Shing Roger Jang, Hong-Ru Lee:
A General Framework of Progressive Filtering and Its Application to Query by Singing/Humming.
350-358
- Erdem Unal, Elaine Chew, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Challenging Uncertainty in Query by Humming Systems: A Fingerprinting Approach.
359-371
- Iman S. H. Suyoto, Alexandra L. Uitdenbogerd, Falk Scholer:
Searching Musical Audio Using Symbolic Queries.
372-381
- F. Kurth, M. Muler:
Efficient Index-Based Audio Matching.
382-395
- Akihiro Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase:
A Quick Search Method for Audio Signals Based on a Piecewise Linear Representation of Feature Trajectories.
396-407
- E. Pampalk, P. Herrera, M. Goto:
Computational Models of Similarity for Drum Samples.
408-423
- A. Holzapfel, Y. Stylianou:
Musical Genre Classification Using Nonnegative Matrix Factorization-Based Features.
424-434
- Kazuyoshi Yoshii, Masataka Goto, Kazuhiro Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model.
435-447
- Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, Homer H. Chen:
A Regression Approach to Music Emotion Recognition.
448-457
- Luca Mion, Giovanni De Poli:
Score-Independent Audio Features for Description of Music Expression.
458-466
- Douglas Turnbull, Luke Barrington, D. Torres, Gert R. G. Lanckriet:
Semantic Annotation and Retrieval of Music and Sound Effects.
467-476
Volume 16,
Number 3,
March 2008
- Jingdong Chen, Jacob Benesty, Yiteng Huang:
A Minimum Distortion Noise Reduction Algorithm With Multiple Microphones.
481-493
- Yonggang Deng, William J. Byrne:
HMM Word and Phrase Alignment for Statistical Machine Translation.
494-507
- Giulia Garau, Steve Renals:
Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition.
508-518
- Jian Xue, Yunxin Zhao:
Random Forests of Phonetic Decision Trees for Acoustic Modeling in Conversational Speech Recognition.
519-528
- O. Gillet, G. Richard:
Transcription and Separation of Drum Signals From Polyphonic Music.
529-540
- Richard C. Hendriks, Jesper Jensen, Richard Heusdens:
Noise Tracking Using DFT Domain Subspace Decompositions.
541-553
- Haibin Huang, Pasi Fränti, Dong-Yan Huang, Susanto Rahardja:
Cascaded RLS-LMS Prediction in MPEG-4 Lossless Audio Coding.
554-562
- Jeih-Weih Hung, Wei-Yi Tsai:
Constructing Modulation Frequency Domain-Based Features for Robust Speech Recognition.
563-577
- A. Miguel, Eduardo Lleida, R. Rose, Luis Buera, O. Saz, Alfonso Ortega:
Capturing Local Variability for Speaker Normalization in Speech Recognition.
578-593
- Norman Poh, Josef Kittler:
Incorporating Model-Specific Score Distribution in Speaker Verification Systems.
594-606
- Yun Tang, R. Rose:
Rapid Speaker Adaptation Using Clustered Maximum-Likelihood Linear Basis With Sparse Training Data.
607-616
- Jeremy Morris, Eric Fosler-Lussier:
Conditional Random Fields for Integrating Local Discriminative Classifiers.
617-628
- Oscal T.-C. Chen, Wen-Chih Wu:
Highly Robust, Secure, and Perceptual-Quality Echo Hiding Scheme.
629-638
- Shoichiro Saito, Hirokazu Kameoka, K. Takahashi, Takuya Nishimoto, Shigeki Sagayama:
Specmurt Analysis of Polyphonic Music Signals.
639-650
- S. Shelley, D. T. Murphy:
The Modeling of Diffuse Boundaries in the 2-D Digital Waveguide Mesh.
651-665
- Iain McCowan, Mike Lincoln, Ivan Himawan:
Microphone Array Shape Calibration in Diffuse Noise Fields.
666-670
- Bob L. Sturm, John J. Shynk, Laurent Daudet, C. Roads:
Dark Energy in Sparse Atomic Estimations.
671-676
Volume 16,
Number 4,
May 2008
- Chi-Min Liu, Han-Wen Hsu, Wen-Chieh Lee:
Compression Artifacts in Perceptual Audio Coding.
681-695
- M. Yukawa, Rodrigo C. de Lamare, Raimundo Sampaio Neto:
Efficient Acoustic Echo Cancellation With Reduced-Rank Adaptive Filtering Based on Selective Decimation and Adaptive Interpolation.
696-710
- Gal Reuven, Sharon Gannot, Israel Cohen:
Dual-Source Transfer-Function Generalized Sidelobe Canceller.
711-727
- N. Roman, DeLiang Wang:
Binaural Tracking of Multiple Moving Sources.
728-739
- Boaz Rafaely:
The Spherical-Shell Microphone Array.
740-747
- B. Gunel, H. Hachabiboglu, Ahmet M. Kondoz:
Acoustic Source Separation of Convolutive Mixtures Based on Intensity Vector Statistics.
748-756
- Jacob Benesty, Jingdong Chen, Yiteng Huang:
On the Importance of the Pearson Correlation Coefficient in Noise Reduction.
757-765
- Zhiyao Duan, Yungang Zhang, Changshui Zhang, Zhenwei Shi:
Unsupervised Single-Channel Music Source Separation by Average Harmonic Structure Modeling.
766-778
- S. Yaman, Chin-Hui Lee:
A Flexible Classifier Design Framework Based on Multiobjective Programming.
779-789
- Simon Tucker, Steve Whittaker:
Temporal Compression Of Speech: An Evaluation.
790-796
- Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, S. S. Narayanan:
Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework.
797-811
- F. Antonacci, M. Foco, Augusto Sarti, Stefano Tubaro:
Fast Tracing of Acoustic Beams and Paths Through Visibility Lookup.
812-824
- T. Fingscheidt, S. Suhadi, S. Stan:
Environment-Optimized Speech Enhancement.
825-834
- David Y. Zhao, W. Bastiaan Kleijn, A. Ypma, B. de Vries:
Online Noise Estimation Using Stochastic-Gain HMM for Speech Enhancement.
835-846
- J. Grothendieck, A. Gorin:
Towards Link Characterization From Content: Recovering Distributions From Classifier Output.
847-858
- Chia-Yu Wan, Lin-Shan Lee:
Histogram-Based Quantization for Robust and/or Distributed Speech Recognition.
859-873
Volume 16,
Number 5,
July 2008
- Norman H. Adams, Gregory H. Wakefield:
State-Space Synthesis of Virtual Auditory Space.
881-890
- Jianping Deng, Martin Bouchard, Tet Hin Yeap:
Feature Enhancement for Noisy Speech Recognition With a Time-Variant Linear Predictive HMM Structure.
891-899
- P. Liu, C. Liu, H. Jiang, F. Soong, R.-H. Wang:
A Constrained Line Search Optimization Method for Discriminative Training of HMMs.
900-909
- T. Gerkmann, C. Breithaupt, R. Martin:
Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors.
910-919
- Margarita Kotti, Emmanouil Benetos, Costas Kotropoulos:
Computationally Efficient and Robust BIC-Based Speaker Segmentation.
920-933
- H. Hacihabiboglu, B. Gunel, Ahmet M. Kondoz:
Time-Domain Simulation of Directive Sources in 3-D Digital Waveguide Mesh-Based Acoustical Models.
934-946
- M. Karjalainen:
Efficient Realization of Wave Digital Components for Physical Modeling and Sound Synthesis.
947-956
- Yiteng Huang, Jacob Benesty, Jingdong Chen:
Analysis and Comparison of Multichannel Noise Reduction Methods in a Common Framework.
957-968
- Srivatsan Kandadai, Charles D. Creusere:
Scalable Audio Compression at Low Bitrates.
969-979
- Patrick Kenny, Pierre Ouellet, N. Dehak, V. Gupta, Pierre Dumouchel:
A Study of Interspeaker Variability in Speaker Verification.
980-988
- J. Paschedag, B. Lohmann:
Error Convergence of the Filtered-X LMS Algorithm for Multiple Harmonic Excitation.
989-999
- Yegui Xiao, Akira Ikuta, Liying Ma, Khashayar Khorasani:
Stochastic Analysis of the FXLMS-Based Narrowband Active Noise Control System.
1000-1014
- Michael Casey, Christophe Rhodes, Malcolm Slaney:
Analysis of Minimum Distances in High-Dimensional Musical Spaces.
1015-1028
- Khe Chai Sim, Haizhou Li:
On Acoustic Diversification Front-End for Spoken Language Identification.
1029-1037
- Rasool Tahmasbi, Sadegh Rezaei:
Change Point Detection in GARCH Models for Voice Activity Detection.
1038-1046
- Valentin Ion, Reinhold Haeb-Umbach:
A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition.
1047-1060
- Dong Yu, Li Deng, James Droppo, Jian Wu, Yifan Gong, Alex Acero:
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
1061-1070
Volume 16,
Number 6,
August 2008
- Jia-Li You, Yining Chen, Min Chu, Frank K. Soong, Jin-Lin Wang:
Identifying Language Origin of Named Entity With Multiple Information Sources.
1077-1086
- K. I. Nordstrom, George Tzanetakis, Peter F. Driessen:
Transforming Perceived Vocal Effort and Breathiness Using Adaptive Pre-Emphasis Linear Prediction.
1087-1096
- Marco Grimaldi, Fred Cummins:
Speaker Identification Using Instantaneous Frequencies.
1097-1111
- Jan S. Erkelens, Richard Heusdens:
Tracking of Nonstationary Noise Based on Data-Driven Recursive Noise Power Estimation.
1112-1123
- H. Pulakka, Laura Laaksonen, M. Vainio, Jouni Pohjalainen, Paavo Alku:
Evaluation of an Artificial Speech Bandwidth Extension Method in Three Languages.
1124-1137
- J. Serra, E. Gomez, Perfecto Herrera, Xavier Serra:
Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification.
1138-1151
- Ioannis Karydis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, Dimitrios Katsaros, Yannis Manolopoulos:
Music Retrieval Over Wireless Ad-Hoc Networks.
1152-1162
- K. van den Doel, U. M. Ascher:
Real-Time Numerical Solution of Webster's Equation on A Nonuniform Grid.
1163-1172
- T. S. Brandes:
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise.
1173-1180
- U. Manmontri, Patrick A. Naylor:
A Class of Frobenius Norm-Based Algorithms Using Penalty Term and Natural Gradient for Blind Signal Separation.
1181-1193
- Manolis Perakakis, Alexandros Potamianos:
A Study in Efficiency and Modality Usage in Multimodal Form Filling Systems.
1194-1206
- S. Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alex Acero:
An Integrative and Discriminative Technique for Spoken Utterance Classification.
1207-1214
Volume 16,
Number 7,
September 2008
- Evgeny Matusov, Gregor Leusch, Rafael E. Banchs, Nicola Bertoldi, Daniel Dechelotte, Marcello Federico, M. Kolss, Young-Suk Lee, José B. Mariño, M. Paulik, Salim Roukos, Holger Schwenk, Hermann Ney:
System Combination for Machine Translation of Spoken and Written Language.
1222-1237
- Mike Dowman, Virginia Savova, Thomas L. Griffiths, Konrad P. Körding, Joshua B. Tenenbaum, Matthew Purver:
A Probabilistic Model of Meetings That Combines Words and Discourse Features.
1238-1248
- Srinivas Bangalore, Giuseppe Di Fabbrizio, Amanda Stent:
Learning the Structure of Task-Driven Human-Human Dialogs.
1249-1259
- Hany Hassan, Khalil Sima'an, Andy Way:
Syntactically Lexicalized Phrase-Based SMT.
1260-1273
- Christoph Tillmann, Tong Zhang:
An Online Relevant Set Algorithm for Statistical Machine Translation.
1274-1286
- Minwoo Jeong, Gary Geunbae Lee:
Triangular-Chain Conditional Random Fields.
1287-1302
- Alfred Dielmann, Steve Renals:
Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN.
1303-1314
- Min Zhang, Wanxiang Che, Guodong Zhou, AiTi Aw, Chew Lim Tan, Ting Liu, Sheng Li:
Semantic Role Labeling Using a Grammar-Driven Convolution Tree Kernel.
1315-1329
- Ruhi Sarikaya, Mohamed Afify, Yonggang Deng, Hakan Erdogan, Yuqing Gao:
Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic.
1330-1339
- Francesc Alías, Xavier Sevillano, Joan Claudi Socoró, Xavi Gonzalvo:
Towards High-Quality Next-Generation Text-to-Speech Synthesis: A Multidomain Approach by Automatic Domain Classification.
1340-1354
Volume 16,
Number 8,
November 2008
- E. Ravelli, G. Richard, Laurent Daudet:
Union of MDCT Bases for Audio Coding.
1361-1372
- Olivier Derrien, G. Richard:
A New Model-Based Algorithm for Optimizing the MPEG-AAC in MS-Stereo.
1373-1382
- Alberto Carini, S. Malatini:
Optimal Variable Step-Size NLMS Algorithms With Auxiliary Noise Power Scheduling for Feedforward Active Noise Control.
1383-1395
- Miguel Ferrer, Alberto Gonzalez, Maria de Diego, Gema Pinero:
Fast Affine Projection Algorithms for Filtered-x Multichannel Active Noise Control.
1396-1408
- Ming Wu, Guoyue Chen, Xiaojun Qiu:
An Improved Active Noise Control Algorithm Without Secondary Path Identification Based on the Frequency-Domain Subband Architecture.
1409-1419
- Jian-Wu Xu, José Carlos Príncipe:
A Pitch Detector Based on a Generalized Correlation Function.
1420-1432
- Emanuel A. P. Habets, Sharon Gannot, Israel Cohen, P. Sommen:
Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments.
1433-1451
- J. H. Gunther, G. Wilson:
Mean-Squared Error Analysis of Adaptive Subband-Based System Identification.
1452-1465
- Constantin Paleologu, Jacob Benesty, Silviu Ciochina:
A Variable Step-Size Affine Projection Algorithm Designed for Acoustic Echo Cancellation.
1466-1478
- J. Scheuing, Bin Yang:
Disambiguation of TDOA Estimation for Multiple Sources in Reverberant Environments.
1479-1489
- Jacek Dmochowski, Jacob Benesty, Sofiène Affes:
Linearly Constrained Minimum Variance Source Localization and Spectral Estimation.
1490-1502
- Jeroen Breebaart, Erik Schuijers:
Phantom Materialization: A Novel Method to Enhance Stereo Audio Reproduction on Headphones.
1503-1511
- Tomohiro Nakatani, Biing-Hwang Juang, Takuya Yoshioka, Keisuke Kinoshita, Marc Delcroix, Masato Miyoshi:
Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model.
1512-1527
- A. Abramson, I. Cohen:
Single-Sensor Audio Source Separation Using Classification and Estimation Approach and GARCH Modeling.
1528-1540
- Chang-Hsing Lee, Chin-Chuan Han, Ching-Chien Chuang:
Automatic Classification of Bird Species From Their Sounds Using Two-Dimensional Cepstral Coefficients.
1541-1550
- Shahram Khadivi, Hermann Ney:
Integration of Speech Recognition and Machine Translation in Computer-Assisted Translation.
1551-1564
- Juan Manuel Górriz, Javier Ramírez, Elmar Wolfgang Lang, Carlos García Puntonet:
Jointly Gaussian PDF-Based Likelihood Ratio Test for Voice Activity Detection.
1565-1578
- Tiago H. Falk, Wai-Yip Chan:
Hybrid Signal-and-Link-Parametric Speech Quality Measurement for VoIP Communications.
1579-1589
- K. J. Han, S. Kim, S. S. Narayanan:
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization.
1590-1601
- K. S. R. Murty, B. Yegnanarayana:
Epoch Extraction From Speech Signals.
1602-1613
- E. Plourde, B. Champagne:
Auditory-Based Spectral Amplitude Estimators for Speech Enhancement.
1614-1623
- Benny Sallberg, Nedelko Grbic, Ingvar Claesson:
Complex-Valued Independent Component Analysis for Online Blind Speech Extraction.
1624-1632
- Hai Huyen Dam, Hai Quang Dam, Sven Nordholm:
Noise Statistics Update Adaptive Beamformer With PSD Estimation for Speech Extraction in Noisy Environment.
1633-1641
- Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee:
Optimizing the Performance of Spoken Language Recognition With Discriminative Training.
1642-1653
- Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson:
Minimum Mean-Squared Error Estimation of Mel-Frequency Cepstral Coefficients Using a Novel Distortion Model.
1654-1661
- Xiong Xiao, Chng Eng Siong, Haizhou Li:
Normalization of the Speech Modulation Spectra for Robust Speech Recognition.
1662-1674
- Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang:
Using Kernel Discriminant Analysis to Improve the Characterization of the Alternative Hypothesis for Speaker Verification.
1675-1684
- Ruohua Zhou, Marco Mattavelli, Giorgio Zoia:
Music Onset Detection Based on Resonator Time Frequency Image.
1685-1695
- Nicola Bertoldi, Richard Zens, Marcello Federico, Wade Shen:
Efficient Speech Translation Through Confusion Network Decoding.
1696-1705
- Ming Wu, Xiaojun Qiu, Guoyue Chen:
An Overlap-Save Frequency-Domain Implementation of the Delayless Subband ANC Algorithm.
1706-1710
Copyright © Sat Nov 21 01:36:43 2009
by Michael Ley (ley@uni-trier.de)