Volume 16, Number 1, January 2008
- Julio Vargas, Steve McLaughlin:
Cascade Prediction Filters With Adaptive Zeros to Track the Time-Varying Resonances of the Vocal Tract.
1-7

- Joseph Tepperman, Shrikanth Narayanan:
Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation.
8-22

- Ian Vince McLoughlin:
Subjective Intelligibility Testing of Chinese Speech.
23-33

- Nicolas Malyska, Thomas F. Quatieri:
Spectral Representations of Nonmodal Phonation.
34-46

- Carlos Toshinori Ishi, K.-I. Sakakibara, Hiroshi Ishiguro, Norihiro Hagita:
A Method for Automatic Detection of Vocal Fry.
47-56

- Volodya Grancharov, Jan H. Plasberg, Jonas Samuelsson, W. Bastiaan Kleijn:
Generalized Postfilter for Speech Quality Enhancement.
57-64

- L. Anders Ekman, W. Bastiaan Kleijn, Manohar N. Murthi:
Regularized Linear Prediction of Speech.
65-73

- Jerome R. Bellegarda:
Unit-Centric Feature Mapping for Inventory Pruning in Unit Selection Text-to-Speech Synthesis.
74-82

- Gerard Hotho, Lars F. Villemoes, Jeroen Breebaart:
A Backward-Compatible Multichannel Audio Codec.
83-93

- Te Li, Susanto Rahardja, Soo Ngee Koh:
Frequency Region-Based Prioritized Bit-Plane Coding for Scalable Audio.
94-105

- S. Grofit, Yizhar Lavner:
Time-Scale Modification of Audio Signals Using Enhanced WSOLA With Management of Transients.
106-115

- Pierre Leveau, Emmanuel Vincent, Gaël Richard, Laurent Daudet:
Instrument-Specific Harmonic Atoms for Mid-Level Music Representation.
116-128

- Charles D. Creusere, K. D. Kallakuri, R. Vanam:
An Objective Metric of Human Subjective Audio Quality Optimized for a Wide Range of Audio Fidelities.
129-136

- Wei Chu, Benoît Champagne:
A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification.
137-150

- Heidi Christensen, Yoshihiko Gotoh, Steve Renals:
A Cascaded Broadcast News Highlighter.
151-161

- Yekutiel Avargel, Israel Cohen:
Adaptive System Identification in the Short-Time Fourier Transform Domain Using Cross-Multiplicative Transfer Function Approximation.
162-173

- Cédric Févotte, Bruno Torrésani, Laurent Daudet, Simon J. Godsill:
Sparse Linear Regression With Structured Priors and Application to Denoising of Musical Audio.
174-185

- A. S. Park, James R. Glass:
Unsupervised Pattern Discovery in Speech.
186-197

- Jen-Tzung Chien, Meng-Sung Wu:
Adaptive Bayesian Latent Semantic Analysis.
198-207

- Imed Zitouni:
Constrained Minimization and Discriminative Training for Natural Language Call Routing.
208-215

- Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan:
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence.
216-228

- Yi Hu, Philipos C. Loizou:
Evaluation of Objective Quality Measures for Speech Enhancement.
229-238

- Jen-Tzung Chien, Chuan-Wei Ting:
Factor Analyzed Subspace Modeling and Selection.
239-248

Volume 16, Number 2, February 2008
- Anssi Klapuri:
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model.
255-266

- M. R. Every:
Discriminating Between Pitched Sources in Music Audio.
267-277

- Mathieu Lagrange, Luis Gustavo Martins, Jennifer Murdoch, George Tzanetakis:
Normalized Cuts for Predominant Melodic Source Separation.
278-290

- Kyogu Lee, Malcolm Slaney:
Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio.
291-301

- Peter Jan O. Doets, Reginald L. Lagendijk:
Distortion Estimation in Compressed Music Using Only Audio Fingerprints.
302-317

- Marl Levy, Marl Sandler:
Structural Segmentation of Musical Audio by Constrained Clustering.
318-326

- Shlomo Dubnov:
Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection.
327-337

- Min-Yen Kan, Ye Wang, Denny Iskandar, Tin Lay Nwe, Arun Shenoy:
LyricAlly: Automatic Synchronization of Textual Lyrics to Acoustic Music Signals.
338-349

- Jyh-Shing Roger Jang, Hong-Ru Lee:
A General Framework of Progressive Filtering and Its Application to Query by Singing/Humming.
350-358

- Erdem Unal, Elaine Chew, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Challenging Uncertainty in Query by Humming Systems: A Fingerprinting Approach.
359-371

- Iman S. H. Suyoto, Alexandra L. Uitdenbogerd, Falk Scholer:
Searching Musical Audio Using Symbolic Queries.
372-381

- Frank Kurth, Meinard Müller:
Efficient Index-Based Audio Matching.
382-395

- Akihiro Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase:
A Quick Search Method for Audio Signals Based on a Piecewise Linear Representation of Feature Trajectories.
396-407

- Elias Pampalk, Perfecto Herrera, Masataka Goto:
Computational Models of Similarity for Drum Samples.
408-423

- Andre Holzapfel, Yannis Stylianou:
Musical Genre Classification Using Nonnegative Matrix Factorization-Based Features.
424-434

- Kazuyoshi Yoshii, Masataka Goto, Kazuhiro Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model.
435-447

- Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, Homer H. Chen:
A Regression Approach to Music Emotion Recognition.
448-457

- Luca Mion, Giovanni De Poli:
Score-Independent Audio Features for Description of Music Expression.
458-466

- Douglas Turnbull, Luke Barrington, David A. Torres, Gert R. G. Lanckriet:
Semantic Annotation and Retrieval of Music and Sound Effects.
467-476

Volume 16, Number 3, March 2008
- Jingdong Chen, Jacob Benesty, Yiteng Huang:
A Minimum Distortion Noise Reduction Algorithm With Multiple Microphones.
481-493

- Yonggang Deng, William J. Byrne:
HMM Word and Phrase Alignment for Statistical Machine Translation.
494-507

- Giulia Garau, Steve Renals:
Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition.
508-518

- Jian Xue, Yunxin Zhao:
Random Forests of Phonetic Decision Trees for Acoustic Modeling in Conversational Speech Recognition.
519-528

- Olivier Gillet, Gaël Richard:
Transcription and Separation of Drum Signals From Polyphonic Music.
529-540

- Richard C. Hendriks, Jesper Jensen, Richard Heusdens:
Noise Tracking Using DFT Domain Subspace Decompositions.
541-553

- Haibin Huang, Pasi Fränti, Dong-Yan Huang, Susanto Rahardja:
Cascaded RLS-LMS Prediction in MPEG-4 Lossless Audio Coding.
554-562

- Jeih-Weih Hung, Wei-Yi Tsai:
Constructing Modulation Frequency Domain-Based Features for Robust Speech Recognition.
563-577

- Antonio Miguel, Eduardo Lleida, Richard C. Rose, Luis Buera, Oscar Saz, Alfonso Ortega:
Capturing Local Variability for Speaker Normalization in Speech Recognition.
578-593

- Norman Poh, Josef Kittler:
Incorporating Model-Specific Score Distribution in Speaker Verification Systems.
594-606

- Yun Tang, Richard C. Rose:
Rapid Speaker Adaptation Using Clustered Maximum-Likelihood Linear Basis With Sparse Training Data.
607-616

- Jeremy Morris, Eric Fosler-Lussier:
Conditional Random Fields for Integrating Local Discriminative Classifiers.
617-628

- Oscal T.-C. Chen, Wen-Chih Wu:
Highly Robust, Secure, and Perceptual-Quality Echo Hiding Scheme.
629-638

- Shoichiro Saito, Hirokazu Kameoka, K. Takahashi, Takuya Nishimoto, Shigeki Sagayama:
Specmurt Analysis of Polyphonic Music Signals.
639-650

- S. Shelley, Damian T. Murphy:
The Modeling of Diffuse Boundaries in the 2-D Digital Waveguide Mesh.
651-665

- Iain McCowan, Mike Lincoln, Ivan Himawan:
Microphone Array Shape Calibration in Diffuse Noise Fields.
666-670

- Bob L. Sturm, John J. Shynk, Laurent Daudet, C. Roads:
Dark Energy in Sparse Atomic Estimations.
671-676

Volume 16, Number 4, May 2008
- Chi-Min Liu, Han-Wen Hsu, Wen-Chieh Lee:
Compression Artifacts in Perceptual Audio Coding.
681-695

- Masahiro Yukawa, Rodrigo C. de Lamare, Raimundo Sampaio Neto:
Efficient Acoustic Echo Cancellation With Reduced-Rank Adaptive Filtering Based on Selective Decimation and Adaptive Interpolation.
696-710

- Gal Reuven, Sharon Gannot, Israel Cohen:
Dual-Source Transfer-Function Generalized Sidelobe Canceller.
711-727

- Nicoleta Roman, DeLiang Wang:
Binaural Tracking of Multiple Moving Sources.
728-739

- Boaz Rafaely:
The Spherical-Shell Microphone Array.
740-747

- Banu Gunel, Hüseyin Hacihabiboglu, Ahmet M. Kondoz:
Acoustic Source Separation of Convolutive Mixtures Based on Intensity Vector Statistics.
748-756

- Jacob Benesty, Jingdong Chen, Yiteng Huang:
On the Importance of the Pearson Correlation Coefficient in Noise Reduction.
757-765

- Zhiyao Duan, Yungang Zhang, Changshui Zhang, Zhenwei Shi:
Unsupervised Single-Channel Music Source Separation by Average Harmonic Structure Modeling.
766-778

- Sibel Yaman, Chin-Hui Lee:
A Flexible Classifier Design Framework Based on Multiobjective Programming.
779-789

- Simon Tucker, Steve Whittaker:
Temporal Compression Of Speech: An Evaluation.
790-796

- Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, Shrikanth S. Narayanan:
Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework.
797-811

- Fabio Antonacci, M. Foco, Augusto Sarti, Stefano Tubaro:
Fast Tracing of Acoustic Beams and Paths Through Visibility Lookup.
812-824

- Tim Fingscheidt, Suhadi Suhadi, Sorel Stan:
Environment-Optimized Speech Enhancement.
825-834

- David Y. Zhao, W. Bastiaan Kleijn, Alexander Ypma, Bert de Vries:
Online Noise Estimation Using Stochastic-Gain HMM for Speech Enhancement.
835-846

- John Grothendieck, Allen L. Gorin:
Towards Link Characterization From Content: Recovering Distributions From Classifier Output.
847-858

- Chia-Yu Wan, Lin-Shan Lee:
Histogram-Based Quantization for Robust and/or Distributed Speech Recognition.
859-873

Volume 16, Number 5, July 2008
- Norman H. Adams, Gregory H. Wakefield:
State-Space Synthesis of Virtual Auditory Space.
881-890

- Jianping Deng, Martin Bouchard, Tet Hin Yeap:
Feature Enhancement for Noisy Speech Recognition With a Time-Variant Linear Predictive HMM Structure.
891-899

- P. Liu, C. Liu, H. Jiang, F. Soong, R.-H. Wang:
A Constrained Line Search Optimization Method for Discriminative Training of HMMs.
900-909

- Timo Gerkmann, Colin Breithaupt, Rainer Martin:
Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors.
910-919

- Margarita Kotti, Emmanouil Benetos, Costas Kotropoulos:
Computationally Efficient and Robust BIC-Based Speaker Segmentation.
920-933

- Hüseyin Hacihabiboglu, Banu Gunel, Ahmet M. Kondoz:
Time-Domain Simulation of Directive Sources in 3-D Digital Waveguide Mesh-Based Acoustical Models.
934-946

- Matti Karjalainen:
Efficient Realization of Wave Digital Components for Physical Modeling and Sound Synthesis.
947-956

- Yiteng Huang, Jacob Benesty, Jingdong Chen:
Analysis and Comparison of Multichannel Noise Reduction Methods in a Common Framework.
957-968

- Srivatsan Kandadai, Charles D. Creusere:
Scalable Audio Compression at Low Bitrates.
969-979

- Patrick Kenny, Pierre Ouellet, Najim Dehak, Vishwa Gupta, Pierre Dumouchel:
A Study of Interspeaker Variability in Speaker Verification.
980-988

- J. Paschedag, B. Lohmann:
Error Convergence of the Filtered-X LMS Algorithm for Multiple Harmonic Excitation.
989-999

- Yegui Xiao, Akira Ikuta, Liying Ma, Khashayar Khorasani:
Stochastic Analysis of the FXLMS-Based Narrowband Active Noise Control System.
1000-1014

- Michael Casey, Christophe Rhodes, Malcolm Slaney:
Analysis of Minimum Distances in High-Dimensional Musical Spaces.
1015-1028

- Khe Chai Sim, Haizhou Li:
On Acoustic Diversification Front-End for Spoken Language Identification.
1029-1037

- Rasool Tahmasbi, Sadegh Rezaei:
Change Point Detection in GARCH Models for Voice Activity Detection.
1038-1046

- Valentin Ion, Reinhold Haeb-Umbach:
A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition.
1047-1060

- Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, Alex Acero:
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
1061-1070

Volume 16, Number 6, August 2008
- Jia-Li You, Yining Chen, Min Chu, Frank K. Soong, Jin-Lin Wang:
Identifying Language Origin of Named Entity With Multiple Information Sources.
1077-1086

- K. I. Nordstrom, George Tzanetakis, Peter F. Driessen:
Transforming Perceived Vocal Effort and Breathiness Using Adaptive Pre-Emphasis Linear Prediction.
1087-1096

- Marco Grimaldi, Fred Cummins:
Speaker Identification Using Instantaneous Frequencies.
1097-1111

- Jan S. Erkelens, Richard Heusdens:
Tracking of Nonstationary Noise Based on Data-Driven Recursive Noise Power Estimation.
1112-1123

- Hannu Pulakka, Laura Laaksonen, Martti Vainio, Jouni Pohjalainen, Paavo Alku:
Evaluation of an Artificial Speech Bandwidth Extension Method in Three Languages.
1124-1137

- Joan Serrà, Emilia Gómez, Perfecto Herrera, Xavier Serra:
Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification.
1138-1151

- Ioannis Karydis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, Dimitrios Katsaros, Yannis Manolopoulos:
Music Retrieval Over Wireless Ad-Hoc Networks.
1152-1162

- Kees van den Doel, Uri M. Ascher:
Real-Time Numerical Solution of Webster's Equation on A Nonuniform Grid.
1163-1172

- T. S. Brandes:
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise.
1173-1180

- U. Manmontri, Patrick A. Naylor:
A Class of Frobenius Norm-Based Algorithms Using Penalty Term and Natural Gradient for Blind Signal Separation.
1181-1193

- Manolis Perakakis, Alexandros Potamianos:
A Study in Efficiency and Modality Usage in Multimodal Form Filling Systems.
1194-1206

- Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alex Acero:
An Integrative and Discriminative Technique for Spoken Utterance Classification.
1207-1214

Volume 16, Number 7, September 2008
- Evgeny Matusov, Gregor Leusch, Rafael E. Banchs, Nicola Bertoldi, Daniel Déchelotte, Marcello Federico, M. Kolss, Young-Suk Lee, José B. Mariño, M. Paulik, Salim Roukos, Holger Schwenk, Hermann Ney:
System Combination for Machine Translation of Spoken and Written Language.
1222-1237

- Mike Dowman, Virginia Savova, Thomas L. Griffiths, Konrad P. Körding, Joshua B. Tenenbaum, Matthew Purver:
A Probabilistic Model of Meetings That Combines Words and Discourse Features.
1238-1248

- Srinivas Bangalore, Giuseppe Di Fabbrizio, Amanda Stent:
Learning the Structure of Task-Driven Human-Human Dialogs.
1249-1259

- Hany Hassan, Khalil Sima'an, Andy Way:
Syntactically Lexicalized Phrase-Based SMT.
1260-1273

- Christoph Tillmann, Tong Zhang:
An Online Relevant Set Algorithm for Statistical Machine Translation.
1274-1286

- Minwoo Jeong, Gary Geunbae Lee:
Triangular-Chain Conditional Random Fields.
1287-1302

- Alfred Dielmann, Steve Renals:
Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN.
1303-1314

- Min Zhang, Wanxiang Che, Guodong Zhou, AiTi Aw, Chew Lim Tan, Ting Liu, Sheng Li:
Semantic Role Labeling Using a Grammar-Driven Convolution Tree Kernel.
1315-1329

- Ruhi Sarikaya, Mohamed Afify, Yonggang Deng, Hakan Erdogan, Yuqing Gao:
Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic.
1330-1339

- Francesc Alías, Xavier Sevillano, Joan Claudi Socoró, Xavi Gonzalvo:
Towards High-Quality Next-Generation Text-to-Speech Synthesis: A Multidomain Approach by Automatic Domain Classification.
1340-1354

Volume 16, Number 8, November 2008
- Emmanuel Ravelli, Gaël Richard, Laurent Daudet:
Union of MDCT Bases for Audio Coding.
1361-1372

- Olivier Derrien, Gaël Richard:
A New Model-Based Algorithm for Optimizing the MPEG-AAC in MS-Stereo.
1373-1382

- Alberto Carini, Silvia Malatini:
Optimal Variable Step-Size NLMS Algorithms With Auxiliary Noise Power Scheduling for Feedforward Active Noise Control.
1383-1395

- Miguel Ferrer, Alberto González, Maria de Diego, Gema Pinero:
Fast Affine Projection Algorithms for Filtered-x Multichannel Active Noise Control.
1396-1408

- Ming Wu, Guoyue Chen, Xiaojun Qiu:
An Improved Active Noise Control Algorithm Without Secondary Path Identification Based on the Frequency-Domain Subband Architecture.
1409-1419

- Jian-Wu Xu, José Carlos Príncipe:
A Pitch Detector Based on a Generalized Correlation Function.
1420-1432

- Emanuel A. P. Habets, Sharon Gannot, Israel Cohen, P. Sommen:
Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments.
1433-1451

- J. H. Gunther, G. Wilson:
Mean-Squared Error Analysis of Adaptive Subband-Based System Identification.
1452-1465

- Constantin Paleologu, Jacob Benesty, Silviu Ciochina:
A Variable Step-Size Affine Projection Algorithm Designed for Acoustic Echo Cancellation.
1466-1478

- J. Scheuing, Bin Yang:
Disambiguation of TDOA Estimation for Multiple Sources in Reverberant Environments.
1479-1489

- Jacek Dmochowski, Jacob Benesty, Sofiène Affes:
Linearly Constrained Minimum Variance Source Localization and Spectral Estimation.
1490-1502

- Jeroen Breebaart, Erik Schuijers:
Phantom Materialization: A Novel Method to Enhance Stereo Audio Reproduction on Headphones.
1503-1511

- Tomohiro Nakatani, Biing-Hwang Juang, Takuya Yoshioka, Keisuke Kinoshita, Marc Delcroix, Masato Miyoshi:
Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model.
1512-1527

- Ari Abramson, Israel Cohen:
Single-Sensor Audio Source Separation Using Classification and Estimation Approach and GARCH Modeling.
1528-1540

- Chang-Hsing Lee, Chin-Chuan Han, Ching-Chien Chuang:
Automatic Classification of Bird Species From Their Sounds Using Two-Dimensional Cepstral Coefficients.
1541-1550

- Shahram Khadivi, Hermann Ney:
Integration of Speech Recognition and Machine Translation in Computer-Assisted Translation.
1551-1564

- Juan Manuel Górriz, Javier Ramírez, Elmar Wolfgang Lang, Carlos García Puntonet:
Jointly Gaussian PDF-Based Likelihood Ratio Test for Voice Activity Detection.
1565-1578

- Tiago H. Falk, Wai-Yip Chan:
Hybrid Signal-and-Link-Parametric Speech Quality Measurement for VoIP Communications.
1579-1589

- K. J. Han, S. Kim, S. S. Narayanan:
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization.
1590-1601

- K. Sri Rama Murty, B. Yegnanarayana:
Epoch Extraction From Speech Signals.
1602-1613

- Eric Plourde, Benoît Champagne:
Auditory-Based Spectral Amplitude Estimators for Speech Enhancement.
1614-1623

- Benny Sallberg, Nedelko Grbic, Ingvar Claesson:
Complex-Valued Independent Component Analysis for Online Blind Speech Extraction.
1624-1632

- Hai Huyen Dam, Hai Quang Dam, Sven Nordholm:
Noise Statistics Update Adaptive Beamformer With PSD Estimation for Speech Extraction in Noisy Environment.
1633-1641

- Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee:
Optimizing the Performance of Spoken Language Recognition With Discriminative Training.
1642-1653

- Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson:
Minimum Mean-Squared Error Estimation of Mel-Frequency Cepstral Coefficients Using a Novel Distortion Model.
1654-1661

- Xiong Xiao, Chng Eng Siong, Haizhou Li:
Normalization of the Speech Modulation Spectra for Robust Speech Recognition.
1662-1674

- Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang:
Using Kernel Discriminant Analysis to Improve the Characterization of the Alternative Hypothesis for Speaker Verification.
1675-1684

- Ruohua Zhou, Marco Mattavelli, Giorgio Zoia:
Music Onset Detection Based on Resonator Time Frequency Image.
1685-1695

- Nicola Bertoldi, Richard Zens, Marcello Federico, Wade Shen:
Efficient Speech Translation Through Confusion Network Decoding.
1696-1705

- Ming Wu, Xiaojun Qiu, Guoyue Chen:
An Overlap-Save Frequency-Domain Implementation of the Delayless Subband ANC Algorithm.
1706-1710

Last update Sat May 18 20:55:11 2013
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page