| 2012 | ||
|---|---|---|
| j3 | Hao Tang, Stephen M. Chu, Mark Hasegawa-Johnson, Thomas S. Huang: Partially Supervised Speaker Clustering. IEEE Trans. Pattern Anal. Mach. Intell. 34(5): 959-971 (2012) | |
| c30 | Stephen M. Chu, Lidia Mangu: Improving arabic broadcast transcription using automatic topic clustering. ICASSP 2012: 4449-4452 | |
| 2011 | ||
| c29 | Lidia Mangu, Hong-Kwang Kuo, Stephen M. Chu, Brian Kingsbury, George Saon, Hagen Soltau, Fadi Biadsy: The IBM 2011 GALE Arabic speech transcription system. ASRU 2011: 272-277 | |
| c28 | Brian Kingsbury, Hagen Soltau, George Saon, Stephen M. Chu, Hong-Kwang Kuo, Lidia Mangu, Suman V. Ravuri, Nelson Morgan, Adam Janin: The IBM 2009 GALE Arabic speech transcription system. ICASSP 2011: 4672-4675 | |
| 2010 | ||
| c27 | Stephen M. Chu, Daniel Povey: Speaking rate adaptation using continuous frame rate normalization. ICASSP 2010: 4306-4309 | |
| c26 | Stephen M. Chu, Daniel Povey, Hong-Kwang Kuo, Lidia Mangu, Shilei Zhang, Qin Shi, Yong Qin: The 2009 IBM GALE Mandarin broadcast transcription system. ICASSP 2010: 4374-4377 | |
| c25 | George Saon, Hagen Soltau, Upendra V. Chaudhari, Stephen M. Chu, Brian Kingsbury, Hong-Kwang Kuo, Lidia Mangu, Daniel Povey: The IBM 2008 GALE Arabic speech transcription system. ICASSP 2010: 4378-4381 | |
| c24 | ||
| c23 | Qin Shi, Kun Li, Shilei Zhang, Stephen M. Chu, Ji Xiao, ZhiJian Ou: Spoken English assessment system for non-native speakers using acoustic and prosodic features. INTERSPEECH 2010: 1874-1877 | |
| 2009 | ||
| c22 | Stephen M. Chu, Hao Tang, Thomas S. Huang: Fishervoice and semi-supervised speaker clustering. ICASSP 2009: 4089-4092 | |
| c21 | Hao Tang, Stephen M. Chu, Thomas S. Huang: Generative model-based speaker clustering via mixture of von Mises-Fisher distributions. ICASSP 2009: 4101-4104 | |
| c20 | Shilei Zhang, Qin Shi, Stephen M. Chu, Yong Qin: Main vowel domain tone modeling with lexical and prosodic analysis for Mandarin ASR. ICASSP 2009: 4561-4564 | |
| c19 | Hao Tang, Stephen M. Chu, Mark Hasegawa-Johnson, Thomas S. Huang: Emotion recognition from speech VIA boosted Gaussian mixture models. ICME 2009: 294-297 | |
| c18 | Stephen M. Chu, Hao Tang, Thomas S. Huang: Locality preserving speaker clustering. ICME 2009: 494-497 | |
| c17 | Hao Tang, Stephen M. Chu, Thomas S. Huang: Spherical Discriminant Analysis in Semi-supervised Speaker Clustering. HLT-NAACL (Short Papers) 2009: 57-60 | |
| 2008 | ||
| c16 | Qin Shi, Stephen M. Chu, Wen Liu, Hong-Kwang Jeff Kuo, Yi Liu, Yong Qin: Search and classification based language model adaptation. INTERSPEECH 2008: 1578-1581 | |
| 2007 | ||
| j2 | Djamel Mostefa, Nicolas Moreau, Khalid Choukri, Gerasimos Potamianos, Stephen M. Chu, Ambrish Tyagi, Josep R. Casas, Jordi Turmo, Luca Cristoforetti, Francesco Tobia, Aristodemos Pnevmatikakis, Vasileios Mylonakis, Fotios Talantzis, Susanne Burger, Rainer Stiefelhagen, Keni Bernardin, Cedrick Rochet: The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms. Language Resources and Evaluation 41(3-4): 389-407 (2007) | |
| c15 | Stephen M. Chu, Thomas S. Huang: Audio-Visual Speech Fusion Using Coupled Hidden Markov Models. CVPR 2007 | |
| 2006 | ||
| c14 | ZhenQiu Zhang, Gerasimos Potamianos, Stephen M. Chu, Jilin Tu, Thomas S. Huang: Person Tracking in Smart Rooms using Dynamic Programming and Adaptive Subspace Learning. ICME 2006: 2061-2064 | |
| c13 | Yong Qin, Qin Shi, Yi Y. Liu, Hagai Aronowitz, Stephen M. Chu, Hong-Kwang Kuo, Geoffrey Zweig: Advances in Mandarin Broadcast Speech Transcription at IBM Under the DARPA GALE Program. ISCSLP 2006: 410-421 | |
| 2005 | ||
| c12 | ZhenQiu Zhang, Gerasimos Potamianos, Andrew W. Senior, Stephen M. Chu, Thomas S. Huang: A Joint System for Person Tracking and Face Detection. ICCV-HCI 2005: 47-59 | |
| c11 | Dusan Macho, Jaume Padrell, Alberto Abad, Climent Nadeu, Javier Hernando, John W. McDonough, Matthias Wölfel, Ulrich Klee, Maurizio Omologo, Alessio Brutti, Piergiorgio Svaizer, Gerasimos Potamianos, Stephen M. Chu: Automatic Speech Activity Detection, Source Localization, and Speech Recognition on the Chil Seminar Corpus. ICME 2005: 876-879 | |
| c10 | Stephen M. Chu, Etienne Marcheret, Gerasimos Potamianos: Automatic Speech Recognition and Speech Activity Detection in the CHIL Smart Room. MLMI 2005: 332-343 | |
| 2004 | ||
| c9 | Stephen M. Chu, Vit Libal, Etienne Marcheret, Chalapathy Neti, Gerasimos Potamianos: Multistage information fusion for audio-visual speech recognition. ICME 2004: 1651-1654 | |
| c8 | Etienne Marcheret, Stephen M. Chu, Vaibhava Goel, Gerasimos Potamianos: Efficient likelihood computation in multi-stream HMM based audio-visual speech recognition. INTERSPEECH 2004 | |
| c7 | Patricia Scanlon, Gerasimos Potamianos, Vit Libal, Stephen M. Chu: Mutual information based visual feature selection for lipreading. INTERSPEECH 2004 | |
| 2002 | ||
| c6 | Stephen M. Chu, Thomas S. Huang: Audio-visual speech modeling using coupled hidden Markov models. ICASSP 2002: 2009-2012 | |
| c5 | Stephen M. Chu, Thomas S. Huang: An experimental study of coupled hidden Markov models. ICASSP 2002: 4100-4103 | |
| 2000 | ||
| j1 | Rajeev Sharma, Michael Zeller, Vladimir Pavlovic, Thomas S. Huang, Zion Lo, Stephen M. Chu, Yunxin Zhao, James C. Phillips, Klaus Schulten: Speech/Gesture Interface to a Visual-Computing Environment. IEEE Computer Graphics and Applications 20(2): 29-37 (2000) | |
| c4 | Stephen M. Chu, Thomas S. Huang: Automatic head gesture learning and synthesis from prosodic cues. INTERSPEECH 2000: 637-640 | |
| c3 | Stephen M. Chu, Thomas S. Huang: Bimodal speech recognition using coupled hidden Markov models. INTERSPEECH 2000: 747-750 | |
| 1998 | ||
| c2 | Stephen M. Chu, Yunxin Zhao: Robust speech recognition using discriminative stream weighting and parameter interpolation. ICSLP 1998 | |
| 1997 | ||
| c1 | Michael Zeller, James C. Phillips, A. Dalke, W. Humphrey, Klaus Schulten, Thomas S. Huang, Vladimir Pavlovic, Yunxin Zhao, Zion Lo, Stephen M. Chu, Rajeev Sharma: A Visual Computing Environment for Very Large Scale Biomolecular Modeling. ASAP 1997: 3- | |
Data released under the ODC-BY 1.0 license — See also our legal information page