| 2012 | ||
|---|---|---|
| j4 | Zhen-Hua Ling, Li-Rong Dai: Minimum Kullback-Leibler Divergence Parameter Generation for HMM-Based Speech Synthesis. IEEE Transactions on Audio, Speech & Language Processing 20(5): 1492-1502 (2012) | |
| c50 | Bing Jiang, Yan Song, Wu Guo, Li-Rong Dai: Exemplar-Based Sparse Representation for Language Recognition on I-Vectors. INTERSPEECH 2012 | |
| c49 | Xiang Yin, Zhen-Hua Ling, Ming Lei, Li-Rong Dai: Considering Global Variance of the Log Power Spectrum Derived from Mel-Cepstrum in HMM-based Parametric Speech Synthesis. INTERSPEECH 2012 | |
| c48 | Xin Wang, Zhen-Hua Ling, Li-Rong Dai: Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis. ISCSLP 2012: 84-87 | |
| c47 | Yong Xu, Wu Guo, Shan Su, Li-Rong Dai: Spoken term detection for OOV terms based on triphone confusion matrix. ISCSLP 2012: 98-102 | |
| c46 | Xian-Jun Xia, Zhen-Hua Ling, Chen-Yu Yang, Li-Rong Dai: Improved unit selection speech synthesis method utilizing subjective evaluation results on synthetic speech. ISCSLP 2012: 160-164 | |
| c45 | Kui Wu, Yan Song, Wu Guo, Li-Rong Dai: Intra-conversation intra-speaker variability compensation for speaker clustering. ISCSLP 2012: 330-334 | |
| c44 | Yong Xu, Wu Guo, Li-Rong Dai: A hybrid fragment / syllable-based system for improved OOV term detection. ISCSLP 2012: 378-382 | |
| 2011 | ||
| c43 | Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo: Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model. ICASSP 2011: 4520-4523 | |
| c42 | Ming Lei, Zhen-Hua Ling, Li-Rong Dai: Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis. ICASSP 2011: 4712-4715 | |
| c41 | Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai: Factored covariance modeling for text-independent speaker verification. ICASSP 2011: 4856-4859 | |
| c40 | Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai: Non-parallel training for voice conversion based on FT-GMM. ICASSP 2011: 5116-5119 | |
| c39 | Heng Lu, Zhen-Hua Ling, Li-Rong Dai, Ren-Hua Wang: Building HMM based unit-selection speech synthesis system using synthetic speech naturalness evaluation score. ICASSP 2011: 5352-5355 | |
| c38 | Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo: Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model. INTERSPEECH 2011: 373-376 | |
| c37 | Ling-Hui Chen, Yoshihiko Nankaku, Heiga Zen, Keiichi Tokuda, Zhen-Hua Ling, Li-Rong Dai: Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis. INTERSPEECH 2011: 1801-1804 | |
| c36 | Ming Lei, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Li-Rong Dai: Formant-Controlled HMM-Based Speech Synthesis. INTERSPEECH 2011: 2777-2780 | |
| 2010 | ||
| j3 | Heng Lu, Zhen-Hua Ling, Li-Rong Dai, Ren-Hua Wang: Cross-Validation and Minimum Generation Error based Decision Tree Pruning for HMM-based Speech Synthesis. IJCLCLP 15(1) (2010) | |
| c35 | Ming Lei, Zhen-Hua Ling, Li-Rong Dai: Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis. ICASSP 2010: 4230-4233 | |
| c34 | Wu Guo, Zhao Zhang, Yanhua Long, Li-Rong Dai: N-gram nearest neighbor algorithm for voice password system. ICASSP 2010: 4438-4441 | |
| c33 | Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang: HMM-based pseudo-clean speech synthesis for splice algorithm. ICASSP 2010: 4570-4573 | |
| c32 | Cong Liu, Yu Hu, Hui Jiang, Li-Rong Dai: A bounded trust region optimization for discriminative training of HMMS in speech recognition. ICASSP 2010: 4914-4917 | |
| c31 | Yan Song, Qi Tian, Mengyue Wang, Heng Liu, Li-Rong Dai: Multiple instance learning using visual phrases for object classification. ICME 2010: 649-654 | |
| c30 | Heng Lu, Zhen-Hua Ling, Si Wei, Li-Rong Dai, Ren-Hua Wang: Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier. INTERSPEECH 2010: 162-165 | |
| c29 | Zhiwei Shuang, Shiyin Kang, Yong Qin, Li-Rong Dai, Lianhong Cai: HMM based TTS for mixed language text. INTERSPEECH 2010: 618-621 | |
| c28 | Zhen-Hua Ling, Yu Hu, Li-Rong Dai: Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis. INTERSPEECH 2010: 825-828 | |
| c27 | Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai: The estimation and kernel metric of spectral correlation for text-independent speaker verification. INTERSPEECH 2010: 1065-1068 | |
| c26 | Yanhua Long, Li-Rong Dai, Bin Ma, Wu Guo: Effects of the phonological relevance in speaker verification. INTERSPEECH 2010: 2130-2133 | |
| c25 | Ming Lei, Yi-Jian Wu, Frank K. Soong, Zhen-Hua Ling, Li-Rong Dai: A hierarchical F0 modeling method for HMM-based speech synthesis. INTERSPEECH 2010: 2170-2173 | |
| 2009 | ||
| j2 | Meng Wang, Xian-Sheng Hua, Tao Mei, Richang Hong, Guo-Jun Qi, Yan Song, Li-Rong Dai: Semi-supervised kernel density estimation for video annotation. Computer Vision and Image Understanding 113(3): 384-396 (2009) | |
| c24 | Heng Lu, Yi-Jian Wu, Keiichi Tokuda, Li-Rong Dai, Ren-Hua Wang: Full covariance state duration modeling for HMM-based speech synthesis. ICASSP 2009: 4033-4036 | |
| c23 | Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai You, Rong Tong, Ismo Kärkkäinen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li, Li-Rong Dai, Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah, Chng Eng Siong, Tanja Schultz, Qin Jin: The I4U system in NIST 2008 speaker recognition evaluation. ICASSP 2009: 4201-4204 | |
| c22 | Wu Guo, Yanhua Long, Yijie Li, Lei Pan, Eryu Wang, Li-Rong Dai: iFLY system for the NIST 2008 speaker recognition evaluation. ICASSP 2009: 4209-4212 | |
| c21 | Yanhua Long, Bin Ma, Haizhou Li, Wu Guo, Chng Eng Siong, Li-Rong Dai: Exploiting prosodic information for Speaker Recognition. ICASSP 2009: 4225-4228 | |
| c20 | Yan Song, Li-Rong Dai, Ren-Hua Wang: An automatic language identification method based on subspace analysis. ICME 2009: 598-601 | |
| c19 | Cheng-Cheng Wang, Zhen-Hua Ling, Li-Rong Dai: Asynchronous F0 and spectrum modeling for HMM-based speech synthesis. INTERSPEECH 2009: 404-407 | |
| 2008 | ||
| c18 | Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang, Li-Rong Dai: Minumum generation error linear regression based model adaptation for HMM-based speech synthesis. ICASSP 2008: 3953-3956 | |
| c17 | Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang, Li-Rong Dai: Minimum generation error criterion considering global/local variance for HMM-based speech synthesis. ICASSP 2008: 4621-4624 | |
| 2007 | ||
| j1 | Meng Wang, Xian-Sheng Hua, Tao Mei, Jinhui Tang, Guo-Jun Qi, Yan Song, Li-Rong Dai: Interactive Video Annotation by Multi-Concept Multi-Modality Active Learning. Int. J. Semantic Computing 1(4): 459-477 (2007) | |
| c16 | Wu Guo, Lei Pan, Ren-Hua Wang, Li-Rong Dai: Angle of Models Distance as Test Algorithm in Speaker Verification. FSKD (4) 2007: 231-234 | |
| c15 | Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, Ren-Hua Wang: An Interactive Video Annotation Frameowrk with Multiple Modalities. ICASSP (1) 2007: 957-960 | |
| c14 | Meng Wang, Xian-Sheng Hua, Yan Song, Richang Hong, Li-Rong Dai: Lazy Learning Based Efficient Video Annotation. ICME 2007: 607-610 | |
| c13 | Meng Wang, Xian-Sheng Hua, Xun Yuan, Yan Song, Li-Rong Dai: Multi-Graph Semi-Supervised Learning for Video Semantic Feature Extraction. ICME 2007: 1978-1981 | |
| c12 | Meng Wang, Tao Mei, Xun Yuan, Yan Song, Li-Rong Dai: Video annotation by graph-based learning with neighborhood similarity. ACM Multimedia 2007: 325-328 | |
| c11 | Meng Wang, Xian-Sheng Hua, Xun Yuan, Yan Song, Li-Rong Dai: Optimizing multi-graph learning: towards a unified video annotation scheme. ACM Multimedia 2007: 862-871 | |
| c10 | Meng Wang, Xian-Sheng Hua, Yan Song, Wei Lai, Li-Rong Dai, Ren-Hua Wang: An Efficient Automatic Video Shot Size Annotation Scheme. MMM (1) 2007: 649-658 | |
| c9 | Meng Wang, Xian-Sheng Hua, Yan Song, Jinhui Tang, Li-Rong Dai: RMulti-Concept Multi-Modality Active Learning for Interactive Video Annotation. ICSC 2007: 321-328 | |
| 2006 | ||
| c8 | Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, HongJiang Zhang: Semi-Supervised Kernel Regression. ICDM 2006: 1130-1135 | |
| c7 | Yan Song, Guo-Jun Qi, Xian-Sheng Hua, Li-Rong Dai, Ren-Hua Wang: Video Annotation by Active Learning and Semi-Supervised Ensembling. ICME 2006: 933-936 | |
| c6 | Meng Wang, Xian-Sheng Hua, Li-Rong Dai, Yan Song: Enhanced Semi-Supervised Learning for Automatic Video Annotation. ICME 2006: 1485-1488 | |
| c5 | Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, Shipeng Li: Automatic video annotation based on co-adaptation and label correction. ISCAS 2006 | |
| c4 | Yan Song, Xian-Sheng Hua, Guo-Jun Qi, Li-Rong Dai, Meng Wang, HongJiang Zhang: Efficient semantic annotation method for indexing large personal video database. Multimedia Information Retrieval 2006: 289-296 | |
| 2005 | ||
| c3 | Yan Song, Xian-Sheng Hua, Li-Rong Dai, Meng Wang: Semi-automatic video annotation based on active learning with multiple complementary predictors. Multimedia Information Retrieval 2005: 97-104 | |
| 2004 | ||
| c2 | Wei Lai, Xiaodong Gu, Ren-Hua Wang, Li-Rong Dai, HongJiang Zhang: A region based multiple frame-rate tradeoff of video streaming. ICIP 2004: 2067-2070 | |
| c1 | Wei Lai, Xiaodong Gu, Ren-Hua Wang, Li-Rong Dai, HongJiang Zhang: Perceptual Video Streaming by Adaptive Spatial-temporal Scalability. PCM (2) 2004: 431-438 | |
Data released under the ODC-BY 1.0 license — See also our legal information page