| 2012 | ||
|---|---|---|
| j56 | Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano: Theoretical Analysis of Amounts of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array. IEICE Transactions 95-A(2): 586-590 (2012) | |
| j55 | Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator. IEICE Transactions 95-A(2): 591-595 (2012) | |
| j54 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech. Speech Communication 54(1): 134-146 (2012) | |
| j53 | Ryoichi Miyazaki, Hiroshi Saruwatari, Takayuki Inoue, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo: Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction. IEEE Transactions on Audio, Speech & Language Processing 20(7): 2080-2094 (2012) | |
| j52 | Tomoki Toda, Mikihiro Nakagiri, Kiyohiro Shikano: Statistical Voice Conversion Techniques for Body-Conducted Unvoiced Speech Enhancement. IEEE Transactions on Audio, Speech & Language Processing 20(9): 2505-2517 (2012) | |
| c166 | Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Speech kurtosis estimation from observed noisy signal based on generalized Gaussian distribution prior and additivity of cumulants. ICASSP 2012: 4049-4052 | |
| c165 | Kenzo Yamamoto, Tomoki Toda, Hironori Doi, Hiroshi Saruwatari, Kiyohiro Shikano: Statistical approach to voice quality control in esophageal speech enhancement. ICASSP 2012: 4497-4500 | |
| c164 | Ryoichi Miyazaki, Hiroshi Saruwatari, Takayuki Inoue, Kiyohiro Shikano, Kazunobu Kondo: Musical-noise-free speech enhancement: Theory and evaluation. ICASSP 2012: 4565-4568 | |
| c163 | Keigo Kubo, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Evaluation of Many-to-Many Alignment Algorithm by Automatic Pronunciation Annotation Using Web Text Mining. INTERSPEECH 2012 | |
| c162 | Haruka Majima, Rafael Torres, Yoko Fujita, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano: Spoken Inquiry Discrimination Using Bag-of-Words for Speech-Oriented Guidance System. INTERSPEECH 2012 | |
| c161 | Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical-noise-free blind speech extraction using ICA-based noise estimation and iterative spectral subtraction. ISSPA 2012: 286-291 | |
| c160 | Suzumi Kanehara, Hiroshi Saruwatari, Ryoichi Miyazaki, Kiyohiro Shikano, Kazunobu Kondo: Theoretical Analysis of Musical Noise Generation in Noise Reduction Methods with Decision-Directed a Priori SNR Estimator. IWAENC 2012 | |
| c159 | Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical-Noise-Free Blind Speech Extraction Using ICA-Based Noise Estimation with Channel Selection. IWAENC 2012 | |
| 2011 | ||
| j51 | Noriyoshi Kamado, Haruhide Hokari, Shoji Shimada, Hiroshi Saruwatari, Kiyohiro Shikano: Sound Field Reproduction by Wavefront Synthesis Using Directly Aligned Multi Point Control. IEICE Transactions 94-A(3): 907-920 (2011) | |
| j50 | Hiroshi Saruwatari, Y. Ishikawa, Yu Takahashi, Takayuki Inoue, Kiyohiro Shikano, Kazunobu Kondo: Musical Noise Controllable Algorithm of Channelwise Spectral Subtraction and Adaptive Beamforming Based on Higher Order Statistics. IEEE Transactions on Audio, Speech & Language Processing 19(6): 1457-1466 (2011) | |
| j49 | Takayuki Inoue, Hiroshi Saruwatari, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo: Theoretical Analysis of Musical Noise in Generalized Spectral Subtraction Based on Higher Order Statistics. IEEE Transactions on Audio, Speech & Language Processing 19(6): 1770-1779 (2011) | |
| c158 | Hiroyuki Nawata, Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano: Automatic musical thumbnailing based on audio object localization and its evaluation. ICASSP 2011: 41-44 | |
| c157 | Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano: Robust sound field reproduction integrating multi-point sound field control and wave field synthesis. ICASSP 2011: 441-444 | |
| c156 | Takayuki Inoue, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Theoretical analysis of musical noise in Wiener filtering family via higher-order statistics. ICASSP 2011: 5076-5079 | |
| c155 | Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques. ICASSP 2011: 5136-5139 | |
| c154 | Denis Babani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Acoustic model training for non-audible murmur recognition using transformed normal speech data. ICASSP 2011: 5224-5227 | |
| c153 | Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano: Theoretical Analysis of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array. INTERSPEECH 2011: 341-344 | |
| c152 | Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Blind Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator. INTERSPEECH 2011: 361-364 | |
| c151 | Nobuhiko Hattori, Tomoki Toda, Hisashi Kawai, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation. INTERSPEECH 2011: 2769-2772 | |
| c150 | Hiroshi Saruwatari, Nobuhisa Hirata, Toshiyuki Hatta, Ryo Wakisaka, Kiyohiro Shikano, Tomoya Takatani: Semi-blind speech extraction for robot using visual information and noise statistics. ISSPIT 2011: 264-269 | |
| 2010 | ||
| j48 | Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical-Noise Analysis in Methods of Integrating Microphone Array and Spectral Subtraction Based on Higher-Order Statistics. EURASIP J. Adv. Sig. Proc. 2010 (2010) | |
| j47 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Adaptive Training for Voice Conversion Based on Eigenvoices. IEICE Transactions 93-D(6): 1589-1598 (2010) | |
| j46 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Evaluation of Extremely Small Sound Source Signals Used in Speaking-Aid System with Statistical Voice Conversion. IEICE Transactions 93-D(7): 1909-1917 (2010) | |
| j45 | Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models. IEICE Transactions 93-D(9): 2472-2482 (2010) | |
| j44 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Improvements of the One-to-Many Eigenvoice Conversion System. IEICE Transactions 93-D(9): 2491-2499 (2010) | |
| j43 | Tatsuya Hirahara, Makoto Otani, Shota Shimizu, Tomoki Toda, Keigo Nakamura, Yoshitaka Nakajima, Kiyohiro Shikano: Silent-speech enhancement using body-conducted vocal-tract resonance signals. Speech Communication 52(4): 301-313 (2010) | |
| j42 | Panikos Heracleous, V.-A. Tran, Takayuki Nagai, Kiyohiro Shikano: Analysis and Recognition of NAM Speech Using HMM Distances and Visual Information. IEEE Transactions on Audio, Speech & Language Processing 18(6): 1528-1538 (2010) | |
| c149 | Hiroshi Saruwatari, Ryoi Okamoto, Yu Takahashi, Kiyohiro Shikano: Blind Speech Extraction Combining Generalized MMSE STSA Estimator and ICA-Based Noise and Speech Probability Density Function Estimations. LVA/ICA 2010: 49-56 | |
| c148 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Complex Newton algorithm for blind signal extraction of speech in diffuse noise. ICASSP 2010: 213-216 | |
| c147 | Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Statistical approach to enhancing esophageal speech based on Gaussian mixture models. ICASSP 2010: 4250-4253 | |
| c146 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Speech enhancement in presence of diffuse background noise: Why using blind signal extraction? ICASSP 2010: 4770-4773 | |
| c145 | Ryoi Okamoto, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano: MMSE STSA estimator with nonstationary noise estimation based on ICA for high-quality speech enhancement. ICASSP 2010: 4778-4781 | |
| c144 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Non-parallel training for many-to-many eigenvoice conversion. ICASSP 2010: 4822-4825 | |
| c143 | Rafael Torres, Shota Takeuchi, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano: Comparison of methods for topic classification in a speech-oriented guidance system. INTERSPEECH 2010: 1261-1264 | |
| c142 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion. INTERSPEECH 2010: 1628-1631 | |
| c141 | Kumi Ohta, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano: Adaptive voice-quality control based on one-to-many eigenvoice conversion. INTERSPEECH 2010: 2158-2161 | |
| c140 | Hiroshi Sawada, Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Improvement of speech recognition performance for spoken-oriented robot dialog system using end-fire array. IROS 2010: 970-975 | |
| 2009 | ||
| j41 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Enhancement of speech signals separated from their convolutive mixture by FDICA algorithm. Digital Signal Processing 19(1): 127-133 (2009) | |
| j40 | Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Techniques in rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics. Speech Communication 51(1): 42-57 (2009) | |
| j39 | Yu Takahashi, Tomoya Takatani, Keiichi Osako, Hiroshi Saruwatari, Kiyohiro Shikano: Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment. IEEE Transactions on Audio, Speech & Language Processing 17(4): 650-664 (2009) | |
| c139 | Takashi Hiekata, Takashi Morita, Youhei Ikeda, Hiroshi Hashimoto, Ruoyu Zhang, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano: Multiple ICA-based real-time blind source extraction applied to handy size microphone. ICASSP 2009: 121-124 | |
| c138 | Yu Takahashi, Yoshihisa Uemura, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical noise analysis based on higher order statistics for microphone array and nonlinear signal processing. ICASSP 2009: 229-232 | |
| c137 | Shigeki Miyabe, Biing-Hwang Juang, Hiroshi Saruwatari, Kiyohiro Shikano: Kernel-based nonlinear independent component analysis for underdetermined blind source separation. ICASSP 2009: 1641-1644 | |
| c136 | Tomoki Toda, Keigo Nakamura, Hidehiko Sekimoto, Kiyohiro Shikano: Voice conversion for various types of body transmitted speech. ICASSP 2009: 3601-3604 | |
| c135 | Yu Takahashi, Hiroshi Saruwatari, Yuki Fujihara, Kentaro Tachibana, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka: Source adaptive blind signal extraction using closed-form ICA for hands-free robot spoken dialogue system. ICASSP 2009: 3681-3684 | |
| c134 | Hiroshi Saruwatari, Hiromichi Kawanami, Shota Takeuchi, Yu Takahashi, Tobias Cincarek, Kiyohiro Shikano: Hands-free speech recognition challenge for real-world speech dialogue systems. ICASSP 2009: 3729-3732 | |
| c133 | Daisuke Miyamoto, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Acoustic compensation methods for body transmitted speech conversion. ICASSP 2009: 3901-3904 | |
| c132 | Yoshihisa Uemura, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical noise generation analysis for noise reduction methods based on spectral subtraction and MMSE STSA estimation. ICASSP 2009: 4433-4436 | |
| c131 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano: Target Speech Enhancement in Presence of Jammer and Diffuse Background Noise. ICA 2009: 565-572 | |
| c130 | Tomoki Toda, Keigo Nakamura, Takayuki Nagai, Tomomi Kaino, Yoshitaka Nakajima, Kiyohiro Shikano: Technologies for processing body-conducted speech detected with non-audible murmur microphone. INTERSPEECH 2009: 632-635 | |
| c129 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Electrolaryngeal speech enhancement based on statistical voice conversion. INTERSPEECH 2009: 1431-1434 | |
| c128 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Many-to-many eigenvoice conversion with reference voice. INTERSPEECH 2009: 1623-1626 | |
| c127 | Jani Even, Hiroshi Sawada, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Semi-blind suppression of internal noise for hands-free robot spoken dialog system. IROS 2009: 658-663 | |
| c126 | Shigeki Miyabe, Keisuke Masatoki, Hiroshi Saruwatari, Kiyohiro Shikano, Toshiyuki Nomura: Temporal quantization of spatial information using directional clustering for multichannel audio coding. WASPAA 2009: 261-264 | |
| 2008 | ||
| j38 | Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Cost Reduction of Acoustic Modeling for Real-Environment Applications Using Unsupervised and Selective Training. IEICE Transactions 91-D(3): 499-507 (2008) | |
| j37 | Tobias Cincarek, Hiromichi Kawanami, Ryuichi Nisimura, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System. IEICE Transactions 91-D(3): 576-587 (2008) | |
| j36 | Goshu Nagino, Makoto Shozakai, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Building an Effective Speech Corpus by Utilizing Statistical Multidimensional Scaling Method. IEICE Transactions 91-D(3): 607-614 (2008) | |
| j35 | Yuki Yai, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura: Rapid Compensation of Temperature Fluctuation Effect for Multichannel Sound Field Reproduction System. IEICE Transactions 91-A(6): 1329-1336 (2008) | |
| j34 | Keiichi Osako, Yoshimitsu Mori, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano: Fast Convergence Blind Source Separation Using Frequency Subband Interpolation by Null Beamforming. IEICE Transactions 91-A(6): 1357-1361 (2008) | |
| c125 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano: Frequency domain semi-blind signal separation: application to the rejection of internal noises. ICASSP 2008: 157-160 | |
| c124 | Yuuki Haraguchi, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Toshiyuki Nomura: Source-oriented localization control of stereo audio signals based on blind source separation. ICASSP 2008: 177-180 | |
| c123 | Yuuta Yuyama, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano: Hybrid structure of inverse filtering and DOA-parameterized wavefront synthesis. ICASSP 2008: 401-404 | |
| c122 | Randy Gomez, Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano: Distant talking robust speech recognition using late reflection components of room impulse response. ICASSP 2008: 4581-4584 | |
| c121 | Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Question and answer database optimization using speech recognition results. INTERSPEECH 2008: 451-454 | |
| c120 | Hiroshi Saruwatari, Yu Takahashi, Hiroyuki Sakai, Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Kiyohiro Shikano: Development and evaluation of hands-free spoken dialogue system for railway station guidance. INTERSPEECH 2008: 455-458 | |
| c119 | Takashi Muramatsu, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory. INTERSPEECH 2008: 1076-1079 | |
| c118 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: An improved one-to-many eigenvoice conversion system. INTERSPEECH 2008: 1080-1083 | |
| c117 | Randy Gomez, Jani Even, Kiyohiro Shikano: Rapid unsupervised speaker adaptation robust in reverberant environment conditions. INTERSPEECH 2008: 1309-1312 | |
| c116 | Hideki Okamoto, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker verification with non-audible murmur segments by combining global alignment kernel and penalized logistic regression machine. INTERSPEECH 2008: 1369-1372 | |
| c115 | Daisuke Tani, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano: Maximum a posteriori adaptation for many-to-one eigenvoice conversion. INTERSPEECH 2008: 1461-1463 | |
| c114 | Keigo Nakamura, Tomoki Toda, Yoshitaka Nakajima, Hiroshi Saruwatari, Kiyohiro Shikano: Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments. INTERSPEECH 2008: 2209-2212 | |
| c113 | Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano: Real-time implementation of blind spatial subtraction array for hands-free robot spoken dialogue system. IROS 2008: 1687-1692 | |
| c112 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano: An improved permutation solver for blind signal separation based front-ends in robot audition. IROS 2008: 2172-2177 | |
| 2007 | ||
| j33 | Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano: Unvoiced Speech Recognition Using Tissue-Conductive Acoustic Sensor. EURASIP J. Adv. Sig. Proc. 2007 (2007) | |
| j32 | Shigeki Miyabe, Yoichi Hinamoto, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura: Interface for Barge-in Free Spoken Dialogue System Based on Sound Field Reproduction and Microphone Array. EURASIP J. Adv. Sig. Proc. 2007 (2007) | |
| j31 | Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics. IEICE Transactions 90-D(2): 554-561 (2007) | |
| c111 | Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Development and portability of ASR and Q&A modules for real-environment speech-oriented guidance systems. ASRU 2007: 520-525 | |
| c110 | Kentaro Tachibana, Hiroshi Saruwatari, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka: Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA. ICASSP (1) 2007: 45-48 | |
| c109 | Yu Takahashi, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano: Permutation-Robust Structure for ICA-Based Blind Source Extraction. ICASSP (1) 2007: 149-152 | |
| c108 | Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection. INTERSPEECH 2007: 262-265 | |
| c107 | Goshu Nagino, Makoto Shozakai, Kiyohiro Shikano: How to judge reusability of existing speech corpora for target task by utilizing statistical multidimensional scaling. INTERSPEECH 2007: 1302-1305 | |
| c106 | Tobias Cincarek, Izumi Shindo, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task. INTERSPEECH 2007: 1469-1472 | |
| c105 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model. INTERSPEECH 2007: 1981-1984 | |
| c104 | Hideki Okamoto, Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Study on speaker verification with non-audible murmur segments. INTERSPEECH 2007: 2017-2020 | |
| c103 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees. INTERSPEECH 2007: 2517-2520 | |
| c102 | Yoshimitsu Mori, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita: Noise-robust hands-free speech recognition using SIMO-model-based blind source separation. ISSPA 2007: 1-4 | |
| c101 | Yu Takahashi, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano: Robust spatial subtraction array with independent component analysis for speech enhancement. ISSPA 2007: 1-4 | |
| c100 | Hiroyuki Sakai, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano, Akinobu Lee: Voice activity detection applied to hands-free spoken dialogue robot based on decoding using acoustic and language model. ROBOCOMM 2007: 16 | |
| 2006 | ||
| j30 | Yoshimitsu Mori, Hiroshi Saruwatari, Tomoya Takatani, Satoshi Ukai, Kiyohiro Shikano, Takashi Hiekata, Youhei Ikeda, Hiroshi Hashimoto, Takashi Morita: Blind Separation of Acoustic Signals Combining SIMO-Model-Based Independent Component Analysis and Binary Masking. EURASIP J. Adv. Sig. Proc. 2006 (2006) | |
| j29 | Yoshitaka Nakajima, Hideki Kashioka, Nick Campbell, Kiyohiro Shikano: Non-Audible Murmur (NAM) Recognition. IEICE Transactions 89-D(1): 1-4 (2006) | |
| j28 | Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura: Interface for Barge-in Free Spoken Dialogue System Using Nullspace Based Sound Field Control and Beamforming. IEICE Transactions 89-A(3): 716-726 (2006) | |
| j27 | Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Utterance-Based Selective Training for the Automatic Creation of Task-Dependent Acoustic Models. IEICE Transactions 89-D(3): 962-969 (2006) | |
| j26 | Randy Gomez, Akinobu Lee, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models. IEICE Transactions 89-D(3): 998-1005 (2006) | |
| j25 | Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano: An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis. Speech Communication 48(1): 45-56 (2006) | |
| j24 | Hiroshi Saruwatari, Toshiya Kawamura, Tsuyoki Nishikawa, Akinobu Lee, Kiyohiro Shikano: Blind source separation based on a fast-convergence algorithm combining ICA and beamforming. IEEE Transactions on Audio, Speech & Language Processing 14(2): 666-678 (2006) | |
| c99 | Yoshimitsu Mori, Hiroshi Saruwatari, Tomoya Takatani, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita: ICA and Binary-Mask-Based Blind Source Separation with Small Directional Microphones. ICA 2006: 649-657 | |
| c98 | Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training. INTERSPEECH 2006 | |
| c97 | Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker verification with non-audible murmur segments. INTERSPEECH 2006 | |
| c96 | Mikihiro Nakagiri, Tomoki Toda, Hideki Kashioka, Kiyohiro Shikano: Improving body transmitted unvoiced speech with statistical voice conversion. INTERSPEECH 2006 | |
| c95 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech. INTERSPEECH 2006 | |
| c94 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation. INTERSPEECH 2006 | |
| c93 | Tomoki Toda, Yamato Ohtani, Kiyohiro Shikano: Eigenvoice conversion based on Gaussian mixture model. INTERSPEECH 2006 | |
| 2005 | ||
| j23 | ||
| j22 | Kazuki Adachi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Designing Target Cost Function Based on Prosody of Speech Database. IEICE Transactions 88-D(3): 519-524 (2005) | |
| j21 | Satoshi Ukai, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Ryo Mukai, Hiroshi Sawada: Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA. IEICE Transactions 88-A(3): 642-650 (2005) | |
| j20 | Tatsunori Asai, Hiroshi Saruwatari, Kiyohiro Shikano: Interface for Barge-in Free Spoken Dialogue System Combining Adaptive Sound Field Control and Microphone Array. IEICE Transactions 88-A(6): 1613-1618 (2005) | |
| j19 | Tomoya Takatani, Satoshi Ukai, Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano: A Self-Generator Method for Initial Filters of SIMO-ICA Applied to Blind Separation of Binaural Sound Mixtures. IEICE Transactions 88-A(7): 1673-1682 (2005) | |
| j18 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Blind Separation of Speech by Fixed-Point ICA with Source Adaptive Negentropy Approximation. IEICE Transactions 88-A(7): 1683-1692 (2005) | |
| j17 | Yosuke Tatekura, Shigefumi Urata, Hiroshi Saruwatari, Kiyohiro Shikano: On-Line Relaxation Algorithm Applicable to Acoustic Fluctuation for Inverse Filter in Multichannel Sound Reproduction System. IEICE Transactions 88-A(7): 1747-1756 (2005) | |
| j16 | Hiroshi Saruwatari, Hiroaki Yamajo, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano: Blind Separation and Deconvolution for Convolutive Mixture of Speech Combining SIMO-Model-Based ICA and Multichannel Inverse Filtering. IEICE Transactions 88-A(9): 2387-2400 (2005) | |
| j15 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Estimation of Shape Parameter of GGD Function by Negentropy Matching. Neural Processing Letters 22(3): 377-389 (2005) | |
| c92 | Hiroshi Saruwatari, Katsuyuki Sawai, Tsuyoki Nishikawa, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata, Daisuke Saitoh: Speech Enhancement Based on Blind Source Separation in Car Environments. ICDE Workshops 2005: 1205 | |
| c91 | Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments. INTERSPEECH 2005: 293-296 | |
| c90 | Yoshitaka Nakajima, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell: Remodeling of the sensor for non-audible murmur (NAM). INTERSPEECH 2005: 389-392 | |
| c89 | Ryuichi Nisimura, Akinobu Lee, Masashi Yamada, Kiyohiro Shikano: Operating a public spoken guidance system in real environment. INTERSPEECH 2005: 845-848 | |
| c88 | Tomoki Toda, Kiyohiro Shikano: NAM-to-speech conversion with Gaussian mixture models. INTERSPEECH 2005: 1957-1960 | |
| c87 | Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano: Investigating the role of the Lombard reflex in non-audible murmur (NAM) recognition. INTERSPEECH 2005: 2649-2652 | |
| c86 | Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano: Applications of NAM microphones in speech recognition for privacy in human-machine communication. INTERSPEECH 2005: 3041-3044 | |
| c85 | Tomoya Takatani, Satoshi Ukai, Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano: Blind sound scene decomposition for robot audition using SIMO-model-based ICA. IROS 2005: 2247-2252 | |
| c84 | Hiroshi Saruwatari, Yoshimitsu Mori, Tomoya Takatani, Satoshi Ukai, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita: Two-stage blind source separation based on ICA and binary masking for real-time robot audition system. IROS 2005: 2303-2308 | |
| c83 | Yasuaki Ohashi, Tsuyoki Nishikawa, Hiroshi Saruwatari, Akinobu Lee, Kiyohiro Shikano: Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition. IROS 2005: 2328-2332 | |
| 2004 | ||
| j14 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Negentropy based voice-activity detection for noise estimation in very low SNR condition. IEICE Electronic Express 1(16): 495-500 (2004) | |
| j13 | Panikos Heracleous, Satoshi Nakamura, Kiyohiro Shikano: Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers Based on the 3-D N-Best Search Method. VLSI Signal Processing 36(2-3): 105-116 (2004) | |
| c82 | Satoshi Ukai, Hiroshi Saruwatari, Tomoya Takatani, Kiyohiro Shikano, Ryo Mukai, Hiroshi Sawada: Evaluation of Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA. ICA 2004: 626-633 | |
| c81 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Single Channel Speech Enhancement: MAP Estimation Using GGD Prior Under Blind Setup. ICA 2004: 873-880 | |
| c80 | Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano, Atsunobu Kaminuma: Stable and Low-Distortion Algorithm Based on Overdetermined Blind Separation for Convolutive Mixtures of Speech. ICA 2004: 881-888 | |
| c79 | Tatsunori Asai, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano: Interface for barge-in free spoken dialogue system using adaptive sound field control. INTERSPEECH 2004 | |
| c78 | Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Robust speech recognition with spectral subtraction in low SNR. INTERSPEECH 2004 | |
| c77 | Panikos Heracleous, Yoshitaka Nakajima, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Non-audible murmur (NAM) speech recognition using a stethoscopic NAM microphone. INTERSPEECH 2004 | |
| c76 | Tatsuya Kawahara, Akinobu Lee, Kazuya Takeda, Katsunobu Itou, Kiyohiro Shikano: Recent progress of open-source LVCSR engine julius and Japanese model repository. INTERSPEECH 2004 | |
| c75 | Akinobu Lee, Keisuke Nakamura, Ryuichi Nisimura, Hiroshi Saruwatari, Kiyohiro Shikano: Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs. INTERSPEECH 2004 | |
| c74 | Shinichi Yoshizawa, Kiyohiro Shikano: Rapid EM training based on model-integration. INTERSPEECH 2004 | |
| c73 | Kazuki Adachi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification. LREC 2004 | |
| 2003 | ||
| j12 | Hiroshi Saruwatari, Satoshi Kurita, Kazuya Takeda, Fumitada Itakura, Tsuyoki Nishikawa, Kiyohiro Shikano: Blind Source Separation Combining Independent Component Analysis and Beamforming. EURASIP J. Adv. Sig. Proc. 2003(11): 1135-1146 (2003) | |
| j11 | Takanobu Nishiura, Ryousuke Nishioka, Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano: Multiple beamforming with source localization based on CSP analysis. Systems and Computers in Japan 34(5): 69-80 (2003) | |
| c72 | Panikos Heracleous, Satoshi Nakamura, Kiyohiro Shikano: A semi-blind source separation method for hands-free speech recognition of multiple talkers. INTERSPEECH 2003 | |
| c71 | Hiromichi Kawanami, Yohei Iwami, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: GMM-based voice conversion applied to emotional speech synthesis. INTERSPEECH 2003 | |
| c70 | Yoshitaka Nakajima, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell: Non-audible murmur recognition. INTERSPEECH 2003 | |
| c69 | Takanobu Nishiura, Satoshi Nakamura, Kazuhiro Miki, Kiyohiro Shikano: Environmental sound source identification based on hidden Markov model for robust speech recognition. INTERSPEECH 2003 | |
| c68 | Tatsuya Shiraishi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Simple designing methods of corpus-based visual speech synthesis. INTERSPEECH 2003 | |
| c67 | Shingo Yamade, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Unsupervised speaker adaptation based on HMM sufficient statistics in various noisy environments. INTERSPEECH 2003 | |
| c66 | Hiroaki Yamajo, Hiroshi Saruwatari, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano: Blind separation and deconvolution for convolutive mixture of speech using SIMO-model-based ICA and multichannel inverse filtering. INTERSPEECH 2003 | |
| c65 | Shinichi Yoshizawa, Kiyohiro Shikano: Model-integration rapid training based on maximum likelihood for speech recognition. INTERSPEECH 2003 | |
| 2002 | ||
| j10 | Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano: Distant-talking speech recognition based on a 3-D Viterbi search using a microphone array. IEEE Transactions on Speech and Audio Processing 10(2): 48-56 (2002) | |
| c64 | Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano: Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit. ICASSP 2002: 465-468 | |
| c63 | Takanobu Nishiura, Satoshi Nakamura, Kiyohiro Shikano: Talker localization in a real acoustic environment based on DOA estimation and statistical sound source identification. ICASSP 2002: 893-896 | |
| c62 | Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano: Bund source separation based on Multi-Stage ICA combining frequency-domain ICA and time-domain ICA. ICASSP 2002: 917-920 | |
| c61 | Toshio Hirai, Seiichi Tenpaku, Kiyohiro Shikano: Using start/end timings of spectral transitions between phonemes in concatenative speech synthesis. INTERSPEECH 2002 | |
| c60 | Hiromichi Kawanami, Tsuyoshi Masuda, Tomoki Toda, Kiyohiro Shikano: Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer. INTERSPEECH 2002 | |
| c59 | Akinobu Lee, Yuichiro Mera, Hiroshi Saruwatari, Kiyohiro Shikano: Selective multi-path acoustic model based on database likelihoods. INTERSPEECH 2002 | |
| c58 | Mikiko Mashimo, Tomoki Toda, Hiromichi Kawanami, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell: Evaluation of cross-language voice conversion using bilingual and non-bilingual databases. INTERSPEECH 2002 | |
| c57 | Takanobu Nishiura, Satoshi Nakamura, Yuka Okada, Takeshi Yamada, Kiyohiro Shikano: Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition. INTERSPEECH 2002 | |
| c56 | Hiroshi Saruwatari, Katsuyuki Sawai, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata: Speech enhancement in car environment using blind source separation. INTERSPEECH 2002 | |
| c55 | Shingo Yamade, Kanako Matsunami, Akira Baba, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics. INTERSPEECH 2002 | |
| c54 | Hiromichi Kawanami, Tsuyoshi Masuda, Tomoki Toda, Kiyohiro Shikano: Designing speech database with prosodic variety for expressive TTS system. LREC 2002 | |
| c53 | Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Masato Mimura, Atsushi Yamada, Akinori Ito, Katsunobu Itou, Kiyohiro Shikano: Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models. LREC 2002 | |
| 2001 | ||
| j9 | Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano: HMM-separation-based speech recognition for a distant moving speaker. IEEE Transactions on Speech and Audio Processing 9(2): 127-140 (2001) | |
| c52 | Ken'ichi Kumatani, Satoshi Nakamura, Kiyohiro Shikano: An Adaptive Integration Based On Product Hmm For Audio-Visual Speech Recognition. ICME 2001 | |
| c51 | Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: High quality voice conversion based on Gaussian mixture model with dynamic frequency warping. INTERSPEECH 2001: 349-352 | |
| c50 | Mikiko Mashimo, Tomoki Toda, Kiyohiro Shikano, Nick Campbell: Evaluation of cross-language voice conversion based on GMM and straight. INTERSPEECH 2001: 361-364 | |
| c49 | Miichi Yamada, Akira Baba, Shinichi Yoshizawa, Yuichiro Mera, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Unsupervised noisy environment adaptation algorithm using MLLR and speaker selection. INTERSPEECH 2001: 869-872 | |
| c48 | Shinichi Yoshizawa, Akira Baba, Kanako Matsunami, Yuichiro Mera, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano: Evaluation on unsupervised speaker adaptation based on sufficient HMM statictics of selected speakers. INTERSPEECH 2001: 1219-1222 | |
| c47 | Akira Baba, Shinichi Yoshizawa, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano: Elderly acoustic model for large vocabulary continuous speech recognition. INTERSPEECH 2001: 1657-1660 | |
| c46 | Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano: Julius - an open source real-time large vocabulary recognition engine. INTERSPEECH 2001: 1691-1694 | |
| c45 | Ryuichi Nisimura, Kumiko Komatsu, Yuka Kuroda, Kentaro Nagatomo, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Automatic n-gram language model creation from web resources. INTERSPEECH 2001: 2127-2130 | |
| c44 | Hiroshi Saruwatari, Toshiya Kawamura, Kiyohiro Shikano: Blind source separation for speech based on fast-convergence algorithm with ICA and beamforming. INTERSPEECH 2001: 2603-2606 | |
| c43 | Takanobu Nishiura, Satoshi Nakamura, Kiyohiro Shikano: Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array. INTERSPEECH 2001: 2611-2614 | |
| 2000 | ||
| j8 | Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano: Model adaptation by HMM decomposition and composition in noisy reverberant environments. Systems and Computers in Japan 31(5): 77-85 (2000) | |
| c42 | Kiyotsugu Kakihara, Satoshi Nakamura, Kiyohiro Shikano: Speech-to-Face Movement Synthesis based on HMMS. IEEE International Conference on Multimedia and Expo (I) 2000: 427- | |
| c41 | Satoshi Nakamura, Hidetoshi Ito, Kiyohiro Shikano: Stream weight optimization of speech and lip image sequence for audio-visual speech recognition. INTERSPEECH 2000: 20-24 | |
| c40 | Hiroshi Saruwatari, Satoshi Kurita, Kazuya Takeda, Fumitada Itakura, Kiyohiro Shikano: Blind source separation based on subband ICA and beamforming. INTERSPEECH 2000: 94-97 | |
| c39 | Tomoki Toda, Jinlin Lu, Hiroshi Saruwatari, Kiyohiro Shikano: Straight-based voice conversion algorithm based on Gaussian mixture model. INTERSPEECH 2000: 279-282 | |
| c38 | Toshio Hirai, Seiichi Tenpaku, Kiyohiro Shikano: Manipulating speech pitch periods according to optimal insertion/deletion position in residual signal for intonation control in speech synthesis. INTERSPEECH 2000: 330-333 | |
| c37 | Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Shigeki Sagayama, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano: Free software toolkit for Japanese large vocabulary continuous speech recognition. INTERSPEECH 2000: 476-479 | |
| c36 | Parham Zolfaghari, Yoshinori Atake, Kiyohiro Shikano, Hideki Kawahara: Investigation of analysis and synthesis parameters of straight by subjective evaluation. INTERSPEECH 2000: 498-501 | |
| c35 | Yoshinori Atake, Toshio Irino, Hideki Kawahara, Jinlin Lu, Satoshi Nakamura, Kiyohiro Shikano: Robust fundamental frequency estimation using instantaneous frequencies of harmonic components. INTERSPEECH 2000: 907-910 | |
| c34 | Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kazuya Takeda, Atsushi Yamada, Akinori Ito, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee: IPA Japanese Dictation Free Software Project. LREC 2000 | |
| 1999 | ||
| c33 | Panikos Heracleous, Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano: Simultaneous recognition of multiple sound sources based on 3-d n-best search using microphone array. EUROSPEECH 1999 | |
| 1998 | ||
| j7 | Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano: Lip movement synthesis from speech based on Hidden Markov Models. Speech Communication 26(1-2): 105-115 (1998) | |
| c32 | Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano: Lip Movement Synthesis from Speech Based on Hidden Markov Models. FG 1998: 154-159 | |
| c31 | Alexandre Girardi, Kiyohiro Shikano, Satoshi Nakamura: Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphing. ICSLP 1998 | |
| c30 | Katunobu Itou, Mikio Yamamoto, Kazuya Takeda, Toshiyuki Takezawa, Tatsuo Matsuoka, Tetsunori Kobayashi, Kiyohiro Shikano, Shuichi Itahashi: The design of the newspaper-based Japanese large vocabulary continuous speech recognition corpus. ICSLP 1998 | |
| c29 | Tatsuya Kawahara, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano: Sharable software repository for Japanese large vocabulary continuous speech recognition. ICSLP 1998 | |
| c28 | Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano, Masatoshi Morishima, Toshihiro Isobe: Evaluation of model adaptation by HMM decomposition on telephone speech recognition. ICSLP 1998 | |
| c27 | Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano: An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi search. ICSLP 1998 | |
| c26 | Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano: Speech-to-lip movement synthesis based on the EM algorithm using audio-visual HMMs. ICSLP 1998 | |
| c25 | Norimichi Yodo, Kiyohiro Shikano, Satoshi Nakamura: Compression algorithm of trigram language models based on maximum likelihood estimation. ICSLP 1998 | |
| 1997 | ||
| c24 | Alexandre Girardi, Harald Singer, Kiyohiro Shikano, Satoshi Nakamura: Maximum likelihood successive state splitting algorithm for tied-mixture HMNET. EUROSPEECH 1997 | |
| c23 | Masaaki Inoue, Satoshi Nakamura, Takeshi Yamada, Kiyohiro Shikano: Microphone array design measures for hands-free speech recognition. EUROSPEECH 1997 | |
| c22 | Satoshi Nakamura, Ron Nagai, Kiyohiro Shikano: Improved bimodal speech recognition using tied-mixture HMMs and 5000 word audio-visual synchronous database. EUROSPEECH 1997 | |
| c21 | Satoshi Nakamura, Kiyohiro Shikano: Room acoustics and reverberation: impact on hands-free recognition. EUROSPEECH 1997 | |
| c20 | Makoto Shozakai, Satoshi Nakamura, Kiyohiro Shikano: A non-iterative model-adaptive e-CMN/PMC approach for speech recognition in car environments. EUROSPEECH 1997 | |
| 1996 | ||
| c19 | Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano: Robust speech recognition with speaker localization by a microphone array. ICSLP 1996 | |
| c18 | Tadashi Yonezaki, Kiyohiro Shikano: Entropy coded vector quantization with hidden Markov models. ICSLP 1996 | |
| 1995 | ||
| j6 | Osamu Yoshioka, Yasuhiro Minami, Kiyohiro Shikano: A Speech Dialogue System with Multimodal Interface for Telephone Directory Assistance. IEICE Transactions 78-D(6): 616-621 (1995) | |
| j5 | Satoshi Takahashi, Yasuhiro Minami, Kiyohiro Shikano: An HMM State Duration Control Algorithm Applied to Large-Vocabulary Spontaneous Speech Recognition. IEICE Transactions 78-D(6): 648-653 (1995) | |
| 1994 | ||
| j4 | Kiyohiro Shikano, Tomokazu Yamada, Takeshi Kawabata, Shoichi Matsunaga, Sadaoki Furui, Toshiyuki Hanazawa: Dictation Machine Based on Japanese Character Source Modeling. IJPRAI 8(1): 181-196 (1994) | |
| j3 | Yasuhiro Minami, Kiyohiro Shikano, Satoshi Takahashi, Tomokazu Yamada, Osamu Yoshioka, Sadaoki Furui: Large-vocabulary continuous speech recognition algorithm applied to a multi-modal telephone directory assistance system. Speech Communication 15(3-4): 301-310 (1994) | |
| c17 | Satoshi Takahashi, Yasuhiro Minami, Kiyohiro Shikano: An HMM duration control algorithm with a low computational cost. ICSLP 1994 | |
| c16 | Osamu Yoshioka, Yasuhiro Minami, Kiyohiro Shikano: A multi-modal dialogue system for telephone directory assistance. ICSLP 1994 | |
| c15 | Yasuhiro Minami, Kiyohiro Shikano, Osamu Yoshioka, Satoshi Takahashi, Tomokazu Yamada, Sadaoki Furui: A Large-Vocabulary Continuous Speech Recognition Algorithm and its Application to a Multi-Modal Telephone Directory Assistance System. HLT 1994 | |
| 1993 | ||
| c14 | Franck Martin, Kiyohiro Shikano, Yasuhiro Minami: Recognition of noisy speech by composition of hidden Markov models. EUROSPEECH 1993 | |
| c13 | Shoichi Matsunaga, Tomokazu Yamada, Kiyohiro Shikano: Dictation system using inductively auto-generated syntax. EUROSPEECH 1993 | |
| c12 | Yasuhiro Minami, Kiyohiro Shikano, Tomokazu Yamada, Tatsuo Matsuoka: Very-large-vocabulary continuous speech recognition algorithm for telephone directory assistance. EUROSPEECH 1993 | |
| 1992 | ||
| c11 | Shoichi Matsunaga, Toshiaki Tsuboi, Tomokazu Yamada, Kiyohiro Shikano: Continuous speech recognition for medical diagnoses using a character trigram model. ICSLP 1992 | |
| c10 | Tatsuo Matsuoka, Kiyohiro Shikano: Speaker adaptation by modifying mixture coefficients of speaker-independent mixture Gaussian HMMs. ICSLP 1992 | |
| c9 | Yasuhiro Minami, Tatsuo Matsuoka, Kiyohiro Shikano: Phoneme HMM evaluation algorithm without phoneme labeling. ICSLP 1992 | |
| c8 | Akito Nagai, Kenji Kita, Toshiyuki Hanazawa, Tadashi Suzuki, Tomohiro Iwasaki, Tsuyoshi Kawabata, Kunio Nakajima, Kiyohiro Shikano, Tsuyoshi Morimoto, Shigeki Sagayama, Akira Kurematsu: Hardware implementation of realtime 1000-word HMM-LR continuous speech recognition. ICSLP 1992 | |
| 1991 | ||
| j2 | Akira Kurematsu, Hitoshi Iida, Tsuyoshi Morimoto, Kiyohiro Shikano: Language processing in connection with speech translation at ATR interpreting telephony research laboratories. Speech Communication 10(1): 1-9 (1991) | |
| 1990 | ||
| j1 | Akira Kurematsu, Kazuya Takeda, Yoshinori Sagisaka, Shigeru Katagiri, Hisao Kuwabara, Kiyohiro Shikano: ATR Japanese speech database as a tool of speech recognition and synthesis. Speech Communication 9(4): 357-363 (1990) | |
| c7 | Masami Nakamura, Katsuteru Maruyama, Takeshi Kawabata, Kiyohiro Shikano: Neural Network Approach To Word Category Prediction For English Texts. COLING 1990: 213-218 | |
| c6 | Hiroaki Hattori, Satoshi Nakamura, Kiyohiro Shikano, Shigeki Sagayama: Speaker weighted training of HMM using multiple reference speakers. ICSLP 1990 | |
| c5 | Takeshi Kawabata, Toshiyuki Hanazawa, Katsunobu Itou, Kiyohiro Shikano: Japanese phonetic typewriter using HMM phone units and syllable trigrams. ICSLP 1990 | |
| c4 | Yasuhiro Minami, Toshiyuki Hanazawa, Hitoshi Iwamida, Erik McDermott, Kiyohiro Shikano, Shigeru Katagiri, Masaona Kagawa: On the robustness of HMM and ANN speech recognition algorithms. ICSLP 1990 | |
| c3 | Tsuyoshi Morimoto, Kiyohiro Shikano, Hitoshi Iida, Akira Kurematsu: Integration of speech recognition and language processing in spoken language translation system (SL-TRANS). ICSLP 1990 | |
| 1989 | ||
| c2 | Yasuhiro Komori, Kaichiro Hatazaki, Takaharu Tanaka, Takeshi Kawabata, Kiyohiro Shikano: Phoneme recognition expert system using spectrogram reading knowledge and neural networks. EUROSPEECH 1989: 2549-2552 | |
| c1 | Patrick Haffner, Alex Waibel, H. Sawai, Kiyohiro Shikano: Fast back-propagation learning methods for large phonemic neural networks. EUROSPEECH 1989: 2553-2556 | |
Data released under the ODC-BY 1.0 license — See also our legal information page