| 2013 | ||
|---|---|---|
| j42 | Frédéric Mustière, Martin Bouchard, Hossein Najaf-Zadeh, Ramin Pichevar, Louis Thibault, Hiroshi Saruwatari: Design of multichannel frequency domain statistical-based enhancement systems preserving spatial cues via spectral distances minimization. Signal Processing 93(1): 321-325 (2013) | |
| 2012 | ||
| j41 | Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano: Theoretical Analysis of Amounts of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array. IEICE Transactions 95-A(2): 586-590 (2012) | |
| j40 | Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator. IEICE Transactions 95-A(2): 591-595 (2012) | |
| j39 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech. Speech Communication 54(1): 134-146 (2012) | |
| j38 | Ryoichi Miyazaki, Hiroshi Saruwatari, Takayuki Inoue, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo: Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction. IEEE Transactions on Audio, Speech & Language Processing 20(7): 2080-2094 (2012) | |
| c106 | Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Speech kurtosis estimation from observed noisy signal based on generalized Gaussian distribution prior and additivity of cumulants. ICASSP 2012: 4049-4052 | |
| c105 | Kenzo Yamamoto, Tomoki Toda, Hironori Doi, Hiroshi Saruwatari, Kiyohiro Shikano: Statistical approach to voice quality control in esophageal speech enhancement. ICASSP 2012: 4497-4500 | |
| c104 | Ryoichi Miyazaki, Hiroshi Saruwatari, Takayuki Inoue, Kiyohiro Shikano, Kazunobu Kondo: Musical-noise-free speech enhancement: Theory and evaluation. ICASSP 2012: 4565-4568 | |
| c103 | Keigo Kubo, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Evaluation of Many-to-Many Alignment Algorithm by Automatic Pronunciation Annotation Using Web Text Mining. INTERSPEECH 2012 | |
| c102 | Haruka Majima, Rafael Torres, Yoko Fujita, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano: Spoken Inquiry Discrimination Using Bag-of-Words for Speech-Oriented Guidance System. INTERSPEECH 2012 | |
| c101 | Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical-noise-free blind speech extraction using ICA-based noise estimation and iterative spectral subtraction. ISSPA 2012: 286-291 | |
| c100 | Suzumi Kanehara, Hiroshi Saruwatari, Ryoichi Miyazaki, Kiyohiro Shikano, Kazunobu Kondo: Theoretical Analysis of Musical Noise Generation in Noise Reduction Methods with Decision-Directed a Priori SNR Estimator. IWAENC 2012 | |
| c99 | Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical-Noise-Free Blind Speech Extraction Using ICA-Based Noise Estimation with Channel Selection. IWAENC 2012 | |
| 2011 | ||
| j37 | Noriyoshi Kamado, Haruhide Hokari, Shoji Shimada, Hiroshi Saruwatari, Kiyohiro Shikano: Sound Field Reproduction by Wavefront Synthesis Using Directly Aligned Multi Point Control. IEICE Transactions 94-A(3): 907-920 (2011) | |
| j36 | Hiroshi Saruwatari, Y. Ishikawa, Yu Takahashi, Takayuki Inoue, Kiyohiro Shikano, Kazunobu Kondo: Musical Noise Controllable Algorithm of Channelwise Spectral Subtraction and Adaptive Beamforming Based on Higher Order Statistics. IEEE Transactions on Audio, Speech & Language Processing 19(6): 1457-1466 (2011) | |
| j35 | Takayuki Inoue, Hiroshi Saruwatari, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo: Theoretical Analysis of Musical Noise in Generalized Spectral Subtraction Based on Higher Order Statistics. IEEE Transactions on Audio, Speech & Language Processing 19(6): 1770-1779 (2011) | |
| c98 | Shunta Ishii, Tomoki Toda, Hiroshi Saruwatari, Sakriani Sakti, Satoshi Nakamura: Blind noise suppression for Non-Audible Murmur recognition with stereo signal processing. ASRU 2011: 494-499 | |
| c97 | Hiroyuki Nawata, Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano: Automatic musical thumbnailing based on audio object localization and its evaluation. ICASSP 2011: 41-44 | |
| c96 | Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano: Robust sound field reproduction integrating multi-point sound field control and wave field synthesis. ICASSP 2011: 441-444 | |
| c95 | Takayuki Inoue, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Theoretical analysis of musical noise in Wiener filtering family via higher-order statistics. ICASSP 2011: 5076-5079 | |
| c94 | Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques. ICASSP 2011: 5136-5139 | |
| c93 | Denis Babani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Acoustic model training for non-audible murmur recognition using transformed normal speech data. ICASSP 2011: 5224-5227 | |
| c92 | Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano: Theoretical Analysis of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array. INTERSPEECH 2011: 341-344 | |
| c91 | Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Blind Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator. INTERSPEECH 2011: 361-364 | |
| c90 | Nobuhiko Hattori, Tomoki Toda, Hisashi Kawai, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation. INTERSPEECH 2011: 2769-2772 | |
| c89 | Hiroshi Saruwatari, Nobuhisa Hirata, Toshiyuki Hatta, Ryo Wakisaka, Kiyohiro Shikano, Tomoya Takatani: Semi-blind speech extraction for robot using visual information and noise statistics. ISSPIT 2011: 264-269 | |
| 2010 | ||
| j34 | Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical-Noise Analysis in Methods of Integrating Microphone Array and Spectral Subtraction Based on Higher-Order Statistics. EURASIP J. Adv. Sig. Proc. 2010 (2010) | |
| j33 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Adaptive Training for Voice Conversion Based on Eigenvoices. IEICE Transactions 93-D(6): 1589-1598 (2010) | |
| j32 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Evaluation of Extremely Small Sound Source Signals Used in Speaking-Aid System with Statistical Voice Conversion. IEICE Transactions 93-D(7): 1909-1917 (2010) | |
| j31 | Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models. IEICE Transactions 93-D(9): 2472-2482 (2010) | |
| j30 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Improvements of the One-to-Many Eigenvoice Conversion System. IEICE Transactions 93-D(9): 2491-2499 (2010) | |
| c88 | Hiroshi Saruwatari, Ryoi Okamoto, Yu Takahashi, Kiyohiro Shikano: Blind Speech Extraction Combining Generalized MMSE STSA Estimator and ICA-Based Noise and Speech Probability Density Function Estimations. LVA/ICA 2010: 49-56 | |
| c87 | Yu Takahashi, Hiroshi Saruwatari, Hiroshi Shikano, Kazunobu Kondo: Theoretical musical-noise analysis and its generalization for methods of integrating beamforming and spectral subtraction based on higher-order statistics. ICASSP 2010: 93-96 | |
| c86 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Complex Newton algorithm for blind signal extraction of speech in diffuse noise. ICASSP 2010: 213-216 | |
| c85 | Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Statistical approach to enhancing esophageal speech based on Gaussian mixture models. ICASSP 2010: 4250-4253 | |
| c84 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Speech enhancement in presence of diffuse background noise: Why using blind signal extraction? ICASSP 2010: 4770-4773 | |
| c83 | Ryoi Okamoto, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano: MMSE STSA estimator with nonstationary noise estimation based on ICA for high-quality speech enhancement. ICASSP 2010: 4778-4781 | |
| c82 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Non-parallel training for many-to-many eigenvoice conversion. ICASSP 2010: 4822-4825 | |
| c81 | Jani Even, Carlos Toshinori Ishi, Hiroshi Saruwatari, Norihiro Hagita: Close speaker cancellation for suppression of non-stationary background noise for hands-free speech interface. INTERSPEECH 2010: 977-980 | |
| c80 | Rafael Torres, Shota Takeuchi, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano: Comparison of methods for topic classification in a speech-oriented guidance system. INTERSPEECH 2010: 1261-1264 | |
| c79 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion. INTERSPEECH 2010: 1628-1631 | |
| c78 | Kumi Ohta, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano: Adaptive voice-quality control based on one-to-many eigenvoice conversion. INTERSPEECH 2010: 2158-2161 | |
| c77 | Hiroshi Sawada, Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Improvement of speech recognition performance for spoken-oriented robot dialog system using end-fire array. IROS 2010: 970-975 | |
| 2009 | ||
| j29 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Enhancement of speech signals separated from their convolutive mixture by FDICA algorithm. Digital Signal Processing 19(1): 127-133 (2009) | |
| j28 | Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Techniques in rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics. Speech Communication 51(1): 42-57 (2009) | |
| j27 | Yu Takahashi, Tomoya Takatani, Keiichi Osako, Hiroshi Saruwatari, Kiyohiro Shikano: Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment. IEEE Transactions on Audio, Speech & Language Processing 17(4): 650-664 (2009) | |
| c76 | Takashi Hiekata, Takashi Morita, Youhei Ikeda, Hiroshi Hashimoto, Ruoyu Zhang, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano: Multiple ICA-based real-time blind source extraction applied to handy size microphone. ICASSP 2009: 121-124 | |
| c75 | Yu Takahashi, Yoshihisa Uemura, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical noise analysis based on higher order statistics for microphone array and nonlinear signal processing. ICASSP 2009: 229-232 | |
| c74 | Shigeki Miyabe, Biing-Hwang Juang, Hiroshi Saruwatari, Kiyohiro Shikano: Kernel-based nonlinear independent component analysis for underdetermined blind source separation. ICASSP 2009: 1641-1644 | |
| c73 | Yu Takahashi, Hiroshi Saruwatari, Yuki Fujihara, Kentaro Tachibana, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka: Source adaptive blind signal extraction using closed-form ICA for hands-free robot spoken dialogue system. ICASSP 2009: 3681-3684 | |
| c72 | Hiroshi Saruwatari, Hiromichi Kawanami, Shota Takeuchi, Yu Takahashi, Tobias Cincarek, Kiyohiro Shikano: Hands-free speech recognition challenge for real-world speech dialogue systems. ICASSP 2009: 3729-3732 | |
| c71 | Daisuke Miyamoto, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Acoustic compensation methods for body transmitted speech conversion. ICASSP 2009: 3901-3904 | |
| c70 | Yoshihisa Uemura, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo: Musical noise generation analysis for noise reduction methods based on spectral subtraction and MMSE STSA estimation. ICASSP 2009: 4433-4436 | |
| c69 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano: Target Speech Enhancement in Presence of Jammer and Diffuse Background Noise. ICA 2009: 565-572 | |
| c68 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Electrolaryngeal speech enhancement based on statistical voice conversion. INTERSPEECH 2009: 1431-1434 | |
| c67 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Many-to-many eigenvoice conversion with reference voice. INTERSPEECH 2009: 1623-1626 | |
| c66 | Jani Even, Hiroshi Sawada, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani: Semi-blind suppression of internal noise for hands-free robot spoken dialog system. IROS 2009: 658-663 | |
| c65 | Shigeki Miyabe, Keisuke Masatoki, Hiroshi Saruwatari, Kiyohiro Shikano, Toshiyuki Nomura: Temporal quantization of spatial information using directional clustering for multichannel audio coding. WASPAA 2009: 261-264 | |
| 2008 | ||
| j26 | Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Cost Reduction of Acoustic Modeling for Real-Environment Applications Using Unsupervised and Selective Training. IEICE Transactions 91-D(3): 499-507 (2008) | |
| j25 | Tobias Cincarek, Hiromichi Kawanami, Ryuichi Nisimura, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System. IEICE Transactions 91-D(3): 576-587 (2008) | |
| j24 | Goshu Nagino, Makoto Shozakai, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Building an Effective Speech Corpus by Utilizing Statistical Multidimensional Scaling Method. IEICE Transactions 91-D(3): 607-614 (2008) | |
| j23 | Yuki Yai, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura: Rapid Compensation of Temperature Fluctuation Effect for Multichannel Sound Field Reproduction System. IEICE Transactions 91-A(6): 1329-1336 (2008) | |
| j22 | Keiichi Osako, Yoshimitsu Mori, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano: Fast Convergence Blind Source Separation Using Frequency Subband Interpolation by Null Beamforming. IEICE Transactions 91-A(6): 1357-1361 (2008) | |
| c64 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano: Frequency domain semi-blind signal separation: application to the rejection of internal noises. ICASSP 2008: 157-160 | |
| c63 | Yuuki Haraguchi, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Toshiyuki Nomura: Source-oriented localization control of stereo audio signals based on blind source separation. ICASSP 2008: 177-180 | |
| c62 | Yuuta Yuyama, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano: Hybrid structure of inverse filtering and DOA-parameterized wavefront synthesis. ICASSP 2008: 401-404 | |
| c61 | Randy Gomez, Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano: Distant talking robust speech recognition using late reflection components of room impulse response. ICASSP 2008: 4581-4584 | |
| c60 | Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Question and answer database optimization using speech recognition results. INTERSPEECH 2008: 451-454 | |
| c59 | Hiroshi Saruwatari, Yu Takahashi, Hiroyuki Sakai, Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Kiyohiro Shikano: Development and evaluation of hands-free spoken dialogue system for railway station guidance. INTERSPEECH 2008: 455-458 | |
| c58 | Takashi Muramatsu, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory. INTERSPEECH 2008: 1076-1079 | |
| c57 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: An improved one-to-many eigenvoice conversion system. INTERSPEECH 2008: 1080-1083 | |
| c56 | Hideki Okamoto, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker verification with non-audible murmur segments by combining global alignment kernel and penalized logistic regression machine. INTERSPEECH 2008: 1369-1372 | |
| c55 | Daisuke Tani, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano: Maximum a posteriori adaptation for many-to-one eigenvoice conversion. INTERSPEECH 2008: 1461-1463 | |
| c54 | Keigo Nakamura, Tomoki Toda, Yoshitaka Nakajima, Hiroshi Saruwatari, Kiyohiro Shikano: Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments. INTERSPEECH 2008: 2209-2212 | |
| c53 | Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano: Real-time implementation of blind spatial subtraction array for hands-free robot spoken dialogue system. IROS 2008: 1687-1692 | |
| c52 | Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano: An improved permutation solver for blind signal separation based front-ends in robot audition. IROS 2008: 2172-2177 | |
| 2007 | ||
| j21 | Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano: Unvoiced Speech Recognition Using Tissue-Conductive Acoustic Sensor. EURASIP J. Adv. Sig. Proc. 2007 (2007) | |
| j20 | Shigeki Miyabe, Yoichi Hinamoto, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura: Interface for Barge-in Free Spoken Dialogue System Based on Sound Field Reproduction and Microphone Array. EURASIP J. Adv. Sig. Proc. 2007 (2007) | |
| j19 | Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics. IEICE Transactions 90-D(2): 554-561 (2007) | |
| c51 | Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Development and portability of ASR and Q&A modules for real-environment speech-oriented guidance systems. ASRU 2007: 520-525 | |
| c50 | Kentaro Tachibana, Hiroshi Saruwatari, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka: Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA. ICASSP (1) 2007: 45-48 | |
| c49 | Yu Takahashi, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano: Permutation-Robust Structure for ICA-Based Blind Source Extraction. ICASSP (1) 2007: 149-152 | |
| c48 | Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection. INTERSPEECH 2007: 262-265 | |
| c47 | Tobias Cincarek, Izumi Shindo, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task. INTERSPEECH 2007: 1469-1472 | |
| c46 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model. INTERSPEECH 2007: 1981-1984 | |
| c45 | Hideki Okamoto, Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Study on speaker verification with non-audible murmur segments. INTERSPEECH 2007: 2017-2020 | |
| c44 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees. INTERSPEECH 2007: 2517-2520 | |
| c43 | Yoshimitsu Mori, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita: Noise-robust hands-free speech recognition using SIMO-model-based blind source separation. ISSPA 2007: 1-4 | |
| c42 | Yu Takahashi, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano: Robust spatial subtraction array with independent component analysis for speech enhancement. ISSPA 2007: 1-4 | |
| c41 | Hiroyuki Sakai, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano, Akinobu Lee: Voice activity detection applied to hands-free spoken dialogue robot based on decoding using acoustic and language model. ROBOCOMM 2007: 16 | |
| 2006 | ||
| j18 | Yoshimitsu Mori, Hiroshi Saruwatari, Tomoya Takatani, Satoshi Ukai, Kiyohiro Shikano, Takashi Hiekata, Youhei Ikeda, Hiroshi Hashimoto, Takashi Morita: Blind Separation of Acoustic Signals Combining SIMO-Model-Based Independent Component Analysis and Binary Masking. EURASIP J. Adv. Sig. Proc. 2006 (2006) | |
| j17 | Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura: Interface for Barge-in Free Spoken Dialogue System Using Nullspace Based Sound Field Control and Beamforming. IEICE Transactions 89-A(3): 716-726 (2006) | |
| j16 | Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Utterance-Based Selective Training for the Automatic Creation of Task-Dependent Acoustic Models. IEICE Transactions 89-D(3): 962-969 (2006) | |
| j15 | Randy Gomez, Akinobu Lee, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models. IEICE Transactions 89-D(3): 998-1005 (2006) | |
| j14 | Hiroshi Saruwatari, Toshiya Kawamura, Tsuyoki Nishikawa, Akinobu Lee, Kiyohiro Shikano: Blind source separation based on a fast-convergence algorithm combining ICA and beamforming. IEEE Transactions on Audio, Speech & Language Processing 14(2): 666-678 (2006) | |
| c40 | Yoshimitsu Mori, Hiroshi Saruwatari, Tomoya Takatani, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita: ICA and Binary-Mask-Based Blind Source Separation with Small Directional Microphones. ICA 2006: 649-657 | |
| c39 | Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training. INTERSPEECH 2006 | |
| c38 | Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Speaker verification with non-audible murmur segments. INTERSPEECH 2006 | |
| c37 | Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech. INTERSPEECH 2006 | |
| c36 | Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation. INTERSPEECH 2006 | |
| 2005 | ||
| j13 | Kazuki Adachi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Designing Target Cost Function Based on Prosody of Speech Database. IEICE Transactions 88-D(3): 519-524 (2005) | |
| j12 | Satoshi Ukai, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Ryo Mukai, Hiroshi Sawada: Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA. IEICE Transactions 88-A(3): 642-650 (2005) | |
| j11 | Tatsunori Asai, Hiroshi Saruwatari, Kiyohiro Shikano: Interface for Barge-in Free Spoken Dialogue System Combining Adaptive Sound Field Control and Microphone Array. IEICE Transactions 88-A(6): 1613-1618 (2005) | |
| j10 | Tomoya Takatani, Satoshi Ukai, Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano: A Self-Generator Method for Initial Filters of SIMO-ICA Applied to Blind Separation of Binaural Sound Mixtures. IEICE Transactions 88-A(7): 1673-1682 (2005) | |
| j9 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Blind Separation of Speech by Fixed-Point ICA with Source Adaptive Negentropy Approximation. IEICE Transactions 88-A(7): 1683-1692 (2005) | |
| j8 | Yosuke Tatekura, Shigefumi Urata, Hiroshi Saruwatari, Kiyohiro Shikano: On-Line Relaxation Algorithm Applicable to Acoustic Fluctuation for Inverse Filter in Multichannel Sound Reproduction System. IEICE Transactions 88-A(7): 1747-1756 (2005) | |
| j7 | Hiroshi Saruwatari, Hiroaki Yamajo, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano: Blind Separation and Deconvolution for Convolutive Mixture of Speech Combining SIMO-Model-Based ICA and Multichannel Inverse Filtering. IEICE Transactions 88-A(9): 2387-2400 (2005) | |
| j6 | Shoko Araki, Shoji Makino, Robert Aichner, Tsuyoki Nishikawa, Hiroshi Saruwatari: Subband-Based Blind Separation for Convolutive Mixtures of Speech. IEICE Transactions 88-A(12): 3593-3603 (2005) | |
| j5 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Estimation of Shape Parameter of GGD Function by Negentropy Matching. Neural Processing Letters 22(3): 377-389 (2005) | |
| c35 | Hiroshi Saruwatari, Katsuyuki Sawai, Tsuyoki Nishikawa, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata, Daisuke Saitoh: Speech Enhancement Based on Blind Source Separation in Car Environments. ICDE Workshops 2005: 1205 | |
| c34 | Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments. INTERSPEECH 2005: 293-296 | |
| c33 | Daisuke Saitoh, Atsunobu Kaminuma, Hiroshi Saruwatari, Tsuyoki Nishikawa, Akinobu Lee: Speech extraction in a car interior using frequency-domain ICA with rapid filter adaptations. INTERSPEECH 2005: 2301-2304 | |
| c32 | Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano: Investigating the role of the Lombard reflex in non-audible murmur (NAM) recognition. INTERSPEECH 2005: 2649-2652 | |
| c31 | Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano: Applications of NAM microphones in speech recognition for privacy in human-machine communication. INTERSPEECH 2005: 3041-3044 | |
| c30 | Tomoya Takatani, Satoshi Ukai, Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano: Blind sound scene decomposition for robot audition using SIMO-model-based ICA. IROS 2005: 2247-2252 | |
| c29 | Hiroshi Saruwatari, Yoshimitsu Mori, Tomoya Takatani, Satoshi Ukai, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita: Two-stage blind source separation based on ICA and binary masking for real-time robot audition system. IROS 2005: 2303-2308 | |
| c28 | Yasuaki Ohashi, Tsuyoki Nishikawa, Hiroshi Saruwatari, Akinobu Lee, Kiyohiro Shikano: Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition. IROS 2005: 2328-2332 | |
| 2004 | ||
| j4 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Negentropy based voice-activity detection for noise estimation in very low SNR condition. IEICE Electronic Express 1(16): 495-500 (2004) | |
| c27 | Satoshi Ukai, Hiroshi Saruwatari, Tomoya Takatani, Kiyohiro Shikano, Ryo Mukai, Hiroshi Sawada: Evaluation of Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA. ICA 2004: 626-633 | |
| c26 | Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano: Single Channel Speech Enhancement: MAP Estimation Using GGD Prior Under Blind Setup. ICA 2004: 873-880 | |
| c25 | Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano, Atsunobu Kaminuma: Stable and Low-Distortion Algorithm Based on Overdetermined Blind Separation for Convolutive Mixtures of Speech. ICA 2004: 881-888 | |
| c24 | Tatsunori Asai, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano: Interface for barge-in free spoken dialogue system using adaptive sound field control. INTERSPEECH 2004 | |
| c23 | Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Robust speech recognition with spectral subtraction in low SNR. INTERSPEECH 2004 | |
| c22 | Panikos Heracleous, Yoshitaka Nakajima, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Non-audible murmur (NAM) speech recognition using a stethoscopic NAM microphone. INTERSPEECH 2004 | |
| c21 | Akinobu Lee, Keisuke Nakamura, Ryuichi Nisimura, Hiroshi Saruwatari, Kiyohiro Shikano: Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs. INTERSPEECH 2004 | |
| c20 | Kazuki Adachi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification. LREC 2004 | |
| 2003 | ||
| j3 | Hiroshi Saruwatari, Satoshi Kurita, Kazuya Takeda, Fumitada Itakura, Tsuyoki Nishikawa, Kiyohiro Shikano: Blind Source Separation Combining Independent Component Analysis and Beamforming. EURASIP J. Adv. Sig. Proc. 2003(11): 1135-1146 (2003) | |
| j2 | Shoko Araki, Shoji Makino, Yoichi Hinamoto, Ryo Mukai, Tsuyoki Nishikawa, Hiroshi Saruwatari: Equivalence between Frequency-Domain Blind Source Separation and Frequency-Domain Adaptive Beamforming for Convolutive Mixtures. EURASIP J. Adv. Sig. Proc. 2003(11): 1157-1166 (2003) | |
| j1 | Shoko Araki, Ryo Mukai, Shoji Makino, Tsuyoki Nishikawa, Hiroshi Saruwatari: The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech. IEEE Transactions on Speech and Audio Processing 11(2): 109-116 (2003) | |
| c19 | Hiromichi Kawanami, Yohei Iwami, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: GMM-based voice conversion applied to emotional speech synthesis. INTERSPEECH 2003 | |
| c18 | Tatsuya Shiraishi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano: Simple designing methods of corpus-based visual speech synthesis. INTERSPEECH 2003 | |
| c17 | Shingo Yamade, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Unsupervised speaker adaptation based on HMM sufficient statistics in various noisy environments. INTERSPEECH 2003 | |
| c16 | Hiroaki Yamajo, Hiroshi Saruwatari, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano: Blind separation and deconvolution for convolutive mixture of speech using SIMO-model-based ICA and multichannel inverse filtering. INTERSPEECH 2003 | |
| 2002 | ||
| c15 | Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano: Bund source separation based on Multi-Stage ICA combining frequency-domain ICA and time-domain ICA. ICASSP 2002: 917-920 | |
| c14 | Hiroshi Saruwatari, Toshiya Kawamura, Katsuyuki Sawai, Atsunobu Kaminuma, Masao Sakata: Blind source separation based on fast-convergence algorithm using ICA and beamforming for real convolutive mixture. ICASSP 2002: 921-924 | |
| c13 | Shoko Araki, Yoichi Hinamoto, Shoji Makino, Tsuyoki Nishikawa, Ryo Mukai, Hiroshi Saruwatari: Equivalence between frequency domain blind source separation and frequency domain adaptive beamforming. ICASSP 2002: 1785-1788 | |
| c12 | Satoshi Nakamura, Kazuo Hiyane, Futoshi Asano, Yutaka Kaneda, Takeshi Yamada, Takanobu Nishiura, Tetsunori Kobayashi, Shiro Ise, Hiroshi Saruwatari: Design and collection of acoustic sound data for hands-free speech recognition and sound scene understanding. ICME (2) 2002: 161-164 | |
| c11 | Akinobu Lee, Yuichiro Mera, Hiroshi Saruwatari, Kiyohiro Shikano: Selective multi-path acoustic model based on database likelihoods. INTERSPEECH 2002 | |
| c10 | Hiroshi Saruwatari, Katsuyuki Sawai, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata: Speech enhancement in car environment using blind source separation. INTERSPEECH 2002 | |
| c9 | Shingo Yamade, Kanako Matsunami, Akira Baba, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics. INTERSPEECH 2002 | |
| 2001 | ||
| c8 | Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano: High quality voice conversion based on Gaussian mixture model with dynamic frequency warping. INTERSPEECH 2001: 349-352 | |
| c7 | Miichi Yamada, Akira Baba, Shinichi Yoshizawa, Yuichiro Mera, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Unsupervised noisy environment adaptation algorithm using MLLR and speaker selection. INTERSPEECH 2001: 869-872 | |
| c6 | Ryuichi Nisimura, Kumiko Komatsu, Yuka Kuroda, Kentaro Nagatomo, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano: Automatic n-gram language model creation from web resources. INTERSPEECH 2001: 2127-2130 | |
| c5 | Shoko Araki, Shoji Makino, Ryo Mukai, Hiroshi Saruwatari: Equivalence between frequency domain blind source separation and frequency domain adaptive null beamformers. INTERSPEECH 2001: 2595-2598 | |
| c4 | Hiroshi Saruwatari, Toshiya Kawamura, Kiyohiro Shikano: Blind source separation for speech based on fast-convergence algorithm with ICA and beamforming. INTERSPEECH 2001: 2603-2606 | |
| 2000 | ||
| c3 | Hiroshi Saruwatari, Satoshi Kurita, Kazuya Takeda, Fumitada Itakura, Kiyohiro Shikano: Blind source separation based on subband ICA and beamforming. INTERSPEECH 2000: 94-97 | |
| c2 | Tomoki Toda, Jinlin Lu, Hiroshi Saruwatari, Kiyohiro Shikano: Straight-based voice conversion algorithm based on Gaussian mixture model. INTERSPEECH 2000: 279-282 | |
| 1999 | ||
| c1 | Hiroshi Saruwatari, Shoji Kajita, Kazuya Takeda, Fumitada Itakura: Speech enhancement using nonlinear microphone array under nonstationary noise conditions. EUROSPEECH 1999 | |
Colors in the list of coauthors
Last update Fri May 24 04:54:34 2013 CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page