| 2012 | ||
|---|---|---|
| j15 | Takami Yoshida, Kazuhiro Nakadai: Audio-Visual Voice Activity Detection Based on an Utterance State Transition Model. Advanced Robotics 26(10): 1183-1201 (2012) | |
| j14 | Hiroaki Miura, Takami Yoshida, Keisuke Nakamura, Kazuhiro Nakadai: SLAM-based Online Calibration for Asynchronous Microphone Array. Advanced Robotics 26(17): 1941-1965 (2012) | |
| j13 | Kenta Yonekura, Chyon Hae Kim, Kazuhiro Nakadai, Hiroshi Tsujino, Shigeki Sugano: A role of multi-modal rhythms in physical interaction and cooperation. EURASIP J. Audio, Speech and Music Processing 2012: 12 (2012) | |
| j12 | Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Efficient Blind Dereverberation and Echo Cancellation Based on Independent Component Analysis for Actual Acoustic Signals. Neural Computation 24(1): 234-272 (2012) | |
| c106 | Randy Gomez, Tatsuya Kawahara, Keisuke Nakamura, Kazuhiro Nakadai: Multi-party human-robot interaction with distant-talking speech recognition. HRI 2012: 439-446 | |
| c105 | Futoshi Asano, Hideki Asoh, Kazuhiro Nakadai: Sound source localization in spatially colored noise using a hierarchical Bayesian model. ICASSP 2012: 193-196 | |
| c104 | João Lobato Oliveira, Gökhan Ince, Keisuke Nakamura, Kazuhiro Nakadai: Online audio beat tracking for a dancing robot in the presence of ego-motion noise in a real environment. ICRA 2012: 403-408 | |
| c103 | Keisuke Nakamura, Kazuhiro Nakadai, Gökhan Ince: Real-time super-resolution Sound Source Localization for robots. IROS 2012: 694-699 | |
| c102 | João Lobato Oliveira, Gökhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luís Paulo Reis, Fabien Gouyon: Live assessment of beat tracking for robot audition. IROS 2012: 992-997 | |
| c101 | Gökhan Ince, Kazuhiro Nakadai, Keisuke Nakamura: Online learning for template-based multi-channel ego noise estimation. IROS 2012: 3282-3287 | |
| c100 | Keita Okutani, Takami Yoshida, Keisuke Nakamura, Kazuhiro Nakadai: Outdoor auditory scene analysis using a moving microphone array embedded in a quadrocopter. IROS 2012: 3288-3293 | |
| c99 | João Lobato Oliveira, Gökhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luís Paulo Reis, Fabien Gouyon: An active audition framework for auditory-driven HRI: Application to interactive robot dancing. RO-MAN 2012: 1078-1085 | |
| 2011 | ||
| j11 | Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura: Ego noise cancellation of a robot using missing feature masks. Appl. Intell. 34(3): 360-371 (2011) | |
| j10 | Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura: Whole Body Motion Noise Cancellation of a Robot for Improved Automatic Speech Recognition. Advanced Robotics 25(11-12): 1405-1426 (2011) | |
| j9 | Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Audio-to-Score Alignment Using Particle Filter for Coplayer Music Robots. EURASIP J. Adv. Sig. Proc. 2011 (2011) | |
| j8 | Mikio Nakano, Yuji Hasegawa, Kotaro Funakoshi, Johane Takeuchi, Toyotaka Torii, Kazuhiro Nakadai, Naoyuki Kanda, Kazunori Komatani, Hiroshi G. Okuno, Hiroshi Tsujino: A multi-expert model for dialogue and behavior control of conversational robots and agents. Knowl.-Based Syst. 24(2): 248-256 (2011) | |
| c98 | Kenta Yonekura, Chyon Hae Kim, Kazuhiro Nakadai, Hiroshi Tsujino, Shigeki Sugano: Rhythmic reference of a human while a rope turning task. HRI 2011: 289-290 | |
| c97 | Keisuke Nakamura, Kazuhiro Nakadai, Hirofumi Nakajima, Gökhan Ince: Correlation matrix interpolation in Sound Source Localization for a robot. ICASSP 2011: 4324-4327 | |
| c96 | Takeshi Mizumoto, Kazuhiro Nakadai, Takami Yoshida, Ryu Takeda, Takuma Otsuka, Toru Takahashi, Hiroshi G. Okuno: Design and implementation of selectable sound separation on the Texai telepresence system using HARK. ICRA 2011: 2130-2137 | |
| c95 | Gökhan Ince, Keisuke Nakamura, Futoshi Asano, Hirofumi Nakajima, Kazuhiro Nakadai: Assessment of general applicability of ego noise estimation. ICRA 2011: 3517-3522 | |
| c94 | Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno: Bayesian Extension of MUSIC for Sound Source Localization and Tracking. INTERSPEECH 2011: 3109-3112 | |
| c93 | Martin Heckmann, Kazuhiro Nakadai, Hirofumi Nakajima: Robust Intonation Pattern Classification in Human Robot Interaction. INTERSPEECH 2011: 3137-3140 | |
| c92 | Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Jun-ichi Imura, Keisuke Nakamura, Hirofumi Nakajima: Assessment of single-channel ego noise estimation methods. IROS 2011: 106-111 | |
| c91 | Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Jun-ichi Imura, Keisuke Nakamura, Hirofumi Nakajima: Incremental learning for ego noise estimation of a robot. IROS 2011: 131-136 | |
| c90 | Keisuke Nakamura, Kazuhiro Nakadai, Futoshi Asano, Gökhan Ince: Intelligent sound source localization and its application to multimodal human tracking. IROS 2011: 143-148 | |
| c89 | Hiroaki Miura, Takami Yoshida, Keisuke Nakamura, Kazuhiro Nakadai: SLAM-based online calibration of asynchronous microphone array for robot audition. IROS 2011: 524-529 | |
| c88 | Zheng Gong, Kazuhiro Nakadai, Hirofumi Nakajima, Ichiro Hagiwara: HARK based real-time single pane 3D auditory scene visualizer empowered by Speech Arrow. IROS 2011: 530-535 | |
| c87 | Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno: Incremental Bayesian Audio-to-Score Alignment with Flexible Harmonic Structure Models. ISMIR 2011: 525-530 | |
| 2010 | ||
| j7 | Kazuhiro Nakadai, Toru Takahashi, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: Design and Implementation of Robot Audition System 'HARK' - Open Source Software for Listening to Three Simultaneous Speakers. Advanced Robotics 24(5-6): 739-761 (2010) | |
| j6 | Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino: Blind Source Separation With Parameter-Free Adaptive Step-Size Method for Robot Audition. IEEE Transactions on Audio, Speech & Language Processing 18(6): 1476-1485 (2010) | |
| c86 | Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Implementation of Two-level Synchronization for Interactive Music Robot. AAAI 2010 | |
| c85 | Randy Gomez, Tatsuya Kawahara, Kazuhiro Nakadai: Robust hands-free Automatic Speech Recognition for human-machine interaction. Humanoids 2010: 138-143 | |
| c84 | Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Improvement in listening capability for humanoid robot HRP-2. ICRA 2010: 470-475 | |
| c83 | Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Yuji Hasegawa, Hiroshi Tsujino, Jun-ichi Imura: A hybrid framework for ego noise cancellation of a robot. ICRA 2010: 3623-3628 | |
| c82 | Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Upper-limit evaluation of robot audition based on ICA-BSS in multi-source, barge-in and highly reverberant conditions. ICRA 2010: 4366-4371 | |
| c81 | Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: An Improvement in Audio-Visual Voice Activity Detection for Automatic Speech Recognition. IEA/AIE (1) 2010: 51-61 | |
| c80 | Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura: Robust Ego Noise Suppression of a Robot. IEA/AIE (1) 2010: 62-71 | |
| c79 | Takuma Otsuka, Takeshi Mizumoto, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Music-Ensemble Robot That Is Capable of Playing the Theremin While Listening to the Accompanied Music. IEA/AIE (1) 2010: 102-112 | |
| c78 | Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura: A robust speech recognition system against the ego noise of a robot. INTERSPEECH 2010: 2070-2073 | |
| c77 | Martin Heckmann, Claudius Gläser, Frank Joublin, Kazuhiro Nakadai: Applying geometric source separation for improved pitch extraction in human-robot interaction. INTERSPEECH 2010: 2602-2605 | |
| c76 | Takami Yoshida, Kazuhiro Nakadai: Two-layered audio-visual integration in voice activity detection and automatic speech recognition for robots. INTERSPEECH 2010: 2702-2705 | |
| c75 | Hirofumi Nakajima, Gökhan Ince, Kazuhiro Nakadai, Yuji Hasegawa: An easily-configurable robot audition system using Histogram-based Recursive Level Estimation. IROS 2010: 958-963 | |
| c74 | T. Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: An improvement in automatic speech recognition using soft missing feature masks for robot audition. IROS 2010: 964-969 | |
| c73 | Kazuhiro Nakadai, Hirofumi Nakajima, Gökhan Ince, Yuji Hasegawa: Sound source separation and automatic speech recognition for moving sources. IROS 2010: 976-981 | |
| c72 | Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura: Multi-talker speech recognition under ego-motion noise using Missing Feature Theory. IROS 2010: 982-987 | |
| c71 | Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: Two-layered audio-visual speech recognition for robots in noisy environments. IROS 2010: 988-993 | |
| c70 | Martin Heckmann, Frank Joublin, Kazuhiro Nakadai: Pitch extraction in Human-Robot interaction. IROS 2010: 1482-1487 | |
| c69 | Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Speedup and performance improvement of ICA-based robot audition by parallel and resampling-based block-wise processing. IROS 2010: 1949-1956 | |
| c68 | Takeshi Mizumoto, Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Human-robot ensemble between robot thereminist and human percussionist using coupled oscillator model. IROS 2010: 1957-1963 | |
| c67 | Ryota Fujimura, Kazuhiro Nakadai, Michita Imai, Ren Ohmura: PROT - An embodied agent for intelligible and user-friendly human-robot interaction. IROS 2010: 3860-3867 | |
| 2009 | ||
| c66 | Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic estimation of reverberation time with robot speech to improve ICA-based robot audition. Humanoids 2009: 250-255 | |
| c65 | Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Voice quality manipulation for humanoid robots consistent with their head movements. Humanoids 2009: 405-410 | |
| c64 | Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: Automatic speech recognition improved by two-layered audio-visual integration for robot audition. Humanoids 2009: 604-609 | |
| c63 | Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: ICA-based efficient blind dereverberation and echo cancellation method for barge-in-able robot audition. ICASSP 2009: 3677-3680 | |
| c62 | Kazuhiro Nakadai, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: Sound source separation of moving speakers for robot audition. ICASSP 2009: 3685-3688 | |
| c61 | Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Yuji Hasegawa, Hiroshi Tsujino, Jun-ichi Imura: Ego noise suppression of a robot using template subtraction. IROS 2009: 199-204 | |
| c60 | Keisuke Nakamura, Kazuhiro Nakadai, Futoshi Asano, Yuji Hasegawa, Hiroshi Tsujino: Intelligent sound source localization for dynamic environments. IROS 2009: 664-669 | |
| c59 | Hirofumi Nakajima, Keiko Kikuchi, Touru Daigo, Yutaka Kaneda, Kazuhiro Nakadai, Yuji Hasegawa: Real-time sound source orientation estimation using a 96 channel microphone array. IROS 2009: 676-683 | |
| c58 | Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Step-size parameter adaptation of multi-channel semi-blind ICA with piecewise linear model for barge-in-able robot audition. IROS 2009: 2277-2282 | |
| c57 | Takuma Otsuka, Toru Takahashi, Hiroshi G. Okuno, Kazunori Komatani, Tetsuya Ogata, Kazumasa Murata, Kazuhiro Nakadai: Incremental polyphonic audio to score alignment using beat tracking for singer robots. IROS 2009: 2289-2296 | |
| c56 | Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model. IROS 2009: 2730-2735 | |
| c55 | Hiroshi G. Okuno, Kazuhiro Nakadai, Hyun-Don Kim: Robot Audition: Missing Feature Theory Approach and Active Audition. ISRR 2009: 227-244 | |
| 2008 | ||
| c54 | Kazuhiro Nakadai, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: An open source software system for robot audition HARK and its evaluation. Humanoids 2008: 561-566 | |
| c53 | Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino: Adaptive step-size parameter control for real-world blind source separation. ICASSP 2008: 149-152 | |
| c52 | Kazuhiro Nakadai, Shun'ichi Yamamoto, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: A robot referee for rock-paper-scissors sound games. ICRA 2008: 3469-3474 | |
| c51 | Toru Takahashi, Shun'ichi Yamamoto, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Soft missing-feature mask generation for simultaneous speech recognition system in robots. INTERSPEECH 2008: 992-995 | |
| c50 | Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Barge-in-able robot audition based on ICA and missing feature theory under semi-blind situation. IROS 2008: 1718-1723 | |
| c49 | Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino: High performance sound source separation adaptable to environmental changes for robot audition. IROS 2008: 2165-2171 | |
| c48 | Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotaka Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: A robot uses its own microphone to synchronize its steps to musical beats while scatting and singing. IROS 2008: 2459-2464 | |
| c47 | Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotaka Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: A Robot Singer with Music Recognition Based on Real-Time Beat Tracking. ISMIR 2008: 199-204 | |
| 2007 | ||
| j5 | Jean-Marc Valin, Seiichi Yamamoto, Jean Rouat, François Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno: Robust Recognition of Simultaneous Speech by a Mobile Robot. IEEE Transactions on Robotics 23(4): 742-752 (2007) | |
| c46 | Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech. ASRU 2007: 111-116 | |
| c45 | Kentaro Ishii, Yukiko Yamamoto, Michita Imai, Kazuhiro Nakadai: A Navigation System Using Ultrasonic Directional Speaker with Rotating Base. HCI (9) 2007: 526-535 | |
| c44 | Kazuhiro Nakadai, Ryota Sumiya, Mikio Nakano, Koichi Ichige, Yasuo Hirose, Hiroshi Tsujino: The Design of Phoneme Grouping for Coarse Phoneme Recognition. IEA/AIE 2007: 905-914 | |
| c43 | Kazuyoshi Yoshii, Kazuhiro Nakadai, Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: A biped robot that keeps steps in time with musical beats while listening to music with its own ears. IROS 2007: 1743-1750 | |
| c42 | Tomoaki Koiwa, Kazuhiro Nakadai, Jun-ichi Imura: Coarse speech recognition by audio-visual integration based on missing feature theory. IROS 2007: 1751-1756 | |
| c41 | Ryu Takeda, Kazuhiro Nakadai, Kazuhiro Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Exploiting known sound source signals to improve ICA-based robot audition in speech separation and recognition. IROS 2007: 1757-1762 | |
| c40 | Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino: Moving Sound Source Extraction by Time-Variant Beamforming. JSAI 2007: 47-53 | |
| 2006 | ||
| c39 | Yoshitaka Nishimura, Mitsuru Ishizuka, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino: Speech Recognition for a Humanoid with Motor Noise Utilizing Missing Feature Theory. Humanoids 2006: 26-33 | |
| c38 | Mikio Nakano, Atsushi Hoshino, Johane Takeuchi, Yuji Hasegawa, Toyotaka Torii, Kazuhiro Nakadai, Kazuhiro Kato, Hiroshi Tsujino: A Robot That Can Engage in Both Task-Oriented and Non-Task-Oriented Dialogues. Humanoids 2006: 404-411 | |
| c37 | Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals. IEA/AIE 2006: 207-217 | |
| c36 | Kazuhiro Nakadai, Hirofumi Nakajima, Masamitsu Murase, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: Real-Time Tracking of Multiple Sound Sources by Integration of In-Room and Robot-Embedded Microphone Arrays. IROS 2006: 852-859 | |
| c35 | Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Robot Audition System That Recognizes Simultaneous Speech in The Real World. IROS 2006: 5333-5338 | |
| c34 | Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition. PRICAI 2006: 484-494 | |
| 2005 | ||
| c33 | Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Jean Rouat, François Michaud, Tetsuya Ogata, Hiroshi G. Okuno: Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory. ICRA 2005: 1477-1482 | |
| c32 | Kazuhiro Nakadai, Hiroshi Tsujino: Towards New Human-Humanoid Communication: Listening During Speaking by Using Ultrasonic Directional Speaker. ICRA 2005: 1483-1488 | |
| c31 | Masamitsu Murase, Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Kentaro Yamada, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Multiple moving speaker tracking by microphone array on mobile robot. INTERSPEECH 2005: 249-252 | |
| c30 | Kazuhiro Nakadai, Hirofumi Nakajima, Kentaro Yamada, Yuji Hasegawa, Takahiro Nakamura, Hiroshi Tsujino: Sound source tracking with directivity pattern estimation using a 64 ch microphone array. IROS 2005: 1690-1696 | |
| c29 | Shunsuke Kurotaki, Noriaki Suzuki, Kazuhiro Nakadai, Hiroshi G. Okuno, Hideharu Amano: Implementation of active direction-pass filter on dynamically reconfigurable processor. IROS 2005: 3175-3180 | |
| c28 | Mikio Nakano, Yuji Hasegawa, Kazuhiro Nakadai, Takahiro Nakamura, Johane Takeuchi, Toyotaka Torii, Hiroshi Tsujino, Naoyuki Kanda, Hiroshi G. Okuno: A two-layer model for behavior and dialogue planning in conversational service robots. IROS 2005: 3329-3335 | |
| c27 | Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Jean Rouat, François Michaud, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Making a robot recognize three simultaneous sentences in real-time. IROS 2005: 4040-4045 | |
| 2004 | ||
| j4 | Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot. Appl. Intell. 20(3): 253-266 (2004) | |
| j3 | Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Effects of increasing modalities in recognizing three simultaneous speeches. Speech Communication 43(4): 347-359 (2004) | |
| j2 | Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Improvement of recognition of simultaneous speech signals using AV integration and scattering theory for humanoid robots. Speech Communication 44(1-4): 97-112 (2004) | |
| c26 | Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Toshio Yokoyama, Hiroshi G. Okuno: Improvement of Robot Audition by Interfacing Sound Source Separation and Automatic Speech Recognition with Missing Feature Theory. ICRA 2004: 1517-1523 | |
| c25 | Tokitomo Ariyoshi, Kazuhiro Nakadai, Hiroshi Tsujino: Multimodal expression for humanoid robots by integration of human speech mimicking and facial color. INTERSPEECH 2004 | |
| c24 | Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno: Assessment of general applicability of robot audition system by recognizing three simultaneous speeches. IROS 2004: 2111-2116 | |
| 2003 | ||
| j1 | Hiroshi G. Okuno, Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroaki Kitano: Human-robot non-verbal interaction empowered by real-time auditory and visual multiple-talker tracking. Advanced Robotics 17(2): 115-130 (2003) | |
| c23 | Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Realizing personality in audio-visually triggered non-verbal behaviors. ICRA 2003: 392-397 | |
| c22 | Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Robot recognizes three simultaneous speech by active audition. ICRA 2003: 398-405 | |
| c21 | Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Design and Implementation of Personality of Humanoids in Human Humanoid Non-verbal Interaction. IEA/AIE 2003: 662-673 | |
| c20 | Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Three simultaneous speech recognition by integration of active audition and face recognition for humanoid. INTERSPEECH 2003 | |
| c19 | Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroaki Kitano: Applying scattering theory to robot audition system: robust sound source localization and extraction. IROS 2003: 1147-1152 | |
| c18 | Hiroshi G. Okuno, Kazuhiro Nakadai: Real-Time Sound Source Localization and Separation Based on Active Audio-Visual Integration. IWANN (1) 2003: 118-125 | |
| 2002 | ||
| c17 | Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Exploiting Auditory Fovea in Humanoid-Human Interaction. AAAI/IAAI 2002: 431-438 | |
| c16 | Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Speaker Localization and Speech Separation by Audio-Visual Integration. ICRA 2002: 1043-1049 | |
| c15 | Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Social Interaction of Humanoid RobotBased on Audio-Visual Tracking. IEA/AIE 2002: 725-735 | |
| c14 | Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Real-time sound source localization and separation for robot audition. INTERSPEECH 2002 | |
| c13 | Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Auditory fovea based speech enhancement and its application to human-robot dialog system. INTERSPEECH 2002 | |
| c12 | Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Realizing Audio-Visually Triggered ELIZA-Like Non-verbal Behaviors. PRICAI 2002: 552-562 | |
| 2001 | ||
| c11 | Tino Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: A computational model of monkey grating cells for oriented repetitive alternating patterns. ESANN 2001: 315-322 | |
| c10 | Tino Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Graph extraction from color images. ESANN 2001: 329-334 | |
| c9 | Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot. IEA/AIE 2001: 640-650 | |
| c8 | Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Auditory and Visual Multiple-Object Tracking for Humanoids. IJCAI 2001: 1425-1436 | |
| c7 | Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano: Real-time multiple speaker tracking by multi-modal integration for mobile robots. INTERSPEECH 2001: 1193-1196 | |
| c6 | Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Separating three simultaneous speeches with two microphones by integrating auditory and visual processing. INTERSPEECH 2001: 2643-2646 | |
| 2000 | ||
| c5 | Kazuhiro Nakadai, Tino Lourens, Hiroshi G. Okuno, Hiroaki Kitano: Active Audition for Humanoid. AAAI/IAAI 2000: 832-839 | |
| c4 | Hiroaki Kitano, Hiroshi G. Okuno, Kazuhiro Nakadai, Iris Fermin, Theo Sabisch, Yukiko Nakagawa, Tatsuya Matsui: Designing a humanoid head for RoboCup challenge. Agents 2000: 17-18 | |
| c3 | Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Humanoid Active Audition System Improved by the Cover Acoustics. PRICAI 2000: 544-554 | |
| c2 | Ian Frank, Kumiko Tanaka-Ishii, Hiroshi G. Okuno, Junichi Akita, Yukiko Nakagawa, Kazuaki Maeda, Kazuhiro Nakadai, Hiroaki Kitano: And the Fans Are Going Wild! SIG plus MIKE. RoboCup 2000: 139-148 | |
| 1995 | ||
| c1 | Kunio Kashino, Kazuhiro Nakadai, Tomoyoshi Kinoshita, Hidehiko Tanaka: Organization of Hierarchical Perceptual Sounds: Music Scene Analysis with Autonomous Processing Modules and a Quantitative Information Integration Mechanism. IJCAI 1995: 158-164 | |
Data released under the ODC-BY 1.0 license — See also our legal information page