Please note: This is a beta version of the new dblp website.
You can find the classic dblp view of this page here.
You can find the classic dblp view of this page here.
Hiroshi G. Okuno
2010 – today
- 2013
[j47]Kohei Nagira, Takuma Otsuka, Hiroshi G. Okuno: Nonparametric Bayesian sparse factor analysis for frequency domain blind source separation without permutation ambiguity. EURASIP J. Audio, Speech and Music Processing 2013: 4 (2013)
[c216]Ui-Hyun Kim, Kazuhiro Nakadai, Hiroshi G. Okuno: Improved Sound Source Localization and Front-Back Disambiguation for Humanoid Robots with Two Ears. IEA/AIE 2013: 282-291- 2012
[j46]Angelica Lim, Takeshi Mizumoto, Tetsuya Ogata, Hiroshi G. Okuno: A Musical Robot that Synchronizes with a Coplayer Using Non-Verbal Cues. Advanced Robotics 26(3-4): 363-381 (2012)
[j45]Akira Maezawa, Katsutoshi Itoyama, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automated Violin Fingering Transcription Through Analysis of an Audio Recording. Computer Music Journal 36(3): 57-72 (2012)
[j44]Angelica Lim, Tetsuya Ogata, Hiroshi G. Okuno: Towards expressive musical robots: a cross-modal framework for emotional gesture, voice and music. EURASIP J. Audio, Speech and Music Processing 2012: 3 (2012)
[j43]Tatsuhiko Itohara, Takuma Otsuka, Takeshi Mizumoto, Angelica Lim, Tetsuya Ogata, Hiroshi G. Okuno: A multimodal tempo and beat-tracking system based on audiovisual information from live guitar performances. EURASIP J. Audio, Speech and Music Processing 2012: 6 (2012)
[j42]Kazunori Komatani, Mikio Nakano, Masaki Katsumaru, Kotaro Funakoshi, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Allocation of Training Data for Speech Understanding Based on Multiple Model Combinations. IEICE Transactions 95-D(9): 2298-2307 (2012)
[j41]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Efficient Blind Dereverberation and Echo Cancellation Based on Independent Component Analysis for Actual Acoustic Signals. Neural Computation 24(1): 234-272 (2012)
[j40]Shun Nishide, Jun Tani, Toru Takahashi, Hiroshi G. Okuno, Tetsuya Ogata: Tool-Body Assimilation of Humanoid Robot Using a Neurodynamical System. IEEE T. Autonomous Mental Development 4(2): 139-149 (2012)
[c215]Takuma Otsuka, Katsuhiko Ishiguro, Hiroshi Sawada, Hiroshi G. Okuno: Bayesian Unification of Sound Source Localization and Separation with Permutation Resolution. AAAI 2012
[c214]Naoki Hirayama, Shinsuke Mori, Hiroshi G. Okuno: Statistical Method of Building Dialect Language Models for ASR Systems. COLING 2012: 1179-1194
[c213]Angelica Lim, Hiroshi G. Okuno: Using Speech Data to Recognize Emotion in Human Gait. HBU 2012: 52-64
[c212]Kohei Nagira, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Complex Extension of Infinite Sparse Factor Analysis for Blind Speech Separation. LVA/ICA 2012: 388-396
[c211]Yasuharu Hirasawa, Naoki Yasuraoka, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: A GMM Sound Source Model for Blind Speech Separation in Under-determined Conditions. LVA/ICA 2012: 446-453
[c210]Daichi Sakaue, Katsutoshi Itoyama, Tetsuya Ogata, Hiroshi G. Okuno: Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus. ICASSP 2012: 425-428
[c209]Louis-Kenzo Cahier, Tetsuya Ogata, Hiroshi G. Okuno: Incremental probabilistic geometry estimation for robot scene understanding. ICRA 2012: 3625-3630
[c208]Katsutoshi Itoyama, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Recognition Based on Probabilistic Integration of Acoustic Features, Bass Sounds, and Chord Transition. IEA/AIE 2012: 58-67
[c207]Shun Nishide, Jun Tani, Hiroshi G. Okuno, Tetsuya Ogata: Self-organization of object features representing motion using Multiple Timescales Recurrent Neural Network. IJCNN 2012: 1-8
[c206]Harumitsu Nobuta, Kenta Kawamoto, Kuniaki Noda, Kohtaro Sabe, Shun Nishide, Hiroshi G. Okuno, Tetsuya Ogata: Body area segmentation from visual scene based on predictability of neuro-dynamical system. IJCNN 2012: 1-8
[c205]João Lobato Oliveira, Gökhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luís Paulo Reis, Fabien Gouyon: Live assessment of beat tracking for robot audition. IROS 2012: 992-997
[c204]Takeshi Mizumoto, Tetsuya Ogata, Hiroshi G. Okuno: Who is the leader in a multiperson ensemble? - Multiperson human-robot ensemble model with leaderness -. IROS 2012: 1413-1419
[c203]Yusuke Yamamura, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Sound sources selection system by using onomatopoeic querries from multiple sound sources. IROS 2012: 2364-2369
[c202]Takuma Otsuka, Katsuhiko Ishiguro, Hiroshi Sawada, Hiroshi G. Okuno: Unified auditory functions based on Bayesian topic model. IROS 2012: 2370-2376
[c201]Daichi Sakaue, Takuma Otsuka, Katsutoshi Itoyama, Hiroshi G. Okuno: Bayesian Nonnegative Harmonic-Temporal Factorization and Its Application to Multipitch Analysis. ISMIR 2012: 91-96
[c200]João Lobato Oliveira, Gökhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luís Paulo Reis, Fabien Gouyon: An active audition framework for auditory-driven HRI: Application to interactive robot dancing. RO-MAN 2012: 1078-1085
[c199]Kohei Nagira, Takuma Otsuka, Hiroshi G. Okuno: Infinite Sparse Factor Analysis for Blind Source Separation in Reverberant Environments. SSPR/SPR 2012: 638-647- 2011
[j39]Yang Zhang, Tetsuya Ogata, Shun Nishide, Toru Takahashi, Hiroshi G. Okuno: Classification of Known and Unknown Environmental Sounds Based on Self-Organized Space Using a Recurrent Neural Network. Advanced Robotics 25(17): 2127-2141 (2011)
[j38]Shun Nishide, Jun Tani, Hiroshi G. Okuno, Tetsuya Ogata: Towards Written Text Recognition Based on Handwriting Experiences Using a Recurrent Neural Network. Advanced Robotics 25(17): 2173-2187 (2011)
[j37]Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Audio-to-Score Alignment Using Particle Filter for Coplayer Music Robots. EURASIP J. Adv. Sig. Proc. 2011 (2011)
[j36]Hiromasa Fujihara, Masataka Goto, Jun Ogata, Hiroshi G. Okuno: LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics. J. Sel. Topics Signal Processing 5(6): 1252-1261 (2011)
[j35]Mikio Nakano, Yuji Hasegawa, Kotaro Funakoshi, Johane Takeuchi, Toyotaka Torii, Kazuhiro Nakadai, Naoyuki Kanda, Kazunori Komatani, Hiroshi G. Okuno, Hiroshi Tsujino: A multi-expert model for dialogue and behavior control of conversational robots and agents. Knowl.-Based Syst. 24(2): 248-256 (2011)
[j34]Wataru Hinoshita, Hiroaki Arie, Jun Tani, Hiroshi G. Okuno, Tetsuya Ogata: Emergence of hierarchical structure mirroring linguistic composition in a recurrent neural network. Neural Networks 24(4): 311-320 (2011)
[j33]Takuya Yoshioka, Tomohiro Nakatani, Masato Miyoshi, Hiroshi G. Okuno: Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization. IEEE Transactions on Audio, Speech & Language Processing 19(1): 69-84 (2011)
[c198]Angelica Lim, Tetsuya Ogata, Hiroshi G. Okuno: Converting emotional voice to motion for robot telepresence. Humanoids 2011: 472-479
[c197]Yang Zhang, Shun Nishide, Toru Takahashi, Hiroshi G. Okuno, Tetsuya Ogata: Cluster Self-organization of Known and Unknown Environmental Sounds Using Recurrent Neural Network. ICANN (1) 2011: 167-175
[c196]Akira Maezawa, Hiroshi G. Okuno, Tetsuya Ogata, Masataka Goto: Polyphonic audio-to-score alignment based on Bayesian Latent Harmonic Allocation Hidden Markov Model. ICASSP 2011: 185-188
[c195]Naoki Yasuraoka, Hirokazu Kameoka, Takuya Yoshioka, Hiroshi G. Okuno: I-Divergence-based dereverberation method with auxiliary function approach. ICASSP 2011: 369-372
[c194]Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Simultaneous processing of sound source separation and musical instrument identification using Bayesian spectral modeling. ICASSP 2011: 3816-3819
[c193]Hiromitsu Awano, Shun Nishide, Hiroaki Arie, Jun Tani, Toru Takahashi, Hiroshi G. Okuno, Tetsuya Ogata: Use of a Sparse Structure to Improve Learning Performance of Recurrent Neural Networks. ICONIP (3) 2011: 323-331
[c192]Takeshi Mizumoto, Kazuhiro Nakadai, Takami Yoshida, Ryu Takeda, Takuma Otsuka, Toru Takahashi, Hiroshi G. Okuno: Design and implementation of selectable sound separation on the Texai telepresence system using HARK. ICRA 2011: 2130-2137
[c191]Nobuhide Yamakawa, Toru Takahashi, Tetsuro Kitahara, Tetsuya Ogata, Hiroshi G. Okuno: Environmental Sound Recognition for Robot Audition Using Matching-Pursuit. IEA/AIE (2) 2011: 1-10
[c190]Yasuharu Hirasawa, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Robot with Two Ears Listens to More than Two Simultaneous Utterances by Exploiting Harmonic Structures. IEA/AIE (1) 2011: 348-358
[c189]Yasuharu Hirasawa, Naoki Yasuraoka, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Fast and Simple Iterative Algorithm of Lp-Norm Minimization for Under-Determined Speech Separation. INTERSPEECH 2011: 1745-1748
[c188]Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno: Bayesian Extension of MUSIC for Sound Source Localization and Tracking. INTERSPEECH 2011: 3109-3112
[c187]Tatsuhiko Itohara, Takuma Otsuka, Takeshi Mizumoto, Tetsuya Ogata, Hiroshi G. Okuno: Particle-filter based audio-visual beat-tracking for music robot ensemble with human guitarist. IROS 2011: 118-124
[c186]Ui-Hyun Kim, Takeshi Mizumoto, Tetsuya Ogata, Hiroshi G. Okuno: Improvement of speaker localization by considering multipath interference of sound wave for binaural robot audition. IROS 2011: 2910-2915
[c185]Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno: Incremental Bayesian Audio-to-Score Alignment with Flexible Harmonic Structure Models. ISMIR 2011: 525-530
[c184]Naoki Nishikawa, Katsutoshi Itoyama, Hiromasa Fujihara, Masataka Goto, Tetsuya Ogata, Hiroshi G. Okuno: A musical mood trajectory estimation method using lyrics and acoustic features. MIRUM 2011: 51-56
[c183]Mikio Nakano, Shun Sato, Kazunori Komatani, Kyoko Matsuyama, Kotaro Funakoshi, Hiroshi G. Okuno: A Two-Stage Domain Selection Framework for Extensible Multi-Domain Spoken Dialogue Systems. SIGDIAL Conference 2011: 18-29
[c182]Shun Nishide, Hiroshi G. Okuno, Tetsuya Ogata, Jun Tani: Handwriting prediction based character recognition using recurrent neural network. SMC 2011: 2549-2554- 2010
[j32]Kazuhiro Nakadai, Toru Takahashi, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: Design and Implementation of Robot Audition System 'HARK' - Open Source Software for Listening to Three Simultaneous Speakers. Advanced Robotics 24(5-6): 739-761 (2010)
[j31]Kazunori Komatani, Yuichiro Fukubayashi, Satoshi Ikeda, Tetsuya Ogata, Hiroshi G. Okuno: Selecting Help Messages by Using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems. IEICE Transactions 93-D(12): 3359-3367 (2010)
[j30]Tetsuya Ogata, Shun Nishide, Hideki Kozima, Kazunori Komatani, Hiroshi G. Okuno: Inter-modality mapping in robot with recurrent neural network. Pattern Recognition Letters 31(12): 1560-1569 (2010)
[j29]Hiromasa Fujihara, Masataka Goto, Tetsuro Kitahara, Hiroshi G. Okuno: A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval. IEEE Transactions on Audio, Speech & Language Processing 18(3): 638-648 (2010)
[c181]Shun Shiramatsu, Tadachika Ozono, Toramatsu Shintani, Hiroshi G. Okuno: A Corpus-Based Analysis of Coreferential Recency Effect in Japanese Discourse for Tracking Dynamic Topic. ACIS-ICIS 2010: 645-650
[c180]Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Implementation of Two-level Synchronization for Interactive Music Robot. AAAI 2010
[c179]Kazunori Komatani, Masaki Katsumaru, Mikio Nakano, Kotaro Funakoshi, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Allocation of Training Data for Rapid Prototyping of Speech Understanding based on Multiple Model Combination. COLING (Posters) 2010: 579-587
[c178]Naoki Yasuraoka, Takuya Yoshioka, Tomohiro Nakatani, Atsushi Nakamura, Hiroshi G. Okuno: Music dereverberation using harmonic structure source model and Wiener filter. ICASSP 2010: 53-56
[c177]Takuya Yoshioka, Tomohiro Nakatani, Hiroshi G. Okuno: Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure. ICASSP 2010: 4270-4273
[c176]Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Improvement in listening capability for humanoid robot HRP-2. ICRA 2010: 470-475
[c175]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Upper-limit evaluation of robot audition based on ICA-BSS in multi-source, barge-in and highly reverberant conditions. ICRA 2010: 4366-4371
[c174]Wataru Hinoshita, Hiroaki Arie, Jun Tani, Tetsuya Ogata, Hiroshi G. Okuno: Recognition and Generation of Sentences through Self-organizing Linguistic Hierarchy Using MTRNN. IEA/AIE (3) 2010: 42-51
[c173]Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: An Improvement in Audio-Visual Voice Activity Detection for Automatic Speech Recognition. IEA/AIE (1) 2010: 51-61
[c172]Takuma Otsuka, Takeshi Mizumoto, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Music-Ensemble Robot That Is Capable of Playing the Theremin While Listening to the Accompanied Music. IEA/AIE (1) 2010: 102-112
[c171]Akira Maezawa, Katsutoshi Itoyama, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Violin Fingering Estimation Based on Violin Pedagogical Fingering Model Constrained by Bowed Sequence Estimation from Audio Input. IEA/AIE (3) 2010: 249-259
[c170]Shun Shiramatsu, Jun Takasaki, Tatiana Zidrasco, Tadachika Ozono, Toramatsu Shintani, Hiroshi G. Okuno: System for Supporting Web-based Public Debate Using Transcripts of Face-to-Face Meeting. IEA/AIE (3) 2010: 311-320
[c169]Kyoko Matsuyama, Kazunori Komatani, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing. IEA/AIE (2) 2010: 585-594
[c168]Nobuhide Yamakawa, Tetsuro Kitahara, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Effects of modelling within- and between-frame temporal variations in power spectra on non-verbal sound recognition. INTERSPEECH 2010: 2342-2345
[c167]Kyoko Matsuyama, Kazunori Komatani, Ryu Takeda, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Analyzing user utterances in barge-in-able spoken dialogue system for improving identification accuracy. INTERSPEECH 2010: 3050-3053
[c166]Yasuharu Hirasawa, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Exploiting harmonic structures to improve separating simultaneous speech in under-determined conditions. IROS 2010: 450-457
[c165]T. Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: An improvement in automatic speech recognition using soft missing feature masks for robot audition. IROS 2010: 964-969
[c164]Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: Two-layered audio-visual speech recognition for robots in noisy environments. IROS 2010: 988-993
[c163]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Speedup and performance improvement of ICA-based robot audition by parallel and resampling-based block-wise processing. IROS 2010: 1949-1956
[c162]Takeshi Mizumoto, Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Human-robot ensemble between robot thereminist and human percussionist using coupled oscillator model. IROS 2010: 1957-1963
[c161]Angelica Lim, Takeshi Mizumoto, Louis-Kenzo Cahier, Takuma Otsuka, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Robot musical accompaniment: integrating audio and visual cues for real-time synchronization with a human flutist. IROS 2010: 1964-1969
[c160]Shun Nishide, Tetsuya Ogata, Jun Tani, T. Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Motion generation based on reliable predictability using self-organized object features. IROS 2010: 3453-3458
[c159]Akira Maezawa, Masataka Goto, Hiroshi G. Okuno: Query-by-conducting: An Interface to Retrieve Classical-music Interpretations by Real-time Tempo Input. ISMIR 2010: 477-482
[c158]Kazunori Komatani, Hiroshi G. Okuno: Online Error Detection of Barge-In Utterances by Using Individual Users' Utterance Histories in Spoken Dialogue System. SIGDIAL Conference 2010: 289-296
[c157]Hiromitsu Awano, Tetsuya Ogata, Shun Nishide, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Human-robot cooperation in arrangement of objects using confidence measure of neuro-dynamical system. SMC 2010: 2533-2538
2000 – 2009
- 2009
[j28]Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Human Tracking System Integrating Sound and Face Localization Using an Expectation-Maximization Algorithm in Real Environments. Advanced Robotics 23(6): 629-653 (2009)
[j27]Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Self-organization of Dynamic Object Features Based on Bidirectional Training. Advanced Robotics 23(15): 2035-2057 (2009)
[j26]Hyun-Don Kim, Jinsung Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Target Speech Detection and Separation for Communication with Humanoid Robots in Noisy Home Environments. Advanced Robotics 23(15): 2093-2111 (2009)
[j25]Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions. JIP 17: 191-201 (2009)
[c156]Shun Shiramatsu, Tadachika Ozono, Toramatsu Shintani, Kazunori Komatani, Tetsuya Ogata, Toru Takahashi, Hiroshi G. Okuno: Development of a Meeting Browser towards Supporting Public Involvement. CSE (4) 2009: 717-722
[c155]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic estimation of reverberation time with robot speech to improve ICA-based robot audition. Humanoids 2009: 250-255
[c154]Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Voice quality manipulation for humanoid robots consistent with their head movements. Humanoids 2009: 405-410
[c153]Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno: Automatic speech recognition improved by two-layered audio-visual integration for robot audition. Humanoids 2009: 604-609
[c152]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: ICA-based efficient blind dereverberation and echo cancellation method for barge-in-able robot audition. ICASSP 2009: 3677-3680
[c151]Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Prediction and imitation of other's motions by reusing own forward-inverse model in robots. ICRA 2009: 4144-4149
[c150]Hisashi Kanda, Tetsuya Ogata, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Continuous vocal imitation with self-organized vowel spaces in Recurrent Neural Network. ICRA 2009: 4438-4443
[c149]Masaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Adjusting Occurrence Probabilities of Automatically-Generated Abbreviated Words in Spoken Dialogue Systems. IEA/AIE 2009: 481-490
[c148]Kyoko Matsuyama, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Enabling a user to specify an item at any time during system enumeration - item identification for barge-in-able conversational dialogue systems. INTERSPEECH 2009: 252-255
[c147]Masaki Katsumaru, Mikio Nakano, Kazunori Komatani, Kotaro Funakoshi, Tetsuya Ogata, Hiroshi G. Okuno: Improving speech understanding accuracy with limited training data using multiple language models and multiple understanding models. INTERSPEECH 2009: 2735-2738
[c146]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Step-size parameter adaptation of multi-channel semi-blind ICA with piecewise linear model for barge-in-able robot audition. IROS 2009: 2277-2282
[c145]Takuma Otsuka, Toru Takahashi, Hiroshi G. Okuno, Kazunori Komatani, Tetsuya Ogata, Kazumasa Murata, Kazuhiro Nakadai: Incremental polyphonic audio to score alignment using beat tracking for singer robots. IROS 2009: 2289-2296
[c144]Takeshi Mizumoto, Hiroshi Tsujino, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Thereminist robot: Development of a robot theremin player with feedforward and feedback arm control based on a Theremin's pitch model. IROS 2009: 2297-2302
[c143]Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model. IROS 2009: 2730-2735
[c142]Wataru Hinoshita, Tetsuya Ogata, Hideki Kozima, Hisashi Kanda, Toru Takahashi, Hiroshi G. Okuno: Emergence of evolutionary interaction with voice and motion between two robots using RNN. IROS 2009: 4186-4192
[c141]Shun Nishide, Tatsuhiro Nakagawa, Tetsuya Ogata, Jun Tani, Toru Takahashi, Hiroshi G. Okuno: Modeling tool-body assimilation using second-order Recurrent Neural Network. IROS 2009: 5376-5381
[c140]Hisashi Kanda, Tetsuya Ogata, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno: Phoneme acquisition model based on vowel imitation using Recurrent Neural Network. IROS 2009: 5388-5393
[c139]Akira Maezawa, Katsutoshi Itoyama, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Bowed String Sequence Estimation of a Violin Based on Adaptive Audio Signal Classification and Context-Dependent Error Correction. ISM 2009: 9-16
[c138]Hiroshi G. Okuno, Kazuhiro Nakadai, Hyun-Don Kim: Robot Audition: Missing Feature Theory Approach and Active Audition. ISRR 2009: 227-244
[c137]Naoki Yasuraoka, Takehiro Abe, Katsutoshi Itoyama, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Changing timbre and phrase in existing musical performances as you like: manipulations of single part using harmonic and inharmonic models. ACM Multimedia 2009: 203-212
[c136]Masaki Katsumaru, Mikio Nakano, Kazunori Komatani, Kotaro Funakoshi, Tetsuya Ogata, Hiroshi G. Okuno: A Speech Understanding Framework that Uses Multiple Language Models and Multiple Understanding Models. HLT-NAACL (Short Papers) 2009: 133-136
[c135]Kazunori Komatani, Satoshi Ikeda, Yuichiro Fukubayashi, Tetsuya Ogata, Hiroshi G. Okuno: Ranking Help Message Candidates Based on Robust Grammar Verification Results and Utterance History in Spoken Dialogue Systems. SIGDIAL Conference 2009: 314-321
[c134]Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: A Model of Temporally Changing User Behaviors in a Deployed Spoken Dialogue System. UMAP 2009: 409-414
[c133]Hiromasa Fujihara, Masataka Goto, Hiroshi G. Okuno: A novel framework for recognizing phonemes of singing voice in polyphonic music. WASPAA 2009: 17-20
[c132]Takuya Yoshioka, Hirokazu Kameoka, Tomohiro Nakatani, Hiroshi G. Okuno: Statistical models for speech dereverberation. WASPAA 2009: 145-148- 2008
[j24]Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Predicting Object Dynamics From Visual Images Through Active Sensing Experiences. Advanced Robotics 22(5): 527-546 (2008)
[j23]Jean-Julien Aucouturier, Katsushi Ikeuchi, Hirohisa Hirukawa, Shinichiro Nakaoka, Takaaki Shiratori, Shunsuke Kudoh, Fumio Kanehiro, Tetsuya Ogata, Hideki Kozima, Hiroshi G. Okuno, Marek P. Michalowski, Yuta Ogai, Takashi Ikegami, Kazuhiro Kosuge, Takahiro Takeda, Yasuhisa Hirata: Cheek to Chip: Dancing Robots and AI's Future. IEEE Intelligent Systems 23(2): 74-84 (2008)
[j22]Kazunori Komatani, Satoshi Ikeda, Tetsuya Ogata, Hiroshi G. Okuno: Managing out-of-grammar utterances by topic estimation with domain extensibility in multi-domain spoken dialogue systems. Speech Communication 50(10): 863-870 (2008)
[j21]Kazuyoshi Yoshii, Masataka Goto, Kazuhiro Komatani, Tetsuya Ogata, Hiroshi G. Okuno: An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model. IEEE Transactions on Audio, Speech & Language Processing 16(2): 435-447 (2008)
[j20]Shun Shiramatsu, Kazunori Komatani, Kôiti Hasida, Tetsuya Ogata, Hiroshi G. Okuno: A game-theoretic model of referential coherence and its empirical verification using large Japanese and English corpora. TSLP 5(3) (2008)
[c131]Kazumasa Murata, Kazumasa Nakadai, Ryu Takeda, Hiroshi G. Okuno, Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino: A beat-tracking robot for human-robot interaction and its evaluation. Humanoids 2008: 79-84
[c130]Kazuhiro Nakadai, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: An open source software system for robot audition HARK and its evaluation. Humanoids 2008: 561-566
[c129]Shun Nishide, Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Object dynamics prediction and motion generation based on reliable predictability. ICRA 2008: 1608-1614
[c128]Kazuhiro Nakadai, Shun'ichi Yamamoto, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino: A robot referee for rock-paper-scissors sound games. ICRA 2008: 3469-3474
[c127]Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Two-channel-based voice activity detection for humanoid robots in noisy home environments. ICRA 2008: 3495-3501
[c126]Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Integrating Topic Estimation and Dialogue History for Domain Selection in Multi-domain Spoken Dialogue Systems. IEA/AIE 2008: 294-304
[c125]Yuichiro Fukubayashi, Kazunori Komatani, Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno: Rapid Prototyping of Robust Language Understanding Modules for Spoken Dialogue Systems. IJCNLP 2008: 210-216
[c124]Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: Predicting ASR errors by exploiting barge-in rate of individual users for spoken dialogue systems. INTERSPEECH 2008: 183-186
[c123]Masaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Expanding vocabulary for recognizing user's abbreviations of proper nouns without increasing ASR error rates in spoken dialogue systems. INTERSPEECH 2008: 187-190
[c122]Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno, Hiroshi G. Okuno: Extensibility verification of robust domain selection against out-of-grammar utterances in multi-domain spoken dialogue system. INTERSPEECH 2008: 487-490
[c121]Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno, Hiroshi G. Okuno: Extensibility verification of robust domain selection against out-of-grammar utterances in multi-domain spoken dialogue system. INTERSPEECH 2008: 487-490
[c120]Toru Takahashi, Shun'ichi Yamamoto, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Soft missing-feature mask generation for simultaneous speech recognition system in robots. INTERSPEECH 2008: 992-995
[c119]Shun Nishide, Tetsuya Ogata, Ryunosuke Yokoya, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Active sensing based dynamical object feature extraction. IROS 2008: 1-7
[c118]Takeshi Mizumoto, Ryu Takeda, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: A robot listens to music and counts its beats aloud by separating music from counting voice. IROS 2008: 1538-1543
[c117]Hyun-Don Kim, Jinsung Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Target speech detection and separation for humanoid robots in sparse dialogue with noisy home environments. IROS 2008: 1705-1711
[c116]Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Segmenting acoustic signal with articulatory movement using Recurrent Neural Network for phoneme acquisition. IROS 2008: 1712-1717
[c115]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Barge-in-able robot audition based on ICA and missing feature theory under semi-blind situation. IROS 2008: 1718-1723
[c114]Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers. IROS 2008: 2197-2203
[c113]Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotaka Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: A robot uses its own microphone to synchronize its steps to musical beats while scatting and singing. IROS 2008: 2459-2464
[c112]Yuji Kubota, Masatoshi Yoshida, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and Implementation of 3D Auditory Scene Visualizer towards Auditory Awareness with Face Tracking. ISM 2008: 468-476
[c111]Kouhei Sumi, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Recognition Based on Probabilistic Integration of Chord Transition and Bass Pitch Estimation. ISMIR 2008: 39-44
[c110]Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrument Equalizer for Query-by-Example Retrieval: Improving Sound Source Separation Based on Integrated Harmonic and Inharmonic Models. ISMIR 2008: 133-138
[c109]Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotaka Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: A Robot Singer with Music Recognition Based on Real-Time Beat Tracking. ISMIR 2008: 199-204
[c108]Yuji Kubota, Shun Shiramatsu, Masatoshi Yoshida, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: 3D Auditory Scene Visualizer with Face Tracking: Design and Implementation for Auditory Awareness Compensation. ISUC 2008: 42-49
[c107]Shun Shiramatsu, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: SalienceGraph: Visualizing Salience Dynamics of Written Discourse by Using Reference Probability and PLSA. PRICAI 2008: 890-902- 2007
[j19]Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Experience-based imitation using RNNPB. Advanced Robotics 21(12): 1351-1367 (2007)
[j18]Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps. EURASIP J. Adv. Sig. Proc. 2007 (2007)
[j17]Taro Watanabe, Kenji Imamura, Eiichiro Sumita, Hiroshi G. Okuno: Statistical machine translation using hierarchical phrase alignment. Systems and Computers in Japan 38(6): 70-79 (2007)
[j16]Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression. IEEE Transactions on Audio, Speech & Language Processing 15(1): 333-345 (2007)
[j15]Jean-Marc Valin, Seiichi Yamamoto, Jean Rouat, François Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno: Robust Recognition of Simultaneous Speech by a Mobile Robot. IEEE Transactions on Robotics 23(4): 742-752 (2007)
[c106]Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech. ASRU 2007: 111-116
[c105]Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Integration and Adaptation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Signals. ICASSP (1) 2007: 57-60
[c104]Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Vowel Imitation Using Vocal Tract Model and Recurrent Neural Network. ICONIP (2) 2007: 222-232
[c103]Haruhiko Niwa, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Distance Estimation of Hidden Objects Based on Acoustical Holography by applying Acoustic Diffraction of Audible Sound. ICRA 2007: 423-428
[c102]Tetsuya Ogata, Shohei Matsumoto, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Human-Robot Cooperation using Quasi-symbols Generated by RNNPB Model. ICRA 2007: 2156-2161
[c101]Shun Nishide, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Predicting Object Dynamics from Visual Images through Active Sensing Experiences. ICRA 2007: 2501-2506
[c100]Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Auditory and Visual Talker Tracking Through Integrating EM Algorithm and Particle Filter. IEA/AIE 2007: 280-290
[c99]Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Evaluation of Two Simultaneous Continuous Speech Recognition with ICA BSS and MFT-Based ASR. IEA/AIE 2007: 384-394
[c98]Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: Analyzing temporal transition of real user's behaviors in a spoken dialogue system. INTERSPEECH 2007: 142-145
[c97]Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Topic estimation with domain extensibility for guiding user's out-of-grammar utterances in multi-domain spoken dialogue systems. INTERSPEECH 2007: 2561-2564
[c96]Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Discovery of other individuals by projecting a self-model through imitation. IROS 2007: 1009-1014
[c95]Kazuyoshi Yoshii, Kazuhiro Nakadai, Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: A biped robot that keeps steps in time with musical beats while listening to music with its own ears. IROS 2007: 1743-1750
[c94]Ryu Takeda, Kazuhiro Nakadai, Kazuhiro Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Exploiting known sound source signals to improve ICA-based robot audition in speech separation and recognition. IROS 2007: 1757-1762
[c93]Hisashi Kanda, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Vocal imitation using physical vocal tract model. IROS 2007: 1846-1851
[c92]Tetsuya Ogata, Masamitsu Murase, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Two-way translation of compound sentences and arm motions by recurrent neural networks. IROS 2007: 1858-1863
[c91]Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Auditory and visual integration based localization and tracking of humans in daily-life environments. IROS 2007: 2021-2027
[c90]Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Improving Efficiency and Scalability of Model-Based Music Recommender System Based on Incremental Training. ISMIR 2007: 89-94
[c89]Kôiti Hasida, Shun Shiramatsu, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Meaning Games. JSAI 2007: 228-241
[e1]Hiroshi G. Okuno, Moonis Ali (Eds.): New Trends in Applied Artificial Intelligence, 20th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2007, Kyoto, Japan, June 26-29, 2007, Proceedings. Lecture Notes in Computer Science 4570, Springer 2007, ISBN 978-3-540-73322-5- 2006
[j14]Takuya Yoshioka, Takafumi Hikichi, Masato Miyoshi, Hiroshi G. Okuno: Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals. IEICE Transactions 89-A(1): 240-247 (2006)
[j13]Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: A privacy-enhanced access control. Systems and Computers in Japan 37(5): 77-86 (2006)
[j12]Yasuhiro Akiba, Kenji Imamura, Eiichiro Sumita, Hiromi Nakaiwa, Shun'ichi Yamamoto, Hiroshi G. Okuno: Using multiple edit distances to automatically grade outputs from Machine translation systems. IEEE Transactions on Audio, Speech & Language Processing 14(2): 393-402 (2006)
[c88]Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals. IEA/AIE 2006: 207-217
[c87]Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Speaker identification under noisy environments by using harmonic structure extraction and reliable frame weighting. INTERSPEECH 2006
[c86]Yuichiro Fukubayashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Dynamic help generation by estimating user²s mental model in spoken dialogue systems. INTERSPEECH 2006
[c85]Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Improving speech recognition of two simultaneous speech signals by integrating ICA BSS and automatic missing feature mask generation. INTERSPEECH 2006
[c84]Kazuhiro Nakadai, Hirofumi Nakajima, Masamitsu Murase, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino: Real-Time Tracking of Multiple Sound Sources by Integration of In-Room and Robot-Embedded Microphone Arrays. IROS 2006: 852-859
[c83]Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Missing-Feature based Speech Recognition for Two Simultaneous Speech Signals Separated by ICA with a pair of Humanoid Ears. IROS 2006: 878-885
[c82]Haruhiko Niwa, Tetsuya Ogata, Kazunori Komatani, Hiroshi G. Okuno: Multiple Acoustical Holography Method for Localization of Objects in Broad Range using Audible Sound. IROS 2006: 1145-1150
[c81]Ryunosuke Yokoya, Tetsuya Ogata, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Experience Based Imitation Using RNNPB. IROS 2006: 3669-3674
[c80]Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Real-Time Robot Audition System That Recognizes Simultaneous Speech in The Real World. IROS 2006: 5333-5338
[c79]Hiromasa Fujihara, Masataka Goto, Jun Ogata, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals. ISM 2006: 257-264
[c78]Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Musical Instrument Recognizer "Instrogram" and Its Application to Music Retrieval Based on Instrumentation Similarity. ISM 2006: 265-274
[c77]Katsutoshi Itoyama, Tetsuro Kitahara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Feature Weighting in Automatic Transcription of Specified Part in Polyphonic Music. ISMIR 2006: 172-175
[c76]Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences. ISMIR 2006: 296-301
[c75]Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition. PRICAI 2006: 484-494- 2005
[j11]Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Pitch-Dependent Identification of Musical Instrument Sounds. Appl. Intell. 23(3): 267-275 (2005)
[j10]Tino Lourens, Emilia I. Barakova, Hiroshi G. Okuno, Hiroshi Tsujino: A computational model of monkey cortical grating cells. Biological Cybernetics 92(1): 61-70 (2005)
[j9]Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance. User Model. User-Adapt. Interact. 15(1-2): 169-183 (2005)
[c74]Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Jean Rouat, François Michaud, Tetsuya Ogata, Hiroshi G. Okuno: Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory. ICRA 2005: 1477-1482
[c73]Tsuyoshi Tasaki, Shohei Matsumoto, Hayato Ohba, Mitsuhiko Toda, Kazuhiro Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Distance-Based Dynamic Interaction of Humanoid Robot with Multiple People. IEA/AIE 2005: 111-120
[c72]Masamitsu Murase, Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Kentaro Yamada, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Multiple moving speaker tracking by microphone array on mobile robot. INTERSPEECH 2005: 249-252
[c71]Kazunori Komatani, Naoyuki Kanda, Tetsuya Ogata, Hiroshi G. Okuno: Contextual constraints based on dialogue models in database search task for spoken dialogue systems. INTERSPEECH 2005: 877-880
[c70]Tetsuya Ogata, Hayato Ohba, Jun Tani, Kazunori Komatani, Hiroshi G. Okuno: Extracting multi-modal dynamics of objects using RNNPB. IROS 2005: 966-971
[c69]Tsuyoshi Tasaki, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Spatially mapping of friendliness for human-robot interaction. IROS 2005: 1277-1282
[c68]Shunsuke Kurotaki, Noriaki Suzuki, Kazuhiro Nakadai, Hiroshi G. Okuno, Hideharu Amano: Implementation of active direction-pass filter on dynamically reconfigurable processor. IROS 2005: 3175-3180
[c67]Mikio Nakano, Yuji Hasegawa, Kazuhiro Nakadai, Takahiro Nakamura, Johane Takeuchi, Toyotaka Torii, Hiroshi Tsujino, Naoyuki Kanda, Hiroshi G. Okuno: A two-layer model for behavior and dialogue planning in conversational service robots. IROS 2005: 3329-3335
[c66]Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Jean Rouat, François Michaud, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Making a robot recognize three simultaneous sentences in real-time. IROS 2005: 4040-4045
[c65]Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Singer Identification Based on Accompaniment Sound Reduction and Reliable Frame Selection. ISMIR 2005: 329-336
[c64]Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Instrument Identification in Polyphonic Music: Feature Weighting with Mixed Sounds, Pitch-Dependent Timbre Modeling, and Use of Musical Context. ISMIR 2005: 558-563
[c63]Kenri Kodaka, Tetsuya Ogata, Hiroshi G. Okuno: Walking with body-sense in virtual space using the nonlinear oscillator. SMC 2005: 324-329- 2004
[j8]Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot. Appl. Intell. 20(3): 253-266 (2004)
[j7]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Effects of increasing modalities in recognizing three simultaneous speeches. Speech Communication 43(4): 347-359 (2004)
[j6]Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Improvement of recognition of simultaneous speech signals using AV integration and scattering theory for humanoid robots. Speech Communication 44(1-4): 97-112 (2004)
[c62]Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Using a Mixture of N-Best Lists from Multiple MT Systems in Rank-Sum-Based Confidence Measure for MT Outputs. COLING 2004
[c61]Kazunori Komatani, Teruhisa Misu, Tatsuya Kawahara, Hiroshi G. Okuno: Efficient Confirmation Strategy for Large-scale Text Retrieval Systems with Spoken Dialogue Interface. COLING 2004
[c60]Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Toshio Yokoyama, Hiroshi G. Okuno: Improvement of Robot Audition by Interfacing Sound Source Separation and Automatic Speech Recognition with Missing Feature Theory. ICRA 2004: 1517-1523
[c59]Kazunori Komatani, Ryosuke Ito, Tatsuya Kawahara, Hiroshi G. Okuno: Recognition of Emotional States in Spoken Dialogue with a Robot. IEA/AIE 2004: 413-423
[c58]Kazushi Ishihara, Yuya Hattori, Tomohiro Nakatani, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition. INTERSPEECH 2004
[c57]Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno, Tsuyoshi Tasaki, Takeshi Yamaguchi: Robot motion control using listener's back-channels and head gesture information. INTERSPEECH 2004
[c56]Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno: Assessment of general applicability of robot audition system by recognizing three simultaneous speeches. IROS 2004: 2111-2116
[c55]Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno: Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods. ISMIR 2004
[c54]Takuya Yoshioka, Tetsuro Kitahara, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Chord Transcription with Concurrent Recognition of Chord Symbols and Boundaries. ISMIR 2004
[c53]Shinichi Ueno, Fumihiro Adachi, Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno: Bus Information System Based on User Models and Dynamic Generation of VoiceXML Scripts. JSAI Workshops 2004: 46-60
[c52]Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, Hiroshi G. Okuno: Incremental Methods to Select Test Sentences for Evaluating Translation Ability. LREC 2004
[c51]Kazushi Ishihara, Tomohiro Nakatani, Tetsuya Ogata, Hiroshi G. Okuno: Automatic Sound-Imitation Word Recognition from Environmental Sounds Focusing on Ambiguity Problem in Determining Phonemes. PRICAI 2004: 909-918- 2003
[j5]Hiroshi G. Okuno, Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroaki Kitano: Human-robot non-verbal interaction empowered by real-time auditory and visual multiple-talker tracking. Advanced Robotics 17(2): 115-130 (2003)
[c50]Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: Flexible Guidance Generation Using User Model in Spoken Dialogue Systems. ACL 2003: 256-263
[c49]Taro Watanabe, Eiichiro Sumita, Hiroshi G. Okuno: Chunk-Based Statistical Translation. ACL 2003: 303-310
[c48]Takamichi Saito, Kentaro Umesawa, Toshiyuki Kito, Hiroshi G. Okuno: Privacy-Enhanced SPKI Access Control on PKIX and Its Application to Web Server . AINA 2003: 696-703
[c47]Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Musical instrument identification based on F0-dependent multivariate normal distribution. ICME 2003: 409-412
[c46]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Realizing personality in audio-visually triggered non-verbal behaviors. ICRA 2003: 392-397
[c45]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Robot recognizes three simultaneous speech by active audition. ICRA 2003: 398-405
[c44]Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno: Pitch-Dependent Musical Instrument Identification and Its Application to Musical Sound Ontology. IEA/AIE 2003: 112-122
[c43]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Design and Implementation of Personality of Humanoids in Human Humanoid Non-verbal Interaction. IEA/AIE 2003: 662-673
[c42]Kazushi Ishihara, Yasushi Tsubota, Hiroshi G. Okuno: Automatic transformation of environmental sounds into sound-imitation words based on Japanese syllable structure. INTERSPEECH 2003
[c41]Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno: User modeling in spoken dialogue systems for flexible guidance generation. INTERSPEECH 2003
[c40]Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino: Three simultaneous speech recognition by integration of active audition and face recognition for humanoid. INTERSPEECH 2003
[c39]Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroaki Kitano: Applying scattering theory to robot audition system: robust sound source localization and extraction. IROS 2003: 1147-1152
[c38]Hiroshi G. Okuno, Kazuhiro Nakadai: Real-Time Sound Source Localization and Separation Based on Active Audio-Visual Integration. IWANN (1) 2003: 118-125- 2002
[c37]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Exploiting Auditory Fovea in Humanoid-Human Interaction. AAAI/IAAI 2002: 431-438
[c36]Kazunori Komatani, Tatsuya Kawahara, Ryosuke Ito, Hiroshi G. Okuno: Efficient Dialogue Strategy to Find Users' Intended Items from Information Query Results. COLING 2002
[c35]Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Speaker Localization and Speech Separation by Audio-Visual Integration. ICRA 2002: 1043-1049
[c34]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Social Interaction of Humanoid RobotBased on Audio-Visual Tracking. IEA/AIE 2002: 725-735
[c33]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Real-time sound source localization and separation for robot audition. INTERSPEECH 2002
[c32]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Auditory fovea based speech enhancement and its application to human-robot dialog system. INTERSPEECH 2002
[c31]Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno: Belief network based disambiguation of object reference in spoken dialogue system for robot. INTERSPEECH 2002
[c30]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Auditory fovea based speech separation and its application to dialog system. IROS 2002: 1320-1325
[c29]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano: Realizing Audio-Visually Triggered ELIZA-Like Non-verbal Behaviors. PRICAI 2002: 552-562- 2001
[c28]Tino Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: A computational model of monkey grating cells for oriented repetitive alternating patterns. ESANN 2001: 315-322
[c27]Tino Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Graph extraction from color images. ESANN 2001: 329-334
[c26]Tino Lourens, Hiroshi G. Okuno, Hiroaki Kitano: Automatic Graph Extraction from Color Images. ICIAP 2001: 302-308
[c25]Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Sound and Visual Tracking for Humanoid Robot. IEA/AIE 2001: 640-650
[c24]Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroshi G. Okuno, Hiroaki Kitano: Real-Time Auditory and Visual Multiple-Object Tracking for Humanoids. IJCAI 2001: 1425-1436
[c23]Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano: Real-time multiple speaker tracking by multi-modal integration for mobile robots. INTERSPEECH 2001: 1193-1196
[c22]Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano: Separating three simultaneous speeches with two microphones by integrating auditory and visual processing. INTERSPEECH 2001: 2643-2646
[c21]Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: An Access Control with handling Private Information. IPDPS 2001: 172
[c20]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Epipolar geometry based sound localization and extraction for humanoid audition. IROS 2001: 1395-1401
[c19]Hiroshi G. Okuno, Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroaki Kitano: Human-robot interaction through real-time auditory and visual multiple-talker tracking. IROS 2001: 1402-1409
[c18]Tino Lourens, Hiroshi G. Okuno, Hiroaki Kitano: Detection of Oriented Repetitive Alternating Patterns in Color Images (A Computational Model of Monkey Grating Cells). IWANN (1) 2001: 95-107- 2000
[c17]Kazuhiro Nakadai, Tino Lourens, Hiroshi G. Okuno, Hiroaki Kitano: Active Audition for Humanoid. AAAI/IAAI 2000: 832-839
[c16]Hiroaki Kitano, Hiroshi G. Okuno, Kazuhiro Nakadai, Iris Fermin, Theo Sabisch, Yukiko Nakagawa, Tatsuya Matsui: Designing a humanoid head for RoboCup challenge. Agents 2000: 17-18
[c15]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano: Humanoid Active Audition System Improved by the Cover Acoustics. PRICAI 2000: 544-554
[c14]Ian Frank, Kumiko Tanaka-Ishii, Hiroshi G. Okuno, Junichi Akita, Yukiko Nakagawa, Kazuaki Maeda, Kazuhiro Nakadai, Hiroaki Kitano: And the Fans Are Going Wild! SIG plus MIKE. RoboCup 2000: 139-148
[c13]Yukiko Nakagawa, Hiroshi G. Okuno, Hiroaki Kitano: Bridging Gap between the Simulation and Robotics with a Global Vision System. RoboCup 2000: 209-218
[c12]Takamichi Saito, Kentaro Umesawa, Hiroshi G. Okuno: Privacy-Enhanced Access Control by SPKI and Its Application to Web Server. WETICE 2000: 201-206
1990 – 1999
- 1999
[j4]Tomohiro Nakatani, Hiroshi G. Okuno: Harmonic sound stream segregation using localization and its application to speech stream segregation. Speech Communication 27(3-4): 209-222 (1999)
[j3]Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata: Listening to two simultaneous speeches. Speech Communication 27(3-4): 299-310 (1999)
[c11]Yukiko Nakagawa, Hiroshi G. Okuno, Hiroaki Kitano: Using Vision to Improve Sound Source Separation. AAAI/IAAI 1999: 768-775- 1998
[j2]Hiroshi G. Okuno, Shin-ichi Minato, Hideki Isozaki: On the Properties of Combination Set Operations. Inf. Process. Lett. 66(4): 195-199 (1998)
[c10]Tomohiro Nakatani, Hiroshi G. Okuno: Sound Ontology for Computational Auditory Scence Analysis. AAAI/IAAI 1998: 1004-1010- 1997
[c9]Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata: Understanding Three Simultaneous Speeches. IJCAI (1) 1997: 30-35- 1996
[c8]Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata: Interfacing Sound Stream Segregation to Automatic Speech Recognition - Preliminary Results on Listening to Several Sounds Simultaneously. AAAI/IAAI, Vol. 2 1996: 1082-1089
[c7]Hiroshi G. Okuno, Osamu Shimokuni, Hidehiko Tanaka: Design and Implementation of Multiple-Context Truth Maintenance System with Binary Decision Diagram. IEA/AIE 1996: 47-56
[c6]Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata: A new speech enhancement: speech stream segregation. ICSLP 1996- 1995
[c5]Tomohiro Nakatani, Hiroshi G. Okuno, Takeshi Kawabata: Residue-Driven Architecture for Computational Auditory Scene Analysis. IJCAI 1995: 165-174- 1994
[c4]Tomohiro Nakatani, Hiroshi G. Okuno, Takeshi Kawabata: Auditory Stream Segregation in Auditory Scene Analysis with a Multi-Agent System. AAAI 1994: 100-107
[c3]Tomohiro Nakatani, Takeshi Kawabata, Hiroshi G. Okuno: Unified architecture for auditory scene analysis and spoken language processing. ICSLP 1994
1980 – 1989
- 1987
[c2]Hiroshi G. Okuno, Nobuyasu Osato, Ikuo Takeuchi: Firmware approach to fast Lisp interpreter. MICRO 1987: 1-11- 1986
[j1]Ikuo Takeuchi, Hiroshi G. Okuno, Nobuyasu Ohsato: A List Processing Language TAO with Multiple Programming Paradigms. New Generation Comput. 4(4): 401-444 (1986)- 1984
[c1]Hiroshi G. Okuno, Ikuo Takeuchi, Nobuyasu Ohsato, Yasushi Hibino, Kazufumi Watanabe: TAO: Afst Interpreter-Centered Lisp System on Lisp Machine ELIS. LISP and Functional Programming 1984: 140-149
Coauthor Index
[j45] [j42] [j41] [j35] [c194] [c183] [j31] [j30] [c180] [c179] [c176] [c175] [c172] [c171] [c169] [c168] [c167] [c166] [c165] [c163] [c162] [c161] [c160] [c158] [c157] [j28] [j27] [j26] [j25] [c156] [c155] [c154] [c152] [c151] [c150] [c149] [c148] [c147] [c146] [c145] [c143] [c140] [c136] [c135] [c134] [j24] [j22] [j20] [c129] [c127] [c126] [c125] [c124] [c123] [c122] [c121] [c120] [c119] [c118] [c117] [c116] [c115] [c114] [c112] [c111] [c110] [c108] [c107] [j19] [j18] [c106] [c105] [c104] [c103] [c102] [c101] [c100] [c99] [c98] [c97] [c96] [c95] [c93] [c92] [c91] [c90] [c89] [c88] [c87] [c86] [c85] [c83] [c82] [c81] [c80] [c79] [c78] [c77] [c76] [c75] [j9] [c72] [c71] [c70] [c69] [c66] [c65] [c64] [c61] [c59] [c58] [c57] [c54] [c53] [c50] [c41] [c36]
[c216] [j41] [c205] [c200] [j37] [j35] [c192] [c188] [c185] [j32] [c180] [c176] [c175] [c173] [c172] [c165] [c164] [c163] [c162] [c155] [c154] [c153] [c152] [c146] [c145] [c143] [c138] [c130] [c128] [c120] [c115] [c113] [c109] [j15] [c106] [c95] [c94] [c88] [c84] [c80] [c75] [c74] [c72] [c68] [c67] [c66] [j8] [j7] [j6] [c60] [c56] [j5] [c46] [c45] [c43] [c40] [c39] [c38] [c37] [c35] [c34] [c33] [c32] [c30] [c29] [c28] [c27] [c25] [c24] [c23] [c22] [c20] [c19] [c17] [c16] [c15] [c14]
[j46] [j45] [j44] [j43] [j42] [j41] [j40] [c212] [c211] [c210] [c209] [c208] [c207] [c206] [c204] [c203] [j39] [j38] [j37] [j34] [c198] [c197] [c196] [c194] [c193] [c191] [c190] [c189] [c188] [c187] [c186] [c185] [c184] [c182] [j31] [j30] [c180] [c179] [c176] [c175] [c174] [c172] [c171] [c169] [c168] [c167] [c166] [c165] [c163] [c162] [c161] [c160] [c157] [j28] [j27] [j26] [j25] [c156] [c155] [c154] [c152] [c151] [c150] [c149] [c148] [c147] [c146] [c145] [c144] [c143] [c142] [c141] [c140] [c139] [c137] [c136] [c135] [j24] [j23] [j22] [j21] [j20] [c129] [c127] [c126] [c125] [c123] [c122] [c121] [c120] [c119] [c118] [c117] [c116] [c115] [c114] [c112] [c111] [c110] [c108] [c107] [j19] [j18] [c106] [c105] [c104] [c103] [c102] [c101] [c100] [c99] [c97] [c96] [c95] [c94] [c93] [c92] [c91] [c90] [c89] [c88] [c87] [c86] [c85] [c83] [c82] [c81] [c80] [c79] [c78] [c77] [c76] [c75] [c74] [c73] [c72] [c71] [c70] [c69] [c66] [c65] [c64] [c63] [c58] [c57] [c54] [c51]
data released under the ODC-BY 1.0 license. See also our legal information page
last updated on 2013-06-19 21:57 CEST by the dblp team



