1. MLMI 2004:
Martigny, Switzerland
Samy Bengio, Hervé Bourlard (Eds.):
Machine Learning for Multimodal Interaction, First International Workshop,MLMI 2004, Martigny, Switzerland, June 21-23, 2004, Revised Selected Papers.
Lecture Notes in Computer Science 3361 Springer 2004, ISBN 3-540-24509-X
HCI and Applications
Structuring and Interaction
- Iain McCowan, Daniel Gatica-Perez, Samy Bengio, Darren Moore, Hervé Bourlard:
Towards Computer Understanding of Human Interactions.
56-75

- Alfred Dielmann, Steve Renals:
Multistream Dynamic Bayesian Network for Meeting Segmentation.
76-86

- Denis Lalanne, Rolf Ingold, Didier von Rotz, Ardhendu Behera, Dalila Mekhaldi, Andrei Popescu-Belis:
Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives.
87-100

- Nicolas Moënne-Loccoz, Bruno Janvier, Stéphane Marchand-Maillet, Eric Bruno:
An Integrated Framework for the Management of Video Collection.
101-110

- Jean Carletta, Jonathan Kilgour:
The NITE XML Toolkit Meets the ICSI Meeting Corpus: Import, Annotation, and Browsing.
111-121

Multimodal Processing
- Nuria Oliver, Eric Horvitz:
S-SEER: Selective Perception in a Multimodal Office Activity Recognition System.
122-135

- Tue Lehn-Schiøler, Lars Kai Hansen, Jan Larsen:
Mapping from Speech to Images Using Continuous State Space Models.
136-145

- Ofer Dekel, Joseph Keshet, Yoram Singer:
An Online Algorithm for Hierarchical Phoneme Classification.
146-158

- Norman Poh, Samy Bengio:
Towards Predicting Optimal Fusion Candidates: A Case Study on Biometric Authentication Tasks.
159-172

- Julien Meynet, Vlad Popovici, Jean-Philippe Thiran:
Mixture of SVMs for Face Class Modeling.
173-181

- Guillaume Lathoud, Jean-Marc Odobez, Daniel Gatica-Perez:
AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking.
182-195

Speech Processing
- Chuck Wooters, Nikki Mirghafori, Andreas Stolcke, Tuomo W. Pirinen, Ivan Bulyko, David Gelbart, Martin Graciarena, Scott Otterson, Barbara Peskin, Mari Ostendorf:
The 2004 ICSI-SRI-UW Meeting Recognition System.
196-208

- Mathew Magimai-Doss, Hervé Bourlard:
On the Adequacy of Baseform Pronunciations and Pronunciation Variants.
209-222

- Qifeng Zhu, Barry Y. Chen, Nelson Morgan, Andreas Stolcke:
Tandem Connectionist Feature Extraction for Conversational Speech Recognition.
223-231

- Barry Y. Chen, Qifeng Zhu, Nelson Morgan:
Long-Term Temporal Features for Conversational Speech Recognition.
232-242

- Hagai Aronowitz, David Burshtein, Amihood Amir:
Speaker Indexing in Audio Archives Using Gaussian Mixture Scoring Simulation.
243-252

- Mikko Kurimo, Ville T. Turunen, Inger Ekman:
Speech Transcription and Spoken Document Retrieval in Finnish.
253-262

- Harald Romsdorfer, Beat Pfister, René Beutler:
A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System.
263-276

Dialogue Management
Vision and Emotion
- Roddy Cowie, Marc Schröder:
Piecing Together the Emotion Jigsaw.
305-317

- T. Balomenos, Amaryllis Raouzaiou, Spiros Ioannou, Athanasios I. Drosopoulos, Kostas Karpouzis, Stefanos D. Kollias:
Emotion Analysis in Man-Machine Interaction Systems.
318-328

- Philipp Zehnder, Esther Koller-Meier, Luc J. Van Gool:
A Hierarchical System for Recognition, Tracking and Pose Estimation.
329-340

- Santiago Venegas-Martinez, Gianluca Antonini, Jean-Philippe Thiran, Michel Bierlaire:
Automatic Pedestrian Tracking Using Discrete Choice Models and Image Correlation Techniques.
341-348

- Mihai Osian, Tinne Tuytelaars, Luc J. Van Gool:
A Shape Based, Viewpoint Invariant Local Descriptor.
349-359

Last update Fri May 24 19:37:59 2013
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page