ICME 2003:
Baltimore, MD, USA
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, ICME 2003, 6-9 July 2003, Baltimore, MD, USA.
IEEE 2003/2004, ISBN 0-7803-7965-9
Volume 1
Networked Video I
- Thinh P. Q. Nguyen, Puneet Mehra, Avideh Zakhor:
Path diversity and bandwidth allocation for multimedia streaming.
1-4

- Susie Wee, John G. Apostolopoulos, Wai-tian Tan, Sumit Roy:
Research and design of a mobile streaming media content delivery network.
5-8

- Jacob Chakareski, Eric Setton, Yi J. Liang, Bernd Girod:
Video streaming with diversity.
9-12

- Marco Fumagalli, Phoom Sagetong, Antonio Ortega:
Estimation of erased data in a H.263 coded stream by using unbalanced multiple description coding.
13-16

- Amy R. Reibman, Vinay A. Vaishampayan:
Quality monitoring for compressed video subjected to packet loss.
17-20

Automatic Indexing
Multimodal Interfaces
- Yeow Kee Tan, Nasser Sherkat, Tony Allen:
Eye gaze and speech for data entry: a comparison of different data entry methods.
41-44

- Yasuhito Sawahata, Kiyoharu Aizawa:
Wearable imaging system for summarizing personal experiences.
45-48

- Timothy T. H. Chen, Sidney Fels, Saehee Sarah Min:
FlowField and beyond: applying pressure-sensitive multi-point touchpad interaction.
49-52

- Xin Fan, Xing Xie, Wei-Ying Ma, Hong-Jiang Zhang, He-Qin Zhou:
Visual attention based image browsing on mobile devices.
53-56

- Björn Schuller, Martin Zobl, Gerhard Rigoll, Manfred K. Lang:
A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge.
57-60

Speech and Audio Processing I
- Hsuan-Huei Shih, Shrikanth S. Narayanan, C.-C. Jay Kuo:
A statistical multidimensional humming transcription using phone level hidden Markov models for query by humming systems.
61-64

- Rongshan Yu, Xiao Lin, Susanto Rahardja, Chi Chung Ko:
A fine granular scalable perceptually lossy and lossless audio coder.
65-68

- Simon Lucey, Tsuhan Chen:
An investigation into subspace rapid speaker adaptation for verification.
69-72

- Manuel J. Reyes Gomez, Daniel P. W. Ellis:
Selection, parameter estimation, and discriminative training of hidden Markov models for general audio modeling.
73-76

- Chih-Kai Yang, Sou-Gee Chen:
New static and dynamic search algorithms for fast MP3 bit allocations.
77-80

Image Processing I
- Yongmin Li, Li-Qun Xu, Geoff Morrison, Charles Nightingale, Jason Morphett:
Robust panorama from MPEG video.
81-84

- Jun-Wei Hsieh:
Fast stitching algorithm for moving object detection and mosaic construction.
85-88

- Zhang John Chen, Jagath Samarabandu:
Planar region depth filling using edge detection with embedded confidence technique and Hough transform.
89-92

- S. H. Srinivasan, Mohan S. Kankanhalli:
Wide baseline spectral matching.
93-96

- Wei-Qi Yan, Mohan S. Kankanhalli:
Colorizing infrared home videos.
97-100

- Hasan F. Ates, Michael T. Orchard:
Image interpolation using wavelet-based contour estimation.
101-104

- Andy Chang, Oscar C. Au, Yick Ming Yeung:
A novel approach to fast multi-block motion estimation for H.264 video coding.
105-108

- Gulcin Caner, A. Murat Tekalp, Wendi B. Heinzelman:
Super resolution recovery for multi-camera surveillance imaging.
109-112

- Yu Hen Hu, Rajas A. Sambhare:
Constrained texture synthesis for image post processing.
113-116

Multimedia Architectures and Implementation
- Nikolaos Bellas, Malcolm Dwyer:
A programmable, high performance vector array unit used for real-time motion estimation.
117-120

- Tay-Jyi Lin, Chin-Chi Chang, Tsung-Hsun Yang, Yu-Ming Chang, Chien-Hung Lin, Chen-Chia Lee, Hung-Yueh Lin, Chein-Wei Jen:
Performance evaluation of ring-structure register file in multimedia applications.
121-124

- Tay-Jyi Lin, Tsung-Hsun Yang, Chein-Wei Jen:
Coefficient optimization for area-effective multiplier-less FIR filters.
125-128

- Satoshi Nishiguchi, Kazuhide Higashi, Yoshinari Kameda, Michihiko Minoh:
A sensor-fusion method for detecting a speaking student.
129-132

- Tsung-Han Tsai, Wen-Cheng Chen, Chun-Nan Liu:
A low power VLSI implementation for variable length decoder in MPEG-1 layer III.
133-136

- Hung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Ya-Yun Shih, Liang-Gee Chen:
Novel word-level algorithm of embedded block coding in JPEG 2000.
137-140

- Jongmyon Kim, D. Scott Wills:
Quantized color instruction set for media-on-demand applications.
141-144

- Michelle Yan, James Shaw, Vahid Khamsi, Shih-Ping Liou:
Tracking and presenting user attention for collaborative browsing using heterogeneous devices.
145-148

- Shinsuke Kobayashi, Kentaro Mita, Yoshinori Takeuchi, Masaharu Imai:
Rapid prototyping of JPEG encoder using the ASIP development system: PEAS-III.
149-152

Text, Graphics, Face, Scene, and Song Recognition
- Ioannis Andreou, Nikitas M. Sgouros:
Sketch creation utilizing shape matching techniques.
153-156

- Michael H. Lee, Surya Nepal, Uma Srinivasan:
Edge-based semantic classification of sports video sequences.
157-160

- Gees C. Stein, Jens Rittscher, Anthony Hoogs:
Enabling video annotation using a semantic database extended with visual knowledge.
161-164

- Hidehisa Nagano, Kunio Kashino, Hiroshi Murase:
A fast search algorithm for background music signals based on the search for numerous small signal components.
165-168

- Ahmet Ekin, A. Murat Tekalp:
Generic play-break event detection for summarization and hierarchical sports video analysis.
169-172

- Amit Chakraborty, Peiya Liu, Liang H. Hsu:
Extracting anchorable information units from PDF files.
173-176

- Lijun Yin, Sergey Royt, Matt T. Yourst, Anup Basu:
Recognizing facial expressions using active textures with wrinkles.
177-180

- Francis K. H. Quek, Yingen Xiong:
Oscillatory gestures and discourse.
181-184

Networked Video II
Multimedia Security and Content Protection I
- H. Vicky Zhao, Min Wu, Z. Jane Wang, K. J. Ray Liu:
Performance of detection statistics under collusion attacks on independent multimedia fingerprints.
205-208

- Alexia Giannoula, Anastasios Tefas, Nikos Nikolaidis, Ioannis Pitas:
Improving the detection reliability of correlation-based watermarking techniques.
209-212

- Ming Sun Fu, Oscar C. Au:
A multi-bit robust watermark for halftone images.
213-216

- Nedeljko Cvejic, Djordje Tujkovic, Tapio Seppänen:
Increasing robustness of an audio watermark using turbo codes.
217-220

- Jonathan Foote, John Adcock, Andreas Girgensohn:
Time base modulation: a new approach to watermarking audio.
221-224

Virtual Reality and Imaging I
Authentication and Recognition
Wireless Multimedia Techniques
- Wei Wang, Michael R. Lyu:
Automatic generation of dubbing video slides for mobile wireless environment.
265-268

- Surya Nepal, Uma Srinivasan:
Adaptive video highlights for wired and wireless platforms.
269-272

- Dirk Trossen, Hemant H. Chaskar:
Enabling user-tailored MMS delivery in heterogeneous access scenarios.
273-276

- Shengjie Zhao, Zixiang Xiong, Xiaodong Wang:
Optimal resource allocation for wireless video over CDMA networks.
277-280

- Amol Bhatkar, Rajarathnam Chandramouli, Narayanan Vijaykrishnan, Mary Jane Irwin:
Computation and transmission energy modeling through profiling for MPEG4 video transmission.
281-284

- Wen Xu, Sheila S. Hemami:
Delay-optimized robust transmission of images over multiple channels.
285-288

- Wanghong Yuan, Klara Nahrstedt:
Buffering approach for energy saving in video sensors.
289-292

- Jiancong Chen, S.-H. Gary Chan, Qian Zhang, Wenwu Zhu, Jin Chen:
A distributed power adaptation algorithm for multimedia delivery over ad hoc networks.
293-296

Content-based Retrieval
- Jieh Hsiang, Wen-Jun Liu, Bee-Chung Chen, Hsieh-Chang Tu:
Multidimensional interactive fine-grained image retrieval.
297-300

- Jürgen Assfalg, Alberto Del Bimbo, Pietro Pala:
Curvature maps for 3D CBR.
301-304

- Xiangdong Zhou, Qi Zhang, Lan Lin, Ailin Deng, Gang Wu:
Image retrieval by fuzzy clustering of relevance feedback records.
305-308

- Jun Gao, George Tzanetakis, Peter Steenkiste:
Content-based retrieval of music in scalable peer-to-peer networks.
309-312

- Lei Zhang, Fang Qian, Mingjing Li, Hong-Jiang Zhang:
An efficient memorization scheme for relevance feedback in image retrieval.
313-316

- Yuxin Peng, Chong-Wah Ngo, Qing-Jie Dong, Zong-Ming Guo, Jianguo Xiao:
Video clip retrieval by maximal matching and optimal matching in graph theory.
317-320

- Xin Huang, Shu-Ching Chen, Mei-Ling Shyu:
Incorporating real-valued multiple instance learning into relevance feedback for image retrieval.
321-324

- Ming Hong Pi, Mrinal K. Mandal, Anup Basu:
Image retrieval based on 2-D histogram of fractal parameters.
325-328

- Giridharan Iyengar, Harriet J. Nock, Chalapathy Neti:
Audio-visual synchrony for detection of monologues in video archives.
329-332

- Min Xu, Ling-Yu Duan, Changsheng Xu, Qi Tian:
A fusion scheme of visual and auditory modalities for event detection in sports video.
333-336

Image Processing II
- Ching-Yeh Chen, Shao-Yi Chien, Yi-Hau Chen, Yu-Wen Huang, Liang-Gee Chen:
Unsupervised object-based sprite coding system for tennis sport.
337-340

- Armando J. Pinho, António J. R. Neves:
Block-based histogram packing of color-quantized images.
341-344

- Nejat Kamaci, Yucel Altunbasak:
Performance comparison of the emerging H.264 video coding standard with the existing standards.
345-348

- Xiaodong Gu, Hong-Jiang Zhang:
Implementing dynamic GOP in video encoding.
349-352

- Yung-Gi Wu, Ming-Zhi Huang, Yu-Ling Wen:
Fractal image compression with variance and mean.
353-356

- Martin P. Boliek, Gene K. Wu:
JPEG 2000-like access using the JPM compound document file format.
357-360

- Shou-Yi Tseng:
Efficient motion estimation algorithm using run-time and distortion optimization approach.
361-364

- Liang Zhang:
Statistical model for intensity differences of corresponding points between stereo image pairs.
365-368

- Yuhua Ding, George J. Vachtsevanos, Anthony J. Yezzi Jr., Wayne Daley, Bonnie S. Heck-Ferri:
A real-time curve evolution-based image fusion algorithm for multisensory image segmentation.
369-372

- Bernd Girod, Chuo-Ling Chang, Prashant Ramanathan, Xiaoqing Zhu:
Light field compression using disparity-compensated lifting.
373-376

Speech Coding, Analysis, and Synthesis
- Christian H. Ritz, Ian S. Burnett, Jason Lukasiak:
Low bit rate wideband WI speech coding.
377-380

- Houman Zarrinkoub, Paul Mermelstein:
Joint optimization of short-term and long-term predictors in CELP speech coders.
381-384

- Om Deshmukh, Carol Y. Espy-Wilson:
A measure of aperiodicity and periodicity in speech.
385-388

- K. Sreenivasa Rao, B. Yegnanarayana:
Prosodic manipulation using instants of significant excitation.
389-392

- Arun Kumar, Ashish Verma:
Using phone and diphone based acoustic models for voice conversion: a step towards creating voice fonts.
393-396

- Xiaodong He, Wu Chou:
minimum classification error linear regression for acoustic model adaptation of continuous density HMMS.
397-400

- Björn Schuller, Gerhard Rigoll, Manfred K. Lang:
Hidden Markov model-based speech emotion recognition.
401-404

- Dong Wang, Lie Lu, Hong-Jiang Zhang:
Speech segmentation without speech recognition.
405-408

- Julien Pinquier, Jean-Luc Rouas, Régine André-Obrecht:
A fusion study in speech / music classification.
409-412

Multimedia Technology for Gaming
Multimedia Learning
- Raghavendra Singh, Ravi Kothari:
Relevance feedback algorithm based on learning from labeled and unlabeled data.
433-436

- Milind R. Naphade, Ching-Yung Lin, Apostol Natsev, Belle L. Tseng, John R. Smith:
A framework for moderate vocabulary semantic visual concept detection.
437-440

- Shinsuke Nakajima, Shinichi Kinoshita, Katsumi Tanaka:
Amplifying the differences between your positive samples and neighbors in image retrieval.
441-444

- Apostol Natsev, John R. Smith:
Active selection for multi-example querying by content.
445-448

- Tzvetanka I. Ianeva, Arjen P. de Vries, Hein Röhrig:
Detecting cartoons: a case study in automatic video-genre classification.
449-452

QoS
- Wuttipong Kumwilaisak, Qian Zhang, Wenwu Zhu, C.-C. Jay Kuo, Ya-Qin Zhang:
On the rate constraint of transmitting multiple priority classes with QoS.
453-456

- Bo Shen:
Meta-caching and meta-transcoding for server-side service proxy.
457-460

- Sheau-Ru Tong, Chun-Cheng Chang:
Harmonic DiffServ: provisioning scalable heterogeneous-QoS multicast in DiffServ networks.
461-464

- Rajeev Kumar:
A protocol with transcoding to support QoS over Internet for multimedia traffic.
465-468

- Nam Pham Ngoc, Gauthier Lafruit, Jean-Yves Mignolet, Serge Vernalde, Geert Deconinck, Rudy Lauwereins:
A framework for mapping scalable networked applications on run-time reconfigurable platforms.
469-472

Image/Video Rendering/Synthesis
- Pun-Mo Ho, Tien-Tsin Wong, Kwok-Hung Choy, Chi-Sing Leung:
PCA-based compression for image-based relighting.
473-476

- Amit A. Kale, Amit K. Roy Chowdhury, Rama Chellappa:
Video based rendering of planar dynamic scenes.
477-480

- Sarah John, Mikhail A. Vorontsov:
Multiframe selective information fusion for 'looking through the woods'.
481-484

- Timothy K. Shih, Liang-Chen Lu, Ying-Hong Wang, Rong-Chi Chang:
Multi-resolution image inpainting.
485-488

- Zhanfeng Yue, Liang Zhao, Rama Chellappa:
View synthesis of articulating humans using visual hull.
489-492

Layered, Scalable & Multiple Descriptions Transmission
- Xiao Su, Rod Fatoohi:
Scalable coded image transmissions over peer-to-peer networks.
493-496

- Ji-An Zhao, Bo Li, Ishfaq Ahmad:
Traffic modeling for layered video.
497-500

- Lechang Cheng, Mabo Robert Ito:
Receiver-driven layered multicast using active networks.
501-504

- Chung-Ming Huang, Yuan-Tse Yu, Guo-Shiung Liau:
A statistical flow control mechanism for layered multimedia over the differentiated service network.
505-508

- Eric Setton, Yi J. Liang, Bernd Girod:
Adaptive multiple description video streaming over multiple channels with active probing.
509-512

- Ivan Lee, Ling Guan:
Centralized peer-to-peer streaming with layered video.
513-516

- Ali C. Begen, Yucel Altunbasak, Özlem Ergun:
Fast heuristics for multi-path selection for multiple description encoded video streaming.
517-520

- Bo Xie, Wenjun Zeng:
Source characteristics based fast bitstream switching.
521-524

- Augustin Gavrilescu, Adrian Munteanu, Peter Schelkens, Jan Cornelis:
Embedded multiple description scalar quantizers for progressive image transmission.
525-528

Image Compression
- Mylene C. Q. Farias, Sanjit K. Mitra, John M. Foley:
Perceptual contributions of blocky, blurry and noisy artifacts to overall annoyance.
529-532

- Jingdong Wang, Jianguo Lee, Changshui Zhang:
Kernel GMM and its application to image binarization.
533-536

- Rastislav Lukac, Bogdan Smolka, Konstantinos N. Plataniotis, Anastasios N. Venetsanopoulos:
Generalized adaptive vector sigma filters.
537-540

- Yao Nie, Kenneth E. Barner:
Optimized fuzzy transformation for image deblocking.
541-544

- Ee Ping Ong, Weisi Lin, Zhongkang Lu, Susu Yao, Xiaokang Yang, Lijun Jiang:
No-reference JPEG-2000 image quality metric.
545-548

- Giuseppe Messina, Alfio Castorina, Sebastiano Battiato, Angelo Bosco:
Image quality improvement by adaptive exposure correction techniques.
549-552

- Giovanni Motta, Francesco Rizzo, James A. Storer:
Partitioned vector quantization: application to lossless compression of hyperspectral images.
553-556

- Daewon Kim, Daekyu Shin:
Energy-based adaptive DCT/IDCT for video coding.
557-560

- Lorenzo Granai, Fulvio Moschetti, Pierre Vandergheynst:
Ridgelet transform applied to motion compensated images.
561-564

Coding and Noise Removal
- Phil Spencer Whitehead, David V. Anderson, Mark A. Clements:
Adaptive, acoustic noise suppression for speech enhancement.
565-568

- Ashish Jagmohan, Anshul Sehgal, Narendra Ahuja:
WYZE-PMD based multiple description video codec.
569-572

- Nualsawat Hiransakolwong, Kien A. Hua, Khanh Vu, Piotr S. Windyga:
Segmentation of ultrasound liver images: an automatic approach.
573-576

- Nuwan D. Nanayakkara, Jagath Samarabandu:
Unsupervised model based image segmentation using domain knowledge based fuzzy logic and edge enhancement.
577-580

- Zhengguo Li, Feng Pan, Keng Pang Lim, Genan Feng, Xiao Lin, Susanto Rahardja, Dajun Wu:
Adaptive frame layer rate control for H.264.
581-584

- Bogdan Smolka, Konstantinos N. Plataniotis, Rastislav Lukac, Anastasios N. Venetsanopoulos:
Similarity based impulsive noise removal in color images.
585-588

- Siva Somasundaram, Koduvayur P. Subbalakshmi:
3-D multiple description video coding for packet switched networks.
589-592

- Xu Huang, Allan C. Madoc, Andrew D. Cheetham:
Wavelet-based Bayesian estimator for Poisson noise removal from images.
593-596

- Hideaki Kimata, Masaki Kitahara, Yoshiyuki Yashima:
3D motion vector coding with block base adaptive interpolation filter on H.264.
597-600

- Ligang Lu, Vadim Sheinin:
Real-time MPEG video coding with information look-ahead.
601-604

Watermarking and Fingerprinting
- Micheal Mullarkey, Neil J. Hurley, Guenole C. M. Silvestre, Teddy Furon:
Application of side-informed embedding and polynomial detection to audio watermarking.
605-608

- Ming Sun Fu, Oscar C. Au:
A novel method to embed watermark in different halftone images: data hiding by conjugate error diffusion (DHCED).
609-612

- Hong Zhao, Min Wu, Z. Jane Wang, K. J. Ray Liu:
Nonlinear collusion attacks on independent fingerprints for multimedia.
613-616

- Z. Jane Wang, Min Wu, Hong Zhao, K. J. Ray Liu, Wade Trappe:
Resistance of orthogonal Gaussian fingerprints to collusion attacks.
617-620

- Jeffrey A. Bloom:
Security and rights management in digital cinema.
621-624

- Anandabrata Pal, Kulesh Shanmugasundaram, Nasir D. Memon:
Automated reassembly of fragmented images.
625-628

- Kaliappan Gopalan:
Audio steganography using bit modification.
629-632

- Heather Yu:
Scalable encryption for multimedia content access control.
633-636

- Andreas Kalivas, Anastasios Tefas, Ioannis Pitas:
Watermarking of 3D models using principal component analysis.
637-640

Video Processing for Multi-Camera Surveillance Systems
- Matteo Gandetto, Luca Marchesotti, S. Sciutto, D. Negroni, Carlo S. Regazzoni:
From multi-sensor surveillance towards smart interactive spaces.
641-644

- Ser-Nam Lim, Ahmed M. Elgammal, Larry S. Davis:
Image-based pan-tilt camera control in a multi-camera surveillance environment.
645-648

- Omar Javed, Zeeshan Rasheed, Orkun Alatas, Mubarak Shah:
KNIGHT™: a real time surveillance system for multiple and non-overlapping cameras.
649-652

- Fatih Porikli, Ajay Divakaran:
Multi-camera calibration, object tracking and query generation.
653-656

- Karsten Müller, Aljoscha Smolic, Michael Drose, Patrick Voigt, Thomas Wiegand:
Multi-texture modeling of 3D traffic scenes.
657-660

Wireless Multimedia I
- Jianping Hua, Zixiang Xiong:
Optimal rate allocation in progressive joint source-channel coding for image transmission over CDMA networks.
661-664

- Jie Chen:
Fast hopping OFDM and packet-awareness coder design for wireless multimedia delivery.
665-668

- Xiaofeng Xu, Mihaela van der Schaar, Santhana Krishnamachari, Sunghyun Choi, Yao Wang:
Adaptive error control for fine-granular-scalability video coding over IEEE 802.11 wireless LANs.
669-672

- Shirish S. Karande, Syed A. Khayam, Michael Krappel, Hayder Radha:
Analysis and modeling of errors at the 802.11b link layer.
673-676

- Yong Sun, Zixiang Xiong, Xiaodong Wang:
Iterative decoding of differentially space-time coded multiple descriptions of images.
677-680

Multimedia Hardware and Architectures
- Sebastiano Battiato, Alfio Castorina, Mirko Guarnera, Filippo Vella:
A light viewfinder pipeline for consumer devices application.
681-684

- Minseok Song, Heonshik Shin:
Minimization of buffer requirements using variable-size parity groups for fault-tolerant video servers.
685-688

- Chunjiang J. Duanmu, M. Omair Ahmad, M. N. S. Swamy:
8-bit partial sums of 16 luminance values for fast block motion estimation.
689-692

- Yu-Wen Huang, To-Wei Chen, Bing-Yu Hsieh, Tu-Chih Wang, Te-Hao Chang, Liang-Gee Chen:
Architecture design for deblocking filter in H.264/JVT/AVC.
693-696

- Xinjian Chen, Qionghai Dai:
A novel VLSI architecture for multidimensional discrete wavelet transform.
697-700

Novel Applications
- Juan Carlos Guerri, Carlos E. Palau, Ana Pajares, Angela Belda, Juan José Cermeño, Manuel Esteve:
A multimedia telemedicine system to assess musculoskeletal disorders.
701-704

- Shu-Ching Chen, Keqi Zhang, Min Chen:
A real-time 3D animation environment for storm surge.
705-708

- Panu Hämäläinen, Marko Hännikäinen, Timo D. Hämäläinen, Riku Soininen:
Offline architecture for real-time betting.
709-712

- Chung-Sheng Li, Charu C. Aggarwal, Murray Campbell, Yuan-Chi Chang, Gregory Glass, Vijay S. Iyengar, Mahesh Joshi, Ching-Yung Lin, Milind R. Naphade, John R. Smith, Belle L. Tseng, Min Wang, Kun-Lung Wu, Philip S. Yu:
Epi-SPIRE: a system for environmental and public health activity monitoring.
713-716

- John V. Harrison, Anna Andrusiewicz:
Enhancing digital advertising using dynamically configurable multimedia.
717-720

Speech and Audio Processing II
- Sunil Bharitkar, Philip Hilmes, Chris Kyriakakis:
Sensitivity of multichannel room equalization to listener position.
721-724

- Sascha Spors, Achim Kuntz, Rudolf Rabenstein:
Listening room compensation for wave field synthesis.
725-728

- Kenzo Obata, Kentaro Noguchi, Yoshiaki Todokoro:
A new sound source location algorithm based on formant frequency for sound image localization.
729-732

- Arvindh Krishnaswamy, Julius O. Smith:
Inferring control inputs to an acoustic violin from audio spectra.
733-736

- Yong Rui, Dinei A. F. Florêncio:
New direct approaches to robust sound source localization.
737-740

- Parham Aarabi, Guangji Shi, Omid S. Jahromi:
Robust speech separation using time-frequency masking.
741-744

- Zhe Feng, Yaqian Zhou, Lide Wu, Zongge Li:
Audio classification based on maximum entropy model.
745-748

- Kuntal Sengupta, Prabir Burman:
Non-parametric approach to ICA using kernel density estimation.
749-752

- Jean-Luc Rouas, Jérôme Farinas, François Pellegrino, Régine André-Obrecht:
Modeling prosody for language identification on read and spontaneous speech.
753-756

Multimedia Indexing
- Yimin Wu, Aidong Zhang:
An adaptive classification method for multimedia retrieval.
757-760

- Janghyun Yoon, Nikil Jayant:
Semantics-sensitive image retrieval: an information fusion approach.
761-764

- Anlei Dong, Bir Bhanu:
Concept learning and transplantation for dynamic image databases.
765-768

- Paisarn Muneesawang, Ling Guan:
Image retrieval with embedded sub-class information using Gaussian mixture models.
769-772

- Jeroen Vendrig, Marcel Worring, Arnold W. M. Smeulders:
Components and systems for interactive video indexing.
773-776

- Andrea Kutics, Akihiko Nakagawa, Kiyotaka Tanaka, Minoru Yamada, Yasuo Sambe, Sakuichi Ohtsuka:
Linking images and keywords for semantics-based image retrieval.
777-780

- Alejandro Jaimes, John R. Smith:
Semi-automatic, data-driven construction of multimedia ontologies.
781-784

- Keiji Yanai:
Image collector II: a system for gathering more than one thousand images from the Web for one keyword.
785-788

QoS and Broadcasts
- Corina Scheiter, Rainer Steffen, Markus Zeller, Rudi Knorr, Benno Stabernack, Kai-Immo Wels:
A system for QOS-enabled MPEG-4 video transmission over Bluetooth for mobile applications.
789-792

- Chin-Hei Chien, Wanjiun Liao:
A self-configuring RED gateway for quality of service (QoS) networks.
793-796

- Jia Zhang, Jen-Yao Chung, Zhixing Zhang:
A router model for QoS-based multimedia Web services.
797-800

- Hong Kee Sul, Hyunchul Kim, Kilnam Chon:
A hybrid pagoda broadcasting protocol: fixed-delay pagoda broadcasting protocol with partial preloading.
801-804

- Nera W. C. Liu, Jack Y. B. Lee:
Constrained consonant broadcasting - a generalized periodic broadcasting scheme for large scale video streaming.
805-808

- Yeonjoon Chung, Ahmed H. Tewfik:
An efficient video broadcasting protocol with scalable preloading scheme.
809-812

- Virgilio Rodriguez:
Resource management for scalably encoded information: the case of image transmission over wireless networks.
813-816

- Chow-Sing Lin, Tzong-Yao Chang, Jin-Ru Hsieh:
On utilizing multi-channel to provide scheduled video delivery.
817-820

- Deepak S. Turaga, Mihaela van der Schaar:
Content-adaptive filtering in the UMCTF framework.
821-824

- Zhizhong Zhe, Hong Ren Wu, Zhenghua Yu, Tim Ferguson, Damian M. Tan:
Performance evaluation of a perceptual ringing distortion metric for digital video.
825-828

Signal Processing Theory and Methods I
- Abdessamad Ben Hamza, Hamid Krim, Bilge Karaçali:
Structural risk minimization using nearest neighbor rule.
829-832

- Hamid Reza Abutalebi, Hamid Sheikhzadeh, Robert L. Brennan, George H. Freeman:
Affine projection algorithm for oversampled subband adaptive filters.
833-836

- Mohammad Bilal Malik:
State-space RLS.
837-840

- Behrouz Nowrouzian, Arthur T. G. Fuller, M. N. S. Swamy:
A necessary and sufficient condition for the BIBO stability of general-order bode-type variable-amplitude wave-digital equalizers.
841-844

- Zhong Ji, Shuren Qi:
Detection of EEG basic rhythm feature by using band relative intensity ratio(BRIR).
845-848

- Kamyar Hazaveh, Kaamran Raahemifar:
Optimized local discriminant basis algorithm.
849-852

- Palghat P. Vaidyanathan, Byung-Jun Yoon:
Discrete probability density estimation using multirate DSP models.
853-856

- Andre Tkacenko, Palghat P. Vaidyanathan:
On the least squares signal approximation model for overdecimated rational nonuniform filter banks and applications.
857-860

- J. Michael Peterson, Shubha Kadambe:
A probabilistic approach for blind source separation of underdetermined convolutive mixtures.
861-864

- Jie Liang, Lu Gan, Chengjie Tu, Trac D. Tran, Kai-Kuang Ma:
On efficient implementation of oversampled linear phase perfect reconstruction filter banks.
865-868

Volume 2
Smart Cameras
- Jörn Jachalsky, Martin Wahler, Peter Pirsch, S. Capperon, Winfried Gehrke, W. M. Kruijtzer, Antonio Núñez:
A core for ambient and mobile intelligent imaging applications.
1-4

- Wayne Wolf, Burak Ozer, Tiehan Lv:
Architectures for distributed smart cameras.
5-8

- Kohsia S. Huang, Mohan M. Trivedi:
Distributed video arrays for tracking, human identification, and activity analysis.
9-12

- John W. Fisher III, Trevor Darrell:
Learning cross-modal appearance models with application to tracking.
13-16

- Jacky Mallett, M. Michael Bove Jr.:
Eye Society.
17-20

Multimedia Retrieval
- Feng Jing, Mingjing Li, Hong-Jiang Zhang, Bo Zhang:
Support vector machines for region-based image retrieval.
21-24

- Charles Parker:
Towards intelligent string matching in query-by-humming systems.
25-28

- Wing Ho Leung, Tsuhan Chen:
Hierarchical matching for retrieval of hand-drawn sketches.
29-32

- Joo-Hwee Lim, Philippe Mulhem, Qi Tian:
Event-based home photo retrieval.
33-36

- Bo Feng, Qing Li, Jun Yang, Liu Wenyin, Jian Zhai:
Efficient database facilities for content-based Flash retrieval.
37-40

Network Adaptive Techniques
- Hui Cheng, Xi Min Zhang, Yun-Qing Shi, Anthony Vetro, Huifang Sun:
Rate allocation for FGS coded video using composite R-D analysis.
41-44

- Nicola Franchi, Marco Fumagalli, Rosa Lancini:
Optimised source and channel coding for video transmission over ADSL.
45-48

- Gene Cheung, Connie Chan:
Jointly optimal reference frame & quality of service selection for H.261 video coding over lossy networks.
49-52

- Ashwatha Matthur, Padmavathi Mundur:
Dynamic load balancing across mirrored multimedia servers.
53-56

- Hongliang Li, Guizhong Liu, Yongli Li, Zhongwei Zhang:
An effective burstiness estimation model for VBR video stream.
57-60

Multimedia Software and Architectures
Virtual Reality and Imaging II
- Kostas Karpouzis, Amaryllis Raouzaiou, Paraskevi K. Tzouveli, Spiros Ioannou, Stefanos D. Kollias:
MPEG-4: one multimedia standard to unite all.
81-84

- Takahito Kawanishi, Masaru Tsuchida, Shigeru Takagi, Hiroshi Murase:
Small cylindrical display for anthropomorphic agents.
85-88

- Hitoshi Kanda, Jun Ohya:
Efficient, realistic method for animating dynamic behaviors of 3D botanical trees.
89-92

- Wang Hee Lee, Kuntal Sengupta, Rajeev Sharma:
Augmented reality with occlusion rendering using background-foreground segmentation and trifocal tensors.
93-96

- Lijun Jiang, Shiqian Wu, Dajun Wu, Ee Ping Ong, Susanto Rahardja:
3D shape modeling by color phase stepping light projection.
97-100

- Angus M. K. Siu, Rynson W. H. Lau:
Relief occlusion-adaptive meshes for 3D imaging.
101-104

- Roberta L. Gomes, Guillermo de Jesús Hoyos-Rivera, Jean-Pierre Courtiat:
Collaborative virtual environments: going beyond virtual reality.
105-108

- Irene Cheng:
Efficient 3D object simplification and fragmented texture scaling for online visualization.
109-112

Robustness, Error Concealment and Loss Recovery
- Wenjun Zeng:
Spatial-temporal error concealment with side information for standard video codecs.
113-116

- Hyunjoo Kim, Sooyong Kang, Heonyoung Y. Yeom:
Node selection for a fault-tolerant streaming service on a peer-to-peer network.
117-120

- Thenghong H. Yeo, Wai Choong Wong, Dong-Yan Huang:
Soft decision unequal error protection scheme for MPEG advanced audio coding.
121-124

- Fan Zhai, Randall Berry, Thrasyvoulos N. Pappas, Aggelos K. Katsaggelos:
A rate-distortion optimized error control scheme for scalable video streaming over the Internet.
125-128

- Shirish S. Karande, Hayder Radha:
A new family of channel coding schemes for real-time visual communications.
129-132

- Gaurav Agarwal, Alwin Anbu, Aniruddha Sinha:
A fast algorithm to find the region-of-interest in the compressed MPEG domain.
133-136

- Chui Sian Ong, Klara Nahrstedt, Wanghong Yuan:
Quality of protection for mobile multimedia applications.
137-140

- Timothy K. Shih, Louis H. Lin, Jen-Shiun Chiang:
Progressive image transmission by adaptive interpolation.
141-144

- Wei-Ying Kung, Chang-Su Kim, C. C. Jay Kuo:
A spatial-domain error concealment method with edge recovery and selective directional interpolation.
145-148

- Pascal Bourdon, Bertrand Augereau, Christian Olivier, Christian Chatellier:
A PDE-based method for ringing artifact removal on grayscale and color JPEG2000 images.
149-152

Networked Multimedia
- Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang:
Replication strategies for peer-to-peer based multimedia distribution service.
153-156

- Amir Asif:
Multimedia learning objects for digital signal processing in communications.
157-160

- David S. Doermann, Arvind Karunanidhi, Niketu Parekh, M. A. Khan, S. Chen, Hasan Timucin Ozdemir, M. Miwa, Kuo Chu Lee:
Issues in the transmission, analysis, storage and retrieval of surveillance video.
161-164

- Tayeb Lemlouma, Nabil Layaïda:
Encoding multimedia presentations for user preferences and limited environments.
165-168

- Keng Pang Lim, Dajun Wu, Si Wu, Susanto Rahardja, Xiao Lin, Lijun Jiang, Rongshan Yu, Feng Pan, Zhengguo Li, Susu Yao, Genan Feng, Chi Chung Ko:
Video streaming on embedded devices through GPRS network.
169-172

- Qiang Ma, Katsumi Tanaka:
WebTelop: dynamic TV-content augmentation by using Web pages.
173-176

- Yasuhiko Watanabe, Kazuya Sono, Kazuya Yokomizo, Yoshihiro Okada:
Translation camera on mobile phone.
177-180

- Mihai M. Lazarescu, Svetha Venkatesh:
Using camera motion to identify types of American football plays.
181-184

Moving from Features to Semantics using Computational Media Aesthetics
Multimedia Security and Content Protection II
- Yan Sun, K. J. Ray Liu:
Multi-layer key management for secure multimedia multicast communications.
205-208

- Qibin Sun, Dajun He, Zhishou Zhang, Qi Tian:
A secure and robust approach to scalable video authentication.
209-212

- Dekun Zou, Chai Wah Wu, Guorong Xuan, Yun-Qing Shi:
A content-based image authentication system with lossless data hiding.
213-216

- Z. Jane Wang, Min Wu, Wade Trappe, K. J. Ray Liu:
Anti-collusion of group-oriented fingerprinting.
217-220

- Ankur Datta, Niels da Vitoria Lobo, John J. Leeson:
Novel feature vector for image authentication.
221-224

Source and Channel Coding
Image Coding and Enhancement
Video Analysis
- Xinguo Yu, Qi Tian, Kongwah Wan:
A novel ball detection framework for real soccer video.
265-268

- Ying Li, Yu-Fei Ma, Hong-Jiang Zhang:
Salient region detection and tracking in video.
269-272

- Xinguo Yu, Changsheng Xu, Qi Tian, Hon Wai Leong:
A ball tracking framework for broadcast soccer video.
273-276

- Rainer Lienhart, Luhong Liang, Alexander Kuranov:
A detector tree of boosted classifiers for real-time object detection and tracking.
277-280

- Min Xu, Namunu C. Maddage, Changsheng Xu, Mohan S. Kankanhalli, Qi Tian:
Creating audio keywords for event detection in soccer video.
281-284

- Shunsuke Kamijo, Masao Sakauchi:
Segmentation of vehicles and pedestrians in traffic scene by spatio-temporal Markov random field model.
285-288

- Alan Hanjalic:
Multimodal approach to measuring excitement in video.
289-292

- Rong Jin, Alexander G. Hauptmann:
Learning to identify video shots with people based on face detection.
293-296

- Yang Ran, Qinfen Zheng:
Multi moving people detection from binocular sequences.
297-300

- Zuzana Cernekova, Constantine Kotropoulos, Ioannis Pitas:
Video shot segmentation using singular value decomposition.
301-304

Multimedia Streaming Architecture
- Hai Jin, Dafu Deng:
HHMSM: a hierarchical hybrid multicast stream merging scheme for large-scale video-on-demand systems.
305-308

- Zhen Li, Guobin Shen, Shipeng Li, Edward J. Delp:
L-TFRC: an end-to-end congestion control mechanism for video streaming over the Internet.
309-312

- Chen-Lung Chan, Shih-Yu Huang, Jia-Shung Wang:
Cooperative cache framework for video streaming applications.
313-316

- Toufik Ahmed, Ahmed Mehaoua, Vincent Lecuire:
Streaming MPEG-4 audio visual objects using TCP-friendly rate control and unequal error protection.
317-320

- Longin Jan Latecki, Kishore Kulkarni, Jaiwant Mulik:
Better audio performance when video stream is monitored by TCP congestion control.
321-324

- Xuxian Jiang, Yu Dong, Dongyan Xu, Bharat K. Bhargava:
GnuStream: a P2P media streaming system prototype.
325-328

- Jun Guo, Peter G. Taylor, Moshe Zukerman, Sammy Chan, Kit-Sang Tang, Eric W. M. Wong:
On the efficient use of video-on-demand storage facility.
329-332

- Michael Harville, Michele Covell, Susie Wee:
An architecture for componentized, network-based media services.
333-336

Image Classification and Detection
- Sungju Youm, Woosaeng Kim:
Dynamic threshold method for scene change detection.
337-340

- Woosaeng Kim, Ji Yoon Kim:
Image classification using spatial relationship matrix based on color spatio-histogram.
341-344

- Xavier Gibert, Huiping Li, David S. Doermann:
Sports video classification using HMMS.
345-348

- Shaohua Kevin Zhou, Rama Chellappa, Baback Moghaddam:
Adaptive visual tracking and recognition using particle filters.
349-352

- Hwajeong Lee, Daehwan Kim, Daijin Kim, Sung Yang Bang:
Real-time automatic vehicle management system using vehicle tracking and car plate number identification.
353-356

- Junqiang Lan, Xinhua Zhuang:
Embedded SLCCA for wavelet image coding.
357-360

- Jian Zhou, Huai-Rong Shao, Chia Shen, Ming-Ting Sun:
FGS enhancement layer truncation with minimized intra-frame quality variation.
361-364

- Aya Aner-Wolf:
Determining a scene's atmosphere by film grammar rules.
365-368

- Mukesh A. Zaveri, Uday B. Desai, S. N. Merchant:
Tracking multiple maneuvering point targets using multiple filter bank in infrared image sequence.
369-372

Indexing, Segmentation, and Retrieval
- Paisarn Muneesawang, Ling Guan:
Automatic relevance feedback for video retrieval.
373-376

- Miki Haseyama, Isao Kondo:
2-D functional AR model for image identification.
377-380

- Chi-Man Pun:
Invariant content-based image retrieval by wavelet energy signatures.
381-384

- Jiqiang Song, Min Cai, Michael R. Lyu:
A robust statistic method for classifying color polarity of video text.
385-388

- Akisato Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase:
Dynamic-segmentation-based feature dimension reduction for quick audio/video searching.
389-392

- Miki Haseyama, Atsushi Matsumura:
A trainable retrieval system for cartoon character images.
393-396

- Sangoh Jeong, Chee Sun Won, Robert M. Gray:
Histogram-based image retrieval using Gauss mixture vector quantization.
397-400

- Qixiang Ye, Wen Gao, Wei Zeng:
Color image segmentation using density-based clustering.
401-404

Video Segmentation for Semantic Annotation and Transcoding
- Ba Tu Truong, Svetha Venkatesh, Chitra Dorai:
Identifying film takes for cinematic analysis.
405-408

- Nathalie Peyrard, Patrick Bouthemy:
Motion-based selection of relevant video segments for video summarisation.
409-412

- Winston H. Hsu, Shih-Fu Chang:
A statistical framework for fusing mid-level perceptual features in news story segmentation.
413-416

- Anthony Vetro, Tetsuji Haga, Kazuhiko Sumi, Huifang Sun:
Object-based coding for long-term archive of surveillance video.
417-420

- Marco Bertini, Rita Cucchiara, Alberto Del Bimbo, Andrea Prati:
Object and event detection for semantic annotation and transcoding.
421-424

Wireless Multimedia II
- Syed A. Khayam, Shirish S. Karande, Michael Krappel, Hayder Radha:
Cross-layer protocol design for real-time multimedia applications over 802.11 b networks.
425-428

- Fan Yang, Qian Zhang, Wenwu Zhu, Ya-Qin Zhang:
An end-to-end TCP-friendly streaming protocol for multimedia over wireless Internet.
429-432

- Zhijun Lei, Nicolas D. Georganas:
Rate adaptation transcoding for video streaming over wireless channels.
433-436

- Yong Pei, James W. Modestino:
Interactive video coding and transmission over wired-to-wireless IP networks using an edge proxy.
437-440

- Allen Miu, John G. Apostolopoulos, Wai-tian Tan, Mitchell D. Trott:
Low-latency wireless video over 802.11 networks using path diversity.
441-444

Multimedia Semantics
- John R. Smith, Milind R. Naphade, Apostol Natsev:
Multimedia semantic indexing using model vectors.
445-448

- Dinh Quoc Phung, Svetha Venkatesh, Chitra Dorai:
On the extraction of thematic and dramatic functions of content in educational videos.
449-452

- Brett Adams, Chitra Dorai, Svetha Venkatesh, Hung H. Bui:
Indexing narrative structure and semantics in motion pictures with a probabilistic framework.
453-456

- Jiebo Luo, Amit Singhal, Weiyu Zhu:
Natural object detection in outdoor scenes based on probabilistic spatial context models.
457-460

- Shinichi Takagi, Shinobu Hattori, Kazumasa Yokoyama, Akihisa Kodate, Hideyoshi Tominaga:
Sports video categorizing method using camera motion parameters.
461-464

Face, Body, and Audio-visual Analysis
- Yong Ma, Xiaoqing Ding:
Robust real-time face detection based on cost-sensitive AdaBoost method.
465-468

- Jonathan H. Connell, Norman Haas, Etienne Marcheret, Chalapathy Neti, Gerasimos Potamianos, Senem Velipasalar:
A real-time prototype for small-vocabulary audio-visual ASR.
469-472

- Mingkun Li, Dongge Li, Nevenka Dimitrova, Ishwar K. Sethi:
Audio-visual talking face detection.
473-476

- Chung-Lin Huang, Chia-Ying Chung:
A real-time model-based human motion analysis system.
477-480

- Petar S. Aleksic, Aggelos K. Katsaggelos:
Product HMMs for audio-visual continuous speech recognition using facial animation parameters.
481-484

Multimedia Security and Content Protection III
- Peter Hon-Wah Wong, Yick Ming Yeung, Oscar C. Au:
Capacity for JPEG2000-to-JPEG2000 images watermarking.
485-488

- Chun-Shien Lu:
Dual security-based image steganography.
489-492

- Yongdong Wu, Feng Bao, Changsheng Xu:
The security flaws in some authentication watermarking schemes.
493-496

- Huiping Guo, Nicolas D. Georganas:
Digital image watermarking for joint ownership verification without a trusted dealer.
497-500

- Feilong Liu, Yangsheng Wang:
An improved block dependent fragile image watermark.
501-504

- Gwenaël J. Doërr, Jean-Luc Dugelay:
New intra-video collusion attack using mosaicing.
505-508

- Shaohui Liu, Hongxun Yao, Wen Gao:
Neural network based steganalysis in still images.
509-512

- Serhat Erküçük, Sridhar Krishnan, Mehmet Zeytinoglu:
Robust audio watermarking using a chirp based technique.
513-516

Multimedia Distribution
- Stefano Gnavi, Marco Grangetto, Enrico Magli, Gabriella Olmo:
Comparison of rate allocation strategies for H.264 video transmission over wireless lossy correlated networks.
517-520

- Min-You Wu, Wei Shu:
Video distribution with edge stations and Wi-Fi delivery networks.
521-524

- Si Woong Jang, Yong Woon Park:
A dynamic multicasting policy based on proxy caching.
525-528

- Bahjat Qazzaz, Javier Moreno, Xiaoyuan Yang, Porfidio Hernández, Remo Suppi, Emilio Luque:
Admission control policies for video on demand brokers.
529-532

- Qiang Liu, Jenq-Neng Hwang:
A new congestion control algorithm for layered multicast in heterogeneous multimedia dissemination.
533-536

- Hugh Melvin, Liam Murphy:
An integrated NTP-RTCP solution to audio skew detection and compensation for VoIP applications.
537-540

- Hong Man, Yang Li:
Multi-stream video transport over DiffServ wireless LANS.
541-544

- Syed Irtiza Ali, Hayder Radha:
Hierarchical handoff schemes over wireless LAN/WAN networks for multimedia applications.
545-548

Image Compression and Modeling
- Takahiro Nakayama, Masahiro Konda, Koji Takeuchi, Koji Kotani, Tadahiro Ohmi:
Adaptive resolution vector quantization technique and basic codebook design method for compound image compression.
549-552

- Xingsong Hou, Guizhong Liu:
A wavelet packet image coding algorithm based on quadtree classification and UTCQ.
553-556

- Xiaopeng Fan, Yan Lu, Wen Gao:
A novel coefficient scanning scheme for directional spatial prediction-based image compression.
557-560

- Deepak S. Turaga, Mihaela van der Schaar:
Reduced complexity spatio-temporal scalable motion compensated wavelet video encoding.
561-564

- Yuxin Liu, Zhen Li, Paul Salama, Edward J. Delp:
A discussion of leaky prediction based scalable coding.
565-568

- Jean Cardinal:
Compression of side information.
569-572

- Feng Pan, Zhengguo Li, Keng Pang Lim, Dajun Wu, Rongshan Yu, Genan Feng:
An adaptive rate control algorithm for video coding over personal digital assistants (PDA).
573-576

- Geovanni Martinez:
Maximum-likelihood motion estimation of a human face.
577-580

- Mihaela van der Schaar, Deepak S. Turaga:
Unconstrained motion compensated temporal filtering (UMCTF) framework for wavelet video coding.
581-584

- Shan Suthaharan:
A perceptually significant block-edge impairment metric for digital video coding.
585-588

Signal Processing Theory and Methods II
- Zheng Fang, Yingbo Hua:
Maximum likelihood method for blind identification of multiple autoregressive channels.
589-592

- Khim Sia Tan, Woon-Seng Gan, Jun Yang, Meng Hwa Er:
Constant beamwidth beamformer for difference frequency in parametric array.
593-596

- Omid S. Jahromi, Parham Aarabi:
Time delay estimation and signal reconstruction using multi-rate measurements.
597-600

- Yunnan Wu, Sun-Yuan Kung:
Detection for MIMO systems with imprecise channel knowledge.
601-604

- Xinying Zhang, Sun-Yuan Kung:
Capacity analysis for parallel and sequential MIMO equalizers.
605-608

- Timo Roman, Mihai Enescu, Visa Koivunen:
Time-domain method for tracking dispersive channels in MIMO OFDM systems.
609-612

- Frank Papenfuß, Yuri Artyukh, Eugene Boole, Dirk Timmermann:
Optimal sampling functions in nonuniform sampling driver designs to overcome the Nyquist limit.
613-616

- Pamornpol Jinachitra:
Constrained EM estimates for harmonic source separation.
617-620

- Khaled Amleh, Hongbin Li:
Blind code timing and carrier offset estimation for DS-CDMA systems.
621-624

- Mauricio M. Lara, Aldo G. Orozco-Lugo, Desmond C. McLernon, Hugo J. Muro-Lemus:
Blind recovery of multiple packets in ad hoc mobile networks using polynomial phase modulating sequences.
625-628

Multimedia Authoring and Presentation
Multimedia Streaming
Capturing and Indexing Multimedia Events and Content
- Werner Geyer, Heather Richter, Gregory D. Abowd:
Making multimedia meeting records more meaningful.
669-672

- Jiqiang Song, Michael R. Lyu, Jenq-Neng Hwang, Min Cai:
PVCAIS: a personal videoconference archive indexing system.
673-676

- Yoshinari Kameda, Satoshi Nishiguchi, Michihiko Minoh:
CARMUL: concurrent automatic recording for multimedia lecture.
677-680

- Nikolai Joukov, Tzi-cker Chiueh:
Lectern II: a multimedia lecture capturing and editing system.
681-684

- Avare Stewart, Patrick Wolf, Matthias Hemmje:
Media and metadata management for capture and access systems in electronic lecturing environments.
685-688

Image/Video Indexing and Retrieval
Speech and Audio Processing III
- Manu Mathew, Vasudha Bhat, Shine M. Thomas, Changhoon Yim:
Modified MP3 encoder using complex modified cosine transform.
709-712

- Björn Schuller, Gerhard Rigoll, Manfred K. Lang:
HMM-based music retrieval using stereophonic feature information and framelength adaptation.
713-716

- Aaron S. Master, Yi-Wen Liu:
Robust chirp parameter estimation for Hann windowed signals.
717-720

- Ting-Yao Wu, Lie Lu, Ke Chen, Hong-Jiang Zhang:
UBM-based incremental speaker adaptation.
721-724

- Cheng-Yuan Lin, Jyh-Shing Roger Jang:
New refinement schemes for voice conversion.
725-728

- Dong-Yan Huang, Ruihua Ma:
Integer fast modified cosine transform.
729-732

- Hadi Harb, Liming Chen:
Gender identification using a general audio classifier.
733-736

- Jouni Paulus, Anssi Klapuri:
Conventional and periodic N-grams in the transcription of drum sequences.
737-740

- Steven J. Rennie, Parham Aarabi, Trausti T. Kristjansson, Brendan J. Frey, Kannan Achan:
Robust variational speech separation using fewer microphones than speakers.
741-744

Video Processing for Multimedia Interaction
- Alexandre R. J. François, Eun-Young Elaine Kang:
A handheld mirror simulation.
745-748

- Jamey Graham, Jonathan J. Hull:
A paper-based interface for video browsing and retrieval.
749-752

- Frank M. Shipman III, Andreas Girgensohn, Lynn Wilcox:
Creating navigable multi-level video summaries.
753-756

- Lalitha Agnihotri, Nevenka Dimitrova, John R. Kender, John Zimmerman:
Study on requirement specifications for personalized multimedia summarization.
757-760

- Chun-Chuan Yang, Chih-Wen Tien, Yung-Chi Wang:
Supporting VCR-like operations in SMIL2.0 players.
761-764

- Erkut Erdem, Aykut Erdem, Volkan Atalay, A. Enis Çetin:
Computer vision based unistroke keyboard system and mouse for the handicapped.
765-768

- Ishwar Ramani, Rajiv P. Bharadwaja, P. Venkat Rangan:
Location tracking for media appliances in wireless home networks.
769-772

- Lujun Yuan, Wen Gao, Yan Lu:
Latest arrival time leaky bucket for HRD constrained video coding.
773-776

Motion Estimation
- Charay Lerdsudwichai, Mohamed Abdel-Mottaleb:
Algorithm for multiple faces tracking.
777-780

- Patrick Lanvin, Jean-Charles Noyer, Mohammed Benjelloun:
Non-linear estimation of image motion and tracking.
781-784

- Mireya S. Garcia, Henri Nicolas:
Video object motion applications focusing on non-planar rotation.
785-788

- Yu-Kuang Tu, Jar-Ferr Yang, Yi-Nung Shen, Ming-Ting Sun:
Fast variable-size block motion estimation using merging procedure with an adaptive threshold.
789-792

- Hongbin Wang, Hua Lin:
A spectral clustering approach to motion segmentation based on motion trajectory.
793-796

- Korada Ramkishor, T. S. Raghu, K. Suman, Pallapothu S. S. B. K. Gupta:
Spatial correlation based fast field motion vector estimation algorithm for interlaced video encoding.
797-800

- Ye Lu, Cheng Lu, Ze-Nian Li:
A modified space frequency decomposition algorithm for visual motion.
801-804

- Sumeer Goel, Mohsen Shaaban, Tarek Darwish, Hanan A. Mahmoud, Magdy Bayoumi:
Memory accesses reduction for MIME algorithm.
805-808

- Yu-Wen Huang, Bing-Yu Hsieh, Tu-Chih Wang, Shao-Yi Chen, Shyh-Yih Ma, Chun-Fu Shen, Liang-Gee Chen:
Analysis and reduction of reference frames for motion estimation in MPEG-4 AVC/JVT/H.264.
809-812

- Shunan Lin, Anthony Vetro, Yao Wang:
Rate-distortion analysis of the multiple description motion compensation video coding scheme.
813-816

Design and Implementation of Signal Processing Systems
- Adel Baganne, Imed Bennour, Mehrez Elmarzougui, Eric Martin:
A simulation based approach for incorporating virtual components IP cores into multimedia systems design.
817-820

- Atsushi Hatabu, Takashi Miyazaki, Ichiro Kuroda:
Optimization of decision-timing for early termination of SSDA-based block matching.
821-824

- Xiaojuan Hu, Linda DeBrunner, Victor E. DeBrunner:
Design of space-efficient, wide- and narrow transition-band, FIR filters.
825-828

- Duy Cuong Nguyen, Parham Aarabi, Ali Sheikholeslami:
Real-time sound localization using field-programmable gate arrays.
829-832

- Sang Yoon Park, Nam Ik Cho:
Fixed point error analysis of CORDIC processor based on the variance propagation.
833-836

- Justin J. Song, Jian Li, Yen-Kuang Chen:
Quality-delay-and-computation trade-off analysis of acoustic echo cancellation on general-purpose CPU.
837-840

- Etienne Cornu, Hamid Sheikhzadeh, Robert L. Brennan, Hamid Reza Abutalebi, Edmund C. Y. Tam, Peter Iles, Kar Wai Wong:
ETSI AMR-2 VAD: evaluation and ultra low-resource implementation.
841-844

- Daisuke Takahashi:
A radix-16 FFT algorithm suitable for multiply-add instruction based on Goedecker method.
845-848

- Sung-Won Lee, In-Cheol Park:
Low-power hybrid structure of digital matched filters for direct sequence spread spectrum systems.
849-852

Volume 3
Theoretical Insights and Improvements for Multimodal Biometrics
- Conrad Sanderson, Samy Bengio, Hervé Bourlard, Johnny Mariéthoz, Ronan Collobert, Mohamed Faouzi BenZeghiba, Fabien Cardinaux, Sébastien Marcel:
Speech & face based biometric authentication at IDIAP.
1-4

- Julian Fiérrez-Aguilar, Javier Ortega-Garcia, Joaquin Gonzalez-Rodriguez:
Fusion strategies in multimodal biometric verification.
5-8

- Upendra V. Chaudhari, Ganesh N. Ramaswamy, Gerasimos Potamianos, Chalapathy Neti:
Information fusion and decision cascading for audio-visual speaker recognition based on time-varying stream reliability prediction.
9-12

- Xiaoguang Lu, Yunhong Wang, Anil K. Jain:
Combining classifiers for face recognition.
13-16

- Arslan Brömme:
A classification of biometric signatures.
17-20

Summarization
- Michael G. Christel, Chang Huang:
Enhanced access to digital video through visually rich interfaces.
21-24

- Berna Erol, Dar-Shyang Lee, Jonathan J. Hull:
Multimodal summarization of meeting recordings.
25-28

- Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Huifang Sun:
Unsupervised discovery of multilevel statistical video structures using hierarchical hidden Markov models.
29-32

- Stefano Berretti, Alberto Del Bimbo, Pietro Pala:
Merging results of distributed image libraries.
33-36

- Rui Cai, Lie Lu, Hong-Jiang Zhang, Lian-Hong Cai:
Highlight sound effects detection in audio stream.
37-40

Multistream Audio and Video Processing for Telepresence
- Douglas L. Jones:
Four-dimensional sound source recovery from arbitrary acoustic arrays.
41-44

- Qiong Liu, Don Kimber, Jonathan Foote, Chunyuan Liao:
Multichannel video/audio acquisition for immersive conferencing.
45-48

- Wolfgang Herbordt, Herbert Buchner, Walter Kellermann, Rudolf Rabenstein, Sascha Spors, Heinz Teutsch:
Full-duplex multichannel communication: real-time implementations in a general framework.
49-52

- Parham Aarabi, Bob Mungamuru:
Scene reconstruction using distributed microphone arrays.
53-56

- Ankur Mohan, Ramani Duraiswami, Dmitry N. Zotkin, Daniel DeMenthon, Larry S. Davis:
Using computer vision to generate customized spatial audio.
57-60

Video/Image tracking
- Takashi Yamamoto, Rama Chellappa:
Shape and motion driven particle filtering for human body tracking.
61-64

- Karthik Hariharakrishnan, Dan Schonfeld, Philippe Raffy, Fathy Yassa:
Object tracking using adaptive block matching.
65-68

- Gabriel Tsechpenakis, Kostas Rapanntzikos, Nicolas Tsapatsoulis, Stefanos D. Kollias:
Object tracking in clutter and partial occlusion through rule-driven utilization of Snakes.
69-72

- Ofer Miller, Ety Navon, Amir Averbuch:
Tracking of moving objects based on graph edges similarity.
73-76

- Hao Jiang, Mark S. Drew:
Shadow-resistant tracking in video.
77-80

Multimedia Security and Content Protection IV
- Adnan Abdul-Aziz Gutub, Mohammad K. Ibrahim:
High performance elliptic curve GF(2k) cryptoprocessor architecture for multimedia.
81-84

- Wei-Qi Yan, Mohan S. Kankanhalli:
Scrambling of engineering drawings.
85-88

- Mitsuru Kondo, Daigo Muramatsu, Masahiro Sasaki, Takashi Matsumoto:
Nonlinear separation of signature trajectories for on-line personal authentication.
89-92

- José Gabriel Rodríguez Carneiro Gomes, Mylene Christine Queiroz de Farias, Sanjit K. Mitra, Marco Carli:
An accurate billing mechanism for multimedia communications.
93-96

- Dipti Prasad Mukherjee, Subhamoy Maitra:
Robust buyer authentication scheme for multimedia object.
97-100

- Haiping Lu, Alex C. Kot, Susanto Rahardja:
Binary image watermarking through biased binarization.
101-104

- Suk-Hawn Lee, Tae-Su Kim, Byung-Ju Kim, Seong Geun Kwon, Ki-Ryong Kwon, Kuhn-Il Lee:
3D polygonal meshes watermarking using normal vector distributions.
105-108

- Nut Taesombut, Vineet Kumar, Rishi Dubey, P. Venkat Rangan:
Secure registration protocol for media appliances in wireless home networks.
109-112

Human Movement and Face Analysis
- Naresh P. Cuntoor, Amit A. Kale, Rama Chellappa:
Combining multiple evidences for gait recognition.
113-116

- Richard D. Green, Ling Guan:
Tracking human movement patterns using particle filtering.
117-120

- Jian Li, Shaohua Kevin Zhou, Chandra Shekhar:
A comparison of subspace analysis for face recognition.
121-124

- Jianyu Wang, Wen Gao, Shiguang Shan, XiaoPeng Hu:
Facial feature tracking combining model-based and model-free method.
125-128

- Shaohua Kevin Zhou, Rama Chellappa:
Simultaneous tracking and recognition of human faces from video.
129-132

- Gang Pan, Zhaohui Wu, Yunhe Pan:
Automatic 3D face verification from range data.
133-136

- Heng Liu, Shengye Yan, Xilin Chen, Wen Gao:
Rotated face detection in color images using radial template (RT).
137-140

- Xiujuan Chai, Shiguang Shan, Wen Gao, Bo Cao:
Novel example-based shape learning for fast face alignment.
141-144

- Do-Hyung Kim, Jaeyeon Lee, Jung Soh, YunKoo Chung:
Real-time face verification using multiple feature combination and a support vector machine supervisor.
145-148

- Wen Gao, Shiguang Shan, Xiujuan Chai, Xiaowei Fu:
Virtual face image generation for illumination and pose insensitive face recognition.
149-152

Image and Video Coding and Analysis
- Chengjie Tu, Trac D. Tran, Jie Liang:
Error resilient pre-/post-filtering for DCT-based block coding systems.
153-156

- Aysegul Cuhadar, Sinan Tasdoken:
Multiple arbitrary shape ROI coding with zerotree based wavelet coders.
157-160

- Marie Babel, Olivier Déforges:
Lossless and lossy minimal redundancy pyramidal decomposition for scalable image compression technique.
161-164

- Jari Korhonen, Ye Wang:
Schemes for error resilient streaming of perceptually coded audio.
165-168

- Stefano Belfiore, Marco Grangetto, Enrico Magli, Gabriella Olmo:
Spatio-temporal video error concealment with perceptually optimized mode selection.
169-172

- Son Lam Phung, Douglas Chai, Abdesselam Bouzerdoum:
Adaptive skin segmentation in color images.
173-176

- Takuma Ishida, Shogo Muramatsu, Hisakazu Kikuchi, Tetsuro Kuge:
Invertible deinterlacing with variable coefficients and its lifting implementation.
177-180

- Namrata Vaswani, Amit K. Roy Chowdhury, Rama Chellappa:
Statistical shape theory for activity modeling.
181-184

- John N. Carter, Pelopidas Lappas, Robert I. Damper:
Evidence-based object tracking via global energy maximization.
185-188

- Manoranjan Paul, Manzur Murshed, Laurence Dooley:
A new real-time pattern selection algorithm for very low bit-rate video coding focusing on moving regions.
189-192

Speech and Audio Processing IV
- Ye Wang, Jian Tang, Ali Ahmaniemi, Markus Vaalgamaa:
Parametric vector quantization for coding percussive sounds in music.
193-196

- Mukund Devarajan, Fansheng Meng, Penny Hix, Stephen A. Zahorian:
HMM-neural network monophone models for computer based articulation training for the hearing impaired.
197-200

- Suryakanth V. Gangashetty, C. Chandra Sekhar, B. Yegnanarayana:
Constraint satisfaction model for enhancement of evidence in recognition of consonant-vowel utterances.
201-204

- Daniel Garcia-Romero, Julian Fiérrez-Aguilar, Joaquin Gonzalez-Rodriguez, Javier Ortega-Garcia:
Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech.
205-208

- Takanobu Nishiura, Masato Nakayama, Satoshi Nakamura:
An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition.
209-212

- Jianhua Tao, Xing Ni:
Auditive learning based Chinese F0 prediction.
213-216

- Justinian P. Rosca, Radu V. Balan, Christophe Beaugeant:
Multi-channel psychoacoustically motivated speech enhancement.
217-220

- Jin Li:
A progressive to lossless embedded audio coder (PLEAC) with reversible modulated lapped transform.
221-224

Signal Processing and Testing in Multimodal Biometrics
Multimedia Coding and Transport
- Narasinha Kamat, Ju Wang, Jonathan C. L. Liu:
A delay-efficient rerouting scheme for VoIP traffic.
245-248

- Xiaofei Liao, Hai Jin:
A new cluster-based distributed video recorder server.
249-252

- Zhihua Chen, Bobby Bodenheimer, J. Fritz Barnes:
Extending progressive meshes for use over unreliable networks.
253-256

- Christian Bachmeir, Peter Tabery, Serdar Uzumcu, Eckehard G. Steinbach:
A scalable virtual programmable real-time testbed for rapid multimedia service creation and evaluation.
257-260

- Bulent Cavusoglu, Dan Schonfeld, Rashid Ansari:
Real-time adaptive forward error correction for MPEG-2 video communications over RTP networks.
261-264

Multimedia Standards
- Chun-Chuan Yang, Chih-Wen Tien, Yung-Chi Wang:
Modeling of the non-deterministic synchronization behaviors in SMIL2.0 documents.
265-268

- Zaher Aghbari, Akifumi Makinouchi:
Extending MPEG-7 description scheme of moving regions by the semantic visual-spatio-temporal relationships.
269-272

- Jason Lukasiak, David Stirling, Nick Harders, Shane Perrow:
Performance of MPEG-7 low level audio descriptors with compressed data.
273-276

- Yick Ming Yeung, Oscar C. Au, Andy Chang:
Efficient rate control technique for JPEG2000 image coding using priority scanning.
277-280

- Jae-Gon Kim, Yong Wang, Shih-Fu Chang:
Content-adaptive utility-based video adaptation.
281-284

Face Analysis and Modeling
Segmentation, Summarization, and Structuring
- Ichiro Ide, Hiroshi Mo, Norio Katayama, Shin'ichi Satoh:
Topic-based inter-video structuring of a large-scale news video corpus.
305-308

- Ewa Kijak, Guillaume Gravier, Patrick Gros, Lionel Oisel, Frédéric Bimbot:
HMM based structuring of tennis videos using visual and audio cues.
309-312

- Lionel Brunel, Pierre Mathieu:
Fast method of segmentation and indexing MPEG1-2 flow.
313-316

- Yue Zhang, Mario A. Nascimento, Osmar R. Zaïane:
Building image mosaics: an application of content-based image retrieval.
317-320

- Wenli Zhang, Xiaomeng Wu, Shunsuke Kamijo, Masao Sakauchi:
A proposal for a video content generation support system and its application.
321-324

- Yan Liu, John R. Kender:
Fast scene segmentation using multi-level feature selection.
325-328

- Jek Charlson So Yu, Mohan S. Kankanhalli, Philippe Mulhem:
Semantic video summarization in compressed domain MPEG video.
329-332

- Xingquan Zhu, Xindong Wu:
Sequential association mining for video summarization.
333-336

- Eliza Yingzi Du, Chein-I Chang, Paul D. Thouin:
An unsupervised approach to color video thresholding.
337-340

- Darren E. Butler, Sridha Sridharan, V. Michael Bove Jr.:
Real-time adaptive background segmentation.
341-344

Rate Control and Packet Classification for Transmission
- Enrico Masala, Juan Carlos De Martin:
Analysis-by-synthesis distortion computation for rate-distortion optimized multimedia streaming.
345-348

- Yuh-Ching Wang, Jin-Jang Leou:
A rate control scheme for H.26L video transmission.
349-352

- Mei-Ling Shyu, Shu-Ching Chen, Hongli Luo:
Ensuring fairness in multimedia multicast streaming with optimal rate allocation and client buffer utilization.
353-356

- S. R. Subramanya, Jagannathan Sarangapani, Mingsheng Peng:
A scheme for fair, rate-based end-to-end congestion control of multimedia traffic in packet switched networks.
357-360

- Chi-Wah Wong, Oscar C. Au, Bojun Meng, Hong-Kwai Lam:
Perceptual rate control for low-delay video communications.
361-364

- Mei-Ling Shyu, Shu-Ching Chen, Hongli Luo:
Per-class queue management and adaptive packet drop mechanism for multimedia networking.
365-368

- Davide Quaglia, Juan Carlos De Martin:
Adaptive packet classification for constant perceptual quality of service delivery of video streams over time-varying networks.
369-372

- Qiang Liu, Jenq-Neng Hwang:
End-to-end available bandwidth estimation and time measurement adjustment for multimedia QOS.
373-376

- Lifeng Zhao, C.-C. Jay Kuo:
Buffer-constrained R-D optimized rate control for video coding.
377-380

Audio Signal Processing
- Dmitry N. Zotkin, Shihab A. Shamma, Powen Ru, Ramani Duraiswami, Larry S. Davis:
Pitch and timbre manipulations using cortical representation of sound.
381-384

- Hsuan-Huei Shih, Shrikanth S. Narayanan, C.-C. Jay Kuo:
Multidimensional humming transcription using a statistical approach for query by humming systems.
385-388

- Arvindh Krishnaswamy:
Application of pitch tracking to South Indian classical music.
389-392

- Mohammed Raad, Alfred Mertins, Ian S. Burnett:
Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT).
393-396

- Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Comparing MFCC and MPEG-7 audio features for feature extraction, maximum likelihood HMM and entropic prior HMM for sports audio classification.
397-400

- Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework.
401-404

- Lie Lu, Yi Mao, Liu Wenyin, Hong-Jiang Zhang:
Audio restoration by constrained audio texture synthesis.
405-408

- Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno:
Musical instrument identification based on F0-dependent multivariate normal distribution.
409-412

- Dreten De Koning, Werner Verhelst:
On psychoacoustic noise shaping for audio requantization.
413-416

Architecture, Implementation, and Design
- Nicolas Ventroux, Jean-François Nezan, Mickaël Raulet, Olivier Déforges:
Rapid prototyping for an optimized MPEG4 decoder implementation over a parallel heterogeneous architecture.
417-420

- Hung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Liang-Gee Chen:
Hardware oriented rate control algorithm and implementation for realtime video coding.
421-424

- Ho-Man Tang, Michael R. Lyu, Irwin King:
Face recognition committee machine.
425-428

- Shantanu Chakrabartty, Masakazu Yagi, Tadashi Shibata, Gert Cauwenberghs:
Robust cephalometric landmark identification using support vector machines.
429-432

- Richard Kuehnel, Yuke Wang:
A method of generating uniformly distributed sequences over [0, K], where K+1 is not a power of two.
433-436

- Anand Krishnamurthy, Yiyan Tang, Cathy Xu, Yuke Wang:
An efficient implementation of multi-prime RSA on DSP processor.
437-440

- Donglai Xu, Rui Gao, Hadj Batatia:
An improved parallel architecture fro MPEG-4 motion estimation in 3G mobile applications.
441-444

- Toshiyuki Yamane, Yasunao Katayama:
An ultra-fast Reed-Solomon decoder soft-IP with 8-error correcting capability.
445-448

Multimedia Technology in Bioinformatics
- Zuyi Wang, Sun-Yuan Kung, Junying Zhang, Javed I. Khan, Jianhua Xuan, Yue Joseph Wang:
Computational intelligence approach for gene expression data mining and classification.
449-452

- Harry Hochheiser, Eric H. Baehrecke, Stephen M. Mount, Ben Shneiderman:
Dynamic querying for pattern identification in microarray and genomic data.
453-456

- Sophia R. He, Edmond J. Breen, Sybille M. N. Hunt:
Proteomics: approaches and image analysis tools for drug discovery.
457-460

- Jinwook Seo, Marina Bakay, Po Zhao, Yi-Wen Chen, Priscilla Clarkson, Ben Shneiderman, Eric P. Hoffman:
Interactive color mosaic and dendrogram displays for signal/noise optimization in microarray data analysis.
461-464

- Per B. Hojte, Xiaoxing Wang:
Registering electrophoresis images for bioinformatics study of protein.
465-468

Video Analysis and Mining
- Dong-Jun Lan, Yu-Fei Ma, Hong-Jiang Zhang:
A novel motion-based representation for video mining.
469-472

- Belle L. Tseng, Ching-Yung Lin, DongQing Zhang, John R. Smith:
Improved text overlay detection in videos using a fusion-based classifier.
473-476

- Chih-Yi Chiu, Shih-Pin Chao, Jui-Hsiang Chao, Wen-Yen Chang, Hsin-Chih Lin, Shi-Nine Yang:
Motion indexing and synthesis.
477-480

- Cees G. M. Snoek, Marcel Worring:
Time interval maximum entropy based event indexing in soccer video.
481-484

- Li-Qun Xu, Yongmin Li:
Video classification using spatial-temporal features and PCA.
485-488

Multimedia Computing Systems and Appliances
- Ju Wang, Jonathan C. L. Liu, Yishu He:
Efficient buffering control for a software-only, high-level, high-profile, MPEG-2 decoder.
489-492

- Yan Zhu, Min-You Wu, Wei Shu:
Comparison study and evaluation of overlay multicast networks.
493-496

- Yoshitaka Nakamura, Hirozumi Yamaguchi, Akihito Hiromori, Keiichi Yasumoto, Teruo Higashino, Kenichi Taniguchi:
On designing end-user multicast for multiple video sources.
497-500

- Eugenio Costamagna, Lorenzo Favalli, Francesco Tarantola:
Characterization and modeling of campus-level IP network traffic.
501-504

- Stuart Goose, Rajanikanth Tanikella, Sreedhar Kodlahalli:
Attenuator: towards preserving the original appearance of large documents when rendered on small screen mobile devices.
505-508

Fast Algorithm for Video Processing
- Keman Yu, Jiangbo Lu, Jiang Li, Shipeng Li:
Practical real-time video codec for mobile devices.
509-512

- Hyungjoon Kim, Yucel Altunbasak:
Low-complexity rate-distortion optimal macroblock mode selection for MPEG-like video coders.
513-516

- Hye-Yeon C. Tourapis, Alexis M. Tourapis:
Fast motion estimation within the H.264 codec.
517-520

- Bojun Meng, Oscar C. Au, Chi-Wah Wong, Hong-Kwai Lam:
Efficient intra-prediction mode selection for 4×4 blocks in H.264.
521-524

- Jun Xin, Ming-Ting Sun, Vincent Hsu:
Diversity-based fast block motion estimation.
525-528

Multimedia Human-Machine Interface and Interaction
- Yao-Jen Chang, Chao-Kuei Hsieh, Pei-Wei Hsu, Yung-Chang Chen:
Speech-assisted facial expression analysis and synthesis for virtual conferencing systems.
529-532

- Ashish Verma, Nitendra Rajput, L. Venkata Subramaniam:
Using viseme based acoustic models for speech driven lip synthesis.
533-536

- Atsuo Yoshitaka, Hirokazu Seki:
Detecting auditory information in concentration based on eye movement.
537-540

- Martin Zobl, Michael Geiger, Björn Schuller, Manfred K. Lang, Gerhard Rigoll:
A real-time system for hand gesture controlled operation of in-car devices.
541-544

- Olivier Pietquin, Thierry Dutoit:
Aided design of finite-state dialogue management systems.
545-548

- Laurence Devillers, Lori Lamel, Ioana Vasilescu:
Emotion detection in task-oriented spoken dialogues.
549-552

- Nils Klarlund:
Editing by voice and the role of sequential symbol systems for improved human-to-computer information rates.
553-556

- Amarnag Subramanya, Raghunandan S. Kumaran, John N. Gowdy:
Real time eye tracking for human computer interfaces.
557-560

- Alper Kanak, Engin Erzin, Yucel Yemez, A. Murat Tekalp:
Joint audio-video processing for biometric speaker identification.
561-564

- Ying Li, Shrikanth Narayanan, C.-C. Jay Kuo:
Audiovisual-based adaptive speaker identification.
565-568

Algorithms and Architectures for Multimedia Communcations
- Sumit Roy, John Ankcorn, Susie Wee:
Architecture of a modular streaming media server for content delivery networks.
569-572

- Hideaki Ito, Teruo Fukumura:
A delivery method of videos with required minimum bandwidths.
573-576

- Shiang-Chun Liou, Hsuan-Chia Lu, Kuo-Hsien Yeh:
A capable location prediction and resource reservation scheme in wireless networks for multimedia.
577-580

- Yen-Chi Lee, Yucel Altunbasak, Russell M. Mersereau:
A drift-free motion-compensated predictive encoding technique for multiple description coding.
581-584

- Enrico Magli, Massimo Mancin, Luca Merello:
Low-complexity video compression for wireless sensor networks.
585-588

- Shuhua Peng, Xiaodong Liu, Qionghai Dai, Yu Cheng:
An improved RM algorithm for preventing streaming media tasks from starvation.
589-592

- Gaurav Harit, Santanu Chaudhury, Gaurav Garg, Pramod Kumar Sharma:
A framework for video representation and transcoding using appearance spaces.
593-596

- Andrea Cavallaro, Olivier Steiger, Touradj Ebrahimi:
Semantic segmentation and description for video transcoding.
597-600

- Tu-Chih Wang, Yu-Wen Huang, Hung-Chi Fang, Liang-Gee Chen:
Performance analysis of hardware oriented algorithm modification in H.264.
601-604

Speech Recognition and Enhancement
- Ashutosh Garg, Gerasimos Potamianos, Chalapathy Neti, Thomas S. Huang:
Frame-dependent multi-stream reliability indicators for audio-visual speech recognition.
605-608

- Hideki Banno, Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura:
In-car speech recognition using distributed microphones: adapting to automatically detected driving conditions.
609-612

- LiFeng Sang, Zhaohui Wu, Yingchun Yang, Wanfeng Zhang:
Automatic speaker recognition using dynamic Bayesian network.
613-616

- Phu Chien Nguyen, Masato Akagi, Tu Bao Ho:
Temporal decomposition: a promising approach to VQ-based speaker identification.
617-620

- Guillaume Lathoud, Iain McCowan:
Location based speaker segmentation.
621-624

- Shoichi Matsunaga, Atsunori Ogawa, Yoshikazu Yamaguchi, Akihiro Imamura:
Non-native English speech recognition using bilingual English lexicon and acoustic models.
625-628

- Guangji Shi, Parham Aarabi:
Robust digit recognition using phase-dependent time-frequency masking.
629-632

- Jounghoon Beh, Hanseok Ko:
A novel spectral subtraction scheme for robust speech recognition: spectral subtraction using spectral harmonics of speech.
633-636

Last update Sat May 25 03:24:49 2013
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page