28. SIGIR 2005: Salvador, Bahia, Brazil
Ricardo A. Baeza-Yates, Nivio Ziviani, Gary Marchionini, Alistair Moffat, John Tait (Eds.): SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil, August 15-19, 2005. ACM 2005 ISBN 1-59593-034-5
João Candido Portinari: The Portinari project: IR helps art and culture. 1-2
Theory 1


Oren Kurland, Lillian Lee, Carmel Domshlak: Better than the real thing?: iterative pseudo-query processing using cluster-based language models. 19-26
Javed A. Aslam, Emine Yilmaz, Virgiliu Pavlu: The maximum entropy method for analyzing retrieval measures. 27-34
Relevance feedback
Ryen W. White, Ian Ruthven, Joemon M. Jose: A study of factors affecting the utility of implicit relevance feedback. 35-42
Xuehua Shen, Bin Tan, ChengXiang Zhai: Context-sensitive information retrieval using implicit feedback. 43-50
Chen Zhang, Joyce Y. Chai, Rong Jin: User term feedback in interactive text-based image retrieval. 51-58
Distributed
Matthias Bender, Sebastian Michel, Peter Triantafillou, Gerhard Weikum, Christian Zimmer: Improving collection selection with overlap awareness in P2P search engines. 67-74

Kartik Hosanagar: A utility theoretic approach to determining optimal wait times in distributed information retrieval. 91-97
Filtering
Yiming Yang, Shinjae Yoo, Jian Zhang, Bryan Kisiel: Robustness of adaptive filtering methods in a cross-benchmark evaluation. 98-105
Zhiwei Li, Bin Wang, Mingjing Li, Wei-Ying Ma: A probabilistic model for retrospective news event detection. 106-113
Gui-Rong Xue, Chenxi Lin, Qiang Yang, Wensi Xi, Hua-Jun Zeng, Yong Yu, Zheng Chen: Scalable collaborative filtering using cluster-based smoothing. 114-121
Categorization and classification
Jun Yan, Ning Liu, Benyu Zhang, Shuicheng Yan, Zheng Chen, QianSheng Cheng, Weiguo Fan, Wei-Ying Ma: OCFS: optimal orthogonal centroid feature selection for text categorization. 122-129
Wensi Xi, Edward A. Fox, Weiguo Fan, Benyu Zhang, Zheng Chen, Jun Yan, Dong Zhuang: SimFusion: measuring similarity using unified relationship matrix. 130-137
Kazuhiro Seki, Javed Mostafa: An application of text categorization methods to gene ontology annotation. 138-145
Evaluation
Kai Puolamäki, Jarkko Salojärvi, Eerika Savia, Jaana Simola, Samuel Kaski: Combining eye movements and collaborative filtering for proactive information retrieval. 146-153
Thorsten Joachims, Laura A. Granka, Bing Pan, Helene Hembrooke, Geri Gay: Accurately interpreting clickthrough data as implicit feedback. 154-161
Mark Sanderson, Justin Zobel: Information retrieval system evaluation: effort, sensitivity, and reliability. 162-169
Web search 1
Dennis Fetterly, Mark Manasse, Marc Najork: Detecting phrase-level duplication on the world wide web. 170-177
Paul-Alexandru Chirita, Wolfgang Nejdl, Raluca Paiu, Christian Kohlschütter: Using ODP metadata to personalize search. 178-185
Gui-Rong Xue, Qiang Yang, Hua-Jun Zeng, Yong Yu, Zheng Chen: Exploiting the hierarchical structure for link analysis. 186-193
Summarization
Jian-Tao Sun, Dou Shen, Hua-Jun Zeng, Qiang Yang, Yuchang Lu, Zheng Chen: Web-page summarization using clickthrough data. 194-201
Kathleen McKeown, Rebecca J. Passonneau, David K. Elson, Ani Nenkova, Julia Hirschberg: Do summaries help? 210-217
Fernando Flores: The future of media, blogs and innovation: new IR challenges? 218
Efficiency
Trevor Strohman, Howard R. Turtle, W. Bruce Croft: Optimization strategies for complex queries. 219-225
Nieves R. Brisaboa, Antonio Fariña, Gonzalo Navarro, José R. Paramá: Efficiently decodable and searchable natural language adaptive compression. 234-241
Martin Theobald, Ralf Schenkel, Gerhard Weikum: Efficient and self-tuning incremental query expansion for top-k query processing. 242-249
Categorization and supervised machine learning
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Shuming Shi, Yunbo Cao, Hang Li: Title extraction from bodies of HTML documents and its application to web page retrieval. 250-257
Dell Zhang, Xi Chen, Wee Sun Lee: Text classification with kernels on the multinomial manifold. 266-273
Shenghuo Zhu, Xiang Ji, Wei Xu, Yihong Gong: Multi-labelled classification using maximum entropy method. 274-281
Theory 2
Arjen P. de Vries, Thomas Rölleke: Relevance information: a loss of entropy but a gain for IDF? 282-289
Jianfeng Gao, Haoliang Qi, Xinsong Xia, Jian-Yun Nie: Linear discriminant model for information retrieval. 290-297
Oren Kurland, Lillian Lee: PageRank without hyperlinks: structural re-ranking using links induced by language models. 306-313
Structured data
Charles L. A. Clarke: Controlling overlap in content-oriented XML retrieval. 314-321
Christos Tryfonopoulos, Stratos Idreos, Manolis Koubarakis: Publish/subscribe functionality in IR environments using structured overlay networks. 322-329
Paul A. Viola, Mukund Narasimhan: Learning to extract information from semi-structured text using a discriminative context free grammar. 330-337
NLP

Vitor Rocha de Carvalho, William W. Cohen: On the collective classification of email "speech acts". 345-352
Multimedia
Changsheng Xu, Xi Shao, Namunu Chinthaka Maddage, Mohan S. Kankanhalli: Automatic music video summarization based on audio-visual-text analysis and alignment. 361-368
Bin Ma, Haizhou Li: A phonotactic-semantic paradigm for automatic spoken document classification. 369-376
Nicholas R. Howe, Toni M. Rath, R. Manmatha: Boosted decision trees for word recognition in handwritten document retrieval. 377-383
Question answering
Hang Cui, Min-Yen Kan, Tat-Seng Chua: Generic soft pattern models for definitional question answering. 384-391
Jimmy J. Lin: Evaluation of resources for question answering evaluation. 392-399
Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan, Tat-Seng Chua: Question answering passage retrieval using dependency relations. 400-407
Web search 2
Tao Qin, Tie-Yan Liu, Xu-Dong Zhang, Zheng Chen, Wei-Ying Ma: A study of relevance propagation for web search. 408-415
Nick Craswell, Stephen E. Robertson, Hugo Zaragoza, Michael J. Taylor: Relevance weighting for query independent evidence. 416-423
Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li: Detecting dominant locations from search queries. 424-431
Amit Singhal: Challenges in running a commercial search engine. 432
User studies
James Allan, Ben Carterette, Joshua Lewis: When will information retrieval be "good enough"? 433-440
Luanne Freund, Elaine G. Toms, Charles L. A. Clarke: Modeling task-genre relationships for IR in the workplace. 441-448
Jaime Teevan, Susan T. Dumais, Eric Horvitz: Personalizing search via automated analysis of interests and activities. 449-456
Diane Kelly, Vijay Deepak Dollu, Xin Fu: The loquacious user: a document-independent source of terms for query expansion. 457-464
Theory 3



Shuming Shi, Ji-Rong Wen, Qing Yu, Ruihua Song, Wei-Ying Ma: Gravitation-based model for information retrieval. 488-495
Web search 3
Berthier A. Ribeiro-Neto, Marco Cristo, Paulo Braz Golgher, Edleno Silva de Moura: Impedance coupling in content-targeted advertising. 496-503
Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma: Improving web search results using affinity graph. 504-511
Elad Yom-Tov, Shai Fine, David Carmel, Adam Darlow: Learning to estimate query difficulty: including applications to missing content detection and distributed information retrieval. 512-519
Cross-language
Christof Monz, Bonnie J. Dorr: Iterative translation disambiguation for cross-language information retrieval. 520-527
Kornél G. Markó, Stefan Schulz, Olena Medelyan, Udo Hahn: Bootstrapping dictionaries for cross-language information retrieval. 528-535
Yi Liu, Rong Jin, Joyce Y. Chai: A maximum coherence model for dictionary-based cross-language information retrieval. 536-543
Video and image
Arnab Ghoshal, Pavel Ircing, Sanjeev Khudanpur: Hidden Markov models for automatic annotation and content-based retrieval of images and video. 544-551
Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan I. Moldovan: Exploiting ontologies for automatic image annotation. 552-558
Gustavo Carneiro, Nuno Vasconcelos: A database centric view of semantic image annotation and retrieval. 559-566
Posters
Eugene Agichtein, Silviu Cucerzan, Eric Brill: Analysis of factoid questions for effective relation extraction. 567-568
Javier Artiles, Julio Gonzalo, Felisa Verdejo: A testbed for people searching strategies in the WWW. 569-570
Javed A. Aslam, Emine Yilmaz, Virgiliu Pavlu: A geometric interpretation of r-precision and its correlation with average precision. 573-574
Leif Azzopardi, Mark Girolami, Malcolm Crowe: Probabilistic hyperspace analogue to language. 575-576
Claudine Santos Badue, Ramurti A. Barbosa, Paulo Braz Golgher, Berthier A. Ribeiro-Neto, Nivio Ziviani: Basic issues on the processing of web queries. 577-578
Wilma Bainbridge, Ryen W. White, Douglas W. Oard: An interface to search human movements based on geographic and chronological metadata. 579-580
Steven M. Beitzel, Eric C. Jensen, Ophir Frieder, David A. Grossman, David D. Lewis, Abdur Chowdhury, Aleksander Kolcz: Automatic web query classification using labeled and unlabeled training data. 581-582
Steven M. Beitzel, Eric C. Jensen, Ophir Frieder, Abdur Chowdhury, Greg Pass: Surrogate scoring for improved metasearch precision. 583-584
Roi Blanco, Alvaro Barreiro: Characterization of a simple case of the reassignment of document identifiers as a pattern sequencing problem. 587-588
Oisín Boydell, Barry Smyth, Cathal Gurrin, Alan F. Smeaton: Evaluating the impact of selection noise in community-based web search. 591-592
Kian Ming Adam Chai: Expectation of f-measures: tractable exact computation and some empirical observations of its properties. 593-594

Paul Ferguson, Alan F. Smeaton, Cathal Gurrin, Peter Wilkins: Top subset retrieval on large collections using sorted indices. 599-600

Jens Grivolla: Using Oracle for natural language document retrieval an automatic query reformulation approach. 605-606
Nathalie Hernandez, Josiane Mothe, Sandra Poulain: Customizing information access according to domain and task knowledge: the ontoExplo system. 607-608
Eduard Hoenkamp, Sander van Dijk: Evaluating semantic indexing techniques through cross-language fingerprinting. 609-610
Xiangji Huang, Yan Rui Huang, Miao Wen: A dual index model for contextual information retrieval. 613-614
Eric C. Jensen, Steven M. Beitzel, David A. Grossman, Ophir Frieder, Abdur Chowdhury: Predicting query difficulty on the web by learning visual clues. 615-616
Jiwoon Jeon, W. Bruce Croft, Joon Ho Lee: Finding semantically similar questions based on their answers. 617-618
Rong Jin, Joyce Y. Chai: Study of cross lingual information retrieval using on-line translation systems. 619-620
Rieko Kadobayashi, Katsumi Tanaka: 3D viewpoint-based photo search and information browsing. 621-622
Tomoko Kajiyama, Noriko Kando, Shin'ichi Satoh: Examination and enhancement of a ring-structured graphical search interface based on usability testing. 623-624
Vijay Krishnan: Short comings of latent models in supervised settings. 625-626
Lun-Wei Ku, Li-Ying Lee, Tung-Ho Wu, Hsin-Hsi Chen: Major topic detection and its application to opinion summarization. 627-628


Jimmy J. Lin, G. Craig Murray: Assessing the term independence assumption in blind relevance feedback. 635-636
Wei-Hao Lin, Alexander G. Hauptmann: Revisiting the effect of topic set size on retrieval error. 637-638
Bicheng Liu, David J. Harper, Stuart N. K. Watt: Information sharing through rational links and viewpoint retrieval. 639-640
João Magalhães, Stefan M. Rüger: Mining multimedia salient concepts for incremental information extraction. 641-642

Jukka Perkiö, Wray L. Buntine, Henry Tirri: A temporally adaptive content-based relevance ranking algorithm. 647-648
Himanshu Sharma, Bernard J. Jansen: Automated evaluation of search engine performance via implicit user feedback. 649-650
Renxu Sun, Hang Cui, Keya Li, Min-Yen Kan, Tat-Seng Chua: Dependency relation matching for answer selection. 651-652
Songbo Tan, Xueqi Cheng, Bin Wang, Hongbo Xu, Moustafa Ghanem, Yike Guo: Using dragpushing to refine centroid text classifiers. 653-654
Dolf Trieschnigg, Wessel Kraaij: Scalable hierarchical topic detection: exploring a sample based approach. 655-656
Goldee Udani, Shachi Dave, Anthony Davis, Tim Sibley: Noun sense induction using web search results. 657-658
Jun Wang, Marcel J. T. Reinders, Reginald L. Lagendijk, Johan A. Pouwelse: Self-organizing distributed collaborative filtering. 659-660
Ho Chung Wu, Robert W. P. Luk, Kam-Fai Wong, Kui-Lam Kwok, W. J. Li: A retrospective study of probabilistic context-based retrieval. 663-664
Baoping Zhang, Yuxin Chen, Weiguo Fan, Edward A. Fox, Marcos André Gonçalves, Marco Cristo, Pável Calado: Intelligent fusion of structural and citation-based evidence for text classification. 667-668
Ying Zhang, Fei Huang, Stephan Vogel: Mining translations of OOV terms from the web through cross-lingual query expansion. 669-670
Shuigeng Zhou, Jihong Guan: On redundancy of training corpus for text categorization: a perspective of geometry. 671-672
Demos
Pedro Cano, Markus Koppenberger, Nicolas Wack: An industrial-strength content-based music recommendation system. 673
Soumen Chakrabarti, Jeetendra Mirchandani, Arnab Nandi: SPIN: searching personal information networks. 674
J. Stephen Downie, Andreas F. Ehmann, David K. Tcheng: Music-to-knowledge (M2K): a prototyping and evaluation environment for music information retrieval research. 676
Jochen L. Leidner: A wireless natural language search engine. 677
Donald Metzler, Yaniv Bernstein, W. Bruce Croft, Alistair Moffat, Justin Zobel: The recap system for identifying information flow. 678
Dragomir R. Radev, Omer Kareem, Jahna Otterbacher: Hierarchical text summarization for WAP-enabled mobile devices. 679


Ville H. Tuulos, Jukka Perkiö, Henry Tirri: Multi-faceted information retrieval system for large scale email archives. 683



