25. SIGIR 2002: Tampere, Finland
SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 11-15, 2002, Tampere, Finland. ACM 2002
C. J. van Rijsbergen: Landmarks in information retrieval: the message out of the bottle. 1
Web Information Retrieval

Seung-Taek Park, David M. Pennock, C. Lee Giles, Robert Krovetz: Analysis of lexical signatures for finding lost or related documents. 11-18
Wessel Kraaij, Thijs Westerveld, Djoerd Hiemstra: The Importance of Prior Probabilities for Entry Page Search. 27-34
Information Retrieval Theory
Djoerd Hiemstra: Term-specific smoothing for the language modeling approach to information retrieval: the importance of a query term. 35-41
Rong Jin, Alexander G. Hauptmann, ChengXiang Zhai: Title language model for information retrieval. 42-48
User Studies
Ryen White, Ian Ruthven, Joemon M. Jose: Finding relevant documents using top ranking sentences: an evaluation of two alternative schemes. 57-64
Mao Chen, Andrea S. LaPaugh, Jaswinder Pal Singh: Predicting category accesses for a user in a structured information space. 65-72
David A. Smith: Detecting and Browsing Events in Unstructured text. 73-80
Filtering
Yi Zhang, James P. Callan, Thomas P. Minka: Novelty and redundancy detection in adaptive filtering. 81-88
Kian Ming Adam Chai, Hai Leong Chieu, Hwee Tou Ng: Bayesian online classifiers for text classification and filtering. 97-104
Summarization
Massih-Reza Amini, Patrick Gallinari: The use of unlabeled data to improve supervised learning for text summarization. 105-112
Hongyuan Zha: Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering. 113-120
Hilda Hardy, Nobuyuki Shimizu, Tomek Strzalkowski, Ting Liu, Xinyang Zhang, G. Bowden Wise: Cross-document summarization by concept classification. 121-128
Text Categorization
Noam Slonim, Nir Friedman, Naftali Tishby: Unsupervised document classification using sequential information maximization. 129-136
Takahiko Kawatani: Topic difference factor extraction between two document sets and its application to text categorization. 137-144
Yong-Bae Lee, Sung-Hyon Myaeng: Text genre classification with genre-revealing and subject-revealing features. 145-150
Cross-language Information Retrieval
Paul McNamee, James Mayfield: Comparing cross-language query expansion techniques by degrading translation resources. 159-166
Marcello Federico, Nicola Bertoldi: Statistical cross-language information retrieval using n-best query translations. 167-174
Jianfeng Gao, Ming Zhou, Jian-Yun Nie, Hongzhao He, Weijun Chen: Resolving query translation ambiguity using a decaying co-occurrence model and syntactic dependence relations. 183-190
Clustering
Xin Liu, Yihong Gong, Wei Xu, Shenghuo Zhu: Document clustering with cluster refinement and model selection capabilities. 191-198
Paul N. Bennett, Susan T. Dumais, Eric Horvitz: Probabilistic combination of text classifiers using reliability indicators: models and results. 207-214
Efficiency
Dirk Bahle, Hugh E. Williams, Justin Zobel: Efficient phrase querying with an auxiliary index. 215-221
Falk Scholer, Hugh E. Williams, John Yiannis, Justin Zobel: Compression of inverted indexes for fast query evaluation. 222-229
Bruno Pôssas, Nivio Ziviani, Wagner Meira Jr., Berthier A. Ribeiro-Neto: Set-based model: a new approach for information retrieval. 230-237
Collaborative Filtering
John F. Canny: Collaborative filtering with privacy via factor analysis. 238-245
Rickard Cöster, Martin Svensson: Inverted file search algorithms for collaborative filtering. 246-252
Andrew I. Schein, Alexandrin Popescul, Lyle H. Ungar, David M. Pennock: Methods and metrics for cold-start recommendations. 253-260
Arabic Information Retrieval

Jinxi Xu, Alexander Fraser, Ralph M. Weischedel: Empirical studies in strategies for Arabic retrieval. 269-274
Leah S. Larkey, Lisa Ballesteros, Margaret E. Connell: Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis. 275-282
Queries
David Carmel, Eitan Farchi, Yael Petruschka, Aya Soffer: Automatic query refinement using lexical affinities with maximal information gain. 283-290
Susan T. Dumais, Michele Banko, Eric Brill, Jimmy J. Lin, Andrew Y. Ng: Web question answering: is more always better?. 291-298

Kimmo Koskenniemi: Is natural language an inconvenience or an opportunity for IR?. 315
Evaluation
Ellen M. Voorhees, Chris Buckley: The effect of topic set size on retrieval experiment error. 316-323
Eero Sormunen: Liberal relevance criteria of TREC -: counting on negligible documents?. 324-330
Multimedia
Shai Shalev-Shwartz, Shlomo Dubnov, Nir Friedman, Yoram Singer: Robust temporal and spectral modeling for query By melody. 331-338
Poster session
Fuchun Peng, Xiangji Huang, Dale Schuurmans, Nick Cercone, Stephen E. Robertson: Using self-supervised word segmentation in Chinese information retrieval. 349-350
Ben Wolin: Automatic classification in product catalogs. 351-352
Chris H. Q. Ding, Xiaofeng He, Parry Husbands, Hongyuan Zha, Horst D. Simon: PageRank, HITS and a unified framework for link analysis. 353-354
Fred Damerau, Tong Zhang, Sholom M. Weiss, Nitin Indurkhya: Experiments in high-dimensional text categorization. 357-358
Xiao-Jun Yuan, Nicholas J. Belkin, Ja-Young Kim: The relationship between ASK and relevance criteria. 359-360
Vijay Boyapati: Improving hierarchical text classification using unlabeled data. 363-364
Susan Dziadosz, Raman Chandrasekar: Do thumbnail previews help users make better relevance decisions about web search results?. 365-366
Fabio Ciravegna, Alexiei Dingli, Yorick Wilks, Daniela Petrelli: Amilcare: adaptive information extraction for document annotation. 367-368
Charles L. A. Clarke, Gordon V. Cormack, M. Laszlo, Thomas R. Lynam, Egidio L. Terra: The impact of corpus size on question answering performance. 369-370
Jack G. Conrad, Changwen Yang, Joanne S. Claussen: Effective collection metasearch in a hierarchical environment: global vs. localized retrieval performance. 371-372
Fabio Crestani, Pablo de la Fuente, Jesús Vegas: Experimenting with graphical user interfaces for structured document retrieval. 373-374
Koji Eguchi, Keizo Oyama, Emi Ishida, Kazuko Kuriyama, Noriko Kando: The web retrieval task and its evaluation in the third NTCIR workshop. 375-376
Mathias Géry: Non-linear reading for a structured web indexation. 379-380
Abdur Chowdhury, M. Catherine McCabe, David A. Grossman, Ophir Frieder: Document normalization revisited. 381-382
Preben Hansen, Daniela Petrelli, Jussi Karlgren, Micheline Beaulieu, Mark Sanderson: User-centered interface design for cross-language information retrieval. 383-384
Keiichiro Hoashi, Erik Zeitler, Naomi Inoue: Implementation of relevance feedback for content-based music retrieval based on user prefences. 385-386
Christopher B. Jones, Ross Purves, Anne Ruas, Mark Sanderson, Monika Sester, Marc J. van Kreveld, Robert Weibel: Spatial information retrieval and geographical ontologies an overview of the SPIRIT project. 387-388
Gareth J. F. Jones, Steven M. Gabb: A visualisation tool for topic tracking analysis and development. 389-390
Sang-Bum Kim, Hae-Chang Rim, Heui-Seok Lim: A new method of parameter estimation for multinomial naive bayes text classifiers. 391-392
K. L. Kwok: Higher precision for two-word queries. 395-396
Birger Larsen, Peter Ingwersen: The boomerang effect: retrieving scientific documents via the network of references and citations. 397-398
Ray R. Larson: A logistic regression approach to distributed IR. 399-400
Elizabeth D. Liddy, Eileen Allen, Sarah Harwell, Susan Corieri, Ozgur Yilmazel, Necati Ercan Ozgencil, Anne Diekema, Nancy J. McCracken, Joanne Silverstein, Stuart A. Sutton: Automatic metadata generation & evaluation. 401-402
James Mayfield, Paul McNamee: Converting on-line bilingual dictionaries from human-readable to machine-readable form. 405-406
Tadashi Nomoto, Yuji Matsumoto: Modeling (in)variability of human judgments for text summarization. 407-408
Tetsuya Sakai, Stephen E. Robertson: Relative and absolute term selection criteria: a comparative study for English and Japanese IR. 411-412
Alberto Lavelli, Bernardo Magnini, Fabrizio Sebastiani: Building thematic lexical resources by term categorization. 415-416
Rong Jin, Luo Si, Alexander G. Hauptmann, James P. Callan: Language model for IR using collection information. 419-420
Ian Soboroff: Does WT10g look like the web?. 423-424

Yuen-Hsien Tseng: Error correction in a Chinese OCR test collection. 429-430
Andrew Turpin, William R. Hersh: User interface effects in past batch versus user experiments. 431-432

Thijs Westerveld: Probabilistic multimedia retrieval. 437-438
Wenfeng Yang: Chinese keyword extraction based on max-duplicated strings of the documents. 439-440
Yazhong Feng, Yueting Zhuang, Yunhe Pan: A hierarchical approach: query large music database by acoustic input. 441-442
Demo session
Ryen White, Joemon M. Jose, Ian Ruthven: A system using implicit feedback and top ranking sentences to help users find relevant web documents. 446
Wolfgang Hürst: Indexing, searching, and retrieving of recorded live presentations with the AOF (authoring on the fly) search engine. 447
Heikki Keskustalo, Turid Hedlund, Eija Airio: UTACLIR -: general query translation framework for several language pairs. 448
Eero Sormunen, Sakari Hokkanen, Petteri Kangaslampi, Petri Pyy, Bemmu Sepponen: Query performance analyser: a web-based tool for IR research and instruction. 450
Fabio Ciravegna, Alexiei Dingli, Yorick Wilks, Daniela Petrelli: Adaptive information extraction for document annotation in amilcare. 451
Bethina Schmitt, Michael Christoffel, Jürgen Schneider: ExWrap: semi-automatic wrapper generation by example. 452
Anselm Spoerri: Souvenir: flexible note-taking tool to pinpoint and share media highlights. 453
Hideo Joho, Mark Sanderson, Micheline Beaulieu: Hierarchical approach to term suggestion device. 454
Fredric C. Gey, Aitao Chen, Michael K. Buckland, Ray R. Larson: Translingual vocabulary mappings for multilingual information access. 455-456
Hasan Davulcu, Saikat Mukherjee, Arvind Seth, I. V. Ramakrishnan: CuTeX: a system for extracting data from text tables. 457
Prashant Choudhari, Hasan Davulcu, Abhishek Joglekar, Akshay More, Saikat Mukherjee, Supriya Patil, I. V. Ramakrishnan: YellowPager: a tool for ontology-based mining of service directories from web sources. 458



