EMNLP 2009: Singapore
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, 6-7 August 2009, Singapore, A meeting of SIGDAT, a Special Interest Group of the ACL. ACL 2009 ISBN 978-1-932432-59-6
Front Matter.

Koen Deschacht, Marie-Francine Moens: Semi-supervised Semantic Role Labeling Using the Latent Words Language Model. 21-29
Hai Zhao, Wenliang Chen, Chunyu Kit: Semantic Dependency Parsing of NomBank and PropBank: An Efficient Integrated Approach via a Large-scale Feature Selection. 30-39
Zhifei Li, Jason Eisner: First- and Second-Order Expectation Semirings with Applications to Minimum-Risk Training on Translation Forests. 40-51
Omar Zaidan, Chris Callison-Burch: Feasibility of Human-in-the-loop Minimum Error Rate Training. 52-61
Libin Shen, Jinxi Xu, Bing Zhang, Spyros Matsoukas, Ralph M. Weischedel: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation. 72-80
Fabio Massimo Zanzotto, Lorenzo Dell'Arciprete: Efficient kernels for sentence pair classification. 91-100

Makoto Miwa, Rune Sætre, Yusuke Miyao, Jun'ichi Tsujii: A Rich Feature Vector for Protein-Protein Interaction Extraction from Multiple Corpora. 121-130
Kedar Bellare, Andrew McCallum: Generalized Expectation Criteria for Bootstrapping Extractors using Record-Text Alignment. 131-140
Siddharth Patwardhan, Ellen Riloff: A Unified Model of Phrasal and Sentential Evidence for Information Extraction. 151-160
Jingjing Liu, Stephanie Seneff: Review Sentiment Scoring via a Parse-and-Paraphrase Paradigm. 161-169
Swapna Somasundaran, Galileo Namata, Janyce Wiebe, Lise Getoor: Supervised and Unsupervised Methods in Employing Discourse Relations for Improving Opinion Polarity Classification. 170-179
Ramanathan Narayanan, Bing Liu, Alok N. Choudhary: Sentiment Analysis of Conditional Sentences. 180-189
Xavier Carreras, Michael Collins: Non-Projective Parsing for Statistical Machine Translation. 200-209
Arne Mauser, Sasa Hasan, Hermann Ney: Extending Statistical Machine Translation with Discriminative and Trigger-Based Lexicon Models. 210-218
Ulf Hermjakob: Improved Word Alignment with Statistics and Linguistic Heuristics. 229-237
Daniel Ramage, David Hall, Ramesh Nallapati, Christopher D. Manning: Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora. 248-256
Zhiyuan Liu, Peng Li, Yabin Zheng, Maosong Sun: Clustering to Find Exemplar Terms for Keyphrase Extraction. 257-266
Dmitry Davidov, Ari Rappoport: Geo-mining: Discovery of Road and Transport Networks Using Directional Patterns. 267-275
Chris Callison-Burch: Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon's Mechanical Turk. 286-295
Annie Louis, Ani Nenkova: Automatically Evaluating Content Selection in Summarization without Human Models. 306-314
Linlin Li, Caroline Sporleder: Classifier Combination for Contextual Idiom Detection Without Labelled Data. 315-323
Brian Roark, Asaf Bachrach, Carlos Cardenas, Christophe Pallier: Deriving lexical and syntactic expectation-based measures for psycholinguistic modeling via incremental top-down parsing. 324-333
Rajesh Ranganath, Daniel Jurafsky, Daniel A. McFarland: It's Not You, it's Me: Detecting Flirting and its Misperception in Speed-Dates. 334-342
Ziheng Lin, Min-Yen Kan, Hwee Tou Ng: Recognizing Implicit Discourse Relations in the Penn Discourse Treebank. 343-351
Trevor Cohn, Phil Blunsom: A Bayesian Model of Syntax-Directed Tree to String Grammar Induction. 352-361
Tong Xiao, Mu Li, Dongdong Zhang, Jingbo Zhu, Ming Zhou: Better Synchronous Binarization for Machine Translation. 362-370
Daniel Galron, Sergio Penkale, Andy Way, I. Dan Melamed: Accuracy-Based Scoring for DOT: Towards Direct Error Minimization for Data-Oriented Translation. 371-380
Yuval Marton, Chris Callison-Burch, Philip Resnik: Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases. 381-390
Tadashi Nomoto: A Comparison of Model Free versus Model Intensive Approaches to Sentence Compression. 391-399
Wei Lu, Hwee Tou Ng, Wee Sun Lee: Natural Language Generation with Tree Conditional Random Fields. 400-409
Ben Hachey: Multi-Document Summarisation Using Generic Relation Extraction. 420-429

Daniel Dahlmeier, Hwee Tou Ng, Tanja Schultz: Joint Learning of Preposition Senses and Semantic Roles of Prepositional Phrases. 450-458
Mitesh M. Khapra, Sapan Shah, Piyush Kedia, Pushpak Bhattacharyya: Projecting Parameters for Multilingual Word Sense Disambiguation. 459-467
Ram Boukobza, Ari Rappoport: Multi-Word Expression Identification Using Sentence Surface Features. 468-477
Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, Jason S. Chang: Acquiring Translation Equivalences of Multiword Expressions by Normalized Correlation Frequencies. 478-486
Zhan-yi Liu, Haifeng Wang, Hua Wu, Sheng Li: Collocation Extraction Using Monolingual Word Alignment Method. 487-495
Jianfeng Gao, Qiang Wu, Chris Burges, Krysta Marie Svore, Yi Su, Nazan Khan, Shalin Shah, Hongyan Zhou: Model Adaptation via Model Interpolation and Boosting for Web Search Ranking. 505-513
Wen-Yun Yang, Yunbo Cao, Chin-Yew Lin: A Structural Support Vector Method for Extracting Contexts and Answers of Questions from Online Forums. 514-523
Huihsin Tseng, Longbin Chen, Fan Li, Ziming Zhuang, Lei Duan, Belle L. Tseng: Mining Search Engine Clickthrough Log for Matching N-gram Features. 524-533
Javier Artiles, Enrique Amigó, Julio Gonzalo: The role of named entities in Web People Search. 534-542
Zhiheng Huang, Marcus Thint, Asli Çelikyilmaz: Investigation of Question Classifier in Question Answering. 543-550
Jun Suzuki, Hideki Isozaki, Xavier Carreras, Michael Collins: An Empirical Study of Semi-supervised Structured Conditional Models for Dependency Parsing. 551-560
Richard Johansson: Statistical Bistratal Dependency Parsing. 561-569
Wenliang Chen, Jun'ichi Kazama, Kiyotaka Uchimoto, Kentaro Torisawa: Improving Dependency Parsing with Subtrees from Auto-Parsed Data. 570-579
Sajib Dasgupta, Vincent Ng: Topic-wise, Sentiment-wise, or Otherwise? Identifying the Hidden Dimension for Unsupervised Text Classification. 580-589
Yejin Choi, Claire Cardie: Adapting a Polarity Lexicon using Integer Linear Programming for Domain-Specific Sentiment Classification. 590-598
Saif Mohammad, Cody Dunne, Bonnie J. Dorr: Generating High-Coverage Semantic Orientation Lexicons From Overtly Marked Words and a Thesaurus. 599-608
Nilesh N. Dalvi, Ravi Kumar, Bo Pang, Andrew Tomkins: Matching Reviews to Objects using a Language Model. 609-618
Brian Murphy, Marco Baroni, Massimo Poesio: EEG responds to conceptual stimuli and corpus semantics. 619-627
Justin Washtell, Katja Markert: A Comparison of Windowless and Window-Based Computational Association Measures as Predictors of Syntagmatic Human Associations. 628-637
Lin Sun, Anna Korhonen: Improving Verb Clustering with Automatically Acquired Selectional Preferences. 638-647
Yumao Lu, Fuchun Peng, Gilad Mishne, Xing Wei, Benoît Dumoulin: Improving Web Search Relevance with Semantic Features. 648-657
Jong-Hoon Oh, Kiyotaka Uchimoto, Kentaro Torisawa: Can Chinese Phonemes Improve Machine Transliteration?: A Comparative Study of English-to-Chinese Transliteration Models. 658-667
Taesun Moon, Katrin Erk, Jason Baldridge: Unsupervised morphological segmentation and clustering with document boundaries. 668-677
Jurgen Van Gael, Andreas Vlachos, Zoubin Ghahramani: The infinite HMM for unsupervised PoS tagging. 678-687
Qiuye Zhao, Mitch Marcus: A Simple Unsupervised Learner for POS Disambiguation Rules Given Only a Minimal Lexicon. 688-697
Min Zhang, Haizhou Li: Tree Kernel-based SVM with Structured Syntactic Knowledge for BTG-based Phrase Reordering. 698-707
Spyros Matsoukas, Antti-Veikko I. Rosti, Bing Zhang: Discriminative Corpus Weight Estimation for Machine Translation. 708-717

Tim Miller: Word Buffering Models for Improved Speech Repair Parsing. 737-745
Robert C. Moore, Chris Quirk: Less is More: Significance-Based N-gram Selection for Smaller, Better Language Models. 746-755
Erin Fitzgerald, Frederick Jelinek, Keith Hall: Integrating sentence- and word-level error identification for disfluency correction. 765-774
Yuval Marton, Saif Mohammad, Philip Resnik: Estimating Semantic Distance Using Soft Semantic Constraints in Knowledge-Source - Corpus Hybrid Models. 775-783
Wen-tau Yih: Learning Term-weighting Functions for Similarity Measures. 793-802
Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuka: A Relational Model of Semantic Similarity between Words using Automatically Extracted Lexical Pattern Clusters from the Web. 803-812
Laura Rimell, Stephen Clark, Mark Steedman: Unbounded Dependency Recovery for Parser Evaluation. 813-821
David A. Smith, Jason Eisner: Parser Adaptation and Projection with Quasi-Synchronous Grammar Features. 822-831
Zhongqiang Huang, Mary P. Harper: Self-Training PCFG Grammars with Latent Annotations Across Languages. 832-841
Reut Tsarfaty, Khalil Sima'an, Remko Scha: An Alternative to Head-Driven Approaches for Parsing a (Relatively) Free Word-Order Language. 842-851
Dmitry Davidov, Ari Rappoport: Enhancement of Lexical Concepts Using Cross-lingual Web Mining. 852-861
István Varga, Shoichi Yokoyama: Bilingual dictionary generation for low-resourced language pairs. 862-870
Dani Yogatama, Kumiko Tanaka-Ishii: Multilingual Spectral Clustering Using Document Similarity Propagation. 871-879
David M. Mimno, Hanna M. Wallach, Jason Naradowsky, David A. Smith, Andrew McCallum: Polylingual Topic Models. 880-889
Casey Whitelaw, Ben Hutchinson, Grace Chung, Ged Ellis: Using the Web for Language Independent Spellchecking and Autocorrection. 890-899
Paul Kidwell, Guy Lebanon, Kevyn Collins-Thompson: Statistical Estimation of Word Acquisition with Application to Readability Prediction. 900-909
Vivi Nastase, Michael Strube: Combining Collocations, Lexical and Encyclopedic Knowledge for Metonymy Resolution. 910-918
Ichiro Yamada, Kentaro Torisawa, Jun'ichi Kazama, Kow Kuroda, Masaki Murata, Stijn De Saeger, Francis Bond, Asuka Sumida: Hypernym Discovery Based on Distributional Similarity and Hierarchical Structures. 929-937
Patrick Pantel, Eric Crestan, Arkady Borkovsky, Ana-Maria Popescu, Vishnu Vyas: Web-Scale Distributional Similarity and Entity Set Expansion. 938-947
Eduard H. Hovy, Zornitsa Kozareva, Ellen Riloff: Toward Completeness in Concept Extraction and Classification. 948-957
Jacob Eisenstein, James Clarke, Dan Goldwasser, Dan Roth: Reading to Learn: Constructing Features from Semantic Abstracts. 958-967
Guodong Zhou, Fang Kong: Global Learning of Noun Phrase Anaphoricity in Coreference Resolution via Label Propagation. 978-986
Fang Kong, Guodong Zhou, Qiaoming Zhu: Employing the Centering Theory in Pronoun Resolution from the Semantic Perspective. 987-996
Octavian Popescu: Person Cross Document Coreference with Name Perplexity Estimates. 997-1006
Yang Liu, Tian Xia, Xinyan Xiao, Qun Liu: Weighted Alignment Matrices for Statistical Machine Translation. 1017-1026
Matti Kääriäinen: Sinuhe - Statistical Machine Translation using a Globally Trained Conditional Exponential Family Translation Model. 1027-1036
Hui Zhang, Min Zhang, Haizhou Li, Chew Lim Tan: Fast Translation Rule Matching for Syntax-based Statistical Machine Translation. 1037-1045
Enrique Alfonseca, Massimiliano Ciaramita, Keith Hall: Gazpacho and summer rash: lexical relationships from temporal patterns of web search queries. 1046-1055
Roy Bar-Haim, Jonathan Berant, Ido Dagan: A Compact Forest for Scalable Inference over Entailment and Paraphrase Rules. 1056-1065
Marco Dinarelli, Alessandro Moschitti, Giuseppe Riccardi: Re-Ranking Models Based-on Small Training Data for Spoken Language Understanding. 1076-1085
Anlei Dong, Yi Chang, Shihao Ji, Ciya Liao, Xin Li, Zhaohui Zheng: Empirical Exploitation of Click Data for Task Specific Ranking. 1086-1095


Andrew M. Finch, Eiichiro Sumita: Bidirectional Phrase-based Statistical Machine Translation. 1124-1132
Matthew Frampton, Jia Huang, Trung H. Bui, Stanley Peters: Real-time decision detection in multi-party dialogue. 1133-1141
Aria Haghighi, Dan Klein: Simple Coreference Resolution with Rich Syntactic and Semantic Features. 1152-1161
Tadayoshi Hara, Yusuke Miyao, Jun'ichi Tsujii: Descriptive and Empirical Approaches to Capturing Underlying Dependencies among Parsing Errors. 1162-1171
Chikara Hashimoto, Kentaro Torisawa, Kow Kuroda, Stijn De Saeger, Masaki Murata, Jun'ichi Kazama: Large-Scale Verb Entailment Acquisition from the Web. 1172-1181
Hany Hassan, Khalil Sima'an, Andy Way: A Syntactified Direct Translation Model with Linear-time Decoding. 1182-1191
Samer Hassan, Rada Mihalcea: Cross-lingual Semantic Relatedness Using Encyclopedic Knowledge. 1192-1201
Xiaodong He, Kristina Toutanova: Joint Optimization for Machine Translation System Combination. 1202-1211
Liang Huang, Wenbin Jiang, Qun Liu: Bilingually-Constrained (Monolingual) Shift-Reduce Parsing. 1222-1231
Zhiheng Huang, Guangping Zeng, Weiqun Xu, Asli Çelikyilmaz: Accurate Semantic Class Classifier for Coreference Resolution. 1232-1240
Minwoo Jeong, Chin-Yew Lin, Gary Geunbae Lee: Semi-supervised Speech Act Recognition in Emails and Forums. 1250-1259
Lun-Wei Ku, Ting-Hao Huang, Hsin-Hsi Chen: Using Morphological and Syntactic Structures for Chinese Opinion Analysis. 1260-1269
Gerasimos Lampouras, Ion Androutsopoulos: Finding Short Definitions of Terms on Web Pages. 1270-1279
Junhui Li, Guodong Zhou, Hai Zhao, Qiaoming Zhu, Peide Qian: Improving Nominal SRL in Chinese Language with Verbal SRL Information and Automatic Predicate Recognition. 1280-1288
Xiao Li: On the Use of Virtual Evidence in Conditional Random Fields. 1289-1297
Xiaojun Lin, Yang Fan, Meng Zhang, Xihong Wu, Huisheng Chi: Refining Grammars for Parsing with Hierarchical Semantic Knowledge. 1298-1307
Olena Medelyan, Eibe Frank, Ian H. Witten: Human-competitive tagging using automatic keyphrase extraction. 1318-1327
Yusuke Miyao, Jun'ichi Tsujii: Supervised Learning of a Probabilistic Lexicon of Verb Semantic Classes. 1328-1337
Christof Müller, Iryna Gurevych: A Study on the Semantic Relatedness of Query and Document Terms in Information Retrieval. 1338-1347
Preslav Nakov, Hwee Tou Ng: Improved Statistical Machine Translation for Resource-Poor Languages Using Related Resource-Rich Languages. 1358-1367
Truc-Vien T. Nguyen, Alessandro Moschitti, Giuseppe Riccardi: Convolution Kernels on Constituent, Dependency and Sequential Structures for Relation Extraction. 1378-1387
Ekaterina Ovchinnikova, Theodore Alexandrov, Tonio Wandmacher: Automatic Acquisition of the. 1388-1397
Arzucan Özgür, Dragomir R. Radev: Detecting Speculations and their Scopes in Scientific Text. 1398-1407
Michael J. Paul, Roxana Girju: Cross-Cultural Analysis of Blogs and Forums with Mixed-Collection Topic Models. 1408-1417
Adam Pauls, John DeNero, Dan Klein: Consensus Training for Consensus Decoding in Machine Translation. 1418-1427
Emily Pitler, Ken Ward Church: Using Word-Sense Disambiguation Methods to Classify Web Queries by Intent. 1428-1436
Longhua Qian, Guodong Zhou, Fang Kong, Qiaoming Zhu: Semi-Supervised Learning for Semantic Relation Classification using Stratified Sampling Strategy. 1437-1445
Changqin Quan, Fuji Ren: Construction of a Blog Emotion Corpus for Chinese Emotional Expression Analysis. 1446-1454
Ryohei Sasano, Sadao Kurohashi: A Probabilistic Model for Associative Anaphora Resolution. 1455-1464
Prakash Srinivasan, Alexander Yates: Quantifier Scope Disambiguation Using Extracted Pragmatic Knowledge: Preliminary Results. 1465-1474
Weiwei Sun, Zhifang Sui, Meng Wang, Xin Wang: Chinese Semantic Role Labeling with Shallow Parsing. 1475-1483
Hisami Suzuki, Xiao Li, Jianfeng Gao: Discovery of Term Variation in Japanese Web Search Queries. 1484-1492
Simone Teufel, Advaith Siddharthan, Colin R. Batchelor: Towards Domain-Independent Argumentative Zoning: Evidence from Chemistry and Computational Linguistics. 1493-1502
Richard C. Wang, William W. Cohen: Character-level Analysis of Semi-Structured Documents for Set Expansion. 1503-1512
Xinglong Wang, Jun'ichi Tsujii, Sophia Ananiadou: Classifying Relations for Biomedical Named Entity Disambiguation. 1513-1522
Dan Wu, Wee Sun Lee, Nan Ye, Hai Leong Chieu: Domain adaptive bootstrapping for named entity recognition. 1523-1532
Yuanbin Wu, Qi Zhang, Xuanjing Huang, Lide Wu: Phrase Dependency Parsing for Opinion Mining. 1533-1541
Naoki Yoshinaga, Masaru Kitsuregawa: Polynomial to Linear: Efficient Classification with Conjunctive Features. 1542-1551





