23. COLING 2010: Beijing, China
Chu-Ren Huang, Dan Jurafsky (Eds.): COLING 2010, 23rd International Conference on Computational Linguistics, Proceedings of the Conference, 23-27 August 2010, Beijing, China. Tsinghua University Press 2010
Front Matter.
Hassan Al-Haj, Shuly Wintner: Identifying Multi-word Expressions by Leveraging Morphological and Syntactic Idiosyncrasy. 10-18
Daniel Andrade, Tetsuya Nasukawa, Jun'ichi Tsujii: Robust Measurement and Comparison of Context Similarity for Finding Translation Pairs. 19-27
Carmen Banea, Rada Mihalcea, Janyce Wiebe: Multilingual Subjectivity: Are More Languages Better? 28-36
Alberto Barrón-Cedeño, Paolo Rosso, Eneko Agirre, Gorka Labaka: Plagiarism Detection across Distant Language Pairs. 37-45
Núria Bel, Maria Coll, Gabriela Resnik: Automatic Detection of Non-deverbal Event Nouns for Quick Lexicon Production. 46-52
Adrian Bickerstaffe, Ingrid Zukerman: A Hierarchical Classifier Applied to Multi-way Sentiment Detection. 62-70
Graeme W. Blackwood, Adrià de Gispert, William Byrne: Fluency Constraints for Minimum Bayes-Risk Decoding of Statistical Machine Translation Lattices. 71-79
André Blessing, Hinrich Schütze: Self-Annotation for fine-grained geospatial relation extraction. 80-88
Bernd Bohnet: Top Accuracy and Fast Dependency Parsing is not a Contradiction. 89-97
Bernd Bohnet, Leo Wanner, Simon Mille, Alicia Burga: Broad Coverage Multilingual Deep Sentence Generation with a Stochastic Multi-Level Realizer. 98-106
Bernard Brosseau-Villeneuve, Jian-Yun Nie, Noriko Kando: Towards an optimal weighting of context words based on distance. 107-115
Razvan C. Bunescu, Yunfeng Huang: A Utility-Driven Approach to Question Ranking in Social QA. 125-133
Xiaoyan Cai, Wenjie Li, Ouyang You, Hong Yan: Simultaneous Ranking and Clustering of Sentences: A Reinforcement Approach to Multi-Document Summarization. 134-142


Chien Chin Chen, Chen-Yuan Wu: Bipolar Person Name Identification of Topic Documents Using Principal Component Analysis. 170-178
Ying Chen, Sophia Yat Mei Lee, Shoushan Li, Chu-Ren Huang: Emotion Cause Detection with Linguistic Constructions. 179-187
Bin Chen, Jian Su, Chew Lim Tan: A Twin-Candidate Based Approach for Event Pronoun Resolution using Composite Kernel. 188-196
Sung-Pil Choi, Sung-Hyon Myaeng: Simplicity is Better: Revisiting Single Kernel PPI Extraction. 206-214
Nigel Collier, Reiko Matsuda Goodwin, John McCrae, Son Doan, Ai Kawazoe, Mike Conway, Asanee Kawtrakul, Koichi Takeuchi, Dinh Dien: An ontology-driven system for detecting global health events. 215-222
Bart Cramer, Yi Zhang: Constraining robust constructions for broad-coverage parsing with precision grammars. 223-231
Josep Maria Crego, Aurélien Max, François Yvon: Local lexical adaptation in Machine Translation through triangulation: SMT helping SMT. 232-240
Pascal Denis, Philippe Muller: Comparison of different algebras for inducing the temporal structure of texts. 250-258
Markus Dickinson: Generating Learner-Like Morphological Errors in Russian. 259-267
Mark Dredze, Paul McNamee, Delip Rao, Adam Gerber, Tim Finin: Entity Disambiguation for Knowledge Base Population. 277-285
Yajuan Duan, Long Jiang, Tao Qin, Ming Zhou, Heung-Yeung Shum: An Empirical Study on Learning to Rank of Tweets. 295-303
Nan Duan, Mu Li, Dongdong Zhang, Ming Zhou: Mixture Model-based Minimum Bayes Risk Decoding using Multiple Machine Translation Systems. 313-321
Katja Filippova: Multi-Sentence Compression: Finding Shortest Paths in Word Graphs. 322-330
Sanae Fujita, Masaaki Nagata: Enriching Dictionaries with Images from the Internet - Targeting Wikipedia and a Japanese Semantic Lexicon: Lexeed -. 331-339
Kavita Ganesan, ChengXiang Zhai, Jiawei Han: Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions. 340-348
Qin Gao, Francisco Guzmán, Stephan Vogel: EMDC: A Semi-supervised Approach for Word Alignment. 349-357
Jianfeng Gao, Xiaolong Li, Daniel Micol, Chris Quirk, Xu Sun: A Large Scale Ranker-Based System for Search Query Spelling Correction. 358-366
Dmitriy Genzel: Automatically Learning Source-side Reordering Rules for Large Scale Machine Translation. 376-384
Ryan Georgi, Fei Xia, William D. Lewis: Comparing Language Similarity across Genetic and Typologically-Based Groupings. 385-393
Spence Green, Christopher D. Manning: Better Arabic Parsing: Baselines, Evaluations, and Analysis. 394-402
Gintare Grigonyte, João Cordeiro, Gaël Dias, Rumen Moraliyski, Pavel Brazdil: Paraphrase Alignment for Synonym Evidence Discovery. 403-411
Sheng Guo, Naren Ramakrishnan: Finding the Storyteller: Automatic Spoiler Tagging using Linguistic Cues. 412-420
Yaakov HaCohen-Kerner, Aharon Tayeb, Natan Ben-Dror: Detection of Simple Plagiarism in Computer Science Papers. 421-429
Matthias Hartung, Anette Frank: A Structured Vector Space Model for Hidden Attribute Meaning in Adjective-Noun Phrases. 430-438
Katsuhiko Hayashi, Hajime Tsukada, Katsuhito Sudoh, Kevin Duh, Seiichi Yamamoto: Hierarchical Phrase-based Machine Translation with Word-based Reordering Model. 439-446
Yanqing He, Yu Zhou, Chengqing Zong, Huilin Wang: A Novel Reordering Model Based on Multi-layer Phrase for Statistical Machine Translation. 447-455
Verena Henrich, Erhard W. Hinrichs: Standardizing Wordnets in the ISO Standard LMF: Wordnet-LMF for GermaNet. 456-464
Julia Hockenmaier, Yonatan Bisk: Normal-form parsing for Combinatory Categorial Grammars with generalized composition and type-raising. 465-473
Gum-Won Hong, Chi-Ho Li, Ming Zhou, Hae-Chang Rim: An Empirical Study on Web Mining of Parallel Data. 474-482
Jian Huang, Pucktada Treeratpituk, Sarah M. Taylor, C. Lee Giles: Enhancing Cross Document Coreference of Web Documents with Context Similarity and Very Large Scale Text Categorization. 483-491



Mark Johnson, Katherine Demuth: Unsupervised phonemic Chinese word segmentation using Adaptor Grammars. 528-536
Laura Kallmeyer, Wolfgang Maier: Data-Driven Parsing with Probabilistic Linear Context-Free Rewriting Systems. 537-545
Rohit J. Kate, Xiaoqiang Luo, Siddharth Patwardhan, Martin Franz, Radu Florian, Raymond J. Mooney, Salim Roukos, Chris Welty: Learning to Predict Readability using Diverse Linguistic Features. 546-554
Mitesh M. Khapra, Saurabh Sohoney, Anup Kulkarni, Pushpak Bhattacharyya: Value for Money: Balancing Annotation Effort, Lexicon Building and Accuracy for Multilingual WSD. 555-563
Seokhwan Kim, Minwoo Jeong, Jonghoon Lee, Gary Geunbae Lee: A Cross-lingual Annotation Projection Approach for Relation Detection. 564-571
Su Nam Kim, Timothy Baldwin, Min-Yen Kan: Evaluating N-gram based Evaluation Metrics for Automatic Keyphrase Extraction. 572-580
Doo Soon Kim, Ken Barker, Bruce W. Porter: Improving the Quality of Text Understanding by Delaying Ambiguity Resolution. 581-589
Petr Knoth, Jakub Novotny, Zdenek Zdráhal: Automatic generation of inter-passage links based on semantic similarity. 590-598
Fang Kong, Guodong Zhou, Longhua Qian, Qiaoming Zhu: Dependency-driven Anaphoricity Determination for Coreference Resolution. 599-607
Roland Kuhn, Boxing Chen, George F. Foster, Evan Stratford: Phrase Clustering for Smoothing TM Probabilities - or, How to Extract Paraphrases from Phrase Tables. 608-616
Audrey Laroche, Philippe Langlais: Revisiting Context-based Projection Methods for Term-Translation Spotting in Comparable Corpora. 617-625
Young-Suk Lee, Bing Zhao, Xiaoqiang Luo: Constituent Reordering and Syntax Models for English-to-Japanese Statistical Machine Translation. 626-634
Shoushan Li, Sophia Yat Mei Lee, Ying Chen, Chu-Ren Huang, Guodong Zhou: Sentiment Classification and Polarity Shifting. 635-643
Bo Li, Éric Gaussier: Improving Corpus Comparability for Bilingual Lexicon Extraction from Comparable Corpora. 644-652
Fangtao Li, Chao Han, Minlie Huang, Xiaoyan Zhu, Yingju Xia, Shu Zhang, Hao Yu: Structure-Aware Review Mining and Summarization. 653-661
Mu Li, Yinggong Zhao, Dongdong Zhang, Ming Zhou: Adaptive Development Data Selection for Log-linear Model in Statistical Machine Translation. 662-670
Junhui Li, Guodong Zhou, Hongling Wang, Qiaoming Zhu: Learning the Scope of Negation via Shallow Semantic Parsing. 671-679
Thomas Lippincott, Diarmuid Ó Séaghdha, Lin Sun, Anna Korhonen: Exploring variation across biomedical subdomains. 689-697
Xiaohua Liu, Kuan Li, Bo Han, Ming Zhou, Long Jiang, Zhongyang Xiong, Changning Huang: Semantic Role Labeling for News Tweets. 698-706

Hector Llorens, Estela Saquete, Borja Navarro-Colorado: TimeML Events Recognition and Classification: Learning CRF Models with Semantic Roles. 725-733
Yue Lu, Huizhong Duan, Hongning Wang, ChengXiang Zhai: Exploiting Structured Ontology to Organize Scattered Online Opinions. 734-742
Minh-Thang Luong, Min-Yen Kan: Enhancing Morphological Alignment for Translating Highly Inflected Languages. 743-751
Erwin Marsi, Emiel Krahmer: Automatic analysis of semantic similarity in comparable text through syntactic tree matching. 752-760
Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Minami, Kohji Dohsaka: Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes. 761-769
Shachar Mirkin, Jonathan Berant, Ido Dagan, Eyal Shnarch: Recognising Entailment within Discourse. 770-778
Makoto Miwa, Sampo Pyysalo, Tadayoshi Hara, Jun'ichi Tsujii: Evaluating Dependency Representations for Event Extraction. 779-787
Makoto Miwa, Rune Sætre, Yusuke Miyao, Jun'ichi Tsujii: Entity-Focused Sentence Simplification for Relation Extraction. 788-796
Smruthi Mukund, Debanjan Ghosh, Rohini K. Srihari: Using Cross-Lingual Projections to Generate Semantic Role Labeled Annotated Corpus for Urdu - A Resource Poor Language. 797-805
Alena Neviarouskaya, Helmut Prendinger, Mitsuru Ishizuka: Recognition of Affect, Judgment, and Appreciation in Text. 806-814
ThuyLinh Nguyen, Stephan Vogel, Noah A. Smith: Nonparametric Word Segmentation for Machine Translation. 815-823
Joakim Nivre, Laura Rimell, Ryan T. McDonald, Carlos Gómez-Rodríguez: Evaluation of Dependency Parsers on Unbounded Dependencies. 833-841
Jong-Hoon Oh, Ichiro Yamada, Kentaro Torisawa, Stijn De Saeger: Co-STAR: A Co-training Style Algorithm for Hyponymy Relation Acquisition from Structured and Unstructured Text. 842-850
Naoaki Okazaki, Jun'ichi Tsujii: Simple and Efficient Algorithm for Approximate Dictionary Matching. 851-859
Derya Ozkan, Kenji Sagae, Louis-Philippe Morency: Latent Mixture of Discriminative Experts for Multimodal Prediction Modeling. 860-868
Makbule Gulcin Ozsoy, Ilyas Cicekli, Ferda Nur Alpaslan: Text Summarization of Turkish Texts using Latent Semantic Analysis. 869-876
Emily Pitler, Shane Bergsma, Dekang Lin, Kenneth Ward Church: Using Web-scale N-grams to Improve Base NP Parsing Performance. 886-894
Vahed Qazvinian, Dragomir R. Radev, Arzucan Özgür: Citation Summarization Through Keyphrase Extraction. 895-903
Lizhen Qu, Georgiana Ifrim, Gerhard Weikum: The Bag-of-Opinions Method for Review Rating Prediction from Sparse Text Patterns. 913-921
Md. Altaf ur Rahman, Vincent Ng: Inducing Fine-Grained Semantic Classes via Hierarchical and Collective Classification. 931-939
Sujith Ravi, Ashish Vaswani, Kevin Knight, David Chiang: Fast, Greedy Model Minimization for Unsupervised Tagging. 940-948
Michael Roth, Anette Frank: Computing EM-based Alignments of Routes and Route Directions as a Basis for Natural Language Generation. 958-966
Ksenia Shalonova, Bruno Golénia: Weakly Supervised Morphology Learning for Agglutinating Languages Using Small Training Sets. 976-983
Shuming Shi, Huibin Zhang, Xiaojie Yuan, Ji-Rong Wen: Corpus-based Semantic Class Mining: Distributional vs. Pattern-Based Approaches. 993-1001
Ekaterina Shutova, Lin Sun, Anna Korhonen: Metaphor Identification Using Verb and Noun Clustering. 1002-1010
Xiance Si, Zhiyuan Liu, Maosong Sun: Explore the Structure of Social Tags by Subsumption Relations. 1011-1019
Sebastian Spiegler, Andrew van der Spuy, Peter A. Flach: Ukwabelana - An open-source morphological Zulu corpus. 1020-1028
Sebastian Spiegler, Christian Monson: EMMA: A novel Evaluation Metric for Morphological Analysis. 1029-1037
Tomek Strzalkowski, George Aaron Broadwell, Jennifer Stromer-Galley, Samira Shaikh, Sarah M. Taylor, Nick Webb: Modeling Socio-Cultural Phenomena in Discourse. 1038-1046
Jun Sun, Min Zhang, Chew Lim Tan: Discriminative Induction of Sub-Tree Alignment using Limited Labeled Data. 1047-1055
Lin Sun, Thierry Poibeau, Anna Korhonen, Cédric Messiant: Investigating the cross-linguistic potential of VerbNet-style classification. 1056-1064
Anders Søgaard, Christian Rishøj: Semi-supervised dependency parsing using generalized tri-training. 1065-1073
George Tsatsaronis, Iraklis Varlamis, Kjetil Nørvåg: SemanticRank: Ranking Keywords and Sentences Using Semantic Graphs. 1074-1082
Daniel Tse, James R. Curran: Chinese CCGbank: extracting CCG derivations from the Penn Chinese Treebank. 1083-1091
Zhaopeng Tu, Yang Liu, Young-Sook Hwang, Qun Liu, Shouxun Lin: Dependency Forest for Statistical Machine Translation. 1092-1100
Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe Dubiner: Large Scale Parallel Document Mining for Machine Translation. 1101-1109
Karthik Visweswariah, Jiri Navratil, Jeffrey S. Sorensen, Vijil Chenthamarakshan, Nandakishore Kambhatla: Syntax Based Reordering with Automatically Derived Rules for Improved Statistical Machine Translation. 1119-1127
Henning Wachsmuth, Peter Prettenhofer, Benno Stein: Efficient Statement Identification for Automatic Market Forecasting. 1128-1136
Xiaojun Wan: Towards a Unified Approach to Simultaneous Single-Document and Multi-Document Summarizations. 1137-1145
William Yang Wang, Kathleen McKeown: "Got You!": Automatic Vandalism Detection in Wikipedia with Web-based Shallow Syntactic-Semantic Modeling. 1146-1154
Kai Wang, Tat-Seng Chua: Exploiting Salient Patterns for Question Detection and Question Retrieval in Community-based Question Answering. 1155-1163
Mengqiu Wang, Christopher D. Manning: Probabilistic Tree-Edit Models with Structured Latent Variables for Textual Entailment and Question Answering. 1164-1172
Kun Wang, Chengqing Zong, Keh-Yih Su: A Character-Based Joint Model for Chinese Word Segmentation. 1173-1181

Xinyan Xiao, Yang Liu, Young-Sook Hwang, Qun Liu, Shouxun Lin: Joint Tokenization and Translation. 1200-1208
Ge Xu, Xinfan Meng, Houfeng Wang: Build Chinese Emotion Lexicons Using A Graph-based Algorithm and Multiple Resources. 1209-1217
Hui Yang, Anne N. De Roeck, Alistair Willis, Bashar Nuseibeh: A Methodology for Automatic Identification of Nocuous Ambiguity. 1218-1226
Mei Yang, Katrin Kirchhoff: Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation. 1227-1235
Yao Yao, Feng-hsi Liu: A Working Report on Statistically Modeling Dative Variation in Mandarin Chinese. 1236-1244
Naoki Yoshinaga, Masaru Kitsuregawa: Kernel Slicing: Scalable Online Training with Conjunctive Features. 1245-1253
Liang-Chih Yu, Hsiu-Min Shih, Yu-Ling Lai, Jui-Feng Yeh, Chung-Hsien Wu: Discriminative Training for Near-Synonym Substitution. 1254-1262
Fabio Massimo Zanzotto, Ioannis Korkontzelos, Francesca Fallucchi, Suresh Manandhar: Estimating Linear Models for Compositional Distributional Semantics. 1263-1271
Zhongwu Zhai, Bing Liu, Hua Xu, Peifa Jia: Grouping Product Features Using Semi-Supervised Learning with Soft-Constraints. 1272-1280
Wei Zhang, Jian Su, Chew Lim Tan, WenTing Wang: Entity Linking Leveraging Automatically Generated Annotation. 1290-1298


Shiqi Zhao, Haifeng Wang, Xiang Lan, Ting Liu: Leveraging Multiple MT Engines for Paraphrase Generation. 1326-1334
Yiping Zhou, Lan Nie, Omid Rouhani-Kalleh, Flavian Vasile, Scott Gaffney: Resolving Surface Forms to Wikipedia Topics. 1335-1343
Zhemin Zhu, Delphine Bernhard, Iryna Gurevych: A Monolingual Tree-based Translation Model for Sentence Simplification. 1353-1361
Tao Zhuang, Chengqing Zong: A Minimum Error Weighting Combination Strategy for Chinese Semantic Role Labeling. 1362-1370
Simon Zwarts, Mark Johnson, Robert Dale: Detecting Speech Repairs Incrementally Using a Noisy Channel Approach. 1371-1378
Lilja Øvrelid, Erik Velldal, Stephan Oepen: Syntactic Scope Resolution in Uncertainty Analysis. 1379-1387



