LREC 2000:
Athens, Greece
Proceedings of the Second International Conference on Language Resources and Evaluation, LREC 2000, 31 May - June 2, 2000, Athens, Greece.
European Language Resources Association 2000, ISBN 2-9517408-6-7
- Gérard Bailly, Eduardo Rodríguez Banga, Alex I. C. Monaghan, Erhard Rank:
The Cost258 Signal Generation Test Array.

- Kosho Shudo, Masahito Takahashi, Yasuo Koyama, Kenji Yoshimura:
Collocations as Word Co-ocurrence Restriction Data - An Application to Japanese Word Processor.

- Amit Bagga:
Enhancing the TDT Tracking Evaluation.

- Amalia Arvaniti, Mary Baltazani:
GREEK ToBI: A System for the Annotation of Greek Speech Corpora.

- Adam Kilgarriff, Joseph Rosenzweig:
English Senseval: Report and Results.

- Asunción Moreno, Robrecht Comeyne, Keith Haslam, Henk van den Heuvel, Harald Höge, Sabine Horbach, Giorgio Micca:
SALA: SpeechDat across Latin America. Results of the First Phase.

- Dan Tufis:
Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging.

- Elliott Macklovitch, Michel Simard, Philippe Langlais:
TransSearch: A Free Translation Memory on the World Wide Web.

- Bolette Sandford Pedersen, Sanni Nimb:
Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language.

- Hajime Mochizuki, Manabu Okumura:
A Comparison of Summarization Methods Based on Task-based Evaluation.

- Zheng Jie, Mao Yuhang:
A Word Sense Disambiguation Method Using Bilingual Corpus.

- Stavroula-Evita Fotinea, Ioannis Dologlou, Stylianos Bakamidis, Gregory Stainhaouer, George Carayannis:
Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering.

- Sandro Pedrazzini, Elisabeth Maier, Dierk König:
Terms Specification and Extraction within a Linguistic-based Intranet Service.

- Eva Hajicová, Petr Sgall:
Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level.

- Eva Hajicová, Jarmila Panevová, Petr Sgall:
Coreference in Annotating a Large Corpus.

- Peter Bennison, Lynne Bowker:
Designing a Tool for Exploiting Bilingual Comparable Corpora.

- Diana Maynard, Sophia Ananiadou:
Creating and Using Domain-specific Ontologies for Terminological Applications.

- Ellen M. Voorhees, Dawn M. Tice:
The TREC-8 Question Answering Track.

- Satoshi Sekine, Hitoshi Isahara:
IREX: IR & IE Evaluation Project in Japanese.

- Svetlana Sheremetyeva, Sergei Nirenburg:
Towards A Universal Tool For NLP Resource Acquisition.

- Junfeng Hu, Shiwen Yu:
The Multi-layer Language Knowledge Base of Chinese NLP.

- Yasmina Abbas, Marie-Luce Picard:
With WORLDTREK Family, Create, Update and Browse your Terminological World.

- Noureddine Chenfour, A. Benabbou, A. Mouradi:
Etude et Evaluation de la Di-Syllabe comme Unité Acoustique pour le Système de Synthèse Arabe PARADIS.

- Marcela Charfuelan, José Relaño-Gil, María del Carmen Rodríguez Gancedo, Daniel Tapias Merino, Luis A. Hernández Gómez:
Dialogue Annotation for Language Systems Evaluation.

- Philippe Langlais, Sébastien Sauvé, George F. Foster, Elliott Macklovitch, Guy Lapalme:
Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures.

- Gerardo Sierra, John McNaught:
Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs.

- Jean-Yves Antoine, Jacques Siroux, Jean Caelen, Jeanne Villaneau, Jérôme Goulian, Mohamed Ahafhaf:
Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm.

- Guy Perennou, Martine de Calmès:
MHATLex: Lexical Resources for Modelling the French Pronunciation.

- Carine-Alexia Lavelle, Martine de Calmès, Guy Perennou:
Dialogue and Prompting Strategies Evaluation in the DEMON System.

- Henk van den Heuvel, Lou Boves, Khalid Choukri, Simo M. A. Goddijn, Eric Sanders:
SLR Validation: Present State of Affairs and Prospects.

- Thierry Dutoit, Michel Bagein, Fabrice Malfrère, Vincent Pagel, Alain Ruelle, Nawfal Tounsi, Dominique Wynsberghe:
EULER: an Open, Generic, Multilingual and Multi-platform Text-to-Speech System.

- Marc Swerts, Emiel Krahmer:
On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems.

- Itziar Aduriz, Eneko Agirre, Izaskun Aldezabal, Xabier Arregi, Jose Maria Arriola, Xabier Artola, Koldo Gojenola, A. Maritxalar, Kepa Sarasola, Miriam Urkia:
A Word-level Morphosyntactic Analyzer for Basque.

- Albert Russel, Hennie Brugman, Daan Broeder, Peter Wittenburg:
The EUDICO Project, Multi Media Annotation over the Internet.

- Anna Braasch, Sussi Olsen:
Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon.

- Stavroula-Evita Fotinea, Athanassios Protopapas, Dimitris Dimitriadis, George Carayannis:
Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek.

- Tami Rannon, Ofra Golani, Anat Goren, Sherrie Shammass, Ami Moyal:
Creation of Spoken Hebrew Databases.

- Damjan Vlaj, Janez Kaiser, Ralph Wilhelm, Ute Ziegenhain:
PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists.

- Rosa Estopà, Jordi Vivaldi, M. Teresa Cabré:
Use of Greek and Latin Forms for Term Detection.

- Maria Canelli, Daniele Grasso, Margaret King:
Methods and Metrics for the Evaluation of Dictation Systems: a Case Study.

- Noah A. Smith, Michael E. Jahr:
Cairo: An Alignment Visualization Tool.

- Andreas Mengel, Wolfgang Lezius:
An XML-based Representation Format for Syntactically Annotated Corpora.

- Ornella Corazzari, Nicoletta Calzolari, Antonio Zampolli:
An Experiment of Lexical-Semantic Tagging of an Italian Corpus.

- Núria Bel, Federica Busa, Nicoletta Calzolari, Elisabetta Gola, Alessandro Lenci, Monica Monachini, Antoine Ogonowski, Ivonne Peters, Wim Peters, Nilda Ruimy, Marta Villegas, Antonio Zampolli:
SIMPLE: A General Framework for the Development of Multilingual Lexicons.

- Zygmunt Vetulani:
Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX.

- Rainer Siemund, Harald Höge, Siegfried Kunzmann, Krzysztof Marasek:
SPEECON - Speech Data for Consumer Devices.

- Antonio Moreno, Ralph Grishman, Susana López, Fernando Sánchez, Satoshi Sekine:
A Treebank of Spanish and its Application to Parsing.

- Susanne Jekat, Lorenzo Tessiore:
End-to-End Evaluation of Machine Interpretation Systems: A Graphical Evaluation Tool.

- Xabier Artola, Arantza Díaz de Ilarraza Sánchez, Nerea Ezeiza, Koldo Gojenola, A. Maritxalar, Aitor Soroa:
A Proposal for the Integration of NLP Tools using SGML-Tagged Documents.

- Thierry Fontenelle:
A Bilingual Electronic Dictionary for Frame Semantics.

- Martin Braschler, Donna Harman, Michael Hess, Michael Kluck, Carol Peters, Peter Schäuble:
The Evaluation of Systems for Cross-language Information Retrieval.

- José Bettencourt Gonçalves, Rita Veloso:
Spoken Portuguese: Geographic and Social Varieties.

- Maria Fernanda Bacelar do Nascimento, Luísa Pereira, João Saramago:
Portuguese Corpora at CLUL.

- Antonio Moreno, Chantal Pérez:
Reusing the Mikrokosmos Ontology for Concept-based Multilingual Terminology Databases.

- Kazuhiro Kimura, Hideki Hirakawa:
Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation.

- Alessandro Cucchiarelli, Enrico Faggioli, Paola Velardi:
Will Very Large Corpora Play For Semantic Disambiguation The Role That Massive Computing Power Is Playing For Other AI-Hard Problems?

- Shuichi Itahashi:
Guidelines for Japanese Speech Synthesizer Evaluation.

- Masumi Narita:
Constructing a Tagged E-J Parallel Corpus for Assisting Japanese Software Engineers in Writing English Abstracts.

- Hiroyuki Shinnou, Masanori Ikeya:
Extraction of Unknown Words Using the Probability of Accepting the Kanji Character Sequence as One Word.

- Rosen Ivanov:
Automatic Speech Segmentation in High Noise Condition.

- Elisa Gavieiro-Villatte, Laurent Spaggiari:
Open Ended Computerized Overview of Controlled Languages.

- Rodolfo Delmonte:
Shallow Parsing and Functional Structure in Italian Corpora.

- Dimitrios Kokkinakis, Maria Toporowska Gronostaj, Karin Warmenius:
Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon.

- Diana Santos, Eckhard Bick:
Providing Internet Access to Portuguese Corpora: the AC/DC Project.

- Sharon Inkelas, Aylin Küntay, C. Orhan Orgun, Ronald Sprouse:
Turkish Electronic Living Lexicon (TELL): A Lexical Database.

- Wim Goedertier, Simo M. A. Goddijn, Jean-Pierre Martens:
Orthographic Transcription of the Spoken Dutch Corpus.

- Giulia Bernardis, Hervé Bourlard, Martin Rajman, Jean-Cédric Chappelier:
Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers.

- Guillermo Rojo, Maria Concepción Álvarez, Pilar Alvariño, Adelaida Gil, María Paula Santalla, Susana Sotelo:
An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser.

- John A. Bateman, Anthony F. Hartley:
Target Suites for Evaluating the Coverage of Text Generators.

- Claire Grover, Colin Matheson, Andrei Mikheev, Marc Moens:
LT TTT - A Flexible Tokenisation Tool.

- Albert Rilliard, Véronique Aubergé:
Perception and Analysis of a Reiterant Speech Paradigm: a Functional Diagnostic of Synthetic Prosody.

- Marcello Federico, Dimitri Giordani, Paolo Coletti:
Development and Evaluation of an Italian Broadcast News Corpus.

- Marta Villegas, Núria Bel, Alessandro Lenci, Nicoletta Calzolari, Nilda Ruimy, Antonio Zampolli, Teresa Sadurní, Joan Soler:
Multilingual Linguistic Resources: From Monolingual Lexicons to Bilingual Interrelated Lexicons.

- Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli, Claudia Soria:
Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation.

- Paolo Allegrini, Simonetta Montemagni, Vito Pirrelli:
Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation.

- Rodger Kibble, Kees van Deemter:
Coreference Annotation: Whither?

- Ramón López-Cózar, Antonio J. Rubio, Jesús E. Díaz-Verdejo, Ángel de la Torre:
Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control.

- Cosmin Munteanu, Marian Boldea:
MDWOZ: A Wizard of Oz Environment for Dialog Systems Development.

- Dan Bohus, Marian Boldea:
A Web-based Text Corpora Development System.

- Byron Georgantopoulos, Stelios Piperidis:
Term-based Identification of Sentences for Text Summarisation.

- Kristine Levane, Andrejs Spektors:
Morphemic Analysis and Morphological Tagging of Latvian Corpus.

- Patrick Kremer, Laurent Schmitt:
Textual Information Retrieval Systems Test: The Point of View of an Organizer and Corpuses Provider.

- Nelleke Oostdijk:
The Spoken Dutch Corpus. Overview and First Evaluation.

- Toni Badia, Angels Egea:
A Strategy for the Syntactic Parsing of Corpora: from Constraint Grammar Output to Unification-based Processing.

- Joan Soler i Bou:
Producing LRs in Parallel with Lexicographic Description: the DCC project.

- Atsushi Fujii, Tetsuya Ishikawa:
A Novelty-based Evaluation Method for Information Retrieval.

- Ruslan Mitkov:
Towards More Comprehensive Evaluation in Anaphora Resolution.

- Joseph Polifroni, Stephanie Seneff:
Galaxy-II as an Architecture for Spoken Dialogue Evaluation.

- Marko Tadic:
Building the Croatian-English Parallel Corpus.

- Tamás Váradi:
Lexical and Translation Equivalence in Parallel Corpora.

- Daan Broeder, Hennie Brugman, Albert Russel, R. Skiba, Peter Wittenburg:
Towards a Standard for Meta-descriptions of Language Resources.

- Einar Meister, Arvo Eek, Toomas Altosaar, Martti Vainio:
Object-oriented Access to the Estonian Phonetic Database.

- Adriana Roventini, Antonietta Alonge, Nicoletta Calzolari, Bernardo Magnini, Francesca Bertagna:
ItalWordNet: a Large Semantic Database for Italian.

- Catalina Barbu:
FAST - Towards a Semi-automatic Annotation of Corpora.

- François Trouilleux, Éric Gaussier, Gabriel G. Bès, Annie Zaenen:
Coreference Resolution Evaluation Based on Descriptive Specificity.

- Dominique Dutoit:
A Text->Meaning->Text Dictionary and Process.

- Philippe Boula de Mareüil, Christophe d'Alessandro, François Yvon, Véronique Aubergé, Jacqueline Vaissière, Angélique Amelot:
A French Phonetic Lexicon with Variants for Speech and Language Processing.

- Laila Dybkjær, Morten Baun Møller, Niels Ole Bernsen, Michael Grosse, Martin Olsen, Amanda Schiffrin:
Annotating Communication Problems Using the MATE Workbench.

- Niels Ole Bernsen, Laila Dybkjær:
A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components.

- Niamh Bohan, Elisabeth Breidt, Martin Volk:
Evaluating Translation Quality as Input to Product Development.

- Lars Ahrenberg, Magnus Merkel, Anna Sågvall Hein, Jörg Tiedemann:
Evaluation of Word Alignment Systems.

- Hervé Déjean:
How To Evaluate and Compare Tagsets? A Proposal.

- John White, Jennifer Doyon, Susan Talbott:
Determining the Tolerance of Text-handling Tasks for MT Output.

- Johann Gamper:
A Parallel Corpus of Italian/German Legal Texts.

- Sabine Buchholz, Antal van den Bosch:
Integrating Seed Names and ngrams for a Named Entity List and Classifier.

- Hideki Kashioka, Satosi Shirai:
Automatically Expansion of Thesaurus Entries with a Different Thesaurus.

- Daniel Zeman, Anoop Sarkar:
Learning Verb Subcategorization from Corpora: Counting Frame Subsets.

- Saso Dzeroski, Tomaz Erjavec, Jakub Zavrel:
Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets.

- Giorgio Micca, Alessandra Frasca, Maria-Gabriella Di Benedetto:
Cross-lingual Interpolation of Speech Recognition Models.

- Wim Peters, Ivonne Peters:
Lexicalised Systematic Polysemy in WordNet.

- Björn Gambäck, Fredrik Olsson:
Experiences of Language Engineering Algorithm Reuse.

- Jana Klímová, Jan Kocek:
Derivation in the Czech National Corpus.

- Jakub Zavrel, Walter Daelemans:
Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers.

- Barbora Hladká:
The Context (not only) for Humans.

- Lars Borin:
Something Borrowed, Something Blue: Rule-based Combination of POS Taggers.

- Jon Mills:
Screffva: A Lexicographer's Workbench.

- Philippe Alcouffe, Nicolas Gacon, Claude Roux, Frédérique Segond:
A Step toward Semantic Indexing of an Encyclopedic Corpus.

- Thomas Brey, Gerhard Hanrieder, Paul Heisterkamp, Ludwig Hitzenberger, Peter Regel-Brietzmann:
Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project.

- Gees C. Stein, Tomek Strzalkowski, G. Bowden Wise, Amit Bagga:
Evaluating Summaries for Multiple Documents in an Interactive Environment.

- Jorge Kinoshita:
Grammarless Bracketing in an Aligned Bilingual Corpus.

- William J. Black, John McNaught, Gian Piero Zarri, Andreas Persidis, Andrew Brasher, Luca Gilardoni, Elisa Bertino, Giovanni Semeraro, Pietro Leo:
A Semi-automatic System for Conceptual Annotation, its Application to Resource Construction and Evaluation.

- Amy Isard, David McKelvie, Andreas Mengel, Morten Baun Møller:
The MATE Workbench Annotation Tool, a Technical Description.

- Rhys James Jones, John S. Mason, Louise Helliker, Mark Pawlewski:
Recruitment Techniques for Minority Language Speech Databases: Some Observations.

- Charles L. Wayne:
Multilingual Topic Detection and Tracking: Successful Research Enabled by Corpora and Evaluation.

- Montserrat Marimon Felipe, Jordi Porta Zamorano:
PoS Disambiguation and Partial Parsing Bidirectional Interaction.

- Hamish Cunningham, Kalina Bontcheva, Valentin Tablan, Yorick Wilks:
Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis.

- Nancy Ide, Patrice Bonhomme, Laurent Romary:
XCES: An XML-based Encoding Standard for Linguistic Corpora.

- Iason Demiros, Sotiris Boutsis, Voula Giouli, Maria Liakata, Harris Papageorgiou, Stelios Piperidis:
Named Entity Recognition in Greek Texts.

- Sotiris Boutsis, Prokopis Prokopidis, Voula Giouli, Stelios Piperidis:
A Robust Parser for Unrestricted Greek Text.

- Matej Rojc, Zdravko Kacic:
A Computational Platform for Development of Morphologic and Phonetic Lexica.

- Constantin Orasan, Ramesh Krishnamurthy:
An Open Architecture for the Construction and Administration of Corpora.

- Matej Rojc, Zdravko Kacic:
Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System.

- Constantin Orasan:
CLinkA A Coreferential Links Annotator.

- Adam Kilgarriff, Colin Yallop:
What's in a Thesaurus?

- Harris Papageorgiou, Prokopis Prokopidis, Voula Giouli, Stelios Piperidis:
A Unified POS Tagging Architecture and its Application to Greek.

- Patrice Bonhomme, Patrice Lopez:
Resources for Lexicalized Tree Adjoining Grammars and XML Encoding: TagML.

- Andreas Witt, Harald Lüngen, Dafydd Gibbon:
Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers.

- Steven Bird, David Day, John S. Garofolo, John Henderson, Christophe Laprun, Mark Liberman:
ATLAS: A Flexible and Extensible Architecture for Linguistic Annotation.

- Pavel A. Skrelin, Tatiana Y. Sherstinova:
Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches.

- Lluís de Yzaguirre, Marta Ribas, Jordi Vivaldi, M. Teresa Cabré:
Some Technical Aspects about Aligning Near Languages.

- Tony McEnery, Paul Baker, Lou Burnard:
Corpus Resources and Minority Language Engineering.

- Brigitte Krenn:
CDB - A Database of Lexical Collocations.

- Marilyn A. Walker, Lynette Hirschman, John S. Aberdeen:
Evaluation for Darpa Communicator Spoken Dialogue Systems.

- Edouard Geoffrois, Claude Barras, Steven Bird, Zhibiao Wu:
Transcribing with Annotation Graphs.

- Massimo Poesio:
Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results.

- Steven Bird, Peter Buneman, Wang Chiew Tan:
Towards a Query Language for Annotation Graphs.

- Catherine Macleod, Nancy Ide, Ralph Grishman:
The American National Corpus: A Standardized Resource for American English.

- Martha Palmer, Hoa Trang Dang, Joseph Rosenzweig:
Semantic Tagging for the Penn Treebank.

- Kiril Ribarov:
Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions.

- Kiril Ribarov:
The (Un)Deterministic Nature of Morphological Context.

- David Day, Alan Goldschen, John Henderson:
A Framework for Cross-Document Annotation.

- Peggy Cadel, Hélène Ledouble:
Extraction of Concepts and Multilingual Information Schemes from French and English Economics Documents.

- Eric Breck, John D. Burger, Lisa Ferro, Lynette Hirschman, David House, Marc Light, Inderjeet Mani:
How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done.

- Daniela Oppermann, Susanne Burger, Karl Weilhammer:
What are Transcription Errors and Why are They made?

- Barbara Di Eugenio:
On the Usage of Kappa to Evaluate Agreement on Coding Tasks.

- Le Sun, Youbing Jin, Lin Du, Yufang Sun:
Automatic Extraction of English-Chinese Term Lexicons from Noisy Bilingual Corpora.

- Christopher Cieri, Mark Liberman:
Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium.

- Christopher Cieri, David Graff, Mark Liberman, Nii Martey, Stephanie Strassel:
Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts.

- Yuji Matsumoto, Tatsuo Yamashita:
Using Machine Learning Methods to Improve Quality of Tagged Corpora and Learning Models.

- Stephanie Strassel, David Graff, Nii Martey, Christopher Cieri:
Quality Control in Large Annotation Projects Involving Multiple Judges: The Case of the TDT Corpora.

- Takehito Utsuro:
Learning Preference of Dependency between Japanese Subordinate Clauses and its Evaluation in Parsing.

- Lin-Shan Lee, Lee-Feng Chien:
Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era.

- Lori S. Levin, Boris Bartlog, Ariadna Font Llitjós, Donna Gates, Alon Lavie, Dorcas Wallace, Taro Watanabe, Monika Woszczyna:
Lessons Learned from a Task-based Evaluation of Speech-to-Speech Machine Translation.

- Frank Van Eynde, Jakub Zavrel, Walter Daelemans:
Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus.

- Karl Weilhammer, Daniela Oppermann, Susanne Burger:
The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora.

- Leonardo Lesmo, Vincenzo Lombardo:
Automatic Assignment of Grammatical Relations.

- Bernardo Magnini, Gabriela Cavaglia:
Integrating Subject Field Codes into WordNet.

- Cristina Bosco, Vincenzo Lombardo, Daniela Vassallo, Leonardo Lesmo:
Building a Treebank for Italian: a Data-driven Annotation Schema.

- Kyongho Min, William H. Wilson, Yoo-Jin Moon:
Typographical and Orthographical Spelling Error Correction.

- Jana Klímová, Karel Pala:
Application of WordNet ILR in Czech Word-formation.

- Byeongchang Kim, Jin-Seok Lee, Jeongwon Cha, Geunbae Lee:
POSCAT: A Morpheme-based Speech Corpus Annotation Tool.

- Uwe Quasthoff, Christian Wolff:
A Flexible Infrastructure for Large Monolingual Corpora.

- Byung-Ju Kang, Key-Sun Choi:
Automatic Transliteration and Back-transliteration by Decision Tree Learning.

- Klaus Ries, Lori S. Levin, Liza Valle, Alon Lavie, Alex Waibel:
Shallow Discourse Genre Annotation in CallHome Spanish.

- Anne Abeillé, Lionel Clément, Alexandra Kinyon:
Building a Treebank for French.

- Paola Merlo, Suzanne Stevenson:
Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task.

- Nadjet Bouayad-Agha:
Layout Annotation in a Corpus of Patient Information Leaflets.

- Dominique Vaufreydaz, C. Bergamini, Jean-François Serignat, Laurent Besacier, Mohammad Akbar:
A New Methodology for Speech Corpora Definition from Internet Documents.

- Luisa Bentivogli, Emanuele Pianta, Fabio Pianesi:
Coping with Lexical Gaps when Building Aligned Multilingual Wordnets.

- Young-Soog Chae, Key-Sun Choi:
Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus.

- Young-Soog Chae, Key-Sun Choi:
Introduction of KIBS (Korean Information Base System) Project.

- John A. Bateman, Elke Teich, Geert-Jan M. Kruijff, Ivana Kruijff-Korbayová, Serge Sharoff, Hana Skoumalová:
Resources for Multilingual Text Generation in Three Slavic Languages.

- Dafydd Gibbon, Thorsten Trippel:
A Multi-view Hyperlexicon Resource for Speech and Language System Development.

- Lynne J. Cahill, Christy Doran, Roger Evans, Rodger Kibble, Chris Mellish, Daniel S. Paiva, Mike Reape, Donia Scott, Neil Tipper:
Enabling Resource Sharing in Language Generation: an Abstract Reference Architecture.

- Zdravko Kacic, Bogomir Horvat, Aleksandra Zögling:
Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language.

- Christophe Jouis:
ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora.

- Richard F. E. Sutcliffe, Sadao Kurohashi:
A Parallel English-Japanese Query Collection for the Evaluation of On-Line Help Systems.

- Dan Tufis, Péter Dienes, Csaba Oravecz, Tamás Váradi:
Principled Hidden Tagset Design for Tiered Tagging of Hungarian.

- Felisa Verdejo, Julio Gonzalo, Anselmo Peñas, Fernando López-Ostenero, David Fernández-Amorós:
Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine.

- Dafydd Gibbon, Ana Paula Quirino Simões, Martin Matthiesen:
An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages.

- Gabriel Illouz:
Sublanguage Dependent Evaluation: Toward Predicting NLP performances.

- Jan-Torsten Milde, Markus Reinsch:
The Universal XML Organizer: UXO.

- Helka Folch, Serge Heiden, Benoit Habert, Serge Fleury, Gabriel Illouz, Pierre Lafon, Julien Nioche, Sophie Prevost:
TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation.

- Davide Turcato, Janine Toole, Stavroula Tsiplakou, Trude Heift, Paul McFetridge:
An Approach to Lexical Development for Inflectional Languages.

- Luzia Wittmann, Ricardo Daniel Santos Faro Marques Ribeiro, Tânia Pêgo, Fernando Batista:
Some Language Resources and Tools for Computational Processing of Portuguese at INESC.

- Takehito Utsuro, Manabu Sassano:
Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation.

- Joyce Yue Chai:
Evaluation of a Generic Lexical Semantic Resource in Information Extraction.

- Jim Talley:
The Establishment of Motorola's Human Language Data Resource Center: Addressing the Criticality of Language Resources in the Industrial Setting.

- Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kazuya Takeda, Atsushi Yamada, Akinori Ito, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee:
IPA Japanese Dictation Free Software Project.

- Kikuo Maekawa, Hanae Koiso, Sadaoki Furui, Hitoshi Isahara:
Spontaneous Speech Corpus of Japanese.

- Sean Boisen, Michael Crystal, Richard M. Schwartz, Rebecca Stone, Ralph M. Weischedel:
Annotating Resources for Information Extraction.

- Thierry Declerck, Alexander Werner Jachmann, Hans Uszkoreit:
The New Edition of the Natural Language Software Registry (an Initiative of ACL hosted at DFKI).

- Jong-mi Kim:
Design Methodology for Bilingual Pronunciation Dictionary.

- Constandina Economou, Spyros Raptis, Gregory Stainhaouer:
LEXIPLOIGISSI: An Educational Platform for the Teaching of Terminology in Greece.

- Malgorzata Marciniak, Agnieszka Mykowiecka, Anna Kupsc, Adam Przepiórkowski:
An HPSG-Annotated Test Suite for Polish.

- Finn Tore Johansen, Narada D. Warakagoda, Børge Lindberg, Gunnar Lehtinen, Zdravko Kacic, Andrej Zgank, Kjell Elenius, Giampiero Salvi:
The COST 249 SpeechDat Multilingual Reference Recogniser.

- Marianna KatsoYannou, Eleni Efthimiou:
Terminology Encoding in View of Multifunctional NLP Resources.

- Key-Sun Choi, Young-Soog Chae:
Terminology in Korea: KORTERM.

- Gaëlle Birocheau:
Morphological Tagging to Resolve Morphological Ambiguities.

- Sonja Nießen, Franz Josef Och, Gregor Leusch, Hermann Ney:
An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research.

- Fiammetta Namer, Georgette Dal:
GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources.

- Josué Ndamba, Jean Silence Bayamboussa:
Le Programme Compalex (COMPAraison LEXicale).

- David Graff, Steven Bird:
Many Uses, Many Annotations for Large Speech Corpora: Switchboard and TDT as Case Studies.

- Gerhard Budin, Alan K. Melby:
Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future.

- Bilel Gargouri, Mohamed Jmaiel, Abdelmajid Ben Hamadou:
Using a Formal Approach to Evaluate Grammars.

- Alvin F. Martin, Mark A. Przybocki:
Design Issues in Text-Independent Speaker Recognition Evaluation.

- Fei Xia, Martha Palmer, Nianwen Xue, Mary Ellen Okurowski, John Kovarik, Fu-Dong Chiou, Shizhe Huang, Tony Kroch, Mitchell P. Marcus:
Developing Guidelines and Ensuring Consistency for Chinese Text Annotation.

- Jerneja Gros, France Mihelic, Simon Dobrisek, Tomaz Erjavec, Mario Zganec:
Corpora of Slovene Spoken Language for Multi-lingual Applications.

- Ergina Kavallieratou, Nikos Liolios, E. Koutsogeorgos, Nikos Fakotakis, George K. Kokkinakis:
GRUHD: A Greek database of Unconstrained Handwriting.

- France Mihelic, Jerneja Gros, Elmar Nöth, Volker Warnke:
Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS.

- Catia Cucchiarini, Johan Van Hoorde, Elisabeth D'Halleweyn:
NL-Translex: Machine Translation for Dutch.

- Jaroslava Hlavácová:
Rarity of Words in a Language and in a Corpus.

- Ángel Martín Municio, Guillermo Rojo, Fernando Sánchez León, Octavio Pinillos:
Language Resources Development at the Spanish Royal Academy.

- Irina Prodanof, Amedeo Cappelli, Lorenzo Moretti:
Reusability as Easy Adaptability: A Substantial Advance in NL Technology.

- Andrew Bredenkamp, Berthold Crysmann, Mirela Petrea:
Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking.

- Martin Gellerstam, Yvonne Cederholm, Torgny Rasmark:
The Bank of Swedish.

- George Tambouratzis, Stella Markantonatou, Nikolaos Hairetakis, George Carayannis:
Automatic Style Categorisation of Corpora in the Greek Language.

- Aristomenis Thanopoulos, Nikos Fakotakis, George Kokkinakis:
Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts.

- Hélène Bonneau-Maynard, Laurence Devillers, Sophie Rosset:
Predictive Performance of Dialog Systems.

- Penny Labropoulou, Elena Mantzari, Harris Papageorgiou, Maria Gavrilidou:
Automatic Generation of Dictionary Definitions from a Computational Lexicon.

- Nicole Beringer, Marcia Neff:
Regional Pronunciation Variants for Automatic Segmentation.

- Mario Refice, Michelina Savino, Marco Altieri, Roberto Altieri:
SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties.

- Klaus Bengler:
Automotive Speech-Recognition - Success Conditions Beyond Recognition Rates.

- Wolfgang Menzel, Eric Atwell, Patrizia Bonaventura, Daniel Herron, Peter Howarth, Rachel Morton, Clive Souter:
The ISLE Corpus of Non-Native Spoken English.

- Kallirroi Georgila, Nikos Fakotakis, George Kokkinakis:
A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora.

- Georges Vignaux:
The PAROLE Program.

- Stéphane Chaudiron, Khalid Choukri, Audrey Mance, Valérie Mapelli:
For a Repository of NLP Tools.

- Jeffrey Allen, Khalid Choukri:
Survey of Language Engineering Needs: a Language Resources Perspective.

- Jo Calder:
Interarbora and Thistle - Delivering Linguistic Structure by the Internet.

- George Demetriou, Robert J. Gaizauskas:
Automatically Augmenting Terminological Lexicons from Untagged Text.

- Andrea Setzer, Robert J. Gaizauskas:
Annotating Events and Temporal Information in Newswire Texts.

- Bonnie J. Dorr, Gina-Anne Levow, Dekang Lin, Scott C. Thomas:
Chinese-English Semantic Resource Construction.

- Vera Fluhr-Semenova, Christian Fluhr, Stéphanie Brisson:
Production of NLP-oriented Bilingual Language Resources from Human-oriented dictionaries.

- Justus C. Roux, Elizabeth C. Botha, Johan A. du Preez:
Developing a Multilingual Telephone Based Information System in African Languages.

- Roberto Basili, Maria Teresa Pazienza, Michele Vindigni, Fabio Massimo Zanzotto:
Tuning Lexicons to New Operational Scenarios.

- José A. R. Fonollosa, Asunción Moreno:
SpeechDat-Car Fixed Platform.

- Thorsten Brants:
Inter-annotator Agreement for a German Newspaper Corpus.

- Thorsten Brants, Oliver Plaehn:
Interactive Corpus Annotation.

- Tomaz Erjavec, Roger Evans, Nancy Ide, Adam Kilgarriff:
The Concede Model for Lexical Databases.

- Nick Hatzigeorgiu, Maria Gavrilidou, Stelios Piperidis, George Carayannis, Anastasia Papakostopoulou, Athanassia Spiliotopoulou, Anna Vacalopoulou, Penny Labropoulou, Elena Mantzari, Harris Papageorgiou, Iason Demiros:
Design and Implementation of the Online ILSP Greek Corpus.

- Saturnino Luz:
A Software Toolkit for Sharing and Accessing Corpora Over the Internet.

- Ülle Viks:
Tools for the Generation of Morphological Entries in Dictionaries.

- Paula Guerreiro:
Improving Lexical Databases with Collocational Information: Data from Portuguese.

- Kiyoaki Shirai, Hozumi Tanaka, Takenobu Tokunaga:
Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model.

- Marilyn Mason:
Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications.

- David Portabella, Albert Febrer, Asunción Moreno:
NaniTrans: a Speech Labelling Tool.

- Sanda M. Harabagiu, Steven J. Maiorano:
Acquisition of Linguistic Patterns for Knowledge-based Information Extraction.

- Elisabeth D'Halleweyn, Erwin Dewallef, Jeannine Beeken:
A Platform for Dutch in Human Language Technologies.

- Marilyn A. Walker, Candace A. Kamm, Julie E. Boland:
Developing and Testing General Models of Spoken Dialogue System Peformance.

- Claude de Loupy, Marc El-Bèze:
Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation.

- George K. Mikros, George Carayannis:
Modern Greek Corpus Taxonomy.

- Patrick Paroubek:
Language Resources as by-Product of Evaluation: The MULTITAG Example.

- Judith L. Klavans, Nina Wacholder, David Kirk Evans:
Evaluation of Computational Linguistic Techniques for Identifying Significant Topics for Browsing Applications.

- Satoshi Nakamura, Kazuo Hiyane, Futoshi Asano, Takanobu Nishiura, Takeshi Yamada:
Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition.

- George Demetriou, Eric Atwell, Clive Souter:
Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling.

- Luca Cristoforetti, Marco Matassoni, Maurizio Omologo, Piergiorgio Svaizer, Enrico Zovato:
Annotation of a Multichannel Noisy Speech Corpus.

- John Kontos, Ioanna Malagardi, Spyros Fountoukis:
ARISTA Generative Lexicon for Compound Greek Medical Terms.

- Knut Hofland:
A Self-Expanding Corpus Based on Newspapers on the Web.

- Janne Bondi Johannessen, Anders Nøklestad, Kristin Hagen:
A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts.

- Nick Campbell:
COCOSDA - a Progress Report.

- Ivonne Peters, Wim Peters:
The Treatment of Adjectives in SIMPLE: Theoretical Observations.

- Christine Michel:
Cardinal, Nominal or Ordinal Similarity Measures in Comparative Evaluation of Information Retrieval Process.

- Laurie E. Damianos, Jill L. Drury, Tari Lin Fanderclai, Lynette Hirschman, Jeffrey L. Kurtz, Beatrice T. Oshika:
Evaluating Multi-party Multi-modal Systems.

- Claudia Kunze:
Extension and Use of GermaNet, a Lexical-Semantic Database.

- Serge A. Yablonsky:
Russian Monitor Corpora: Composition, Linguistic Encoding and Internet Publication.

- Ann A. Copestake, Dan Flickinger:
An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG.

- Maosong Sun, Honglin Sun, Changning Huang, Zhang Pu, Xing Hongbing, Zhou Qiang:
Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus.

- Asunción Moreno, Børge Lindberg, Christoph Draxler, Gaël Richard, Khalid Choukri, Stephan Euler, Jeffrey Allen:
SPEECHDAT-CAR. A Large Speech Database for Automotive Environments.

- Giovanna Turrini, Laura Cignoni, Alessandro Paccosi:
Addizionario: an Interactive Hypermedia Tool for Language Learning.

- Khalid Choukri, Audrey Mance, Valérie Mapelli:
Recent Developments within the European Language Resources Association (ELRA).

Last update Sat May 18 15:12:05 2013
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page