COLING/ACL 2006 CD | CONFERENCE ONLINE | COLING/ACL 2006 ONLINE | ACL ONLINE

Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics

General Chair
Nicoletta Calzolari (Istituto di Linguistica Computazionale – CNR, Italy)

Program Committee Co-Chairs
Claire Cardie (Cornell University, USA)
Pierre Isabelle (National Research Council of Canada, Canada)

Full proceedings volume 1 (PDF)
Full proceedings volume 2 (PDF)
Schedule and Author index (HTML)
Bibliography (BibTeX)
Live website


Front matter bib
Combination of Arabic Preprocessing Schemes for Statistical Machine Translation
Fatiha Sadat and Nizar Habash
pp. 1–8 bib
Going Beyond AER: An Extensive Analysis of Word Alignments and Their Impact on MT
Necip Fazil Ayan and Bonnie J. Dorr
pp. 9–16 bib
Unsupervised Topic Modelling for Multi-Party Spoken Discourse
Matthew Purver, Konrad P. Körding, Thomas L. Griffiths and Joshua B. Tenenbaum
pp. 17–24 bib
Minimum Cut Model for Spoken Lecture Segmentation
Igor Malioutov and Regina Barzilay
pp. 25–32 bib
Bootstrapping Path-Based Pronoun Resolution
Shane Bergsma and Dekang Lin
pp. 33–40 bib
Kernel-Based Pronoun Resolution with Structured Syntactic Knowledge
Xiaofeng Yang, Jian Su and Chew Lim Tan
pp. 41–48 bib
A Finite-State Model of Human Sentence Processing
Jihyun Park and Chris Brew
pp. 49–56 bib
Acceptability Prediction by Means of Grammaticality Quantification
Philippe Blache, Barbara Hemforth and Stéphane Rauzy
pp. 57–64 bib
Discriminative Word Alignment with Conditional Random Fields
Phil Blunsom and Trevor Cohn
pp. 65–72 bib
Named Entity Transliteration with Comparable Corpora
Richard Sproat, Tao Tao and ChengXiang Zhai
pp. 73–80 bib
Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora
Dragos Stefan Munteanu and Daniel Marcu
pp. 81–88 bib
Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation
Yee Seng Chan and Hwee Tou Ng
pp. 89–96 bib
Ensemble Methods for Unsupervised WSD
Samuel Brody, Roberto Navigli and Mirella Lapata
pp. 97–104 bib
Meaningful Clustering of Senses Helps Boost Word Sense Disambiguation Performance
Roberto Navigli
pp. 105–112 bib
Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations
Patrick Pantel and Marco Pennacchiotti
pp. 113–120 bib
Modeling Commonality among Related Classes in Relation Extraction
GuoDong Zhou, Jian Su and Min Zhang
pp. 121–128 bib
Relation Extraction Using Label Propagation Based Semi-Supervised Learning
Jinxiu Chen, Donghong Ji, Chew Lim Tan and Zhengyu Niu
pp. 129–136 bib
Polarized Unification Grammars
Sylvain Kahane
pp. 137–144 bib
Partially Specified Signatures: A Vehicle for Grammar Modularity
Yael Cohen-Sygal and Shuly Wintner
pp. 145–152 bib
Morphology-Syntax Interface for Turkish LFG
Özlem Çetinoğlu and Kemal Oflazer
pp. 153–160 bib
PCFGs with Syntactic and Prosodic Indicators of Speech Repairs
John Hale, Izhak Shafran, Lisa Yung, Bonnie Dorr, Mary Harper, Anna Krasnyanskaya, Matthew Lease, Yang Liu, Brian Roark, Matthew Snover and Robin Stewart
pp. 161–168 bib
Dependency Parsing of Japanese Spoken Monologue Based on Clause Boundaries
Tomohiro Ohno, Shigeki Matsubara, Hideki Kashioka, Takehiko Maruyama and Yasuyoshi Inagaki
pp. 169–176 bib
Trace Prediction and Recovery with Unlexicalized PCFGs and Slash Features
Helmut Schmid
pp. 177–184 bib
Learning More Effective Dialogue Strategies Using Limited Dialogue Move Features
Matthew Frampton and Oliver Lemon
pp. 185–192 bib
Dependencies between Student State and Speech Recognition Problems in Spoken Tutoring Dialogues
Mihai Rotaru and Diane J. Litman
pp. 193–200 bib
Learning the Structure of Task-Driven Human-Human Dialogs
Srinivas Bangalore, Giuseppe Di Fabbrizio and Amanda Stent
pp. 201–208 bib
Semi-Supervised Conditional Random Fields for Improved Sequence Segmentation and Labeling
Feng Jiao, Shaojun Wang, Chi-Hoon Lee, Russell Greiner and Dale Schuurmans
pp. 209–216 bib
Training Conditional Random Fields with Multivariate Evaluation Measures
Jun Suzuki, Erik McDermott and Hideki Isozaki
pp. 217–224 bib
Approximation Lasso Methods for Language Modeling
Jianfeng Gao, Hisami Suzuki and Bin Yu
pp. 225–232 bib
Automated Japanese Essay Scoring System based on Articles Written by Experts
Tsunenori Ishioka and Masayuki Kameda
pp. 233–240 bib
A Feedback-Augmented Method for Detecting Errors in the Writing of Learners of English
Ryo Nagata, Atsuo Kawai, Koichiro Morihiro and Naoki Isu
pp. 241–248 bib
Correcting ESL Errors Using Phrasal SMT Techniques
Chris Brockett, William B. Dolan and Michael Gamon
pp. 249–256 bib
Graph Transformations in Data-Driven Dependency Parsing
Jens Nilsson, Joakim Nivre and Johan Hall
pp. 257–264 bib
Learning to Generate Naturalistic Utterances Using Reviews in Spoken Dialogue Systems
Ryuichiro Higashinaka, Rashmi Prasad and Marilyn A. Walker
pp. 265–272 bib
Measuring Language Divergence by Intra-Lexical Comparison
T. Mark Ellison and Simon Kirby
pp. 273–280 bib
Enhancing Electronic Dictionaries with an Index Based on Associations
Olivier Ferret and Michael Zock
pp. 281–288 bib
Guiding a Constraint Dependency Parser with Supertags
Kilian Foth, Tomas By and Wolfgang Menzel
pp. 289–296 bib
Efficient Unsupervised Discovery of Word Categories Using Symmetric Patterns and High Frequency Words
Dmitry Davidov and Ari Rappoport
pp. 297–304 bib
Bayesian Query-Focused Summarization
Hal Daumé III and Daniel Marcu
pp. 305–312 bib
Expressing Implicit Semantic Relations without Supervision
Peter D. Turney
pp. 313–320 bib
Hybrid Parsing: Using Probabilistic Models as Predictors for a Symbolic Parser
Kilian A. Foth and Wolfgang Menzel
pp. 321–328 bib
Error Mining in Parsing Results
Benoît Sagot and Éric de La Clergerie
pp. 329–336 bib
Reranking and Self-Training for Parser Adaptation
David McClosky, Eugene Charniak and Mark Johnson
pp. 337–344 bib
Automatic Classification of Verbs in Biomedical Texts
Anna Korhonen, Yuval Krymolowski and Nigel Collier
pp. 345–352 bib
Selection of Effective Contextual Information for Automatic Synonym Acquisition
Masato Hagiwara, Yasuhiro Ogawa and Katsuhiko Toyama
pp. 353–360 bib
Scaling Distributional Similarity to Large Corpora
James Gorman and James R. Curran
pp. 361–368 bib
Extractive Summarization using Inter- and Intra- Event Relevance
Wenjie Li, Mingli Wu, Qin Lu, Wei Xu and Chunfa Yuan
pp. 369–376 bib
Models for Sentence Compression: A Comparison across Domains, Training Requirements and Evaluation Measures
James Clarke and Mirella Lapata
pp. 377–384 bib
A Bottom-Up Approach to Sentence Ordering for Multi-Document Summarization
Danushka Bollegala, Naoaki Okazaki and Mitsuru Ishizuka
pp. 385–392 bib
Learning Event Durations from Event Descriptions
Feng Pan, Rutu Mulkar and Jerry R. Hobbs
pp. 393–400 bib
Automatic Learning of Textual Entailments with Cross-Pair Similarities
Fabio Massimo Zanzotto and Alessandro Moschitti
pp. 401–408 bib
An Improved Redundancy Elimination Algorithm for Underspecified Representations
Alexander Koller and Stefan Thater
pp. 409–416 bib
Integrating Syntactic Priming into an Incremental Probabilistic Parser, with an Application to Psycholinguistic Modeling
Amit Dubey, Frank Keller and Patrick Sturt
pp. 417–424 bib
A Fast, Accurate Deterministic Parser for Chinese
Mengqiu Wang, Kenji Sagae and Teruko Mitamura
pp. 425–432 bib
Learning Accurate, Compact, and Interpretable Tree Annotation
Slav Petrov, Leon Barrett, Romain Thibaux and Dan Klein
pp. 433–440 bib
Semi-Supervised Learning of Partial Cognates Using Bilingual Bootstrapping
Oana Frunza and Diana Inkpen
pp. 441–448 bib
Direct Word Sense Matching for Lexical Substitution
Ido Dagan, Oren Glickman, Alfio Gliozzo, Efrat Marmorshtein and Carlo Strapparava
pp. 449–456 bib
An Equivalent Pseudoword Solution to Chinese Word Sense Disambiguation
Zhimao Lu, Haifeng Wang, Jianmin Yao, Ting Liu and Sheng Li
pp. 457–464 bib
Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition
Daisuke Okanohara, Yusuke Miyao, Yoshimasa Tsuruoka and Jun’ichi Tsujii
pp. 465–472 bib
Factorizing Complex Models: A Case Study in Mention Detection
Radu Florian, Hongyan Jing, Nanda Kambhatla and Imed Zitouni
pp. 473–480 bib
Segment-Based Hidden Markov Models for Information Extraction
Zhenmei Gu and Nick Cercone
pp. 481–488 bib
A DOM Tree Alignment Model for Mining Parallel Data from the Web
Lei Shi, Cheng Niu, Ming Zhou and Jianfeng Gao
pp. 489–496 bib
QuestionBank: Creating a Corpus of Parse-Annotated Questions
John Judge, Aoife Cahill and Josef van Genabith
pp. 497–504 bib
Creating a CCGbank and a Wide-Coverage CCG Lexicon for German
Julia Hockenmaier
pp. 505–512 bib
Improved Discriminative Bilingual Word Alignment
Robert C. Moore, Wen-tau Yih and Andreas Bode
pp. 513–520 bib
Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation
Deyi Xiong, Qun Liu and Shouxun Lin
pp. 521–528 bib
Distortion Models for Statistical Machine Translation
Yaser Al-Onaizan and Kishore Papineni
pp. 529–536 bib
A Study on Automatically Extracted Keywords in Text Categorization
Anette Hulth and Beáta B. Megyesi
pp. 537–544 bib
A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization
Jingyang Li, Maosong Sun and Xian Zhang
pp. 545–552 bib
Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization
Alfio Gliozzo and Carlo Strapparava
pp. 553–560 bib
A Progressive Feature Selection Algorithm for Ultra Large Feature Spaces
Qi Zhang, Fuliang Weng and Zhe Feng
pp. 561–568 bib
Annealing Structural Bias in Multilingual Weighted Grammar Induction
Noah A. Smith and Jason Eisner
pp. 569–576 bib
Maximum Entropy Based Restoration of Arabic Diacritics
Imed Zitouni, Jeffrey S. Sorensen and Ruhi Sarikaya
pp. 577–584 bib
An Iterative Implicit Feedback Approach to Personalized Search
Yuanhua Lv, Le Sun, Junlin Zhang, Jian-Yun Nie, Wan Chen and Wei Zhang
pp. 585–592 bib
The Effect of Translation Quality in MT-Based Cross-Language Information Retrieval
Jiang Zhu and Haifeng Wang
pp. 593–600 bib
A Comparison of Document, Sentence, and Term Event Spaces
Catherine Blake
pp. 601–608 bib
Tree-to-String Alignment Template for Statistical Machine Translation
Yang Liu, Qun Liu and Shouxun Lin
pp. 609–616 bib
Incorporating Speech Recognition Confidence into Discriminative Named Entity Recognition of Speech Data
Katsuhito Sudoh, Hajime Tsukada and Hideki Isozaki
pp. 617–624 bib
Exploiting Syntactic Patterns as Clues in Zero-Anaphora Resolution
Ryu Iida, Kentaro Inui and Yuji Matsumoto
pp. 625–632 bib
Self-Organizing n-gram Model for Automatic Word Spacing
Seong-Bae Park, Yoon-Shik Tae and Se-Young Park
pp. 633–640 bib
Concept Unification of Terms in Different Languages for IR
Qing Li, Sung-Hyon Myaeng, Yun Jin and Bo-yeong Kang
pp. 641–648 bib
Word Alignment in English-Hindi Parallel Corpus Using Recency-Vector Approach: Some Studies
Niladri Chatterjee and Saumya Agrawal
pp. 649–656 bib
Extracting Loanwords from Mongolian Corpora and Producing a Japanese-Mongolian Bilingual Dictionary
Badam-Osor Khaltar, Atsushi Fujii and Tetsuya Ishikawa
pp. 657–664 bib
An Unsupervised Morpheme-Based HMM for Hebrew Morphological Disambiguation
Meni Adler and Michael Elhadad
pp. 665–672 bib
Contextual Dependencies in Unsupervised Word Segmentation
Sharon Goldwater, Thomas L. Griffiths and Mark Johnson
pp. 673–680 bib
MAGEAD: A Morphological Analyzer and Generator for the Arabic Dialects
Nizar Habash and Owen Rambow
pp. 681–688 bib
Noun Phrase Chunking in Hebrew: Influence of Lexical and Morphological Features
Yoav Goldberg, Meni Adler and Michael Elhadad
pp. 689–696 bib
Multi-Tagging for Lexicalized-Grammar Parsing
James R. Curran, Stephen Clark and David Vadas
pp. 697–704 bib
Guessing Parts-of-Speech of Unknown Words Using Global Information
Tetsuji Nakagawa and Yuji Matsumoto
pp. 705–712 bib
A Clustered Global Phrase Reordering Model for Statistical Machine Translation
Masaaki Nagata, Kuniko Saito, Kazuhide Yamamoto and Kazuteru Ohashi
pp. 713–720 bib
A Discriminative Global Training Algorithm for Statistical MT
Christoph Tillmann and Tong Zhang
pp. 721–728 bib
Phoneme-to-Text Transcription System with an Infinite Vocabulary
Shinsuke Mori, Daisuke Takuma and Gakuto Kurata
pp. 729–736 bib
Automatic Generation of Domain Models for Call-Centers from Noisy Transcriptions
Shourya Roy and L Venkata Subramaniam
pp. 737–744 bib
Proximity in Context: An Empirically Grounded Computational Model of Proximity for Processing Topological Spatial Expressions
John D. Kelleher, Geert-Jan M. Kruijff and Fintan J. Costello
pp. 745–752 bib
Machine Learning of Temporal Relations
Inderjeet Mani, Marc Verhagen, Ben Wellner, Chong Min Lee and James Pustejovsky
pp. 753–760 bib
An End-to-End Discriminative Approach to Machine Translation
Percy Liang, Alexandre Bouchard-Côté, Dan Klein and Ben Taskar
pp. 761–768 bib
Semi-Supervised Training for Statistical Word Alignment
Alexander Fraser and Daniel Marcu
pp. 769–776 bib
Left-to-Right Target Generation for Hierarchical Phrase-Based Translation
Taro Watanabe, Hajime Tsukada and Hideki Isozaki
pp. 777–784 bib
You Can’t Beat Frequency (Unless You Use Linguistic Knowledge) – A Qualitative Evaluation of Association Measures for Collocation and Term Extraction
Joachim Wermter and Udo Hahn
pp. 785–792 bib
Ontologizing Semantic Relations
Marco Pennacchiotti and Patrick Pantel
pp. 793–800 bib
Semantic Taxonomy Induction from Heterogenous Evidence
Rion Snow, Daniel Jurafsky and Andrew Y. Ng
pp. 801–808 bib
Names and Similarities on the Web: Fact Extraction in the Fast Lane
Marius Paşca, Dekang Lin, Jeffrey Bigham, Andrei Lifchits and Alpa Jain
pp. 809–816 bib
Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora
Alexandre Klementiev and Dan Roth
pp. 817–824 bib
A Composite Kernel to Extract Relations between Entities with Both Flat and Structured Features
Min Zhang, Jie Zhang, Jian Su and GuoDong Zhou
pp. 825–832 bib
Japanese Dependency Parsing Using Co-Occurrence Information and a Combination of Case Elements
Takeshi Abekawa and Manabu Okumura
pp. 833–840 bib
Answer Extraction, Semantic Clustering, and Extractive Summarization for Clinical Question Answering
Dina Demner-Fushman and Jimmy Lin
pp. 841–848 bib
Discovering Asymmetric Entailment Relations between Verbs Using Selectional Preferences
Fabio Massimo Zanzotto, Marco Pennacchiotti and Maria Teresa Pazienza
pp. 849–856 bib
Event Extraction in a Plot Advice Agent
Harry Halpin and Johanna D. Moore
pp. 857–864 bib
An All-Subtrees Approach to Unsupervised Parsing
Rens Bod
pp. 865–872 bib
Advances in Discriminative Parsing
Joseph Turian and I. Dan Melamed
pp. 873–880 bib
Prototype-Driven Grammar Induction
Aria Haghighi and Dan Klein
pp. 881–888 bib
Exploring Correlation of Dependency Relation Paths for Answer Extraction
Dan Shen and Dietrich Klakow
pp. 889–896 bib
Question Answering with Lexical Chains Propagating Verb Arguments
Adrian Novischi and Dan Moldovan
pp. 897–904 bib
Methods for Using Textual Entailment in Open-Domain Question Answering
Sanda Harabagiu and Andrew Hickl
pp. 905–912 bib
Using String-Kernels for Learning Semantic Parsers
Rohit J. Kate and Raymond J. Mooney
pp. 913–920 bib
A Bootstrapping Approach to Unsupervised Detection of Cue Phrase Variants
Rashid M. Abdalla and Simone Teufel
pp. 921–928 bib
Semantic Role Labeling via FrameNet, VerbNet and PropBank
Ana-Maria Giuglea and Alessandro Moschitti
pp. 929–936 bib
Multilingual Legal Terminology on the Jibiki Platform: The LexALP Project
Gilles Sérasset, Francis Brunet-Manquat and Elena Chiocchetti
pp. 937–944 bib
Leveraging Reusability: Cost-Effective Lexical Acquisition for Large-Scale Ontology Translation
G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajič and Pavel Pecina
pp. 945–952 bib
Accurate Collocation Extraction Using a Multilingual Parser
Violeta Seretan and Eric Wehrli
pp. 953–960 bib
Scalable Inference and Training of Context-Rich Syntactic Translation Models
Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang and Ignacio Thayer
pp. 961–968 bib
Modelling Lexical Redundancy for Machine Translation
David Talbot and Miles Osborne
pp. 969–976 bib
Empirical Lower Bounds on the Complexity of Translational Equivalence
Benjamin Wellington, Sonjia Waxmonsky and I. Dan Melamed
pp. 977–984 bib
A Hierarchical Bayesian Language Model Based On Pitman-Yor Processes
Yee Whye Teh
pp. 985–992 bib
A Phonetic-Based Approach to Chinese Chat Text Normalization
Yunqing Xia, Kam-Fai Wong and Wenjie Li
pp. 993–1000 bib
Discriminative Pruning of Language Models for Chinese Word Segmentation
Jianfeng Li, Haifeng Wang, Dengjun Ren and Guohua Li
pp. 1001–1008 bib
Novel Association Measures Using Web Search with Double Checking
Hsin-Hsi Chen, Ming-Shun Lin and Yu-Chuan Wei
pp. 1009–1016 bib
Semantic Retrieval for the Accurate Identification of Relational Concepts in Massive Textbases
Yusuke Miyao, Tomoko Ohta, Katsuya Masuda, Yoshimasa Tsuruoka, Kazuhiro Yoshida, Takashi Ninomiya and Jun’ichi Tsujii
pp. 1017–1024 bib
Exploring Distributional Similarity Based Models for Query Spelling Correction
Mu Li, Muhua Zhu, Yang Zhang and Ming Zhou
pp. 1025–1032 bib
Robust PCFG-Based Generation Using Automatically Acquired LFG Approximations
Aoife Cahill and Josef van Genabith
pp. 1033–1040 bib
Incremental Generation of Spatial Referring Expressions in Situated Dialog
John D. Kelleher and Geert-Jan M. Kruijff
pp. 1041–1048 bib
Learning to Predict Case Markers in Japanese
Hisami Suzuki and Kristina Toutanova
pp. 1049–1056 bib
Are These Documents Written from Different Perspectives? A Test of Different Perspectives Based on Statistical Distribution Divergence
Wei-Hao Lin and Alexander Hauptmann
pp. 1057–1064 bib
Word Sense and Subjectivity
Janyce Wiebe and Rada Mihalcea
pp. 1065–1072 bib
Improving QA Accuracy by Question Inversion
John Prager, Pablo Duboue and Jennifer Chu-Carroll
pp. 1073–1080 bib
Reranking Answers for Definitional QA Using Language Modeling
Yi Chen, Ming Zhou and Shilong Wang
pp. 1081–1088 bib
Highly Constrained Unification Grammars
Daniel Feinstein and Shuly Wintner
pp. 1089–1096 bib
A Polynomial Parsing Algorithm for the Topological Model: Synchronizing Constituent and Dependency Grammars, Illustrated by German Word Order Phenomena
Kim Gerdes and Sylvain Kahane
pp. 1097–1104 bib
Stochastic Language Generation Using WIDL-Expressions and its Application in Machine Translation and Summarization
Radu Soricut and Daniel Marcu
pp. 1105–1112 bib
Learning to Say It Well: Reranking Realizations by Predicted Synthesis Quality
Crystal Nakatsu and Michael White
pp. 1113–1120 bib
An Effective Two-Stage Model for Exploiting Non-Local Dependencies in Named Entity Recognition
Vijay Krishnan and Christopher D. Manning
pp. 1121–1128 bib
Learning Transliteration Lexicons from the Web
Jin-Shea Kuo, Haizhou Li and Ying-Kuei Yang
pp. 1129–1136 bib
Punjabi Machine Transliteration
M.G. Abbas Malik
pp. 1137–1144 bib
Multilingual Document Clustering: An Heuristic Approach Based on Cognate Named Entities
Soto Montalvo, Raquel Martínez, Arantza Casillas and Víctor Fresno
pp. 1145–1152 bib
Time Period Identification of Events in Text
Taichi Noro, Takashi Inui, Hiroya Takamura and Manabu Okumura
pp. 1153–1160 bib
Optimal Constituent Alignment with Edge Covers for Semantic Projection
Sebastian Padó and Mirella Lapata
pp. 1161–1168 bib
Utilizing Co-Occurrence of Answers in Question Answering
Min Wu and Tomek Strzalkowski
pp. 1169–1176 bib