Machine Translation Archive

Subject Index

to methods, techniques, issues and topics

Publications 2005-2009

[Click for later publications since 2010; for earlier publications: 2000-2004, 1990-1999, before 1990]

To return to home page click here

 

Adjectives

(2008) Bruno Cartoni: Lexical resources for automatic translation of constructed neologisms: the case study of relational adjectives. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 57KB]

Adverbs

(2008) Elliott Macklovitch, Guy Lapalme, & Fabrizio Gotti: TransSearch: what are translators looking for?  AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.412-419. [PDF, 757KB]

Alignment

(2009) Sergio Barrachina, Oliver Bender, Francisco Casacuberta, Jorge Civera, Elsa Cubel, Shahram Khadivi, Antonio Lagarda, Hermann Ney, Jesús Tomás, Enrique Vidal, & Juan-Miguel Vilar: Statistical approaches to computer-assisted translation. Computational Linguistics 35 (1), pp.3-28. [PDF, 252KB]

(2009) Reza Bosagh Zadeh: Building strong multilingual aligned corpora. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.176-181. [PDF, 497KB]

(2009) Gosse Bouma, Sergio Duarte, & Zahurul Islam: Cross-lingual alignment and completion of Wikipedia templates. NAACL-HLT-2009: The Third International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies (CLIAWS3), Proceedings of the Workshop, June 4, 2009, Boulder, Colorado; pp.21-29. [PDF, 160KB]

(2009) Jamie Brunning, Adrià de Gispert & William Byrne: Context-dependent alignment models for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.110-118. [PDF, 226KB]

(2009) Guy De Pauw, Peter Waiganjo Wagacha, & Gilles-Maurice de Schryver: The SAWA corpus: a parallel corpus English – Swahili. Proceedings of the EACL Workshop on Language Technologies for African Languages (AfLaT 2009), Athens, Greece, 31 March 2009; pp.9-16. [PDF, 129KB]

(2009) Thi-Ngoc-Diep Do, Viet-Bac Le, Brigitte Bigi, Laurent Besacier, & Eric Castelli: Mining a comparable text corpus for a Vietnamese-French statistical machine translation system.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.165-172. [PDF, 152KB]

(2009) Sittichai Jiampojamarn, Aditya Bhargava, Qing Dou, Kenneth Dwyer, & Grzegorz Kondrak: DirecTL: a language-independent approach to transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.28-31. [PDF, 128KB]

(2009) Adrien Lardilleux, Jonathan Chevelu, Yves Lepage, Ghislain Putois, & Julien Gosme: Lexicons or phrase tables? An investigation in sampling-based multilingual alignment. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.45-52. [PDF, 345KB]; presentation [PDF of PPT, 256KB]

(2009) Yves Lepage, Adrien Lardilleux, & Julien Gosme: The GREYC translation memory for the IWSLT 2009 evaluation campaign: one step beyond translation memory. IWSLT 2009: Proceedings of the International Workshop on Spoke n Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 45-49. [PDF, ]; presentation [PDF of PPT,

(2009) Chi-Ho Li, Xiaodong He, Yupeng Liu, & Ning Xi: Incremental HMM alignment for MT system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.949-957. [PDF, 559KB]

(2009) Samat Mamitimin & Min Hou: Chinese-Uyghur sentence alignment: an approach based on anchor sentences. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.38-45. [PDF, 215KB]

(2009) David Mareček: Analysis and alignment of parallel data in TectoMT. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.57-58 [PDF, 122KB]

(2009) Toshiaki Nakazawa & Sadao Kurohashi: Statistical phrase alignment model using dependency relation probability.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.10-18. [PDF, 670KB]

(2009) Hideharu Okuma, Kazuo Hara, Masashi Shimbo, & Yuji Matsumoto: Bypassed alignment graph for learning coordination in Japanese sentences. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.5-8. [PDF, 474KB]

(2009) Vladimir Pervouchine, Haizhou Li, & Bo Lin: Transliteration alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.136-144. [PDF, 572KB]

(2009) Emmanuel Prochasson, Emmanuel Morin & Kyo Kageura: Anchor points for bilingual lexicon extraction from small comparable corpora. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.284-291. [PDF, 138KB]

(2009) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment with flexible matching for building confusion networks: BBN system description for WMT09 system combination task. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.61-65. [PDF, 115KB]

(2009) Ken Samuel, Alan Rubenstein, Sherri Condon & Alex Yeh: Name matching between Chinese and Roman scripts: machine complements human. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.28-31. [PDF, 454KB]

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2009) Felipe Sánchez-Martínez & Andy Way: Marker-based filtering of bilingual phrase pairs for SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.144-151. [PDF , 386KB]

(2009) Ruhi Sarikaya, Sameer Maskey, Rong Zhang & Ea-Ee Jan: Iterative sentence-pair extraction from quasi-parallel corpora for machine translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

 (2009) Anders Søgaard: On the complexity of alignment problems in two synchronous grammar formalisms.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.60-68. [PDF, 161KB]

(2009) Anders Søgaard & Jonas Kuhn: Empirical lower bounds on alignment error rates in syntax-based machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.19-27. [PDF, 138KB]

(2009) Sanghoun Song & Francis Bond: Online search interface for the Sejong Korean-Japanese bilingual corpus and auto-interpretation of phrase alignment. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.146-149. [PDF, 153KB]

(2009) Jun Sun, Min Zhang, & Chew Lim Tan: A non-contiguous tree sequence alignment-based model for statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.914-922. [PDF, 459KB]

(2009) Masao Utiyama & Hitoshi Isahara: Mining patents for parallel corpora. In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.): Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.41-58.

(2009) Balakrishnan Vardarajan & Delip Rao: Î-extension Hidden Markov Models and weighted transducers for machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.120-123. [PDF, 129KB]

(2009) Fan Yang, Jun Zhao, & Kang Liu: A Chinese-English organization name translation system using heuristic web mining and asymmetric alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.387-395. [PDF, 178KB]

(2009) Ventsislav Zhechev: Unsupervised generation of parallel treebanks through sub-tree alignment. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.89-98. [PDF, 478KB]  [presentation at MT Marathon 2009, 3281KB]

(2008) Necip Fazil Ayan, Jing Zheng, & Wen Wang: Improving alignments for better confusion networks for combining machine translation systems. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.33-40. [PDF, 228KB]

(2008) Alberto Barrón-Cedeño, Paolo Rosso, David Pinto, & Alfons Juan: On cross-lingual plagiarism analysis using a statistical model. ECAI 2008 Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse, July 22, Patras, Greece; 5pp. [PDF, 132KB]

(2008) Marta R.Costa-jussà, José A.R.Fonollosa, & Enric Monte: Using reordering in statistical machine translation based on recursive alignment block classification.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 96KB]

(2008) John DeNero & Dan Klein: The complexity of phrase alignment problems. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 25-28. [PDF, 180KB]

(2008) John DeNero, Alexandre Bouchard-Côté, & Dan Klein: Sampling alignment structure under a Bayesian translation model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.314-323. [PDF, 356KB]

(2008) Markus Dreyer, Jason R.Smith, & Jason Eisner: Latent-variable modeling of string transductions with finite-state methods.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1080-1089. [PDF, 513KB]

(2008) Chris Dyer, Alex Mont, Aaron Cordova, & Jimmy Lin: SMT model building with MapReduce. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 91 slides. [PDF, 1850KB]

(2008) Mark Fishel & Heiki-Jaan Kaalep: Experiments on processing overlapping parallel corpora. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 186KB]

(2008) Surya Ganesh, Sree Harsha, Prasad Pingali, & Vasudeva Varma: Statistical transliteration for cross language information retrieval using HMM alignment and CRF. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.42-47. [PDF, 127KB]

(2008) Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen, & Robert Moore: Indirect HMM-based hypothesis alignment for combining outputs from machine translation systems.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.98-107. [PDF, 280KB]

(2008) Young-Sook Hwang, YoungKil Kim, & SangKyu Park: Paraphrasing depending on bilingual context toward generalization of translation knowledge. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.327-334. [PDF, 952KB]

(2008) Einav Itamar & Alon Itai: Using movie subtitles for creating a large-scale bilingual corpora. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 120KB]

(2008) Damianos Karakos, Jason Eisner, Sanjeev Khudanpur, & Markus Dreyer: Machine translation system combination using ITG-based alignments. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 81-84. [PDF, 65KB]

(2008) Adrien Lardilleux & Yves Lepage: Multilingual alignments by monolingual string differences.  Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.55-58. [PDF, 164KB]

(2008) Adrien Lardilleux & Yves Lepage: A truly multilingual, high coverage, accurate, yet simple, subsentential alignment method. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.125-132. [PDF, 633KB]

(2008) Yves Lepage, Adrien Lardilleux, Julien Gosme, & Jean-Luc Manguin: The GREYC machine translation system for the IWSLT 2008 evaluation campaign. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 39-45. [PDF, 184KB]; presentation [PDF, 142KB]

(2008) Yanjun Ma, John Tinsley, Hany Hassan, Jinhua Du, & Andy Way: Exploiting alignment techniques in MATREX: the DCU machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 26-33. [PDF, 198KB]; presentation [PDF, 559KB]

(2008) Lieve Macken, Els Lefever, & Veronique Hoste:  Linguistically-based sub-sentential alignment for terminology extraction from a bilingual automotive corpus. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.529-536. [PDF, 145KB]

(2008) Kazuaki Maeda, Xiaoyi Ma, & Stephanie Strassel: Creating sentence-aligned parallel text corpora from a large archive of potential parallel text using BITS and Champollion.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 68KB]

(2008) David Mareček, Zdeněk Žabokrtský, & Václav Novák: Automatic alignment of Czech and English deep syntactic dependency trees. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.104-113. [PDF, 586KB]

(2008) Toshiaki Nakazawa & Sadao Kurohashi: Linguistically-motivated tree-based probabilistic phrase alignment. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.163-171. [PDF, 603KB]

(2008) Daniel Ortiz-Martínez, Ismael García-Varea, & Francisco Casacuberta: Phrase-level alignment generation using a smoothed loglinear phrase-based statistical alignment model. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.160-169. [PDF, 536KB]

(2008) Antti-Veikko I. Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment for building confusion networks with application to machine translation system combination. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.183-186. [PDF, 61KB]

(2008) Lei Shi & Ming Zhou: Improved sentence alignment on parallel web pages using a stochastic tree alignment model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.505-513. [PDF, 483KB]

(2008) Anders Søgaard: Learning context-sensitive synchronous rules. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.170-175. [PDF, 525KB]

(2008) Zhiyi Song & Stephanie Strassel: Entity translation and alignment in the ACE-07 ET task. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 135KB]

(2008) Kathrin Spreyer, Jonas Kuhn, & Bettina Schrader: Identification of comparable argument-head relations in parallel corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 138KB]

(2008) Jörg Tiedemann: Synchronizing translated movie subtitles. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 171KB]

(2008) Julia Trushkina, Lieve Macken, & Hans Paulussen: Sentence alignment in DPC: maximizing precision, minimizing human effort.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 52KB]

(2008) Clare R.Voss, Matthew Aguirre, Jeffrey Micher, Richard Chang, Jamal Laoudi, & Reginald Hobbs: Boosting performance of weak MT engines automatically: using MT output to align segments & build statistical post-editors. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.192-201. [PDF, 506KB]

(2008) Clare R. Voss, Jamal Laoudi, & Jeffrey Micher: Exploitation of an Arabic language resource for MT evaluation: using Buckwalter-based lookup tool to augment CMU alignment algorithm.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 246KB]

(2008) Min Zhang, Hongfei Jiang, Aiti Aw, Haizhou Li, Chew Lim Tan, & Sheng Li: A tree sequence alignment-based tree-to-tree translation model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 559-567. [PDF, 317KB]

(2007) Fernando Diaz & Donald Metzler: Pseudo-aligned multilingual corpora. IJCAI-07: Twentieth International Joint conference on Artificial Intelligence, Hyderabad, India, 6-12 January 2007; pp.2727-2732. [PDF, 126KB]

(2007) Dayne Freitag & Shahram Khadivi: A sequence alignment model based on the averaged perceptron. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 238-247. [PDF, 180KB]

(2007) Mary Hearne, John Tinsley, Ventsislav Zhechev, & Andy Way: Capturing translational divergences with a statistical tree-to-tree aligner. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.85-94 [PDF, 394KB]; presentation [PDF, 462KB]

(2007) Sarvnaz Karimi, Falk Scholer, & Andrew Turpin: Collapsed consonant and vowel models: new approaches for English-Persian transliteration and back-transliteration.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 648-655 [PDF, 195KB]

(2007) Jae Dong Kim & Stephan Vogel: Iterative refinement of lexicon and phrasal alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.281-288 [PDF, 121KB]

(2007) Tadashi Kumano, Hideki Tanaka, & Takenobu Tokunaga: Extracting phrasal alignments from comparable corpora by using joint probability SMT model. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.95-103 [PDF, 366KB]

(2007) Patrik Lambert, Rafael E.Banchs, & Josep M.Crego: Discriminative alignment training without annotated data for machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.85-88 [PDF, 114KB]

(2007) Ding Liu & Daniel Gildea: Source-language features and maximum correlation training for machine translation evaluation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.41-48 [PDF, 201KB]

(2007) Lieve Macken: Analysis of translational correspondence in view of sub-sentential alignment.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 9pp. [PDF, 182KB]

(2007) Toshiaki Nakazawa, Yu Kun, & Sadao Kurohashi: Structural phrase alignment based on consistency criteria. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.337-344 [PDF, 202KB]

(2007) Nasredine Semmar & Christian Fluhr: Arabic to French sentence alignment: exploration of a cross-language information retrieval approach. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 73-80 [PDF, 156KB]

(2007) John Tinsley, Ventsislav Zhechev, Mary Hearne, & Andy Way: Robust language pair-independent sub-tree alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.467-474 [PDF, 1310KB]

(2007) Akira Ushioda: Phrase alignment for integration of SMT and RBMT resources. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.8-12. [PDF, 66KB]

(2007) Akira Ushioda: Phrase alignment based on bilingual parsing. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.241-250 [PDF, 312KB]

(2007) Min Zhang, Hongfei Jiang, Ai Ti Aw, Jun Sun, Sheng Li, & Chew Lim Tan: A tree-to-tree alignment-based model for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.535-542 [PDF, 243KB]

(2007) Qibo Zhu, Diana Inkpen & Ash Asudeh: Automatic extraction of translations from web-based bilingual materials [abstract]. Machine Translation 21 (3), September 2007; pp.139-163.

(2006) Alexandru Ceauşu, Dan Ştefănescu, & Dan Tufiş: Acquis Communautaire sentence alignment using support vector machines.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2134-2137 [PDF, 381KB]

(2006) Brooke Cowan, Ivona Kučerová, & Michael Collins: A discriminative model for tree-to-tree translation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 232-241. [PDF, 338KB]

(2006) Hal Daumé III & Daniel Marcu: Induction of word and phrase alignments for automatic document summarization. Computational Linguistics 31 (4), pp. 505-530. [PDF, 507KB]

(2006) John DeNero, Dan Gillick, James Zhang, & Dan Klein: Why generative phrase models underperform surface heuristics.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 31-38 [PDF, 393KB]

(2006)Yonggang Deng & William Byrne: MTTK: an alignment toolkit for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 265-268 [PDF, 135KB]

(2006) Adria de Gispert & José B.Mariño: Linguistic tuple segmentation in n-gram-based statistical machine translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1049; abstract [PDF, 90KB]

(2006) Silvia Hansen-Schirra, Stella Neumann, & Mihaela Vela : Multi-dimensional annotation and alignment in an English-German translation corpus. EACL-2006: Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-dimensional Markup in Natural Language Processing, April 4, 2006, Trento, Italy; pp.35-42. [PDF, 77KB]

(2006) Mark Hopkins & Jonas Kuhn: A framework for incorporating alignment information in parsing. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.9-16 [PDF, 311 KB]

(2006) Radu Ion, Alexandru Ceauşu, & Dan Tufiş: Dependency-based phrase alignment.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1290-1293 [PDF, 430KB]

(2006) Ding Liu & Daniel Gildea: Stochastic iterative alignment for machine translation evaluation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.539-546. [PDF, 201KB]

(2006) Xiaoyi Ma: Champollion: a robust parallel text sentence aligner.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.489-492 [PDF, 313KB]

(2006) Arne Mauser, Evgeny Matusov, & Hermann Ney: Training a statistical machine translation system without GIZA++.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.714-720 [PDF, 325KB]

(2006) Karolina Owczarzak, Declan Groves, Josef Van Genabith, & Andy Way: Contextual bitext-derived paraphrases in automatic MT evaluation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 86-93 [PDF, 222KB]

(2006) Gábor Pohl: English-Hungarian NP alignment in MetaMorpho TM. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.69-74 [PDF, 81KB]

(2006) J.A.Sánchez & J.M.Benedí: Stochastic inversion transduction grammars for obtaining word phrases for phrase-based statistical machine translation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 130-133 [PDF, 91KB]

(2006) Bettina Schrader: ATLAS – a new text alignment architecture. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.715-722. [PDF, 139KB]

(2006) Nasredine Semmar & Christian Fluhr: Using cross-language information retrieval for sentence alignment.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.95-104. [PDF, 254KB]

(2006) David A. Smith & Jason Eisner: Quasi-synchronous grammars: alignment by soft projection of syntactic dependencies.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 23-30 [PDF, 207KB]

(2006) Dan Ştefănescu & Dan Tufiş: Aligning multilingual thesauri. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.475-478 [PDF, 461KB]

(2006) Mihaela Vela & Silvia Hansen-Schirra: The use of multi-level annotation and alignment for the translator. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 19pp. [PDF, 187KB]

(2006) David Vilar, Maja Popovic, & Hermann Ney: AER: do we need to “improve” our alignments? International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 205-212 [PDF, 108KB]

(2006) Benjamin Wellington, Sonja Waxmonsky, & I.Dan Melamed: Empirical lower bounds on the complexity of translational equivalence. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.977-984. [PDF, 121KB]

(2006) Martin Volk, Sofia Gustafson-Capková, Joakim Lundborg, Torsten Marek, Yvonne Samuelsson, & Frida Tidström: XML-based phrase alignment in parallel treebanks. EACL-2006: Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-dimensional Markup in Natural Language Processing, April 4, 2006, Trento, Italy; pp.93-96. [PDF, 103KB]

(2006) Jia Xu, Richard Zens, & Hermann Ney: Partitioning parallel documents using binary segmentation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 78-85 [PDF, 433KB]

(2006) Hao Zhang & Daniel Gildea: Efficient search for inversion transduction grammar.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 224-231. [PDF, 172KB]

(2005) proceedings of ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005

(2005) Yonggang Deng & William Byrne: HMM word and phrase alignment for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 169-176. [PDF, 189KB]

(2005) Emmanuel Giguet: Multi-grained alignment of parallel texts with endogenous resources. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.12-17 [PDF, 194KB]

(2005) Declan Groves & Andy Way: Hybrid data-driven models of machine translation [abstract]. Machine Translation 19 (3-4), 2005; pp. 301-323.

(2005) Sanjika Hewavitharana, Stephan Vogel, & Alex Waibel: Augmenting a statistical translation system with a translation memory. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 126132. [PDF, 74KB]

(2005) Jae Dong Kim, Ralf D. Brown, Peter J. Jansen, & Jaime G. Carbonell: Symmetric probabilistic alignment for example-based translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 153-159. [PDF, 63KB]

(2005) Philipp Koehn: Europarl: a parallel corpus for statistical machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.79-86. [PDF, 123KB]

(2005) Takeshi Kutsumi, Takehiko Yoshimi, Katsunori Kotani, Ichiko Sata, & Hitoshi Isahara: Selection of entries for a bilingual dictionary from aligned translation equivalents using support vector machines. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.11-16. [PDF, 253KB]

(2005) Jong-Hoon Oh & Key-Sun Choi: Automatic extraction of English-Korean translations for constituents of technical terms. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.67-72. [PDF, 226KB]

(2005) Stephan Vogel: PESA: phrase pair extraction as sentence splitting. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.251-258. [PDF, 192KB]

(2005) Jia Xu, Richard Zens, & Hermann Ney: Sentence segmentation using IBM word alignment model 1. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 280-287. [PDF, 145KB]

(2005) Hao Zhang & Daniel Gildea: Stochastic lexicalized inversion transduction grammar for alignment. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 4787-482. [PDF, 102KB]

(2005) Ying Zhang & Stephan Vogel: An efficient phrase-to-phrase alignment model for arbitrarily long phrase and large corpora. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 294-301. [PDF, 187KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.85-90. [PDF, 936KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.71-78. [PDF, 1139KB]

Analogies and analogical modelling (see also Example-based methods)

(2009) Philippe Langlais, François Yvon & Pierre Zweigenbaum: Improvements in analogical learning: application to translating multi-terms of the medical domain.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.487-495. [PDF, 172KB]

(2009) Harold Somers, Sandipan Dandapat, & Sundip Kumar Naskar: A review of EBMT using proportional analogies. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.53-60. [PDF, 337KB]

(2007) Etienne Denoual: Analogical translation of unknown words in a statistical machine translation framework. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.135-141 [PDF, 94KB]

(2007) Philippe Langlais & Alexandre Patry: Translating unknown words using analogical learning. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 877-886. [PDF, 155KB]

(2005) Yves Lepage & Etienne Denoual: ALEPH: an EBMT system based on the preservation of proportional analogies between sentences across langauges. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 388KB]

(2005) Yves Lepage & Etienne Denoual: Purest ever example-based machine translation: detailed presentation and assessment [abstract]. Machine Translation 19 (3-4), 2005; pp.251-282.

Analysis see Parsing; Semantic analysis; Syntactic analyis

Anaphora resolution

 (2007) Keh-Yi Su: [review of] Ruslan Mitkov, Anaphora resolution. Machine Translation 21 (2), June 2007; pp.135-137. [see publication]

(2005) Shigeko Nariyama, Eric Nichols, Francis Bond, Takaaki Tanaka, & Hiromi Nakaiwa: Extracting representative arguments from dictionaries for resolving zero pronouns. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.3-10. [PDF, 665KB]

Annotation

(2009) Enrique Amigó, Jesús Giménez, Julio Gonzalo, & Felisa Verdejo: The contribution of linguistic features to automatic machine translation evaluation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.306-314. [PDF, 518KB]

(2009) Ondřej Bojar & Zdeněk Žabokrtský: CzEng 0.9: large parallel treebank with rich annotation. Prague Bulletin of Mathematical Linguistics, no.92, December 2009; pp.63-83 [PDF, 217KB]

(2009) E.Boldrini, S.Ferrández, R.Izquierdo, D.Tomás, & J.L.Vicedo: A parallel corpus labeled using open and restricted domain ontologies [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 65KB]

(2009) Boxing Chen, George Foster, & Roland Kuhn: Phrase translation model enhanced with association based features. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.25-32. [PDF, 158KB]

(2009) David Chiang, Kevin Knight, & Wei Wang: 11,001 new features for statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.218-226. [PDF, 114KB]

(2009) Jinho D.Choi, Martha Palmer, & Nianwen Xue: Using parallel Propbanks to enhance word-alignments. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.121-124. [PDF, 139KB]

(2009) Jakob Elming, Nizar Habash, & Josep M.Crego: Combination of statistical word alignments based on multiple preprocessing schemes.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.93-110.

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Christian Federmann, Silke Theison, Andreas Eisele, Hans Uszkoreit, Yu Chen, Michael Jellinghaus, & Sabine Hunsicker: Translation combination using factored word substitution.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.70-74. [PDF, 100KB]

(2009) Alexander Fraser, Renjing Wang, & Hinrich Schütze: Rich bitext projection features for parse reranking. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.282-290. [PDF, 132KB]

(2009) Christian Hardmeier & Martin Volk: Using linguistic annotations in statistical machine translation of film subtitles. In: Kristina Jokinen and Eckhard Bick (eds.) Proceedings of the 17th Nordic Conference of Computational Linguistics (NODALIDA 2009), 15-16 May 2009, Odense, Denmark; pp.57-64. [PDF,  158KB]

(2009) Hieu Hoang & Philipp Koehn: Improving mid-range reordering using templates of factors. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.372-379. [PDF, 184KB]

(2009) Michal Hrusecky, Tomas Caithaml, & Chris Dyer: Feature function overhaul. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; 13pp. [PDF, 88KB]

(2009) Philipp Koehn: Discriminative training and factored translation models. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.69-81 [PDF, 75 slides, 530KB]

(2009) Germán Sanchis-Trilles, Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Online language model adaptation for spoken dialog translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 160-167. [PDF, 349KB]; presentation [PDF of PPT, 250KB]

(2009) Sanghoun Song & Francis Bond: Online search interface for the Sejong Korean-Japanese bilingual corpus and auto-interpretation of phrase alignment. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.146-149. [PDF, 153KB]

(2009) Sara Stymne: A comparison of merging strategies for translation of German compounds. Proceedings of the EACL 2009 Student Research Workshop, Athens, Greece, 2 April 2009; pp.61-69. [PDF, 111KB]

(2008) Carmen Banea, Rada Mihalcea, Janyce Wiebe, & Samer Hassan: Multilingual subjectivity analysis using machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.127-135. [PDF, 15403KB]

(2008) David Chiang, Yuval Marton, & Philip Resnik: Online large-margin training of syntactic and structural translation features.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233. [PDF, 162KB]

(2008) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth S.Narayanan: Towards unsupervised training of the classifier-based speech translator. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2739-2742; abstract [PDF, 58KB]

(2008) Meghan Lammie Glenn, Stephanie Strassel, Lauren Friedman, & Haejoong Lee: Management of large annotation projects involving multiple human judges: a case study of GALE machine translation post-editing. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 144KB]

(2008) João Graça, Joana Paulo Pardal, Luisa Coheur, & Diamantino Caseiro: Building a golden collection of parallel multi-language word alignments. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 301KB]

(2008) Reginald Hobbs, Jamal Laoudi, & Clare R.Voss: MTriage: web-enabled software for the creation, machine translation, and annotation of smart documents.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 433KB]

(2008) Jin-Ji Li, Dong-Il Kim, & Jong-Hyeok Lee: Annotation guidelines for Chinese-Korean word alignment.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 195KB]

(2008) Dekang Lin, Shaojun Zhao, Benjamin Van Durme, & Marius Paşca: Mining parenthetical translations from the web by word alignment. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 994-1002. [PDF, 419KB]

(2008) Qun Liu, Zhongjun He, Yang Liu, & Shouxun Lin: Maximum entropy based rule selection model for syntax-based statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.89-97. [PDF, 916KB]

(2008) Beáta Megyesi, Bengt Dahlqvist, Eva Pettersson, & Joakim Nivre: Swedish-Turkish parallel treebank. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 80KB]

(2008) Petya Osenova, Kiril Simov, & Eelco Mossel: Language resources for semantic document annotation and crosslingual retrieval.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 465KB]

(2008) Doaa Samy & Ana González-Ledesma: Pragmatic annotation of discourse markers in a multilingual parallel corpus (Arabic-Spanish-English).  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 301KB]

(2008) Virach Sornlertlamvanich, Thatsanee Charoenporn, Chumpol Mokarat, Hammam Riza, Hitoshi Isahara, & Purev Jaimal: Synset assignment for bi-lingual dictionary with limited resource. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.673-678. [PDF, 549KB]

(2008) Virach Sornlertlamvanich, Thatsanee Charoenporn, Suphanut Thayaboon, Chumpol Mokarat, & Hitoshi Isahara: Enhanced tools for online collaborative language resource development. IJCNLP 2008: Sixth Workshop on Asian Language Resources, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.105-106. [PDF, 310KB]

(2008) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: Linguistically annotated BTG for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1009-1016. [PDF, 141KB]

(2008) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: A linguistically annotated reordering model for BTG-based statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 149-152. [PDF, 319KB]

(2008) Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Word alignment annotation in a Japanese-Chinese parallel corpus.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 195KB]

(2008) Jingbo Zhu, Huizhen Wang, & Eduard Hovy: Learning a stopping criterion for active learning for word sense disambiguation and text classification. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.366-372. [PDF, 421KB]

(2007) Matthias Buch-Kromann: Computing translation units and quantifying parallelism in parallel dependency treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.69-76 [PDF, 361KB]

(2007) Philipp Koehn & Hieu Hoang: Factored translation models. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 868-876. [PDF, 225KB]

(2007) Lluís Màrquez, Luis Villarejo, M.A.Martí, & Mariona Taulé: SemEval-2007 task 09: multilevel semantic annotation of Catalan and Spanish. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.42-47 [PDF, 89KB]

(2007) Roser Morante & Bertjan Busser: ILK2: semantic role labelling for Catalan and Spanish using TiMBL. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.183-186 [PDF, 120KB]

(2007) Martin Volk, Joakim Lundborg, & Maël Mettler: A search tool for parallel treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.85-92 [PDF, 270KB]

(2006) Juri Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid Iomdin, Andrei Sannikov, & Victor Sizov: A syntactically and semantically tagged corpus of Russian: state of the art and prospects.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1378-1381 [PDF, 596KB]

(2006) Dan Flickinger: Identifying complex phenomena in a corpus via a treebank lens. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.125-129 [PDF, 89KB]

(2006) Silvia Hansen-Schirra, Stella Neumann, & Mihaela Vela : Multi-dimensional annotation and alignment in an English-German translation corpus. EACL-2006: Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-dimensional Markup in Natural Language Processing, April 4, 2006, Trento, Italy; pp.35-42. [PDF, 77KB]

(2006) Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw & Ralph Weischedel: OntoNotes: the 90% solution.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 57-60 [PDF, 44KB]

(2006) Svetla Koeva, Svetlozara Lesseva, & Maria Todorova: Bulgarian sense tagged corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.79-86. [PDF, 494KB]

(2006) Ivana Kruijff-Korbayová, Klára Chvátalová, & Oana Postolache: Annotation guidelines for Czech-English word alignment.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1256-1261 [PDF, 608KB]

(2006) Mohamed Maamouri, Ann Bies, & Seth Kulick: Diacritization: a challenge to Arabic treebank annotation and parsing.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.35-47 [PDF, 223KB]

(2006) Márton Miháltz & Gábor Pohl: Exploiting parallel corpora for supervised word sense disambiguation in English-Hungarian machine translation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1294-1297 [PDF, 339KB]

(2006) Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J.Miller, Teruko Mitamura, Florence Reeder, & Advaith Siddharthan: Parallel syntactic annotation of multiple languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.559-564 [PDF, 286KB]

(2006) E. Saquete, P.Martínez-Barco, R.Muñoz, M.Negri, M.Speranza, & R.Sprugnoli: Multilingual extension of a temporal expression normalizer using annotated corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.1-8 [PDF, 261 KB]

(2006) Serge Sharoff, Bogdan Babych, Paul Rayson, Olga Mudraya, & Scott Piao: ASSIST: automated semantic assistance for translators. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.139-142 [PDF, 69KB]

(2006) Mihaela Vela & Silvia Hansen-Schirra: The use of multi-level annotation and alignment for the translator. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 19pp. [PDF, 187KB]

(2006) Yuk Wah Wong & Raymond J. Mooney: Learning for semantic parsing with statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 439-446 [PDF, 205KB]

(2005) Jesús Cardeñosa, Carolina Gallardo & Luis Iraola: An XML-UNL model for knowledge-based annotation. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.300-308 [abstract, PDF, 99KB]

(2005) Youjin Chung & Jong-Hyeok Lee: Practical word-sense disambiguation using co-occurring concept codes [abstract]. Machine Translation 19 (1),2005; pp.59-82.

(2005) Jesús Giménez & Lluís Màrquez: Combining linguistic data views for phrase-based SMT. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 145-148. [PDF, 73KB]

(2005) Carol Nichols & Rebecca Hwa: Word alignment and cross-lingual resource acquisition. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 69-72. [PDF, 331KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.85-90. [PDF, 936KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.71-78. [PDF, 1139KB]

Aspect

(2007) Yang Ye, Karl-Michael Schneider, & Steven Abney: Aspect marker generation in English-to-Chinese machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.521-527 [PDF, 129KB]

ASR see Speech recognition

Automatic evaluation see Evaluation measures and metrics

Bayesian methods

(2009) Adrià de Gispert, Sami Virpioja, Mikko Kurimo, & William Byrne: Minimum Bayes risk combination of translation hypotheses from alternative morphological decompositions.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.73-76. [PDF, 72KB]

(2009) John DeNero, David Chiang, & Kevin Knight: Fast consensus decoding over translation forests. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.567-575. [PDF, 650KB]

(2009) Shankar Kumar, Wolfgang Macherey, Chris Dyer & Franz Och: Efficient minimum error rate training and minimum Bayes-risk decoding for translation hypergraphs and lattices. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.163-171. [PDF, 342KB]

(2009) Zhifei Li, Jason Eisner, & Sanjeev Khudanpur: Variational decoding for statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.593-601. [PDF, 217KB]

Belief representation

(2009) Thomas Deselaers, Saša Hasan, Oliver Bender, & Hermann Ney: A deep learning approach to machine transliteration.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.233-241. [PDF, 201KB]

Bibliography

  (2008) Chan Sin-wai: A topical bibliography of computer(-aided) translation. Hong Kong: Chinese University Press. xxxix, 1505pp.

(2007) Duoxiu Qian: [review of] Chan Sin-wai (ed.) A topical bibliography of computer (aided) translation. Machine Translation 21 (3), September 2007; pp.183-186. [see publication]

 (2005) Martina Schwanke: Bibliography on machine translation. Magdeburg, June 2005. [PDF, 360KB]

Bilingual corpora [see also Example-based methods, Multilingual corpora]

(2009) Sadaf Abdul-Rauf & Holger Schwenk: Exploiting comparable corpora with TER and TERp. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.46-54. [PDF, 186KB]

(2009) Sadaf Abdul-Rauf & Holger Schwenk: On the use of comparable corpora to improve SMT performance. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.16-23. [PDF, 100KB]

(2009) R.Basili, D.De Cao, D.Croce, B.Coppola, & A.Moschitti: Cross-language frame semantic transfer in bilingual corpora [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 76KB]

(2009) Ondřej Bojar & Zdeněk Žabokrtský: CzEng 0.9: large parallel treebank with rich annotation. Prague Bulletin of Mathematical Linguistics, no.92, December 2009; pp.63-83 [PDF, 217KB]

(2009) E.Boldrini, S.Ferrández, R.Izquierdo, D.Tomás, & J.L.Vicedo: A parallel corpus labeled using open and restricted domain ontologies [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 65KB]

(2009) Budiono, Hammam Riza, & Charil Hakim: Resource report: building parallel text corpora for multi-domain translation system. ACL-IJCNLP-2009: 7th Workshop on Asian Language Resources (ALR-7), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp. 92-95. [PDF, 243KB]

(2009) Han-Bin Chen, Jian-Cheng Wu & Jason S.Chang: Learning bilingual linguistic reordering model for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.254-262. [PDF, 333KB]

(2009) Ken Church: Repetition and language models and comparable corpora [abstract]. ACL-IJCNLP-2009: Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; p.1. [PDF, 61KB]

(2009) Guy De Pauw, Peter Waiganjo Wagacha, & Gilles-Maurice de Schryver: The SAWA corpus: a parallel corpus English – Swahili. Proceedings of the EACL Workshop on Language Technologies for African Languages (AfLaT 2009), Athens, Greece, 31 March 2009; pp.9-16. [PDF, 129KB]

(2009) Thi-Ngoc-Diep Do, Viet-Bac Le, Brigitte Bigi, Laurent Besacier, & Eric Castelli: Mining a comparable text corpus for a Vietnamese-French statistical machine translation system.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.165-172. [PDF, 152KB]

(2009) Loic Dugast, Jean Senellart & Philipp Koehn: Selective addition of corpus-extracted phrasal lexical rules to a rule-based machine translation system. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.222-229. [PDF, 183KB]

(2009) Izaskun Fernandez, Iñaki Alegria, & Nerea Ezeiza: Using Wikipedia for named-entity translation. [SEPLN 2009] SALTMIL 2009, Donostia-San Sebastián, Spain. “Information retrieval and information extraction for less resourced languages”, Donostia-San Sebastián, September 7; pp.27-35. [PDF, 550KB]

(2009) Alexander Fraser, Renjing Wang, & Hinrich Schütze: Rich bitext projection features for parse reranking. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.282-290. [PDF, 132KB]

(2009) Lorraine Goeuriot, Emmanuel Morin & Béatrice Daille: Compilation of specialized comparable corpora in French and Japanese. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.55-63. [PDF, 186KB]

(2009) Xiwu Han, Hanzhang Li, & Tiejun Zhao: Train the machine with what it can learn – corpus selection for SMT. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.27-33. [PDF, 184KB]

(2009) Liang Huang, Wenbin Jiang & Qun Liu: Bilingually-constrained (monolingual) shift-reduce parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1222-1231. [PDF, 169KB]

 (2009) Tatsuya Ishisaka, Kazuhide Yamamoto, Masao Utiyama, & Eiichiro Sumita: Development of a Japanese-English software manual parallel corpus. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.254-259. [PDF, 87KB]

(2009) Heng Ji: Mining name translations from comparable corpora by creating bilingual information networks. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.34-47. [PDF, 120KB]

(2009) Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, & Qingsheng Zhu: Mining bilingual data from the web with adaptively learnt patterns. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.870-878. [PDF, 306KB]

(2009) Jesse Saba Kirchner, Justin Nuger, & Yi Zhang: An extensible crosslinguistic readability framework. ACL-IJCNLP-2009: Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.11-18. [PDF, 215KB]

(2009) A.Kumaran, K.Saravanan, Naren Datha, B.Ashok, & Vikram Dendi: WikiBABEL: a Wiki-style platform for creation of parallel data. Proceedings of the ACL-IJCNLP 2009 Sofware Demonstrations, Suntec, Singapore, 3 August 2009; pp.29-32. [PDF, 769KB]

(2009) Bin Lu, Benjamin K.Tsou, Jingbo Zhu, Tao Jiang, & Oi Yee Kwong: The construction of a Chinese-English patent parallel corpus. MT Summit XII: Third Workshop on Patent Translation, August 30, 2009, Ottawa, Ontario, Canada; pp. 17-24. [PDF, 181KB]

(2009) Paul McNamee, James Mayfield, & Charles Nicholas: Translation corpus source and size in bilingual retrieval. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.25-28. [PDF, 177KB]

(2009) David Mareček: Analysis and alignment of parallel data in TectoMT. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.57-58 [PDF, 122KB]

(2009) Preslav Nakov & Hwee Tou Ng: Improved statistical machine translation for resource-poor languages using related resource-rich languages. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1358-1367. [PDF, 209KB]

(2009) Sylwia Ozdowska & Andy Way: Optimal bilingual data for French-English PB-SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.96-103. [PDF, 224KB]

(2009) Emmanuel Prochasson, Emmanuel Morin & Kyo Kageura: Anchor points for bilingual lexicon extraction from small comparable corpora. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.284-291. [PDF, 138KB]

(2009) Ibrahim M.Saleh & Nizar Habash: Automatic extraction of lemma-based bilingual dictionaries for morphologically rich languages. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 622KB]

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2009) Ruhi Sarikaya, Sameer Maskey, Rong Zhang & Ea-Ee Jan: Iterative sentence-pair extraction from quasi-parallel corpora for machine translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) R.Mahesh K.Sinha: Automated mining of names using parallel Hindi-English corpus. ACL-IJCNLP-2009: 7th Workshop on Asian Language Resources (ALR-7), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp. 11-18. [PDF, 312KB]

(2009) R.Mahesh K.Sinha: Mining complex predicates in Hindi using a parallel Hindi-English corpus. [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Multiword Expressions, ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; pp.40-46. [PDF, 112KB]

(2009) Benjamin Snyder, Tahira Naseem, & Regina Barzilay: Unsupervised multilingual grammar induction. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.73-81. [PDF, 273KB]

(2009) Sanghoun Song & Francis Bond: Online search interface for the Sejong Korean-Japanese bilingual corpus and auto-interpretation of phrase alignment. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.146-149. [PDF, 153KB]

(2009) John Tinsley & Andy Way: Automatically generated parallel treebanks and their exploitability in machine translation [abstract]. Machine Translation 23 (1), February 2009; pp.1-22.

(2009) Masao Utiyama & Hitoshi Isahara: Mining patents for parallel corpora. In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.41-58.

(2009) Kun Yu & Junichi Tsujii: Bilingual dictionary extraction from Wikipedia. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 379-386. [PDF, 984KB]

(2009) Kun Yu & Junichi Tsujii: Extracting bilingual dictionary from comparable corpora with dependency heterogeneity. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.121-124. [PDF, 132KB]

(2009) Qibo Zhu, Diana Inkpen & Ash Asudeh: Inducing translations from officially published materials in Canadian government websites. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 176-183. [PDF, 167KB]

(2008) Takehashi Abekawa & Kyo Kageura: Constructing a corpus that indicates patterns of modification between draft and final translations by human translators. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 65KB]

(2008) Farag Ahmed & Andreas Nürnberger: Arabic/English word translation disambiguation using parallel corpora and matching  schemes.  EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.6-11. [PDF, 616KB]

(2008) Marianna Apidianaki: Translation-oriented word sense induction based on parallel corpora. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 232KB]

(2008) Bogdan Babych, Serge Sharoff, & Anthony Hartley: Generalising lexical translation strategies for MT using comparable corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 412KB]

(2008) Ondřej Bojar, Miroslav Janiček, Zdeněk Žabokrtský, Pavel Češka, & Peter Beňa: CzEng 0.7: parallel corpus with community-supplied translations. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 73KB]

(2008) David Burkett & Dan Klein: Two languages are better than one (for syntactic parsing).  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.877-886. [PDF, 405KB]

(2008) Chris Callison-Burch: Syntactic constraints on paraphrases extracted from parallel corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.196-205. [PDF, 187KB]

(2008) Jorge Civera & Alfons Juan-Ciscar: Bilingual text classification using the IBM 1 translation model.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 77KB]

(2008) Alain Désilets, Benoit Farley, Marta Stojanovic, & Geneviève Patenaude: WeBiText: building large heterogeneous translation memories from parallel web content. Translating and the Computer 30, 27-28 November 2008, London; 11pp. [PDF, 470KB]

(2008) Mark Fishel & Heiki-Jaan Kaalep: Experiments on processing overlapping parallel corpora. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 186KB]

(2008) Atsushi Fujii, Masao Utiyama, Mikio Yamamoto, & Takehito Utsuro: Toward the evaluation of machine translation using patent information.  AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.97-106. [PDF, 649KB]

(2008) Atsushi Fujii, Masao Utiyama, Mikio Yamamoto, & Takehito Utsuro: Producing a test collection for patent machine translation in the seventh NTCIR workshop.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 172KB]

(2008) Le An Ha, Gabriela Fernandez, Ruslan Mitkov, & Gloria Corpas: Mutual bilingual terminology extraction. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 87KB]

(2008) Olivier Hamon & Djamel Mostefa: The impact of reference quality on automatic MT evaluation.  Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.39-42. [PDF, 66KB]

(2008) Young-Sook Hwang, YoungKil Kim, & SangKyu Park: Paraphrasing depending on bilingual context toward generalization of translation knowledge. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.327-334. [PDF, 952KB]

(2008) Cong-Phap Huynh, Christian Boitet, & Hervé Blanchon: SECTra_w.1: an online collaborative system for evaluating, post-editing and presenting MT translation corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 824KB]

(2008) Alon Lavie, Alok Parlikar, & Vamshi Ambati: Syntax-driven learning of sub-sentential translation equivalents and translation rules from parsed parallel corpora. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.87-95. [PDF, 1610KB]

(2008) Bo Li & Juan Liu: Mining Chinese-English parallel corpora from the web. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.847-852. [PDF, 457KB]

(2008) Lieve Macken, Els Lefever, & Veronique Hoste:  Linguistically-based sub-sentential alignment for terminology extraction from a bilingual automotive corpus. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.529-536. [PDF, 145KB]

(2008) Elliott Macklovitch, Guy Lapalme, & Fabrizio Gotti: TransSearch: what are translators looking for?  AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.412-419. [PDF, 757KB]

(2008) Kazuaki Maeda, Xiaoyi Ma, & Stephanie Strassel: Creating sentence-aligned parallel text corpora from a large archive of potential parallel text using BITS and Champollion.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 68KB]

(2008) Michael Mohler & Rada Mihalcea: BABYLON parallel text builder: gathering parallel texts for low-density languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 63KB]

(2008) Rogelio Nazar, Leo Wanner, & Jorge Vivaldi: Two-step flow in bilingual lexicon extraction from unrelated corpora. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.140-149. [PDF, 537KB]

(2008) Achim Ruopp & Fei Xia: Finding parallel texts on the web using cross-language information retrieval.  IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.18-25. [PDF, 239KB]

(2008) K.Saravanan & A.Kumaran: Some experiments in mining named entitiy transliteration pairs from comparable corpora. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.26-33. [PDF, 805KB]

(2008) Lei Shi & Ming Zhou: Improved sentence alignment on parallel web pages using a stochastic tree alignment model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.505-513. [PDF, 483KB]

(2008) Kathrin Spreyer, Jonas Kuhn, & Bettina Schrader: Identification of comparable argument-head relations in parallel corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 138KB]

(2008) Keiji Yasuda, Ruiqiang Zhang, Hirofumi Yamamoto, & Eiichiro Sumita: Method of selecting training data to build a compact and efficient translation model. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.655-660. [PDF, 447KB]

(2008) Shiqi Zhao, Haifeng Wang, Ting Liu, & Sheng Li: Pivot approach for extracting paraphrase patterns from bilingual corpora. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 780-788. [PDF, 188KB]

(2007) Julia Aymerich & Hermes Camelo: Automatic extraction of entries for a machine translation dictionary using bitexts. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.21-27 [PDF, 88KB]

(2007) Bogdan Babych, Anthony Hartley, & Serge Sharoff: A dynamic dictionary for discovering indirect translation equivalents.  Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 10pp. [PDF, 150KB]

(2007) Matthias Buch-Kromann: Breaking the barrier of context-freeness: towards a linguistically adequate probabilistic dependency model of parallel texts. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.31-40 [PDF, 432KB]; presentation [PDF, 1285KB]; presentation [PDF, 1285KB]

(2007) Matthias Buch-Kromann: Computing translation units and quantifying parallelism in parallel dependency treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.69-76 [PDF, 361KB]

(2007) Gloria Corpas Pastor: Lost in specialised translation: the corpus as an inexpensive and under-exploited aid for language service providers. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 18pp. [PDF, 159KB]

(2007) Gloria Corpas Pastor & Miriam Seghiri: Specialized corpora for translators: a quantitative method to determine representativeness. Translation Journal 11 (3), July 2007; 8pp. [PDF, 232KB]

 (2007) Alain Désilets, Caroline Barrière, & Jean Quirion: Making WikiMedia resources more useful for translators. Wikimania 2007: the international Wikimedia conference, Taipei, Taiwan, 30 August 2007; 27 slides [PDF, 704KB]

(2007) Pablo Gamallo Otero: Learning bilingual lexicons from comparable English and Spanish corpora. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.191-197 [PDF, 509KB]

(2007) Ulrich Germann: Two tools for creating and visualizing sub-sentential alignments of parallel text. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.121-124 [PDF, 247KB]

(2007) Xiaoguang Hu, Haifeng Wang, & Hua Wu: Using RBMT systems to produce bilingual corpus for SMT. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 287-295. [PDF, 131KB]

(2007) Masaki Itagaki, Takako Aikawa, & Xiaodong He: Automatic validation of terminology translation consistency with statistical method.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.269-274 [PDF, 416KB]

(2007) Krzysztof Jassem & Tomasz Kowalski: Machine translation using scarce bilingual corpora. TASK Quarterly 11, no.1-2, 21-33. [PDF, 216KB]

(2007) J. Howard Johnson, Joel Martin, George Foster & Roland Kuhn: Improving translation quality by discarding most of the phrasetable. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 967-975. [PDF, 182KB]

(2007) Heiki-Jaan Kaalep & Kaarel Veskis: Comparing parallel corpora and evaluating their quality. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.275-279 [PDF, 164KB]

(2007) Caroline Lavecchia, Kamel Smaïli, & David Langlois: Building a bilingual dictionary from movie subtitles based on inter-lingual triggers. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 19pp. [PDF, 139KB]

(2007) Caroline Lavecchia, Kamel Smaïli, David Langlois, & Jean-Paul Haton: Using inter-lingual triggers for machine translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2829-2832; abstract [PDF, 60KB]

(2007) Yajuan Lü, Jin Huang & Qun Liu: Improving statistical machine translation performance by training data selection and optimization. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 343-350. [PDF, 235KB]

(2007) Lieve Macken, Julia Trushkina, & Lidia Rura: Dutch parallel corpus: MT corpus and translator’s aid. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.313-320 [PDF, 132KB]

(2007) E.Morin, B.Daille, K.Takeuchi, & K.Kageura: Bilingual terminology mining – using brain, not brawn comparable corpora.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 664-671 [PDF, 143KB]

(2007) Jin’ichi Murakami, Masato Tokuhisa, & Satoru Ikehara: Statistical machine translation using large J/E parallel corpus and long phrase tables.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 69KB]; presentation [PDF, 304KB]

(2007) Hwe Tou Ng & Yee Seng Chan: SemEval-2007 task 11: English lexical sample task via English-Chinese parallel text. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.54-58 [PDF, 84KB]

(2007) Chris Quirk, Raghavendra Udupa U., & Arul Menezes: Generative models of noisy translations with applications to parallel fragment extraction. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.377-384 [PDF, 249KB]

(2007) Monika Rosińska: Collecting Polish-German parallel corpora in the Internet. Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 15-17 October 2007; pp.285-292. [PDF, 550KB]

(2007) Masao Utiyama & Hitoshi Isahara: A Japanese-English patent parallel corpus. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.475-482 [PDF, 113KB]

(2007) Vincent Vandeghinste: Removing the distinction between a translation memory, a bilingual dictionary and a parallel corpus. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 21pp. [PDF, 90KB]

(2007) Antal van den Bosch, Nicolas Stroppa, & Andy Way: A memory-based classification approach to marker-based EBMT.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 10pp. [PDF, 251KB]

(2007) Martin Volk, Joakim Lundborg, & Maël Mettler: A search tool for parallel treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.85-92 [PDF, 270KB]

(2007) Qibo Zhu, Diana Inkpen & Ash Asudeh: Automatic extraction of translations from web-based bilingual materials [abstract]. Machine Translation 21 (3), September 2007; pp.139-163.

(2006) Iñaki Alegria, Nerea Ezeiza, & Izaskun Fernandez: Named entities translation based on comparable corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.1-8 [PDF, 455KB]

(2006) Saba Amsalu: Data-driven Amharic-English bilingual lexicon acquisition . LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.281-286 [PDF, 366KB]

(2006) Marco Baroni, Adam Kilgarriff, Jan Pomikálek, & Pavel Rychlý: WebBootCaT: instant domain-specific corpora to support human translators. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.247-252 [PDF, 191KB]

(2006) Ondřej Bojar & Zdeněk Žabokrtský: CzEng: Czech-English parallel corpus: release version 0.5. Prague Bulletin of Mathematical Linguistics, no.86, 2006; pp.59-62. [PDF, 92KB]

(2006) Helena M.Caseli, Maria das Graças V.Nunes, & Mikel L.Forcada: Automatic induction of bilingual resources from aligned parallel corpora: application to shallow-transfer machine translation [abstract]. Machine Translation 20 (4),2006; pp.227-245.

(2006) A.Casillas, A. Díaz de Illarraza, J.Igartua, R. Martínez, & K. Sarasola: Compilation and structuring of a Spanish-Basque parallel corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.55-58. [PDF, 172KB]

(2006) Lea Cyrus: Building a resource for studying translation shifts.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.697-702 [PDF, 358KB]

(2006) Andreas Eisele: Parallel corpora and phrase-based statistical machine translation for new language pairs via multiple intermediaries.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.845-848 [PDF, 329KB]

(2006) Tomaž Erjavec: The English-Slovene ACQUIS corpus. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2138-2141 [PDF, 365KB]

(2006) Dan Flickinger: Identifying complex phenomena in a corpus via a treebank lens. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.125-129 [PDF, 89KB]

(2006) Ken’ichi Fukushima, Kenjiro Taura, & Takashi Chikayama: A fast and accurate method for detecting English-Japanese parallel texts. Coling-ACL 2006: Proceedings of the Workshop on Multilingual Language Resources and Interoperability, Sydney, July 2006; pp.60-67. [PDF, 258KB]

(2006) Silvia Hansen-Schirra, Stella Neumann, & Mihaela Vela : Multi-dimensional annotation and alignment in an English-German translation corpus. EACL-2006: Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-dimensional Markup in Natural Language Processing, April 4, 2006, Trento, Italy; pp.35-42. [PDF, 77KB]

(2006) Krzysztof Jassem & Kowalski Tomasz: An algorithm for extracting translation rules from scarce bilingual corpora. Proceedings of the International Multiconference on Computer Science and Information Technology, vol.1: XXII Autumn Meeting of Polish Information Processing Society, November 6-10, 2006, Wisla, Poland; pp.67-73. [PDF, 368KB]

(2006) Beáta Bandmann Megyesi, Anna Sågvall Hein, & Éva Csató Johanson: Building a Swedish-Turkish parallel corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2130-2133 [PDF, 692KB]

(2006) Márton Miháltz & Gábor Pohl: Exploiting parallel corpora for supervised word sense disambiguation in English-Hungarian machine translation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1294-1297 [PDF, 339KB]

(2006) Dragos Stefan Munteanu & Daniel Marcu: Extracting parallel sub-sentential fragments from non-parallel corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.81-88. [PDF, 1598KB]

(2006) Dragos Stefan Munteanu & Daniel Marcu: Improving machine translation performance by exploiting non-parallel corpora. Computational Linguistics 31 (4), pp. 477-504 [PDF, 1060KB]

(2006) G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajič, & Pavel Pecina: Leveraging recurrent phrase structure in large-scale ontology translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.141-150 [PDF, 686KB]

(2006) Sylwia Ozdowska: Projecting POS tags and syntactic dependencies from English and French to Polish in aligned corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.53-60 [PDF, 366 KB]

(2006) Michael Paul & Eiichiro Sumita: Exploiting variant corpora for machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 113-116 [PDF, 86KB]

(2006) Viktor Pekar, Ruslan Mitkov, Dimitar Blagoev, & Andrea Mulloni: Finding translations for low-frequency words in comparable corpora [abstract].  Machine Translation 20 (4),2006; pp.247-266.

(2006) Alicia Pérez, Inés Torres, Francisco Casacuberta, & Víctor Guijarrubia: A Spanish-Basque weather forecast corpus for probabilistic speech translation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.99-102. [PDF, 227KB]

(2006) Serge Sharoff, Bogdan Babych, & Anthony Hartley: Using comparable corpora to solve problems difficult for human translators. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.739-746. [PDF, 250KB]

(2006) Serge Sharoff, Bogdan Babych, & Anthony Hartley: Using collocations from comparable corpora to find translation equivalents.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.465-470 [PDF, 1104KB]

(2006) Serge Sharoff: Translation as problem-solving: uses of comparable corpora.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Third International Workshop on Language Resources for Translation Work, Research & Training (LR4Trans-III), Genoa, Italy, 28 May 2006; pp.23-28. [PDF, 914KB]

(2006) Tao Tao, Su-Youn Yoon, Andrew Fister, Richard Sproat, & ChengXiang Zhai: Unsupervised name entity transliteration using temporal and phonetic correlation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 250-257. [PDF, 139KB]

(2006) Gregor Thurmair: Using corpus information to improve MT quality. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Third International Workshop on Language Resources for Translation Work, Research & Training (LR4Trans-III), Genoa, Italy, 28 May 2006; pp.45-48. [PDF, 371KB]

(2006) Hitomi Tohyama & Shigeki Matsubara: Collection of simultaneous interpreting patterns by using bilingual spoken monologue corpus. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2564-2569 [PDF, 552KB]

(2006) Mihaela Vela & Silvia Hansen-Schirra: The use of multi-level annotation and alignment for the translator. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 19pp. [PDF, 187KB]

(2006) Haifeng Wang, Hua Wu, & Zhanyi Liu: Word alignment for languages with scarce resources using bilingual corpora of other language pairs. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.874-881. [PDF, 155KB]

(2006) Xinglong Wang & David Martinez: Word sense disambiguation using automatically translated sense examples. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.45-52 [PDF, 272 KB]

(2006) Benjamin Wellington, Sonja Waxmonsky, & I.Dan Melamed: Empirical lower bounds on the complexity of translational equivalence. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.977-984. [PDF, 121KB]

 (2006) Michael Wilkinson: Compiling corpora for use as translation resources. Translation Journal 10 (1), January 2006; 7pp. [PDF, 132KB]

(2006) Jia Xu, Richard Zens, & Hermann Ney: Partitioning parallel documents using binary segmentation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 78-85 [PDF, 433KB]

(2005) proceedings of ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005

(2005) Alison Alvarez, Lori Levin, Robert Frederking, Erik Peterson & Jeff Good: Semi-automated elicitation corpus generation . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.388-395. [PDF, 131KB]

(2005) Naoki Asanoma, Setsuo Yamada, Osamu Furuse, & Masahiro Oku: Building a conversation corpus by text derivation from "germ dialogs". 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 27-32. [PDF, 82KB]

(2005) Colin Bannard & Chris Callison-Burch: Paraphrasing with bilingual parallel corpora. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 597-604. [PDF, 196KB]

(2005) Chris Brockett & William B.Dolan: Support vector machines for paraphrase identification and corpus construction. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 1-8. [PDF, 115KB]

(2005) Martin Čmejrek, Jan Cuřín, Jan Hajič, & Jiří Havelka: Prague Czech-English dependency treebank: resource for structure-based MT. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 73-78. [PDF, 66KB]

(2005) Etienne Denoual: The influence of example-data: homogeneity on EBMT quality MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.35-42. [PDF, 404KB]

(2005) John Fry: Assembling a parallel corpus from RSS news feeds MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.59-62. [PDF, 303KB]

(2005) Pablo Gamallo Otero: Extraction of translation equivalents from parallel corpora using sense-sensitive contexts. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 97-102. [PDF, 52KB]

(2005) Emmanuel Giguet: Multi-grained alignment of parallel texts with endogenous resources. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.12-17 [PDF, 194KB]

(2005) Ebba Gustavii: Target language preposition selection - an experiment with transformation based learning and aligned bilingual data. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 112-118. [PDF, 45KB]

(2005) Fei Huang, Ying Zhang, & Stephan Vogel: Mining key phrase translations from web corpora. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 483-490. [PDF, 310KB]

 (2005) Zhenglin Jin & Caroline Barrière: Exploring sentence variations with bilingual corpora. Corpus Linguistics 2005. Birmingham, UK, 14-17 July 2005; 14pp. [PDF, 226KB]

(2005) Toshiyuki Kanamaru, Masaki Murata, Kow Kuroda, & Hitoshi Isahara: Obtaining Japanese lexical units for semantic frames from Berkeley FrameNet using a bilingual corpus. IJCNLP-05: Sixth International Workshop on Linguistically Interpreted Corpora (LINC 2005). Proceedings of the workshop, 15 October 2005, Jeju Island, Korea; pp. 11-20. [PDF, 106KB]

(2005) Chunyu Kit, Xiaoyue Liu, KingKui Siu, & Jonathan J.Webster: Harvesting the bitexts of the laws of Hong Kong from the web. IJCNLP-05: Fifth Workshop on Asian Language Resources (ALR-05). Proceedings of the workshop, 14 October 2005, Jeju Island, Korea; pp. 71-78. [PDF, 665KB]

(2005) Grzegorz Kondrak: Cognates and word alignment in bitexts. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.305-312. [PDF, 179KB]

(2005) Jonas Kuhn: Parsing word-aligned parallel corpora in a grammar induction context. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 17-25. [PDF, 150KB]

(2005) Yves Lepage & Etienne Denoual: The ‘purest’ EBMT system ever built: no variables, no templates, no training, examples, just examples, only examples MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.81-90. [PDF, 400KB]

(2005) Karin Müller: Revealing phonological similarities between related languages from automatically generated parallel corpora.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 33-40. [PDF, 114KB]

(2005) Sylwia Ozdowska: Using bilingual dependencies to align words in English/French parallel corpora.  ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 127-132. [PDF, 89KB]

(2005) Isamu Okada, Shinichiro Miyazawa, Kazunari Ishida, Nobuhiko Shimizu, & Toshizumi Ohta: Quality analysis of patent parallel corpus by the scale MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Patent Translation; pp.29-34. [PDF, 82KB]

(2005) Sitthaa Phaholphinyo, Teerapong Modhiran, Nattapol Kritsuthikul, & Thepchai Supnithi: A practical of memory-based approach for improving accuracy of MT. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.41-46. [PDF, 297KB]

(2005) Masatsugu Tonoike, Mitsuhiro Kida, Toshihiro Takagi, Yauhiro Sasaki, Takehito Utsuro, & Satoshi Sato: Effect of domain-specific corpus in compositional translation estimation for technical terms. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.114-119. [PDF, 193KB]

(2005) Michael Wilkinson: Discovering translation equivalents in a tourism corpus by means of fuzzy searching. Translation Journal 9 (4), October 2005; 6pp. [PDF, 182KB]

(2005) Toon Witkam: A new road to automatic translation. [Utrecht: private publication, 26 November 2005]. 13pp. [PDF, 177KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.85-90. [PDF, 936KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.71-78. [PDF, 1139KB]

(2005) Yujie Zhang, Qun Liu, Qing Ma, & Hitoshi Isahara: A multi-aligner for Japanese-Chinese parallel corpora. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.133-140. [PDF, 420KB]

(2005) Bing Zhao & Alex Waibel: Learning a log-linear model with bilingual phrase-pair features for statistical machine translation. IJCNLP-05: Fourth SIGHAN  Workshop on Chinese Language Processing. Proceedings of the workshop, 14-15 October 2005, Jeju Island, Korea; pp. 79-86. [PDF, 1301KB]

Bi-text see Bilingual corpora

Book reviews

(2009) Paul Bennett: [review of] Andrew Radford, Martin Atkinson, David Britain, Harald Clahsen, Andrew Spencer: Linguistics: an introduction, 2nd ed. Machine Translation 23 (1), February 2009; pp.65-67. [see publication]

(2009) Phil Blunsom: Book review: Learning machine translation, [by] Cyril Goutte, Nicola Cancedda, Marc Dymetman, and George Foster (eds.). Computational Linguistics 35 (4), pp. 637-640 [PDF, 51KB]

(2009) Philipp Koehn: [review of] Cyril Goutte, Nicola Cancedda, Marc Dymetman, George Foster (eds.) Learning machine translation. Machine Translation 23 (4), November 2009; pp.269-271. [see publication]

(2009) Sara Morrissey: [review of] Yorick Wilks: Machine translation: its scope and limits. Machine Translation 23 (4), November 2009; pp.265-267. [see publication]

(2008) Joseba Abaitua: [review of] Khurshid Ahmad, Christopher Brewster, Mark Stevenson (eds.) Words and intelligence I: Selected papers by Yorick Wilks. Machine Translation 22 (3), September 2008; pp.175-180. [see publication]

(2008) Iñaki Alegria & Kepa Sarasola: [review of] Sylviane Cardey, Peter Greenfield, Séverine Vienney (eds.) Machine translation, controlled languages and specialised languages. Machine Translation 22 (4), December 2008; pp.259-261. [see publication]

(2007) Núria Bel: Review of Dubkjaer, Laila; Hemsen, Holmer; Minker, Wolfgang (eds.) Evaluation of text and speech systems.  Machine Translation 21 (1), March 2007; pp.73-76. [see publication]

(2007) Michael Cronin: [review of] Albert Branchadell and Lovell Margaret West (eds.) Less translated languages. Machine Translation 21 (1), March 2007; pp.69-71. [see publication]

(2007) Duoxiu Qian: [review of] Chan Sin-wai (ed.) A topical bibliography of computer (aided) translation. Machine Translation 21 (3), September 2007; pp.183-186. [see publication]

(2007) Horacio Rodríguez: [review of] Kornai, András, Mathematical linguistics. Machine Translation 21 (4),December 2007; pp.253-256. [see publication]

(2007) Keh-Yi Su: [review of] Ruslan Mitkov, Anaphora resolution. Machine Translation 21 (2), June 2007; pp.135-137. [see publication]

(2006) Aoife Cahill: [review of] Harry Bunt, John Carroll, Giorgio Satta (eds.), New developments in parsing technology. Machine Translation 20 (2), 2006; pp.139-142. [see publication]

(2006) Marcello Federico: [review of] Mark Johnson, Sanjeev P.Khudanpur, Mari Ostendorf and Roni Rosenfeld (eds.), Mathematical foundations of speech and language processing. Machine Translation 20 (3), 2006; pp.223-225. [see publication]

(2006) Daniel Hardt: [review of] Marjorie J.McShane, A theory of ellipsis. Machine Translation 20 (3), 2006; pp.221-222. [see publication]

(2006) Dorothy Kenny: [review of] Chiew Kin Quah, Translation and technology. Machine Translation 20 (4), 2006; pp.291-295. [see publication]

(2006) Dorothy Kenny: [review of] Uwe Reinke, Translation memories. Systeme – Konzepte – Linguistische Optimierung. Machine Translation 20 (4), 2006; pp.305-309. [see publication]

(2006) Tracy Holloway King: [review of] Hristo Georgiev, English algorithmic grammar. Machine Translation 20 (3),2006; pp.217-220. [see publication]

(2006) Thomas Koller: [review of] Emanuela Cresti and Massimo Moneglia (eds.), C-ORAL-ROM, Integrated reference corpora for spoken Romance languages. Machine Translation 20 (4), 2006; pp.297-300. [see publication]

(2006) Nicolas Stroppa: [review of] Walter Daelemans and Antal van den Bosch, Memory-based language processing. Machine Translation 20 (2), 2006; pp.143-145.  [see publication]

(2006) Joachim Wagner: [review of] Nadja Nesselhauf, Collocations in a learner corpus. Machine Translation 20 (4), 2006; pp.301-303. [see publication]

(2005) John Hutchins: Example-based machine translation: a review and commentary [abstract]. Machine Translation 19 (3-4), 2005; pp.197-211.

(2005) Derek Lewis: Books received. In: Machine Translation Review, issue 14: December 2005; pp.6-9.

Bootstrapping see index of applications

Bracketing transduction grammar see Transduction grammars

Bridge language see Pivot language

Capitalization see Written forms

Case-based methods

(2005) J.Akshay Iyer & Pushpak Bhattacharyya: Using semantic information to improve case retrieval in case-based reasoning systems. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.347-358 [abstract, PDF, 15KB]

Case grammar and case frames

(2009) Eneko Agirre, Aitziber Atutxa, Gorka Labaka, Mikel Lersundi, Aingeru Mayor, & Kepa Sarasola: Use of rich linguistic information to translate prepositions and grammatical cases to Basque. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.58-65. [PDF, 340KB]

(2009) R.Basili, D.De Cao, D.Croce, B.Coppola, & A.Moschitti: Cross-language frame semantic transfer in bilingual corpora [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 76KB]

(2009) Karel Pala & Aleš Horák: Multilingual features of complex valency frames. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.41-49. [PDF, 145KB]

(2009) Ananthakrishnan Ramanathan, Hansraj Choudhary, Avishek Ghosh, & Pushpak Bhattacharyya: Case markers and morphology: addressing the crux of the fluency problem in English-Hindi SMT. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.800-808. [PDF, 221KB]

(2009) Dekai Wu & Pascale Fung: Can semantic role labeling improve SMT? EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.218-225. [PDF, 1141KB]

(2009) Shoichi Yokoyama & Masumi Okuyama: Translation disambiguation of patent sentences using case frames. MT Summit XII: Third Workshop on Patent Translation, August 30, 2009, Ottawa, Ontario, Canada; pp. 25-32. [PDF, 75KB]

(2007) Sanae Fujita & Francis Bond: A method of creating new valency entries [abstract]. Machine Translation 21 (1), March 2007; pp.1-28.

(2007) Pascale Fung, Zhaojun Wu, Yongsheng Yang, & Dekai Wu: Learning bilingual semantic frames: shallow semantic parsing vs. semantic role projection. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.75-84 [PDF, 477KB]

(2007) Kristina Toutanova & Hisami Suzuki: Generating case markers in machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.49-56 [PDF, 168KB]

(2005) Kuntal Dey & Pushpak Bhattacharyya: Universal Networking Language based analysis and generation for Bengali case structure constructs. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.215-229 [abstract, PDF, 88KB]

(2005) Brian Murphy & Carl Vogel: A comparative evaluation of UNL participant relations using a five-language parallel corpus. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.64-76 [abstract, PDF, 137KB]

(2005) Shigeko Nariyama, Eric Nichols, Francis Bond, Takaaki Tanaka, & Hiromi Nakaiwa: Extracting representative arguments from dictionaries for resolving zero pronouns. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.3-10. [PDF, 665KB]

Categorial grammar

(2009) Rejwanul Haque, Sudip Kumar Naskar, Yanjun Ma & Andy Way: Using supertags as source language context in SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.234-241. [PDF, 247KB]

(2009) Hany Hassan, Khalil Sima’an, & Andy Way: A syntactified direct translation model with linear-time decoding. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1182-1191. [PDF, 163KB]

(2008) Prachya Boonkwan & Thepchai Supnithi: Memory-inductive categorical grammar: an approach to gap resolution in analytic-language translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.80-87. [PDF, 339KB]

(2007) Alexandra Birch, Miles Osborne, & Philipp Koehn: CCG supertags in factored statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 9-16 [PDF, 175KB]

(2007) Hany Hassan, Khalil Sima’an, & Andy Way: Supertagged phrase-based statistical machine translation.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 288-295 [PDF, 175KB]

(2007) Michael White, Rajakrishnan Rajkumar, & Scott Martin: Towards broad coverage surface realization with CCG. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.22-30 [PDF, 2020KB]

Character alignment

(2009) Dipankar Bose & Sudeshna Sarkar: Learning multi character alignment rules and classification of training data for transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.61-64. [PDF, 289KB]

(2009) Jörg Tiedemann: Character-based PSMT for closely related languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.12-19 [PDF, 399KB]

Chart parsing

(2009) Martin Čmejrek, Bowen Zhou, & Bing Xiang: Enriching SCFG rules directly from efficient bilingual chart parsing. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 136-143. [PDF, 315KB]; presentation [PDF of PPT, 982KB]

(2009) Selçuk Köprü & Adnan Yazıcı: Lattice parsing to integrate speech recognition and rule-based machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.469-477. [PDF, 185KB]

(2009) Zhifei Li & Jason Eisner: First- and second-order expectation semirings with applications to minimum-risk training on translation forests. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.40-51. [PDF, 406KB]

(2009) Yang Liu, Yajuan Lü & Qun Liu: Improving tree-to-tree translation with packed forests.  [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.558-566. [PDF, 151KB]

(2009) Hui Zhang, Min Zhang, Haizhou Li, Aiti Aw, Chew Lim Tan: Forest-based tree sequence to string  translation model. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.172-180. [PDF, 254KB]

(2008) Haitao Mi, Liang Huang, & Qun Liu: Forest-based translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 192-199. [PDF, 185KB]

(2008) Haitao Mi & Liang Huang: Forest-based translation rule extraction.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.206-214. [PDF, 170KB]

(2007) Liang Huang & David Chiang: Forest rescoring: faster decoding with integrated language models. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 144-151 [PDF, 215KB]

(2007) Yang Liu, Yun Huang, Qun Liu, & Shouxun Lin: Forest-to-string statistical translation rules. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 704-711 [PDF, 198KB]

(2006) Andreas Zollmann & Ashish Venugopal: Syntax augmented machine translation via chart parsing.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 138-141 [PDF, 99KB]

Chunks and chunking see Segmentation

Closely related languages

(2009) Vishal Goyal & Gurpreet Singh Lehal: Evaluation of Hindi to Punjabi machine translation system. International Journal of Computer Science Issues, vol.4, no.1 (2009), pp.36-39. [PDF, 101KB]

(2009) Petr Homola, Vladislav Kuboň, & Jernej Vičič: Shallow transfer between Slavic languages. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.219-232. [PDF, 390KB]

(2009) Preslav Nakov & Hwee Tou Ng: Improved statistical machine translation for resource-poor languages using related resource-rich languages. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1358-1367. [PDF, 209KB]

(2009) R.Mahesh K.Sinha: Developing English-Urdu machine translation via Hindi. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 7pp. [PDF, 643KB]

(2009) Jörg Tiedemann: Character-based PSMT for closely related languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.12-19 [PDF, 399KB]

(2008) Peter A. Chew & Ahmed Abdelali: The effects of language relatedness on multilingual information retrieval: a case study with Indo-European and Semitic languages. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.1-9. [PDF, 196KB]

(2008) Vishal Goyal & Gurpreet Singh Lehal: Comparative study of Hindi and Punjabi language scripts. Nepalese Linguistics, Journal of the Linguistics Society of Nepal, vol. 23 (2008), pp. 67-82. [PDF, 290KB]

(2008) Petr Homola: A distributed database for mobile NLP applications. ACL-08: HLT. Workshop on Mobile Language Processing. Proceedings of the workshop, June 20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 27-28. [PDF, 71KB]

(2008) Petr Homola & Vladislav Kuboň: Improving machine translation between closely related Romance languages . EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.72-77. [PDF, 527KB]

(2008) Gurpreet Singh Josan & Gurpreet Singh Lehal: A Punjabi to Hindi machine translation system. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.157-160. [PDF, 190KB]

(2008) M.G.Abbas Malik, Christian Boitet, & Pushpak Bhattacharyya: Hindi Urdu machine transliteration using finite-state transducers.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.537-544. [PDF,774KB]

(2008) Jan Nouza, Jan Silovsky, Jindrich Zdansky, Petr Cerva, Martin Kroul, & Josef Chaloupka: Czech-to-Slovak adapted broadcast news transcription system. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2683-2686; abstract [PDF, 57KB]

(2008) Daniel Zeman & Philip Resnik: Cross-language parser adaptation between related languages. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.35-42. [PDF, 150KB]

(2007) Bogdan Babych, Anthony Hartley, & Serge Sharoff: Translating from under-resourced languages: comparing direct transfer against pivot translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.29-35 [PDF, 197KB]

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: A MT system from Turkmen to Turkish employing finite state and statistical methods. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.459-465 [PDF, 399KB]

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: Machine translation between Turkic languages. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 189-192 [PDF, 135KB]

(2007) Jernej Vicic: Rapid development of RBMT systems for related languages. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 12pp. [PDF, 154KB]

(2007) David Vilar, Jan-T. Peter, & Hermann Ney: Can we translate letters?  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 33-39 [PDF, 124KB]

(2006) Carme Armentano i Oller & Mikel L. Forcada: Open-source machine translation between small languages: Catalan and Aranese Occitan.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.51-54. [PDF, 62KB]

(2006) Boštan Dvořák, Petr Homola, & Vladislav Kuboň: Exploiting similarity in the MT into a minority language.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.59-64. [PDF, 379KB]

(2006) Brock Pytlik & David Yarowsky: Machine translation for languages lacking bitext via multilingual gloss transduction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.156-165 [PDF, 267KB]

(2006) Kevin P. Scannell: Machine translation for closely related language pairs.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.103-107. [PDF, 139KB]

(2005) Antonio M.Corbi-Bellot, Mikel L. Forcada, Sergio Ortíz-Rojas, Juan Antonio Pérez-Ortiz, Gema Ramírez-Sánchez, Felipe Sánchez-Martínez, Iñaki Alegria, Aingeru Mayor, & Kepa Sarasola: An open-source shallow-transfer machine translation engine for the Romance languages of Spain. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 79-86. [PDF, 130KB]

(2005) Karin Müller: Revealing phonological similarities between related languages from automatically generated parallel corpora.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 33-40. [PDF, 114KB]

Code switching see Dialects and registers

Cognates

(2007) Ruslan Mitkov, Viktor Pekar, Dimitar Blagoev, & Andrea Mulloni: Methods for extracting and classifying pairs of cognates and false friends [abstract]. Machine Translation 21 (1), March 2007; pp.29-53.

(2007) Preslav Nakov & Marti Hearst: UCB system description for the WMT 2007 shared task. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 212-215 [PDF, 129KB]

(2006) Jirka Hana, Anna Feldman, Chris Brew, & Luiz Amaral: Tagging Portuguese with a Spanish tagger using cognates. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.33-40 [PDF, 253KB]

(2006) Charles Schafer: Novel probabilistic finite-state transducers for cognate and transliteration modeling. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.203-212 [PDF, 250KB]

(2005) Grzegorz Kondrak: Cognates and word alignment in bitexts. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.305-312. [PDF, 179KB]

Cohesion see Reference

Collocations see Multiword expressions

Combination systems (see also Multi-engine systems)

(2009) Takako Aikawa & Achim Ruopp: Chained system: a linear combination of different types of statistical machine translation systems. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.184-189. [PDF, 229KB]

(2009) Boxing Chen, Min Zhang, Haizhou Li & Aiti Aw: A comparative study of hypothesis alignment and its improvement for machine translation system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.941-948. [PDF, 119KB]

(2009) Yi-Chang Chen & Chia-Ping Chen: A framework for machine translation output combination. ROCLING 2009: Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, Taichung, Taiwan, 2009; pp.309-317. [PDF, 342KB]

(2009) Nan Duan, Mu Li, Tong Xiao, & Ming Zhou: The feature subspace method for SMT system combination. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1096-1104. [PDF, 416KB]

(2009) Christian Federmann, Silke Theison, Andreas Eisele, Hans Uszkoreit, Yu Chen, Michael Jellinghaus, & Sabine Hunsicker: Translation combination using factored word substitution.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.70-74. [PDF, 100KB]

(2009) Yang Feng, Yang Liu, Haitao Mi, Qun Liu, & Yajuan Lü: Lattice-based system combination for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1105-1113. [PDF, 161KB]

(2009) Adrià de Gispert, Sami Virpioja, Mikko Kurimo, & William Byrne: Minimum Bayes risk combination of translation hypotheses from alternative morphological decompositions.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.73-76. [PDF, 72KB]

(2009) Xiaodong He & Kristina Toutanova: Joint optimization for machine translation system combination. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1202-1211. [PDF, 337KB]

(2009) Kenneth Heafield, Greg Hanneman, & Alon Lavie: Machine translation system combination with flexible word ordering. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.56-60. [PDF, 104KB]

(2009) Almut Silja Hildebrand & Stephan Vogel: CMU system combination for WMT’09. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.47-50. [PDF, 101KB]

(2009) Maxim Khalilov & José A.R.Fonollosa: N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.424-432. [PDF, 564KB]

(2009) Gregor Leusch, Evgeny Matusov, & Hermann Ney: The RWTH system combination system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.51-55. [PDF, 173KB]

(2009) Chi-Ho Li, Xiaodong He, Yupeng Liu, & Ning Xi: Incremental HMM alignment for MT system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.949-957. [PDF, 559KB]

(2009) Yang Liu, Haitao Mi, Yang Feng, & Qun Liu: Joint decoding with multiple translation models. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.55-63. [PDF, 145KB]

(2009) Yong Zhao & Xiaodong He: Using n-gram based features for machine translation system combination.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.205-208. [PDF, 357KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Combination tests . [NIST, 2009]; [HTML, 114KB]

(2008) Almut Silja Hildebrand & Stephan Vogel: Combination of machine translation systems via hypothesis selection from combined n-best lists. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.254-261 [PDF, 681KB]

Compositionality

(2008) Béatrice Daille & Emmanuel Morin: A effective compositional model for lexical alignment. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp 95-102. [PDF, 401KB]

(2008) Nikesh Garera & David Yarowsky: Translating compounds by learning component gloss translation models via multiple languages. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.403-410. [PDF, 497KB]

(2005) Masatsugu Tonoike, Mitsuhiro Kida, Toshihiro Takagi, Yauhiro Sasaki, Takehito Utsuro, & Satoshi Sato: Effect of domain-specific corpus in compositional translation estimation for technical terms. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.114-119. [PDF, 193KB]

Compound nouns see Nouns and noun phrases

Concatenation grammar see Transduction grammars

Concordances

 (2009) Stéphane Huet, Julien Bourdaillet, & Philippe Langlais: TS3: an improved version of the bilingual concordancer TransSearch. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.20-27 [PDF, 678KB]

(2008) Elliott Macklovitch, Guy Lapalme, & Fabrizio Gotti: TransSearch: what are translators looking for?  AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.412-419. [PDF, 757KB]

(2007) Gloria Corpas Pastor: Lost in specialised translation: the corpus as an inexpensive and under-exploited aid for language service providers. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 18pp. [PDF, 159KB]

Conditional random fields

(2009) Eiji Aramaki & Takeshi Abekawa: Fast decoding and easy implementation: transliteration as sequential labeling. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.65-68. [PDF, 413KB]

(2009) Asif Ekbal & Sivajii Banyopadhyay: Voted NER system using appropriate unlabeled data. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.202-210. [PDF, 98KB]

(2009) Spence Green, Conal Sathi & Christopher D.Manning: NP subject detection in verb-initial Arabic clauses. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp.  [PDF, 729KB]

(2009) Matti Kääriäinen: Sinuhe – statistical machine translation using a globally trained conditional exponential family translation model. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1027-1036. [PDF, 159KB]

(2009) Mitesh M.Khapra & Pushpak Bhattacharyya: Improving transliteration accuracy using word-origin detection and lexicon lookup. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.84-87. [PDF, 114KB]

(2009) Sravana Reddy & Sonjia Waxmonsky: Substring-based transliteration with conditional random fields. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.92-95. [PDF, 90KB]

(2009) Praneeth Shishtla, Surya Ganesh V, Sethuramalingam Subramaniam, & Vasudeva Varma: A language-independent transliteration schema using character aligned models at NEWS 2009. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.40-43. [PDF, 114KB]

(2009) Dong Yang, Paul Dixon, Yi-Cheng Pan, Tasuku Oonishi, Masanobu Nakamura & Sadaoki Furui: Combining a two-step conditional random field and a joint source channel model for machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.72-75. [PDF, 110KB]

Confidence measures

(2009) Jennifer DeCamp: What is missing in user-centric MT? MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 489-485. [PDF, 454KB]

(2009) Fei Huang: Confidence measure for word alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.932-940. [PDF, 727KB]

(2009) Gregor Leusch & Hermann Ney: Edit distances with block movements and error rate confidence estimates [abstract]. Machine Translation 23 (2/3), September 2009; pp.129-140.

(2009) Sylvain Raybaud, Caroline Lavecchia, David Langlois, & Kamel Smaïli: Word- and sentence-level confidence measures for machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.104-111. [PDF, 305KB]

(2009) Sylvain Raybaud, David Langlois & Kamel Smaili: Efficient combination of confidence measures for machine translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Kuniko Saito & Kenji Imamura: Tag confidence measure for semi-automatically updating. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.168-176. [PDF, 204KB]

(2009) Lucia Specia, Nicola Cancedda, Marc Dymetman, Marco Turchi, & Nello Cristianini: Estimating the sentence-level quality of machine translation systems. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.28-35 [PDF, 379KB]

(2009) Lucia Specia, Craig Saunders, Marco Turchi, Zhuoran Wang & John Shawe-Taylor: Improving the confidence of machine translation quality estimates. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.136-143. [PDF, 82KB]

(2009) Lucia Specia, Nicola Cancedda, Marc Dymetman, Craig Saunders, Marco Turchi, Nello Cristianini, Zhuoran Wang & John Shawe-Taylor: Sentence-level confidence estimation for MT. SMART Workshop at EACL 2009, Barcelona, Spain, 13 May 2009. 35 slides. [PDF of PPT, 549KB]

(2008) Nguyen Bach, Qin Gao, & Stephan Vogel: Improving word alignment with language model based confidence scores. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.151-154. [PDF, 607KB]

(2008) Youssef Kadri & Jian-Yun Nie: A comparative study for query translation using linear combination and confidence measure. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.181-188. [PDF, 429KB]

(2007) Alberto Sanchis, Alfons Juan, & Enrique Vidal: Estimation of confidence measures for machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.407-412 [PDF, 129KB]

(2007) Nicola Ueffing & Hermann Ney: Word-level confidence estimation for machine translation. Computational Linguistics 33 (1), pp. 9-40. [PDF, 279KB]

(2006) Toshiyuki Takezawa & Tohru Shimizu: Performance improvement of dialog speech translation by rejecting unreliable utterances. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1100; abstract [PDF, 78KB]

(2005) Nicola Ueffing & Hermann Ney: Word-class confidence estimation for machine translation using phrase-based translation models.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 763-770. [PDF, 336KB]

(2005) Nicola Ueffing & Hermann Ney: Application of word-level confidence measures in interactive statistical machine translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 262-270. [PDF, 173KB]

Confusion networks (see also Combination systems, Lattices and networks, Multi-engine systems)

(2009) Boxing Chen, Min Zhang, Haizhou Li & Aiti Aw: A comparative study of hypothesis alignment and its improvement for machine translation system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.941-948. [PDF, 119KB]

(2009) Yang Feng, Yang Liu, Haitao Mi, Qun Liu, & Yajuan Lü: Lattice-based system combination for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1105-1113. [PDF, 161KB]

(2009) Xiaodong He & Kristina Toutanova: Joint optimization for machine translation system combination. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1202-1211. [PDF, 337KB]

(2009) Gregor Leusch, Evgeny Matusov, & Hermann Ney: The RWTH system combination system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.51-55. [PDF, 173KB]

(2009) Chi-Ho Li, Xiaodong He, Yupeng Liu, & Ning Xi: Incremental HMM alignment for MT system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.949-957. [PDF, 559KB]

(2009) Zhifei Li & Sanjeev Khudanpur: Efficient extraction of oracle-best translations from hypergraphs.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.9-12. [PDF, 112KB]

(2009) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment with flexible matching for building confusion networks: BBN system description for WMT09 system combination task. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.61-65. [PDF, 115KB]

(2009) Holger Schwenk, Loïc Barrault, Yannick Estève, & Patrik Lambert: LIUM’s statistical machine translation system for IWSLT 2009.  IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 65-70. [PDF, 344KB]; poster [PDF, 117KB]

(2009) Yong Zhao & Xiaodong He: Using n-gram based features for machine translation system combination.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.205-208. [PDF, 357KB]

(2008) Necip Fazil Ayan, Jing Zheng, & Wen Wang: Improving alignments for better confusion networks for combining machine translation systems. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.33-40. [PDF, 228KB]

(2008) Boxing Chen, Min Zhang, Aiti Aw, & Haizhou Li: Regenerating hypotheses for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.105-112. [PDF, 99KB]

(2008) Damianos Karakos, Jason Eisner, Sanjeev Khudanpur, & Markus Dreyer: Machine translation system combination using ITG-based alignments. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 81-84. [PDF, 65KB]

(2008) Gregor Leusch, Evgeny Matusov, & Hermann Ney: Complexity of finding the BLEU-optimal hypothesis in a confusion network.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.839-847. [PDF, 255KB]

(2008) Antti-Veikko I. Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment for building confusion networks with application to machine translation system combination. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.183-186. [PDF, 61KB]

(2007) Nicola Bertoldi: Spoken language translation through confusion network decoding. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 32pp. [PDF of PPT presentation, 499KB]

(2007) Roldano Cattoni, Nicola Bertoldi, & Marcello Federico: Punctuating confusion networks for speech translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2453-2456; abstract [PDF, 23KB]

(2007) Daniele Falavigna, Nicola Bertoldi, Fabio Brugnara, Roldano Cattoni, Mauro Cettolo, Boxing Chen, Marcello Federico, Diego Giuliani, Roberto Gretter, Deepa Gupta, & Dino Seppi: The IRST English-Spanish translation system for European Parliament speeches. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2833-2836; abstract [PDF, 23KB]

Context-based methods

(2009) Jamie Brunning, Adrià de Gispert & William Byrne: Context-dependent alignment models for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.110-118. [PDF, 226KB]

(2009) Sander Canisius & Antal van den Bosch: A constraint satisfaction approach to machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.182-189. [PDF, 533KB]

(2009) Jaime Carbonell: New paradigms for machine translation. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 35pp. [PDF of PPT, 462KB]

(2009) Marta R.Costa-jussà & Rafael E.Banchs: Barcelona Media SMT system description for the IWSLT 2009: introducing source context information. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 24-28. [PDF, 265KB]; poster [PDF, 281KB[

(2009) Jinhua Du, Yanjun Ma & Andy Way: Source-side context-informed hypothesis alignment for combining outputs from machine translation systems. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.230-27. [PDF, 277KB]

(2009) Rejwanul Haque, Sandipan Dandapat, Ankit Kumar Srivastava, Sudip Kumar Naskar & Andy Way: English-Hindi transliteration using context-informed PB-SMT: the DCU system for NEWS 2009. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.104-107. [PDF, 98KB]

(2009) Ruhi Sarikaya, Sameer Maskey, Rong Zhang & Ea-Ee Jan: Iterative sentence-pair extraction from quasi-parallel corpora for machine translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Libin Shen, Jinxi Xu, Bing Zhang, Spyros Matsoukas & Ralph Weischedel: Effective use of linguistic and contextual information for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.72-80. [PDF, 122KB]

(2009) Rie Tanaka, Yohei Murakami, & Toru Ishida: Context-based approach for pivot translation services. IJCAI-09: Twenty-first International Joint conference on Artificial Intelligence, Pasadena, Californai, USA, 11-17 July 2009; pp.1555-1561. [PDF, 505KB]

(2009) Maarten van Gompel, Antal van den Bosch, & Peter Berck: Extending memory-based machine translation to phrases. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.61-68. [PDF, 207KB]

(2009) Ying Zhang & Nguyen Bach: Virtual Babel: towards context-aware machine translation in virtual worlds. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 387-394. [PDF, 290KB]

(2008) Ralf D.Brown: Exploiting document-level context for data-driven machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.46-55. [PDF, 496KB]

(2008) Maxine Carpuat & Dekai Wu: Evaluation of context-dependent phrasal translation lexicons for statistical machine translation. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 122KB]

(2008) Kevin Gimpel & Noah A. Smith: Rich source-side context for statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.9-17. [PDF, 211KB]

(2007) Marine Carpuat & Dekai Wu: Context-dependent phrasal translation lexicons for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.73-80 [PDF, 425KB]

(2007) Jesús Giménez & Lluís Màrquez: Context-aware discriminative phrase selection for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 159-166 [PDF, 1046KB]

(2007) Nicolas Stroppa, Antal van den Bosch, & Andy Way: Exploiting source similarity for SMT using context-informed features. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.231-240 [PDF, 309KB]; presentation [PDF, 149KB]

(2006) Jaime Carbonell, Steve Klein, David Miller, Michael Steinbaum, Tomer Grassiany, & Jochen Frei: Context-based machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.19-28 [PDF, 338KB]

(2006) Andrei Popescu-Belis, Paula Estrella, Margaret King, & Nancy Underwood: A model for context-based evaluation of language processing systems and its application to machine translation evaluation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.691-696 [PDF, 536KB]

(2005) Young-Sook Hwang & Yutaka Sasaki: Context-dependent SMT model using bilingual verb-noun collocation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 549-556. [PDF, 1005KB]

Context-free grammars

(2009) Martin Čmejrek, Bowen Zhou, & Bing Xiang: Enriching SCFG rules directly from efficient bilingual chart parsing. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 136-143. [PDF, 315KB]; presentation [PDF of PPT, 982KB]

(2009) Liang Huang, Hao Zhang, David Gildea & Kevin Knight: Binarization of synchronous context-free grammars. Computational Linguistics 35 (4), pp. 559-595 [PDF, 837KB]

(2009) Hongfei Jiang, Muyun Yang, Tiejun Zhao, Sheng Li & Bo Wang: A statistical machine translation model based on a syntactic synchronous grammar. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.125-128. [PDF, 514KB]

(2009) Maxim Khalilov & José A.R.Fonollosa: N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.424-432. [PDF, 564KB]

(2009) Shankar Kumar, Wolfgang Macherey, Chris Dyer & Franz Och: Efficient minimum error rate training and minimum Bayes-risk decoding for translation hypergraphs and lattices. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.163-171. [PDF, 342KB]

(2009) Anders Søgaard & Jonas Kuhn: Empirical lower bounds on alignment error rates in syntax-based machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.19-27. [PDF, 138KB]

(2009) Ashish Venugopal, Andreas Zollmann, Noah A.Smith, & Stephan Vogel: Preference grammars: softening syntactic constraints to improve statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 212KB]

(2007) Ashish Venugopal, Andreas Zollmann, & Stephan Vogel: An efficient two-pass approach to synchronous-CFG driven statistical MT. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.500-507 [PDF, 279KB]

(2007) Francisco Oliveira, Fai Wong, Ka-Seng Leong, Chio-Kin Tong, & Ming-Chui Dong: Query translation for cross-language information retrieval by parsing constraint synchronous grammar. International Conference on Machine Learning and Cybernetics, 19-22, August, 2007, Hong Kong; 6pp. [PDF, 218KB]

(2007) Hao Zhang & Daniel Gildea: Factorization of synchronous context-free grammars in linear time. SSST, NAACL-HLT-2007 AMTA Workshop on  Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.25-32 [PDF, 151KB]

Conversation see Dialogue

Coordination

(2009) Karin Harbusch & Gerard Kempen: Generating clausal coordinate ellipsis multilingually: a uniform approach based on postediting. Proceedings of the 12th European Workshop on Natural Language Generation, March 30-31, 2009, Athens, Greece; pp.138-145. [PDF, 389KB]

(2009) Hideharu Okuma, Kazuo Hara, Masashi Shimbo, & Yuji Matsumoto: Bypassed alignment graph for learning coordination in Japanese sentences. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.5-8. [PDF, 474KB]

Corpora see Bilingual corpora, Monolingual corpora, Multilingual corpora

Corpus-based MT see Context-based methods, Example-based methods, Statistical MT

Cyrillic coding and transliteration

(2008) Karine Megerdoomian & Dan Parvaz: Low-density language bootstrapping: the case of Tajiki Persian.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 331KB]

Data elicitation

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Toward active learning in data selection: automatic discovery of language features during elicitation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2007) Alison Alvarez, Lori Levin, Robert Frederking, & Jill Lehman: An assessment of language elicitation without the supervision of a linguist. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.1-10 [PDF, 415KB]; presentation [PDF, 412KB]

(2005) Alison Alvarez, Lori Levin, Robert Frederking, Erik Peterson & Jeff Good: Semi-automated elicitation corpus generation . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.388-395. [PDF, 131KB]

Data-oriented translation (see also Example-based methods)

(2009) Daniel Galron, Sergio Penkale, Andy Way, & I.Dan Melamed: Accuracy-based scoring for DOT: towards direct error minimization for data-oriented translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.371-380. [PDF, 164KB]

(2009) Vincent Vandeghinste: Tree-based target language modeling. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.152-159. [PDF, 391KB]

(2007) Rens Bod: Unsupervised syntax-based machine translation: the contribution of discontiguous phrases. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.51-56 [PDF, 87KB]

(2007) John Tinsley, Ventsislav Zhechev, Mary Hearne, & Andy Way: Robust language pair-independent sub-tree alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.467-474 [PDF, 1310KB]

(2006) Mary Hearne & Andy Way: Disambiguation strategies for data-oriented translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.59-68 [PDF, 164KB]

Dates

(2009) Julien Gosme, Yves Lepage & Adrien Lardilleux: Translation of sublanguages by subgrammars. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.77-84. [PDF, 116KB]

Decomposition see Segmentation

Dependency relations

(2009) Nguyen Bach, Qin Gao, & Stephan Vogel: Source-side dependency tree reordering models and subtree movements and constraints. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 17-24. [PDF, 211KB]

(2009) Wanxiang Che, Zhenghua Li, Yongqiang Li, Yuhang Guo, Bing Qin, & Ting Liu: Multingual dependency-based syntactic and semantic parsing. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.49-54. [PDF, 633KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Michel Galley & Christopher D.Manning: Quadratic-time dependency parsing for machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.773-781. [PDF, 235KB]

(2009) Nikesh Garera, Chris Callison-Burch & David Yarowsky: Improving translation lexicon induction from monolingual corpora via dependency contexts and part-of-speech equivalences. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning, June 4-5, 2009, Boulder, Colorado; pp.129-137. [PDF, 450KB]

(2009) Kevin Gimpel & Noah A.Smith: Feature-rich translation by quasi-synchronous lattice parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.219-228. [PDF, 329KB]

(2009) Jan Hajič, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antònia Marti, Lluís Màrquez, Adam Meyers, Joakim Nivre, Sebastian Padó, Jan Štěpánek, Pavel Straňák, Mihai Surdeanu, Nainwen Xue, & Yi Zhang: The CoNLL-2009 Shared task: syntactic and semantic dependencies in multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.1-18. [PDF, 220KB]

(2009) Jan Hajič: Tree-based machine translation using syntax and semantics. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 23pp. [PDF of PPT, 543KB]

(2009) Saša Hasan & Hermann Ney: Comparison of extended lexicon models in search and rescoring for SMT. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.17-20. [PDF, 167KB]

(2009) Hany Hassan, Khalil Sima’an, & Andy Way: A syntactified direct translation model with linear-time decoding. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1182-1191. [PDF, 163KB]

(2009) Yifan He & Andy Way: Learning labeled dependencies in machine translation evaluation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.44-51. [PDF, 426KB]

(2009) Jeremy G.Kahn, Matthew Snover, & Mari Ostendorf: Expected dependency pair match: predicting translation quality with expected syntactic structure [abstract]. Machine Translation 23 (2/3), September 2009; pp.169-179.

(2009) Sadao Kurohashi: Fully syntactic example-based machine translation [abstract]. Invited talk [at] Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; p.1. [PDF, 22KB]; presentation [PDF of PPT, 2353KB]

(2009) Roser Morante, Vincent Van Asch, & Antal van den Bosch: Joint memory-based learning of syntactic and semantic dependencies in multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.25-30. [PDF, 108KB]

(2009) Hwidong Na, Jin-Ji Li, Jungi Kim, & Jong-Hyeok Lee: Improving fluency by reordering target constituents using MST parser in English-to-Japanese phrase-based SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.276-283. [PDF, 381KB]

(2009) Toshiaki Nakazawa & Sadao Kurohashi: Statistical phrase alignment model using dependency relation probability.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.10-18. [PDF, 670KB]

(2009) Martin Popel & Zdeněk Žabokrtský: Improving English-Czech tectogrammatical MT. Prague Bulletin of Mathematical Linguistics, no.92, December 2009; pp.115-134 [PDF, 265KB]

(2009) Han Ren, Donghong Ji, Jing Wan, & Mingyao Zhang: Parsing syntactic and semantic dependencies for multiple languages with a pipeline approach. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.97-102. [PDF, 228KB]

(2009) James Smith & Stephen Clark: EBMT for SMT: a new EBMT-SMT hybrid. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.3-10. [PDF, 144KB]; presentation [PDF of PPT, 564KB]

(2009) Ankit K.Srivastava & Andy Way: Using percolated dependencies for phrase extraction in SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.316-323. [PDF, 91KB]

(2009) Oscar Täckström: Multilingual semantic parsing with a pipeline of linear classifiers. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.103-108. [PDF, 103KB]

(2009) Yotaro Watanabe, Masayuki Asahara, & Yuji Matsumoto: Multilingual syntactic-semantic dependency parsing with three-stage approximate max-margin linear models. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.103-108. [PDF, 727KB]

(2009) Deyi Xiong, Min Zhang, Aiti Aw & Haizhou Li: A source dependency model for statistical machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 371-378. [PDF, 87KB]

(2009) Peng Xu, Jaeho Kang, Michael Ringgard, & Franz Och: Using a dependency parser to improve SMT for subject-object-verb languages. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.245-253. [PDF, 273KB]

(2009) Kun Yu & Junichi Tsujii: Extracting bilingual dictionary from comparable corpora with dependency heterogeneity. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.121-124. [PDF, 132KB]

(2009) Zdeněk Žabokrtský & Martin Popel: Hidden Markov tree model in dependency-based machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.145-148. [PDF, 154KB]

(2009) Zdeněk Žabokrtský: TectoMT: software framework for developing MT systems (and other NLP applications). Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.51-56 [PDF, 570KB]

(2009) Sina Zarrieß & Jonas Kuhn: Exploiting translational correspondences for pattern-independent MWE identification.  [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Multiword Expressions, ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; pp.23-30. [PDF, 132KB]

(2009) Hai Zhao, Wenliang Chen, Jun’ichi Kazama, Kiyotaka Uchimoto, & Kentaro Torisawa: Multilingual dependency learning: exploiting rich features for tagging syntactic and semantic dependencies. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.61-66. [PDF, 364KB]

(2009) Hai Zhao, Yan Song, Chunyu Kit, & Guodong Zhou: Cross language dependency parsing using a bilingual lexicon. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.55-63. [PDF, 547KB]

(2009) Jing Zheng, Necip Fazil Ayan, Wen Wang, & David Burkett: Using syntax in large-scale audio document translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2008) Kalika Bali, Sankaran Baskaran, & A.Kumaran: A dependency treelet-based phrasal SMT: evaluation and issues in English-Hindi language pair.  ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 292KB]

(2008) Ondřej Bojar: Problems of deep syntactic English-to-Czech MT. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 33 slides. [PDF, 248KB]

(2008) Ondřej Bojar & Adam Lopez: Tree-based translation. [tutorial handout at] Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 7pp. [PDF, 248KB]

(2008) Mauro Cettolo, Marcello Federico, Daniele Pighin, & Nicola Bertoldi: Shallow-syntax phrase-based translation: joint versus factored string-to-chunk models. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp. 56-64. [PDF, 176KB]

(2008) Colin Cherry: Cohesive phrase-based decoding for statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 72-80. [PDF, 179KB]

(2008) Yvette Graham & Josef van Genabith: Packed rules for automatic transfer-rule induction. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.57-65. [PDF, 1052KB]

(2008) Yuqing Guo, Josef van Genabith, & Haifeng Wang: Dependency-based n-gram models for general purpose sentence realisation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.297-304. [PDF, 138KB]

(2008) Young-Sook Hwang, YoungKil Kim, & SangKyu Park: Paraphrasing depending on bilingual context toward generalization of translation knowledge. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.327-334. [PDF, 952KB]

(2008) Jason Katz-Brown & Michael Collins: Syntactic reordering in preprocessing for Japanese®English translation: MIT system description for NTCIR-7 patent translation task. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 409-414. [PDF, 424KB]

(2008) Yanjun Ma, Sylwia Ozdowska, Yanli Sun, & Andy Way: Improving word alignment using syntactic dependencies. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.69-77. [PDF, 181KB]

(2008) David Mareček, Zdeněk Žabokrtský, & Václav Novák: Automatic alignment of Czech and English deep syntactic dependency trees. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.104-113. [PDF, 586KB]

(2008) Aurélien Max, Rafik Makhloufi, & Philippe Langlais: Explorations in using grammatical dependencies for contextual phrase translation disambiguation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.114-119. [PDF, 568KB]

(2008) Toshiaki Nakazawa & Sadao Kurohashi: Linguistically-motivated tree-based probabilistic phrase alignment. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.163-171. [PDF, 603KB]

(2008) Vassilina Nikoulina & Marc Dymetman: Experiments in discriminating phrase-based translations on the basis of syntactic coupling features. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.55-60. [PDF, 98KB]

(2008) Joakim Nivre, Igor M.Boguslavsky, & Leonid L.Iomdin: Parsing the SynTagRus treebank of Russian.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.641-648. [PDF, 266KB]

(2008) Matt Post & Daniel Gildea: Parsers as language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.172-181. [PDF, 538KB]

(2008) Libin Shen, Jinxi Xu, & Ralph Weischedel: A new string-to-dependency machine translation algorithm with a target dependency language model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 577-585. [PDF, 147KB]

(2008) Zdeněk Žabokrtský, Jan Ptáček, & Petr Pajas: TectoMT: highly modular MT system with tectogrammatics used as transfer layer. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.167-170. [PDF, 136KB]

(2008) Daniel Zeman & Philip Resnik: Cross-language parser adaptation between related languages. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.35-42. [PDF, 150KB]

(2007) Eckard Bick: Dan2eng: wide-coverage Danish-English machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.37-43 [PDF, 621KB]

(2007) Bernd Bohnet: The induction and evaluation of word order rules using corpora based on the two concepts of topological models. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.38-45 [PDF, 1835KB]

(2007) Matthias Buch-Kromann: Breaking the barrier of context-freeness: towards a linguistically adequate probabilistic dependency model of parallel texts. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.31-40 [PDF, 432KB]; presentation [PDF, 1285KB]

(2007) Matthias Buch-Kromann: Computing translation units and quantifying parallelism in parallel dependency treebanks. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.69-76 [PDF, 361KB]

(2007) Pi-Chuan Chang & Kristina Toutanova: A discriminative syntactic word order model for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 9-16 [PDF, 235KB]

(2007) Nizar Habash: Syntactic preprocessing for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.215-222 [PDF, 118KB]

(2007) Irene Langkilde-Geary: Declarative syntactic processing of natural language using concurrent constraint programming and probabilistic dependency modeling. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.55-63 [PDF, 1969KB]

(2007) Dennis N. Mehay & Chris Brew: BLEUÂTRE: flattening syntactic dependencies for MT evaluation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.122-131 [PDF, 370KB]; presentation [PDF, 802KB]

(2007) Arul Menezes & Chris Quirk: Using dependency order templates to improve generality in translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 1-8 [PDF, 153KB]

(2007) Toshiaki Nakazawa, Yu Kun, & Sadao Kurohashi: Structural phrase alignment based on consistency criteria. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.337-344 [PDF, 202KB]

(2007) Joakim Nivre: Data-driven dependency parsing across languages and domains: perspectives from the CoNLL 2007 Shared Task. IWPT 2007: Proceedings of the 10th International Conference on Parsing Technologies, June 23-24, 2007, Prague, Czech Republic; pp.168-170. [PDF, 69KB]

(2007) Karolina Owczarzak & Josef van Genabith: Evaluating machine translation with LFG dependencies [abstract]. Machine Translation 21 (2), June 2007; pp.95-119.

(2007) Karolina Owczarzak, Josef van Genabith, & Andy Way: Labelled dependencies in machine translation evaluation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 104-111 [PDF, 239KB]

(2007) Karolina Owczarzak, Josef van Genabith, & Andy Way: Dependency-based automatic evaluation for machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.80-87 [PDF, 245KB]

(2007) Deyi Xiong, Qun Liu, & Shouxun Lin: A dependency treelet string correspondence model for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 40-47 [PDF, 411KB]

(2006) Yafa Al-Raheb, A.Akrout, J. van Genabith, & J. Dichy: DCU 250 Arabic dependency bank: an LFG gold standrad resource for the Arabic Penn treebank.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.105-116. [PDF, 355KB]

(2006) Juri Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid Iomdin, Andrei Sannikov, & Victor Sizov: A syntactically and semantically tagged corpus of Russian: state of the art and prospects.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1378-1381 [PDF, 596KB]

(2006) Giuseppe Attardi: Experiments with a multilanguage non-projective dependency parser. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.166-170. [PDF, 104KB]

(2006) Sabine Buchholz & Erwin Marsi: CoNLL-X shared task on multilingual dependency parsing. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.149-164. [PDF, 158KB]

(2006) Yuchang Cheng, Masayuki Asahara & Yuji Matsumoto: Multi-lingual dependency parsing at NAIST.  CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.191-195. [PDF, 192KB]

(2006) Colin Cherry & Dekang Lin: Soft syntactic constraints for word alignment through discriminative training. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.105-112. [PDF, 153KB]

(2006) Colin Cherry & Dekang Lin: A comparison of syntactically motivated word alignment spaces. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.145-152 [PDF, 150KB]

(2006) Simon Corston-Oliver & Anthony Aue: Dependency parsing with reference to Slovene, Spanish and Swedish. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.196-200. [PDF, 286KB]

(2006) Simon Corston-Oliver, Anthony Aue, Kevin Duh & Eric Ringger: Multilingual dependency parsing using Bayes point machines.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 160-167 [PDF, 178KB]

(2006) Yuan Ding & Martha Palmer: Better learning and decoding for syntax based SMT using PSDIG. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.37-45 [PDF, 391KB]

(2006) Eva Hajičová: Old linguists never die, they only get obligatorily deleted. Computational Linguistics 32 (4), pp. 457-469. [PDF, 98KB]

(2006) Saša Hasan, Oliver Bender, & Hermann Ney: Reranking translation hypotheses using structural properties.  EACL-2006: Proceedings of the Workshop on Learning Structured Informationa in Natural Language Applications, April 3, 2006, Trento, Italy; pp.41-48. [PDF, 100KB]

(2006) Radu Ion, Alexandru Ceauşu, & Dan Tufiş: Dependency-based phrase alignment.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1290-1293 [PDF, 430KB]

(2006) Richard Johansson & Pierre Nugues: Investigating multilingual dependency parsing. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.206-210. [PDF, 75KB]

(2006) Philippe Langlais & Fabrizio Gotti: EBMT by tree-phrasing [abstract]. Machine Translation 20 (1),2006; pp.1-23.

(2006) Ryan McDonald, Kevin Lerman, & Fernando Pereira: Multilingual dependency analysis with a two-stage discriminative parser. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.216-220. [PDF, 79KB]

(2006) Arul Menezes, Kristina Toutanova, & Chris Quirk: Microsoft Research treelet translation system: NAACL 2006 Europarl evaluation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 158-161 [PDF, 676KB]

(2006) Sylwia Ozdowska: Projecting POS tags and syntactic dependencies from English and French to Polish in aligned corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.53-60 [PDF, 366 KB]

(2006) Christopher Quirk & Arul Menezes: Dependency treelet translation: the convergence of statistical and example-based machine translation? [abstract] Machine Translation 20 (1),2006; pp.43-65.

(2006) Chris Quirk & Simon Corston-Oliver: The impact of parse quality on syntactically-informed statistical machine translation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 62-69. [PDF, 203KB]

(2006) Sebastian Riedel, Ruket Ҫakıcı, & Ivan Meza-Ruiz: Multi-lingual dependency parsing with incremental integer linear programming. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.226-230. [PDF, 72KB]

(2006) Koichiro Ryu, Shigeki Matsubara, & Yasuyoshi Inagaki: Simultaneous English-Japanese spoken language translation based on incremental dependency parsing and transfer. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.683-690. [PDF, 420KB]

(2006) David A. Smith & Jason Eisner: Quasi-synchronous grammars: alignment by soft projection of syntactic dependencies.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 23-30 [PDF, 207KB]

(2006) Otakar Smrž: Tips and tricks of the Prague Arabic dependency treebank.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.25-34. [PDF, ]

(2006) Toon Witkam: History and heritage of the DLT (Distributed Language Technology) project. [Utrecht: private publication, 2006]. 11pp. [PDF, 170KB]

(2006) Hao Zhang & Daniel Gildea: Inducing word alignments with bilexical synchronous trees. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.953-960. [PDF, 156KB]

(2005) Etienne Blanc: About and around the French enconverter and the French deconverter. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.157-166 [abstract, PDF, 21KB]

(2005) Ondřej Bojar & Jan Hajič: Extracting translations verb frames. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.2-6 [PDF, 207KB]

(2005) Yuan Ding & Martha Palmer: Machine translation using probabilistic synchronous dependency insertion grammars.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 541-548. [PDF, 369KB]

(2005) Heidi J. Fox: Dependency-based statistical machine translation.  ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 91-96. [PDF, 101KB]

(2005) Philippe Langlais, Fabrizio Gotti, Didier Bourigault, & Claude Coulombe: EBMT by tree-phrasing: a pilot study MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.71-80. [PDF, 387KB]

(2005) Arul Menezes & Chris Quirk: Dependency treelet translation: the convergence of statistical and example-based machine-translation? MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.99-108. [PDF, 314KB]

(2005) Arul Menezes & Chris Quirk: Microsoft Research treelet translation system: IWSLT evaluation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 4pp. [PDF, 240KB]

(2005) Sylwia Ozdowska: Using bilingual dependencies to align words in English/French parallel corpora.  ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 127-132. [PDF, 89KB]

(2005) Chris Quirk, Arul Menezes, & Colin Cherry: Dependency treelet translation: syntactically informed phrasal SMT.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 271-279. [PDF, 118KB]

(2005) Toon Witkam: A new road to automatic translation. [Utrecht: private publication, 26 November 2005]. 13pp. [PDF, 177KB]

Diacritics

(2009) Hamdy S.Mubarak, Mohamed Al Sharqawy & Esraa Al Masry: Diacritization and transliteration of proper nouns from Arabic to English.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.256-259. [PDF, 367KB]

(2008) Tim Schlippe, ThuyLinh Nguyen, & Stephan Vogel: Diacritization as a machine translation problem and as a sequence labeling problem. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.270-278 [PDF, 701KB]

(2007) Mona Diab, Mahmoud Ghoneim, & Nizar Habash: Arabic diacritization in the context of statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.143-149 [PDF, 105KB]

(2007) Nizar Habash & Owen Rambow: Arabic diacritization through full morphological tagging. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.53-56 [PDF, 99KB]

(2006) Mohamed Maamouri, Ann Bies, & Seth Kulick: Diacritization: a challenge to Arabic treebank annotation and parsing.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.35-47 [PDF, 223KB]

Dialects and registers

 (2009) Ahmed Abdelali, Steve Helmreich, & Ron Zacharski: Investigations on standard Arabic geographical classification.  CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 193KB]

(2009) Rodolfo Delmonte, Antonella Bristot, Sara Tonelli, & Emanuele Pianta: English/Veneto resource poor machine translation with STILVEN.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.82-89 [abstract]

(2008) Thamar Solorio & Yang Liu: Part-of-speech tagging for English-Spanish code-switched text. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1051-1060. [PDF, 129KB]

(2007) Jing-Shin Chang & Chun-Kai Kung: A Chinese-to-Chinese statistical machine translation model for mining synonymous simplified-traditional Chinese terms. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.81-87 [PDF, 314KB]

(2007) Mona Diab & Nizar Habash: Arabic dialect processing tutorial. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.5-6 [PDF, 122KB]

(2007) Yves Scherrer: Adaptive string distance measures for bilingual dialect lexicon induction. ACL 2007: proceedings of the Student Research Worksho, Prague, Czech Republic, June 2007; pp. 55-60 [PDF, 122KB]

(2006) Mona Diab & Nizar Habash: Arabic dialect processing.  Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 61pp. [PDF of PPT presentation, 1514KB]

(2006) Nizar Habash & Owen Rambow: MAGEAD: a morphological analyzer and generator for the Arabic dialects. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.681-688. [PDF, 112KB]

(2006) Rebecca Hwa, Carol Nichols & Khalil Sima’an: Corpus variations for translation lexicon induction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.74-81 [PDF, 178KB]

(2006) Keith J.Miller & Michelle Vanni: Formal vs. informal: register-differentiated Arabic MT evaluation in the PLATO paradigm.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.161-166 [PDF, 296KB]

(2005) Nizar Habash, Owen Rambow & George Kiraz: Morphological analysis and generation for Arabic dialects.  ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 17-24. [PDF, 121KB]

Dialogue

(2009) Beth Ann Hockey & Manny Rayner: Using paraphrases of deep semantic representations to support regression testing in spoken dialogue systems. NAACL-HLT-2009: Software Engineering, Testing and Quality Assurance for Natural Language Processing (SETQA-NLP 2009), Proceedings of the Workshop, June 5, 2009, Boulder, Colorado; pp.14-21. [PDF, 139KB]

(2009) Germán Sanchis-Trilles, Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Online language model adaptation for spoken dialog translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 160-167. [PDF, 349KB]; presentation [PDF of PPT, 250KB]

(2008) Sherri Condon, Jon Phillips, Christy Doran, John Aberdeen, Dan Parvaz, Beatrice Oshika, Greg Sanders, & Craig Schlendoff: Applying automated metrics to speech translation dialogs.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 178KB]

(2008) Farzad Ehsani, Jim Kimzey, Elaine Zuber, Demitrios Master, & Karen Sudre: Speech to speech translation for nurse patient interaction. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.54-59. [PDF, 92KB]

(2008) Andrew Finch & Eiichiro Sumita: Dynamic model interpolation for statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.208-215. [PDF, 428KB]

(2008) Long Jiang & Ming Zhou: Generating Chinese couplets using a statistical MT approach. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.377-384. [PDF, 284KB]

(2008) Azza Abdel Monem, Khaled Shaalan, Ahmed Rafea, & Hoda Baraka: Generating Arabic text in multilingual speech-to-speech machine translation framework [abstract]. Machine Translation 22 (4), December 2008; pp.205-258.

(2008) Vivek Kumar Rangarajan Sridhar, Shrikanth Narayanan, & Srinivas Bangalore: Enriching spoken language translation with dialog acts. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 225-228. [PDF, 383KB]

(2007) Kristin Precoda, Jing Zheng, Dimitra Vergyri, Horacio Franco, Colleen Richey, Andreas Kathol, & Sachin Kajarekar: IraqComm: a next generation translation system. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2841-2844; abstract [PDF, 23KB]

(2007) Sharath Rao, Ian Lane, & Tanja Schultz: Improving spoken language translation by automatic disfluency removal: evidence from conversational speech transcripts. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.385-389 [PDF, 140KB]

(2007) Sharath Rao, Ian Lane, & Tanja Schultz: Optimizing sentence segmentation for spoken language translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2845-2848; abstract [PDF, 23KB]

(2007) Harold Somers: Theoretical and methodological issues regarding the use of language technologies for patients with limited English proficiency. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.206-213 [PDF, 637KB]; presentation [PDF, 464KB]

(2007) Ying Zhang & Stephan Vogel: PanDoRA: a large-scale two-way statistical machine translation system for hand-held devices. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.543-550 [PDF, 472KB]

(2006) Farzad Ehsani, Jim Kimzey, Demetrios Master, Karen Lesea & Hunil Park: Speech to speech translation for medical triage in Korean.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.17-23 [PDF, 96KB]

(2006) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth Narayanan: Cross-lingual dialog model for speech to speech translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1858; abstract [PDF, 74KB]

(2006) Diana Inkpen, Muath Alzghool, Gareth J.F.Jones & Douglas W.Oard: Investigating cross-language speech retrieval for a spontaneous conversational speech collection.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 61-64 [PDF, 43KB]

(2006) Djamal Mostefa, Olivier Hamon, & Khalid Choukri: Evaluation of automatic speech recognition and speech language translation within TC-STAR: results from the first evaluation campaign.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.149-154 [PDF, 388KB]

(2006) Manny Rayner, Pierrette Bouillon, Nikos Chatzichrisafis, Marianne Santaholma, Marianne Starlander, Beth Ann Hockey, Yukie Nakao, Hitoshi Isahara & Kyoko Kanzaki: MedSLT: a limited-domain unidirectional grammar-based medical speech translator.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.44-47 [PDF, 75KB]

(2006) Stephanie Seneff, Chao Wang & John Lee: Combining linguistic and statistical methods for bi-directional English Chinese translation in the flight domain. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.213-222 [PDF, 152KB]

(2006) Toshiyuki Takezawa & Tohru Shimizu: Performance improvement of dialog speech translation by rejecting unreliable utterances. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1100; abstract [PDF, 78KB]

(2005) Naoki Asanoma, Setsuo Yamada, Osamu Furuse, & Masahiro Oku: Building a conversation corpus by text derivation from "germ dialogs". 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 27-32. [PDF, 82KB]

(2005) Emil Ettelaie, Sudeep Gandhe, Panayiotis Georgiou, Kevin Knight, Daniel Marcu, Shrikanth Narayanan, David Traum, & Robert Belvin: Transonics: a practical speech-to-speech translator for English-Farsi medical dialogues.  ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 89-92  [PDF, 352KB]

(2005) Manny Rayner, Nikos Chatzichrisafis, Pierrette Bouillon, Yukie Nakao, Hitoshi Isahara, Kyoko Kanzaki, Beth Ann Hockey, Marianne Santaholma, & Marianne Starlander: Japanese speech understanding using grammar specialization. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 26-27. [PDF, 54KB]

(2005) Mutsuko Tomokiyo, Gérard Chollet, & Solange Hollard: Studies of emotional expression sin oral dialogues: towards an extension of Universal Networking Language. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.286-299 [abstract, PDF, 13KB]

Dictionaries see Lexical resources

Direct translation approach

(2009) Hany Hassan, Khalil Sima’an, & Andy Way: A syntactified direct translation model with linear-time decoding. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1182-1191. [PDF, 163KB]

(2008) Gurpreet Singh Josan & Gurpreet Singh Lehal: A Punjabi to Hindi machine translation system. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.157-160. [PDF, 190KB]

(2007) Bogdan Babych, Anthony Hartley, & Serge Sharoff: Translating from under-resourced languages: comparing direct transfer against pivot translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.29-35 [PDF, 197KB]

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: A MT system from Turkmen to Turkish employing finite state and statistical methods. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.459-465 [PDF, 399KB]

Disambiguation (see also Word sense disambiguation)

(2009) Pi-Chuan Chang, Dan Jurafsky, & Christopher D.Manning: Disambiguating “DE” for Chinese-English machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.215-223. [PDF, 276KB]

(2009) Alexander Fraser, Renjing Wang, & Hinrich Schütze: Rich bitext projection features for parse reranking. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.282-290. [PDF, 132KB]

(2009) Gan Jin: Some problems in a French-Chinese machine translation system.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.138-142 [abstract]

(2009) Shoichi Yokoyama & Masumi Okuyama: Translation disambiguation of patent sentences using case frames. MT Summit XII: Third Workshop on Patent Translation, August 30, 2009, Ottawa, Ontario, Canada; pp. 25-32. [PDF, 75KB]

(2009) Zhifei Li, Jason Eisner, & Sanjeev Khudanpur: Variational decoding for statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.593-601. [PDF, 217KB]

(2008) Eiji Aramaki, Takeshi Imai, Kengo Miyo, & Kazuhiko Ohe: Orthographic disambiguation incorporating transliterated probability. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.48-55. [PDF, 525KB]

(2008) Victoria Fossum & Kevin Knight: Using bilingual Chinese-English word alignments to resolve PP-attachment ambiguity in English. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.245-253 [PDF, 595KB]

(2008) Aurélien Max, Rafik Makhloufi, & Philippe Langlais: Explorations in using grammatical dependencies for contextual phrase translation disambiguation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.114-119. [PDF, 568KB]

(2008) Wolodja Wentland, Johannes Knopp, Carina Silberer, & Matthias Hartung: Building a multilingual lexical resource for named entity disambiguation, translation and transliteration.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 451KB]

(2007) Marine Carpuat & Dekai Wu: Context-dependent phrasal translation lexicons for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.73-80 [PDF, 425KB]

(2007) Marine Carpuat & Dekai Wu: How phrase sense disambiguation outperforms word sense disambiguation for statistical machine translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.43-52 [PDF, 313KB]

(2006) Mohammed A. Attia: An ambiguity-controlled morphological analyzer for modern standard Arabic modelling finite state networks.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.48-67. [PDF, 290KB]

(2006) Mary Hearne & Andy Way: Disambiguation strategies for data-oriented translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.59-68 [PDF, 164KB]

(2006) Ted Pedersen, Anagha Kulkarni, Roxana Angheluta, Zornitsa Kozareva, & Thamar Solorio: Improving name discrimination: a language salad approach. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.25-32 [PDF, 268 KB]

(2006) Marcus Sammer, Kobi Reiter, Stephen Soderland, Katrin Kirchhoff & Oren Etzioni: Ambiguity reduction for machine translation: human-computer collaboration.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.193-202 [PDF, 247KB]

(2005) Igor M.Boguslavsky, Leonid L.Iomd