Machine Translation Archive

Subject Index

to methods, techniques, issues and topics

Publications since 2005

Part 2: N-Z

[Back to part 1]

[Click for earlier publications: 2000-2004, 1990-1999, before 1990]

To return to home page click here

Names and named entities

(2010) Gosse Bouma: Cross-lingual ontology alignment using EuroWordNet and Wikipedia. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1023-1028. [PDF, 336KB]

(2010) Raghavendra Udupa & Mitesh Khapra: Improving the multilingual user experience of Wikipedia using cross-language name search. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.492-500. [PDF, 246KB]

(2009) see [ACL-IJCNLP 2009] NEWS 2009. Named Entities Workshop: shared task on transliteration. Proceedings of the Workshop, Suntec, Singapore, 7 August 2009

(2009) Kedar Bellare, Koby Crammer, & Dayne Freitag: Loss-sensitive discriminative training of machine transliteration models. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.61-65. [PDF, 117KB]

(2009) Ming-Wei Chang, Dan Goldwasser, Dan Roth, & Yuancheng Tu: Unsupervised constraint driven learning for transliteration discovery. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.299-307. [PDF, 580KB]

(2009) Loic Dugast, Jean Senellart, & Philipp Koehn: Statistical post editing and dictionary extraction: Systran/Edinburgh submissions for ACL-WMT2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.110-114. [PDF, 245KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Tatiana Gornostay & Inguna Skadiņa: English-Latvian toponym processing: translation strategies and linguistic patterns. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.81-87. [PDF, 266KB]

(2009) Filip Graliński, Krzysztof Jassem, & Michał Marcińczuk: An environment for named entity recognition and translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.88-95. [PDF, 430KB]

(2009) Filip Graliński, Krzysztof Jassem, & Michal Marcińczuk: Named entity recognition in machine anonymization. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.77-89. [PDF, 144KB]

(2009) Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang, Xian Wu & Zhong Su: Domain adaptation with latent semantic association for named entity recognition. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.281-289. [PDF, 203KB]

(2009) Heng Ji: Mining name translations from comparable corpora by creating bilingual information networks. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.34-47. [PDF, 120KB]

(2009) Alexandre Klementiev & Dan Roth: Names entity transliteration and discovery in multilingual corpora.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.79-92.

(2009) Maoxi Li, Jiajun Zhang, Yu Zhou, & Chengqing Zong: The CASIA statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 83-90. [PDF, 317KB]; presentation [PDF of PPT, 463KB]

(2009) Djamel Mostefa, Mariama Laïb, Stéphane Chaudiron, Khalid Choukri, & Gaël de Chalendar: A multilingual named entity corpus for Arabic, English and French.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.213-216. [PDF, 571KB]

(2009) Hamdy S.Mubarak, Mohamed Al Sharqawy & Esraa Al Masry: Diacritization and transliteration of proper nouns from Arabic to English.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.256-259. [PDF, 367KB]

(2009) Peter Nabende: Evaluation of dynamic Bayesian network models for entitiy name transliteration. Benelearn 2009: proceedings of the 18th Annual Belgian-Dutch Conference on Machine Learning, Tilburg, May 2009; pp.99-100. [PDF, 151KB]

(2009) Sara Noeman: Transliteration using phrase based SMT approach on substrings.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.199-204. [PDF, 348KB]

(2009) Bruno Pouliquen & Ralf Steinberger: Automatic construction of multilingual name dictionaries.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.59-78.

(2009) Sujith Ravi & Kevin Knight: Learning phoneme mappings for transliteration without parallel data. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 198KB]

(2009) Aarthi Reddy, Richard Rose, Hani Safadi, Samuel Larkin & Gilles Boulianne: Incorporating knowledge of source language text in a system for dictation of document translations. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 112-119. [PDF, 167KB]

(2009) R.Mahesh K.Sinha: Automated mining of names using parallel Hindi-English corpus. ACL-IJCNLP-2009: 7th Workshop on Asian Language Resources (ALR-7), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp. 11-18. [PDF, 312KB]

(2009) Raghavendra Udupa, K.Saravanan, A.Kumaran, & Jagadeesh Jagarlamundi: MINT: a method for effective and scalable mining of named entity transliterations from large comparable corpora.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.799-807. [PDF, 1120KB]

(2009) Fan Yang, Jun Zhao, & Kang Liu: A Chinese-English organization name translation system using heuristic web mining and asymmetric alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.387-395. [PDF, 178KB]

(2008) Bogdan Babych & Anthony Hartley: Sensitivity of automated MT evaluation metrics on higher quality MT output: BLEU vs task-based evaluation methods. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 57KB]

(2008) Sivaji Bandyopadhyay: Multilingual named entity recognition. IJCNLP 2008: Workshop on NER for South and South East Asian Languages, Proceedings of the workshop, 12 January 2008, Hyderabad, India; pp.3-4. [PDF, 37KB]

(2008) Stephan Busemann & Yajing Zhang: Identifying foreign person names in Chinese text. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 255KB]

(2008) Asif Ekbal, Rejwanul Haque, Amitava Das, Venkateswarlu Poka, & Sivaji Bandyopadhyay: Language independent named entity recognition in Indian languages. IJCNLP 2008: Workshop on NER for South and South East Asian Languages, Proceedings of the workshop, 12 January 2008, Hyderabad, India; pp.33-40. [PDF, 160KB]

(2008) Dan Goldwasser & Dan Roth: Active sample selection for named entity transliteration. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 53-56. [PDF, 538KB]

(2008) Dan Goldwasser & Dan Roth: Transliteration as constrained optimization.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.353-362. [PDF, 645KB]

(2008) Ulf Hermjakob, Kevin Knight, & Hal Daumé III: Name translation in statistical machine translation: learning when to transliterate. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 389-397. [PDF, 107KB]

(2008) Fei Huang, Ahmad Emami, & Imed Zitouni: When Harry met Harri, [Arabic name] and [Chinese name]: cross-lingual name spelling normalization.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.391-399. [PDF, 165KB]

(2008) Shasha Liao: Combining source and target language information for name tagging of machine translation output. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the student research workshop, June 16, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 19-24. [PDF, 98KB]

(2008) Inderjeet Mani, Alex Yeh, & Sherri Condon: Learning to match names across languages. Coling 2008: Proceedings of the 2nd Workshop on Multi-source, Multilingual Information Extraction and Summarization, 23 August 2008, Manchester, UK; pp.2-9. [PDF, 182KB]

(2008) Denis Maurel: Prolexbase: a multilingual relational lexical database of proper names. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 145KB]

(2008) Alexander E. Richman & Patrick Schone: Mining Wiki resources for multilingual named entity recognition. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1-9. [PDF, 134KB]

(2008) K.Saravanan & A.Kumaran: Some experiments in mining named entitiy transliteration pairs from comparable corpora. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.26-33. [PDF, 805KB]

(2008) Zhiyi Song & Stephanie Strassel: Entity translation and alignment in the ACE-07 ET task. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 135KB]

(2008) Kristen Summers & Diane Chandler: Language processing for analysis and investigation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.454-456. [PDF, 647KB]

(2008) Jean Tavernier, Rosa Cowan, & Michelle Vanni: Holy Moses! Leveraging existing tools and resources for entity translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 74KB]

(2008) Yu-Chun Wang, Richard Tzong-Han Tsai, & Wen-Lian Hsu: Learning patterns from the web to translate named entities for cross language information retrieval.  IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.281-288. [PDF, 464KB]

(2008) Wolodja Wentland, Johannes Knopp, Carina Silberer, & Matthias Hartung: Building a multilingual lexical resource for named entity disambiguation, translation and transliteration.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 451KB]

(2008) Imed Zitouni & Radu Florian: Mention detection crossing the language barrier. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.600-609. [PDF, 149KB]

(2007) Bogdan Babych & Anthony Hartley: Sensitivity of automated models for MT evaluation: proximity-based vs. performance-based methods. MT Summit XI Workshop: Automatic procedures in MT evaluation, 11 September 2007, Copenhagen, Denmark, [Proceedings]; 22pp. [PDF of PPT presentation, 150KB]

(2007) Guihong Cao, Jianfeng Gao, & Jian-Yun Nie: A system to mine large-scale bilingual dictionaries from monolingual web pages. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.57-64 [PDF, 539KB]

(2007) Chu-Ren Huang, Petr Šimon, & Shu-Kai Hsieh: Automatic discovery of named entity variants – grammar-driven approaches to non-alphabetical transliterations. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 153-156 [PDF, 246KB]

(2007) Long Jiang, Ming Zhou, Lee-Feng Chien & Cheng Niu: Named entity translation with web mining and transliteration. IJCAI-07: Twentieth International Joint conference on Artificial Intelligence, Hyderabad, India, 6-12 January 2007; pp.1629-1634. [PDF, 491KB]

(2007) Sarvnaz Karimi, Andrew Turpin, & Falk Scholer: Corpus effects on the evaluation of automated transliteration systems. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 640-647 [PDF, 1215KB]

(2007) Sarvnaz Karimi, Falk Scholer, & Andrew Turpin: Collapsed consonant and vowel models: new approaches for English-Persian transliteration and back-transliteration.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 648-655 [PDF, 195KB]

(2007) Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, & Minghui Dong: Semantic transliteration of personal names. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 120-127 [PDF, 152KB]

(2007)  S.Saleem, K.Subramanian, R.Prasad, David Stallard, Chia-Lin Kao, P.Natarajan, & R.Suleiman: Improvements in machine translation for English/Iraqi speech translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2445-2448; abstract [PDF, 23KB]

(2007) Khaled Shaalan & Hafsa Raza: Person name entity recognition for Arabic. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 17-24 [PDF, 591KB]

(2007) Tarek Sherif & Grzegorz Kondrak: Substring-based transliteration. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 944-951 [PDF, 152KB]

(2007)  Jian-Cheng Wu & Jason S.Chang: Learning to find English to Chinese transliterations on the web. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 858-857. [PDF, 187KB]

(2007) Chien-Cheng Wu & Jason S.Chang: Learning to find transliteration on the Web. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Demonstration Program, pp.21-22 [PDF, 1621KB]

(2007) Su-Youn Yoon, Kyoung-Young Kim, & Richard Sproat: Multilingual transliteration using feature based phonetic method. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 112-119 [PDF, 269KB]

(2007) Bing Zhao, Nguyen Bach, Ian Lane & Stephan Vogel: A log-linear block transliteration model based on bi-stream HMMs. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.364-371 [PDF, 453KB]

(2006) Iñaki Alegria, Nerea Ezeiza, & Izaskun Fernandez: Named entities translation based on comparable corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.1-8 [PDF, 455KB]

(2006) Marine Carpuat, Yihai Shen, Xiaofeng Yu, & Dekai Wu: Toward integrating word sense and entity disambiguation into statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 37-44  [PDF, 330KB]

(2006) Conrad Chen & Hsin-Hsi Chen: A high-accurate Chinese-English NE backward translation system combining both lexical information and web statistics.  Coling-ACL 2006: Proceedings of the Coling/ACL 2 006 Main Conference Poster Sessions, Sydney, July 2006; pp.81-88. [PDF, 313KB]

(2006) Sherri Condon & Keith J.Miller: Name transliteration: current methods, applications, and evaluation in name transliteration and name translation. Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 46pp. [PDF of PPT presentation, 1103KB]

(2006) Sherri Condon & Keith J. Miller: What’s in a name: current methods, applications, and evaluation in multilingual name search and matching. [Abstract for tutorial, in] HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 299-300 [PDF, 20KB]

(2006) Asif Ekbal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: A modified joint source-channel model for transliteration. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.191-198. [PDF, 159KB]

(2006) Andrew T. Freeman, Sherri L. Condon & Christopher M. Ackerman: Cross linguistic name matching in English and Arabic: a “one to many mapping” extension of the Levenshtein edit distance algorithm. HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 471-478 [PDF, 371KB]

(2006) Jianfeng Gao, Mu Li, Andi Wu, & Chang-Ning Huang: Chinese word segmentation and named entity recognition: a pragmatic approach. Computational Linguistics 31 (4), pp. 531-574. [PDF, 1638KB]

(2006) Mehdi M. Kashani, Fred Popowich, & Fatiha Sadat: Automatic transliteration of proper nouns from Arabic to English. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.76-83. [PDF, 179KB]

(2006) Alexandre Klementiev & Dan Roth: Weakly supervised named entity transliteration and discovery from multilingual comparable corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.817-824. [PDF, 173KB]

(2006) Ted Pedersen, Anagha Kulkarni, Roxana Angheluta, Zornitsa Kozareva, & Thamar Solorio: Improving name discrimination: a language salad approach. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.25-32 [PDF, 268 KB]

(2006) Charles Schafer: Novel probabilistic finite-state transducers for cognate and transliteration modeling. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.203-212 [PDF, 250KB]

(2006) Richard Sproat, Tao Tao, & ChenXiang Zhai: Named entity transliteration with comparable corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.73-80. [PDF, 232KB]

(2006) Dmitry Zelenko & Chinatsu Aone: Discriminative methods for transliteration.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 612-617. [PDF, 342KB]

(2005) Ondřej Bojar, Petr Homola, & Vladislav Kuboň: Problems of reusing an existing MT system. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.179-184. [PDF, 73KB]

(2005) Hany Hassan & Jeffrey Sorensen: An integrated approach for Arabic-English named entity translation. ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 87-93. [PDF, 108KB]

(2005) Fei Huang: Cluster-specific named entity transliteration. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 435-444. [PDF, 169KB]

(2005) Fei Huang, Ying Zhang, & Stephan Vogel: Mining key phrase translations from web corpora. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 483-490. [PDF, 310KB]

(2005) Thamar Solorio: Exploiting named entity taggers in a second language. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp.25-30.[PDF, 70KB]

(2005) Gregor Thurmair: Improving machine translation quality. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit: invited paper [PDF, 209KB]

Natural language understanding

(2009) Younès Bahou, Lamia Hadrich Belguith, & Abdelmajid Ben Hamadou: Disfluency and out-of-vocabulary word processing in Arabic speech understanding. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 825KB]

Neologisms see Unknown words

Networks see Lattices and networks, Semantic analysis and representation

Neural networks

(2010) Holger Schwenk: Continuous-space language models for statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.137-146. [PDF, 151KB]

(2006) Holger Schwenk, Marta R. Costa-jussà & José A. R. Fonollosa: Continuous space language models for the IWSLT 2006 task. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 166-173 [PDF, 165KB]

(2006) Holger Schwenk, Daniel Dechelotte, & Jean-Luc Gauvain: Continuous space language models for statistical machine translation. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.723-730. [PDF, 508KB]

(2005) Necip Fazil Ayan, Bonnie J. Dorr, & Christof Monz: NeurAlign: combining word alignments using neural networks.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 65-72. [PDF, 139KB]

(2005) Philippe Langlais, Thomas Leplus, Simona Gandrabur, & Guy Lapalme: From the real world to real words: the METEO case. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 166-175. [PDF, 102KB]

Non-contiguous relations see Discontiguous relations

Nouns and noun phrases

(2010) Soma Paul, Prashant Mathur, & Sushant Kishore: Syntactic construct: an aid for translating English nominal compound into Hindi. Proceedings of the NAACL HLT 2010 Workshop on Extracting and Using Constructions in Computational Linguistics, Los Angeles, CA, June 2010; pp.32-38. [PDF, 107KB]

(2010) Gábor Recski, András Rung, Attila Zséder, & András Kornai: NP alignment in bilingual corpora. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3379-3382. [PDF, 305KB]

(2009) Lars Bungum & Stephan Oepen: Automatic translation of Norwegian noun compounds. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.136-143. [PDF, 465KB]

(2009) Pi-Chuan Chang, Dan Jurafsky, & Christopher D.Manning: Disambiguating “DE” for Chinese-English machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.215-223. [PDF, 276KB]

(2009) Chris Dyer: Using maximum entropy model to build segmentation lattices for MT.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.406-414. [PDF, 173KB]

(2009) Spence Green, Conal Sathi & Christopher D.Manning: NP subject detection in verb-initial Arabic clauses. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp.  [PDF, 729KB]

(2009) Le Thi Sinh: French-Vietnamese noun phrase translation.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.158-163 [abstract]

(2009) Svetlana Sheremetyeva: On extracting multiword NP terminology for MT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.205-212. [PDF, 215KB]

(2009) Sara Stymne: A comparison of merging strategies for translation of German compounds. Proceedings of the EACL 2009 Student Research Workshop, Athens, Greece, 2 April 2009; pp.61-69. [PDF, 111KB]

(2008) Nikesh Garera & David Yarowsky: Translating compounds by learning component gloss translation models via multiple languages. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.403-410. [PDF, 497KB]

(2008) Sara Stymne & Maria Holmqvist: Processing of Swedish compounds for phrase-based statistical machine translation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.182-191. [PDF, 532KB]

(2007) Roxana Girju: Improving the interpretation of noun phrases with cross-linguistic information. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 568-575 [PDF, 156KB]

(2007) Sanjika Hewavitharana, Alon Lavie, & Stephan Vogel: Experiments with a noun-phrase driven statistical machine translation system. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.247-253 [PDF, 241KB]

(2007) Maria Holmqvist, Sara Stymne, & Lars Ahrenberg: Getting to know Moses: initial experiments on German-English factored translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 181-184 [PDF, 123KB]

(2007) Masaki Itagaki, Takako Aikawa, & Xiaodong He: Automatic validation of terminology translation consistency with statistical method.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.269-274 [PDF, 416KB]

(2007) Rebecca Nesson & Stuart M.Shieber: Extraction phenomena in synchronous TAG syntax and semantics. SSST, NAACL-HLT-2007 AMTA Workshop on  Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.9-16 [PDF, 696KB]

(2006) Chris Brockett, William B.Dolan, & Michael Gamon: Correcting ESL errors using phrasal SMT techniques.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.249-256. [PDF, 109KB]

(2006) Michael Carl & Ecaterina Rascu: A dictionary lookup strategy for translating of discontinuous phrases. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.49-57 [PDF, 197KB]

(2006) Gábor Pohl: English-Hungarian NP alignment in MetaMorpho TM. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.69-74 [PDF, 81KB]

(2006) Bettina Schrader: Non-probabilistic alignment of rare German and English nominal expressions.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1274-1277 [PDF, 251KB]

(2005) Patrik Lambert & Rafael Banchs: Data inferred multi-word expressions for statistical machine translation . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.396-403. [PDF, 182KB]

Number (singular, plural)

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Toward active learning in data selection: automatic discovery of language features during elicitation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

Numbers and numerals

(2008) Dongdong Zhang, Mu Li, Nan Duan, Chi-Ho Li, & Ming Zhou: Measure word generation for English-Chinese SMT systems. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 89-96. [PDF, 237KB]

Ontology see Knowledge representation

Oracle extraction see Confusion networks

Out of vocabulary see Unknown words

Parallel text corpora see Bilingual corpora, Multilingual corpora

Paraphrasing

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

(2010) Michael Denkowski, Hasan Al-Haj, & Alon Lavie: Turker-assisted paraphrasing for English-Arabic machine translation. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.66-70. [PDF, 237KB]

(2010) Soma Paul, Prashant Mathur, & Sushant Kishore: Syntactic construct: an aid for translating English nominal compound into Hindi. Proceedings of the NAACL HLT 2010 Workshop on Extracting and Using Constructions in Computational Linguistics, Los Angeles, CA, June 2010; pp.32-38. [PDF, 107KB]

(2009) Anabela Barreiro & Luís Miguel Cabral: ReEscreve: a translator-friendly multi-pupose paraphrasing software tool. MT Summit XII - Workshop: Beyond Translation Memories: New Tools for Translators MT, August 29, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 594KB]; presentation, 37pp. [PDF of PPT, 1344KB]

(2009) Chris Dyer, Hendra Setiawan, Yuval Marton, & Philip Resnik: The University of Maryland statistical machine translation system for the Fourth Workshop on Machine Translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.145-149. [PDF, 120KB]

(2009) Beth Ann Hockey & Manny Rayner: Using paraphrases of deep semantic representations to support regression testing in spoken dialogue systems. NAACL-HLT-2009: Software Engineering, Testing and Quality Assurance for Natural Language Processing (SETQA-NLP 2009), Proceedings of the Workshop, June 5, 2009, Boulder, Colorado; pp.14-21. [PDF, 139KB]

(2009) Yuval Marton, Chris Callison-Birch, & Philip Resnik: Improved statistical machine translation using monolingually-derived paraphrases. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.381-390. [PDF, 160KB]

(2009) Shachar Mirkin, Lucia Specia, Nicola Cancedda, Ido Dagan, Marc Dymetman, & Idan Szpektor: Source-language entailment modeling for translating unknown terms. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.791-799. [PDF, 152KB]

(2009) Bo Wang, Tiejun Zhao, Muyun Yang, & Sheng Li: References extension for the automatic evaluation of MT by syntactic hybridization.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.37-44. [PDF, 378KB]

(2008) Francis Bond, Eric Nichols, Darren Scott Appling, & Michael Paul: Improving statistical machine translation by paraphrasing the training data. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.150-157 [PDF, 132KB]; presentation [PDF,189KB]

(2008) Chris Callison-Burch, Trevor Cohn, & Mirella Lapata: ParaMetric: an automatic evaluation metric for paraphrasing. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.97-104. [PDF, 232KB]

(2008) Chris Callison-Burch: Syntactic constraints on paraphrases extracted from parallel corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.196-205. [PDF, 187KB]

(2008) Atsushi Fujita & Satoshi Sato: A probabilistic model for measuring grammaticality and similarity of automatically generated paraphrases of predicate phrases. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.225-232. [PDF, 498KB]

(2008) Young-Sook Hwang, YoungKil Kim, & SangKyu Park: Paraphrasing depending on bilingual context toward generalization of translation knowledge. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.327-334. [PDF, 952KB]

(2008) Nitin Madnani, Philip Resnik, Bonnie J.Dorr, & Richard Schwartz: Are multiple reference translations necessary? Investigating the value of paraphrased reference translations in parameter optimization. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.143-152. [PDF, 655KB]

(2008) Aurélien Max & Michael Zock: Looking up phrase rephrasings via a pivot language. Coling 2008: Proceedings of the Workshop on Cognitive Aspects of the Lexicon, 24 August 2008, Manchester, UK; pp.77-84. [PDF, 257KB]

(2008) Preslav Nakov: Improving English-Spanish statistical machine translation: experiments in domain adaptation, sentence paraphrasing, tokenization, and recasing. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.147-150. [PDF, 110KB]

(2008) Shiqi Zhao, Haifeng Wang, Ting Liu, & Sheng Li: Pivot approach for extracting paraphrase patterns from bilingual corpora. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 780-788. [PDF, 188KB]

(2008) Shiqi Zhao, Cheng Niu, Ming Zhou, Ting Liu, & Sheng Li: Combining multiple resources to improve SMT-based paraphrasing model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1021-1029. [PDF, 188KB]

(2007) Nitin Madnani, Necip Fazil Ayan, Philip Resnick, & Bonnie J.Dorr: Using paraphrases for parameter tuning in statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 120-127 [PDF, 148KB]

(2006) Chris Callison-Burch, Philipp Koehn & Miles Osborne: Improved statistical machine translation using paraphrases.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 17-24 [PDF, 178KB]

(2006) David Kauchak & Regina Barzilay: Paraphrasing for automatic evaluation. HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 455-462 [PDF, 196KB]

(2006) Karolina Owczarzak, Declan Groves, Josef Van Genabith, & Andy Way: Contextual bitext-derived paraphrases in automatic MT evaluation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 86-93 [PDF, 222KB]

(2006) Kiyotaka Uchimoto, Naoko Hayashida, Toru Ishida, & Hitoshi Isahara: Automatic detection and semi-automatic revision of non-machine-translatable parts of a sentence.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.703-708 [PDF, 330KB]

(2006) Liang Zhou, Chin-Yew Lin, & Eduard Hovy: Re-evaluating machine translation results with paraphrase support.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 77-84. [PDF, 324KB]

(2005) Colin Bannard & Chris Callison-Burch: Paraphrasing with bilingual parallel corpora. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 597-604. [PDF, 196KB]

(2005) Chris Brockett & William B.Dolan: Support vector machines for paraphrase identification and corpus construction. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 1-8. [PDF, 115KB]

(2005) Michael Carl, Ecaterina Rascu, & Paul Schmidt: Using template-grammars for shake & bake paraphrasing. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 66-72. [PDF, 107KB]

(2005) William B.Dolan & Chris Brockett: Automatically constructing a corpus of sentential paraphrases. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 9-16. [PDF, 115KB]

(2005) Andrew Finch, Young-Sook Hwang, & Eiichiro Sumita: Using machine translation evaluation techniques to determine sentence-level semantic equivalence. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 17-24. [PDF, 111KB]

(2005) Atsushi Fujita & Kentaro Inui: A class-oriented approach to building a paraphrase corpus. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 25-32. [PDF, 300KB]

(2005) Richard Power & Donia Scott: Automatic generation of large-scale paraphrases. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 73-79. [PDF, 49KB]

(2005) Dekai Wu: Recognizing paraphrases and textual entailment using inversion transduction grammars.  ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 25-30. [PDF, 113KB]

Parentheses

(2008) Dekang Lin, Shaojun Zhao, Benjamin Van Durme, & Marius Paşca: Mining parenthetical translations from the web by word alignment. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 994-1002. [PDF, 419KB]

Parsing see Syntactic analysis, Tagging

Particles

(2009) Pi-Chuan Chang, Dan Jurafsky, & Christopher D.Manning: Disambiguating “DE” for Chinese-English machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.215-223. [PDF, 276KB]

(2009) Craig Kopris: Endoclitics in Pashto: can they really do that?  CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 6pp. [PDF, 485KB]

(2008) Arul Menezes & Chris Quirk: Syntactic models for structural word insertion and deletion.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.735-744. [PDF, 178KB]

(2005) R. Mahesh K. Sinha & Anil Thakur: Handling ki in Hindi for Hindi-English MT. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.356-363. [PDF, 289KB]

Parts of speech see Tagging

Pattern matching methods (see also Example-based methods, Learning and training)

(2009) Gonzalo Iglesias, Adrià de Gispert, Eduardo R.Banga & William Byrne: Rule filtering by pattern for efficient hierarchical translation. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.380-388. [PDF, 109KB]

(2009) Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, & Qingsheng Zhu: Mining bilingual data from the web with adaptively learnt patterns. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.870-878. [PDF, 306KB]

(2009) Jin’ichi Murakami, Masato Tokuhisa, & Satoru Ikehara: Statistical machine translation adding pattern-based machine translation in Chinese-English translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 107-112. [PDF, 317KB]; presentation [PDF of PPT, 75KB]

(2008) Takehashi Abekawa & Kyo Kageura: Constructing a corpus that indicates patterns of modification between draft and final translations by human translators. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 65KB]

(2008) Takeshi Abekawa & Kyo Kageura: What prompts translators to modify draft translations? An anlysis of basic modification patterns for use in the automatic notification of awkwardly translated text. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.241-248. [PDF, 393KB]

(2008) Adam David Lopez: Machine translation by pattern matching. PhD thesis, University of Edinburgh, 2008. 138pp. [PDF, 1171KB]

(2008) Adam Lopez: Tera-scale translation models via pattern matching. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.505-512. [PDF, 86KB]

(2008) Adam Lopez: Translation by pattern matching. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 109 slides. [PDF, 174KB]

(2008) Yu-Chun Wang, Richard Tzong-Han Tsai, & Wen-Lian Hsu: Learning patterns from the web to translate named entities for cross language information retrieval.  IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.281-288. [PDF, 464KB]

(2008) Shiqi Zhao, Haifeng Wang, Ting Liu, & Sheng Li: Pivot approach for extracting paraphrase patterns from bilingual corpora. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 780-788. [PDF, 188KB]

(2007) Stella Markantonatou, Sokratis Sofianopoulos, Vassiliki Spilioti, Marina Vassiliou, & Olga Yannoutsou: An MT system embedding pattern knowledge.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 8pp. [PDF, 195KB]

(2007) Dong Zhou, Mark Truran, Tim Brailsford & Helen Ashman: NTCIR-6 experiments using pattern matched translation extraction. Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; pp.145-151. [PDF, 130KB]

(2006) Josep M. Crego, Adrià de Gispert, Patrik Lambert, Marta R. Costa-jussà, Maxim Khalilov, Rafael E.Banchs, José B.Mariño, & José A.R.Fonollosa: N-gram-based SMT system enhanced with reordering patterns.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 162-165 [PDF, 209KB]

(2006) Stella Markantonatou, Sokratis Sofianopoulos, Vassiliki Spilioti, George Tambouratzis, Marina Vassiliou, & Olga Yannoutsou: Using patterns for machine translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.239-245 [PDF, 237KB]

(2006) Hitomi Tohyama & Shigeki Matsubara: Collection of simultaneous interpreting patterns by using bilingual spoken monologue corpus. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2564-2569 [PDF, 552KB]

(2005) Niladri Chatterjee, Shailly Goyal, Anjali Naithani: Resolving pattern ambiguity for English to Hindi machine translation using WordNet. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.18-25 [PDF, 245KB]

(2005) Christina Lioma & Iadh Ounis: Deploying part-of-speech patterns to enhance statistical phrase-based machine translation resources.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 163-166. [PDF, 118KB]

(2005) Adam Lopez & Philip Resnik: Pattern visualization for machine translation output.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 12-13. [PDF, 160KB]

(2005) R. Mahesh K. Sinha & Anil Thakur: Divergence patterns in machine translation between Hindi and English. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.346-353. [PDF, 288KB]

(2005) Miki Sasaki & Toshiki Murata: A pattern-based machine translation system - Yakushite Net MT engine. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 7pp. [PDF, 112KB]

Perceptron

(2010) Cristina España-Bonet & Lluís Màrquez: Robust estimation of feature weights in statistical machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 618KB]

(2009) Dayne Freitag & Zhiqiang Wang: Name transliteration with bidirectional perceptron edit models. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.132-135. [PDF, 96KB]

(2009) Alexandre Patry & Philippe Langlais: Prediction of words in statistical machine translation using a multilayer perceptron. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 101-111. [PDF, 337KB]

(2007) Dayne Freitag & Shahram Khadivi: A sequence alignment model based on the averaged perceptron. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 238-247. [PDF, 180KB]

Personal names see Names [named entities]

Phonology and phonetics

(2009) Colin Cherry & Hisami Suzuki: NEWS 2009 machine transliteration shared task system description: transliteration with letter-to-phoneme technology. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.69-71. [PDF, 289KB]

(2009) Priyanka Gupta & Vishal Goyal: Implementation of rule based algorithm for sandhi-vicheda of compound Hindi words. International Journal of Computer Science Issues, vol.3 (2009), pp. 45-49. [PDF, 122KB]

(2009) Oi Yee Kwong: Graphemic approximation of phonological context for English-Chinese transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.186-193. [PDF, 258KB]

(2009) Oi Yee Kwong: Phonological context approximation and homophone treatment for NEWS 2009 English-Chinese transliteration shared task. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.76-79. [PDF, 178KB]

(2009) Antoine Laurent, Paul Deléglise, & Sylvain Meignier: Grapheme to phonese conversion using an SMT system. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Mohammad Mehdi Arab & Ali Azimizadeh: Construction of a Persian letter-to-sound conversion system based on classification and regression tree. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 676KB]

(2009) Jong-Hoon Oh, Kiyotaka Uchimoto, & Kentaro Torisawa: Can Chinese phonemes improve machine transliteration?: a comparative study of English-to-Chinese transliteration models. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.658-667. [PDF, 181KB]

(2009) Taraka Rama, Anil Kumar Singh, & Sudbeer Kolachina: Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.90-95. [PDF, 404KB]

(2009) Sujith Ravi & Kevin Knight: Learning phoneme mappings for transliteration without parallel data. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 198KB]

(2009) Sravana Reddy & Sonjia Waxmonsky: Substring-based transliteration with conditional random fields. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.92-95. [PDF, 90KB]

(2008) Jin-Shea Kuo, Haizhou Li, & Chih-Lung Lin: Mining transliterations from web query results: an incremental approach. IJCNLP 2008: Sixth SIGHAN Workshop on Chinese Language Processing, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.16-23. [PDF, 400KB]

(2007) Su-Youn Yoon, Kyoung-Young Kim, & Richard Sproat: Multilingual transliteration using feature based phonetic method. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 112-119 [PDF, 269KB]

(2006) Jin-Shea Kuo, Haizhou Li, & Ying-Kuei Yang: Learning transliteration lexicons from the Web. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1129-1136. [PDF, 428KB]

(2006) Tao Tao, Su-Youn Yoon, Andrew Fister, Richard Sproat, & ChengXiang Zhai: Unsupervised name entity transliteration using temporal and phonetic correlation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 250-257. [PDF, 139KB]

(2005) Karin Müller: Revealing phonological similarities between related languages from automatically generated parallel corpora.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 33-40. [PDF, 114KB]

Phrase-based MT see Statistical MT

Pivot language (see also Interlingua methods)

(2010) Mitesh M.Khapra, A.Kumaran, & Pushpak Bhattacharyya: Everbody loves a rich cousin: an empirical study of transliteration through bridge languages.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.420-428. [PDF, 157KB]

(2009) Yu Chen, Martin Kay, & Andreas Eisele: Intersecting multilingual data for faster and better statistical translations.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.128-136. [PDF, 159KB]

(2009) Nizar Habash & Jun Hu: Improving Arabic-Chinese statistical machine translation using English as pivot language. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.173-181. [PDF, 265KB]

(2009) Philipp Koehn, Alexandra Birch & Ralf Steinberger: 462 machine translation systems for Europe. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 65-72. [PDF, 204KB]

(2009) Yuval Marton, Chris Callison-Birch, & Philip Resnik: Improved statistical machine translation using monolingually-derived paraphrases. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.381-390. [PDF, 160KB]

(2009) Michael Paul, Hirofumi Yamamoto, Eiichiro Sumita, & Satoshi Nakamura: On the importance of pivot language selection for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.221-224. [PDF, 114KB]

(2009) Rie Tanaka, Yohei Murakami, & Toru Ishida: Context-based approach for pivot translation services. IJCAI-09: Twenty-first International Joint conference on Artificial Intelligence, Pasadena, Californai, USA, 11-17 July 2009; pp.1555-1561. [PDF, 505KB]

(2009) Varga István & Yokoyama Shoichi: Bilingual dictionary generation for low-resourced language pairs. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.862-870. [PDF, 199KB]

(2009) Hua Wu & Haifeng Wang: Revisiting pivot language approach for machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.154-162. [PDF, 459KB]

(2008) see IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA.

(2008) Nicola Bertoldi, Madalina Barbaiani, Marcello Federico, & Roldano Cattoni: Phrase-based statistical machine translation with pivot languages.  IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.143-149 [PDF, 208KB]; presentation [PDF, 230KB]

(2008) Simon Carter, Christof Monz, & Sirvan Yahyaei: The QMUL system description for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.104-107. [PDF, 161KB]

(2008) Yu Chen, Andreas Eisele, & Martin Kay: Improving statistical machine translation efficiency by triangulation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 122KB]

(2008) Maxim Khalilov, Maria R. Costa-jussà, Carlos A.Henríquez Q., José A.R.Fonollosa, Adolfo Hernández H., José B.Mariño, Rafael E. Banchs, Chen Boxing, Min Zhang, Aiti Aw, & Haizhou Li: The TALP&I2R SMT systems for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.116-123 [PDF, 236KB]

(2008) Aurélien Max & Michael Zock: Looking up phrase rephrasings via a pivot language. Coling 2008: Proceedings of the Workshop on Cognitive Aspects of the Lexicon, 24 August 2008, Manchester, UK; pp.77-84. [PDF, 257KB]

(2008) Coşkun Mermer, Hamza Kaya, Ömer Farukhan Güneş, & Mehmet Uğur Doğan: The TÜBÍTAK-UEKAE statistical machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.138-142 [PDF, 239KB; presentation [PDF, 179KB]

(2008) Jessica Ramírez, Masayuki Asahara, & Yuji Matsumoto: Japanese-Spanish thesaurus construction using English as a pivot. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.473-480. [PDF, 787KB]

(2008) Takashi Tsunakawa, Naoaki Okazaki, & Jun’ichi Tsujii: Building a bilingual lexicon using phrase-based statistical machine translation via a pivot language. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.127-130. [PDF, 115KB]

(2008) Takashi Tsunakawa, Naoaki Okazaki, & Jun’ichi Tsujii: Building bilingual lexicons using lexical translation probabilities via pivot languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 66KB]

(2008) Haifeng Wang, Hua Wu, Xiaoguang Hu, Zhanyi Liu, Jianfeng Li, Dengjun Ren, & Zhengyu Niu: The TCH machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.124-131 [PDF, 242KB]; presentation [PDF, 135KB]

(2008) Shiqi Zhao, Haifeng Wang, Ting Liu, & Sheng Li: Pivot approach for extracting paraphrase patterns from bilingual corpora. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 780-788. [PDF, 188KB]

(2007) Bogdan Babych, Anthony Hartley, & Serge Sharoff: Translating from under-resourced languages: comparing direct transfer against pivot translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.29-35 [PDF, 197KB]

(2007) Trevor Cohn & Mirella Lapata: Machine translation by triangulation: making effective use of multi-parallel corpora. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 728-735 [PDF, 197KB]

(2007) Varga István & Yokoyama Shoichi: Japanese-Hungarian dictionary generation using ontology resources. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.483-490 [PDF, 167KB]

(2007) Vitaly Klyuev: OASIS at NTCIR-6: on-line query translation for Chinese-Japanese cross-lingual information retrieval. Proceedings of NTCIR-6 Workshop Meeting, May 15-18, 2007, Tokyo, Japan; pp.85-91. [PDF, 264KB]

(2007) Shankar Kumar, Franz Och, & Wolfgang Macherey: Improving word alignment with bridge languages. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 42-50. [PDF, 183KB]

(2007) Masatoshi Tsuchiya, Ayu Purwarianti, Toshiyuki Wakita, & Seiichi Nakagawa: Expanding Indonesian-Japanese small translation dictionary using a pivot language. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 197-200 [PDF, 567KB]

(2007) Masao Utiyama & Hitoshi Isahara: A comparison of pivot methods for phrase-based statistical machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.484-491 [PDF, 495KB]

(2007) Hua Wu & Haifeng Wang: Pivot language approach for phrase-based statistical machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 856-863 [PDF, 205KB]

(2006) Kisuh Ahn & Matthew Frampton: Automatic generation of translation dictionaries using intermediary languages. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.41-44 [PDF, 278KB]

(2006) Andreas Eisele: Parallel corpora and phrase-based statistical machine translation for new language pairs via multiple intermediaries.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.845-848 [PDF, 329KB]

(2006) Adrià de Gispert & José B. Mariño: Catalan-English statistical machine translation without parallel corpus: bridging through Spanish.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.65-68. [PDF, 64KB]

(2005) Frederic C.Gey: How similar are Chinese and Japanese for cross-language information retrieval? Proceedings of NTCIR-5 Workshop Meeting, December 6-9, 2005, Tokyo, Japan; 4pp. [PDF, 343KB]

(2005) Kui-Lam Kwok, Sora Choi, Norbert Dinstl, & Peter Deng: NTCIR-5 Chinese, English, Korean cross-language retrieval experiments using PIRCS. Proceedings of NTCIR-5 Workshop Meeting, December 6-9, 2005, Tokyo, Japan; 8pp. [PDF, 455KB]

(2005) Tetsuji Nakagawa : NTCIR-5 CLIR experiments at Oki. Proceedings of NTCIR-5 Workshop Meeting, December 6-9, 2005, Tokyo, Japan; 6pp. [PDF, 303KB]

Post-editing see index of applications

Postpositions

(2010) Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro, & Suguru Matsuyoshi: Utilizing semantic equivalence classes of Japanese functional expressions in translation rule acquisition from parallel patent sentences. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1777-1785. [PDF, 942KB]

(2008) Sibansu Mukhopadhyay, Biswath Barik, & Sudeshna Sarkar: Problems of transfer grammar: the Hindi postpositional correlatives in Bangla. ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 180KB]

(2006) Masatoshi Tsuchiya, Takao Shime, Toshihiro Takagi, Takehito Utsuro, Kiyotaka Uchimoto, Suguru Matsuyoshi, Satoshi Sato, & Seiichi Nakagawa: Chunking Japanese compound functional expressions by machine learning. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp. 25-32 [PDF, 793KB]

Pragmatics in MT

(2010) Fangzhong Su & Katja Markert: Word sense subjectivity for cross-lingual lexical substitution. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.357-360. [PDF, 109KB]

(2008) Carmen Banea, Rada Mihalcea, Janyce Wiebe, & Samer Hassan: Multilingual subjectivity analysis using machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.127-135. [PDF, 15403KB]

(2006) David Farwell & Stephen Helmreich: Pragmatics-based MT and the translation of puns. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.187- 193 [PDF, 157KB]

Predicates and arguments see Semantic analysis and representation, Syntactic analysis and representation

Predictive methods see Text prediction

Pre-editing see index of applications

Prepositional phrases

(2008) Victoria Fossum & Kevin Knight: Using bilingual Chinese-English word alignments to resolve PP-attachment ambiguity in English. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.245-253 [PDF, 595KB]

(2008) Elliott Macklovitch, Guy Lapalme, & Fabrizio Gotti: TransSearch: what are translators looking for?  AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.412-419. [PDF, 757KB]

(2006) Martin Volk: How bad is the problem of PP-attachment? A comparison of English, German and Swedish. Proceedings of the Third ACL-SIGSEM Workshop on Prepositions, Trento, Italy, April 2006; pp.81-88. [PDF, 133KB]

(2005) Rajat Kumar Mohanty, Ashish Francis Almeida & Pushpak Bhattacharyya: Prepositional phrase attachment and interlingua. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.241-253 [abstract, PDF, 15KB]

Prepositions

(2010) Md.Zahurul Islam, Jörg Tiedemann & Andreas Eisele: English to Bangla phrase-based machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 601KB]

(2009) Eneko Agirre, Aitziber Atutxa, Gorka Labaka, Mikel Lersundi, Aingeru Mayor, & Kepa Sarasola: Use of rich linguistic information to translate prepositions and grammatical cases to Basque. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.58-65. [PDF, 340KB]

(2009) Hendra Setiawan, Min-Yen Kan, Haizhou Li, & Philip Resnik: Topological ordering of function words in hierarchical phrase-based translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.324-332. [PDF, 504KB]

(2008) Arul Menezes & Chris Quirk: Syntactic models for structural word insertion and deletion.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.735-744. [PDF, 178KB]

(2007) Samar Husain, Dipti Misra Sharma, & Manohar Reddy: Simple preposition correspondence: a problem in English to Indian language machine translation. ACL 2007: proceedings of the 4th ACL-SIGSEM Workshop on Prepositions, Prague, Czech Republic, 28 June 2007; pp.51-58 [PDF, 104KB]

(2007) Hendra Setiawan, Min-Yen Kan, & Haizhou Li: Ordering phrases with function words. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 712-719 [PDF, 213KB]

(2006) Sudip Kumar Naskar & Sivaji Bandyopadhyay: Handling of prepositions in English to Bengali machine translation. Proceedings of the Third ACL-SIGSEM Workshop on Prepositions, Trento, Italy, April 2006; pp.89-94. [PDF, 208KB]

(2005) Ebba Gustavii: Target language preposition selection - an experiment with transformation based learning and aligned bilingual data. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 112-118. [PDF, 45KB]

Probabilistic methods see Statistical analysis

Pronouns

(2008) Prachya Boonkwan & Thepchai Supnithi: Memory-inductive categorical grammar: an approach to gap resolution in analytic-language translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.80-87. [PDF, 339KB]

Proposed systems

(2009) Jaime Carbonell: New paradigms for machine translation. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 35pp. [PDF of PPT, 462KB]

(2009) Hans Uszkoreit: Hybrid machine translation. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 16pp. [PDF of PPT, 1407KB]

(2007) Hitoshi Isahara, Sadao Kurohashi, Jun’ichi Tsujii, Kiyotaka Uchimoto, Hiroshi Nakagawa, Hiroyuki Kaji, & Shun’ichi Kikuchi: Development of a Japanese-Chinese machine translation system.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.263-267 [PDF, 161KB]

(2007) Philipp Koehn: EuroMatrix – machine translation for all European languages. Invited talk at MT Summit XI, 10-14 September 2007, Copenhagen, Denmark; 33pp. [PDF of PPT presentation, 478KB]

Punctuation

(2008) Coşkun Mermer, Hamza Kaya, Ömer Farukhan Güneş, & Mehmet Uğur Doğan: The TÜBÍTAK-UEKAE statistical machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.138-142 [PDF, 239KB; presentation [PDF, 179KB]

(2007) Roldano Cattoni, Nicola Bertoldi, & Marcello Federico: Punctuating confusion networks for speech translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2453-2456; abstract [PDF, 23KB]

(2006) Evgeny Matusov, Arne Mauser, & Hermann Ney: Automatic sentence segmentation and punctuation prediction for spoken language translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 158-165 [PDF, 117KB]

(2007) Evgeny Matusov, Dustin Hillard, Mathew Magimai-Doss, Dilek Hakkani-Tür, Mari Ostendorf, & Hermann Ney: Improving speech translation with automatic boundary prediction. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2449-2452; abstract [PDF, 61KB]

Quality improvement techniques (see also Interactive methods)

(2008) Nguyen Bach, Qin Gao, & Stephan Vogel: Improving word alignment with language model based confidence scores. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.151-154. [PDF, 607KB]

(2008) Almut Silja Hildebrand & Stephan Vogel: Combination of machine translation systems via hypothesis selection from combined n-best lists. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.254-261 [PDF, 681KB]

(2008) Reginald Hobbs, Jamal Laoudi, & Clare R.Voss: MTriage: web-enabled software for the creation, machine translation, and annotation of smart documents.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 433KB]

(2007) Ariadna Font Llitjós, Jaime Carbonell, & Alon Lavie: Improving transfer-based MT systems with automatic refinements. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.183-190 [PDF, 339KB]

(2007) Ariadna Font Llitjós & William A. Ridmann: The inner works of an automatic rule refiner for machine translation. METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 10pp. [PDF, 412KB]

(2007) Hans-Udo Stadler & Ursula Peter-Spörndli: The quest for machine translation quality at CLS Communication. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.435-442 [PDF, 115KB]

(2006) David Kauchak: Contributions to research on machine translation. PhD thesis, University of California, San Diego, 2006. xiv,92pp. [PDF, 491KB]

(2006) Ariadna Font-Llitjós: Can the Internet help improve machine translation?  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 219-222 [PDF, 135KB]

(2006) Bart Mellebeek, Karolina Owczarzak, Declan Groves, Josef Van Genabith, & Andy Way: A syntactic skeleton for statistical machine translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.195-202

(2006) Dragos Stefan Munteanu & Daniel Marcu: Improving machine translation performance by exploiting non-parallel corpora. Computational Linguistics 31 (4), pp. 477-504 [PDF, 1060KB]

(2006) Karolina Owczarzak, Bart Mellebeek, Declan Groves, Josef Van Genabith & Andy Way: Wrapper syntax for example-based machine translation.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.148-155 [PDF, 248KB]

(2006) Gregor Thurmair: Using corpus information to improve MT quality. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Third International Workshop on Language Resources for Translation Work, Research & Training (LR4Trans-III), Genoa, Italy, 28 May 2006; pp.45-48. [PDF, 371KB]

(2005) Bogdan Babych: Information extraction technology in machine translation: IE methods for improving and evaluating MT quality. Ph D thesis, University of Leeds, Centre for Translation Studies, March 2005. 186pp. [PDF, 859KB]

(2005) Bart Mellebeek, Anna Khasin, Karolina Owczarzak, Josef Van Genabith, & Andy Way: Improving online machine translation systems. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.290-297. [PDF, 203KB]

(2005) Zhu Jiang & Wang Haifeng: The effect of adding rules into the rule-based MT system. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.298-304. [PDF, 378KB]

(2005) Lars Ahrenberg: Codified close translation as a standard for MT. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 13-22. [PDF, 69KB]

(2005) Bart Mellebeek, Anna Khasin, Josef Van Genabith, & Andy Way: TransBooster: boosting the performance of wide-coverage machine translation systems. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 189-197. [PDF, 65KB]

(2005) R. Mahesh K. Sinha: Integrating CAT and MT in AnglaBhart-II architecture. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 235-244. [PDF, 205KB]

Quality measures see Evaluation measures and metrics

Questions see Interrogatives and questions

Rapid development of MT systems

(2010) William D.Lewis: Haitian Creole: how to build and ship an MT engine from scratch in 4 days, 17 hours, & 30 minutes. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 518KB]; presentation: 24 slides [PDF, 630KB]

(2010) William Lewis: Haitian Creole: developing MT for a low data language. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 24pp. [PDF, 3425KB]

(2010) Andrejs Vasiljevs: LetsMT! – towards cloud based service for MT generation. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 15pp. [PDF, 368KB]

(2010) Jernej Vičič & Petr Homola: Speeding up the implementation process of a shallow transfer machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 624KB]

(2008) Toni Badia, Maite Melero, & Oriol Valentin: Rapid deployment of a new METIS language pair: Catalan-English. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 122KB]

(2008) Christopher Dyer, Aaron Cordova, Alex Mont, & Jimmy Lin: Fast, easy, and cheap: construction of statistical machine translation models with MapReduce. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.199-207. [PDF, 228KB]

(2008) C.-L. Kao, S.Saleem, R.Prasad, F.Choi, P.Natarajan, D.Stallard, K.Krstovski, & M.Kamali: Rapid development of an English/Farsi speech-to-speech translation system. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.166-173 [PDF, 143KB]; presentation [PDF, 95KB]

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2007) Nguyen Bach, Matthais Eck, Paisarn Charoenpornsawat, Thilo Köhler, Sebastian Stüker, ThuyLinh Nguyen, Roger Hsiao, Alex Waibel, Stephan Vogel, Tanja Schultz, & Alan W.Black: The CMU TransTac 2007 eyes-free two-way speech-to-speech translation system. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 489KB]; presentation [PDF, 1043KB]

(2007) Tanja Schultz, Alan W.Black, Sameer Badaskar, Matthew Hornyak, & John Kominek: SPICE: web-based tools for rapid language adaptation in speech processing systems. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2125-2128; abstract [PDF, 23KB]

(2007) Sylvain Surcin, Elke Lange, & Jean Senellart: Rapid development of new language pairs at SYSTRAN. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.443-449 [PDF, 114KB]

(2007) Svetlana Sheremetyeva: On portability of resources for a quick ramp up of multilingual MT of patent claims. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.28-33. [PDF, 66KB]

(2006) Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Hamid Mansouri Rad & Ron Zacharski: Guarani: a case study in resource development for quick ramp-up MT. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.1- 9 [PDF, 344KB]

(2005) Herman A. Engelbrecht & Tanja Schultz: Rapid development of an Afrikaans English speech-to-speech translator. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 171KB]

(2005) Hemali Majithia, Philip Rennart & Evelyne Tzoukermann: Rapid ramp-up for statistical machine translation: minimal training for maximal coverage . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.438-444. [PDF, 160KB]

Reduplicatives

(2005) R. M. K. Sinha, & Anil Thakur: Dealing with replicative words in Hindi for machine translation to English. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.157-164. [PDF, 298KB]

Reference (see also Anaphora resolution)

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2008) Colin Cherry: Cohesive phrase-based decoding for statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 72-80. [PDF, 179KB]

(2008) Imed Zitouni & Radu Florian: Mention detection crossing the language barrier. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.600-609. [PDF, 149KB]

(2007) Anja Belz, Albert Gatt, Ehud Reiter & Jette Viethen: The attribute selection for generation of referring expressions challenge. [Introduction to Shared Task Evaluation Challenge.] MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.73-74 [PDF, 438KB]

(2007) Anja Belz & Albert Gatt: The attribute selection for GRE challenge: overview and evaluation results. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.75-83 [PDF, 438KB]

(2007) Bernd Bohnet: IS-FBN, IS-FBS, IS-IAC: the adaptation of two classic algorithms for the generation of referring expresssions in order to produce expressions like humans do. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.84-86 [PDF, 676KB]

(2007) Raquel Hervás & Pablo Gervás: NIL: attribute selection for matching the task corpus using relative attribute groupings obtained from the test data. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.87-89 [PDF, 694KB]

(2007) J.D.Kelleher: DIT: frequency based incremental attribute selection for GRE. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.90-91 [PDF, 502KB]

(2007) Advaith Siddharthan & Ann Copestake: Evaluating an open-domain GRE algorithm on closed domains system IDs: CAM-B, CAM-T, CAM-BU and CAM-TU. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.92-94 [PDF, 727KB]

(2007) Philipp Spanger, Kurosawa Takahiro, & Tokunaga Takenobu: TITCH: attribute selection based on discrimination power and frequency. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.98-100 [PDF, 688KB]

(2007) Mariët Theune, Pascal Touset, Jette Viethen, & Emiel Krahmer: Cost-based attribute selection for GRE (GRAPH-SC/GRAPH-FP). MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.95-97 [PDF, 721KB]

(2007) Kees van Deemter & Albert Gatt: Content determination in GRE: evaluating the evaluator. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.101-103 [PDF, 754KB]

Reference translations see Evaluation measures and metrics

Registers see Dialects and registers

Re-ordering (see also Generation, Syntactic transfer)

(2010) P.V.S.Avenesh: A data mining approach to learn reorder rules for SMT. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Student Research Workshop… June 2, 2010, Los Angeles, California; pp.52-57. [PDF, 114KB]

(2010) Marta R.Costa-jussà & José R.Fonollosa: Using linear interpolation and weighted reordering hypotheses in the Moses system. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1712-1718. [PDF, 389KB]

(2010) Jinhua Du & Andy Way: The impact of source-side syntactic reordering on hierarchical phrase-based SMT. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 746KB]; presentation: 34 slides [PDF, 540KB]

(2010) Chris Dyer & Philip Resnik: Context-free reordering, finite-state translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.858-866. [PDF, 272KB]

(2010) Niyu Ge: A direct syntax-driven reordering model for phrase-based machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.849-857. [PDF, 161KB]

(2010) Maxim Khalilov & Khalil Sima’an: Source reordering using MaxEnt classifiers and supertags. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]

(2009) Nguyen Bach, Qin Gao, & Stephan Vogel: Source-side dependency tree reordering models and subtree movements and constraints. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 17-24. [PDF, 211KB]

(2009) Ibrahim Badr, Rabih Zbib, & James Glass: Syntactic phrase reordering for English-to-Arabic statistical machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.86-93. [PDF, 115KB]

(2009) Srinivas Bangalore, Stephan Kanthak, & Patrick Haffner: Statistical machine translation through global lexical selection and sentence reconstruction.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.185-203.

(2009) Alexandra Birch, Phil Blunsom & Miles Osborne: A quantitative analysis of reordering phenomena. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.197-205. [PDF, 808KB]

(2009) Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, & Christopher D.Manning: Discriminative reordering with Chinese grammatical features. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.51-59. [PDF, 558KB]

(2009) Han-Bin Chen, Jian-Cheng Wu & Jason S.Chang: Learning bilingual linguistic reordering model for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.254-262. [PDF, 333KB]

(2009) John DeNero, Adam Pauls, & Dan Klein: Asynchronous binarization for synchronous grammars. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.141-144. [PDF, 183KB]

(2009) Arantza Díaz de Ilarraza, Gorka Labaka, & Kepa Sarasola: Reordering on Spanish-Basque SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 207-213. [PDF, 327KB]

(2009) Jakob Elming & Nizar Habash: Syntactic reordering for English-Arabic phrase-based machine translation. Proceedings of the EACL 2009 Workhop on Computational Approaches to Semitic Languages, Athens, Greece, 31 March 2009; pp.69-77. [PDF, 155KB]

(2009) Spence Green, Conal Sathi & Christopher D.Manning: NP subject detection in verb-initial Arabic clauses. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp.  [PDF, 729KB]

(2009) Kei Hashimoto, Hirofumi Yamamoto, Hideo Okuma, Eiichiro Sumita, & Keiichi Tokuda: Reordering model using syntactic information of a source tree for statistical machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.69-77. [PDF, 383KB]

(2009) Kenneth Heafield, Greg Hanneman, & Alon Lavie: Machine translation system combination with flexible word ordering. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.56-60. [PDF, 104KB]

(2009) Hieu Hoang & Philipp Koehn: Improving mid-range reordering using templates of factors. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.372-379. [PDF, 184KB]

(2009) Maxim Khalilov & José A.R.Fonollosa: N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.424-432. [PDF, 564KB]

(2009) Maxim Khalilov, José A.R.Fonollosa, & Mark Dras: A new subtree-transfer approach to syntax-based reordering for statistical machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.197-204. [PDF, 1078KB]

(2009) Philipp Koehn & Barry Haddow: Edinburgh’s submission to all tracks of the WMT2009 shared task with reordering and speed improvements to Moses.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.160-164. [PDF, 105KB]

(2009) Jin-Ji Li, Jungi Kim, Dong-Il Kim, & Jong-Hyeok Lee: Chinese syntactic reordering for adequate generation of Korean verbal phrases in Chinese-to-Korean SMT.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.190-196. [PDF, 323KB]

(2009) Hwidong Na, Jin-Ji Li, Jungi Kim, & Jong-Hyeok Lee: Improving fluency by reordering target constituents using MST parser in English-to-Japanese phrase-based SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.276-283. [PDF, 381KB]

(2009) Toshiaki Nakazawa & Sadao Kurohashi: Statistical phrase alignment model using dependency relation probability.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.10-18. [PDF, 670KB]

(2009) Vinh Van Nguyen, Akira Shimazu, Minh Le Nguyen & Thai Phuong Nguyen: Improving a lexicalized hierarchical reordering model using maximum entropy. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 89-96. [PDF, 172KB]

(2009) Yizhao Ni, Craig J.Saunders, Sandor Szedmak, & Mahesan Niranjan: Handling phrase reorderings for machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 4 August 2009; pp.241-244. [PDF, 465KB]

(2009) Jan Niehues, Teresa Herrmann, Muntsin Kolss & Alex Waibel: The Universität Karlsruhe translation system for the EACL-WMT 2009. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.80-84. [PDF, 92KB]

(2009) Jan Niehues & Muntsin Kolss: A POS-based model for long-range reorderings in SMT. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.206-214. [PDF, 200KB]

(2009) Maja Popović, David Vilar, Daniel Stein, Evgeny Matusov & Hermann Ney: The RWTH machine translation system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.66-69. [PDF, 210KB]

(2009) Hendra Setiawan, Min-Yen Kan, Haizhou Li, & Philip Resnik: Topological ordering of function words in hierarchical phrase-based translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.324-332. [PDF, 504KB]

(2009) Roy Tromble & Jason Eisner: Learning linear ordering problems for better translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1007-1016. [PDF, 178KB]

(2009) Peng Xu, Jaeho Kang, Michael Ringgard, & Franz Och: Using a dependency parser to improve SMT for subject-object-verb languages. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.245-253. [PDF, 273KB]

(2009) Sirvan Yahyaei & Christof Monz: Decoding by dynamic chunking for statistical machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.160-167. [PDF, 127KB]

(2009) Hui Zhang, Min Zhang, Haizhou Li, Aiti Aw, Chew Lim Tan: Forest-based tree sequence to string  translation model. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.172-180. [PDF, 254KB]

(2009) Min Zhang & Haizhou Li: Tree kernel-based SVM with structured syntactic knowledge for BTG-based phrase reordering. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.698-707. [PDF, 294KB]

(2008) Marta R. Costa-jussà & José A.R.Fonollosa: Computing multiple weighted reordering hypotheses for a statistical machine translation phrase-based system. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.82-88. [PDF, 623KB]

(2008) Marta R.Costa-jussà, José A.R.Fonollosa, & Enric Monte: Using reordering in statistical machine translation based on recursive alignment block classification.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 96KB]

(2008) Josep M. Crego & Nizar Habash: Using shallow syntax information to improve word alignment and reordering for SMT. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.53-61. [PDF, 200KB]

(2008) Jakob Elming: Syntactic reordering integrated with phrase-based SMT. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.46-54. [PDF, 141KB]

(2008) Jakob Elming: Syntactic reordering integrated with phrase-based SMT. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.209-216. [PDF, 140KB]

(2008) Michel Galley & Christopher D.Manning: A simple and effective hierarchical phrase reordering model. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.848-856. [PDF, 230KB]

(2008) Niyu Ge, Abe Ittycheriah, & Kishinore Papineni: Multiple reorderings in phrase-based machine translation. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.61-68. [PDF, 1-6KB]

(2008) Yanqing He & Chengqing Zong: A generalized reordering model for phrase-based statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.117-124. [PDF, 661KB]

(2008) Hieu Hoang: Short range re-ordering using POS tags. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 24 slides. [PDF, 66KB]

(2008) Jason Katz-Brown & Michael Collins: Syntactic reordering in preprocessing for Japanese®English translation: MIT system description for NTCIR-7 patent translation task. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 409-414. [PDF, 424KB]

(2008) Maxim Khalilov, Adolfo Hernández H., Marta R. Costa-jussà, Josep M. Crego, Carlos A. Henríquez Q., Patrik Lambert, José A.R. Fonollosa, José B. Mariño, & Rafael E. Banchs: The TALP-UPC ngram-based statistical machine translation systems for ACL-WMT 2008. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.127-130. [PDF, 128KB]

(2008) Vassilina Nikoulina & Marc Dymetman: Experiments in discriminating phrase-based translations on the basis of syntactic coupling features. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.55-60. [PDF, 98KB]

(2008) Vassilina Nikoulina & Marc Dymetman: Using syntactic coupling features for discriminating phrase-based translations (WMT-08 shared translation task). ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.159-162. [PDF, 68KB]

(2008) Slav Petrov, Aria Haghighi, & Dan Klein: Coarse-fine syntactic machine translation using language projections.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.108-116. [PDF, 306KB]

(2008) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: A linguistically annotated reordering model for BTG-based statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 149-152. [PDF, 319KB]

(2008) Vineet Yadav, Sriram Venkatapathy, & Karthik Gali: Comparative analysis of phrasal cohesion of English-Hindi with English-French. ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 76KB]

(2008) Hirofumi Yamamoto, Hideo Okuma, & Eiichiro Sumita: Imposing constraints from the source tree on ITG constraints for SMT. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.1-9. [PDF, 171KB]

(2008) Mikio Yamamoto, Jyunya Norimatsu, Mitsuru Koshikawa, Takahiro Fukutomi, Taku Nishio, Kugatsu Sadamitsu, & Takehito Utsuro: Diversion of hierarchical phrases as reordering templates. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 466-470. [PDF, 917KB]

(2008) Dakun Zhang, Le Sun, & Wenbo Li: A structured prediction approach for statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.649-654. [PDF, 521KB]

(2008) Jiajun Zhang, Chengqing Zong, & Shoushan Li: Sentence type based reordering model for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1089-1096. [PDF, 151KB]

(2008) Min Zhang, Hongfei Jiang, Aiti Aw, Haizhou Li, Chew Lim Tan, & Sheng Li: A tree sequence alignment-based tree-to-tree translation model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 559-567. [PDF, 317KB]

(2008) Bing Zhao & Yaser Al-Onaizan: Generalizing local and non-local word-reordering patterns for syntax-based machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.572-581. [PDF, 538KB]

(2007) Srinivas Bangalore, Patrick Haffner, & Stephan Kanthak: Statistical machine translation through global lexical selection and sentence reconstruction. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 152-159 [PDF, 458KB]

(2007) Pi-Chuan Chang & Kristina Toutanova: A discriminative syntactic word order model for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 9-16 [PDF, 235KB]

(2007) Marta R. Costa-jussà, Josep M.Crego, Patrik Lambert, Maxim Khalilov, José A.R.Fonollosa, José B.Mariño, & Rafael E.Banchs: Ngram-based statistical machine translation enhanced with multiple weighted reordering hypotheses.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 167-170 [PDF, 208KB]

(2007) Marta R.Costa-jussà & José A.R. Fonollosa: Analysis of statistical and morphological classes to generate weighted reordering hypotheses on a statistical machine translation system.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 171-176 [PDF, 424KB]

(2007) John DeNero & Dan Klein: Tailoring word alignments to syntactic machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 17-24 [PDF, 415KB]

(2007) Markus Dreyer, Keith Hall, & Sanjeev Khudanpur: Comparing reordering constraints for SMT using efficient BLEU oracle computation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.103-110 [PDF, 206KB]

(2007) Chi-Ho Li, Dongdong Zhang, Mu Li, Ming Zhou, Minghui Li, & Yi Guan: A probabilistic approach to syntax-based reordering for statistical machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 720-727 [PDF, 7105KB]

(2007) Kay Rottmann & Stephan Vogel: Word reordering in statistical machine translation with a POS-based distortion model. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.171-180 [PDF, 3725KB]; presentation [PDF, 1129KB]

(2007) Germán Sanchis & Francisco Casacuberta: Reordering via n-best lists for Spanish-Basque translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.191-198 [PDF, 309KB]; poster [PDF, 115KB]

(2007) Hendra Setiawan, Min-Yen Kan, & Haizhou Li: Ordering phrases with function words. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 712-719 [PDF, 213KB]

(2007) Chao Wang, Michael Collins, & Philipp Koehn: Chinese syntactic reordering for statistical machine translation.  EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 737-745. [PDF, 161KB]

(2007) Dongdong Zhang, Mu Li, Chi-Ho Li, & Ming Zhou: Phrase reordering model integrating syntactic knowledge for SMT. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 533-540. [PDF, 325KB]

(2007) Yuqi Zhang, Richard Zens, & Hermann Ney: Improved chunk-level reordering for statistical machine translation.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 255KB]; presentation [PDF, 1644KB]

(2007) Yuqi Zhang, Richard Zens, & Hermann Ney: Chunk-level reordering of source language sentences with automatically learned rules for statistical machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.1-8 [PDF, 165KB]

(2007) Simon Zwarts & Mark Dras: Syntax-based word reordering in phrase-based statistical machine translation: why does it work? MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.559-566 [PDF, 98KB]

(2006) Josep M. Crego, Adrià de Gispert, Patrik Lambert, Marta R. Costa-jussà, Maxim Khalilov, Rafael E.Banchs, José B.Mariño, & José A.R.Fonollosa: N-gram-based SMT system enhanced with reordering patterns.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 162-165 [PDF, 209KB]

(2006) Masaaki Nagata, Kuniko Saito, Kazuhide Yamamoto, & Kazuteru Ohashi: A clustered global phrase reordering model for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.713-720. [PDF, 182KB]

(2006) Deyi Xiong, Qun Liu, & Shouxun Lin: Maximum entropy based phrase reordering model for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.521-528. [PDF, 420KB]

(2006) Richard Zens & Hermann Ney: Discriminative reordering models for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 55-63 [PDF, 153KB]

(2006) Hao Zhang & Daniel Gildea: Inducing word alignments with bilexical synchronous trees. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.953-960. [PDF, 156KB]

(2006) Andreas Zollmann & Ashish Venugopal: Syntax augmented machine translation via chart parsing.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 138-141 [PDF, 99KB]

(2006) Simon Zwarts & Mark Dras: This phrase-based SMT system is out of order: generalized word reordering in machine translation. Australasian Language Technology Workshop 2006 (ALTW2006): Proceedings of the Workshop, 30 November - 1 December 2006, University of Sydney; pp.149-156. [PDF, 86KB]

(2005) Stephan Kanthak, David Vilar, Evgeny Matusov, Richard Zens, & Hermann Ney: Novel reordering approaches in phrase-based statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 167-174. [PDF, 172KB]

Reusability

(2009) Kepa Sarasola: Matxin: developing sustainable machine translation for a less-resourced language [abstract]. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 1.

(2009) Kevin Unhammer & Trond Trosterud: Reuse of free resources in machine translation between Nynorsk and Bokmål. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 35-42. [PDF, 219KB]

(2008) I.Alegria, X.Arregi, A.Diaz de Ilarraza, G.Labaka, M.Lersundi, A.Mayor, & K.Sarasola: Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open source. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.59-64. [PDF, 74KB]

(2008) Stephan Oepen: Grammar-based processing and re-usable hybrid MT – an experience report in the wilderness. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 32 slides. [PDF, 254KB]

(2008) Matthew Snover, Bonnie Dorr, & Richard Schwartz: Language and translation model adaptation using comparable corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.857-866. [PDF, 132KB]

(2007) Svetlana Sheremetyeva: On portability of resources for a quick ramp up of multilingual MT of patent claims. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.28-33. [PDF, 66KB]

(2006) Jan Tore Lønning & Stephan Oepen: Re-usable tools for precision machine translation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.53-56. [PDF, 213KB]

(2006) G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajič, & Pavel Pecina: Leveraging reusability: cost-effective lexical acquisition for large-scale ontology translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.945-952. [PDF, 159KB]

(2006) Bogdan Sacaleanu & Günter Neumann: Cross-cutting aspects of cross-language question answering systems. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multilingual Question Answering (MLQA06), Trento, Italy, April 4, 2006; pp.15-22 [PDF, 652KB]

(2005) Ondřej Bojar, Petr Homola, & Vladislav Kuboň: Problems of reusing an existing MT system. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.179-184. [PDF, 73KB]

(2005) Ondřej Bojar, Petr Homola & Vladislav Kuboň: An MT system recycled. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.380-387. [PDF, 242KB]

Reversibility

(2009) Andrew Finch & Eiichiro Sumita: Bidirectional phrase-based statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1124-1132. [PDF, 164KB]

(2007) Pierrette Bouillon, Glenn Flores, Marianne Starlander, Nikos Chatzichrisafis, Marianne Santaholma, Nikos Tsourakis, Manny Rayner, & Beth Ann Hockey: A bidirectional grammar-based medical speech translator. ACL 2007: Proceedings of the Workshop on Grammar-based approaches to spoken language processing (SPEECHGRAM 2007), June 29, 2007, Prague, Czech Republic; pp.41-48. [PDF, 216KB]

(2005) Michel Carl, Paul Schmidt, & Jörg Schütz: Reversible template-based Shake & Bake generation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.17-25. [PDF, 367KB]

Rhetorical Structure Theory see Discourse representation

Role and reference grammar

(2009) Yasser Salem & Brian Nolan: Designing an XML lexicon architecture for Arabic machine translation based on role and reference grammar.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.221-229. [PDF, 691KB]

Rule-based methods (see also Interlingua methods, Transfer-based MT)

(2010) Yu Chen & Andreas Eisele: Hierarchical hybrid translation between English and German . EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 570KB]; presentation: 18 slides [PDF, 1652KB]

(2010) Yu Chen & Andreas Eisele: Integrating a rule-based with a hierarchical translation system. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1746-1752. [PDF, 327KB]

(2010) Marta R.Costa-jussà, Mireia Farrús, José B.Mariño, & José A.R.Fonollosa: Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1706-1711. [PDF, 335KB]

(2010) Julia Epiphantseva: PROMT: flexible and efficient management of translation quality. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 27pp. [PDF, 689KB]

(2010) Mikel L.Forcada: Apertium: free/open-source rule-based machine translation. Presentation at Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 29 January, Dublin, Ireland; 38 slides. [PDF of PPT, 152KB]

(2010) Víctor M.Sánchez-Cartagena & Juan Antonio Pérez-Ortiz: Tradubi: open-source social translation for the Apertium machine translation platform. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.47-56. [PDF, 223KB]

(2010) Francis M.Tyers: Rule-based Breton to French machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 553KB]

(2010) Francis M.Tyers, Felipe Sánchez-Martínez, Sergio Ortiz-Rojas, & Mikel L.Forcada: Free/open-source resources in the Apertium platform for machine translation research and development. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.67-76. [PDF, 116KB]

(2009) Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers (eds.): Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain

(2009) Eneko Agirre, Aitziber Atutxa, Gorka Labaka, Mikel Lersundi, Aingeru Mayor, & Kepa Sarasola: Use of rich linguistic information to translate prepositions and grammatical cases to Basque. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.58-65. [PDF, 340KB]

(2009) Julia Aymerich & Hermes Camelo: The machine translation maturity model at PAHO. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 403-409. [PDF, 44KB]

(2009) Loic Dugast, Jean Senellart & Philipp Koehn: Selective addition of corpus-extracted phrasal lexical rules to a rule-based machine translation system. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.222-229. [PDF, 183KB]

(2009) Raymond Flournoy & Christine Duran: Machine translation and document localization at Adobe: from pilot to production. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 425-428. [PDF, 123KB]

(2009) Daniel Grasmick: contribution to panel”Will there be a winner?” Translingual Europe 2009, May 13-14, Prague, Czech Republic; 3pp. [PDF of PPT, 402KB]

(2009) Maxim Khalilov & José A.R.Fonollosa: N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.424-432. [PDF, 564KB]

(2009) Selçuk Köprü & Adnan Yazıcı: Lattice parsing to integrate speech recognition and rule-based machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.469-477. [PDF, 185KB]

(2009) A.-L.Lagarda, V.Alabau, F.Casacuberta, R.Silva, & E.Díaz-de-Liaño: Statistical post-editing of a rule-based machine translation system. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.217-220. [PDF, 83KB]

(2009) Jin’ichi Murakami, Masato Tokuhisa, & Satoru Ikehara: Statistical machine translation adding pattern-based machine translation in Chinese-English translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 107-112. [PDF, 317KB]; presentation [PDF of PPT, 75KB]

(2009) Attila Novak: MorphoLogic’s submission for the WMT 2009 shared task.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.155-159. [PDF, 163KB]

(2009) Manny Rayner, Paula Estrella, Pierrette Bouillon, Beth Ann Hockey & Yukie Nakao: Using artificially generated data to evaluate statistical machine translation. ACL-IJCNLP 2009: Workshop on Grammar Engineering Across Frameworks (GEAF 2009), Proceedings of the workshop, 6 August 2009, Suntec, Singapore; pp.54-62. [PDF, 214KB]

(2009) Chakaveh Saedi, Mehrnoush Shamsfard, & Yasaman Motazedi: Automatic translation between English and Persian texts. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 1001KB]

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2009) Holger Schwenk, Sadaf Abdul-Rauf, Loïc Barrault, & Jean Senellart: SMT and SPE machine translation system for WMT’09.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.130-134. [PDF, 73KB]

(2009) Francis M.Tyers: Rule-based augmentation of training data in Breton-French statistical machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.213-217. [PDF , 227KB]

(2009) Francis M. Tyers & Kevin Donnelly: apertium-cy: a collaboratively-developed RBMT system for Welsh to English. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.57-66. [PDF, 134KB]  [presentation at MT Marathon 2009, 1376KB]

(2009) Francis M.Tyers, Linda Wiechetek, & Trond Trosterud: Developing prototypes for machine translation between two Sámi languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.120-127. [PDF, 340KB]

(2009) Hans Uszkoreit: Hybrid machine translation. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 16pp. [PDF of PPT, 1407KB]

(2009) Hans Uszkoreit: Will there be a winner? [contribution to panel]. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 5pp. [PDF of PPT, 1626KB]

(2009) Eric Wehrli, Luka Nerima & Yves Scherrer: Deep linguistic multilingual translation and bilingual dictionaries.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.90-94. [PDF, 71KB]

(2009) Eric Wehrli, Violeta Seretan, Luka Nerima, & Lorenza Russo: Collocations in a rule-based MT system: a case study evaluation of their translation adequacy. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.128-135. [PDF, 375KB]

(2008) A.S.Andreyeva: Lexical-functional correspondences and their use in the system of machine translation ETAP-3. Coling 2008: Proceedings of the Workshop on Cognitive Aspects of the Lexicon, 24 August 2008, Manchester, UK; pp.64-72. [PDF,  164KB]

(2008) Prachya Boonkwan & Thepchai Supnithi: Memory-inductive categorical grammar: an approach to gap resolution in analytic-language translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.80-87. [PDF, 339KB]

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Inductive detection of language features via clustering minimal pairs: towards feature-rich grammars in machine translation.  Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.78-86. [PDF, 246KB]

(2008) A. Diaz de Ilarraza, G. Labaka, & K. Sarasola: Statistical post-editing: a valuable method in domain adaptation of RBMT systems for less-resourced languages. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 35-40. [PDF, 364KB]

(2008) Loïc Dugast, Jean Senellart, & Philipp Koehn: Can we relearn an RBMT system? ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.175-178. [PDF, 87KB]

(2008) Andreas Eisele: Hybrid architectures for machine translation. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 29 slides. [PDF, 1167KB]

(2008) Andreas Eisele, Christian Federmann, Hans Uszkoreit, Hervé Saint-Amand, Martin Kay, Michael Jellinghaus, Sabine Hunsicker, Teresa Herrmann, & Yu Chen: Hybrid machine translation architectures within and beyond the EuroMatrix project. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.27-34. [PDF, 552KB]

(2008) Andreas Eisele, Christian Federmann, Hervé Saint-Amand, Michael Jellinghaus, Teresa Herrmann, & Yu Chen: Using Moses to integrate multiple rule-based machine translation engines into a hybrid system. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.179-182. [PDF, 90KB]

(2008) Petra Gieselmann: Architecture of the Lucy translation system. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 28 slides. [PDF, 225KB]

(2008) Petr Homola: A distributed database for mobile NLP applications. ACL-08: HLT. Workshop on Mobile Language Processing. Proceedings of the workshop, June 20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 27-28. [PDF, 71KB]

(2008) Tatsuya Izuha, Akira Kumano, & Yuka Kuroda: Toshiba rule-based machine translation system at NTCIR-7 PAT MT. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 430-434. [PDF, 545KB]

(2008) Janne Bondi Johannessen, Torbjørn Nordgård, & Lars Nygaard: Evaluation of linguistics-based translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 249KB]

(2008) Brianna Laugher & Ben MacLeod: Machine translation for Indonesian and Tagalog. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.397-401. [PDF, 518KB]

(2008) Stephan Oepen: Grammar-based processing and re-usable hybrid MT – an experience report in the wilderness. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 32 slides. [PDF, 254KB]

(2008) Nicola Ueffing, Jens Stephan, Evgeny Matusov, Loïc Dugast, George Foster, Roland Kuhn, Jean Senellart, & Jin Yang: Tighter integration of rule-based and statistical MT in serial system combination. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.913-920. [PDF, 205KB]

(2008) Clare R.Voss, Matthew Aguirre, Jeffrey Micher, Richard Chang, Jamal Laoudi, & Reginald Hobbs: Boosting performance of weak MT engines automatically: using MT output to align segments & build statistical post-editors. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.192-201. [PDF, 506KB]

(2007) Pierrette Bouillon, Glenn Flores, Marianne Starlander, Nikos Chatzichrisafis, Marianne Santaholma, Nikos Tsourakis, Manny Rayner, & Beth Ann Hockey: A bidirectional grammar-based medical speech translator. ACL 2007: Proceedings of the Workshop on Grammar-based approaches to spoken language processing (SPEECHGRAM 2007), June 29, 2007, Prague, Czech Republic; pp.41-48. [PDF, 216KB]

(2007) Loïc Dugast, Jean Senellart, & Philipp Koehn: Statistical post-editing on  SYSTRAN’s rule-based translation system.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 220-223 [PDF, 128KB]

(2007) Yu Chen, Andreas Eisele, Christian Federmann, Eva Hasler, Michael Jellinghaus, & Silke Theison: Multi-engine machine translation with an open-source decoder for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 193-196 [PDF, 96KB]

(2007) Andreas Eisele: Hybrid machine translation: combining rule-based and statistical MT systems. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 36pp. [PDF of PPT presentation, 777KB]

(2007) Ehara Terumasa: Rule based machine translation combined with statistical post editor for Japanese to English patent translation. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.13-18. [PDF, 355KB]

(2007) Eduard Hovy: Investigating why BLEU penalizes non-statistical systems. MT Summit XI Workshop: Automatic procedures in MT evaluation, 11 September 2007, Copenhagen, Denmark, [Proceedings]; 10pp. [PDF of PPT presentation, 265KB]

(2007) Xiaoguang Hu, Haifeng Wang, & Hua Wu: Using RBMT systems to produce bilingual corpus for SMT. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 287-295. [PDF, 131KB]

(2007) Pierre Isabelle, Cyril Goutte, & Michel Simard: Domain adaptation of MT systems through automatic post-editing.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.255-261 [PDF, 151KB]

(2007) Stephan Oepen, Erik Velldal, Jan Tore Lønning, Paul Meurer, Victoria Rosén, & Dan Flickinger: Towards hybrid quality-oriented machine translation – on linguistics and probabilities in MT. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.144-153 [PDF, 328KB]

(2007) Anna Sågvall Hein: Rule-based and statistical machine translation with a focus on Swedish [abstract]. Invited talk at TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; 1p. [PDF, 15KB]

(2007) Michel Simard, Nicola Ueffing, Pierre Isabelle, & Roland Kuhn: Rule-based translation with statistical phrase-based post-editing. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 203-206 [PDF, 130KB]

(2007) Sylvain Surcin, Elke Lange, & Jean Senellart: Rapid development of new language pairs at SYSTRAN. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.443-449 [PDF, 114KB]

(2006) Krzysztof Jassem & Kowalski Tomasz: An algorithm for extracting translation rules from scarce bilingual corpora. Proceedings of the International Multiconference on Computer Science and Information Technology, vol.1: XXII Autumn Meeting of Polish Information Processing Society, November 6-10, 2006, Wisla, Poland; pp.67-73. [PDF, 368KB]

(2006) Gregor Thurmair: Using corpus information to improve MT quality. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Third International Workshop on Language Resources for Translation Work, Research & Training (LR4Trans-III), Genoa, Italy, 28 May 2006; pp.45-48. [PDF, 371KB]

(2006) Jean Senellart & John S. White: First strategies for integrating hybrid approaches into established systems (contribution to panel on “Hybrid machine translation”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 68KB]

(2006) Jiang Zhu & Haifeng Wang: The effect of translation quality in MT-based cross-language information retrieval. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.593-600. [PDF, 110KB]

(2005) Zhu Jiang & Wang Haifeng: The effect of adding rules into the rule-based MT system. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.298-304. [PDF, 378KB]

(2005) Gábor Prószéky: An approach to machine translation via the rule-to-rule hypothesis. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 219-225. [PDF, 147KB]

(2005) Manny Rayner, Pierrette Bouillon, Nikos Chatzichrisafis, Beth Ann Hockey, Marianne Santaholma, Marianne Starlander, Hitoshi Isahara, Kyoko Kanzaki, & Yukie Nakao: A methodology for comparing grammar-based and robust approaches to speech understanding. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.1877-1880 [PDF, 52KB]; abstract [PDF, 55KB]

Scarce resources (see also Language resources, Rapid development of MT)

(2010) Sisay Adugna & Andreas Eisele: English-Oromo machine translation: an experiment using a statistical approach. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2196-2199. [PDF, 368KB]

(2010) Vamshi Ambati, Stephen Vogel, & Jaime Carbonell: Active learning and crowd-sourcing for machine translation. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2169-2174. [PDF, 436KB]

(2010) Ondřej Bojar, Pavel Straňák, & Daniel Zeman: Data issues in English-to-Hindi machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1771-1777. [PDF, 557KB]

(2010) Md.Zahurul Islam, Jörg Tiedemann & Andreas Eisele: English to Bangla phrase-based machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 601KB]

(2010) William Lewis: Haitian Creole: developing MT for a low data language. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 24pp. [PDF, 3425KB]

(2010) William D.Lewis: Haitian Creole: how to build and ship an MT engine from scratch in 4 days, 17 hours, & 30 minutes. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 518KB]; presentation: 24 slides [PDF, 630KB]

(2010) Jan Niehues & Alex Waibel: Domain adaptation in statistical machine translation using factored translation models. EAMT 2010: Pro ceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 581KB]; presentation: 27 slides [PDF, 887KB]

(2010) Yulia Tsvetkov & Shuly Wintner: Automatic acquisition of parallel corpora from websites with dynamic content.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3389-3392. [PDF, 447KB]

(2010) Francis M.Tyers: Rule-based Breton to French machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 553KB]

(2010) Andrejs Vasiljevs: LetsMT! – towards cloud based service for MT generation. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 15pp. [PDF, 368KB]

(2009) Eneko Agirre, Aitziber Atutxa, Gorka Labaka, Mikel Lersundi, Aingeru Mayor, & Kepa Sarasola: Use of rich linguistic information to translate prepositions and grammatical cases to Basque. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.58-65. [PDF, 340KB]

(2009) Vamshi Ambati & Jaime Carbonell: Proactive learning for building machine translation systems for minority languages. NAACL-HLT-2009: Active Learning for Natural Language Processing (ALNLP-09), Proceedings of the workshop, June 5, 2009, Boulder, Colorado; pp.58-61. [PDF, 101KB]

(2009) Rodolfo Delmonte, Antonella Bristot, Sara Tonelli, & Emanuele Pianta: English/Veneto resource poor machine translation with STILVEN.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.82-89 [abstract]

(2009) Dmitriy Genzel, Klaus Machery & Jakob Uszkoreit: Creating a high-quality machine translation system for a low-resource language: Yiddish. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 41-48. [PDF, 157KB]

(2009) Hendrik J.Groenewald & Wildrich Fourie: Introducing the Autshumato integrated translation environment. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.190-196. [PDF , 410KB]

(2009) Gholamreza Haffari, Maxim Roy, & Anoop Sarkar: Active learning for statistical phrase-based machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.415-423. [PDF, 190KB]

(2009) Preslav Nakov & Hwee Tou Ng: Improved statistical machine translation for resource-poor languages using related resource-rich languages. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1358-1367. [PDF, 209KB]

(2009) Kepa Sarasola: Matxin: developing sustainable machine translation for a less-resourced language [abstract]. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 1.

(2009) Stephen Soderland, Christopher Lim, Mausam, Bo Qin, Oren Etzioni, & Jonathan Pool: Lemmatic machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.128-135. [PDF, 214KB]

(2009) Jörg Tiedemann: Translating questions for cross-lingual QA. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.112-119. [PDF, 370KB]

(2009) Varga István & Yokoyama Shoichi: Bilingual dictionary generation for low-resourced language pairs. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.862-870. [PDF, 199KB]

(2008) Steve DeNeefe, Ulf Hermjakob & Kevin Knight: Overcoming vocabulary sparsity in MT using lattices. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.89-96. [PDF, 492KB]

(2008) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth S.Narayanan: Mitigation of data sparsity in classifier-based translation. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.1-4. [PDF, 138KB]

(2008) Reginald L.Hobbs & Clare R.Voss: Designing and executing machine translation workflows through the Kepler framework. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.380-389.  [PDF of PPT presentation, 1204KB]

(2008) Andreas Kathol & Jing Zheng: Strategies for building a Farsi-English SMT system from limited resources. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2731-2734; abstract [PDF, 62KB]

(2008) Karine Megerdoomian & Dan Parvaz: Low-density language bootstrapping: the case of Tajiki Persian.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 331KB]

(2008) Rada Mihalcea & Vivi Nastase: How to add a new language on the NLP map: building resources and tools for languages with scarce resources. [Abstract of tutorial at] IJCNLP 2008: Sixth SIGHAN Workshop on Chinese Language Processing, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.938. [PDF, 249KB]

(2008) Michael Mohler & Rada Mihalcea: BABYLON parallel text builder: gathering parallel texts for low-density languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 63KB]

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2008) Virach Sornlertlamvanich: Cross language resource sharing. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.3-4. [PDF, 37KB]

(2008) Virach Sornlertlamvanich, Thatsanee Charoenporn, Chumpol Mokarat, Hammam Riza, Hitoshi Isahara, & Purev Jaimal: Synset assignment for bi-lingual dictionary with limited resource. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.673-678. [PDF, 549KB]

(2008) Kathrin Spreyer, Jonas Kuhn, & Bettina Schrader: Identification of comparable argument-head relations in parallel corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 138KB]

(2008) Vincent Vandeghinste, Peter Dirix, Ineke Schuurman, Stella Markantonatou, Sokratis Sofianopoulos, Marina Vassiliou, Olga Yannoutsou, Toni Badia, Maite Melero, Gemma Boleda, Michael Carl, & Paul Schmidt: Evaluation of a machine translation system for low resource languages: METIS-II.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 78KB]

(2008) Daniel Zeman & Philip Resnik: Cross-language parser adaptation between related languages. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.35-42. [PDF, 150KB]

(2008) Imed Zitouni & Radu Florian: Mention detection crossing the language barrier. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.600-609. [PDF, 149KB]

(2007) Alison Alvarez, Lori Levin, Robert Frederking, & Jill Lehman: An assessment of language elicitation without the supervision of a linguist. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.1-10 [PDF, 415KB]; presentation [PDF, 412KB]

(2007) Bogdan Babych, Anthony Hartley, & Serge Sharoff: Translating from under-resourced languages: comparing direct transfer against pivot translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.29-35 [PDF, 197KB]

(2007) Krzysztof Jassem & Tomasz Kowalski: Machine translation using scarce bilingual corpora. TASK Quarterly 11, no.1-2, 21-33. [PDF, 216KB]

(2007) Chris Quirk, Raghavendra Udupa U., & Arul Menezes: Generative models of noisy translations with applications to parallel fragment extraction. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.377-384 [PDF, 249KB]

(2006) proceedings of 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, LREC-2006: Fifth International Confe rence on Language Resources and Evaluation, Genoa, Italy, 23 May 2006. [PDF, 5391KB]

(2006) Andreas Eisele: Parallel corpora and phrase-based statistical machine translation for new language pairs via multiple intermediaries.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.845-848 [PDF, 329KB]

(2006) Emmanuel Giguet & Pierre-Sylvain Luquet: Multilingual lexical database generation fom parallel texts in 20 European languages with endogenous resources. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.271-278. [PDF, 427KB]

(2006) Krzysztof Jassem & Kowalski Tomasz: An algorithm for extracting translation rules from scarce bilingual corpora. Proceedings of the International Multiconference on Computer Science and Information Technology, vol.1: XXII Autumn Meeting of Polish Information Processing Society, November 6-10, 2006, Wisla, Poland; pp.67-73. [PDF, 368KB]

(2006) Ayu Purwarianti, Masatoshi Tsuchiya, & Seiichi Nakagawa: Indonesian-Japanese CLIR using only limited resource. Coling-ACL 2006: Proceedings of the workshop on How can computational linguistics improve information retrieval? Sydney, July 2006; pp.1-8. [PDF, 80KB]

(2006) Jason Riesa, Behrang Mohit, Kevin Knight, & Daniel Marcu: Building an English-Iraqi Arabic machine translation system for spoken utterances with limited resources . Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 2012; abstract [PDF, 80KB]

(2006) Jason Riesa & David Yarowsky: Minimally supervised morphological segmentation with applications to machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.185-192 [PDF, 613KB]

(2006) Vincent Vandeghinste, Ineka Schuurman, Michael Carl, Stella Markantonatou, & Toni Badia: METIS-II: machine translation for low-resource languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1284-1289 [PDF, 409KB]

(2006) Haifeng Wang, Hua Wu, & Zhanyi Liu: Word alignment for languages with scarce resources using bilingual corpora of other language pairs. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.874-881. [PDF, 155KB]

(2005) Hiroshi Echizen-ya, Kenji Araki, & Yoshio Momouchi: Automatic acquisition of bilingual rules for extraction of bilingual word pairs from parallel corpora. ACL-SIGLEX-2005: Workshop on Deep Lexical Acquisition, University of Michigan, Ann Arbor, 30 June 2005; pp. 87-96.  [PDF, 726KB]

(2005) Andreas Kathol, Kristin Precoda, Dimitra Vergyri, Wen Wang, & Susanne Riehemann: Speech translation for low-resource languages: the case of Pashto. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.2273-2276; abstract [PDF, 44KB]

(2005) Jonas Kuhn: Parsing word-aligned parallel corpora in a grammar induction context. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 17-25. [PDF, 150KB]

(2005) Adam Lopez & Philip Resnik: Improved HMM alignment models for languages with scarce resources.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 83-86. [PDF, 103KB]

(2005) Joel Martin, Rada Mihalcea, & Ted Pedersen: Word alignment for languages with scarce resources. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 65-74. [PDF, 170KB]

(2005) Maja Popovic & Hermann Ney: Exploiting phrasal lexica and additional morpho-syntactic language resources for statistical machine translation with scarce training data. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 212-218. [PDF, 65KB]

Segmentation (see also Morphological analysis, Tagging)

(2010)  Marta R.Costa-jussà, Vidas Daudaravicius, & Rafael E.Banchs: Integration of statistical collocation segmentations in a phrase-based statistical machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 975KB]

(2010) Mark Fishel & Harri Kirik: Linguistically motivated unsupervised segmentation for machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1741-1745. [PDF, 308KB]

(2010) Yiou Wang, Kiyotaka Uchimoto, Jun’ichi Kazama, Canasai Kruengkrai, & Kentaro Torisawa: Adapting Chinese word segmentation for machine translation based on short units.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1758-1764. [PDF, 409KB]

(2010) Deyi Xiong, Min Zhang, & Haizhou Li: Learning translation boundaries for phrase-based decoding. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.136-144. [PDF, 450KB]

(2009) Arianna Bisazza & Marcello Federico: Morphological pre-processing for Turkish to English statistical machine translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 129-135. [PDF, 373KB]; presentation [PDF of PPT, 302KB]

(2009) Fethi Bougares, Laurent Besacier, & Hervé Blanchon: LIG approach for IWSLT09: using multiple morphological segmenters for spoken language translation of Arabic. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 60-64. [PDF, 377KB]; presentation [PDF of PPT, 668KB]

(2009) Lars Bungum & Stephan Oepen: Automatic translation of Norwegian noun compounds. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.136-143. [PDF, 465KB]

(2009) Tagyoung Chung & Daniel Gildea: Unsupervised tokenization for machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.718-726. [PDF, 197KB]

(2009) Arantza Díaz de Ilarraza, Gorka Labaka, & Kepa Sarasola: Relevance of different segmentation options on Spanish-Basque SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.74-80. [PDF, 390KB]

(2009) Chris Dyer: Using maximum entropy model to build segmentation lattices for MT.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.406-414. [PDF, 173KB]

(2009) Chris Dyer, Hendra Setiawan, Yuval Marton, & Philip Resnik: The University of Maryland statistical machine translation system for the Fourth Workshop on Machine Translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.145-149. [PDF, 120KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Alexander Fraser: Experiments in morphosyntactic processing for translating to and from German.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.115-119. [PDF, 75KB]

(2009) Satoshi Kamatani, Tetsuro Chino & Kazuo Sumita: Hybrid spoken language translation using sentence splitting based on syntax structure. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 57-64. [PDF, 167KB]

(2009) Maoxi Li, Jiajun Zhang, Yu Zhou, & Chengqing Zong: The CASIA statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 83-90. [PDF, 317KB]; presentation [PDF of PPT, 463KB]

(2009) Yanjun Ma & Andy Way: Bilingually motivated domain-adapted word segmentation for statistical machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.549-557. [PDF, 325KB]

(2009) Yanjun Ma, Tsuyoshi Okita, Özlem Çetinoğlu, Jinhua Du, & Andy Way: Low-resource machine translation using MaTrEx: the DCU machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 29-36. [PDF, 305KB]; presentation [PDF of PPT, 466KB]

(2009) Preslav Nakov, Chang Liu, Wei Lu, & Hwee Tou Ng: The NUS statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 91-98. [PDF, 338KB]; presentation [PDF of PPT, 787KB]

(2009) Maja Popović, David Vilar, Daniel Stein, Evgeny Matusov & Hermann Ney: The RWTH machine translation system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.66-69. [PDF, 210KB]

(2009) Mehrnoush Shamsfard, Soheila Kiani & Yaser Shahedi: STeP-1: standard text preparation for Persian language. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 659KB]

(2009) Massoud Sharifi-Atashgah & Mahmood Bijankhan: Corpus-based analysis for multi-token units in Persian. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 817KB]

(2009) Ankit Srivastava, Sergio Penkale, Declan Groves, & John Tinsley: Evaluating syntax-driven approaches to phrase extraction for MT.  Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.19-28. [PDF, 250KB]; presentation [PDF of PPT, 329KB]

(2009) Sara Stymne: A comparison of merging strategies for translation of German compounds. Proceedings of the EACL 2009 Student Research Workshop, Athens, Greece, 2 April 2009; pp.61-69. [PDF, 111KB]

(2009) Maarten van Gompel, Antal van den Bosch, & Peter Berck: Extending memory-based machine translation to phrases. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.61-68. [PDF, 207KB]

(2009) Dong Yang, Paul Dixon, Yi-Cheng Pan, Tasuku Oonishi, Masanobu Nakamura & Sadaoki Furui: Combining a two-step conditional random field and a joint source channel model for machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.72-75. [PDF, 110KB]

(2009) Fan Yang, Jun Zhao, & Kang Liu: A Chinese-English organization name translation system using heuristic web mining and asymmetric alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.387-395. [PDF, 178KB]

(2008) Ibrahim Badr, Rabih Zbib, & James Glass: Segmentation for English-to-Arabic statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers., June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 153-156. [PDF, 89KB]

(2008) Ming-Hong Bai, Keh-Jiann Chen, & Jason S.Chang: Improving word alignment by adjusting Chinese word segmentation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.249-256. [PDF, 560KB]

(2008) Graeme Blackwood, Adrià de Gispert, & William Byrne: Phrasal segmentation models for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.19-22. [PDF, 112KB]

(2008) Pi-Chuan Chang, Michel Galley, & Christopher D. Manning: Optimizing Chinese word segmentation for machine translation performance. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.224-232. [PDF, 165KB]

(2008) Steve DeNeefe, Ulf Hermjakob & Kevin Knight: Overcoming vocabulary sparsity in MT using lattices. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.89-96. [PDF, 492KB]

(2008) Nikesh Garera & David Yarowsky: Translating compounds by learning component gloss translation models via multiple languages. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.403-410. [PDF, 497KB]

(2008) Jonghoon Lee, Donghyeon Lee & Gary Geunbae Lee: A transformation-based sentence-splitting method for statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; 6pp. [PDF, 195KB]

(2008) Preslav Nakov: Improving English-Spanish statistical machine translation: experiments in domain adaptation, sentence paraphrasing, tokenization, and recasing. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.147-150. [PDF, 110KB]

(2008) Stefan Riezler, Yi Liu, & Alexander Vasserman: Translating queries into snippets for improved query expansion.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.737-744. [PDF, 137KB]

(2008) Benjamin Snyder & Regina Barzilay: Unsupervised multilingual learning for morphological segmentation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 737-745. [PDF, 254KB]

(2008) Sara Stymne & Maria Holmqvist: Processing of Swedish compounds for phrase-based statistical machine translation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.182-191. [PDF, 532KB]

(2008) Jia Xu, Jianfeng Gao, Kristina Toutanova, & Hermann Ney: Bayesian semi-supervised Chinese word segmentation for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1017-1024. [PDF, 137KB

(2008) Ruiqiang Zhang, Keiji Yasuda, & Eiichiro Sumita: Chinese word segmentation and statistical machine translation. ACM Transactions on Speech and Language Processing 5 (2), art.4. [abstract, PDF, 13KB]

(2008) Ruiqiang Zhang & Eiichiro Sumita: Chinese unknown word translation by subword re-segmentation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.225-232. [PDF, 360KB]

(2008) Ruiqiang Zhang, Keiji Yasuda, & Eiichiro Sumita: Improved statistical machine translation by multiple Chinese word segmentation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.216-223. [PDF, 250KB]

(2007) Yi Chang, Ying Zhang, Stephan Vogel, & Jie Yang: Enhancing image-based Arabic document translation using noisy channel correction model. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.89-95 [PDF, 157KB]

(2007) Christian Fügen & Muntsin Kolss: The influence of utterance chunking on machine translation performance. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2837-2840; abstract [PDF, 61KB]

(2007) Yanjun Ma, Nicolas Stroppa, & Andy Way: Alignment-guided chunking. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.114-121 [PDF, 357KB]; presentation [PDF, 377KB]

(2007) Alicia Pérez, Víctor Guijarrubia, Raquel Justo, M. Inés Torres, & Francisco Casacuberta: A comparison of linguistically and statistically enhanced models for speech-to-speech machine translation.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 1234KB]; presentation [PDF, 2546KB]

(2007) Sharath Rao, Ian Lane, & Tanja Schultz: Optimizing sentence segmentation for spoken language translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2845-2848; abstract [PDF, 23KB]

(2007) Antal van den Bosch, Nicolas Stroppa, & Andy Way: A memory-based classification approach to marker-based EBMT.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 10pp. [PDF, 251KB]

(2007) Yuqi Zhang, Richard Zens, & Hermann Ney: Chunk-level reordering of source language sentences with automatically learned rules for statistical machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.1-8 [PDF, 165KB]

(2006) Jianfeng Gao, Mu Li, Andi Wu, & Chang-Ning Huang: Chinese word segmentation and named entity recognition: a pragmatic approach. Computational Linguistics 31 (4), pp. 531-574. [PDF, 1638KB]

(2006) Adria de Gispert & José B.Mariño: Linguistic tuple segmentation in n-gram-based statistical machine translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1049; abstract [PDF, 90KB]

(2006) Howard Johnson, Fatiha Sadat, George Foster, Roland Kuhn, Michel Simard, Eric Joanis, & Samuel Larkin: PORTAGE, with smoothed phrase tables and segment choice models.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 134-137 [PDF, 118KB]

(2006) Roland Kuhn, Denis Yuen, Michel Simard, Patrick Paul, George Foster, Eric Joanis, & Howard Johnson: Segment choice models: feature-rich models for global distortion in statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 25-32 [PDF, 95KB]

(2006) Evgeny Matusov, Arne Mauser, & Hermann Ney: Automatic sentence segmentation and punctuation prediction for spoken language translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 158-165 [PDF, 117KB]

(2006) Bart Mellebeek, Karolina Owczarzak, Josef Van Genabith & Andy Way: Multi-engine machine translation by recursive sentence decomposition. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.110-118 [PDF, 145KB]

(2006) Jason Riesa & David Yarowsky: Minimally supervised morphological segmentation with applications to machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.185-192 [PDF, 613KB]

(2006) Jia Xu, Richard Zens, & Hermann Ney: Partitioning parallel documents using binary segmentation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 78-85 [PDF, 433KB]

(2005) Etienne Denoual & Yves Lepage: BLEU in characters: towards automatic MT evaluation in languages without word delimiters. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.79-84. [PDF, 495KB]

(2005) John Henderson & William Morgan: Gaming fluency: evaluating the bounds and expectations of segment-based translation memory. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 175-182. [PDF, 194KB]

(2005) Krisda Khankasikam & Nuttanart Muansuwan: Thai word segmentation a lexical semantic approach . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.331-338. [PDF, 155KB]

(2005) Evgeny Matusov, Gregor Leusch, Oliver Bender, & Hermann Ney: Evaluating machine translation output with automatic sentence segmentation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 7pp. [PDF, 113KB]

(2005) Chris Quirk, Arul Menezes, & Colin Cherry: Dependency treelet translation: syntactically informed phrasal SMT.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 271-279. [PDF, 118KB]

(2005) Jia Xu, Evgeny Matusov, Richard Zens, & Hermann Ney: Integrated Chinese word segmentation in statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 7pp. [PDF, 105KB]

(2005) Jia Xu, Richard Zens, & Hermann Ney: Sentence segmentation using IBM word alignment model 1. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 280-287. [PDF, 145KB]

(2005) Ying Zhang & Stephan Vogel: Competitive grouping in integrated phrase segmentation and alignment model.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 159-162. [PDF, 437KB]

Semantic analysis and representation (see also Disambiguation, Latent semantic analysis, Treebanks)

(2010) Benoît Gaillard, Malek Boualem, & Olivier Collin: Query translation using Wikipedia-based resources for analysis and disambiguation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 579KB]

(2010) Chi-kiu Lo & Dekai Wu: Evaluating machine translation utility via semantic role labels. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2873-2877. [PDF, 417KB]

(2010) Yashar Mehdad, Matteo Negri, & Marcello Federico: Towards cross-lingual textual entailment.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.321-324. [PDF, 230KB]

(2010) Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro, & Suguru Matsuyoshi: Utilizing semantic equivalence classes of Japanese functional expressions in translation rule acquisition from parallel patent sentences. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1777-1785. [PDF, 942KB]

(2010) Ineke Schuurman & Vincent Vandeghinste: Cultural aspects of spatiotemporal analysis in multilingual applications. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3308-3313. [PDF, 318KB]

(2010) Lonneke van der Plas, Tanja Samardžić, & Paola Merlo: Cross-lingual validity of PropBank in the manual annotation of French. [ACL-2010] Proceedings of the Fourth Linguistic Annotation Workshop, Uppsala, Sweden, 15-16 July 2010; pp.113-117. [PDF, 102KB]

(2010) Billy Tak-Ming Wong: Semantic evaluation of machine translation.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2884-2888. [PDF, 574KB]

(2009) Marianna Apidianaki: Data-driven semantic analysis for multilingual WSD and lexical selection in translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.77-85. [PDF, 364KB]

(2009) Anders Björkelund, Love Hafdell, & Pierre Nugues: Multilingual semantic role labeling.  CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.43-48. [PDF, 313KB]

(2009) Xavier Blanco: Remarks about linguistic analysis, normalization and translation of Spanish “What to do in case of fire” texts. ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.43-48 [abstract]

(2009) Marine Carpuat: One translation per discourse. NAACL-HLT-2009 (SEW-2009): Semantic Evaluations: Recent Achievements and Future Directions, Proceedings of the workshop, June 4, 2009, Boulder, Colorado; pp.19-27. [PDF, 112KB]

(2009) Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, & Christopher D.Manning: Discriminative reordering with Chinese grammatical features. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.51-59. [PDF, 558KB]

(2009) Wanxiang Che, Zhenghua Li, Yongqiang Li, Yuhang Guo, Bing Qin, & Ting Liu: Multingual dependency-based syntactic and semantic parsing. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.49-54. [PDF, 633KB]

(2009) Dmitry Davidov & Ari Rappoport: Enhancements of lexical concepts using cross-lingual web mining. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.852-861. [PDF, 162KB]

(2009) Dmitry Davidov & Ari Rappoport: Translation and extension of concepts across languages. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.175-183. [PDF, 141KB]

(2009) Beate Dorow, Florian Laws, Lukas Michelbacher, Christian Scheible, & Jason Utt: A graph-theoretic algorithm for automatic extension of translation lexicons. Proceedings of the EACL 2009 Workshop on GEMS: Geometrical Models of Natural Language Semantics, Athens, Greece, 31 March 2009; pp.91-95. [PDF, 81KB]

(2009) Andrea Gesmundo, James Henderson, Paola Merlo, & Ivan Titov: A latent variable model of synchronous syntactic-semantic parsing for multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.37-42. [PDF, 120KB]

(2009) Jesús Giménez & Lluís Màrquez: On the robustness of syntactic and semantic features for automatic MT evaluation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.250-258. [PDF, 171KB]

(2009) Jan Hajič: Tree-based machine translation using syntax and semantics. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 23pp. [PDF of PPT, 543KB]

(2009) Beth Ann Hockey & Manny Rayner: Using paraphrases of deep semantic representations to support regression testing in spoken dialogue systems. NAACL-HLT-2009: Software Engineering, Testing and Quality Assurance for Natural Language Processing (SETQA-NLP 2009), Proceedings of the Workshop, June 5, 2009, Boulder, Colorado; pp.14-21. [PDF, 139KB]

(2009) Baoli Li, Martin Emms, Saturnino Luz, & Carl Vogel: Exploring multilingual semantic role labeling. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.73-78. [PDF, 140KB]

(2009) Pierre Mahé & Nicola Cancedda: Linguistically enriched word-sequence kernels for discriminative language modeling.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.111-128.

(2009) Ivan Meza-Ruiz & Sebastian Riedel: Multilingual semantic role labelling with Markov logic. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.85-90. [PDF, 150KB]

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky & Chris Manning: Robust machine translation evaluation with entailment features. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.297-305. [PDF, 437KB]

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky, & Christopher D.Manning: Textual entailment features for machine translation evaluation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.37-41. [PDF, 378KB]

(2009) Davide Picca, Alfio Massimiliano Gliozzo, & Simone Campora: Bridging languages by supersense entity tagging. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.136-142. [PDF, 159KB]

(2009) Ananthakrishnan Ramanathan, Hansraj Choudhary, Avishek Ghosh, & Pushpak Bhattacharyya: Case markers and morphology: addressing the crux of the fluency problem in English-Hindi SMT. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.800-808. [PDF, 221KB]

(2009) Han Ren, Donghong Ji, Jing Wan, & Mingyao Zhang: Parsing syntactic and semantic dependencies for multiple languages with a pipeline approach. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.97-102. [PDF, 228KB]

(2009) Bernard Scott & Anabela Barreiro: OpenLogos MT and the SAL representation language. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 19-26. [PDF, 253KB]

(2009) Oscar Täckström: Multilingual semantic parsing with a pipeline of linear classifiers. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.103-108. [PDF, 103KB]

(2009) Yotaro Watanabe, Masayuki Asahara, & Yuji Matsumoto: Multilingual syntactic-semantic dependency parsing with three-stage approximate max-margin linear models. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.103-108. [PDF, 727KB]

(2009) Dekai Wu & Pascale Fung: Can semantic role labeling improve SMT? EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.218-225. [PDF, 1141KB]

(2009) Dekai Wu & Pascale Fung: Semantic roles for SMT: a hybrid two-pass model.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.13-16. [PDF, 286KB]

(2008) David Ellis, Mathias Creutz, Timo Honkela, & Mikko Kurimo: Speech to speech machine translation: Biblical chatter from Finnish to English.  IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp. 123-130. [PDF, 104KB]

(2008) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth S.Narayanan: Towards unsupervised training of the classifier-based speech translator. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2739-2742; abstract [PDF, 58KB]

(2008) Nikesh Garera & David Yarowsky: Minimally supervised multilingual taxonomy and translation lexicon induction. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.465-472. [PDF, 356KB]

(2008) Gerhard Kremer, Andrea Abel, & Marco Baroni: Cognitively salient relations for multilingual lexicography. Coling 2008: Proceedings of the Workshop on Cognitive Aspects of the Lexicon, 24 August 2008, Manchester, UK; pp.94-101. [PDF, 84KB]

(2008) Ding Liu & Daniel Gildea: Improved tree-to-string transducer for machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.62-69. [PDF, 208KB]

(2008) Rajat Kumar Mohanty & Pushpak Bhattacharyya: Lexical resources for semantic extraction.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 148KB]

(2008) Rajat Mohanty, Sandeep Limaye, M.Krishna, & Pushpak Bhattacharyya: Semantic graph from English sentences. ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 173KB]

(2008) Petya Osenova, Kiril Simov, & Eelco Mossel: Language resources for semantic document annotation and crosslingual retrieval.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 465KB]

(2008) Manny Rayner, Pierrette Bouillon, Beth Ann Hockey, & Yukie Nakao: Almost flat functional semantics for speech translation.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.713-720. [PDF, 96KB]

(2008) Yorick Wilks: On whose shoulders? Computational Linguistics 34(4), pp. 471-486. [PDF,

(2007) Anja Belz & Sebastian Varges (eds.): MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark. Proceedings; prelims and contents  [PDF, 655KB]

(2007) Pascale Fung, Zhaojun Wu, Yongsheng Yang, & Dekai Wu: Learning bilingual semantic frames: shallow semantic parsing vs. semantic role projection. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.75-84 [PDF, 477KB]

(2007) Roxana Girju: Improving the interpretation of noun phrases with cross-linguistic information. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 568-575 [PDF, 156KB]

(2007) Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, & Minghui Dong: Semantic transliteration of personal names. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 120-127 [PDF, 152KB]

(2007) Lluís Màrquez, Luis Villarejo, M.A.Martí, & Mariona Taulé: SemEval-2007 task 09: multilevel semantic annotation of Catalan and Spanish. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.42-47 [PDF, 89KB]

(2007) Roser Morante & Bertjan Busser: ILK2: semantic role labelling for Catalan and Spanish using TiMBL. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.183-186 [PDF, 120KB]

(2007) Marcus Sammer & Stephen Soderland: Building a sense-distinguished multilingual lexicon from monolingual corpora and bilingual lexicons. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.399-406 [PDF, 263KB]

(2007) Yuk Wah Wong & Raymond J.Mooney: Generation by inverting a semantic parser that uses statistical machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.172-179 [PDF, 192KB]

(2006) Thierry Declerck, Asunción Gómez Pérez, Ovidiu Vela, Zeno Gantner, & David Manzano-Macho: Multilingual lexical semantic resources for ontology translation. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1492-1495 [PDF, 248KB]

(2006) Gaolin Fang, Hao Yu, & Fumihito Nishino: Chinese-English term translation mining based on semantic prediction.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.199-206. [PDF, 240KB]

(2006) Eduard Hovy: A gentle introduction to ontologies.  Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 60pp. [PDF of PPT presentation, 2050KB]

(2006) Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw & Ralph Weischedel: OntoNotes: the 90% solution.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 57-60 [PDF, 44KB]

(2006) Kentaro Inui, Toru Hirano, Ryu Iida, Atsushi Fujita, & Yuji Matsumoto: Augmenting a semantic verb lexicon with large scale collection of example sentences.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.365-368 [PDF, 283KB]

(2006) Svetla Koeva, Svetlozara Lesseva, & Maria Todorova: Bulgarian sense tagged corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.79-86. [PDF, 494KB]

(2006) Qing Li, Sung-Hyon Myaeng, Yun Jin, & Be-yeong Kang: Concept unification of terms in different languages for IR. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.641-648. [PDF, 234KB]

(2006) Jan Tore Lønning & Stephan Oepen: Re-usable tools for precision machine translation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.53-56. [PDF, 213KB]

(2006) Irina Matveeva & Gina-Anne Levow: Computing term translation probabilities with generalized latent semantic analysis. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.151-154 [PDF, 90KB]

(2006) Stephan Oepen & Jan Tore Lønning: Discriminant-based MRS banking.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1250-1255 [PDF, 430KB]

(2006) Serge Sharoff, Bogdan Babych, Paul Rayson, Olga Mudraya, & Scott Piao: ASSIST: automated semantic assistance for translators. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.139-142 [PDF, 69KB]

(2006) Sara Stymne & Lars Ahrenberg: A bilingual grammar for translation of English-Swedish verb frame divergences. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.9-17 [PDF, 139KB]

(2006) Yuk Wah Wong & Raymond J. Mooney: Learning for semantic parsing with statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 439-446 [PDF, 205KB]

(2006) Eiko Yamamoto, Kyoko Kanzaki, & Hitoshi Isahara: Detection of inconsistencies in concept classifications in a large dictionary: toward an improvement of the EDR electronic dictionary.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2325-2330 [PDF, 569KB]

(2005) Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Research on Computing Science, 12). Mexico City: National Polytechnic Institute; 443pp. [full set of abstracts]

(2005) Mike Dillinger: Interlinguas and semantic roles. MT Summit X, Phuket, Thailand, September 12, 2005, Tutorial Note. [PDF of PPT presentation, 1945KB]

(2005) Andrew Finch, Young-Sook Hwang, & Eiichiro Sumita: Using machine translation evaluation techniques to determine sentence-level semantic equivalence. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 17-24. [PDF, 111KB]

(2005) Dan Flickinger, Jan Tore Lønning, Helge Dyvik, Stephan Oepen, & Francis Bond: SEM-I rational MT: enriching deep grammars with a semantic interface for scalable machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.165-172. [PDF, 249KB]

(2005) Alfio Gliozzo & Carlo Strapparava: Cross language text categorization by acquiring mulitingual domain models from comparable corpora. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 9-16. [PDF, 259KB]

(2005) Toshiyuki Kanamaru, Masaki Murata, Kow Kuroda, & Hitoshi Isahara: Obtaining Japanese lexical units for semantic frames from Berkeley FrameNet using a bilingual corpus. IJCNLP-05: Sixth International Workshop on Linguistically Interpreted Corpora (LINC 2005). Proceedings of the workshop, 15 October 2005, Jeju Island, Korea; pp. 11-20. [PDF, 106KB]

(2005) Rajat Kumar Mohanty, Anupama Dutta, & Pushpak Bhattacharyya: Semantically relatable sets: building blocks for representing semantics. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.101-108. [PDF, 157KB]

(2005) Shigeko Nariyama, Eric Nichols, Francis Bond, Takaaki Tanaka, & Hiromi Nakaiwa: Extracting representative arguments from dictionaries for resolving zero pronouns. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.3-10. [PDF, 665KB]

(2005) Einat H. Nir & Geoffrey L. Melnick: Constant-sense connection paths MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Patent Translation; pp.21-28. [PDF, 147KB]

(2005) Diganta Saha & Sivaji Bandyopadhyay: