Machine Translation Archive

Index of Languages

Languages A to D: publications since 2000

[For languages E click here; for languages F to L click here, for languages M to Z click here]

[For publications before 2000 click here]

 

African languages

(2009) Anna Borovikov, Eugene Borovikov, Bradley Colquitt, & Kristen Summers: The EDEAL project for automated processing of African languages.  MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 546-549. [PDF, 73KB]

(2009) Hendrik J.Groenewald & Wildrich Fourie: Introducing the Autshumato integrated translation environment. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.190-196. [PDF , 410KB]

(2008) Manal Amin: Pan African localization network. [PDF, 35KB]

 

Afrikaans to Dutch

(2011) Pim Otte & Francis M.Tyers: Rapid rule-based machine translation between Dutch and Afrikaans. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.153-160. [PDF, 317KB]

Afrikaans to English

(2005) Herman A. Engelbrecht & Tanja Schultz: Rapid development of an Afrikaans English speech-to-speech translator. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 171KB]

Ainu to Japanese

(2005) Hiroshi Echizen-ya, Kenji Araki, & Yoshio Momouchi: Automatic acquisition of bilingual rules for extraction of bilingual word pairs from parallel corpora. ACL-SIGLEX-2005: Workshop on Deep Lexical Acquisition, University of Michigan, Ann Arbor, 30 June 2005; pp. 87-96.  [PDF, 726KB]

Amazigh

(2010) Fadoua Ataa Allah & Siham Boulaknadel: Light morphology processing for Amazighe language. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 32-35. [PDF, 278KB]

(2010) Mohamed Outahajala, Lahbib Zekouar, Paolo Rosso, & M.Antònia Martí: Tagging Amazigh with AnCoraPipe. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 52-56. [PDF, 326KB]

American Sign Language

(2007) Matt Huenerfauth, Liming Zhao, Erdan Gu, & Jan Allbeck: Design and evaluation of an American Sign Language generator. ACL 2007: Proceedings of the Workshop on Embodied Language Processing, June 29, 2007, Prague, Czech Republic; pp.51-58. [PDF, 204KB]

(2005) Matt Huenerfauth: American Sign Language generation: multimodal NLG with multiple linguistic channels. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 37-42. [PDF, 91KB]

(2004) Matt Huenerfauth: Spatial and planning models of ASL classifier predicates for machine translation; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.65-74. [PDF, 190KB]

(2004) Matt Huenerfauth: A multi-path architecture for machine translation of English texts into American Sign Language animation. HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA – Proceedings of the Student Research Workshop. 6pp. [PDF, 272KB]

(2004) Emiko Suzuki, Mariko Horikoshi, & Kyoko Kakihana: Bilingual sign language dictionary for learning a second sign language without learning the target spoken language. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.93-96. [PDF, 1015KB]

(2000) Liwei Zhao, Karin Kipper, William Schuler, Christian Vogler, Norman Badler, & Martha Palmer: A machine translation system from English to American Sign Language. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.54-67. [go to publisher details]

Amharic

(2010) Michael Gasser: A dependency grammar for Amharic. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, 17 May 2010, Valletta, Malta; 9pp. [PDF, 179KB]

(2006) Saba Amsalu & Sisay Fissaha Adafre: Machine translation for Amharic: where are we.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.47-50. [PDF, 88KB]

(2004) Sisay Fissaha Adafre: Formal analysis of some aspects of Amharic noun phrases 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 43-52. [PDF, 198KB]

(2003) Sisay Fissaha & Johann Haller: Application of corpus-based techniques to Amharic texts. MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF, 430KB]

Amharic to English

(2011) Michael Gasser: Towards synchronous extensible dependency grammar. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.3-10. [PDF, 265KB]

(2008) Atelach Alemu Argaw: Word sense discrimination using context vector similarity. SLTC 2008: Second Swedish Language Technology Conference, November 20-21, 2008, Stockholm; pp.65-66. [PDF, 322KB]

(2006) Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Hamid Mansouri Rad & Ron Zacharski: Guarani: a case study in resource development for quick ramp-up MT. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.1- 9 [PDF, 344KB]

(2006) Saba Amsalu: Data-driven Amharic-English bilingual lexicon acquisition . LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.281-286 [PDF, 366KB]

Arabic (see also Iraqi Arabic)

(2011) Alex Rudnick: A resource-light approach to learning verb valencies. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 54KB]; presentation: 31 slides [PDF of PPT, 697KB]

(2010) Mohamed Altantawy, Nizar Habash, Owen Rambow, & Ibrahim Saleh: Morphological analysis and generation of Arabic nouns: a morphemic functional approach. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, organised by ELRA, 17-23 May 2010, Valletta, Malta; pp.851-858. [PDF,                899KB]

(2010) Yassine Benajiba & Mona Diab: A web application for dialectal Arabic text annotation. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 91-98. [PDF, 747KB]

(2010) Lynne Cahill: A syllable-based approach to verbal morphology in Arabic. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 19-26. [PDF, 747KB]

(2010) Mona Diab, Nizar Habash, Owen Rambow, Mohamed Altantawy, & Yassine Benajiba: COLABA: Arabic dialect annotation and processing. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 66-74. [PDF, 747KB]

(2010) Kais Dukes & Nizar Habash: Morphological annotation of Quranic Arabic. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, organised by ELRA, 17-23 May 2010, Valletta, Malta; pp.2530-2536. [PDF, 995KB]

(2010) Ahmed El Kholy & Nizar Habash: Techniques for Arabic morphological detokenization and orthographic denormalization. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 45-51. [PDF, 354KB]

(2010) Spence Green & Christopher D.Manning: Better Arabic parsing: baselines, evaluations, and analysis. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.394-402. [PDF, 148KB]

(2010) Nizar Y.Habash: Introduction to Arabic natural language processing. (Synthesis Lectures on Human Language Technologies.) San Rafael, CA: Morgan & Claypool Publishers, 2010. 187pp. [abstract]

(2010) Saab Mansour: MorphTagger: HMM-based Arabic segmentation for statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.321-327. [PDF, 369KB]

(2010) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: Arabic morpho-syntactic feature disambiguation in a translation context. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.61-65. [PDF, 153KB]

(2010) Yuval Marton, Nizar Habash, & Owen Rambow: Improving Arabic dependency parsing with lexical and inflectional morphological features. Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages, pages 13–21, Los Angeles, CA, USA, June 2010; pp.13-21. [PDF, 187KB]

(2010) Hassan Sawaf: Arabic dialect handling in hybrid machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 146KB]

(2009) Ahmed Abdelali, Steve Helmreich, & Ron Zacharski: Investigations on standard Arabic geographical classification.  CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 193KB]

(2009) Younès Bahou, Lamia Hadrich Belguith, & Abdelmajid Ben Hamadou: Disfluency and out-of-vocabulary word processing in Arabic speech understanding. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 825KB]

(2009) Romaric Besançon, Djamel Mostefa, Ismaïl Timimi, Stéphane Chaudiron, Mariama Laïb, & Khalid Choukri: Arabic, English and French: three languages in a filtering systems evaluation project. MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.163-167. [PDF, 465KB]

(2009) Khalid Choukri: MEDAR – Mediterranean Arabic language and speech technology: an intermediate report on the MEDAR survey of actors, projects, products. MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.186-192. [PDF, 445KB]

(2009) Selçuk Köprü & Jude Miller: A unification based approach to the morphological analysis and generation of Arabic. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 615KB]

(2009) Bente Maegaard, M.Attia, K.Choukri, S.Krauwer, C.Mokbel, & M.Yaseen: MEDAR: Arabic language technology, state-of-the-art and a cooperation roadmap. MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.168-174. [PDF, 374KB]

(2009) Djamel Mostefa, Mariama Laïb, Stéphane Chaudiron, Khalid Choukri, & Gaël de Chalendar: A multilingual named entity corpus for Arabic, English and French.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.213-216. [PDF, 571KB]

(2009) Khaled Shaalan, Azza Abdel Monem & Ahmed Rafea: Syntactic generation of Arabic in interlingua-based machine translation framework. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 574KB]

 (2008) Manal Amin: Pan African localization network. Translating and the Computer 30, 27-28 November 2008, London; 6pp. [PDF, 35KB]

(2008) Bente Maegaard, M.Atiyya, K.Choukri, S.Krauwer, C.Mokbel, & M.Yaseen: MEDAR – collaboration between European and Mediterranean Arabic partners to support the development of language technology for Arabic. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; pp. 3609-3614. [PDF, 235KB]

(2008) Azza Abdel Monem, Khaled Shaalan, Ahmed Rafea, & Hoda Baraka: Generating Arabic text in multilingual speech-to-speech machine translation framework [abstract]. Machine Translation 22 (4), December 2008; pp.205-258.

(2008) Doaa Samy & Ana González-Ledesma: Pragmatic annotation of discourse markers in a multilingual parallel corpus (Arabic-Spanish-English).  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 301KB]

(2008) Tim Schlippe, ThuyLinh Nguyen, & Stephan Vogel: Diacritization as a machine translation problem and as a sequence labeling problem. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.270-278 [PDF, 701KB]

(2008) Benjamin Snyder & Regina Barzilay: Unsupervised multilingual learning for morphological segmentation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 737-745. [PDF, 254KB]

(2007) Pierrette Bouillon, Sonia Halimi, Manny Rayner, & Beth Ann Hockey: Adapting a medical speech to speech translation system (MedSLT) to Arabic. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 41-48 [PDF, 145KB]

(2007) Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arısoy, Murat Saraçlar, & Andreas Stolcke: Analysis of morph-based speech recognition and the modeling of out-of-vocabulary words across languages. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.380-387 [PDF, 136KB]

(2007) Mona Diab, Mahmoud Ghoneim, & Nizar Habash: Arabic diacritization in the context of statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.143-149 [PDF, 105KB]

(2007) Mona Diab & Nizar Habash: Arabic dialect processing tutorial. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.5-6 [PDF, 122KB]

(2007) Jakob Elming & Nizar Habash: Combination of statistical word alignments based on multiple preprocessing schemes. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.25-28 [PDF, 123KB]

(2007) Nizar Habash & Owen Rambow: Arabic diacritization through full morphological tagging. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.53-56 [PDF, 99KB]

(2007) Einat Minkov, Kristina Toutanova, & Hisami Suzuki: Generating complex morphology for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 128-135 [PDF, 219KB]

(2007) Khaled Shaalan & Hafsa Raza: Person name entity recognition for Arabic. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 17-24 [PDF, 591KB]

(2006) proceedings of conference The Challenge of Arabic for NLP/MT. British Computer Society, London, 23 October 2006.

(2006) Sherri Condon & Keith J. Miller: What’s in a name: current methods, applications, and evaluation in multilingual name search and matching. [Abstract for tutorial, in] HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 299-300 [PDF, 20KB]

(2006) Simon Corston-Oliver, Anthony Aue, Kevin Duh & Eric Ringger: Multilingual dependency parsing using Bayes point machines.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 160-167 [PDF, 178KB]

(2006) Mona Diab & Nizar Habash: Arabic dialect processing.  Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 61pp. [PDF of PPT presentation, 1514KB]

(2006) Nizar Habash & Owen Rambow: MAGEAD: a morphological analyzer and generator for the Arabic dialects. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.681-688. [PDF, 112KB]

(2006) Nizar Habash & Fatiha Sadat: Arabic preprocessing schemes for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 49-52 [PDF, 72KB]

(2006) Rebecca Hwa, Carol Nichols & Khalil Sima’an: Corpus variations for translation lexicon induction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.74-81 [PDF, 178KB]

(2006) Dragos Stefan Munteanu & Daniel Marcu: Improving machine translation performance by exploiting non-parallel corpora. Computational Linguistics 31 (4), pp. 477-504 [PDF, 1060KB]

(2006) Jason Riesa & David Yarowsky: Minimally supervised morphological segmentation with applications to machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.185-192 [PDF, 613KB]

(2006) Doaa Samy, Antonio Moreno Sandoval, Jose M. Guirao, & Enrique Alfonseca: Building a parallel multilingual corpus (Arabic-Spanish-English).  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2176-2181 [PDF, 473KB]

(2005) Daoud Maher Daoud: Arabic generation in the framework of the Universal Networking Language.  In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.195-209 [abstract, PDF, 58KB]

(2005) Gregory Grefenstette, Nasredine Semmar, & Faïza Elkateb-Gara: Modifying a natural language processing system for European languages to treat Arabic in information processing and information retrieval applications.  ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 31-38. [PDF,

(2005) Nizar Habash & Owen Rambow: Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 573-580. [PDF, 107KB]

(2005) Nizar Habash, Owen Rambow & George Kiraz: Morphological analysis and generation for Arabic dialects.  ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 17-24. [PDF, 121KB]

(2004) Martin Kay: Arabic script-based languages deserve to be studied linguistically. Coling 2004: Workshop on Computational Approaches to Arabic Script-based Languages, University of Geneva, Geneva, Switzerland, August 28, 2004; 1pp. [PDF, 13KB]

(2004) Bente Maegaard: NEMLAR – an Arabic language resources project. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.109-112. [PDF, 279KB]

(2004) Bente Maegaard: The NEMLAR project on Arabic language resources 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp.124-128. [PDF, 123KB]

(2004) Mark Pedersen, Domenyk Eades, Samir K.Amin & Lakshmi Prakash: Relative clauses in Hindi and Arabic: a Paninian dependency grammar analysis. Coling’04 workshop: Proceedings Recent Advances in Dependency Grammar, August 28, 2004, Geneva, Switzerland; 8pp. [PDF, 115KB]

(2003) Joseph Dichy & Ali Farghaly: Roots & patterns vs. stems plus grammar-lexis specifications: on what basis should a multilingual database centred on Arabic be built? MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF, 258KB]

(2003) Ali Farghaly & Jean Senellart: Intuitive coding of the Arabic lexicon. MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF, 168KB]

(2003) Alex Fraser: Issues in Arabic MT MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF of PPT presentation, 62KB]

(2003) Bassam Haddad & Mustafa Yaseen: Towards semantic composition of Arabic: a λ-DRT based approach. MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans,USA, 23 September 2003 [PDF, 162KB]

(2003) Eman Othman, Khaled Shaalan & Ahmed Rafea: A chart parser for analyzing modern standard Arabic sentence. MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF, 186KB]

(2002) Yaser Al-Onaizan & Kevin Knight: Machine transliteration of names in Arabic text; ACL-2002 workshop "Computational approaches to Semitic languages", July 2002, Philadelphia, USA; 13pp. [PDF, 245KB]

(2002) Douglas W.Oard, Frederic C. Gey, & Bonnie J. Dorr: Evaluating Arabic retrieval from English or French queries: the TREC-2001 cross-language information retrieval track. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop Arabic language resources and evaluation: status and prospects, Las Palmas de Gran Canaria, Spain, 1 June 2002; 6pp. [PDF, 183KB]

Arabic to Chinese

(2009) Nizar Habash & Jun Hu: Improving Arabic-Chinese statistical machine translation using English as pivot language. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.173-181. [PDF, 265KB]

Arabic to English

(2011) Hala Almaghout, Jie Jiang, & Andy Way: CCG contextual labels in hierarchical phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.281-288. [PDF, 317KB]; presentation, 25 slides [PDF, 964KB]

 (2011) JacobAndreas, Nizar Habash, & Owen Rambow: Fuzzy syntactic reordering for phrase-based statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.227-236. [PDF, 221KB]

(2011) Nguyen Bach, Fei Huang, & Yaser Al-Onaizan: Goodness: a method for measuring machine translation confidence. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.211-219. [PDF, 976KB]

(2011) Kfir Bar & Nachum Dershowitz: Using verb paraphrases for Arabic-to-English example-based translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; 3pp. [PDF, 94KB];  presentation: 27 slides [PDF of PPT, 1356KB]

(2011) Luisa Bentivogli, Marcello Federico, Giovanni Moretti, & Michael Paul: Getting expert quality from the crowd for machine translation evaluation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.521-528. [PDF, 337KB]

(2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Methods for smoothing the optimizer instability in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.32-39. [PDF, 150KB]

(2011) Marcello Federico, Arianna Bisazza & Christian Hardmeier: Morphological processing and word reordering for statistical MT of highly inflected languages [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 37 slides [PDF of PPT, 835KB]

(2011) Sanjika Hewavitharana & Stephan Vogel: Extracting parallel phrases from comparable data. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.61-68. [PDF, 217KB]

(2011) Matthias Huck, David Vilar, Daniel Stein, & Hermann Ney: Advancements in Arabic-to-English hierarchical machine translation. [EAMT  2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.273-280. [PDF, 396KB]; presentation, 23 slides [PDF]

(2011) Alon Lavie & Hassan Al-Haj: The impact of Arabic morphological segmentation on broad-scale phrase-based SMT [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; presentation: 38 slides [PDF of PPT, 362KB]

(2011) Christof Monz: Statistical machine translation with local language models. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.869-879. [PDF, 407KB]

(2011) Jan Niehues, Teresa Herrmann, Stephan Vogel, & Alex Waibel: Wider context by using bilingual langauge models in machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.198-206. [PDF, 570KB]

(2011) Jason Riesa, Ann Irvine, & Daniel Marcu: Feature-rich language-independent syntax-based alignment for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.497-507. [PDF, 1053KB]

(2011) Hassan Sajjad, Alexander Fraser, & Helmut Schmid: An algorithm for unsupervised transliteration mining with an application to word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.430-439. [PDF, 171KB]

(2011) Lucia Specia, Najeh Hajlaoui, Catalina Hallett, & Wilker Aziz: Predicting machine translation adequacy. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.513-520. [PDF, 193KB]

(2011) Nadi Tomeh, Alexandre Allauzen, & François Yvon: Discriminative weighted alignment matrices for statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.305-312. [PDF, 413KB]; presentation, 38 slides [PDF]

(2011) Bing Xiang & Abraham Ittycheriah: Discriminative feature-tied mixture modeling for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.424-428. [PDF, 85KB]

(2010) Jinxi Xu & Antti-Veikko I.Rosti: Combining unsupervised and supervised alignments for MT: an empirical study.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.667-673. [PDF, 74KB]

(2011) Bing Zhao, Young-Suk Lee, Xiaoqiang Luo, & Liu Li: Learning to transform and select elementary trees for improved syntax-based machine translations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.846-855. [PDF, 711KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: The DCU machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.37-44. [PDF, 621KB]

(2010) Kfir Bar & Nachum Dershowitz: Tel Aviv University’s system description for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.169-174. [PDF, 399KB]

(2010) Kfir Bar & Nachum Dershowitz: Using synonyms for Arabic-to-English example-based translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 83KB]

(2010) Yassine Benajiba, Imed Zitouni, Mona Diab, & Paolo Rosso: Arabic named entity recognition: using features extracted from noisy data. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.281-285. [PDF, 122KB]

(2010) Laurent Besacier, Haitem Afli, Do Thi Ngoc Diep, Hervé Blanchon, & Marion Potet: LIG statistical machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.99-104. [PDF, 417KB]

 (2010) Arianna Bisazza & Marcello Federico: Chunk-based verb reordering in VSO sentences for Arabic-English statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 235-243. [PDF, 372KB]

(2010) Arianna Bisazza, Ioannis Klasinas, Mauro Cettolo, & Marcello Federico: FBK @ IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.53-58. [PDF, 346KB]

(2010) Marine Carpuat, Yuval Marton, & Nizar Habash: Improving Arabic-to-English statistical machine translation by reordering post-verbal subjects for alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.178-183. [PDF, 108KB]

(2010) Marine Carpuat, Yuval Marton, & Nizar Habash: Reordering matrix post-verbal subjects for Arabic-to-English SMT. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 10pp. [PDF, 287KB] 

(2010) Simon Carter & Christof Monz: Discriminative syntactic reranking for statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 143KB]

(2010) David Chiang: Learning to translate with source and target syntax. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1443-1452. [PDF, 357KB]

(2010) Marta R.Costa-jussà & José R.Fonollosa: Using linear interpolation and weighted reordering hypotheses in the Moses system. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1712-1718. [PDF, 389KB]

(2010) John DeNero, Shankar Kumar, Ciprian Chelba, & Franz Och: Model combination for machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.975-983. [PDF, 573KB]

(2010) Chris Dyer & Philip Resnik: Context-free reordering, finite-state translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.858-866. [PDF, 272KB]

 (2010) Cristina España-Bonet & Lluís Màrquez: Robust estimation of feature weights in statistical machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 618KB]

(2010) Qin Gao, Francisco Guzman, & Stephan Vogel: EMDC: a semi-supervised approach for word alignment. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.349-357. [PDF, 231KB]

(2010) Qin Gao, Nguyen Bach, & Stephan Vogel: A semi-supervised word alignment algorithm with partial manual alignments. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 1-10. [PDF, 278KB]

(2010) Julien Gosme, Wigdan Mekki, Fathi Debili, Yves Lepage, & Nadine Lucas: The GREYC/LLACAN machine translation systems for the IWSLT 2010 campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.59-65. [PDF, 307KB]

(2010) Spence Green, Michel Galley, & Christopher D.Manning: Improved models of distortion cost for statistical machine translation.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.867-875. [PDF, 223KB]

(2010) Stephen Grimes, Xuansong Li, Ann Bies, Seth Kulick, Xiaoyi Ma, & Stephanie Strassel: Creating Arabic-English parallel word-aligned treebank corpora at LDC. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp.102-107. [PDF, 747KB]

(2010) Kenneth Heafield & Alon Lavie: Combining machine translation output with open source: the Carnegie Mellon multi-engine machine translation scheme. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.27-36. [PDF, 133KB]

(2010) Kenneth Heafield & Alon Lavie: Voting on n-grams for machine translation system combination. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 128KB]

(2010) Matthias Huck, Martin Ratajczak, Patrick Lehnen, & Hermann Ney: A comparison of various types of extended lexicon models for statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 130KB]

 (2010) Philipp Koehn: Enabling monolingual translators: post-editing vs.options. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.537-545. [PDF,

(2010) A.Kumaran, Mitesh M.Khapra, & Haizhou Li: Report of NEWS 2010 transliteration mining shared task. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.21-28. [PDF, 598KB]

(2010) A.Kumaran, Mitesh M.Khapra, & Haizhou Li: Whitepaper of NEWS 2010 shared task on transliteration mining. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.29-38. [PDF, 677KB]

(2010) Haizhou Li, A.Kumaran, Min Zhang, & Vladimir Pervouchine: Report of NEWS 2010 transliteration generation shared task. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.1-11. [PDF, 411KB]

(2010) Haizhou Li, A.Kumaran, Min Zhang, & Vladimir Pervouchine: Whitepaper of NEWS 2010 shared task on transliteration generation. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.12-20. [PDF,  182KB]

(2010) Frank Liberato, Behrang Mohit, & Rebecca Hwa: Improving phrase-based translation with prototypes of short phrases.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.301-304. [PDF, 106KB]

(2010) Saab Mansour, Stephan Peitz, David Vilar, Joern Wuebker, & Hermann Ney: The RWTH Aachen machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.163-168. [PDF, 248KB]

(2010) Evgeny Matusov & Selçuk Köprü: AppTek’s APT machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.29-36. [PDF, 336KB]

(2010) ThuyLinh Nguyen, Stephan Vogel, & Noah A.Smith: Nonparametric word segmentation for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.815-823. [PDF, 267KB]

(2010) Michael Paul, Marcello Federico, & Sebastian Stüker: Overview of the IWSLT 2010 evaluation campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.3-27. [PDF, 558KB]

(2010) Narges Sharif Razavian & Stephan Vogel: Fixed length word suffix for factored statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.147-150. [PDF, 268KB]

(2010) Jason Riesa & Daniel Marcu: Hierarchical search for word alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.157-166. [PDF, 413KB]

(2010) Brian Roberson: Multi-language desktop suite. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Kashif Shah, Loïc Barrault, & Holger Schwenk: Translation model adaptation by resampling. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 392-399. [PDF, 176KB]

(2010) Wade Shen, Tim Anderson, Ray Slyh, & A.Ryan Aminzadeh: The MIT-LL/AFRL IWSLT-2010 MT system. Proceedings of the 7th International Workshop on Spoken L anguage Translation, 2-3 December 2010, Paris, France; pp.127-134. [PDF, 341KB]; presentation: 11 slides [PDF of PPT, 422KB]

(2010) Kristen Summers & Hassan Sawaf: User-generated system for critical document triage and exploitation – version 2011. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Nadi Tomeh, Alexandre Allauzen, Guillaume Wisniewski & François Yvon: Refining word alignment with discriminative training. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 203KB]

(2010) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: The MIRACL Arabic-English statistical machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.119-125. [PDF, 1083KB]

(2010) Sirvan Yahyaei & Christoph Monz: Dynamic distortion in a discriminative reordering model for statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.353-360. [PDF, 460KB]

(2010) Sirvan Yahyaei & Christof Monz: The QMUL system description for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.157-162. [PDF, 303KB]

(2010) Rabih Zbib, Spyros Matsoukas, Richard Schwartz, & John Makhoul: Decision trees for lexical smoothing in statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the work shop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 428-437. [PDF, 2766KB]

(2009) Sadaf Abdul-Rauf & Holger Schwenk: Exploiting comparable corpora with TER and TERp. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.46-54. [PDF, 186KB]

(2009) Kedar Bellare, Koby Crammer, & Dayne Freitag: Loss-sensitive discriminative training of machine transliteration models. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.61-65. [PDF, 117KB]

(2009) Nicola Bertoldi, Arianna Bisazza, Mauro Cettolo, Germán Sanchis-Trilles, & Marcello Federico: FBK @ IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 37-44. [PDF, 346KB]; poster [PDF, 411KB]

(2009) Alexandra Birch, Phil Blunsom & Miles Osborne: A quantitative analysis of reordering phenomena. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.197-205. [PDF, 808KB]

(2009) Reza Bosagh Zadeh: Building strong multilingual aligned corpora. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.176-181. [PDF, 497KB]

(2009) Fethi Bougares, Laurent Besacier, & Hervé Blanchon: LIG approach for IWSLT09: using multiple morphological segmenters for spoken language translation of Arabic. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 60-64. [PDF, 377KB]; presentation [PDF of PPT, 668KB]

(2009) Jamie Brunning, Adrià de Gispert & William Byrne: Context-dependent alignment models for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.110-118. [PDF, 226KB]

(2009) Sherri Condon, Gregory A.Sanders, Dan Parvaz, Alan Rubenstein, Christy Doran, John Aberdeen, & Beatrice Oshika: Normalization for automated metrics: English and Arabic speech translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 33-40. [PDF, 326KB]

(2009) Marta R.Costa-jussà & Rafael E.Banchs: Barcelona Media SMT system description for the IWSLT 2009: introducing source context information. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 24-28. [PDF, 265KB]; poster [PDF, 281KB[

(2009) Steve DeNeefe & Kevin Knight: Synchronous tree adjoining machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.727-736. [PDF, 398KB]

(2009) Thomas Deselaers, Saša Hasan, Oliver Bender, & Hermann Ney: A deep learning approach to machine transliteration.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.233-241. [PDF, 201KB]

(2009) Jakob Elming, Nizar Habash, & Josep M.Crego: Combination of statistical word alignments based on multiple preprocessing schemes.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.93-110.

(2009) Adrià de Gispert, Sami Virpioja, Mikko Kurimo, & William Byrne: Minimum Bayes risk combination of translation hypotheses from alternative morphological decompositions.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.73-76. [PDF, 72KB]

(2009) Spence Green, Conal Sathi & Christopher D.Manning: NP subject detection in verb-initial Arabic clauses. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp.  [PDF, 729KB]

(2009) Nizar Habash: REMOOV: a tool for online handling of out-of-vocabulary words in machine translation. MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.217-220. [PDF, 342KB]

(2009) Nizar Habash, Bonnie Dorr, & Christof Monz: Symbolic-to-statistical hybridization: extending generation-heavy machine translation [abstract]. Machine Translation 23 (1), February 2009; pp.23-63.

(2009) Ulf Hermjakob: Improved word alignment with statistics and linguistic heuristics. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.229-237. [PDF, 107KB]

(2009) Fei Huang: Confidence measure for word alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.932-940. [PDF, 727KB]

(2009) Gonzalo Iglesias, Adrià de Gispert, Eduardo R.Banga & William Byrne: Hierarchical phrase-based translation with weighted finite state transducers. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.433-441. [PDF, 170KB]

(2009) Gonzalo Iglesias, Adrià de Gispert, Eduardo R.Banga & William Byrne: Rule filtering by pattern for efficient hierarchical translation. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.380-388. [PDF, 109KB]

(2009) Maxim Khalilov, José A.R.Fonollosa, & Mark Dras: Compiling hierarchical word reordering and decoding in phrase-based statistical machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.78-86. [PDF, 1161KB]

(2009) Maxim Khalilov & José A.R.Fonollosa: N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.424-432. [PDF, 564KB]

(2009) Yves Lepage, Adrien Lardilleux, & Julien Gosme: The GREYC translation memory for the IWSLT 2009 evaluation campaign: one step beyond translation memory. IWSLT 2009: Proceedings of the International Workshop on Spoke n Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 45-49. [PDF, ]; presentation [PDF of PPT,

(2009) Spyros Matsoukas, Antti-Veikko I.Rosti, & Bing Zhang: Discriminative corpus weight estimation for machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.708-717. [PDF, 116KB]

(2009) Evgeny Matusov, Gregor Leusch, & Hermann Ney: Learning to combine machine translation systems.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.257-276.

(2009) Arne Mauser, Saša Hasan & Hermann Ney: Extending statistical machine translation with discriminative and trigger-based lexicon models. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.210-218. [PDF, 266KB]

(2009) Coşkun Mermer, Hamza Kaya, & Mehmet Uğur Doğan: The TÜBİTAK-UEKAE statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 113-117. [PDF, 457KB]; poster [PDF, 2530KB]

(2009) Hamdy S.Mubarak, Mohamed Al Sharqawy & Esraa Al Masry: Diacritization and transliteration of proper nouns from Arabic to English.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.256-259. [PDF, 367KB]

(2009) Sara Noeman: Transliteration using phrase based SMT approach on substrings.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.199-204. [PDF, 348KB]

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky & Chris Manning: Robust machine translation evaluation with entailment features. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.297-305. [PDF, 437KB]

(2009) Michael Paul: Overview of the IWSLT 2009 evaluation campaign. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 1-18. [PDF, 398KB]; presentation [PDF of PPT, 481KB]

(2009) Ibrahim M.Saleh & Nizar Habash: Automatic extraction of lemma-based bilingual dictionaries for morphologically rich languages. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 622KB]

(2009) Yasser Salem & Brian Nolan: Designing an XML lexicon architecture for Arabic machine translation based on role and reference grammar.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.221-229. [PDF, 691KB]

(2009) Holger Schwenk, Loïc Barrault, Yannick Estève, & Patrik Lambert: LIUM’s statistical machine translation system for IWSLT 2009.  IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 65-70. [PDF, 344KB]; poster [PDF, 117KB]

(2009) Hendra Setiawan, Min-Yen Kan, Haizhou Li, & Philip Resnik: Topological ordering of function words in hierarchical phrase-based translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.324-332. [PDF, 504KB]

(2009) Libin Shen, Jinxi Xu, Bing Zhang, Spyros Matsoukas & Ralph Weischedel: Effective use of linguistic and contextual information for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.72-80. [PDF, 122KB]

(2009) Wade Shen, Brian Delaney, A.Ryan Aminzadeh, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2009 MT system. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 71-78. [PDF, 398KB]; presentation [PDF of PPT, 1730KB]

(2009) Mei Yang, Amittai Axelrod, Kevin Duh, & Katrin Kirchhoff: The University of Washington machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 124-128. [PDF, 264KB]; poster [PDF, 1111KB]

(2009) The 2009 NIST machine translation evaluation plan (MT09). [NIST, 2009]; 8pp. [PDF,360KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Arabic-English. [NIST, 2009]; [HTML, 301KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Combination tests. [NIST, 2009]; HTML, 114KB]

(2009) NIST 2009 open machine translation evaluation (MT09). Official release of results: Progress test. [NIST, 2009]; [HTML, 108KB]

(2008) Farag Ahmed & Andreas Nürnberger: Arabic/English word translation disambiguation using parallel corpora and matching  schemes.  EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.6-11. [PDF, 616KB]

(2008) Farag Ahmed & Andreas Nürnberger: Arabic/English word translation disambiguation approach based on naïve Bayesian classifier. Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 20-22 October 2008; pp.331-338. [PDF, 318KB]

(2008) L.Besacier, A.Ben-Youssef, & H.Blanchon: The LIG Arabic/English speech translation system at IWSLT08. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 58-62. [PDF, 212KB]; presentation [PDF, 91KB]

(2008) Graeme Blackwood, Adrià de Gispert, Jamie Brunning, & William Byrne: Large-scale statistical machine translation with weighted finite state transducers. FSMNLP 2008: Seventh International Workshop on Finite-State Methods and Natural Language Processing, Ispra, Italy, 11-12 September 2009; 12pp. [PDF, 361KB]

(2008) Ralf D.Brown: Exploiting document-level context for data-driven machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.46-55. [PDF, 496KB]

(2008) Simon Carter, Christof Monz, & Sirvan Yahyaei: The QMUL system description for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.104-107. [PDF, 161KB]

(2008) David Chiang, Yuval Marton, & Philip Resnik: Online large-margin training of syntactic and structural translation features.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233. [PDF, 162KB]

(2008) Josep M. Crego & Nizar Habash: Using shallow syntax information to improve word alignment and reordering for SMT. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.53-61. [PDF, 200KB]

(2008) Steve DeNeefe, Ulf Hermjakob & Kevin Knight: Overcoming vocabulary sparsity in MT using lattices. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.89-96. [PDF, 492KB]

(2008) Jennifer Doyon, Christine Doran, C.Donald Means, & Domenique Parr: Automated machine translation improvement through post-editing techniques: analyst and translator experiments. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.346-353. [PDF, 712KB]

(2008) Kevin Duh & Katrin Kirchhoff: Beyond log-linear models: boosted minimum error rate training for n-best re-ranking. ACL 2008 HLT Short Papers, June 2008, Columbus, Ohio; pp.37-40. [PDF, 102KB]

(2008) Christopher Dyer, Smaranda Muresan, & Philip Resnik: Generalizing word lattice translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1012-1020 . [PDF, 201KB]

(2008) Kathleen Egan, Francis Kubala, & Allen Sears: User-centered development and implementation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.354-363. [PDF of PPT presentation, 1017KB]

(2008) Victoria Fossum, Kevin Knight, & Steven Abney: Using syntax to improve word alignment precision for syntax-based machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.44-52. [PDF, 150KB]

(2008) Michel Galley & Christopher D.Manning: A simple and effective hierarchical phrase reordering model. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.848-856. [PDF, 230KB]

(2008) Nizar Habash: Four techniques for online handling of out-of-vocabulary words in Arabic-English statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 57-60. [PDF, 81KB]

(2008) Ulf Hermjakob, Kevin Knight, & Hal Daumé III: Name translation in statistical machine translation: learning when to transliterate. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 389-397. [PDF, 107KB]

(2008) Maxim Khalilov, Maria R. Costa-jussà, Carlos A.Henríquez Q., José A.R.Fonollosa, Adolfo Hernández H., José B.Mariño, Rafael E. Banchs, Chen Boxing, Min Zhang, Aiti Aw, & Haizhou Li: The TALP&I2R SMT systems for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.116-123 [PDF, 236KB]

(2008) Kriste Krstovski, Michael Decerbo, Rohit Prasad, David Stallard, Shirin Saleem, & Premkumar Natarajan: A wearable headset speech-to-speech translation system. ACL-08: HLT. Workshop on Mobile Language Processing. Proceedings of the workshop, June 20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 10-12. [PDF, 117KB]

(2008) Jonghoon Lee & Gary Geunbae Lee: POSTECH machine translation system for IWSLT 2008 evaluation campaign. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.98-103 [PDF, 424KB]; presentation [PDF, 280KB]

(2008) Yves Lepage, Adrien Lardilleux, Julien Gosme, & Jean-Luc Manguin: The GREYC machine translation system for the IWSLT 2008 evaluation campaign. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 39-45. [PDF, 184KB]; presentation [PDF, 142KB]

(2008) Yanjun Ma, John Tinsley, Hany Hassan, Jinhua Du, & Andy Way: Exploiting alignment techniques in MATREX: the DCU machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 26-33. [PDF, 198KB]; presentation [PDF, 559KB]

(2008) Yuval Marton & Philip Resnik: Soft syntactic constraints for hierarchical phrased-based translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1003-1011. [PDF, 153KB]

(2008) Coşkun Mermer, Hamza Kaya, Ömer Farukhan Güneş, & Mehmet Uğur Doğan: The TÜBÍTAK-UEKAE statistical machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.138-142 [PDF, 239KB; presentation [PDF, 179KB]

(2008) ThuyLinh Nguyen & Stephan Vogel: Context-based Arabic morphological analysis for machine translation. CoNLL: proceedings of the  Twelfth Conference on Computational Natural Language Learning, 16-17 August 2008, Manchester, UK; pp.135-142. [PDF, 123KB]

(2008) Michael Paul: Overview of the IWSLT 2008 evaluation campaign. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 1-17. [PDF, 262KB]; presentation [PDF, 765KB]

(2008) John F.Pitrelli, Burn L.Lewis, Edward A.Epstein, Martin Franz, Daniel Kiecza, Jerome L.Quinn, Ganesh Ramaswamy, Amit Srivastava, & Paola Virga: Aggregating distributed STT, MT, and information extraction engines: the GALE interoperability-demo system. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2743-2746; abstract [PDF, 63KB]

(2008) John F.Pitrelli, Burn L.Lewis, Edward A.Epstein, Jerome L.Quinn, & Ganesh Ramaswamy: A data format enabling interoperation of speech recognition, translation and information extraction engines: the GALE type system. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.1654-1657; abstract [PDF, 67KB]

(2008) Allison L.Powell & Allison Blodgett: The use of machine-generated transcripts during human translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.427-434. [PDF, 649KB]

(2008) Hassan Sawaf, Braddock Gaskill, & Michael Veronis: Hybrid machine translation applied to media monitoring. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.440-447. [PDF, 963KB]

(2008) Holger Schwenk, Yannick Estève, & Sadaf Abdul Rauf: The LIUM Arabic/English statistical machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 63-68. [PDF, 120KB]; presentation [PDF, 49KB]

(2008) Holger Schwenk & Philipp Koehn: Large and diverse language models for statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.661-666. [PDF, 364KB]

(2008) Wade Shen, Brian Delaney, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2008 MT system. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 69-76. [PDF, 208KB]; presentation [PDF, 607KB]

(2008) Matthew Snover, Bonnie Dorr, & Richard Schwartz: Language and translation model adaptation using comparable corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.857-866. [PDF, 132KB]

(2008) Zhiyi Song & Stephanie Strassel: Entity translation and alignment in the ACE-07 ET task. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 135KB]

(2008) Michael Subotin: Generalizing local translation models. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.28-36. [PDF, 148KB]

(2008) Jean Tavernier, Rosa Cowan, & Michelle Vanni: Holy Moses! Leveraging existing tools and resources for entity translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 74KB]

(2008) Christoph Tillmann: A rule-driven dynamic programming decoder for statistical MT. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.37-45. [PDF, 152KB]

(2008) Roy W.Tromble, Shankar Kumar, Franz Och, & Wolfgang Machery: Lattice minimum Bayes-risk decoding for statistical machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.620-629. [PDF, 285KB]

(2008) Carol Van Ess-Dykema, Helen G. Gigley, Stephen Lewis, & Emily Vancho Bannister: Embedding technology at the front end of a human translation workflow: an NVTC vision. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.457-463. [PDF, 725KB] 

(2008) David Vilar, Daniel Stein, Yuqi Zhang, Evgeny Matusov, Arne Mauser, Oliver Bender, Saab Mansour, & Hermann Ney: The RWTH machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.108-115 [PDF, 228KB]; presentation [PDF, 74KB]

(2008) Clare R. Voss, Jamal Laoudi, & Jeffrey Micher: Exploitation of an Arabic language resource for MT evaluation: using Buckwalter-based lookup tool to augment CMU alignment algorithm.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 246KB]

(2008) Brian A. Weiss, Craig Schlenoff, Greg Sanders, Michelle P.Steves, Sherri Condon, Jon Phillips, & Dan Parvaz: Performance evaluation of speech translation systems.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 123KB]

(2008) Andreas Zollmann, Ashish Venugopal, Franz Och, & Jay Ponte: A systematic comparison of phrase-based, hierarchical and syntax-augmented statistical MT. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1145-1152. [PDF, 153KB]

(2008) NIST: MetricsMATR challenge. [NIST, 2008]. 6pp. [PDF, 35KB]

(2008) The 2008 NIST open machine translation evaluation plan (MT08). [NIST, 2008]; 7pp. [PDF, 360KB]

(2008) NIST 2008 open machine translation evaluation – (MT08). Official evaluation results. [NIST, 2008] [HTML]

(2007) Nguyen Bach, Matthais Eck, Paisarn Charoenpornsawat, Thilo Köhler, Sebastian Stüker, ThuyLinh Nguyen, Roger Hsiao, Alex Waibel, Stephan Vogel, Tanja Schultz, & Alan W.Black: The CMU TransTac 2007 eyes-free two-way speech-to-speech translation system. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 489KB]; presentation [PDF, 1043KB]

(2007) L. Besacier, A.Mahdhaoui, & V-B.Le: The LIG Arabic/English speech translation system at IWSLT07. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 5pp. [PDF, 305KB]; presentation [PDF, 313KB]

(2007) Kfir Bar, Y.Choueka, & N.Dershowitz: An Arabic to English example-based translation system. ICTIS 2007: Information and Communication Technologies International Symposium. Workshop on Arabic natural language processing, 3-5 April 2007, Fez, Morocco; pp. 355-359 [PDF, 165KB]

(2007) Yi Chang, Ying Zhang, Stephan Vogel, & Jie Yang: Enhancing image-based Arabic document translation using noisy channel correction model. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.89-95 [PDF, 157KB]

(2007) Christopher J.Dyer: The University of Maryland translation system for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 174KB]

(2007) Nicola Ehling, Richard Zens, & Hermann Ney: Minimum Bayes risk decoding for BLEU.  ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 101-104 [PDF, 334KB]

(2007) Cameron S.Fordyce: Overview of the IWSLT 2007 evaluation campaign. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 12pp. [PDF, 923KB]; presentation [PDF, 579KB]

(2007) Nizar Habash: Syntactic preprocessing for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.215-222 [PDF, 118KB]

(2007) Olivier Hamon: Experiences and conclusions from the CESTA evaluation project. MT Summit XI Workshop: Automatic procedures in MT evaluation, 11 September 2007, Copenhagen, Denmark, [Proceedings]; 22pp. [PDF of PPT presentation, 108KB]

(2007) Hany Hassan, Yanjun Ma, & Andy Way: MaTrEx: the DCU machine translation system for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 113KB]; presentation [PDF, 185KB]

(2007) Hany Hassan, Khalil Sima’an, & Andy Way: Supertagged phrase-based statistical machine translation.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 288-295 [PDF, 175KB]

(2007) Sanjika Hewavitharana, Alon Lavie, & Stephan Vogel: Experiments with a noun-phrase driven statistical machine translation system. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.247-253 [PDF, 241KB]

(2007) Fei Huang & Kishore Papineni: Hierarchical system combination for machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 277-286. [PDF, 158KB]

(2007) Abraham Ittycheriah & Salim Roukos: Direct translation model 2. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.57-64 [PDF, 238KB]

(2007) Douglas Jones, Martha Herzog, Hussny Ibrahim, Arvind Jairam, Wade Shen, Edward Gibson, & Michael Emonts: ILR-based MT comprehension test with multi-level questions. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.77-80 [PDF, 67KB]

(2007) Mehdi M.Kashani, Eric Joanis, Roland Kuhn, George Foster, & Fred Popowich: Integration of an Arabic transliteration module into a statistical machine translation system.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp.17-24 [PDF, 145KB]

(2007) Jae Dong Kim & Stephan Vogel: Iterative refinement of lexicon and phrasal alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.281-288 [PDF, 121KB]

(2007) Katrin Kirchhoff & Mei Yang: The University of Washington machine translation system for the IWSLT 2007 competition.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 106KB]; presentation [PDF, 505KB]

(2007) Katrin Kirchhoff, Owen Rambow, Nizar Habash, & Mona Diab: Semi-automatic error analysis for large-scale statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.289-296 [PDF, 192KB]

(2007) Shankar Kumar, Franz Och, & Wolfgang Macherey: Improving word alignment with bridge languages. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 42-50. [PDF, 183KB]

(2007) Patrik Lambert, Marta R.Costa-jussà, Josep M.Crego, Maxim Khalilov, José B.Mariño, Rafael E.Banchs, José A.R.Fonollosa, & Holger Schwenk: The TALP ngram-based SMT system for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 7pp. [PDF, 312KB]; presentation [PDF, 584KB]

(2007) Ian Lane, Andreas Zollmann, Thuy Linh Nguyen, Nguyen Bach, Ashish Venugopal, Stephan Vogel, Kay Rottmann, Ying Zhang, & Alex Waibel: The CMU-UKA statistical machine translation systems for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 7pp. [PDF, 150KB]; presentation [PDF, 393KB]

(2007) Yves Lepage & Adrien Lardilleux: The GREYC machine translation system for the IWSLT 2007 evaluation campaign. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 7pp. [PDF, 312KB]; presentation [PDF, 584KB]

(2007) Jonathan May & Kevin Knight: Syntactic re-alignment models for machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 360-368. [PDF, 180KB]

(2007) Coşkun Mermer, Hamza Kaya, & Mehmet Uğur Doğan: The TÜBÍTAK-UEKAE statistical machine translation system for IWSLT 2007.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 4pp. [PDF, 108KB]; presentation [PDF, 202KB]

(2007) Aaron B.Phillips: Sub-phrasal matching and structural templates in example-based MT. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.163-170 [PDF, 295KB]

(2007) Aaron B. Phillips, Violetta Cavalli-Sforza, & Ralf D. Brown: Improving example-based machine translation through morphological generalization and adaptation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.369-375 [PDF, 115KB]

(2007) Sharath Rao, Ian Lane, & Tanja Schultz: Optimizing sentence segmentation for spoken language translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2845-2848; abstract [PDF, 23KB]

(2007) Antti-Veikko I. Rosti, Spyros Matsoukas, & Richard Schwartz: Improved word-level system combination for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 312-319 [PDF, 135KB]

(2007) Ruhi Sarikaya, Yonggang Deng, & Yuqing Gao: Context dependent word modeling for statistical machine translation using part-of-speech tags. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2821-2824; abstract [PDF, 23KB]

(2007) Wade Shen, Brian Delaney, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2007 MT system.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 174KB]; presentation [PDF, 623KB]

(2007) Tarek Sherif & Grzegorz Kondrak: Bootstrapping a stochastic transducer for Arabic-English transliteration extraction. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 864-871 [PDF, 189KB]

(2007) Tarek Sherif & Grzegorz Kondrak: Substring-based transliteration. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 944-951 [PDF, 152KB]

(2007) Christoph Tillmann & Tong Zhang: A block bigram prediction model for statistical machine translation. ACM Transactions on Speech and Language Processing 4 (3), art.6. [abstract, PDF, 12KB]

(2007) Taro Watanabe, Jun Suzuki, Katsuhito Sudoh, Hajime Tsukada, & Hideki Isozaki: Larger feature set approach for machine translation in IWSLT 2007.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 154KB]; presentation [PDF, 1303KB]

(2007) Taro Watanabe, Jun Suzuki, Hajime Tsukada, & Hideki Isozaki: Online large-margin training for statistical machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 764-773. [PDF, 199KB]

(2007) Ying Zhang & Stephan Vogel: PanDoRA: a large-scale two-way statistical machine translation system for hand-held devices. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.543-550 [PDF, 472KB]

(2007) Bing Zhao, Nguyen Bach, Ian Lane & Stephan Vogel: A log-linear block transliteration model based on bi-stream HMMs. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.364-371 [PDF, 453KB]

(2006) proceedings of  International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan

(2006) Azzah Al-Maskari & Mark Sanderson: The affect of machine translation on the performance of Arabic-English QA system. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multilingual Question Answering (MLQA06), Trento, Italy, April 4, 2006; pp.9-14 [PDF, 452KB]

(2006) Yaser Al-Onaizan & Kishore Papineni: Distortion models for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.529-536. [PDF, 135KB]

(2006) Roldano Cattoni, Nicola Bertoldi, Mauro Cettolo, Boxing Chen, & Marcello Federico: A web-based demonstrator of a multi-lingual phrase-based translation system. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.91-94 [PDF, 184KB]

(2006) Anas El Isbihani, Shahram Khadivi, Oliver Bender, & Hermann Ney: Morpho-syntactic Arabic preprocessing for Arabic-to-English statistical machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 15-22 [PDF, 156KB]

(2006) Andrew T. Freeman, Sherri L. Condon & Christopher M. Ackerman: Cross linguistic name matching in English and Arabic: a “one to many mapping” extension of the Levenshtein edit distance algorithm. HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 471-478 [PDF, 371KB]

(2006) Adria de Gispert & José B.Mariño: Linguistic tuple segmentation in n-gram-based statistical machine translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1049; abstract [PDF, 90KB]

(2006) Nizar Habash, Bonnie Dorr & Christof Monz: Challenges in building an Arabic-English GHMT system with SMT components. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.56-65 [PDF, 116KB]

(2006) Nizar Habash, Clinton Mah, Sabiha Imran, Randy Calistri-Yeh, & Páraic Sheridan: Design, construction and validation of an Arabic-English conceptual interlingua for cross-lingual information retrieval. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.107-112 [PDF, 408KB]

(2006) Sattar Izwaini: Problems of Arabic machine translation: evaluation of three systems. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.118-148. [PDF, 281KB]

(2006) Mehdi M. Kashani, Fred Popowich, & Fatiha Sadat: Automatic transliteration of proper nouns from Arabic to English. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.76-83. [PDF, 179KB]

(2006) Michael C. McCord: MT for social impact (contribution to panel on “Machine translation for social impact”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 1022KB]

(2006) Keith J.Miller & Michelle Vanni: Formal vs. informal: register-differentiated Arabic MT evaluation in the PLATO paradigm.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.161-166 [PDF, 296KB]

(2006) Agam Patel & Dragomir R.Radev: Lexical similarity can distinguish between automatic and manual translations. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1230-1235 [PDF, 326KB]

(2006) Fatiha Sadat & Nizar Habash: Combination of Arabic preprocessing schemes for statistical machine translation.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1-8. [PDF, 104KB]

(2006) Rami B. Safadi: Social impact of translation via SMS (contribution to panel on “Machine translation for social impact”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 70KB]

(2006) Christoph Tillmann & Tong Zhang: A discriminative global training algorithm for statistical MT. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.721-728. [PDF, 179KB]

(2006) Dmitry Zelenko & Chinatsu Aone: Discriminative methods for transliteration.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 612-617. [PDF, 342KB]

(2006) Weizhong Zhu, Bowen Zhou, Charles Prosser, Pavel Krbec, & Yuqing Gao: Recent advances of IBM’s handheld speech translation system. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1590; abstract [PDF, 93KB]

(2006) Andreas Zollmann, Ashish Venugopal, & Stefan Vogel: Bridging the inflection morphology gap for Arabic statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 201-204 [PDF, 147KB]

(2006) The 2006 NIST machine translation evaluation plan (MT06). [NIST, 2006]; 6pp. [PDF,92KB]

(2005) Jill Burstein & Daniel Marcu: Translation exercise assistant: automated generation of translation exercises for native-Arabic speakers learning English. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 16-17. [PDF, 53KB]

(2005) Yonggang Deng & William Byrne: HMM word and phrase alignment for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 169-176. [PDF, 189KB]

(2005) Matthias Eck & Chiori Hori: Overview of the IWSLT 2005 Evaluation Campaign. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 22pp. [PDF, 256KB]

(2005) Hany Hassan & Jeffrey Sorensen: An integrated approach for Arabic-English named entity translation. ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 87-93. [PDF, 108KB]

(2005) Sanjika Hewavitharana, Bing Zhao, Almut Silja Hildebrand, Matthias Eck, Chiori Hori, Stephan Vogel, & Alex Waibel: The CMU statistical machine translation system for IWSLT2005. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 93KB]

(2005) Thomas Hoeft, Nick Cramer, M.L.Gregory, & Elizabeth Hetzler: Bridging the gap between technology and users: leveraging machine translation in a visual data triage tool.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 2-3. [PDF, 77KB]

(2005) Abraham Ittycheriah & Salim Roukos: A maximum entropy word aligner for Arabic-English machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 33-40. [PDF, 327KB]

(2005) Shyamsundar Jayaraman & Alon Lavie: Multi-engine machine translation guided by explicit word matching.  ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 101-104. [PDF, 117KB]

(2005) Douglas Jones, Edward Gibson, Wade Shen, Neil Granoien, Martha Herzog, Douglas Reynolds, & Clifford Weinstein: Measuring human readability of machine generated text: three case studies in speech recognition and machine translation. Proceedings of 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 18-23, 2005, Philadelphia, PA, USA; vol.5, pp. 1009-1012 [PDF, 233KB]

(2005) Shankar Kumar & William Byrne: Local phrase reordering models for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 161-168. [PDF, 193KB]

(2005) Young-Suk Lee: IBM statistical machine translation for spoken languages. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 1029KB]

(2005) Advaith Siddharthan & Kathleen McKeown: Improving multilingual summarization: using redundancy in the input to correct MT errors.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 33-40. [PDF, 255KB]

(2005) Hajime Tsukada, Taro Watanabe, Jun Suzuki, Hideto Kazawa, & Hideki Isozaki: The NTT statistical machine translation system for IWSLT2005. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 6pp. [PDF, 65KB]

(2005) Richard Zens, Oliver Bender, Saša Hasan, Shahram Khadivi, Evgeny Matusov, Jia Xu, Yuqi Zhang, & Hermann Ney: The RWTH phrase-based statistical machine translation system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 114KB]

(2005) Muhammad Raji Zughoul & Awatef Miz’il Abu-Alshaar: English/Arabic/English machine translation: a historical perspective. Meta 50 (3), 2005; pp.1022-1041. [PDF, 173KB]

(2005) The 2005 NIST machine translation evaluation plan (MT-05). [NIST, 2005]; 6pp. [PDF, 51KB]

(2005) NIST 2005 machine translation evaluation. Official results. [NIST, 2005]; 6pp. [PDF, 38KB]

(2004) Young-Suk Lee: Morphological analysis for statistical machine translation. HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA. Short Papers; pp. 57-60. [PDF, 150KB]

(2004) Daniel Marcu, Alex Fraser, William Wong, & Kevin Knight: Language Weaver Arabic -> English MT [abstract for demonstration]. Coling 2004: Workshop on Computational Approaches to Arabic Script-based Languages, University of Geneva, Geneva, Switzerland, August 28, 2004; 1pp. [PDF, 123KB]

(2004) Dragos Stefan Munteanu, Alexander Fraser, & Daniel Marcu: Improved machine translation performance via parallel sentence extraction from comparable corpora.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp. 265-272. [PDF, 1125KB]

(2004) M. Vanni, C.R.Voss, & C. Tate: Ground truth, reference truth & “omniscient truth” – parallel phrases in parallel texts for MT evaluation.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 10-13. [PDF, 352KB]

(2004) The 2004 NIST machine translation evaluation plan (MT-04). [NIST, 2004]; 3pp. [PDF, 125KB]

(2003) Y. Al-Onaizan, R.Florian, M.Franz, H.Hassan, Y.S.Lee, S.McCarley, K.Papineni, S.Roukos, J.Sorensen, C.Tillmann, T.Ward, & F.Xia: TIPS: a translingual information processing system HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; Demonstrations, pp.1-2 [PDF, 284KB]

(2003) Leslie Barrett: Considerations of methodology and human factors in rating a suite of translated sentences. "Towards systematizing MT evaluation": a workshop on machine translation evaluation at the MT Summit IX, New Orleans, USA, 27 September 2003; pp.13-19. [PDF, 296KB]

(2003) Charles Schafer & David Yarowsky: A two-level syntax-based approach to Arabic-English statistical machine translation. MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF, 403KB]

(2003) Charles Schafer & David Yarowsky: Statistical machine translation using coercive two-level syntactic transduction EMNLP-2003: proceedings of the 2003  conference on Empirical Methods in Natural Language Processing, a meeting of SIGDAT, a special interest group of the ACL, held in conjunction with ACL-03,  11-12 July  2003, Sapporo, Japan; 8pp. [PDF, 363KB]

(2003) David Stallard, John Makhoul, Frederick Choi, Ehry Macrostie, Premkumar Natarajan, Richard Schwartz, & Bushra Zawaydeh: Design and evaluation of a limited two-way speech translator. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2221-2224; abstract [PDF, 34KB]

(2003) Alicia Tribble & Stephan Vogel: The CMU Arabic-to-English statistical MT system MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF from PPT, 255KB]

(2003) Alex Waibel, Ahmed Badran, Alan W.Black, Robert Frederking, Donna Gates, Alon Lavie, Lori Levin, Kevin Lenzo, Laura Mayfield Tomokiyo, Juergen Reichert, Tanja Schultz, Dorcas Wallace, Monika Woszczyna, & Jing Zhang: SPEECHALATOR: two-way speech-to-speech translation in your hand HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; Demonstrations, pp.29-30 [PDF, 25KB]

(2003) Alex Waibel, Ahmed Badran, Alan W.Black, Robert Frederking, Donna Gates, Alon Lavie, Lori Levin, Kevin A.Lenzo, Laura Mayfield Tomokiyo, Jurgen Reichert, Tanja Schultz, Dorcas Wallace, Monika Woszczyna, & Jing Zhang: Speechalator: two-way speech-to-speech translation on a consumer PDA. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.369-372; abstract [PDF, 34KB]

(2003) The 2003 NIST machine translation evaluation plan (MT-03). [NIST, 2003]; 3pp. [PDF, 173KB]

(2002) Yaser Al-Onaizan & Kevin Knight: Named entity translation. HLT 2002: Human Language Technology Conference: proceedings of the second international conference on human language technology research, March 24-27, 2002, San Diego, California; ed. Mitchell Marcus [San Francisco, CA: Morgan Kaufmann for DARPA]; pp. 122-124. Extended abstract. [PDF, 148KB]

(2002) Yaser Al-Onaizan & Kevin Knight: Translating named entities using monolingual and bilingual resources; ACL-2002: 40th Annual meeting of the Association for Computational , July 2002, Philadelphia, USA; pp.400-408 [PDF, 186KB]

(2002) Alon Lavie, Stephan Vogel, Alex Waibel, Ulrich Germann, Kevin Knight, Daniel Marcu, Young-Suk Lee, Kishore Papineni, Salim Roukos, Franz Josef Och, Moussa Bamba, Chris Cieri, Shudong Huang, Florence Reeder, George Doddington: DARPA TIDES MT group meeting, Marina del Rey, Jan 25, 2002; 9pp. [PDF of PPT, 540KB]

(2002) Scott Shepard, Sean Colbath, Kathleen Egan, & Francis Kubala: Oasis Translator's Aide. HLT 2002: Human Language Technology Conference: proceedings of the second international conference on human language technology research, March 24-27, 2002, San Diego, California; ed. Mitchell Marcus [San Francisco, CA: Morgan Kaufmann for DARPA]; pp. 389-390. [PDF, 122KB]

Arabic to French

(2011) Souhir Gahbiche-Braham, Hélène Bonneau-Maynard, & François Yvon: Two ways to use a noisy parallel news corpus for improving statistical machine translation. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.44-51. [PDF, 176KB]

(2010) Saab Mansour: MorphTagger: HMM-based Arabic segmentation for statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.321-327. [PDF, 369KB]

(2010) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: Arabic morpho-syntactic feature disambiguation in a translation context. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.61-65. [PDF, 153KB]

(2010) Nasredine Semmar & Laib Meriama: Using a hybrid word alignment approach for automatic construction and updating of Arabic to French lexicons. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 114-119. [PDF, 309KB]

(2009) Holger Schwenk & Jean Senellart: Translation model adaptation for an Arabic/French news translation system by lightly-supervised training. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.38-315. [PDF, 134KB]

(2008) Saša Hasan & Hermann Ney: A multi-genre SMT system for Arabic to French. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 103KB]

(2008) NIST: MetricsMATR challenge. [NIST, 2008]. 6pp. [PDF, 35KB]

(2007) Nasredine Semmar & Christian Fluhr: Arabic to French sentence alignment: exploration of a cross-language information retrieval approach. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 73-80 [PDF, 156KB]

(2006)  Saša Hasan, Anas El Isbihani  & Hermann Ney: Creating a large-scale Arabic to French statistical machine translation system. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.855-858 [PDF, 312KB]

(2002) Mathieu Guidère: Towards a corpus-based machine translation for standard Arabic. Translation Journal 6 (1), January 2002; 12pp. [PDF, 173KB]

Arabic to Hebrew

(2011) Reshef Shilon, Nizar Habash, Alon Lavie, & Shuly Wintner: Machine translation between Hebrew and Arabic: needs, challenges and preliminary solutions. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; 1p. [PDF, 54KB]

(2010) Reshef Shilon, Nizar Habash, Alon Lavie, & Shuly Wintner: Machine translation between Hebrew and Arabic: needs, challenges and preliminary solutions. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 141KB]

Arabic to Italian

  (2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Bootstrapping Arabic-Italian SMT through comparable texts and pivot translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.249-256. [PDF, 354KB]; presentation, 12 slides [PDF]

Arabic to Japanese

(2008) Pierrette Bouillon, Sonia Halimi, Yukie Nakao, Kyoko Kanzaki, Hitoshi Isahara, Nikos Tsourakis, Marianne Starlander, Beth Ann Hockey, & Manny Rayner: Developing non-European translation pairs in a medium-vocabulary medical speech translation system. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 122KB]

Aramaic

(2008) Benjamin Snyder & Regina Barzilay: Unsupervised multilingual learning for morphological segmentation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 737-745. [PDF, 254KB]

Asturian

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

Australian languages

(2007) Simon Zwarts & Mark Dras: Statistical machine translation of Australian aboriginal languages: morphological analysis with languages of differing morphological richness. Australasian Language Technology Workshop 2007 (ALTW2007): Proceedings of the Workshop, 10-11 December 2007, Melbourne Zoo; pp.134-142. [PDF, 297KB]

Azerbaijani

(2009) Rauf Fatullayev, Sevinc Mammadova, & Abulfat Fatullayev: Translating composite sentences in Azerbaijani-English MT system.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.104-113 [abstract]

(2008) Rauf Fatullayev, Ali Abbasov, & Abulfat Fatullayev: “DILMANC” is the 1st MT system for Azerbaijani. SLTC 2008: Second Swedish Language Technology Conference, November 20-21, 2008, Stockholm; pp.63-64. [PDF, 267KB]

(2008) Rauf Fatullayev, Ali Abbasov, & Abulfat Fatullayev: Peculiarities of the development of the dictionary for the MT system from Azerbaijani. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.35-40. [PDF, 587KB]

(2008) Rauf Fatullayev, Ali Abbasov, & Abulfat Fatullayev: Set of active suffix chains and its role in development of the MT system for Azerbaijani. Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 20-22 October 2008; pp.363-368. [PDF, 288KB]

Bangla see Bengali

Basque

(2009) proceedings of SALTMIL 2009, “Information retrieval and information extraction for less resourced languages”, Donostia-San Sebastián, September 7 2009. [PDF, 16458KB]

(2008) I.Alegria, X.Arregi, A.Diaz de Ilarraza, G.Labaka, M.Lersundi, A.Mayor, & K.Sarasola: Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open source. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.59-64. [PDF, 74KB]

(2006) Nicolas Stroppa, Declan Groves, Andy Way & Kepa Sarasola: Example-based machine translation of the Basque language. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.232-241 [PDF, 168KB]

(2004) Eneko Agirre & David Martinez: The Basque Country University system: English and Basque tasks. [ACL 2004] Proceedings of SENSEVAL-3 Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, 25-26 July 2004, Barcelona. Spain; 5pp. [PDF, 94KB]

(2002) Eneko Agirre, Mikel Lersundi, & David Martinez: A multi-lingual approach to disambiguate prepositions and case suffixes; ACL-2002 SIGLEX/SENSEVAL workshop on Word Sense Disambiguation "Recent successes and future directions", July 2002, Philadelphia, USA; pp. 1-8 [PDF, 166KB]

(2001) Guillermo Barrutieta, Joseba Abaitua & Josuha Díaz: Gross-grained RST through XML metadata for multilingual document generation. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.39-42. [PDF, 151KB]

Basque to English

(2010) Xabier Saralegi & Maddalen Lopez de Lacalle: Dictionary and monlingual corpus-based query translation for Basque-English CLIR. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1353-1358. [PDF, 284KB]

(2009) Izaskun Fernandez, Iñaki Alegria, & Nerea Ezeiza: Using Wikipedia for named-entity translation. [SEPLN 2009] SALTMIL 2009, Donostia-San Sebastián, Spain. “Information retrieval and information extraction for less resourced languages”, Donostia-San Sebastián, September 7; pp.27-35. [PDF, 550KB]

Basque to Spanish

(2011) Alicia Pérez, M.Inés Torres, & Francisco Casacuberta: Steps taken in Spanish-Basque speech translation using stochastic finite-state transducers. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23-27 January, 2011; 2pp. [PDF, 90KB]

(2006) Iñaki Alegria, Nerea Ezeiza, & Izaskun Fernandez: Named entities translation based on comparable corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.1-8 [PDF, 455KB]

(2006) A.Casillas, A. Díaz de Illarraza, J.Igartua, R. Martínez, & K. Sarasola: Compilation and structuring of a Spanish-Basque parallel corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.55-58. [PDF, 172KB]

(2004) Francisco Nevado, Francisco Casacuberta, & Josu Landa: Translation memories enrichment by statistical bilingual segmentation. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.335-338. [PDF, 354KB]

Bengali

(2009) Asif Ekbal & Sivajii Banyopadhyay: Voted NER system using appropriate unlabeled data. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.202-210. [PDF, 98KB]

(2009) Abu Zaher Md. Faridee & Francis M.Tyers: Development of a morphological analyser for Bengali. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 43-50. [PDF, 351KB]

(2005) Kuntal Dey & Pushpak Bhattacharyya: Universal Networking Language based analysis and generation for Bengali case structure constructs. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.215-229 [abstract, PDF, 88KB]

Bengali to English

(2009) Gholamreza Haffari, Maxim Roy, & Anoop Sarkar: Active learning for statistical phrase-based machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.415-423. [PDF, 190KB]

(2008) Sivaji Bandyopadhyay, Tapabrata Mondal, Sudip Kumar Naskar, Asif Ekbal, Rejwanul Haque, & Srinivasa Rao Godavarthy: Bengali, Hindi and Telegu to English ad-hoc bilingual task. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; abstract, p.66. [PDF, 12KB]

(2008) Debasis Mandal, Sandipan Dandapat, Mayank Gupta, Pratyush Banerjee, & Sudeshna Sarkar: Bengali and Hindi to English CLIR evaluation. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; abstract, p.65. [PDF, 12KB]

(2006) Asif Ekbal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: A modified joint source-channel model for transliteration. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.191-198. [PDF, 159KB]

Berber language see Amazigh

Brazilian Portuguese

(2011) Luís Marujo, Nuno Grazina, Tiago Luís, Wang Ling, Luísa Coheur, & Isabel Trancoso: BP2EP – adaptation of Brazilian Portuguese texts to European Portuguese. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.129-136. [PDF, 352KB]

Brazilian Portuguese to English (see also Portuguese to English)

(2011) Daniel Emilio Beck: Syntax-based statistical machine translation using tree automata and tree transducers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.36-40. [PDF, 124KB]

(2010) Helena de Medeiros Caseli, Bruno Akio Sugiyama, & Junia Coutinho Anacleto: Using common sense to generate culturally contextualized machine translation. Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, Los Angeles, CA, June 2010; pp.24-31. [PDF, 353KB]

(2010) Lucia Silva: Fine-tuning in Brazilian Portuguese-English statistical transfer machine translation: verbal tenses.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Student Research Workshop… June 2, 2010, Los Angeles, California; pp.58-63. [PDF, 251KB]

(2008) Helena M. Caseli, Maria das Graças V.Nunes, & Mikel L. Forcada: From free shallow monolingual resources to machine translation systems: easing the task. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 41-48. [PDF, 454KB]

(2006) Helena M.Caseli, Maria das Graças V.Nunes, & Mikel L.Forcada: Automatic induction of bilingual resources from aligned parallel corpora: application to shallow-transfer machine translation [abstract]. Machine Translation 20 (4),2006; pp.227-245.

Brazilian Portuguese to Spanish

(2008) Helena M. Caseli, Maria das Graças V.Nunes, & Mikel L. Forcada: From free shallow monolingual resources to machine translation systems: easing the task. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 41-48. [PDF, 454KB]

Breton to French

(2010) Francis M.Tyers: Rule-based Breton to French machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 553KB]

(2009) Francis M.Tyers: Rule-based augmentation of training data in Breton-French statistical machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.213-217. [PDF , 227KB]

British Sign Language

(2003) Ian Marshall & Éva Sáfár: 'A prototype text to British Sign Language (BSL) translation system' ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 204KB]

Bulgarian

(2011) Svetla Koeva: Furthering natural language processing in Bulgaria. META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 40 slides [PDF of PPT, 3276KB]

(2006) Svetla Koeva, Svetlozara Lesseva, & Maria Todorova: Bulgarian sense tagged corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.79-86. [PDF, 494KB]

(2006) Ted Pedersen, Anagha Kulkarni, Roxana Angheluta, Zornitsa Kozareva, & Thamar Solorio: Improving name discrimination: a language salad approach. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.25-32 [PDF, 268 KB]

(2001) Anthony Hartley, Donia Scott, John Bateman & Danail Dochev: AGILE - a system for multilingual generation of technical instructions. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 145-150. [PDF, 384KB]

(2000) Hristo Tanev & Ruslan Mitkov: LINGUA – a robust architecture for text processing and anaphora resolution in Bulgarian. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1780KB]

Bulgarian to Macedonian

(2011) Tihomir Rangelov: Rule-based machine translation between Bulgarian and Macedonian. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.53-59. [PDF, 164KB]

Catalan

(2010) Guillem Massó & Toni Badia: Dealing with sign language morphemes in statistical machine translation. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.154-157. [PDF, 543KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2007) Lluís Màrquez, Luis Villarejo, M.A.Martí, & Mariona Taulé: SemEval-2007 task 09: multilevel semantic annotation of Catalan and Spanish. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.42-47 [PDF, 89KB]

(2007) Roser Morante & Bertjan Busser: ILK2: semantic role labelling for Catalan and Spanish using TiMBL. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.183-186 [PDF, 120KB]

(2004) Victoria Arranz, Núria Castell, Josep Maria Crego, Jesús Giménez, Adrià de Gispert, & Patrick Lambert: Bilingual connections for trilingual corpora: an XML approach.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1459-1462. [PDF, 674KB]

(2003) Xavier Carreras, Luís Màrquez & Luís Padró: Named entity recognition for Catalan using Spanish resources. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.43-50 [PDF, 431KB]

(2003) David Conejero, Jesus Gimenez, Victoria Arranz, Antonio Bonafonte, Neus Pascual, Nuria Castell, & Asunción Moreno: Lexica and corpora for speech-to-speech translation: a trilingual approach. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.1593-1596; abstract [PDF, 34KB]

(2003) Lluís Màrquez, Adrià de Gispert, Xavier Carreras, & Lluís Padró: Low-cost named entity classification for Catalan: exploiting multilingual resources and unlabeled data.  ACL 2003 Workshop on Multilingual and Mixed-language Named Entity Recognition, July 12, 2003, Sapporo, Japan; 8pp. [PDF, 80KB]

Catalan to English

(2006) Adrià de Gispert & José B. Mariño: Catalan-English statistical machine translation without parallel corpus: bridging through Spanish.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.65-68. [PDF, 64KB]

(2006) Maite Melero, Antoni Oliver, & Toni Badia: Automatic multilingual subtitling in the eTITLE project. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 18pp. [PDF, 149KB]

(2006) Joaquim Moré & Salvador Climent: A cheap MT-evaluation method based on Internet searches. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.19-26 [PDF, 260KB]

(2005) Victoria Arranz, Elisabet Comelles, & David Farwell: The FAME speech-to-speech translation system for Catalan, English, and Spanish. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.195-202. [PDF, 185KB]

(2005) Maja Popovic & Hermann Ney: Exploiting phrasal lexica and additional morpho-syntactic language resources for statistical machine translation with scarce training data. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 212-218. [PDF, 65KB]

(2004) Maja Popović & Hermann Ney: Towards the use of word stems and suffixes for statistical machine translation.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1585-1588. [PDF, 534KB]

Catalan to Greek

(2009) Marcel Puig Portella & Eleni Papadopoulou: Treatment of the imperative forms in the machine translation between Catalan, Spanish and Greek.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.198-202 [abstract]

Catalan to Occitan

(2006) Carme Armentano i Oller & Mikel L. Forcada: Open-source machine translation between small languages: Catalan and Aranese Occitan.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.51-54. [PDF, 62KB]

Catalan to Spanish

(2011) Carlos A.Henríquez Q., José B.Mariño, & Rafael E.Banchs: Deriving translation units using small additional corpora. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.121-128. [PDF, 438KB]

 (2011) Xavier Ivars-Ribes & Victor M.Sánchez-Cartagena: A widely used machine translation service and its migration to a free/open-source solution: the case of Softcatalà. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.61-68. [PDF, 385KB]

(2010) Marta R.Costa-jussà, Mireia Farrús, José B.Mariño, & José A.R.Fonollosa: Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1706-1711. [PDF, 335KB]

(2010) Mireia Farrús, Marta R.Costa-jussà, José B.Mariño, & José A.R.Fonollosa: Linguistic-based evaluation criteria to identift statistical machine translation errors. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 606KB]

(2009) Mireia Farrús, Marta R.Costa-jussà, Marc Poch, Adolfo Hernández, & José B.Mariño: Improving a Catalan-Spanish statistical translation system using morphosyntactic knowledge. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.52-57. [PDF, 399KB]

(2009) Marcel Puig Portella & Eleni Papadopoulou: Treatment of the imperative forms in the machine translation between Catalan, Spanish and Greek.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.198-202 [abstract]

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2009) Luis Villarejo Muñoz, Sergio Ortiz Rojas, & Mireia Ginestí Rosell: Joint efforts to further develop and incorporate Apertium into the document management flow at Universitat Oberta de Catalunya. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortíz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 75-82. [PDF, 270KB]

(2007) Joaquim Moré López & Salvador Climent Roca: A cheap MT evaluation method based on the notion of machine translationness.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 8pp. [PDF, 285KB]

(2007) David Vilar, Jan-T. Peter, & Hermann Ney: Can we translate letters?  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 33-39 [PDF, 124KB]

(2006) Rafael Banchs, Antonio Bonafonte, & Javier Pérez: Acceptance testing of a spoken language translation system. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2074-2079 [PDF, 262KB]

(2006) Maite Melero, Antoni Oliver, & Toni Badia: Automatic multilingual subtitling in the eTITLE project. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 18pp. [PDF, 149KB]

(2006) Joaquim Moré & Salvador Climent: A cheap MT-evaluation method based on Internet searches. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.19-26 [PDF, 260KB]

(2005) Victoria Arranz, Elisabet Comelles, & David Farwell: The FAME speech-to-speech translation system for Catalan, English, and Spanish. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.195-202. [PDF, 185KB]

(2005) Juan Alberto Alonso: Machine translation for Catalan<->Spanish: the real case for productive MT. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 23-26. [PDF, 147KB]

(2004) Felipe Sánchez-Martínez, Juan Antonio Pérez-Ortiz & Mikel L. Forcada: Cooperative unsupervised training of the part-of-speech taggers in a bidirectional machine translation system; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.135-144. [PDF, 92KB]

(2004) Victoria Arranz, Elisabet Comelles, David Farwell, Climent Nadeu, Jaume Padrell, Albert Febrer, Dorcas Alexander, & Kay Peterson: A speech-to-speech translation system for Catalan, Spanish, and English. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 7-16. [go to publisher details]

(2003) Salvador Climent, Joaquim Moré & Antoni Oliver: Building an environment for unsupervised automatic email translation.  Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003 [PDF, 274KB]

(2002) Toni Badia, Gemma Boleda, Carme Colominas, Agnès González, Mireia Garmendia, Martí Quixal: BancTrad: a web interface for integrated access to parallel annotated corpora. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Language resources for translation work and research, Las Palmas Canary Islands, 27 May 2002; pp.15-19. [PDF, 106KB]

(2001) R. Canals-Marote, A. Esteve-Guillén, A. Garrido-Alenda, M.I. Guardiola-Savall, A. Iturraspe-Bellver, S. Montserrat-Buendia, S. Ortiz-Rojas, H. Pastor-Pina, P.M. Pérez-Antón & M.L. Forcada: The Spanish<>Catalan machine translation system interNOSTRUM. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 73-76. [PDF, 196KB]

(2001) Alberto Sanchis, David Picó, Joan Miquel del Val, Ferran Fabregat, Jesús Tomás, Moisés Pastor, Francisco Casacuberta & Enrique Vidal: A morphological analyser for machine translation based on finite-state transducers. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.305-309. [PDF, 48KB]

Cebuano

(2003) Douglas W.Oard, David Doermann, Bonnie Dorr, Daqing He, Philip Resnik, Amy Weinberg, William Byrne, Sanjeev Khudanpur, David Yarowsky, Anton Leuski, Philipp Koehn, & Kevin Knight: Desparately seeking Cebuano. HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 21KB]

Celtic languages [see also Breton, Gaelic, Welsh]

(2006) Delyth Prys: The BLARK matrix and its relation to the language resources situation for the Celtic languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.31-32. [PDF, 37KB]

Chechen to English

(2006) Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Hamid Mansouri Rad & Ron Zacharski: Guarani: a case study in resource development for quick ramp-up MT. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.1- 9 [PDF, 344KB]

Chinese

(2011) Wenjing Lang, Qiaoli Zhou, Guiping Zhang, & Dongfeng Cai: Multi-stage Chinese dependency parsing based on dependency direction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.64-71. [PDF, 128KB]

(2010) Huang Heyan: Researches on MT and language information processing of CCLIE & BIT. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 37pp. [PDF, 7229KB]

(2010) Yanli Sun, Sharon O’Brien, Minako O’Hagan, & Fred Hollowood: A novel statistical pre-processing model for rule-based machine translation system.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 790KB]; presentation: 28 slides [PDF, 580KB]

(2010) Yiou Wang, Kiyotaka Uchimoto, Jun’ichi Kazama, Canasai Kruengkrai, & Kentaro Torisawa: Adapting Chinese word segmentation for machine translation based on short units.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1758-1764. [PDF, 409KB]

(2009) Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang, Xian Wu & Zhong Su: Domain adaptation with latent semantic association for named entity recognition. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.281-289. [PDF, 203KB]

(2009) Chiori Hori, Sakriani Sakti, Michael Paul, Noriyuki Kimura, Yutaka Ashikari, Ryosuke Isotani, Eiichiro Sumita, & Satoshi Nakamura: Network-based speech-to-speech translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; p. 168. [PDF, 286KB]; presentation [PDF of PPT, 485KB]

(2009) Liang Huang, Wenbin Jiang & Qun Liu: Bilingually-constrained (monolingual) shift-reduce parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1222-1231. [PDF, 169KB]

(2009) Tsai Yi-Jung: Research into the practicality of machine translation of administrative documents.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.256-259 [abstract]

(2008) Stephan Busemann & Yajing Zhang: Identifying foreign person names in Chinese text. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 255KB]

(2008) Hailong Cao, Masao Utiyama & Eiichiro Sumita: Chinese word segmentation: adaptation for statistical machine translation. CWMT’2008: the 4th China Workshop on Machine Translation, November 27-28, 2008, Beijing; 15 slides [PDF of PPT, 128KB]

(2008) Pi-Chuan Chang, Michel Galley, & Christopher D. Manning: Optimizing Chinese word segmentation for machine translation performance. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.224-232. [PDF, 165KB]

(2008) Yuqing Guo, Josef van Genabith, & Haifeng Wang: Dependency-based n-gram models for general purpose sentence realisation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.297-304. [PDF, 138KB]

(2008) Long Jiang & Ming Zhou: Generating Chinese couplets using a statistical MT approach. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.377-384. [PDF, 284KB]

(2008) Jia Xu, Jianfeng Gao, Kristina Toutanova, & Hermann Ney: Bayesian semi-supervised Chinese word segmentation for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1017-1024. [PDF, 137KB

(2008) Ruiqiang Zhang, Keiji Yasuda, & Eiichiro Sumita: Improved statistical machine translation by multiple Chinese word segmentation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.216-223. [PDF, 250KB]

(2007) Jing-Shin Chang & Chun-Kai Kung: A Chinese-to-Chinese statistical machine translation model for mining synonymous simplified-traditional Chinese terms. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.81-87 [PDF, 314KB]

(2007) Chu-Ren Huang, Petr Šimon, & Shu-Kai Hsieh: Automatic discovery of named entity variants – grammar-driven approaches to non-alphabetical transliterations. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 153-156 [PDF, 246KB]

(2007) Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, & Minghui Dong: Semantic transliteration of personal names. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 120-127 [PDF, 152KB]

(2006) A.Bonafonte, H.Höge, I.Kiss, A.Moreno, U.Ziegenhain, H.van den Heuvel, H.-U.Hain, X.S.Wang, M.N.Garcia: TC-STAR: specifications of language resources and evaluation for speech synthesis. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.311-314 [PDF, 272KB]

(2006) Simon Corston-Oliver, Anthony Aue, Kevin Duh & Eric Ringger: Multilingual dependency parsing using Bayes point machines.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 160-167 [PDF, 178KB]

(2006) Jianfeng Gao, Mu Li, Andi Wu, & Chang-Ning Huang: Chinese word segmentation and named entity recognition: a pragmatic approach. Computational Linguistics 31 (4), pp. 531-574. [PDF, 1638KB]

(2006) Qing Li, Sung-Hyon Myaeng, Yun Jin, & Be-yeong Kang: Concept unification of terms in different languages for IR. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.641-648. [PDF, 234KB]

(2006) Djamal Mostefa, Olivier Hamon, & Khalid Choukri: Evaluation of automatic speech recognition and speech language translation within TC-STAR: results from the first evaluation campaign.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.149-154 [PDF, 388KB]

(2006) Dragos Stefan Munteanu & Daniel Marcu: Improving machine translation performance by exploiting non-parallel corpora. Computational Linguistics 31 (4), pp. 477-504 [PDF, 1060KB]

(2006) Scott S.L.Piao, Guangfan Sun, Paul Rayson, & Qi Yuan: Automatic extraction of Chinese multiword expressions with a statistical tool. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.17-24 [PDF, 396KB]

(2005) Huang Heyan: Research & development of multi-lingual machine translation and applications. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit: invited paper [PDF of PPT presentation, 550KB]

(2005) Liu Qun, Hou Hongxu, Lin Shouxun, Qian Yueliang, Zhang Yujie, & Isahara Hitoshi: Introduction to China’s HTRDP machine translation evaluation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit: invited paper; pp.i-18-22 [PDF, 310KB], also PDF of PPT presentation [502KB]

(2005) Xiadong Shi & Yidong Chen: A UNL deconverter for Chinese. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.167-174 [abstract, PDF, 13KB]

(2005) Jia Xu, Evgeny Matusov, Richard Zens, & Hermann Ney: Integrated Chinese word segmentation in statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 7pp. [PDF, 105KB]

(2004) Pu-Jen Cheng, Yi-Cheng Pan, Wen-Hsiang Lu, & Lee-Feng Chien: Creating multilingual translation lexicons with regional variations using web corpora.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp. 534-541. [PDF, 283KB]

(2004) Key-Sun Choi, Hee-Sook Bae, Wonseok Kang, Juho Lee, Eunhe Kim, Hekyeong Kim, Donghee Kim, Youngbin Song, & Hyosik Shin: Korean-Chinese-Japanese multilingual wordnet with shared semantic hierarchy.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1131-1134. [PDF, 518KB]

(2003) Kuniko Saito & Masaaki Nagata: Multi-language named-entity recognition system based on HMM. ACL 2003 Workshop on Multilingual and Mixed-language Named Entity Recognition, July 12, 2003, Sapporo, Japan; 8pp. [PDF, 353KB]

(2003) Nianwen Xue & Seth Kulick: Automatic predicate argument structure analysis of the Penn Chinese Treebank MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.418-425. [PDF,53KB]

(2002)  Aman Chiu & Björn Jernudd: Chinese IT terminology management in Hong Kong.  In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.92-114.

(2002) K.L.Kwok: NTCIR-3 Chinese, cross language retrieval experiments using PIRCS. NTCIR Workshop 3: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, October 8-10, 2002, Tokyo, Japan; 7pp. [PDF, 192KB]

(2001) Takako Aikawa, Maite Melero, Lee Schwartz & Andi Wu : Generation for multilingual MT. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 9-14. [PDF, 210KB]

(2001) K.L.Kwok: NTCIR-2 Chinese, cross language retrieval experiment using PIRCS. NTCIR Workshop 2: Proceedings of the Second NTCIR Workshop on Research in Chinese & Japanese Text retrieval and Text Summarization, March 7-9, 2001, Tokyo, Japan; 8pp. [PDF, 202KB]

(2001) Wong Fai, Mao Yuhang, Dong QingFu, & Qi Yihong: Automatic translation: overcome barriers between European and Chinese languages. Proceedings  of First International UNL Open Conference. SuZhou China, 2001. 8pp. [PDF, 311KB]

(2000) Lew R. Micklesen: Erwin Reifler and machine translation at the University of Washington. In: W.John Hutchins (ed.) Early years in machine translation: memoirs and biographies of pioneers (Amsterdam/Philadelphia: John Benjamins Publishing Company, 2000); pp.21-38.

(2000) Hua Wu, Taiyi Huang, Chengqing Zong,& Bo Xu: Chinese generation in a spoken dialogue translation system Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1141-1145 [PDF,.398KB]

Chinese to English

(2011) Hala Almaghout, Jie Jiang, & Andy Way: CCG contextual labels in hierarchical phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.281-288. [PDF, 317KB]; presentation, 25 slides [PDF, 964KB]

(2011) Elizabeth Baran & Nianwen Xue: Singular or plural? Exploiting parallel corpora for Chinese number prediction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.207-214. [PDF, 187KB]

(2011) Yufeng Chen & Chengqing Zong: A semantic-specific model for Chinese named entity translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.138-146. [PDF, 282KB]

(2011) John DeNero & Klaus Macherey: Model-based aligner combination using dual decomposition. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.420-429. [PDF, 315KB]

    (2011) Nan Duan, Mu Li, & Ming Zhou: Hypothesis mixture decoding for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1258-1267. [PDF, 1027KB]

(2011) Nan Duan, Mu Li, Ming Zhou, & Lei Cui: Improving phrase extraction via MBR phrase scoring and pruning. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.189-197. [PDF, 302KB]

(2011) Chris Dyer, Jonathan Clark, Alon Lavie, & Noah A.Smith: Unsupervised word alignment with arbitary features. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.201-210. [PDF, 303KB]

(2011) Qin Gao & Stephan Vogel: Corpus expansion for statistical machine translation with semantic role label substitution rules. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.294-298. [PDF, 151KB]

 (2011) Qin Gao & Stephan Vogel: Utilizing target-side semantic role labels to assist hierarchical phrase-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.107-115. [PDF, 352KB]

(2011) Kevin Gimpel & Noah A.Smith: Quasi-synchronous phrase dependency grammars for machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.474-485. [PDF, 360KB]

(2011) Zhengxian Gong, Min Zhang, & Guodong Zhou: Cache-based document-level statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.909-919. [PDF, 330KB]

 (2011) Zhengxian Gong, Guodong Zhou, & Liangyou Li: Improve SMT with source-side “topic-document” distributions. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.496-501. [PDF, 297KB]

 (2011) Greg Hanneman & Alon Lavie: Automatic category label coarsening for syntax-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.98-106. [PDF, 264KB]

(2011) Gonzalo Iglesias, Cyril Allauzen, William Byrne, Adrià de Gispert, & Michael Riley: Hierarchical phrase-based translation representations. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1373-1383. [PDF, 334KB]

(2011) Ea-Ee Jan, Niyu Ge, Shih-Hsiang Lin, & Berlin Chen: An effective and robust framework for transliteration exploration. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1332-1340. [PDF, 204KB]

(2011) Hongfei Jiang: Expanding the applications of MT in patent translation. Invited talk at [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.17-36. [PDF, 7898KB]

(2011) Li Zezhong, Hideto Ikeda, & Nguyen Thanh Hung: Exploiting multi-features for word alignment in patent MT. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.86-90. [PDF, 238KB]

(2011) Xianhua Li, Yajuan Lü, Yao Meng, Qun Liu, & Hao Yu: Feedback selecting of manually acquired rules using automatic evaluation. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.52-59. [PDF, 185KB]

(2011) Zhifei Li, Jason Eisner, Ziyuan Wang, Sanjeev Khudanpur, & Brian Roark: Minimum imputed risk: unsupervised discriminative training for machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.920-929. [PDF, 236KB]

 (2011) Wang Ling, João Graça, David Martins de Matos, Isabel Trancoso, & Alan Black: Discriminative phrase-based lexicalized reordering models using weighted reordering graphs. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.47-55. [PDF, 275KB]

(2011) Wang Ling, Tiago Luís, João Graça, Luísa Coheur, & Isabel Trancoso: Reordering modeling using weighted alignment matrices. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.450-454. [PDF, 121KB]

 (2011) Feifan Liu, Fei Liu, & Yang Liu: Learning from Chinese-English parallel data for Chinese tense prediction. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1116-1124. [PDF, 556KB]

(2011) Lemao Liu, Tiejun Zhao, Chao Wang, & Hailong Cao: A unified and discriminative soft syntactic constraint model for hierarchical phrase-based translation.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.253-260. [PDF, 197KB]

(2011) Shujie Liu, Chi-Ho Li, & Ming Zhou: A unified SMT framework combining MIRA and MERT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.181-188. [PDF, 233KB]

(2011) Shujie Liu, Chi-Ho Li & Ming Zhou: Statistic machine translation boosted with spurious word deletion. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.72-79. [PDF, 239KB]

(2011) Yang Liu, Qun Liu, & Yajuan Lü: Adjoining tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1278-1287. [PDF, 159KB]

(2011) Yupeng Liu, Tiejun Zhao, & Sheng Li: Hypergraph training and decoding of system combination in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.570-577 [PDF, 235KB]

(2011) Zhanyi Liu, Haifeng Wang, Hua Wu, Ting Liu, & Sheng Li: Reordering with source language collocations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1036-1044. [PDF, 462KB]

(2011) Chi-kiu Lo & Dekai Wu: A radically simple, effective annotation and alignment methodology for semantic frame based SMT and MT evaluation. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.58-65. [PDF, 898KB]

 (2011) Bin Lu, Ka Po Chow, & Benjamin K.Tsou: The cultivation of a Chinese-English-Japanese trilingual parallel corpus from comparable patents. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.472-479. [PDF, 207KB]

(2011) Jeff Ma & Spyros Matsoukas: Building a statistical machine translation system for translating patent documents. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.79-85. [PDF, 145KB]

 (2011) Wei-Yun Ma & Kathleen McKeown: System combination for machine translation based on text-to-text generation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.546-553. [PDF,

(2011) Christof Monz: Statistical machine translation with local language models. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.869-879. [PDF, 407KB]

 (2011) Hwidong Na & Jong-Hyeok Lee: Multi-word unit dependency forest-based translation rule extraction. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.41-51. [PDF, 466KB]

(2011) Emmanuel Prochasson & Pascale Fung: Rare word translation extraction from aligned comparable documents. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1327-1335. [PDF, 157KB]

(2011) Jason Riesa, Ann Irvine, & Daniel Marcu: Feature-rich language-independent syntax-based alignment for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.497-507. [PDF, 1053KB]

(2011) Linfeng Song, Haitao Mi, Yajuan Lü, & Qun Liu: Bagging-based system combination for domain adaptation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.293-298. [PDF,

(2011) Jun Suzuki, Kevin Duh, & Masaaki Nagata: Distributed minimum error rate training of SMT using particle swarm optimization. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.649-657. [PDF, 1101KB]

(2011) Liang Tian, Fai Wong, & Sam Chao: Word alignment using GIZA++ on Windows. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.369-372. [PDF, 192KB]

(2011) Zhaopeng Tu, Yang Liu, Qun Liu, & Shouxun Lin: Extracting hierarchical rules from a weighted alignment matrix. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1294-1303. [PDF, 238KB]

(2011) Ashish Vaswani, Haitao Mi, Liang Huang & David Chiang: Rule Markov models for fast tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.856-864. [PDF, 122KB]

(2011) Wei Wei & Bo Xu: Effective use of discontinuous phrases for hierarchical phrase-based translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.397-404. [PDF, 148KB]

(2011) Shumin Wu & Martha Palmer: Semantic mapping using automatic word alignment and semantic role labeling. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.21-30. [PDF, 200KB]

(2011) Ning Xi, Guangchao Tang, Boyuan Li, & Yinggong Zhao: Word alignment combination over multiple word segmentation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.1-5. [PDF, 329KB]

(2011) Bing Xiang, Niyu Ge, & Abraham Ittycheriah: Improving reordering for statistical machine translation with smoothed priors and syntactic features. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.61-69. [PDF, 200KB]

(2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

(2011) Xinyan Xiao, Yang Liu, Qun Liu, & Shouxun Lin: Fast generation of translation forest for large-scale SMT discriminative training. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.880-888. [PDF, 818KB]

(2011) Jun Xie, Haitao Mi & Qun Liu: A novel dependency-to-string model for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.216-226. [PDF, 992KB]

(2011) Deyi Xiong, Min Zhang, & Haizhou Li: Enhancing language models in statistical machine translation with backward n-grams and mutual information triggers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1288-1297. [PDF, 620KB]

(2011) Jinxi Xu & Jinying Chen: How much can we gain from supervised word alignment?  ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.165-169. [PDF, 71KB]

 (2011) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Simple but effective approaches to improving tree-to-tree model. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.261-268. [PDF, 225KB]

(2011) Jiajun Zhang, Feifei Zhai, & Chenqing Zong: Augmenting string-to-tree translation models with fuzzy use of source-side syntax. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.204-215. [PDF, 317KB]

 (2011) Yinggong Zhao, Shujie Liu, Yangsheng Ji, Jiajun Chen, & Guodong Zhou: Transductive minimum error rate training for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.641-648. [PDF, 214KB]

(2011) Zhongguang Zheng, Yao Meng, & Hao Yu: Lexical-based reordering model for hierarchical phrase-based machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.389-396. [PDF, 385KB]

(2011) Jingbo Zhu & Tong Xiao: Improving decoding generalization for tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.418-423. [PDF, 311KB]

(2011) Andreas Zollmann & Stephan Vogel: A word-class approach to labeling PSCFG rules for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1-11. [PDF, 230KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: CCG augmented hierarchical phrase-based machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.211-218. [PDF, 653KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: The DCU machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.37-44. [PDF, 621KB]

(2010) Stacey Bailey: Data preparation for machine translation customization. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Pratyush Banerjee, Jinhua Du, Sudip Naskar, Baoli Li, Andy Way, & Josef van Genabith: Combining multi-domain statistical machine translation models using automatic classifiers. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 304KB]

(2010) Hailong Cao & Eiichiro Sumita: Filtering syntactic constraints for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.17-21. [PDF, 65KB]

(2010) Hailong Cao, Andrew Finch, & Eiichiro Sumita: Syntactic constraints on phrase extraction for phrase-based machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.28-33. [PDF, 84KB]

(2010) Boxing Chen, George Foster, & Roland Kuhn: Bilingual sense similarity for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.834-843. [PDF, 185KB]

(2010) Boxing Chen, George Foster, & Roland Kuhn: Fast consensus hypothesis regeneration for machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 11-16. [PDF, 120KB]

(2010) Yufeng Chen, Chenqing Zong, & Keh-Yih Su: On jointly recognizing and aligning bilingual named entities. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.631-639. [PDF, 151KB]

(2010) David Chiang: Learning to translate with source and target syntax. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1443-1452. [PDF, 357KB]

(2010) Tagyoung Chung & Daniel Gildea: Effects of empty categories on machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.636-645. [PDF, 198KB]

(2010) Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou, & Tiejun Zhao: Hybrid decoding: decoding with partial hypotheses combination over multiple SMT systems. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.214-222. [PDF, 497KB]

(2010) Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou, & Tiejun Zhao: A joint rule selection model for hierarchical phrase-based translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.6-11. [PDF, 213KB]

(2010) John DeNero & Dan Klein: Discriminative modeling of extraction sets for machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1453-1463. [PDF, 616KB]

(2010) John DeNero, Shankar Kumar, Ciprian Chelba, & Franz Och: Model combination for machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.975-983. [PDF, 573KB]

(2010) Jinhua Du & Andy Way: A discriminative latent variable-based “DE” classifier for Chinese-English SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.286-294. [PDF, 698KB]

(2010) Jinhua Du & Andy Way: The impact of source-side syntactic reordering on hierarchical phrase-based SMT. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 746KB]; presentation: 34 slides [PDF, 540KB]

(2010) Jinhua Du & Andy Way: Using TERp to augment the system combination for SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 177KB]

(2010) Xiangyu Duan, Rafael E.Banchs, Jun Lang, Deyi Xiong, Aiti Aw, Min Zhang, & Haizhou Li: I2R’s machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.67-72. [PDF, 391KB]

(2010) Nan Duan, Hong Sun, & Ming Zhou: Translation model generalization using probability averaging for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.304-312. [PDF, 296KB]

(2010) Nan Duan, Mu Li, Dongdong Zhang, & Ming Zhou: Mixture model-based minimum Bayes risk decoding using multiple machine translation systems. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.313-321. [PDF, 372KB]

(2010) Xiangyu Duan, Min Zhang, & Haizhou Li: Pseudo-word for phrase-based machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.148-156. [PDF, 275KB]

(2010) Chris Dyer & Philip Resnik: Context-free reordering, finite-state translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.858-866. [PDF, 272KB]

(2010) Michel Galley & Christopher D.Manning: Accurate non-hierarchical phrase-based translation.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.966-974. [PDF, 433KB]

(2010) Qin Gao & Stephan Vogel: Consensus versus expertise: a case study of word alignment with Mechanical Turk. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.30-34. [PDF, 389KB]

(2010) Qin Gao, Francisco Guzman, & Stephan Vogel: EMDC: a semi-supervised approach for word alignment. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.349-357. [PDF, 231KB]

(2010) Qin Gao, Nguyen Bach, & Stephan Vogel: A semi-supervised word alignment algorithm with partial manual alignments. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 1-10. [PDF, 278KB]

(2010) Guillem Gascó, Vicent Alabau, Jesús Andrés-Ferrer, Jesús González-Rubio, Martha-Alicia Rocha, Germán Sanchis-Trilles, Francisco Casacuberta, Jorge González, & Joan-Andreu Sánchez: ITI-UPV system description for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.85-92. [PDF, 414KB]; presentation by Germán Sanchis-Trilles: 16 slides [PDF of PPT, 481KB]

 (2010) Guillem Gascó Mora & Joan Andreu Sánchez Peiró: Syntax augmented inversion transduction grammars for machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 62KB]

(2010) Adrià de Gispert, Juan Pino, & William Byrne: Hierarchical phrase-based translation grammars extracted from alignment posterior probabilities.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.545-554. [PDF, 162KB]

(2010) Chooi-Ling Goh, Taro Watanabe, Michael Paul, Andrew Finch, & Eiichiro Sumita: The NICT translation  system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.139-146. [PDF, 379KB]

(2010) Yanqing He, Yu Zhou, Chengqing Zong, & Huilin Wang: A novel reordering model based on multi-layer phrase for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.447-455. [PDF, 234KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Extending the hierachical phrase based model with maximum entropy based BTG. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 104KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Learning phrase boundaries for hierarchical phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.383-390. [PDF, 378KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Maximum entropy based phrase reordering for hierarchical phrase-based translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.555-563. [PDF, 144KB]

(2010) Degen Huang, Lian Zhao, Lishuang Li, & Haitao Yu: Mining large-scale comparable corpora from Chinese-English news collections. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.472-480. [PDF, 255KB]

(2010) Jie Jiang, Jinhua Du, & Andy Way: Improved phrase-based SMT with syntactic reordering patterns learned from lattice scoring. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 257KB]

 (2010) Jie Jiang, Andy Way, & Julie Carson-Berndsen: Lattice score based data cleaning for phrase-based statistical machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 631KB]

(2010) Jie Jiang, Jinhua Du, & Andy Way: Source-side syntactic reordering patterns with functional words for improved phrase-based SMT. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.19-27. [PDF, 208KB]

(2010) Weimin Jiang: Pre-editing for machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Maxim Khalilov & Khalil Sima’an: The ILLC-UvA SMT system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.197-203. [PDF, 1209KB]

(2010) Jae Dong Kim, Ralf D.Brown, & Jaime G.Carbonell: Chunk-based EBMT. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 624KB]

(2010) Philipp Koehn: Enabling monolingual translators: post-editing vs.options. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.537-545. [PDF,

(2010) Roland Kuhn, Boxing Chen, George Foster, & Evan Stratford: Phrase clustering for smoothing TM probabilities – or, how to extract paraphrases from phrase tables. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.608-616. [PDF, 328KB]

(2010) Mikko Kurimo, William Byrne, John Dines, Philip N.Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu, & Junichi Yamagishi: Personalising speech-to-speech translation in the EMIME project. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.48-53. [PDF, 679KB]

(2010) Patrik Lambert, Simon Petitrenaud, Yanjun Ma & Andy Way: Statistical analysis of alignment characteristics for phrase-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 588KB]; presentation: 26 slides [PDF, 133KB]

(2010) Samuel Larkin, Boxing Chen, George Foster, Ulrich Germann, Eric Joanis, Howard Johnson, & Roland Kuhn: Lessons from NRC’s Portage system at WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 127-132. [PDF, 124KB]

(2010) Chi-Ho Li, Nan Duan, Yinggong Zhao, Shujie Liu, Lei Cui, Mei-yuh Hwang, Amittai Axelrod, Jianfeng Gao, Yaodong Zhang, & Li Deng: The MSRA machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.135-138. [PDF, 254KB]; presentation: 17 slides [PDF of PPT, 477KB]

(2010) Mu Li, Yinggong Zhao, Dongdong Zhang, & Ming Zhou: Adaptive development data selection for log-linear model in statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.662-670. [PDF, 213KB]

(2010) Xuansong Li, Niyu Ge, Stephen Grimes, Stephanie M.Strassel, & Kazuaki Maeda: Enriching word alignment with linguistic tags. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2189-2195. [PDF, 509KB]

(2010) Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, & Jason Eisner: Unsupervised discriminative language model training for machine translation using simulated confusion sets. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.656-664. [PDF, 348KB]

(2010) Wang Ling, Tiago Luís, João Graça, Luísa Coheur & Isabel Trancoso: The INESC-ID machine translation system for the IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.81-84. [PDF, 227KB]

(2010) Wang Ling, Tiago Luís, João Graça, Luísa Coheur & Isabel Trancoso: Towards a general and extensible phrase-extraction algorithm. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.313-320. [PDF, 382KB]

(2010) Shui Liu, Sheng Li, Tiejun Zhao, Min Zhang, & Pengyuan Liu: Head-modifier relation based non-lexical reordering model for phrase-based translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.748-756. [PDF, 297KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Improved discriminative ITG alignment using hierarchical phrase pairs and semi-supervised training. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.730-738. [PDF, 1050KB]

(2010) Yang Liu & Qun Liu: Joint parsing and translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.707-715. [PDF, 211KB]

(2010) Yang Liu, Qun Liu, & Shouxun Lin: Discriminative word alignment by linear modeling. Computational Linguistics 36 (3), pp. 303-339 [PDF, 327KB]

(2010) Zhanyi Liu, Haifeng Wang, Hua Wu, & Sheng Li: Improving statistical machine translation with monolingual collocation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.825-833. [PDF, 405KB]

(2010) Chi-kiu Lo & Dekai Wu: Semantic vs. syntactic vs. n-gram structure for machine translation evaluation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.52-60. [PDF, 401KB]

(2010) Bin Lu, Tao Jiang, Kapo Chow, & Benjamin K. Tsou: Building a large English-Chinese parallel corpus from comparable patents and its experimental application to SMT.  [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.42-49. [PDF, 271KB]

(2010) Yanjun Ma & Andy Way: HMM word-to-phrase alignment with dependency constraints. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.101-109. [PDF, 191KB]

(2010) Spyros Martzoukos & Christof Monz: The UvA system description for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.205-208. [PDF, 232KB]

(2010) Haitao Mi & Qun Liu: Constituency to dependency translation with forests. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1433-1442. [PDF, 190KB]

(2010) Haitao Mi, Liang Huang, & Qun Liu: Machine translation with lattices and forests. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.837-845. [PDF, 268KB]

(2010) Hwidong Na & Jong-Hyeok Lee: The POSTECH’s statistical machine translation system for the IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.153-156. [PDF, 290KB]

(2010) ThuyLinh Nguyen, Stephan Vogel, & Noah A.Smith: Nonparametric word segmentation for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.815-823. [PDF, 267KB]

(2010) Michael Paul, Andrew Finch, & Eiichiro Sumita: Integration of multiple bilingually-learned segmentation schemes into statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 400-408. [PDF, 256KB]

(2010) Michael Paul, Marcello Federico, & Sebastian Stüker: Overview of the IWSLT 2010 evaluation campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.3-27. [PDF, 558KB]

(2010) Adam Pauls, Dan Klein, David Chiang, & Kevin Knight: Unsupervised syntactic alignment with inversion traduction grammars. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.118-126. [PDF, 766KB]

(2010) Jocelyn Phillips, Carol Van Ess-Dykema, Timothy Allison & Laurie Gerber: Parallel corpus development at NVTC. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 173KB]; abstract

(2010) Philip Resnik, Olivia Buzek, Chang Hu, Yakov Kronrod, Alex Quinn, & Benjamin B.Bederson: Improving translation via targeted paraphrasing. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.127-137. [PDF, 548KB]

(2010) Yan Song, Chunyu Kit, & Hai Zhao: Reranking with multiple features for better transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.62-65. [PDF, 395KB]

(2010) Jinsong Su, Yang Liu, Haitao Mi, Hongmei Zhao, Yajuan Lü, & Qun Liu: Dependency-based bracketing transduction grammar for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1185-1193. [PDF, 226KB]

(2010) Jinsong Su, Yang Liu, Yajuan Lü, Haitao Mi, & Qun Liu: Learning lexicalized reordering models from reordering graphs. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.12-16. [PDF, 1365KB]

(2010) Jun Sun, Min Zhang, & Chew Lim Tan: Discriminative induction of sub-tree alignment using limited labeled data. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1047-1055. [PDF, 370KB]

(2010) Jun Sun, Min Zhang, & Chew Lim Tan: Exploring syntactic structural features for sub-tree alignment using bilingual tree kernels. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.306-315. [PDF, 286KB]

(2010) Zhaopeng Tu, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Dependency forest for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1092-1100. [PDF, 215KB]

(2010) Wei Wang, Jonathan May, Kevin Knight, & Daniel Marcu: Re-structuring, re-labeling, and re-aligning for syntax-based machine translation. Computational Linguistics 36 (2), pp. 247-277 [PDF, 984KB]

(2010) Zhiyang Wang, Yajuan Lü, Qun Liu, & Young-Sook Hwang: Better filtration and augmentation for hierarchical phrase-based translation rules. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.142-146. [PDF, 135KB]

(2010) Shumin Wu, Jinho D.Choi, & Martha Palmer: Detecting cross-lingual semantic similarity using parallel PropBanks. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 274KB]

(2010) Tong Xiao, Jingbo Zhu, Muhua Zhu, & Huizhen Wang: Boosting-based system combination for machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.739-748. [PDF, 380KB]

(2010) Tong Xiao, Jingbo Zhu, Hao Zhang, & Muhua Zhu: An empirical study of translation rule extraction with multiple parsers. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1345-1353. [PDF, 388KB]

(2010) Xinyan Xiao, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Joint tokenization and translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1200-1208. [PDF, 342KB]

(2010) Deyi Xiong, Min Zhang, & Haizhou Li: Learning translation boundaries for phrase-based decoding. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.136-144. [PDF, 450KB]

(2010) Hao Xiong, Jun Xie, Hui Yu, Kai Liu, Wei Luo, Haitao Mi, Yang Liu, Yajuan Lü, & Qun Liu: The ICT statistical machine translation system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.73-79. [PDF, 510KB]

(2010) Jinxi Xu & Antti-Veikko I.Rosti: Combining unsupervised and supervised alignments for MT: an empirical study.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.667-673. [PDF, 74KB]

(2010) Gae-won You, Seung-won Hwang, Young-In Song, Long Jiang, & Zaiqing Nie: Mining name translations from entity graph mapping.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.430-439. [PDF, 1542KB]

(2010) Hui Zhang, Min Zhang, Haizhou Li, & Eng Siong Chang: Non-isomorphic forest pair translation. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.440-450. [PDF, 307KB]

(2010) Shiqi Zhao, Haifeng Wang, Xiang Lan, & Ting Liu: Leveraging multiple MT engines for paraphrase generation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1326-1334. [PDF, 597KB]

(2010) Andreas Zollmann & Stephan Vogel: New parameterizations and features for PSCFG-based machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.110-117. [PDF, 181KB]

(2009) Joshua S.Albrecht, Rebecca Hwa, & G.Elisabeta Marai: Correcting automatic translations through collaborations between MT and monolingual target-language users.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.60-68. [PDF, 178KB]

(2009) Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, & Jason S.Chang: Acquiring translation equivalences of multiword expressions by normalized correlation frequencies. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.478-486. [PDF, 371KB]

(2009) Sandra Bailly & Colin McCarney: Patent machine translation – Proctor & Gamble experience. IPWARE Summit 2009: International conference and exhibition on Software for Intellectual Property, 21-23 October 2009, Saint-Raphaël, France; abstract, 1pp. [PDF, 97KB]; presentation, 32pp. [PDF of PPT, 2069KB]

(2009) Nicola Bertoldi, Arianna Bisazza, Mauro Cettolo, Germán Sanchis-Trilles, & Marcello Federico: FBK @ IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 37-44. [PDF, 346KB]; poster [PDF, 411KB]

(2009) Alexandra Birch, Phil Blunsom & Miles Osborne: A quantitative analysis of reordering phenomena. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.197-205. [PDF, 808KB]

(2009) Jamie Brunning, Adrià de Gispert & William Byrne: Context-dependent alignment models for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.110-118. [PDF, 226KB]

(2009) Chris Callison-Birch: Fast, cheap, and creative: evaluating translation quality using Amazon’s Mechanical Turk. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.286-295. [PDF, 289KB]

(2009) Pi-Chuan Chang, Dan Jurafsky, & Christopher D.Manning: Disambiguating “DE” for Chinese-English machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.215-223. [PDF, 276KB]

(2009) Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, & Christopher D.Manning: Discriminative reordering with Chinese grammatical features. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.51-59. [PDF, 558KB]

(2009) Boxing Chen, Min Zhang, Haizhou Li & Aiti Aw: A comparative study of hypothesis alignment and its improvement for machine translation system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.941-948. [PDF, 119KB]

(2009) Han-Bin Chen, Jian-Cheng Wu & Jason S.Chang: Learning bilingual linguistic reordering model for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.254-262. [PDF, 333KB]

(2009) Yi-Chang Chen & Chia-Ping Chen: A framework for machine translation output combination. ROCLING 2009: Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, Taichung, Taiwan, 2009; pp.309-317. [PDF, 342KB]

(2009) David Chiang, Kevin Knight, & Wei Wang: 11,001 new features for statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.218-226. [PDF, 114KB]

(2009) Jinho D.Choi, Martha Palmer, & Nianwen Xue: Using parallel Propbanks to enhance word-alignments. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.121-124. [PDF, 139KB]

(2009) Tagyoung Chung & Daniel Gildea: Unsupervised tokenization for machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.718-726. [PDF, 197KB]

(2009) Marta R.Costa-jussà & Rafael E.Banchs: Barcelona Media SMT system description for the IWSLT 2009: introducing source context information. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 24-28. [PDF, 265KB]; poster [PDF, 281KB[

(2009) Josep M. Crego & François Yvon: Gappy translation units under left-to-right SMT decoding. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.66-73. [PDF, 551KB]

(2009) Jennifer DeCamp: What is missing in user-centric MT? MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 489-485. [PDF, 454KB]

(2009) Jinhua Du, Yanjun Ma & Andy Way: Source-side context-informed hypothesis alignment for combining outputs from machine translation systems. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.230-27. [PDF, 277KB]

(2009) Nan Duan, Mu Li, Tong Xiao, & Ming Zhou: The feature subspace method for SMT system combination. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1096-1104. [PDF, 416KB]

(2009) Xiangyu Duan, Deyi Xiong, Hui Zhang, Min Zhang, & Haizhou Li: I2R’s machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 50-54. [PDF, 294KB]; poster [PDF, 40KB]

(2009) Yang Feng, Yang Liu, Haitao Mi, Qun Liu, & Yajuan Lü: Lattice-based system combination for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1105-1113. [PDF, 161KB]

(2009) Michel Galley & Christopher D.Manning: Quadratic-time dependency parsing for machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.773-781. [PDF, 235KB]

(2009) Wei Gao, John Blitzer, Ming Zhou, & Kam-Fai Wong: Exploiting bilingual information to improve web search. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.1075-1083. [PDF,

(2009) Guillem Gascó & Joan Andreu Sánchez: UPV translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 118-123. [PDF, 331KB]; presentation [PDF of PPT, 603KB]

(2009) Jesús Giménez & Lluís Màrquez: On the robustness of syntactic and semantic features for automatic MT evaluation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.250-258. [PDF, 171KB]

(2009) Julien Gosme, Yves Lepage & Adrien Lardilleux: Translation of sublanguages by subgrammars. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.77-84. [PDF, 116KB]

(2009) Aria Haghighi, John Blitzer, John DeNero & Dan Klein: Better word alignments with supervised ITG models. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.306-314. [PDF, 301KB]

(2009) Saša Hasan & Hermann Ney: Comparison of extended lexicon models in search and rescoring for SMT. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.17-20. [PDF, 167KB]

(2009) Zhongjun He, Yao Meng, Yajuan Lü, Hao Yu, & Qun Liu: Reducing SMT rule table with monolingual key phrase. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.121-124. [PDF, 120KB]

(2009) Fei Huang: Confidence measure for word alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.932-940. [PDF, 727KB]

(2009) Gonzalo Iglesias, Adrià de Gispert, Eduardo R.Banga & William Byrne: Hierarchical phrase-based translation with weighted finite state transducers. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.433-441. [PDF, 170KB]

(2009) Heng Ji: Mining name translations from comparable corpora by creating bilingual information networks. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.34-47. [PDF, 120KB]

(2009) Hongfei Jiang, Muyun Yang, Tiejun Zhao, Sheng Li & Bo Wang: A statistical machine translation model based on a syntactic synchronous grammar. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.125-128. [PDF, 514KB]

(2009) Hongfei Jiang, Sheng Li, Muyun Yang, & Tiejun Zhao: A study of translation rule classification for syntax-based statistical machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.45-50. [PDF, 466KB]

(2009) Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, & Qingsheng Zhu: Mining bilingual data from the web with adaptively learnt patterns. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.870-878. [PDF, 306KB]

 (2009) Maxim Khalilov, José A.R.Fonollosa, & Mark Dras: A new subtree-transfer approach to syntax-based reordering for statistical machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.197-204. [PDF, 1078KB]

(2009) Patrik Lambert, Yanjun Ma, Sylwia Ozdowska & Andy Way: Tracking relevant alignment characteristics for machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.268-275. [PDF, 443KB]

(2009) Yves Lepage, Adrien Lardilleux, & Julien Gosme: The GREYC translation memory for the IWSLT 2009 evaluation campaign: one step beyond translation memory. IWSLT 2009: Proceedings of the International Workshop on Spoke n Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 45-49. [PDF, ]; presentation [PDF of PPT,

(2009) Chi-Ho Li, Xiaodong He, Yupeng Liu, & Ning Xi: Incremental HMM alignment for MT system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.949-957. [PDF, 559KB]

(2009) Maoxi Li, Jiajun Zhang, Yu Zhou, & Chengqing Zong: The CASIA statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 83-90. [PDF, 317KB]; presentation [PDF of PPT, 463KB]

(2009) Mu Li, Nan Duan, Dongdong Zhang, Chi-Ho Li, & Ming Zhou: Collaborative decoding: partial hypothesis re-ranking using translation consensus between decoders. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.585-592. [PDF, 354KB]

(2009) Yang Liu, Yajuan Lü & Qun Liu: Improving tree-to-tree translation with packed forests.  [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.558-566. [PDF, 151KB]

(2009) Yang Liu, Haitao Mi, Yang Feng, & Qun Liu: Joint decoding with multiple translation models. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.55-63. [PDF, 145KB]

(2009) Chi-kiu Lo & Dekai Wu: HKUST statistical machine translation experiments for CWMT 2009. CWMT 2009: the 5th China Workshop on Machine Translation, Nanjing, China, October 16-17, 2009; 5pp. [PDF, 85KB]

(2009) Bin Lu, Benjamin K.Tsou, Jingbo Zhu, Tao Jiang, & Oi Yee Kwong: The construction of a Chinese-English patent parallel corpus. MT Summit XII: Third Workshop on Patent Translation, August 30, 2009, Ottawa, Ontario, Canada; pp. 17-24. [PDF, 181KB]

(2009) Yanjun Ma, Tsuyoshi Okita, Özlem Çetinoğlu, Jinhua Du, & Andy Way: Low-resource machine translation using MaTrEx: the DCU machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 29-36. [PDF, 305KB]; presentation [PDF of PPT, 466KB]

(2009) Evgeny Matusov, Gregor Leusch, & Hermann Ney: Learning to combine machine translation systems.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.257-276.

(2009) Arne Mauser, Saša Hasan & Hermann Ney: Extending statistical machine translation with discriminative and trigger-based lexicon models. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.210-218. [PDF, 266KB]<