Machine Translation Archive

 

Index to applications, uses, tools, etc.

 

Publications since 2010

 

Click here for publications 2005-2009, publications 2000-2004, publications 1990-1999, and publications before 1990

 

[To return to home page click here]

 

Abbreviations

(2011) Deana L.Pennell & Yang Liu: A character-level machine translation approach for normalization of SMS abbreviations. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.974-982. [PDF, 2553KB]

Abstracts

(2011) Bruno Pouliquen & Christophe Mazenc: COPPA, CLIR and TAPTA: three tools to assist in overcoming the patent barrier at WIPO. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.24-30. [PDF, 896KB]

Administrative documents

(2010) Michael Jellinghaus, Alexandros Poulis, & David Kolovratnik: Exodus – exploring SMT for EU institutions.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 110-114. [PDF, 109KB]

Air travel see Travel and tourism

Algorithms see Programming and algorithms

Audiovisual media see Multimodal (and multimedia) applications

Authoring tools

 (2010) Jennifer Beaupre & Kent Taylor: Content quality for better MT: a practical guide to quality at the source. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 3pp. [PDF, 69KB]

(2010) Christoph Rösener: Computational linguistics in the translator’s workflow—combining authoring tools and translation memory systems. Proceedings of the NAACL HLT 2010 Workshop on Computational Linguistics and Writing, Los Angeles, CA, June 2010; pp.1-6. [PDF, 2233KB]

Authorship

(2011) Steffen Hedegaard & Jakob Grue Simonsen: Lost in translation: authorship attribution using frame semantics. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.65-70. [PDF, 109KB]

Automobile industry

(2011) Johannes Bursch: Corporate language management at Daimler AG – role and challenges.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 19 slides [PDF of PPT, 2242KB]

(2010) Daniel Andrade, Tetsuya Nasukawa, & Jun’ichi Tsujii: Robust measurement and comparison of context similarity for finding translation pairs. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.19-27. [PDF, 709KB]

Awards

(2011) Georg Rehm & Josef van Genabith: META prize and META seal of recognition: award ceremony.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 18 slides [PDF of PPT, 1442 KB]

Back translation

(2011) Fabio Alves, Adrianna Pagano, & Igor L. da Silva: Modeling (un)packing of meaning in translation: insights from effortful text production. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.153-163. [PDF, 1064KB]

(2011) Chang Hu, Philip Resnik, Yakov Kronrod, Vladimir Eidelman, Olivia Buzek, & Benjamin B.Bederson: The value of monolingual crowdsourcing in a real-world translation scenrio: simulation using Haitian Creole emergency SMS messages. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.399-404 . [PDF, 164KB]

(2011) Zhifei Li, Jason Eisner, Ziyuan Wang, Sanjeev Khudanpur, & Brian Roark: Minimum imputed risk: unsupervised discriminative training for machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.920-929. [PDF, 236KB]

(2010) Milam Aiken & Mina Park: The efficacy of round-trip translation for MT evaluation. Translation Journal 14 (1), January 2010; 10pp. [PDF, 149KB]

(2010) Yanli Sun, Sharon O’Brien, Minako O’Hagan, & Fred Hollowood: A novel statistical pre-processing model for rule-based machine translation system.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 790KB]; presentation: 28 slides [PDF, 580KB]

Benefits and advantages see User experiences and needs

Bible

 (2011) Rafael E.Banchs & Marta R.Costa-jussà: A semantic feature for statistical machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.126-134. [PDF, 576KB]

(2010)  Marta R.Costa-jussà, Vidas Daudaravicius, & Rafael E.Banchs: Integration of statistical collocation segmentations in a phrase-based statistical machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 975KB]

Bootstrapping

 (2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Bootstrapping Arabic-Italian SMT through comparable texts and pivot translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.249-256. [PDF, 354KB]; presentation, 12 slides [PDF]

(2011) Mitesh M.Khapra, Salil Joshi, Arindam Chatterjee, & Pushpak Bhattacharyya: Together we can: bilingual bootstrapping for WSD. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.561-569. [PDF, 137KB]

(2011) Manny Rayner, Paula Estrella, & Pierrette Bouillon: Bootstrapping a statistical speech translator from a rule-based one. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.21-28. [PDF, 199KB]

(2010) Lieve Macken & Walter Daelemans: A chunk-driven bootstrapping approach to extracting translation patterns [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Manny Rayner, Paula Estrella, & Pierrette Bouillon: A bootstrapped interlingua-based SMT architecture. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 611KB]; presentation: 30 slides [PDF, 737KB]

Broadcast transcripts (see also News texts)

(2010) Hassan Sawaf: Arabic dialect handling in hybrid machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 146KB]

Chatrooms and discussion fora

(2011) Johanka Spoustová & Miroslav Spousta: Comparable fora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.96-101. [PDF, 80KB]

Clothing and dress

(2011) Masamichi Ideue, Kazahide Yamamoto, Masao Utiyama, & Eiichiro Sumita: A comparison of unsupervised bilingual term extraction methods using phrase tables. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.346-351. [PDF, 160KB]

Cloud computing

(2011) Andrejs Vasiļjevs, Raivis Skadiņš, & Jörg Tiedemann: LetsMT!: cloud-based platform for building user tailored machine translation engines. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.507-511. [PDF, 211KB]

(2010) Graham Wilcock: Cloud computing for the humanities. Human Language Technologies—The Baltic Perspective, 4th International Conference, Riga, Latvia, October 7-8, 2010; 28 slides [PDF of PPT, 2265KB]

(2011) LetsMT! Platform for online sharing of training data and building user tailored MT. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.337. [PDF, 66KB]

(2010) Andrejs Vasiljevs: LetsMT! – towards cloud based service for MT generation. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 15pp. [PDF, 368KB]

Collaboration [see also Crowd sourcing]

(2011) Maciej Ogrodniczuk & Adam Przepiórkowski: Polish LRTs: CESAR’s story.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 14 slides [PDF of PPT, 324KB]

(2011) Marko Tadić: The CESAR project: enabling LRT for 70m+ speakers.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 24 slides [PDF of PPT, 810KB]

(2011) Hans Uszkoreit: Where do we stand? META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 10 slides [PDF of PPT, 2420KB]

(2011) Andrejs Vasiļjevs, Raivis Skadiņš, & Jörg Tiedemann: LetsMT!: cloud-based platform for building user tailored machine translation engines. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.507-511. [PDF, 211KB]

(2011) Na Ye & Guiping Zhang: Study on the impact factors of the translators’ post-editing efficiency in a collaborative translation environment. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.579-586. [PDF, 277KB]

(2011) LetsMT! Platform for online sharing of training data and building user tailored MT. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.337. [PDF, 66KB]

(2010) proceedings of META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium.

(2010) Takeshi Abekawa, Masao Utiyama, Eiichiro Sumita, & Kyo Kageura: Community-based construction of draft and final translation corpus through a translation hosting site Minna no Hon’yaku (MNH).  LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3662-3669. [PDF, 1695KB]

(2010) Alain Désilets: Collaborative translation: technology, crowdsourcing, and the translator perspective. Introduction to workshop at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 2pp. [PDF, 114KB]

(2010) Alice Dijkstra: Dutch/Flemish HLT cooperation.  Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010, Panel on Small Languages; 9pp. [PDF, 440KB]

(2010) Jo Drugan & Bogdan Babych: Shared resources, shared values? Ethical implications of sharing translation resources. JEC 2010: Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.3-9. [PDF, 9,433KB]

(2010) A.Kumaran, Naren Datha, B.Ashok, K.Saravanan, Anil Ande, Ashwani Sharma, Sridhar Vedantham, Vidya Natampally, Vikram Dendu & Sandor Maurice: WikiBABEL: a system for multilingual Wikipedia content. Contribution to workshop of ‘Collaborative translation’ at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 5pp. [PDF, 316KB]

(2010) Georg Rehm: META-NET and META-SHARE: an overview. Human Language Technologies – the Baltic Perpective, 4th International Conference, Riga, Latvia, October 8, 2010; 52 slides [PDF of PPT, 9770KB]

(2010) Antoine Rey: GlobalSight MT integration. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 5pp. [PDF, 1048KB]

(2010) Víctor M.Sánchez-Cartagena & Juan Antonio Pérez-Ortiz: Tradubi: open-source social translation for the Apertium machine translation platform. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.47-56. [PDF, 223KB]

(2010) Willem Stoeller: Community translation. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 19pp. [PDF, 4871KB]

(2010) Josef van Genabith: EuroMatrixPlus – evaluation, localisation, open source. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 24pp. [PDF, 2818KB]

(2010) Andrejs Vasiljevs: LetsMT! – towards cloud based service for MT generation. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 15pp. [PDF, 368KB]

(2010) Andrejs Vasiļjevs, Tatiana Gornostoy, & Raivis Skadiņš: LetsMT!  - online platform for sharing training data and building user-tailored machine translation. Human Language Technologies – the Baltic Perpective, 4th International Conference, Riga, Latvia, October 8, 2010; 18 slides [PDF of PPT, 727KB]

(2010) Chris Wendt: Better translations with user collaboration – integrated MT at Microsoft. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 807KB]

Commercial systems (see also Economics of MT)

(2011) Jochen Hummel: Visibility for the European industrial LT landscape at META-FORUM 2011.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 11 slides [PDF of PPT, 3786KB]

(2011) Rose Lockwood: LT from research to innovation: big foot, giant squid, little fishes.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 13 slides [PDF of PPT, 1975KB]

(2010) Technical showcase at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, November 2, 2010

Computational complexity (see also Modularity)

(2011) Kenneth Heafield: KenLM: faster and smaller language model queries. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.187-197. [PDF, 169KB]

(2011) Gonzalo Iglesias, Cyril Allauzen, William Byrne, Adrià de Gispert, & Michael Riley: Hierarchical phrase-based translation representations. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1373-1383. [PDF, 334KB]

(2010) Holger Schwenk: Continuous-space language models for statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.137-146. [PDF, 151KB]

Computer science and information technology texts

(2010) Osamuyimen Stewart, David Lubensky, Scott Macdonald, & Julie Marcotte: Using machine translation for localization of electronic support content: evaluating end-user satisfaction. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 6pp. [PDF, 39KB]

Consumer services see Information services

Controlled languages

 (2010) Aarne Ranta: Multilingual packages of controlled languages: an introduction to GF. CNL2010: 2nd Workshop on Controlled Natural Language, Marettimo Island, Sicily, September 13-15, 2010; 83 slides [PDF of PPT, 1140KB]

(2010) Aarne Ranta, Krasimir Angelov, & Thomas Hallgren: Tools for multilingual grammar-based translation on the web. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.66-71. [PDF, 313KB]

 (2010) Irina Temnikova: Cognitive evaluation approach for controlled language post-editing experiment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3485-3490. [PDF, 631KB]

Cross-language information retrieval see Information retrieval

Crowd sourcing

(2011) Luisa Bentivogli, Marcello Federico, Giovanni Moretti, & Michael Paul: Getting expert quality from the crowd for machine translation evaluation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.521-528. [PDF, 337KB]

(2011) Karën Fort, Gilles Adda, & K.Bretonnel Cohen: Amazon Mechanical Turk: gold mine or coal mine?  Computational Linguistics 37 (2), pp. 413-420 [PDF, 96KB]

(2011) Chang Hu, Philip Resnik, Yakov Kronrod, Vladimir Eidelman, Olivia Buzek, & Benjamin B.Bederson: The value of monolingual crowdsourcing in a real-world translation scenrio: simulation using Haitian Creole emergency SMS messages. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.399-404 . [PDF, 164KB]

 (2011) Matteo Negri, Luisa Bentivogli, Yashar Mehdad, Danilo Giampiccolo, & Alessandro Marchetti: Divide and conquer: crowdsourcing the creation of cross-lingual textual entailment corpora.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.670-679. [PDF, 493KB]

 (2011) Omar F.Zaidan & Chris Callison-Burch: Crowdsourcing translation: professional quality from non-professionals. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1220-1229. [PDF, 241KB]

(2010) Gilles Adda & Joseph Mariani: Language resources & Amazon Mechanical Turk: ethical, legal and other issues. LREC 2010: Le gal Issues for Sharing Language Resources - LISLR2010 Workshop, 17 May 2010, Valletta, Malta; 21slides. [PDF of PPT, 244KB]

(2010) Vamshi Ambati, Stephen Vogel, & Jaime Carbonell: Active learning and crowd-sourcing for machine translation. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2169-2174. [PDF, 436KB]

(2010) Vamshi Ambati & Stephan Vogel: Can crowds build parallel corpora for machine translation systems? Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.62-65. [PDF, 89KB]

(2010) Michael Denkowski, Hasan Al-Haj, & Alon Lavie: Turker-assisted paraphrasing for English-Arabic machine translation. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.66-70. [PDF, 237KB]

(2010) Michael Denkowski & Alon Lavie: Exploring normalization techniques for human judgments of machine translation adequacy collected using Amazon Mechanical Turk. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.57-61. [PDF, 248KB]

(2010) Alain Désilets: Collaborative translation: technology, crowdsourcing, and the translator perspective. Introduction to workshop at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 2pp. [PDF, 114KB]

(2010) Bill Dolan: Building partnerships with language communities: the importance of shared technology and shared data. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 38 slides [PDF of PPT, 1245KB]

(2010) Qin Gao & Stephan Vogel: Consensus versus expertise: a case study of word alignment with Mechanical Turk. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.30-34. [PDF, 389KB]

(2010) Yakov Kronrod, Philip Resnik, Olivia Buzek, Chang Hu, Alex Quinn, & Benjamin B.Bederson: Improving translation via targeted paraphrasing. Contribution to workshop of ‘Collaborative translation’ at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 4pp. [PDF, 79KB]

(2010) Robert Munro, Steven Bethard, Victor Kuperman, Vicky Tzuyin Lai, Robin Melnick, Christopher Potts, Tyler Schnoebelen, & Harry Tily: Crowdsourcing and language studies: the new generation of linguistic data. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.122-130. [PDF, 310KB]

(2010) Robert Munro: Crowdsourcing translation for emergency response in Haiti: the global collaboration of local knowledge. Contribution to workshop of ‘Collaborative translation’ at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 4pp. [PDF, 331KB]

(2010) Mike O’Malley: The challenges of distributed parallel corpora. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Willem Stoeller: Community translation. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 19pp. [PDF, 4871KB]

(2010) Jost Zetzsche: Crowdsourcing and the professional translator. Contribution to workshop of ‘Collaborative translation’ at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 1p. [PDF, 54KB]

Cultural aspects see Social and cultural aspects

Customization and implementation

(2011) Ryamond Flournoy: MT use within the enterprise: encouraging adoption via a unified MT API.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.234-238. [PDF, 42KB]

(2011) Yanli Sun, Juan Li, & Yi Li: Deploying MT into a localisation workflow: pains and gains. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.239-243. [PDF, 83KB]

(2011) Andrejs Vasiļjevs, Raivis Skadiņš, & Inguna Skadiņa: Towards application of user-tailored machine translation in localization.  Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.23-31. [PDF, 1896KB]

(2011) Andy Way, Kenny Holden, Lee Ball, & Gavin Wheeldon: SmartMATE: online self-serve access to state-of-the-art SMT.  Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.43-52. [PDF, 1175KB]

(2010) Sven C.Andrä & Jörg Schütz: Effectual MT within a translation workflow panopticon. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 169KB]

(2010) Olga Beregovaya & Alex Yanishevsky: PROMT at PayPal: enterprise-scale MT deployment for financial industry content. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 253KB]

(2010) Heidi Depraetere & Pablo Vazquez: Machine translation engine selection in the enterprise. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 4pp. [PDF, 480KB]; presentation: 14 slides [PDF, 1631KB]

(2010) Fred Hollowood: 2010 the industrialisation of MT. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 8pp. [PDF, 224KB]

(2010) Michael Ladwig: Language NOW. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Jenny Lu: Machine translation (MT) @ CA Technologies: where can MT be best successful and what are the best MT engines for various languages. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 3pp. [PDF, 108KB]

(2010) Kirti Vashee & Rustin Gibbs: Scenarios for customizing an SMT engine based on availability of data. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 153KB]

 (2010) Chris Wendt: Better translations with user collaboration – integrated MT at Microsoft. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 807KB]

 (2010) E.Yuste, M.Herranz, A.-L.Lagarda, L.Tarazón, I.Sánchez-Cortina, & F.Casacuberta: PangeaMT – putting open standards to work…well. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 480KB]

Decipherment

(2011) Jacob Aron: Cracking the code of machine translation. New Scientist, vol.210, issue 2817, June 2011; 2pp.

(2011) Taylor Berg-Kirkpatrick & Dan Klein: Simple effective decipherment via combinatorial optimization. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.313-321. [PDF, 299KB]

(2011) Kevin Knight, Beáta Megyesi & Christiane Schaefer: The Copiale cipher. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.110-119. [PDF, 657KB]

(2011) John Markoff: Translation tool cracked a 1700s code. New York Times, November 13, 2011. [PDF, 65KB]

(2011) Sujith Ravi & Kevin Knight: Deciphering foreign language. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.12-21. [PDF, 225KB]

(2010) Eric Corlett & Gerald Penn: An exact A* method for deciphering letter-substitution ciphers. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1040-1047. [PDF, 132KB]

(2010) Benjamin Snyder, Regina Barzilay, & Kevin Knight: A statistical model for lost language decipherment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1048-1057. [PDF, 764KB]

Dictation

(2011) Eva Safar: Machine translation with corpus linguistics and HPSG. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 12 slides [PDF of PPT, 288KB]

Disabled persons

(2011) EASTIN-CL: crosslingual and multimodal search in a portal for support of assisted living. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.331. [PDF, 138KB]

Document management and organization

(2011) Johannes Bursch: Corporate language management at Daimler AG – role and challenges.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 19 slides [PDF of PPT, 2242KB]

(2011) Yanli Sun, Juan Li, & Yi Li: Deploying MT into a localisation workflow: pains and gains. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.239-243. [PDF, 83KB]

Dynamic programming

(2011) Alexander M.Rush & Michael Collins: Exact decoding of syntactic translation models through Lagrangian relaxation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.72-82. [PDF, 203KB]

Economic aspects

(2010) Heidi Depraetere & Pablo Vazquez: Machine translation engine selection in the enterprise. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 4pp. [PDF, 480KB]; presentation: 14 slides [PDF, 1631KB]

(2010) John Dixon: Machine translation in an enterprise environment. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 126KB]

(2010) Doug Strock: Practical uses of MT at Global Language Translations and Consulting: a case study of MT use for profit. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 3pp. [PDF, 24KB]

(2010) Lori Thicke: What’s driving machine translation and why should we care? EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 11pp. [PDF of PPT presentation, 1539KB]; presentation: 23 slides [PDF, 1568KB]

Edit distance

(2011) Taylor Berg-Kirkpatrick & Dan Klein: Simple effective decipherment via combinatorial optimization. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.313-321. [PDF, 299KB]

(2010) Sergio Penkale, Yanjun Ma, Daniel Galron, & Andy Way: Accuracy-based scoring for phrase-based statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 281KB]

Electronics texts

(2010) Radu Soricut & Abdessamad Echihabi: TrustRank: inducing trust in automatic translations via ranking. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.612-621. [PDF, 222KB]

Emergency services

(2011) Chang Hu, Philip Resnik, Yakov Kronrod, Vladimir Eidelman, Olivia Buzek, & Benjamin B.Bederson: The value of monolingual crowdsourcing in a real-world translation scenrio: simulation using Haitian Creole emergency SMS messages. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.399-404 . [PDF, 164KB]

(2011) William D.Lewis, Robert Munro, & Stephan Vogel: Crisis MT: developing a cookbook for MT in crisis situations. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.501-511. [PDF, 115KB]

(2010) Robert Munro: Crowdsourcing translation for emergency response in Haiti: the global collaboration of local knowledge. Contribution to workshop of ‘Collaborative translation’ at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 4pp. [PDF, 331KB]

 (2010) Irina Temnikova: Cognitive evaluation approach for controlled language post-editing experiment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3485-3490. [PDF, 631KB]

Error detection and correction

(2011) Vicent Alabau, Alberto Sanchis, & Francisco Casacuberta: Improving on-line handwritten recognition using translation models in multimodal interactive machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.389-394. [PDF, 417KB]

(2011) Sankaranarayanan Ananthakrishnan, Shiv Vitaladevuni, Rohit Prasad, & Prem Natarajan: Source error-projection for sample selection in phrase-based SMT for resource-poor languages. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.819-827. [PDF, 425KB]

(2011) Nguyen Bach, Fei Huang, & Yaser Al-Onaizan: Goodness: a method for measuring machine translation confidence. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.211-219. [PDF, 976KB]

(2011) Ahmed El Kholy & Nizar Habash: Automatic error analysis for morphologically rich languages. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.225-232. [PDF, 219KB]

 (2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

(2011) Maja Popović: Hjerson: an open source tool for automatic error classification of machine translation output. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.59-67. [PDF, 121KB]; presentation, 20 slides [PDF of PPT, 226KB]

(2011) Maja Popović & Aljoscha Burchardt: From human to automatic error classification for machine translation output. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.265-272. [PDF, 183KB]; presentation, 16 slides [PDF, 385KB]

(2011) Sara Stymne: BLAST: a tool for error analysis of machine translation output. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Portland, Oregon, June 19-24, 2011; pp.56-61. [PDF, 128KB]

(2011) Sara Stymne: Pre- and postprocessing for statistical machine translation into Germanic languages. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.12-17. [PDF, 93KB]

(2011) Kaveh Taghipour, Shahram Khadivi, & Jia Xu: Parallel corpus refinement as an outlier detection algorithm. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.414-421. [PDF, 213KB]

(2011) Dan Zeman, Mark Fishel, Jan Berka, & Ondřej Bojar: Addicter: what’s wrong with my translations? Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.79-88. [PDF, 276KB]; presentation, 20 slides [PDF of PPT, 359KB]

(2010) Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: Statistical machine translation of texts with misspelled words. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.412-419. [PDF, 160KB]

(2010) Graeme Blackwood, Adrià de Gispert & William Byrne: Fluency constraints for minimum Bayes-risk decoding of statistical machine translation lattices. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.71-79. [PDF, 547KB]

(2010) Mireia Farrús, Marta R.Costa-jussà, José B.Mariño, & José A.R.Fonollosa: Linguistic-based evaluation criteria to identift statistical machine translation errors. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 606KB]

(2010) Kristen Parton & Kathleen McKeown: MT error detection for cross-lingual question answering. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.946-954. [PDF, 161KB]

(2010) Philip Resnik, Olivia Buzek, Chang Hu, Yakov Kronrod, Alex Quinn, & Benjamin B.Bederson: Improving translation via targeted paraphrasing. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.127-137. [PDF, 548KB]

(2010) Sara Stymne & Lars Ahrenberg: Using a grammar checker for evaluation and postprocessing of statistical machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2175-2181. [PDF,

(2010) Guillaume Wisniewski, Alexandre Allauzen, & François Yvon: Assessing phrase-based translation models with oracle decoding. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.933-943. [PDF, 188KB]

(2010) Deyi Xiong, Min Zhang, & Haizhou Li: Error detection for statistical machine translation using linguistic features. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.604-611. [PDF, 520KB]

(2010) Huichao Xue & Rebecca Hwa: Syntax-driven machine translation as a model of ESL revision. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1373-1381. [PDF, 258KB]

Evaluation of systems see index of methods

Examples of MT output

(2011) Omar F.Zaidan & Chris Callison-Burch: Crowdsourcing translation: professional quality from non-professionals. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1220-1229. [PDF, 241KB]

(2010) Danielle Belopotosky: Putting Google to the test. New York Times, March 9, 2010. [PDF, 169KB]

File interchange and exchange

(2011) Seiji Okura, Yuji Yamamoto, Hajime Ito, Michael Kato, Miwako Shimazu, & Francis Bond: UTX 1.11, a simple and open user dictionary/terminology standard, and its effectiveness with multiple MT systems. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.587-594. [PDF, 95KB]

(2011) Stelios Piperidis: META-SHARE: an open resource exchange infrastructure for stimulating research and innovation.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 37 slides [PDF of PPT, 5302KB]

(2010) Jinhua Du, Johann Roturier, & Andy Way: TMX markup: a challenge when adapting SMT to the localisation environment. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 656KB]

(2010) Stelios Piperidis: META-SHARE: the open resource exchange facility. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 32 slides [PDF of PPT, 6471KB]

Financial texts

(2010) Olga Beregovaya & Alex Yanishevsky: PROMT at PayPal: enterprise-scale MT deployment for financial industry content. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 253KB]

Fluency see Readability and comprehension

Geography and geology texts

(2010) Martin Volk, Noah Bubenhofer, Adrian Althaus, Maya Bangerter, Lenz Furrer, & Beni Ruef: Challenges in building a multilingual Alpine heritage corpus. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1653-1659. [PDF, 1296KB]

Globalization

(2010) Doris Albisser: Born global – how to communicate effectively in an international environment. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 2pp. [PDF, 476KB]; presentation: 27 slides [PDF, 553KB]

(2010) Sven C.Andrä & Jörg Schütz: Employing machine translation in glocalization tasks – a use case study. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 611KB]; presentation: 35 slides [PDF, 111KB]

(2010) Lori Thicke: What’s driving machine translation and why should we care? EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 11pp. [PDF of PPT presentation, 1539KB]; presentation: 23 slides [PDF, 1568KB]

Government documentation

(2010) Jennifer DeCamp: Language Technology Resource Center. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 718KB]; presentation: 9 slides [PDF, 341KB]

(2010) Andreas Eisele & Yu Chen: MultiUN: a multilingual corpus from United Nation documents. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2868-2872. [PDF, 333KB]

Government intelligence and MT

(2010) Nicholas Bemish: Use of HLT tools within the US government. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Sean Colbath: Terminology management for web monitoring. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Kathleen Egan: Cross lingual Arabic blog alerting (COLABA). AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Jocelyn Phillips, Carol Van Ess-Dykema, Timothy Allison & Laurie Gerber: Parallel corpus development at NVTC. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 173KB]; abstract

(2010) Chuck Simmons: Foreign Media Collaboration Framework (FMCF). AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

Government policy and MT

(2011) Roberto Cencioni: 3 years already and still a long way to go…  [EC support for LT] META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 18 slides [PDF of PPT, 192KB]

(2011) Svetla Koeva: Furthering natural language processing in Bulgaria. META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 40 slides [PDF of PPT, 3276KB]

(2011) Conrad Toft: Interinstitutional Language Technology Watch: language technology at the European institutions.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 19 slides [PDF of PPT, 627KB]

(2011) Hans Uszkoreit: The Strategic Research Agenda for multilingual Europe: summary of discussions and plan for completion.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 14 slides [PDF of PPT, 1321KB]

(2011) Hans Uszkoreit: Where do we stand? META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 10 slides [PDF of PPT, 2420KB]

(2010) Roberto Cencioni: EC Info-session on upcoming calls and further actions in support of language technology and innovation. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 35 slides [PDF of PPT, 229KB]

(2010) Alice Dijkstra: Dutch/Flemish HLT cooperation.  Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010, Panel on Small Languages; 9pp. [PDF, 440KB]

(2010) Sabine Kirchmeier-Andersen: Linguistic diversity and language change – future challenges for MT. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 34pp. [PDF, 3094KB]

(2010) Swaran Lata: Human language computing in Indian languages – a holistic perspective. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 74 slides [PDF of PPT, 7783KB]

Grammar checkers

(2010) Stéphane Huet, Julien Bourdaillet, Alexandre Patry, & Philippe Langlais: The RALI machine translation system for WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 103-109. [PDF, 115KB]

(2010) Sara Stymne & Lars Ahrenberg: Using a grammar checker for evaluation and postprocessing of statistical machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2175-2181. [PDF,

Hand-held devices see Mobile devices

Hardware see Computer equipment and accessories

Health see Medical texts

Human Language Technology

(2010) Alice Dijkstra: Dutch/Flemish HLT cooperation.  Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010, Panel on Small Languages; 9pp. [PDF, 440KB]

(2010) Huang Heyan: Researches on MT and language information processing of CCLIE & BIT. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 37pp. [PDF, 7229KB]

 (2010) Rūta Marcinkevičienė & Daiva Vitkutė-Adžgauskienė: Developing the human language technology infrastructure in Lithuania.  Human Language Technologies—The Baltic Perspective, 4th International Conference, Riga, Latvia, October 7-8, 2010; 24 slides [PDF of PPT, 6452KB]

 (2010) Einar Meister, Jaak Vilo & Neeme Kahusk: National programme for Estonian language technology: a pre-final summary. Human Language Technologies—The Baltic Perspective, 4th International Conference ,Riga, Latvia, October 7-8, 2010; 24 slides [PDF of PPT, 543KB]

 (2010) Iguna Skadiņa, Ilze Auziņa, Normunds Grūzītis, Kristīna Levāne-Petrova, Gunta Nešpore, Raivis Skadiņš, & Andrejs Vasiļjevs: Language resources and technology for humanities in Latvia 2004-2010.  Human Language Technologies—The Baltic Perspective, 4th International Conference, Riga, Latvia, October 7-8, 2010; 30 slides [PDF of PPT, 1204KB]

Human translation and MT (see also Translation tools and aids for translators, User experiences and needs)

(2010) Giselle de Almeida & Sharon O’Brien: Analysing post-editing performance: correlations with years of translation experience. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 511KB]

(2010) Josep Bonet: Is machine translation ripe for EU translators? Invited talk at: 7th International Workshop on Spoken Language Translation, 2December 2010, Paris, France; 25 slides. [PDF of PPT, 1033KB]

(2010) Nicholas Hartmann: Real voices: what translators do and why we need to keep doing it. Keynote speech at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 63KB]

(2010) Nikiforos Karamanis, Saturnino Luz, & Gavin Doherty: Translation practice in the workplace and machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 542KB]

(2010) Daniel Marcu: Creating value at the boundary between humans and machines. Invited talk at Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; p.1: abstract [PDF, 59KB]

(2010) Chris Wendt: Better translations with user collaboration – integrated MT at Microsoft. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 807KB]

(2010) Jost Zetzsche: Hostile takeover? Welcome addition? Machine translation enters the world of the translator. Translation Journal 14 (3) July 2010; 2pp. [PDF, 141KB]

Humanitarian aid

(2011) William D.Lewis, Robert Munro, & Stephan Vogel: Crisis MT: developing a cookbook for MT in crisis situations. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.501-511. [PDF, 115KB]

(2010) Lori Thicke: Translation technology for fighting world poverty. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 20pp. [PDF, 12614KB]

Humanities

(2010) Graham Wilcock: Cloud computing for the humanities. Human Language Technologies—The Baltic Perspective, 4th International Conference, Riga, Latvia, October 7-8, 2010; 28 slides [PDF of PPT, 2265KB]

(2010) Iguna Skadiņa, Ilze Auziņa, Normunds Grūzītis, Kristīna Levāne-Petrova, Gunta Nešpore, Raivis Skadiņš, & Andrejs Vasiļjevs: Language resources and technology for humanities in Latvia 2004-2010.  Human Language Technologies—The Baltic Perspective, 4th International Conference, Riga, Latvia, October 7-8, 2010; 30 slides [PDF of PPT, 1204KB]

Images see Visual information

Implementation of MT see Customization and implementation

Information extraction

(2011) Jan Berka, Martin Černý, & Ondřej Bojar: Quiz-based evaluation of machine translation. Prague Bulletin of Mathematical Linguistics, no.95, April 2011; pp.77-86. [PDF, 190KB]

(2011) Matthew Snover, Xiang Li, Wen-Pin Lin, Zheng Chen, Suzanne Tamang, Mingmin Ge, Adam Lee, Qi Li, Hao Li, Sam Anzaroot, & Heng Ji: Cross-lingual slot filling from comparable corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.110-119. [PDF, 271KB]

(2010) Sean Colbath: Terminology management for web monitoring. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Adam Lee, Marissa Passantino, Heng Ji, Guijun Qi, & Thomas Huang: Enhancing multi-lingual information extraction via cross-media inference and fusion.  [PDF, 296KB]

(2010) Yashar Mehdad, Matteo Negri, & Marcello Federico: Towards cross-lingual textual entailment.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.321-324. [PDF, 230KB]

(2010) Kristen Summers & Hassan Sawaf: User-generated system for critical document triage and exploitation – version 2011. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

Information retrieval (see also Keywords and keyphrases, Semantic Web)

(2011) Rohit Bharadwa  j G. & Vasudeva Varma: Language-independent context aware query translation using Wikipedia. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.145-150. [PDF, 195KB]

(2011) Duo Ding: Integrate multilingual web search results using cross-lingual topic models. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.20-24. [PDF, 367KB]

(2011) Michael Elhadad, Meni Adler, Yoav Goldberg, & Rafi Cohen: Topic models for morphologically rich languages and their usage to explore multilingual corpora [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; presentation: 64 slides [PDF of PPT, 655KB]

 (2011) Manaal Faruqui, Prasenjit Majumder, & Sebastian Padó: Soundex-based translation correction in Urdu-English cross-language information retrieval. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.25-29. [PDF, 326KB]

(2011) Marcello Federico: [review] Cross-language information retrieval [by] Jian-Yun Nie. Computational Linguistics 37 (2), pp. 411-412 [PDF, 37KB]

 (2011) Amir Hazem, Emmanuel Morin & Sebastian Peña Saldarriaga: Bilingual lexicon extraction from comparable corpora as metasearch. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.35-43. [PDF, 159KB]

(2011) Théo Hoffenberg & Christophe Brun-Franc: An innovative platform to allow full translation of internet sites. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.41-45. [PDF, 316KB]

(2011) Jagadeesh Jagarlamudi, Hal Daumé III, & Raghavendra Udupa: From bilingual dictionaries to interlingual document representations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.147-152. [PDF, 112KB]

(2011) Paul McNamee, James Mayfield, Dawn Lawrie, Douglas W.Oard, & David Doerman: Cross-language entity linking. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.255-263. [PDF, 409KB]

(2011) Alexandre Patry & Philippe Langlais: Identifying parallel documents from a large bilingual collection of texts: application to parallel article extraction in Wikipedia. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.87-95. [PDF, 147KB]

(2011) Bruno Pouliquen & Christophe Mazenc: COPPA, CLIR and TAPTA: three tools to assist in overcoming the patent barrier at WIPO. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.24-30. [PDF, 896KB]

(2011) Nasredine Semmar & Dhouha Bouamor: A new hybrid machine translation approach using cross-language information retrieval and only target text corpora. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.75-79. [PDF, 898KB]

(2011) Simon Shi, Pascale Fung, Emmanuel Prochasson, Chi-kiu Lo, & Dekai Wu: Mining parallel documents using low bandwidth and high precision CLIR from the heterogeneous web. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.420-428. [PDF, 512KB]

(2011) Marco Turchi: Multilingual text mining and machine translation activities carried out at the EC’s Joint Research Center. Invited talk at: Sixth Machine Translation Marathon, 5-10 September 2011, Trento, Italy; 52 slides [PDF of PPT, 4286KB]

(2011) Cross-language information retrieval in QUAERO-MSSE project. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.355. [PDF, 76KB]

(2011) EASTIN-CL: crosslingual and multimodal search in a portal for support of assisted living. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.331. [PDF, 138KB]

(2010) Daniel Andrade, Tetsuya Nasukawa, & Jun’ichi Tsujii: Robust measurement and comparison of context similarity for finding translation pairs. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.19-27. [PDF, 709KB]

(2010) Danish Contractor, Govind Kothari, Tanveer A.Faruquie, L.Venkata Subramaniam, & Sumit Negi: Handling noisy queries in cross language FAQ retrieval. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.87-96. [PDF, 258KB]

(2010) Manoj K.Chinnakotla, Karthik Raman, & Pushpak Bhattacharyya: Multilingual pseudo-relevance feedback: performance study of assisting languages. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1346-1356. [PDF, 1175KB]

(2010) Achille Falaise, David Rouquet, Didier Schwab, Hervé Blanchon, & Christian Boitet: Ontology driven content extraction using interlingual annotation of texts in the OMNIA project. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.52-60. [PDF, 359KB]

(2010) Benoît Gaillard, Malek Boualem, & Olivier Collin: Query translation using Wikipedia-based resources for analysis and disambiguation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 579KB]

(2010) Najeh Hajlaoui & Christian Boitet: Multilinguization and personalization of NL-based systems. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.79-87. [PDF, 453KB]

(2010) Rod Holland: Exploring the AFPAK web. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Degen Huang, Lian Zhao, Lishuang Li, & Haitao Yu: Mining large-scale comparable corpora from Chinese-English news collections. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.472-480. [PDF, 255KB]

(2010) Petr Knoth, Trevor Collins, Elsa Sklavounou, & Zdenek Zdrahal: Facilitating cross-language retrieval and machine translation by multilingual domain ontologies. [LREC 2010] Workshop on Supporting eLearning with Language Resources  and Semantic Data, Valletta, Malta, 22 May 2010; 42 slides. [PDF of PPT, 440KB]

(2010) Yashar Mehdad, Matteo Negri, & Marcello Federico: Towards cross-lingual textual entailment.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.321-324. [PDF, 230KB]

(2010) John C.Platt, Kristina Toutanova, & Wen-tau Yih: Translingual document representations from discriminative projections. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.251-261. [PDF, 319KB]

(2010) Stefan Riezler & Yi Liu: Query rewriting using monolingual statistical machine translation. Computational Linguistics 36 (3), pp. 569-582 [PDF, 145KB]

(2010) Fatiha Sadat: Using co-occurrence tendencies to improve cross-language information retrieval. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1101-1111. [PDF, 1846KB]

(2010) Tetsuya Sakai: Multilinguality at NTCIR, and moving on…[abstract]. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; p.2. [PDF, 56KB]

(2010) Xabier Saralegi & Maddalen Lopez de Lacalle: Dictionary and monlingual corpus-based query translation for Basque-English CLIR. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1353-1358. [PDF, 284KB]

(2010) Raghavendra Udupa & Mitesh Khapra: Improving the multilingual user experience of Wikipedia using cross-language name search. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.492-500. [PDF, 246KB]

(2010) Vishal Vachhani, Manoj K.Chinnakotla, Mitesh M.Khapra, & Pushpak Bhattacharyya: More languages, more MAP?: a study of multiple assisting languages in multilingual PRF. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.70-75. [PDF, 1230KB]

(2010) Yuejie Zhang, Yang Wang, Lei Cen, Yanxia Su, Cheng Jin, Xiangyang Xue, & Jianping Fan: Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1435-1443. [PDF, 551KB]

Information services [see also Translation services]

(2011) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Domain adaptation in statistical machine translation of user-forum data using component-level mixture modelling. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.285-292. [PDF, 137KB]

(2011) Joseph Mariani: The future European multilingual information society.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 26 slides [PDF of PPT, 11,176KB]

(2011) László Podhorányi: Intelligent customer service through language and speech technology at Vodafone Hungary.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 11 slides [PDF of PPT, 4231KB]

 (2011) Johann Roturier & Anthony Bensadoun: Evaluation of MT systems to translate user generated content. MT  Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.244-251. [PDF, 284KB]

(2011) Conrad Toft: Interinstitutional Language Technology Watch: language technology at the European institutions.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 19 slides [PDF of PPT, 627KB]

 (2011) MORMED – multilingual organic information management in the medical domain. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.345. [PDF, 71KB]

(2011)  LIWP – EU language industry web platform. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.339. [PDF, 40KB]

(2010) Arif Bramantoro, Ulrich Schäfer, & Toru Ishida: Towards an integrated architecture for composite language services and multiple linguistic processing components. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3506-3511. [PDF, 375KB]

(2010) Jennifer DeCamp: Language Technology Resource Center. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 718KB]; presentation: 9 slides [PDF, 341KB]

(2010) Philippe Wacker: Vision group “Media and information services”. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 15 slides [PDF of PPT, 125KB]]

Information technology texts see Computer science and information technology texts

Instant messaging see Mobile devices

Internationalization see Localization

Internet [see also Chatroom language, Online MT, World Wide Web]

(2010) Yan Tian: YanFa: an online automatic scoring and intelligent feedback system of student English-Chinese translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Demonstrations volume; pp.25-28. [PDF, 227KB]

Keywords and keyphrases

(2010) Chen Yuncong & Pascale Fung: Unsupervised synthesis of multilingual Wikipedia articles. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.197-205. [PDF, 1040KB]

(2010) Degen Huang, Lian Zhao, Lishuang Li, & Haitao Yu: Mining large-scale comparable corpora from Chinese-English news collections. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.472-480. [PDF, 255KB]

(2010) Liang Tian, Fai Wong, & Sam Chao: An improvement of translation quality with adding key-words in parallel corpus. Ninth International Conference on Machine Learning and Cybernetics Qingdao, China, 11-14 July 2010; pp. 1273-1278. [PDF, 226KB]

Koran see Quran

Language teaching and learning

(2011) Chi-Ho Li, Shujie Liu, Chenguang Wang, & Ming Zhou: ENGtube: an integrated subtitle environment for ESL. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.373-377. [PDF, 360KB]

(2010) Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao, & Claudia Baur: A multilingual CALL game based on speech translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1531-1538. [PDF, 573KB]

(2010) Yan Tian: YanFa: an online automatic scoring and intelligent feedback system of student English-Chinese translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Demonstrations volume; pp.25-28. [PDF, 227KB]

(2010) J.Tomás, A.Canovas, J.Lloret, M.García Pineda, & J.L.Abad: Speech translation in pedagogical environment using additional sources of knowledge.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1521-1524. [PDF, 442KB]

(2010) Huichao Xue & Rebecca Hwa: Syntax-driven machine translation as a model of ESL revision. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1373-1381. [PDF, 258KB]

Language technology

(2011) Roberto Cencioni: 3 years already and still a long way to go…  [EC support for LT] META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 18 slides [PDF of PPT, 192KB]

(2011) Rose Lockwood: LT from research to innovation: big foot, giant squid, little fishes.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 13 slides [PDF of PPT, 1975KB]

(2011) Joseph Mariani: The future European multilingual information society.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 26 slides [PDF of PPT, 11,176KB]

(2011) László Podhorányi: Intelligent customer service through language and speech technology at Vodafone Hungary.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 11 slides [PDF of PPT, 4231KB]

(2011) Conrad Toft: Interinstitutional Language Technology Watch: language technology at the European institutions.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 19 slides [PDF of PPT, 627KB]

(2011) Hans Uszkoreit: Where do we stand? META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 10 slides [PDF of PPT, 2420KB]

(2011) Hans Uszkoreit: The Strategic Research Agenda for multilingual Europe: summary of discussions and plan for completion.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 14 slides [PDF of PPT, 1321KB]

(2011) Josef van Genabith: An overview of multilingual LT research in Europe. META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 15 slides [PDF of PPT, 2052KB]

(2011) Tamás Váradi: Hungarian language technology - from platform to alliance.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 19 slides [PDF of PPT, 1069KB]

(2011) Andrejs Vasiljevs: The META-NET whitepaper series on European languages.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 17 slides [PDF of PPT, 1598KB]

(2010) Hanna Klimek: Towards a business forum for language technology stakeholders. Introduction to session “Language Technology in Practice: Needs – Problems – Wishes”. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 8 slides [PDF of PPT, 9403KB]

Lectures and speeches

(2011) Paul Maergner, Ian Lane, & Alex Waibel: Unsupervised vocabulary selection for domain-independent simultaneous lecture translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.89-96. [PDF, 926KB]

(2011) Nick Ruiz & Marcello Federico: Topic adaptation for lecture translation through bilingual latent semantic models. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.294-302. [PDF, 159KB]

(2010) Masaki Murata, Tomohiro Ohno, Shigeki Matsubara, & Yasuyoshi Inagaki: Construction of chunk-aligned bilingual lecture corpus for simultaneous machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1765-1770. [PDF, 398KB]

(2010) Michael Paul, Marcello Federico, & Sebastian Stüker: Overview of the IWSLT 2010 evaluation campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.3-27. [PDF, 558KB]

Legal aspects

(2010) Gilles Adda & Joseph Mariani: Language resources & Amazon Mechanical Turk: ethical, legal and other issues. LREC 2010: Le gal Issues for Sharing Language Resources - LISLR2010 Workshop, 17 May 2010, Valletta, Malta; 21slides. [PDF of PPT, 244KB]

(2010) John Hendrik Weitzmann & Prodromos Tsiavos: Language resources and legal issues: problems and solutions for basic and industrial research.  META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 39 slides [PDF of PPT, 1428KB]

(2010) Erik Ketzan: Translation licensing and copyright issues under United States law. LREC 2010: Legal Issues for Sharing Language Resources - LISLR2010 Workshop, 17 May 2010, Valletta, Malta; 21slides. [PDF of PPT, 107KB]

Legal texts

(2011) Michael Elhadad, Meni Adler, Yoav Goldberg, & Rafi Cohen: Topic models for morphologically rich languages and their usage to explore multilingual corpora [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; presentation: 64 slides [PDF of PPT, 655KB]

Localization

(2011) Frédéric Blain, Jean Senellart, Holger Schwenk, Mirko Plitt, & Johann Roturier: Qualitative analysis of post-editing for high quality machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.164-171. [PDF, 218KB]

(2011) Tomáš Hudík & Achim Ruopp: The integration of Moses into localization industry. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.47-53. [PDF, 449KB]

(2011) Yanjun Ma, Yifan He, & Josef van Genabith: From the confidence estimation of machine translation to the integration of MT and translation memory. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.13-15. [PDF, 75KB]

(2011) Mirko Plitt & Ventsislav Zhechev: Productive use of MT in localization. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.15-17. [PDF, 75KB]

(2011) Raivis Skadiņš, Maris Puriņš, Inguna Skadiņa, & Andrejs Vasiļjevs: Evaluation of SMT in localization to under-resourced inflected language. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.35-40. [PDF, 287KB]; presentation, 17 slides [PDF, 796KB]

(2011) Yanli Sun, Juan Li, & Yi Li: Deploying MT into a localisation workflow: pains and gains. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.239-243. [PDF, 83KB]

(2011) Andrejs Vasiļjevs, Raivis Skadiņš, & Inguna Skadiņa: Towards application of user-tailored machine translation in localization.  Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.23-31. [PDF, 1896KB]

(2010) Doris Albisser: Born global – how to communicate effectively in an international environment. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 2pp. [PDF, 476KB]; presentation: 27 slides [PDF, 553KB]

(2010) Giselle de Almeida & Sharon O’Brien: Analysing post-editing performance: correlations with years of translation experience. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 511KB]

(2010) Sven C.Andrä & Jörg Schütz: Employing machine translation in glocalization tasks – a use case study. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 611KB]; presentation: 35 slides [PDF, 111KB]

(2010) Olga Beregovaya & Alex Yanishevsky: PROMT at PayPal: enterprise-scale MT deployment for financial industry content. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 253KB]

(2010) Ying-ting Chuang: The concepts of globalization and localization. Translation Journal 14 (3), July 2010; 5pp. [PDF, 100KB]

(2010) Stephen Curran: An open service framework for next generation localisation. LREC 2010: Web Services and Processing Pipelines in HLT - WSPP2010 Workshop, 17 May 2010, Valletta, Malta; 20 slides. [PDF of PPT, 538KB]

(2010) John Dixon: Machine translation in an enterprise environment. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 126KB]

(2010) Jinhua Du, Johann Roturier, & Andy Way: TMX markup: a challenge when adapting SMT to the localisation environment. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 656KB]

(2010) Raymond Flournoy & Jeff Rueppel: One technology: many solutions. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 6pp. [PDF, 50KB]

(2010) Nikiforos Karamanis, Saturnino Luz, & Gavin Doherty: Translation practice in the workplace and machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 542KB]

(2010) François Masselot, Petra Ribiczey, & Gema Ramírez-Sánchez: Using the Apertium Spanish-Brazilian Portuguese machine translation system for localization. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 577KB]; presentation: 23 slides [PDF, 569KB]

(2010) Mirko Plitt & François Masselot: A productivity test of statistical machine translation post-editing in a typical localization context. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.7-16. [PDF, 387KB]

(2010) Jörg Porsiel: Preliminary findings of the vision group “Translation and localisation”. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 13 slides [PDF of PPT,1692KB]

(2010) Achim Ruopp: The Moses for Localization open source project. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 111KB]

(2010) Osamuyimen Stewart, David Lubensky, Scott Macdonald, & Julie Marcotte: Using machine translation for localization of electronic support content: evaluating end-user satisfaction. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 6pp. [PDF, 39KB]

(2010) Jaap van der Meer: Let a thousand MT systems bloom. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; abstract: 1p. [PDF, 460KB]

Machine aids for translators see Translation tools and aids for translators

Marketing, sales and product support texts see Business texts

Medical texts

(2011) Han-Bin Chen, Hen-Hsen Huang, Jengwei Tjiu, Ching-Ting Tan, & Hsin-His Chen: Identification and translation of significant patterns for cross-domain SMT applications. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.277-284. [PDF, 137KB]

(2011) Raphael Cohen, Yoav Goldberg, & Michael Elhadad: Transliterated pairs acquisition in medical Hebrew. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; 3pp. [PDF, 36KB];  presentation: 37 slides [PDF of PPT, 315KB]

(2011) Michael Elhadad, Meni Adler, Yoav Goldberg, & Rafi Cohen: Topic models for morphologically rich languages and their usage to explore multilingual corpora [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; presentation: 64 slides [PDF of PPT, 655KB]

(2011) Michael Gasser: Towards synchronous extensible dependency grammar. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.3-10. [PDF, 265KB]

 (2011) Amir Hazem, Emmanuel Morin & Sebastian Peña Saldarriaga: Bilingual lexicon extraction from comparable corpora as metasearch. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.35-43. [PDF, 159KB]

 (2011) Mitesh M.Khapra, Salil Joshi, & Pushpak Bhattacharyya: It takes two to tango: a bilingual unsupervised approach for estimating sense distributions using expectation maximization. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.695-704. [PDF, 280KB]

(2011) Emmanuel Morin & Emmanuel Prochasson: Bilingual lexicon extraction from comparable corpora enhanced with parallel corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.27-34. [PDF, 125KB]

(2011) María-Dolores Olvera-Lobo & Juncal Guttiérez-Artacho: Multilingual question-answering system in biomedical domain on the Web: an evalution. CLEF 2011: Conference on Multilingual and Multimodal Iinformation Access Evaluation, 19-22 September 2011, Amsterdam; 12slides [PDF of PPT, 413KB]

(2011) Manny Rayner, Paula Estrella, & Pierrette Bouillon: Bootstrapping a statistical speech translator from a rule-based one. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.21-28. [PDF, 199KB]

 (2011) Mark Seligman & Mike Dillinger: Real-time multi-media translation for healthcare: a usability study. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.595-602. [PDF, 574KB]

 (2011) Marianne Starlander & Paula Estrella: Looking for the best evaluation method for interlingua-based spoken language translation in the medical domain. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.81-92. [PDF, 905KB]

(2011) MORMED – multilingual organic information management in the medical domain. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.345. [PDF, 71KB]

(2010) Marta R.Costa-jussà, Mireia Farrús, José B.Mariño, & José A.R.Fonollosa: Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1706-1711. [PDF, 335KB]

(2010) Audrey Laroche & Philippe Langlais: Revisiting context-based projection methods for term-translation spotting in comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.617-625. [PDF, 224KB]

(2010) Sara Morrissey, Harold Somers, Robert Smith, Shane Gilchrist & Sandipan Dandapat: Building a sign language corpus for use in machine translation. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.172-177. [PDF, 588KB]

(2010) Paula Paiva: Corpus representativeness in the selection of medical terms to be used in translation memory tools [abstract]. UCCTS 2010: Using Corpora in Contrastive and Translation Studies, Edge Hill University, UK, 27-29 July 2010; p.15. [PDF, 91KB]

Meetings

(2010) Yang Mei & Katrin Kirchhoff: Contextual modeling for meeting translation using unsupervised word sense disambiguation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1227-1235. [PDF, 461KB]

Messaging see Mobile devices

Mobile devices

(2011) Vladimir Eidelman, Kristy Hollingshead, & Philip Resnik: Noisy SMS machine translation in low-density languages. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.344-350. [PDF, 133KB]

(2011) Andrew Finch, Wei Song, Kumiko Tanana-Ishii, & Eiichiro Sumita: Source language generation from pictures for machine translation on mobile devices. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.263-274. [PDF, 1504KB]

(2011) Sanjika Hewavitharana, Nguyen Bach, Qin Gao, Vamshi Ambati, & Stephan Vogel: CMU Haitian Creole-English translation system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.386-392. [PDF, 122KB]

(2011) Chang Hu, Philip Resnik, Yakov Kronrod, Vladimir Eidelman, Olivia Buzek, & Benjamin B.Bederson: The value of monolingual crowdsourcing in a real-world translation scenrio: simulation using Haitian Creole emergency SMS messages. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.399-404 . [PDF, 164KB]

(2011) Deana L.Pennell & Yang Liu: A character-level machine translation approach for normalization of SMS abbreviations. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.974-982. [PDF, 2553KB]

(2011) Sara Stymne: Spell checking techniques for replacment of unknown words and data cleaning for Haitian Creole SMS translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.470-477. [PDF, 107KB]

(2011) Fai Wong, Francisco Oliveira, Sam Chao, & Chi-Wai Tang: Handheld machine translation system based on constraint synchronous grammar. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.439-446. [PDF, 158KB]

(2010) Danish Contractor, Govind Kothari, Tanveer A.Faruquie, L.Venkata Subramaniam, & Sumit Negi: Handling noisy queries in cross language FAQ retrieval. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.87-96. [PDF, 258KB]

(2010) Mikko Kurimo, William Byrne, John Dines, Philip N.Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu, & Junichi Yamagishi: Personalising speech-to-speech translation in the EMIME project. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.48-53. [PDF, 679KB]

Monolingual use

(2011) Chang Hu, Philip Resnik, Yakov Kronrod, Vladimir Eidelman, Olivia Buzek, & Benjamin B.Bederson: The value of monolingual crowdsourcing in a real-world translation scenrio: simulation using Haitian Creole emergency SMS messages. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.399-404 . [PDF, 164KB]

(2010) Philipp Koehn: Enabling monolingual translators: post-editing vs.options. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.537-545. [PDF,

(2010) Yakov Kronrod, Philip Resnik, Olivia Buzek, Chang Hu, Alex Quinn, & Benjamin B.Bederson: Improving translation via targeted paraphrasing. Contribution to workshop of ‘Collaborative translation’ at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 4pp. [PDF, 79KB]

(2010) Philip Resnik, Olivia Buzek, Chang Hu, Yakov Kronrod, Alex Quinn, & Benjamin B.Bederson: Improving translation via targeted paraphrasing. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.127-137. [PDF, 548KB]

Mountaineering

(2010) Martin Volk, Noah Bubenhofer, Adrian Althaus, Maya Bangerter, Lenz Furrer, & Beni Ruef: Challenges in building a multilingual Alpine heritage corpus. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1653-1659. [PDF, 1296KB]

Multilingual communication (see also Globalization, Localization)

(2011) Joseph Mariani: The future European multilingual information society.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 26 slides [PDF of PPT, 11,176KB]

(2011) Hans Uszkoreit: The Strategic Research Agenda for multilingual Europe: summary of discussions and plan for completion.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 14 slides [PDF of PPT, 1321KB]

(2011) Hans Uszkoreit: Where do we stand? META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 10 slides [PDF of PPT, 2420KB]

(2010) Swaran Lata: Human language computing in Indian languages – a holistic perspective. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 74 slides [PDF of PPT, 7783KB]

   (2010) Alex Waibel: Preliminary findings of the “Interactive systems” vision group. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 15 slides [PDF of PPT, 1059KB]

Multimodal (and multimedia) applications (see also Visual information)

(2011) Mark Seligman & Mike Dillinger: Real-time multi-media translation for healthcare: a usability study. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.595-602. [PDF, 574KB]

(2011) EASTIN-CL: crosslingual and multimodal search in a portal for support of assisted living. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.331. [PDF, 138KB]

(2010) Gerard de Melo & Gerhard Weikum: Providing multilingual, multimodal answers to lexical database queries. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.348-355. [PDF, 1169KB]

(2010) Yota Georgakopoulou: Challenges for the audiovisual industry in the digital age: accessibility and multilingualism. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 16pp. [PDF, 830KB]

(2010) Adam Lee, Marissa Passantino, Heng Ji, Guijun Qi, & Thomas Huang: Enhancing multi-lingual information extraction via cross-media inference and fusion.  [PDF, 296KB]

(2010) Diana Santos, Luís Miguel Cabral, Corina Forascu, Pamela Forner, Frederic Gey, Katrin Lamm, Thomas Mandl, Petya Osenova, Anselmo Peñas, Álvaro Rodrigo, Julia Schulz, Yvonne Skalban, & Erik Tjong Kim Sang: GikiCLEF: crosscultural issues in multilingual information access. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2346-2353. [PDF, 408KB]

(2010) Philippe Wacker: Vision group “Media and information services”. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 15 slides [PDF of PPT, 125KB]]

News texts (see also Syndicated feeds)

(2011) Hala Almaghout, Jie Jiang, & Andy Way: CCG contextual labels in hierarchical phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.281-288. [PDF, 317KB]; presentation, 25 slides [PDF, 964KB]

(2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Bootstrapping Arabic-Italian SMT through comparable texts and pivot translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.249-256. [PDF, 354KB]; presentation, 12 slides [PDF]

(2011) Alexander Fraser, Marion Weller, Aoife Cahill, & Fabienne Fritzinger: Morphological generation of German for SMT. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 21KB];  presentation: 44 slides [PDF of PPT, 307KB]

(2011) Souhir Gahbiche-Braham, Hélène Bonneau-Maynard, & François Yvon: Two ways to use a noisy parallel news corpus for improving statistical machine translation. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.44-51. [PDF, 176KB]

(2011) Zhengxian Gong, Guodong Zhou, & Liangyou Li: Improve SMT with source-side “topic-document” distributions. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.496-501. [PDF, 297KB]

(2011) Sylvain Raybaud, David Langlois, & Kamel Smaïli: Broadcast news speech-to-text translation experiments. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.378-381. [PDF, 64KB]

(2011) Christian Rishøj & Anders Søgaard: Factored translation with unsupervised word clusters. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.447-451. [PDF, 163KB]

(2011) Antonio Toral, Federico Gaspari, Sudip Kumar Naskar, & Andy Way: Comparative evaluation of research vs. online MT systems. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.13-20. [PDF, 163KB]; presentation, 22 slides [PDF, 173KB]

(2011) Marco Turchi: Multilingual text mining and machine translation activities carried out at the EC’s Joint Research Center. Invited talk at: Sixth Machine Translation Marathon, 5-10 September 2011, Trento, Italy; 52 slides [PDF of PPT, 4286KB]

(2010) Marta R.Costa-jussà, Mireia Farrús, José B.Mariño, & José A.R.Fonollosa: Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1706-1711. [PDF, 335KB]

(2010) Vladimir Eidelman, Chris Dyer, & Philip Resnik: The University of Maryland statistical machine translation system for the Fifth Workshop on Machine Translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 72-76. [PDF, 115KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Manipuri-English example based machine translation system [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 64KB]

(2010) İlknur Durgar El-Kahlout & François Yvon: The pay-offs of preprocessing for German-English statistical machine translation. [PDF, 301KB]

(2010) Andrew Finch & Eiichiro Sumita: A Bayesian model of bilingual segmentation for transliteration. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.259-266. [PDF, 511KB]

(2010) Carlos A.Henríquez Q., Marta R.Costa-jussà, Vidas Daudaravicius, Rafael E.Banchs, & José B.Mariño: Using collocation segmentation to augment the phrase table. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 98-102. [PDF, 1398KB]

(2010) Hieu Hoang & Philipp Koehn: Improved translation with source syntax labels. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 409-417. [PDF, 1079KB]

(2010) Degen Huang, Lian Zhao, Lishuang Li, & Haitao Yu: Mining large-scale comparable corpora from Chinese-English news collections. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.472-480. [PDF, 255KB]

(2010) Adam Pauls, Dan Klein, David Chiang, & Kevin Knight: Unsupervised syntactic alignment with inversion traduction grammars. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.118-126. [PDF, 766KB]

(2010) Juan Pino, Gonzalo Iglesias, Adrià de Gispert, Graeme Blackwood, Jamie Brunning, & William Byrne: The CUED HiFST system for the WMT10 translation shared task. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 155-160. [PDF, 97KB]

(2010) Marion Potet, Laurent Besacier, & Hervé Blanchon: The LIG machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 161-166. [PDF, 81KB]

(2010) Germán Sanchis-Trilles, Jesús Andrés-Ferrer, Guillem Gascó, Jesús González-Rubio, Pascual Martínez-Gómez, Martha-Alicia Rocha, Joan-Andreu Sánchez, & Francisco Casacuberta: UPV-PRHLT English-Spanish system for WMT10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 172-176. [PDF, 90KB]

(2010) Libin Shen, Jinxi Xu, & Ralph Weischedel: String-to-dependency statistical machine translation. Computational Linguistics 36 (4), pp. 649-671 [PDF, 233KB]

(2010) Daniel Stein, Jens Forster, Uwe Zelle, Philippe Dreuw, & Hermann Ney: RWTH-Phoenix: analysis of the German sign language corpus. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.225-230. [PDF, 2678KB]

(2010) Jörg Tiedemann: To cache or not to cache? Experiments with adaptive models in statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 189-194. [PDF, 127KB]

Official documents

(2010) Tom Vanallemeersch: Belgisch Staatsblad corpus: retrieving French-Dutch sentences from official documents. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3413-3416. [PDF, 273KB]

Online MT

(2011) Sankaranarayanan Ananthakrishnan, Rohit Prasad, & Prem Natarajan: On-line language model biasing for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.445-449. [PDF, 95KB]

(2011) Marija Brkić, Sanja Seljan, & Maja Matetić: Machine translation evaluation for Croatian-English and English-Croation language pairs. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.93-104. [PDF, 768KB]

(2011) Alexandru Ceauşu, John Tinsley, Jian Zhang, & Andy Way: Experiments on domain adaptation for patent machine translation in the PLuTO project. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.21-28. [PDF, 832KB]; presentation, 16 slides [PDF, 824KB]

(2011) Heidi Depraetere, Joachim Van den Bogaert, & Joeri Van de Walle: Bologna translation service: online translation of course syllabi and study programmes in English. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.29-34. [PDF, 164KB]

(2011) Théo Hoffenberg & Christophe Brun-Franc: An innovative platform to allow full translation of internet sites. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.41-45. [PDF, 316KB]

(2011) Xavier Ivars-Ribes & Victor M.Sánchez-Cartagena: A widely used machine translation service and its migration to a free/open-source solution: the case of Softcatalà. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.61-68. [PDF, 385KB]

(2011) Aarne Ranta: MOLTO: multilingual on-line translation. Invited talk at: Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; p.1. [PDF, 12KB]

(2011) László Tihanyi: Introducing iTranslate4 – internet translation systems for all European languages.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 16 slides [PDF of PPT, 2441KB]

(2011) Antonio Toral, Federico Gaspari, Sudip Kumar Naskar, & Andy Way: Comparative evaluation of research vs. online MT systems. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.13-20. [PDF, 163KB]; presentation, 22 slides [PDF, 173KB]

(2011) Andy Way, Kenny Holden, Lee Ball, & Gavin Wheeldon: SmartMATE: online self-serve access to state-of-the-art SMT.  Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.43-52. [PDF, 1175KB]

(2011) MOLTO – multilingual online translation. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.341. [PDF, 106KB]

(2011) iTranslate4: Internet translation systems for all European languages. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.335. [PDF, 82KB]

(2010) Tim Adams: Finally, you are speaking my language. The Observer, 19 December 2010: The new review, pp.22-23. [PDF, 22KB]

(2010) Miguel Helft: Google’s computing power refines translation tool. New York Times, March 8, 2010. [PDF, 49KB]

(2010) Luciana Cecilia Ramos: Post-editing free machine translation: from a language vendor’s perspective. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 37KB]

(2010) Aarne Ranta: Multilingual packages of controlled languages: an introduction to GF. CNL2010: 2nd Workshop on Controlled Natural Language, Marettimo Island, Sicily, September 13-15, 2010; 83 slides [PDF of PPT, 1140KB]

(2010) Víctor M.Sánchez-Cartagena & Juan Antonio Pérez-Ortiz : ScaleMT : a free/open-source framework for building scalable machine translation web services. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.97-106. [PDF, 140KB]

(2010) iTranslate4.eu: Internet translators for all European languages. (European Community supported project.) Presented at EAMT 2010: 14th Annual conference of the European Association for Machine Translation, 28 May 2010, Saint-Raphaël, France. 1p. [PDF, 843KB]

(2010) Andrejs Vasiļjevs, Tatiana Gornostoy, & Raivis Skadiņš: LetsMT!  - online platform for sharing training data and building user-tailored machine translation. Human Language Technologies – the Baltic Perpective, 4th International Conference, Riga, Latvia, October 8, 2010; 18 slides [PDF of PPT, 727KB]

(2010) Jost Zetzsche: Consider the Luddites. Translation Journal 14 (2) April 2010; 4pp. [PDF, 150KB]

Open source

(2011) Steven Abney & Steven Bird: Towards a data model for the universal corpus. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.120-127. [PDF, 118KB]

(2011) Martha Dís Brandt, Hrafn Loftsson, Hlynur Sigurþórsson, & Francis M.Tyers: Apertium-IceNLP: a rule-based Icelandic to English machine translation system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.217-224. [PDF, 332KB]; presentation, 29 slides [PDF]

(2011) Josep M.Crego, François Yvon, & José B.Mariño: Ncode: an open source bilingual n-gram SMT toolkit. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.49-58. [PDF, 190KB]; presentation, 59 slides [PDF of PPT, 878KB]

(2011) Philipp Koehn: Moses statistical machine translation system.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 12 slides [PDF of PPT, 492KB]

(2011) Aaron B.Phillips & Ralf D.Brown: Training machine translation with a second-order Taylor approximation of weighted translation instances. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.40-47. [PDF, 198KB]

(2011) Stelios Piperidis: META-SHARE: an open resource exchange infrastructure for stimulating research and innovation.  META-FORUM 2011: Solutions for multilingual Europe, June 27/28 2011, Hotel Marriott, Budapest, Hungary; 37 slides [PDF of PPT, 5302KB]

(2011) Maja Popović: Hjerson: an open source tool for automatic error classification of machine translation output. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.59-67. [PDF, 121KB]; presentation, 20 slides [PDF of PPT, 226KB]

(2011) Felipe Sánchez-Martínez and Juan Antonio Pérez-Ortiz (eds.): Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, 20-21 January 2011, Barcelona, Spain

(2011) Daniel Stein, David Vilar, Stephan Peitz, Markus Freitag, Matthias Huck, & Hermann Ney: A guide to Jane, an open source hierarchical translation toolkit. Prague Bulletin of Mathematical Linguistics, no.95, April 2011; pp.5-18. [PDF, 192KB]

(2011) Andrejs Vasiļjevs, Raivis Skadiņš, & Jörg Tiedemann: LetsMT!: cloud-based platform for building user tailored machine translation engines. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.507-511. [PDF, 211KB]

(2011) Jonathan Weese, Juri Ganitkevitch, Chris Callison-Burch, Matt Post, & Adam Lopez: Joshua 3.0: syntax-based machine translation with the Thrax grammar extractor. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.478-484. [PDF, 164KB]

(2011) Omar F.Zaidan: MAISE: a flexible, configurable, extensible open source package for mass AI system evaluation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.130-134. [PDF, 108KB]

(2011) [LIHMT 2011] Introductions, and About the OpenMT-2 project; 1p. [PDF, 124KB]

(2010) proceedings of Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010.

(2010) Loïc Barrault: MANY: open source MT system combination at WMT’10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 271-275. [PDF, 101KB]

(2010) Nicola Bertoldi: IRSTLM toolkit. Presentation at Fifth Machine Translation Marathon, 13-18 September 2010, University of Le Mans; 34 slides. [PDF, 381KB]

(2010) Anton Bryl & Josef van Genabith: f-align: an open-source alignment tool for LFG f-structures. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 229KB]

(2010) Dana Dannélls & John J.Camilleri: Verb morphology of Hebrew and Maltese – towards an open source type theoretical resource grammar in GF. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 57-61. [PDF, 326KB]

(2010) Jo Drugan & Bogdan Babych: Shared resources, shared values? Ethical implications of sharing translation resources. JEC 2010: Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.3-9. [PDF, 9,433KB]

(2010) Chris Dyer, Adam Lopez, Juri Ganitkevitch, Jonathan Weese, Ferhan Ture, Phil Blunsom, Hendra Setiawan, Vladimir Eidelman, & Philip Resnik: cdec: a decoder, alignment, and learning framework for finite-state and context-free translation models. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.7-12. [PDF, 171KB]

(2010) Christian Federmann: Appraise: an open-source toolkit for manual phrase-based evaluation of translations. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1731-1734. [PDF, 368KB]

(2010) Christian Federmann & Andreas Eisele: MT Server Land: an open-source MT architecture. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.57-66. [PDF, 226KB]; presentation [PDF, 509KB]

(2010) Mikel L.Forcada, Boyan Ivanov Bonev, Sergio Ortiz Rojas, Juan Antonio Pérez Ortiz, Gema Ramírez Sánchez, Felipe Sánchez Martínez, Carme Armentano-Oller, Marco A.Montava, & Francis M.Tyers: Documentation of the open-source shallow-transfer machine translation platform Apertium; ed. Mireia Ginestí Rosell. Departament de Llenguatges i Sistemes Informàtics, Universitat d’Alacant, March 10, 2010; 214pp. [PDF, 700KB]

(2010) Mikel L.Forcada: Free/open-source machine translation: the Apertium platform. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 17pp. [PDF, 105KB]

(2010) Philipp Koehn & Hieu Hoang: Moses: machine translation with open source software. Tutorial at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, November 4, 2010; 29 slides [PDF of PPT, 520KB]

(2010) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Ann Irvine, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Ziyuan Wang, Jonathan Weese, & Omar F.Zaidan: Joshua 2.0: a toolkit for parsing-based machine translation with syntax, semirings, discriminative training and other goodies. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 133-137. [PDF, 101KB]

(2010) Aaron B.Phillips: The Cunei machine translation platform for WMT’10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 149-154. [PDF, 121KB]

(2010) Stelios Piperidis: META-SHARE: the open resource exchange facility. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 32 slides [PDF of PPT, 6471KB]

(2010) Ting Qian, Kristy Hollingshead, Su-youn Yoon, Kyoung-young Kim, & Richard Sproat: A Python toolkit for universal transliteration. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2897-2901. [PDF, 576KB]

(2010) Aarne Ranta, Krasimir Angelov, & Thomas Hallgren: Tools for multilingual grammar-based translation on the web. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.66-71. [PDF, 313KB]

(2010) Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao, & Claudia Baur: A multilingual CALL game based on speech translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1531-1538. [PDF, 573KB]

(2010) Antoine Rey: GlobalSight MT integration. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 5pp. [PDF, 1048KB]

(2010) Achim Ruopp: The Moses for Localization open source project. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 111KB]

(2010 Lane Schwartz: Reproducible results in parsing-based machine translation: the JHU shared task submission.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 177-182. [PDF, 101KB]

(2010) Daniel Stein, David Vilar, Stephan Peitz, & Hermann Ney: Jane: a guide to RWTH’s hierarchical machine translation toolkit. Presentation at Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; 29 slides [PDF, 175KB]

(2010) Josef van Genabith: EuroMatrixPlus – evaluation, localisation, open source. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 24pp. [PDF, 2818KB]

(2010) David Vilar, Daniel Stein, Matthias Huck, & Hermann Ney: Jane: open source hierarchical translation, extended with reordering and lexicon models. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 262-270. [PDF, 136KB]

(2010) E.Yuste, M.Herranz, A.-L.Lagarda, L.Tarazón, I.Sánchez-Cortina, & F.Casacuberta: PangeaMT – putting open standards to work…well. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 480KB]

Optical character recognition see Character recognition

Patents

(2011) Proceedings of the 4th Workshop on Patent Translation, Shoi chi Yokoyama (ed,), MT Summit XIII workshop, Xiamen, China, September 23, 2011.

(2011) Alexandru Ceauşu, John Tinsley, Jian Zhang, & Andy Way: Experiments on domain adaptation for patent machine translation in the PLuTO project. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.21-28. [PDF, 832KB]; presentation, 16 slides [PDF, 824KB]

(2011) Chooi-Ling Goh, Takashi Onishi, & Eiichiro Sumita: Rule-based reordering constraints for phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.113-120. [PDF, 446KB]

(2011) Isao Goto, Masao Utiyama, Takashi Onishi, & Eiichiro Sumita: A comparison study of parsers for patent translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.448-455. [PDF, 156KB]

(2011) Bin Lu, Ka Po Chow, & Benjamin K.Tsou: The cultivation of a Chinese-English-Japanese trilingual parallel corpus from comparable patents. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.472-479. [PDF, 207KB]

(2011) Bruno Pouliquen & Christophe Mazenc: COPPA, CLIR and TAPTA: three tools to assist in overcoming the patent barrier at WIPO. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.24-30. [PDF, 896KB]

(2011) Bruno Pouliquen, Christophe Mazenc & Aldo Iorio: Tapta: a user-driven translation system for patent documents based on domain-aware statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.5-12. [PDF, 342KB]; presentation, 15 slides [PDF, 1642KB]

(2011) Patrick Simianer, Katharina Wäschle, & Stefan Riezler: Multi-task minimum error rate training for SMT. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.99-108. [PDF, 138KB]; presentation, 45 slides [PDF of PPT, 295KB]

(2011) Linfeng Song, Haitao Mi, Yajuan Lü, & Qun Liu: Bagging-based system combination for domain adaptation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.293-298. [PDF,

(2011) Katsuhito Sudoh, Xianchao Wu, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Post-ordering in statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.316-323. [PDF, 329KB]

(2011) Wolfgang Täger: The sentence-aligned European patent corpus. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.177-184. [PDF, 234KB]

(2011) PLuTO – patent language translations online. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.351. [PDF, 110KB]

(2010) Georg Artelsmair: Recent developments in machine translation policy at the European Patent Office. META-FORUM 2010: Challenges for multilingual Europe, November 17/18 2010, Brussels, Belgium; 22 slides [PDF of PPT, 709KB]

(2010) Patrizia Biani: The evolution of the European machine translation programme at the European Patent Office. Translingual Europe 2010, Hotel Maritim, Berlin, Germany, Monday June 7th 2010; 20pp. [PDF, 4965KB]

(2010) Patrizia Biani: The evolution of the European machine translation programme at the European Patent Office. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; abstract: 1 p. [PDF, 466KB]; presentation: 19 slides [PDF, 498KB]

(2010) Hiroyuki Kaji, Takashi Tsunakawa, & Daisuke Okada: Using comparable corpora to adapt a translation model to domains. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2182-2188. [PDF, 432KB]

(2010) Bin Lu, Tao Jiang, Kapo Chow, & Benjamin K. Tsou: Building a large English-Chinese parallel corpus from comparable patents and its experimental application to SMT.  [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.42-49. [PDF, 271KB]

(2010) Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro, & Suguru Matsuyoshi: Utilizing semantic equivalence classes of Japanese functional expressions in translation rule acquisition from parallel patent sentences. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1777-1785. [PDF, 942KB]

(2010) John Tinsley: PluTO: patent language translation online. (European Community supported project.) Presented at EAMT 2010: 14th Annual conference of the European Association for Machine Translation, 28 May 2010, Saint-Raphaël, France. 4 slides. [PDF, 1389KB]

(2010) John Tinsley, Andy Way, & Páraic Sheridan: PLuTO: MT for online patent translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 111KB]

Plagiarism

(2010) Alberto Barrón-Cedeño, Paolo Rosso, Eneko Agirre, & Gorka Labaka: Plagiarism detection across distant language pairs. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.37-45. [PDF, 274KB]

Poetry

(2010) Dmitriy Genzel, Jakob Uszkoreit, & Franz Och: “Poetic” statistical machine translation: rhyme and meter. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.158-166. [PDF, 124KB]

Portable devices see Mobile devices

Portals for MT services

(2011) Andy Way, Kenny Holden, Lee Ball, & Gavin Wheeldon: SmartMATE: online self-serve access to state-of-the-art SMT.  Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.43-52. [PDF, 1175KB]

Post-editing and post-processing [see also Crowd sourcing]

(2011) Hanna Béchara, Yanjun Ma, & Josef van Genabith: Statistical post-editing for a statistical MT system. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.308-315. [PDF, 75KB]

(2011) Frédéric Blain, Jean Senellart, Holger Schwenk, Mirko Plitt, & Johann Roturier: Qualitative analysis of post-editing for high quality machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.164-171. [PDF, 218KB]

(2011) Michael Carl, Barbara Dragsted, Jakob Elming, Daniel Hardt, & Arnt Lykke Jakobsen: The process of post-editing: a pilot study. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.131-142. [PDF, 1078KB]

(2011) Ann Clifton & Anoop Sarkar: Combining morpheme-based machine translation with post-processing morpheme prediction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.32-42. [PDF, 287KB]

(2011) Miquel Esplà, Felipe Sánchez-Martínez, & Mikel L.Forcada: Using word alignments to assist computer-aided translation users by marking which target-side words to change or keep unedited. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.81-88. [PDF, 381KB]; presentation, 41 slides [PDF]

(2011) Miquel Esplà-Gomis, Felipe Sánchez-Martínez & Mikel L.Forcada: Using machine translation in computer-aided translation to suggest the target-side words to change. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.172-179. [PDF,160KB]

(2011) Federico Gaspari, Antonio Toral & Sudip Kumar Naskar: User-focused task-oriented MT evaluation for wikis: a case study. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.13-22. [PDF, 961KB]

(2011) Vishal Goyal & Gurpreet Singh Lehal: Hindi to Punjabi machine translation system. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Portland, Oregon, June 19-24, 2011; pp.1-6. [PDF, 247KB]

(2011) Chang Hu, Philip Resnik, Yakov Kronrod, Vladimir Eidelman, Olivia Buzek, & Benjamin B.Bederson: The value of monolingual crowdsourcing in a real-world translation scenrio: simulation using Haitian Creole emergency SMS messages. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.399-404 . [PDF, 164KB]

(2011) Yanjun Ma, Yifan He, & Josef van Genabith: From the confidence estimation of machine translation to the integration of MT and translation memory. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.13-15. [PDF, 75KB]

(2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

(2011) Marion Potet, Raphaël Rubino, Benjamin Lecouteux, Stéphane Huet, Hervé Blanchon, Laurent Besacier, & Fabrice Lefèvre: The LIGA (LIG/LIA) machine translation system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.440-446. [PDF, 92KB]

(2011) Marion Potet, Emmanuelle Esperança-Rodier, Hervé Blanchon, & Laurent Besacier: Preliminary experiments on using users’ post-editions to enhance a SMT system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.161-168. [PDF, 346KB]

(2011) Luciana M.Scocco: Machine translation aid to the test: a study on the benefits of post-editing for translation into the weaker language. Dissertation submitted as part requirement for Master of Arts Translation and Interpreting Studies, University of Western Sydney, 4th November 2011; 14pp. [PDF, 170KB]

(2011) Lucia Specia: Exploiting objective annotations for measuring translation post-editing effort. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.73-80. [PDF, 365KB]; presentation: 94 slides [PDF]

(2011) Sara Stymne: Pre- and postprocessing for statistical machine translation into Germanic languages. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.12-17. [PDF, 93KB]

(2011) Katsuhito Sudoh, Xianchao Wu, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Post-ordering in statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.316-323. [PDF, 329KB]

(2011) Hirokazu Suzuki: Automatic post-editing based on SMT and its selective application by sentence-level automatic quality evaluation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.156-163. [PDF, 725KB]

(2011) Arda Tezcan & Vincent Vandeghinste: SMT-CAT integration in a technical domain: handling XML markup using pre & post-processing methods. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.55-62. [PDF, 343KB]

(2011) Arnaud Vié, Luis Villarejo Muñoz, Mireia Farrús Cabeceran, & Jimmy O’Regan: Apertium advanced web interface: a first step towards interactivity and language tools convergence. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.45-51. [PDF, 280KB]

(2011) Lucas Nunes Vieira & Lucia Specia: A review of translation tools from a post-editing perspective. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.33-42. [PDF, 801KB]

(2011) Na Ye & Guiping Zhang: Study on the impact factors of the translators’ post-editing efficiency in a collaborative translation environment. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.579-586. [PDF, 277KB]

(2011) Omar F.Zaidan & Chris Callison-Burch: Crowdsourcing translation: professional quality from non-professionals. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1220-1229. [PDF, 241KB]

(2010) Giselle de Almeida & Sharon O’Brien: Analysing post-editing performance: correlations with years of translation experience. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 511KB]

(2010) Adriana Beaton & Gabriela Contreras: Sharing the Continental Airlines and SDL post-editing experience. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 3pp. [PDF, 33KB]

(2010) Christian Boitet, Huynh Cong Phap, Nguyen Hong Thai, & Valérie Bellynck: The iMAG concept: multilingual access gateway to an elected web sites with incremental quality increase through collaborative post-edition of MT pretranslations. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 8pp. [PDF, 1585KB]

(2010) Ilse Depraetere: What counts as useful advice in a university post-editing training context? Report on a case study. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 560KB]

(2010) Jinhua Du, Johann Roturier, & Andy Way: TMX markup: a challenge when adapting SMT to the localisation environment. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 656KB]

(2010) Daniel Hardt & Jakob Elming: Incremental re-training for post-editing SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 191KB]

(2010) Yifan He, Yanjun Ma, Johann Roturier, Andy Way, & Josef van Genabith: Improving the post-editing experience using translation recommendation: a user study. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 136KB]

(2010) Yifan He, Yanjun Ma, Andy Way, & Josef van Genabith: Integrating n-best SMT outputs into a TM system. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.374-382. [PDF, 629KB]

(2010) Philipp Koehn: Enabling monolingual translators: post-editing vs.options. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.537-545. [PDF,

(2010) Jae-Hee Lee, Seung-Wook Lee, Gumwon Hong, Young-Sook Hwang, Sang-Bum Kim, & Hae-Chang Rim: A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.623-629. [PDF, 438KB]

 (2010) Elliott Macklovitch & Laurie Gerber: MT postediting showcase. At AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 8 slides

 (2010) Sharon O’Brien: Introduction to post-editing: who, what, how and where to next? Tutorial at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 30 slides [PDF of PPT, 269KB]

 (2010) Mirko Plitt & François Masselot: A productivity test of statistical machine translation post-editing in a typical localization context. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.7-16. [PDF, 387KB]

 (2010) Luciana Cecilia Ramos: Post-editing free machine translation: from a language vendor’s perspective. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 37KB]

 (2010) Víctor M.Sánchez-Cartagena & Juan Antonio Pérez-Ortiz: Tradubi: open-source social translation for the Apertium machine translation platform. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.47-56. [PDF, 223KB]

(2010) Barbara Scott & Adriana Beaton: Evaluating vendors for machine translation and post-editing. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 2pp. [PDF, 30KB]

 (2010) Lucia Specia & Atefeh Farzindar: Estimating machine translation post-editing effort with HTER. JEC 2010: Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.33-41. [PDF, 178KB]

 (2010) Midori Tatsumi & Johann Roturier: Source text characteristics and technical and temporal post-editing effort: what is their relationship? JEC 2010: Second joint EM+/CNGL WorkshopBringing MT to the user: research on integrating MT in the translation industry”, AMTA 2010, Denver, Colorado, November 4, 2010; pp.43-51. [PDF, 10,492KB]

 (2010) Irina Temnikova: Cognitive evaluation approach for controlled language post-editing experiment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3485-3490. [PDF, 631KB]

(2010) Carol Van Ess-Dykema, Jocelyn Phillips, Florence Reeder, & Laurie Gerber: Paralinguist assessment decision factors for machine translation output: a case study. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 180KB]; abstract

(2010) Chris Wendt: Better translations with user collaboration – integrated MT at Microsoft. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 807KB]

Pre-editing and Pre-processing (see also Controlled languages)

(2011) Vishal Goyal & Gurpreet Singh Lehal: Hindi to Punjabi machine translation system. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Portland, Oregon, June 19-24, 2011; pp.1-6. [PDF, 247KB]

(2011) Maxim Khalilov & Khalil Sima’an: ILLC-UvA translation system for EMNLP-WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.413-419. [PDF, 591KB]

(2011) Ma Jianjun, Huang Degen, Liu Haixia, & Sheng Wenfeng: POS tagging of English particles for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.57-63. [PDF, 222KB]

(2011) Santanu Pal, Tanmoy Chakraborty, & Sivaji Bandyopadhyay: Handling multiword expressions in phrase-based statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.215-224. [PDF, 161KB]

(2011) Sara Stymne: Definite noun phrases in statistical machine translation into Scandinavian languages. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.289-296. [PDF, 336KB]; presentation, 50 slides [PDF]

 (2011) Sara Stymne: Pre- and postprocessing for statistical machine translation into Germanic languages. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.12-17. [PDF, 93KB]

 (2011) Arda Tezcan & Vincent Vandeghinste: SMT-CAT integration in a technical domain: handling XML markup using pre & post-processing methods. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.55-62. [PDF, 343KB]

(2011) Arnaud Vié, Luis Villarejo Muñoz, Mireia Farrús Cabeceran, & Jimmy O’Regan: Apertium advanced web interface: a first step towards interactivity and language tools convergence. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.45-51. [PDF, 280KB]

(2010) Alexandre Allauzen, Josep M.Crego, İlknur Durgar El-Kahlout, Le Hai-Son, Guillaume Wisniewski, & François Yvon: LIMSI @ IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.105-112. [PDF, 386KB]

(2010) Jennifer Beaupre & Kent Taylor: Content quality for better MT: a practical guide to quality at the source. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 3pp. [PDF, 69KB]

(2010) Arianna Bisazza & Marcello Federico: Chunk-based verb reordering in VSO sentences for Arabic-English statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 235-243. [PDF, 372KB]

(2010) İlknur Durgar El-Kahlout & François Yvon: The pay-offs of preprocessing for German-English statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.251-258. [PDF, 301KB]

(2010) Susan Howlett & Mark Dras: Dual-path phrase-based statistical machine translation. Australasian Language Technology Workshop 2010 (ALTA Workshop 2010), 9-10 December 2008, Melbourne, Australia; pp.32-40. [PDF, 633KB]

(2010) Hideki Isozaki, Katsuhito Sudoh, Hajime Tsukada, & Kevin Duh: Head finalization: a simple reordering rule for SOV languages.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 133-137. [PDF, 321KB]

(2010) Weimin Jiang: Pre-editing for machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Maxim Khalilov & Khalil Sima’an: Source reordering using MaxEnt classifiers and supertags. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]

(2010) Hassan Sawaf: Arabic dialect handling in hybrid machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 146KB]

(2010) Sara Stymne, Maria Holmqvist, & Lars Ahrenberg: Vs and OOVs: two problems for translation between German and English. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 183-188. [PDF, 105KB]

(2010) Yanli Sun, Sharon O’Brien, Minako O’Hagan, & Fred Hollowood: A novel statistical pre-processing model for rule-based machine translation system.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 790KB]; presentation: 28 slides [PDF, 580KB]

(2010) Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan, & Nanda Kambhatla: Syntax based reordering with automatically derived rules for improved statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1119-1127. [PDF, 169KB]

(2010) Ventsislav Zhechev & Josef van Genabith: Seeding statistical machine translation with translation memory output through tree-based structural alignment. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.43-51. [PDF, 510KB]

Programming and algorithms [see also Bootstrapping, Computer equipment and accessories, Dynamic programming, Parallel processing]

(2011) Alexandre Allauzen, Hélène Bonneau-Maynard, Hai-Son Le, Aurélien Max, Guillaume Wisniewski, François Yvon, Gilles Adda, Josep M.Crego, Adrien Lardilleux, Thomas Lavergne, & Artem Sokolov: LIMSI @ WMT11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.309-315. [PDF, 107KB]

(2011) Ergun Biçici & Deniz Yuret: Instance selection for machine translation using feature decay algorithms. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.272-283. [PDF, 244KB]

(2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Methods for smoothing the optimizer instability in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.32-39. [PDF, 150KB]

(2011) Yin-Wen Chang & Michael Collins: Exact decoding of phrase-based translation models through Lagrangin relaxation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.26-37. [PDF, 312KB]

(2011) Boxing Chen, Roland Kuhn, George Foster, & Howard Johnson: Unpacking and transforming feature functions: new ways to smooth phrase tables.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.269-275. [PDF, 158KB]

(2011) Fabien Cromieres & Sadao Kurohashi: Efficient retrieval of tree translation examples for syntax-based machine translation.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.508-518. [PDF, 616KB]

(2011) Jacob Devlin, Antti-Veikko I.Rosti, Shankar Ananthakrishnan, & Spyros Matsoukas: System combination using discriminative cross-adaptation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.667-675. [PDF, 627KB]

(2011) Michel Galley & Chris Quirk: Optimal search for minimum error rate training. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.38-49. [PDF, 1280KB]

(2011) Sumukh Ghodke, Steven Bird, & Rui Zhang: A breadth-first representation for tree matching in large scale forest-based translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.785-793. [PDF, 941KB]

(2011) Barry Haddow, Abhishek Arun, & Philipp Koehn: SampleRank training for phrase-based machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.261-271. [PDF, 194KB]

(2011) Greg Hanneman, Michelle Burroughs, & Alon Lavie: A general-purpose rule extractor for SCFG-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.135-144. [PDF, 267KB]

(2011) Kenneth Heafield: KenLM: faster and smaller language model queries. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.187-197. [PDF, 169KB]

(2011) Gonzalo Iglesias, Cyril Allauzen, William Byrne, Adrià de Gispert, & Michael Riley: Hierarchical phrase-based translation representations. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1373-1383. [PDF, 334KB]

(2011) Mitesh M.Khapra, Salil Joshi, & Pushpak Bhattacharyya: It takes two to tango: a bilingual unsupervised approach for estimating sense distributions using expectation maximization. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.695-704. [PDF, 280KB]

(2011) Xianhua Li, Yajuan Lü, Yao Meng, Qun Liu, & Hao Yu: Feedback selecting of manually acquired rules using automatic evaluation. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.52-59. [PDF, 185KB]

(2011) Shujie Liu, Chi-Ho Li, & Ming Zhou: A unified SMT framework combining MIRA and MERT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.181-188. [PDF, 233KB]

(2011) J.Scott McCarley, Abraham Ittycheriah, Salim Roukos, Bing Xiang, & Jian-ming Xu: A correction model for word alignments.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.889-898. [PDF, 236KB]

(2011) Adam Pauls & Dan Klein: Faster and smaller n-gram language models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.258-267. [PDF, 246KB]

(2011) Christian Rishøj & Anders Søgaard: Factored translation with unsupervised word clusters. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.447-451. [PDF, 163KB]

(2011) Germán Sanchis-Trilles, Daniel Ortiz-Martínez, Jesús González-Rubio, Jorge González, & Francisco Casacuberta: Bilingual segmentation for phrasetable pruning in statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.257-264. [PDF, 374KB]; presentation, 15 slides [PDF]

(2011) Markus Saers & Dekai Wu: Reestimation of reified rules in semiring parsing and biparsing. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.70-78. [PDF, 237KB]

(2011) Thomas Schoenemann: Regularizing mono- and bi-word models for word alignment. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1313-1322. [PDF, 286KB]

(2011) Christophe Servan & Holger Schwenk: Optimising multiple metrics with MERT. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.109-117. [PDF, 141KB]; presentation, 20 slides [PDF of PPT, 535KB]

(2011) Xiadong Shi, Xiang Zhu, & Yidong Chen: Improving the hierarchical phrase-based translation model. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.383-388. [PDF, 174KB]

(2011) Jun Suzuki, Kevin Duh, & Masaaki Nagata: Distributed minimum error rate training of SMT using particle swarm optimization. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.649-657. [PDF, 1101KB]

(2011) Kaveh Taghipour, Shahram Khadivi, & Jia Xu: Parallel corpus refinement as an outlier detection algorithm. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.414-421. [PDF, 213KB]

(2011) Ming Tan, Wenli Zhou, Lei Zheng, & Shaojun Wang: A large scale distributed syntactic, semantic and lexical language model for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.201-210. [PDF, 192KB]

(2011) Kristina Toutanova & Michel Galley: Why initialization matters for IBM Model 1: multiple optima and non-strict convexity. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.461-466. [PDF, 181KB]

(2011) Ashish Venugopal, Jakob Uszkoreit, David Talbot, Franz J.Och, & Juri Ganitkevitch: Watermarking the outputs of structured prediction with an application in statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1363-1372. [PDF, 293KB]

(2011) Xinyan Xiao, Yang Liu, Qun Liu, & Shouxun Lin: Fast generation of translation forest for large-scale SMT discriminative training. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.880-888. [PDF, 818KB]

(2011) Daqi Zheng, Yifan He, Yang Liu & Qun Liu: Maximum rank correlation training for statistical machine translation.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.48-55. [PDF, 184KB]

(2010) Abhishek Arun, Barry Haddow, & Philipp Koehn: A unified approach to minimum risk training and decoding. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 365-374. [PDF, 225KB]

(2010) Loïc Barrault: Open source machine translation system combination. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.147-155. [PDF, 163KB]

(2010) Ergun Biçici & Deniz Yuret: L1 regularized regression for reranking and system combination in machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 282-289. [PDF, 197KB]

(2010) Graeme Blackwood, Adrià de Gispert, & William Byrne: Efficient path counting transducers for minimum Bayes-risk decoding of statistical machine translation lattices. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.27-32. [PDF, 288KB]

(2010) Ralf D.Brown: Taming structured perceptrons on wild feature vectors. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 384-391. [PDF, 132KB]

(2010) Bill Byrne: Hierarchical phrase-based translation with weighted finite state transducers. Invited talk at: 7th International Workshop on Spoken Language Translation, 2 December 2010, Paris, France; 46 slides. [PDF of PPT, 1266KB]

(2010) Jonathan H.Clark, Jonathan Weese, Byung Gyu Ahn, Andreas Zollmann, Qin Gao, Kenneth Heafield, & Alon Lavie: The machine translation toolpack for LoonyBin: an automated management of experimental machine translation hyperworkflows.  Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.117-126. [PDF, 153KB]

(2010) Martin Čmejrek & Bowen Zhou: Two methods for extending hierarchical rules from the bilingual chart parsing. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.180-188. [PDF, 200KB]

(2010) Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou, & Tiejun Zhao: A joint rule selection model for hierarchical phrase-based translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.6-11. [PDF, 213KB]

(2010) Nan Duan, Hong Sun, & Ming Zhou: Translation model generalization using probability averaging for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.304-312. [PDF, 296KB]

(2010) Kevin Duh, Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki, & Masaaki Nagata: N-best reranking by multitask learning. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 375-383. [PDF, 184KB]

(2010) Marc Dymetman & Nicola Cancedda: Intersecting hierarchical and phrase-based models of translation: formal aspects and algorithms. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.1-9. [PDF, 354KB]

(2010) Cristina España-Bonet & Lluís Màrquez: Robust estimation of feature weights in statistical machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 618KB]

(2010) Yang Feng, Haitao Mi, Yang Liu, & Qun Liu: An efficient shift-reduce decoding algorithm for phrase-based machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.285-293. [PDF, 175KB]

(2010) Mikel L.Forcada, Boyan Ivanov Bonev, Sergio Ortiz Rojas, Juan Antonio Pérez Ortiz, Gema Ramírez Sánchez, Felipe Sánchez Martínez, Carme Armentano-Oller, Marco A.Montava, & Francis M.Tyers: Documentation of the open-source shallow-transfer machine translation platform Apertium; ed. Mireia Ginestí Rosell. Departament de Llenguatges i Sistemes Informàtics, Universitat d’Alacant, March 10, 2010; 214pp. [PDF, 700KB]

(2010) Qin Gao, Francisco Guzman, & Stephan Vogel: EMDC: a semi-supervised approach for word alignment. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.349-357. [PDF, 231KB]

(2010) Andrea Gesmundo & James Henderson: Faster cube pruning. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.267-274. [PDF, 437KB]

(2010) Spence Green, Michel Galley, & Christopher D.Manning: Improved models of distortion cost for statistical machine translation.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.867-875. [PDF, 223KB]

(2010) Barry Haddow: Adding multi-threaded decoding to Moses. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.57-66. [PDF, 136KB]

(2010) Christian Hardmeier: Fast and extensible phrase scoring for statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.87-96. [PDF, 123KB]

(2010) Kenneth Heafield & Alon Lavie: Combining machine translation output with open source: the Carnegie Mellon multi-engine machine translation scheme. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.27-36. [PDF, 133KB]

(2010) Fei Huang & Bing Xiang: Feature-rich discriminative phrase rescoring for SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.492-500. [PDF, 260KB]

(2010) Liang Huang & Haitao Mi: Efficient incremental decoding for tree-to-string translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.273-283. [PDF, 266KB]

(2010) Philipp Koehn & Jean Senellart: Fast approximate string matching with suffix arrays and A* parsing. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 178KB]

(2010) Prasanth Kolachina, Sriram Venkatapathy, Srinivas Bangalore, Sudheer Kolachina, & Avinesh PVS: Phrase-based decoding using a discriminative model. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.34-42. [PDF, 184KB]

(2010) Roland Kuhn, Boxing Chen, George Foster, & Evan Stratford: Phrase clustering for smoothing TM probabilities – or, how to extract paraphrases from phrase tables. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.608-616. [PDF, 328KB]

(2010) Abby Levenberg, Chris Callison-Burch, & Miles Osborne: Stream-based translation models for statistical machine translation.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.394-402. [PDF, 184KB]

(2010) Mu Li, Yinggong Zhao, Dongdong Zhang, & Ming Zhou: Adaptive development data selection for log-linear model in statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.662-670. [PDF, 213KB]

(2010) Peng Li, Maosong Sun, & Ping Xue: Fast-Champollion: a fast and robust sentence alignment algorithm. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.710-718. [PDF, 267KB]

(2010) Peng Li, Maosong Sun & Ping Xue: Fast-Champollion: a fast and robust sentence alignment algorithm [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Wang Ling, Tiago Luís, João Graça, Luísa Coheur & Isabel Trancoso: Towards a general and extensible phrase-extraction algorithm. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.313-320. [PDF, 382KB]

(2010) Yang Liu & Liang Huang: Tree-based and forest-based translation. ACL 2010:Tutorial abstracts, Uppsala, Sweden, 11 July, 2010; p.2. [PDF, 58KB]

(2010) Andreas Maletti & Giorgio Satta: Parsing and translation algorithms based on weighted extended tree transducers. Proceedings of the 2010 Workshop on Applications of Tree Automata in Natural Language Processing, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.19-27. [PDF, 190KB]

(2010) Pascual Martínez Gómez, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, & Germán Sanchis-Trilles: A deterministic annealing-based training algorithm for statistical machine translation models.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 712KB]

(2010) Behrang Mohit, Rebecca Hwa, & Alon Lavie: Using variable decoding weight for language model in statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 165KB]

(2010) Robert C.Moore & William Lewis: Intelligent selection of language model training data. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.220-224. [PDF, 176KB]

(2010) Sylwia Ozdowska & Vincent Claveau: Inferring syntactic rules for word alignment through inductive logic programming. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.749-754. [PDF, 370KB]

(2010) Alicia Pérez, M.Inés Torres, & Francisco Casacuberta: Potential scope of a fully-integrated architecture for speech translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 448KB]

(2010) Benjamin Roth, Andrew McCallum, Marc Dymetman & Nicola Cancedda: Machine translation using overlapping alignments and SampleRank. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 138KB]

(2010) Baskaran Sankaran, Ajeet Grewal, & Anoop Sarkar: Incremental decodings for phrase-based statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 216-223. [PDF, 137KB]

(2010) Thomas Schoenemann: Computing optimal alignments for the IBM-3 translation model. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.98-106. [PDF, 130KB]

(2010) Lane Schwartz & Chris Callison-Burch: Hierarchical phrase-based grammar extraction in Joshua: suffix arrays and prefix trees. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.157-166. [PDF, 136KB]

(2010) Holger Schwenk: Continuous-space language models for statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.137-146. [PDF, 151KB]

(2010) Kashif Shah, Loïc Barrault, & Holger Schwenk: Translation model adaptation by resampling. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 392-399. [PDF, 176KB]

(2010) Libin Shen, Jinxi Xu, & Ralph Weischedel: String-to-dependency statistical machine translation. Computational Linguistics 36 (4), pp. 649-671 [PDF, 233KB]

(2010) Drahomira (Johanka) Spoustová & Miroslav Spousta: Dependency parsing as a sequence labeling task. Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.7-14. [PDF, 271KB]

(2010) Zhiyang Wang, Yajuan Lü, Qun Liu, & Young-Sook Hwang: Better filtration and augmentation for hierarchical phrase-based translation rules. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.142-146. [PDF, 135KB]

(2010) Sirvan Yahyaei & Christoph Monz: Dynamic distortion in a discriminative reordering model for statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.353-360. [PDF, 460KB]

(2010) Dong Yang, Paul Dixon, & Sadaoki Furui: Jointly optimizing a two-step conditional random field model for machine transliteration and its fast decoding algorithm. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.275-280. [PDF, 115KB]

Psychological aspects

(2011) Fumiko Kano Glückstad: Application of classical psychology theory to terminological ontology alignment. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.227-238. [PDF, 1187KB]

Quality control

(2010) Jennifer Beaupre & Kent Taylor: Content quality for better MT: a practical guide to quality at the source. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 3pp. [PDF, 69KB]

(2010) Daniel Marcu, Kathleen Egan, Chuck Simmons, & Ning-Ning Mahlmann: Utilizing automated translation with quality scores to increase productivity. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Carol Van Ess-Dykema, Jocelyn Phillips, Florence Reeder, & Laurie Gerber: Paralinguist assessment decision factors for machine translation output: a case study. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 180KB]; abstract

Question-answering

(2011) María-Dolores Olvera-Lobo & Juncal Guttiérez-Artacho: Multilingual question-answering system in biomedical domain on the Web: an evalution. CLEF 2011: Conference on Multilingual and Multimodal Iinformation Access Evaluation, 19-22 September 2011, Amsterdam; 12slides [PDF of PPT, 413KB]

(2011) Guangyou Zhou, Li Cai, Jun Zhao, & Kang Liu: Phrase-based translation model for question retrieval in community question answer archives. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.653-662. [PDF, 759KB]

(2010) Pamela Forner, Danilo Giampiccolo, Bernardo Magnini, Anselmo Peñas, Álvaro Rodrigo, & Richard Sutcliffe: Evaluating multilingual question answering systems at CLEF. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2774-2781. [PDF, 314KB]

(2010) Lola García-Santiago, Maria-Dolores Olvera-Lobo: Automatic web translators as part of a multilingual question-answering (QA) system: translation of questions. Translation Journal 14 (1), January 2010; 5pp. [PDF, 117KB]

(2010) Yashar Mehdad, Matteo Negri, & Marcello Federico: Towards cross-lingual textual entailment.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.321-324. [PDF, 230KB]

(2010) Gerard de Melo & Gerhard Weikum: Providing multilingual, multimodal answers to lexical database queries. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.348-355. [PDF, 1169KB]

(2010) Kristen Parton & Kathleen McKeown: MT error detection for cross-lingual question answering. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.946-954. [PDF, 161KB]

(2010) Diana Santos, Lu