Machine Translation Archive

Subject Index

to methods, techniques, issues and topics

Publications since 2010

[Click here for earlier publications: 2005-2009, 2000-2004, 1990-1999, before 1990]

To return to home page click here

Alignment [see also Character alignment, Statistical analysis, Word alignment]

(2011) Mohit Bansal, Chris Quirk, & Robert C.Moore: Gappy phrasal alignment by agreement . ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1308-1317. [PDF, 215KB]

(2011) Anja Belz & Eric Kow: Unsupervised alignment of comparable data and text resources. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.102-109. [PDF, 143KB]

(2011) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Improved statistical machine translation using multiword expressions. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.15-20. [PDF, 345KB]

(2011) Houda Bouamor, Aurélien Max, & Anne Vilnat: Monolingual alignment by edit rate computation on sentential paraphrase pairs. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.395-400. [PDF, 138KB]

(2011) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Japanese-Chinese phrase alignment using common Chinese characters information. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.464-471. [PDF, 347KB]

(2011) Rashmi Gangadharaiah: Reducing asymmetry between language-pairs to improve alignment and translation quality. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1346-1350. [PDF, 239KB]

(2011) Sanjika Hewavitharana & Stephan Vogel: Extracting parallel phrases from comparable data. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.61-68. [PDF, 217KB]

(2011) Mark Hopkins, Greg Langmead & Tai Vo: Extraction programs: a unified approach to translation rule extraction. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.523-532. [PDF, 552KB]

(2011) Radu Ion, Alexandru Ceauşu, & Elena Irimia: An expectation maximization algorithm for textual unit alignment. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.128-135. [PDF, 1035KB]

(2011) Jagadeesh Jagarlamudi, Hal Daumé III, & Raghavendra Udupa: From bilingual dictionaries to interlingual document representations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.147-152. [PDF, 112KB]

 (2011) Wang Ling, Tiago Luís, João Graça, Luísa Coheur, & Isabel Trancoso: Reordering modeling using weighted alignment matrices. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.450-454. [PDF, 121KB]

(2011) Chi-kiu Lo & Dekai Wu: A radically simple, effective annotation and alignment methodology for semantic frame based SMT and MT evaluation. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.58-65. [PDF, 898KB]

(2011) Toshiaki Nakazawa & Sadao Kurohashi: Bayesian subtree alignment model based on dependency trees. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.794-802. [PDF, 790KB]

(2011) Graham Neubig, Taro Watanabe, Eiichiro Sumita, Shinsuke Mori, & Tatsuya Kawahara: An unsupervised model for joint phrase alignment and extraction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.632-641. [PDF, 782KB]

 (2011) Jason Naradowsky & Kristina Toutanova: Unsupervised bilingual morpheme segmentation and alignment with context-rich hidden semi-Markov models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.895-904. [PDF, 285KB]

(2011) Kashif Shah, Loïc Barrault, & Holger Schwenk: Parametric weighting of parallel data for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1323-1331. [PDF, 754KB]

(2011) Wolfgang Täger: The sentence-aligned European patent corpus. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.177-184. [PDF, 234KB]

(2011) Dan Zeman, Mark Fishel, Jan Berka, & Ondřej Bojar: Addicter: what’s wrong with my translations? Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.79-88. [PDF, 276KB]; presentation, 20 slides [PDF of PPT, 359KB]

(2011) Min Zhang, Xiangyu Duan, Ming Liu, Yunqing Xia, & Haizhou Li: Joint alignment and artificial data generation: an empirical study of pivot-based machine transliteration. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1207-1215. [PDF, 319KB]

(2010) Lars Ahrenberg: Alignment-based profiling of Europarl data in an English-Swedish parallel corpus. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3398-3404. [PDF, 349KB]

(2010) Stacey Bailey: Data preparation for machine translation customization. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Ondřej Bojar, Pavel Straňák, & Daniel Zeman: Data issues in English-to-Hindi machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1771-1777. [PDF, 557KB]

(2010) Gosse Bouma: Cross-lingual ontology alignment using EuroWordNet and Wikipedia. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1023-1028. [PDF, 336KB]

(2010) Fabienne Braune & Alexander Fraser: Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.81-89. [PDF, 228KB]

(2010) Anton Bryl & Josef van Genabith: f-align: an open-source alignment tool for LFG f-structures. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 229KB]

(2010) Mauro Cettolo, Marcello Federico, & Nicola Bertoldi: Mining parallel fragments from comparable texts. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.227-234. [PDF, 393KB]

(2010) Yufeng Chen, Chenqing Zong, & Keh-Yih Su: On jointly recognizing and aligning bilingual named entities. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.631-639. [PDF, 151KB]

(2010) John DeNero & Dan Klein: Discriminative modeling of extraction sets for machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1453-1463. [PDF, 616KB]

(2010) Do Thi Ngoc Diep, Laurent Besacier, & Eric Castelli: A fully unsupervised approach for mining parallel data from comparable corpora. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 856KB]; presentation: 30 slides [PDF, 2621KB]

(2010) Chris Dyer, Adam Lopez, Juri Ganitkevitch, Jonathan Weese, Ferhan Ture, Phil Blunsom, Hendra Setiawan, Vladimir Eidelman, & Philip Resnik: cdec: a decoder, alignment, and learning framework for finite-state and context-free translation models. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.7-12. [PDF, 171KB]

(2010) Miquel Esplà-Gomis & Mikel L.Forcada: Combining content-based and URL-based heuristics to harvest aligned bitexts from multilingual sites with Bitextor. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.77-86. [PDF, 160KB]

(2010) Mark Fishel & Heiki-Jaan Kaalep: CorporAl: a method and tool for handling overlapping parallel corpora. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.67-76. [PDF, 149KB]; presentation [PDF, 1037KB]

(2010) Carmen Heger, Joern Wuebker, David Vilar, & Hermann Ney: A combination of hierarchical systems with forced alignments from phrase-based systems. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.291-297. [PDF, 377KB]

 (2010) Elena Irimia & Alexandru Ceausu: Dependency-based translation equivalents for factored machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 60KB]

(2010) Jie Jiang, Andy Way, & Julie Carson-Berndsen: Lattice score based data cleaning for phrase-based statistical machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 631KB]

(2010) Maxim Khalilov & Khalil Sima’an: Source reordering using MaxEnt classifiers and supertags. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]

(2010) Patrik Lambert, Simon Petitrenaud, Yanjun Ma & Andy Way: Statistical analysis of alignment characteristics for phrase-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 588KB]; presentation: 26 slides [PDF, 133KB]

(2010) Peng Li, Maosong Sun, & Ping Xue: Fast-Champollion: a fast and robust sentence alignment algorithm. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.710-718. [PDF, 267KB]

 (2010) Peng Li, Maosong Sun & Ping Xue: Fast-Champollion: a fast and robust sentence alignment algorithm [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Wang Ling, Tiago Luís, João Graça, Luísa Coheur & Isabel Trancoso: Towards a general and extensible phrase-extraction algorithm. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.313-320. [PDF, 382KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Discriminative pruning for discriminative ITG alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.316-324. [PDF, 972KB]

(2010) Minh-Thang Luong & Min-Yen Kan: Enhancing morphological alignment for translating highly inflected languages. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.743-751. [PDF, 610KB]

(2010) Yanjun Ma & Andy Way: HMM word-to-phrase alignment with dependency constraints. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.101-109. [PDF, 191KB]

(2010) Adam Pauls, Dan Klein, David Chiang, & Kevin Knight: Unsupervised syntactic alignment with inversion traduction grammars. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.118-126. [PDF, 766KB]

(2010) Benjamin Roth, Andrew McCallum, Marc Dymetman & Nicola Cancedda: Machine translation using overlapping alignments and SampleRank. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 138KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: Linear inversion transduction grammar alignments as a second translation path. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 167-171. [PDF, 111KB]

(2010) Thomas Schoenemann: Computing optimal alignments for the IBM-3 translation model. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.98-106. [PDF, 130KB]

(2010) Rico Sennrich & Martin Volk: MT-based sentence alignment for OCR-generated parallel texts. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 174KB]

(2010) Jason R.Smith, Chris Quirk, & Kristina Toutanova: Extracting parallel sentences from comparable corpora using document level alignment. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.403-411. [PDF, 313KB]

(2010) Anders Søgaard: Can inversion transduction grammars generate hand alignments? EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 6pp. [PDF, 602KB]; presentation by Michael Carl: 19 slides [PDF, 122KB]

(2010) Daniel Stein, Christoph Schmidt, & Hermann Ney: Sign language machine translation overkill. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.337-344. [PDF, 2052KB]

(2010) Jun Sun, Min Zhang, & Chew Lim Tan: Exploring syntactic structural features for sub-tree alignment using bilingual tree kernels. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.306-315. [PDF, 286KB]

(2010) Jörg Tiedemann: Lingua-Align: an experimental toolkit for automatic tree-to-tree alignment.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.736-743. [PDF, 296KB]

(2010) Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan, & Nanda Kambhatla: Syntax based reordering with automatically derived rules for improved statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1119-1127. [PDF, 169KB]

(2010) Ventsislav Zhechev & Josef van Genabith: Maximising TM performance through sub-tree alignment and SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 293KB]

(2010) Ventsislav Zhechev & Josef van Genabith: Seeding statistical machine translation with translation memory output through tree-based structural alignment. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.43-51. [PDF, 510KB]

Analogies and analogical modelling (see also Example-based methods)

 (2011) Kota Takeya & Yves Lepage: Marker-based chunking for analogy-based translation of chunks.  MT Summit  XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.338-345. [PDF, 171KB]

(2010) Dmitry Davidov & Ari Rappoport: Automated translation of semantic relationships. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.241-249. [PDF, 196KB]

Analysis see Parsing; Semantic analysis; Syntactic analyis

Anaphora resolution

 (2011) Elizabeth Baran & Nianwen Xue: Singular or plural? Exploiting parallel corpora for Chinese number prediction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.207-214. [PDF, 187KB]

(2010) Christian Hardmeier & Marcello Federico: Modelling pronominal anaphora in statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.283-289. [PDF, 353KB]

(2010) Ronan Le Nagard & Philipp Koehn: Aiding pronouns translation with co-reference resolution. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 252-261. [PDF, 146KB]

Annotation and features

(2011) Hala Almaghout, Jie Jiang, & Andy Way: CCG contextual labels in hierarchical phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.281-288. [PDF, 317KB]; presentation, 25 slides [PDF, 964KB]

(2011) Vamshi Ambati, Sanjika Hewavitharana, Stephan Vogel, & Jaime Carbonell: Active learning with multiple annotations for comparable data classification task. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.69-77. [PDF, 201KB]

(2011) Eleftherios Avramidis: DFKI system combination with sentence ranking at ML4HMT-2011. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.99-103. [PDF, 260KB]

(2011) Alexandru Ceausu: Rich morpho-syntactic descriptors for factored machine translation with highly inflected languages as target. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 53KB]; presentation: 32 slides [PDF of PPT, 349KB]

(2011) Boxing Chen, Roland Kuhn, George Foster, & Howard Johnson: Unpacking and transforming feature functions: new ways to smooth phrase tables.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.269-275. [PDF, 158KB]

(2011) Christian Chiarcos & Tomaž Erjavec: OWL/DL formalization of the MULTEXT-East morphosyntactic specifications. Proceedings of the Fifth LAW Workshop (LAW V), Portland, Oregon, USA, 23-24 June 2011; pp.11-20. [PDF, 118KB]

(2011) Qin Gao & Stephan Vogel: Corpus expansion for statistical machine translation with semantic role label substitution rules. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.294-298. [PDF, 151KB]

(2011) Yifan He, Yanjun Ma, Andy Way, & Josef van Genabith: Rich linguistic features for translation memory-inspired consistent translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.456-463. [PDF, 159KB]

(2011) Philipp Koehn: Factored models for morphology [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; presentation: 49 slides [PDF of PPT, 425KB]

(2011) Chi-kiu Lo & Dekai Wu: A radically simple, effective annotation and alignment methodology for semantic frame based SMT and MT evaluation. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.58-65. [PDF, 898KB]

(2011) Erwin Marsi, André Lynum, Lars Bungum, & Björn Gambäck: Word translation disambiguation without parallel texts. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.66-74. [PDF, 898KB]

(2011) Matteo Negri, Luisa Bentivogli, Yashar Mehdad, Danilo Giampiccolo, & Alessandro Marchetti: Divide and conquer: crowdsourcing the creation of cross-lingual textual entailment corpora.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.670-679. [PDF, 493KB]

(2011) Kemal Oflazer: Syntax-to-morphology mapping in factored SMT [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; presentation: 60 slides [PDF of PPT, 491KB]

(2011) Tsuyoshi Okita & Andy Way: MWE-sensitive word alignment in factored translation model. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; 2pp. [PDF, 42KB]; presentation: 51 slides [PDF of PPT, 650KB]

(2011) Eva Safar: Machine translation with corpus linguistics and HPSG. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 12 slides [PDF of PPT, 288KB]

(2011) Lonneke van der Plas, Paola Merlo, & James Henderson: Scaling up automatic cross-lingual semantic role annotation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.299-304. [PDF, 161KB]

 (2011) Viet Ha-Thuc & Nicola Cancedda: Confidence-weighted learning of factored discriminitave language models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.439-444. [PDF,

 (2011) Andreas Zollmann & Stephan Vogel: A word-class approach to labeling PSCFG rules for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1-11. [PDF, 230KB]

(2010) Vamshi Ambati, Stephen Vogel, & Jaime Carbonell: Active learning and crowd-sourcing for machine translation. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2169-2174. [PDF, 436KB]

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

(2010) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Nathaniel W.Filardo, Lori Levin, & Christine Piatko: A modality lexicon and its use in automatic tagging. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1402-1407. [PDF, 375KB]

(2010) Yassine Benajiba & Mona Diab: A web application for dialectal Arabic text annotation. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 91-98. [PDF, 747KB]

(2010) Michael Bloodgood & Chris Callison-Burch: Bucking the trend: large-scale cost-focused active learning for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.854-864. [PDF, 489KB]

(2010) Simon Carter & Christof Monz: Discriminative syntactic reranking for statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 143KB]

(2010) Mona Diab, Nizar Habash, Owen Rambow, Mohamed Altantawy, & Yassine Benajiba: COLABA: Arabic dialect annotation and processing. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 66-74. [PDF, 747KB]

(2010) Kais Dukes & Nizar Habash: Morphological annotation of Quranic Arabic. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, organised by ELRA, 17-23 May 2010, Valletta, Malta; pp.2530-2536. [PDF, 995KB]

(2010) Achille Falaise, David Rouquet, Didier Schwab, Hervé Blanchon, & Christian Boitet: Ontology driven content extraction using interlingual annotation of texts in the OMNIA project. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.52-60. [PDF, 359KB]

(2010) Christian Federmann: Appraise: an open-source toolkit for manual phrase-based evaluation of translations. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1731-1734. [PDF, 368KB]

(2010) Yvette Graham & Josef van Genabith: Factor templates for factored machine translation models. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.275-282. [PDF, 366KB]

(2010) Spence Green & Christopher D.Manning: Better Arabic parsing: baselines, evaluations, and analysis. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.394-402. [PDF, 148KB]

(2010) Fei Huang & Bing Xiang: Feature-rich discriminative phrase rescoring for SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.492-500. [PDF, 260KB]

(2010) Mitesh M.Khapra, Saurabh Sohoney, Anup Kulkarni, & Pushpak Bhattacharyya: Value for money: balancing annotation effort, lexicon building and accuracy for multilingual WSD. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.555-563. [PDF, 237KB]

(2010) Philipp Koehn, Barry Haddow, Philip Williams, & Hieu Hoang: More linguistic annotation for statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 115-120. [PDF, 101KB]

(2010) Xuansong Li, Niyu Ge, Stephen Grimes, Stephanie M.Strassel, & Kazuaki Maeda: Enriching word alignment with linguistic tags. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2189-2195. [PDF, 509KB]

(2010) Chi-kiu Lo & Dekai Wu: Semantic vs. syntactic vs. n-gram structure for machine translation evaluation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.52-60. [PDF, 401KB]

(2010) Lieve Macken: An annotation scheme and gold standard for Dutch-English word alignment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3369-3374. [PDF, 379KB]

(2010) Simon Mille & Leo Wanner: Syntactic dependencies for multilingual and multilevel corpus annotation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1889-1896. [PDF, 395KB]

(2010) Smruthi Mukund, Debanjan Ghosh, & Rohini K.Srihari: Using cross-lingual projections to generate semantic role labeled corpus for Urdu – a resource poor language. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.797-805. [PDF, 300KB]

(2010) Mohamed Outahajala, Lahbib Zekouar, Paolo Rosso, & M.Antònia Martí: Tagging Amazigh with AnCoraPipe. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 52-56. [PDF, 326KB]

(2010) Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K.Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L.Forcada, & Andy Way: MaTrEx: the DCU MT system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 143-148. [PDF, 535KB]

(2010) Jan Niehues & Alex Waibel: Domain adaptation in statistical machine translation using factored translation models. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 581KB]; presentation: 27 slides [PDF, 887KB]

(2010) Narges Sharif Razavian & Stephan Vogel: Fixed length word suffix for factored statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.147-150. [PDF, 268KB]

 (2010) Libin Shen, Bing Zhang, Spyros Matsoukas, Jinxi Xu, & Ralph Weischedel: Statistical machine translation with a factorized grammar.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.616-625. [PDF, 147KB]

(2010) Kathrin Spreyer: Notes on the evaluation of dependency parsers obtained through cross-lingual projection. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1176-1184. [PDF, 114KB]

(2010) Lonneke van der Plas, Tanja Samardžić, & Paola Merlo: Cross-lingual validity of PropBank in the manual annotation of French. [ACL-2010] Proceedings of the Fourth Linguistic Annotation Workshop, Uppsala, Sweden, 15-16 July 2010; pp.113-117. [PDF, 102KB]

(2010) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: Linguistically annotated reordering: evaluation and analysis. Computational Linguistics 36 (3), pp. 535-568 [PDF, 745KB]

(2010) Omar F.Zaidan & Juri Ganitkevitch: An enriched MT grammar for under $100. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.93-98. [PDF, 186KB]

Antonyms

(2011) Yuval Marton, Ahmed El Kholy, & Nizar Habash: Filtering antonymous, trend-contrasting, and polarity-dissimilar distributional paraphrases for improving statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.237-249. [PDF, 177KB]

Articles

(2011) Sara Stymne: Definite noun phrases in statistical machine translation into Scandinavian languages. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.289-296. [PDF, 336KB]; presentation, 50 slides [PDF]

Artificial intelligence and MT

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

ASR see Speech recognition

Automatic evaluation see Evaluation measures and metrics

Bayesian methods

(2011) Nan Duan, Mu Li, Ming Zhou, & Lei Cui: Improving phrase extraction via MBR phrase scoring and pruning. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.189-197. [PDF, 302KB]

(2011) Kevin Duh, Katsuhito Sudoh, Tomoharu Iwata, & Hajime Tsukada: Alignment inference and Bayesian adaptation for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.114-121. [PDF, 117KB]

(2011) Kevin Duh, Katsuhito Sudoh, Xianchao Wu, Hajime Tsukada, & Masaaki Nagata: Generalized minimum Bayes risk system combination. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1356-1360. [PDF, 162KB]

(2011) Jesús González-Rubio, Alfons Juan, & Francisco Casacuberta: Minimum Bayes-risk system combination. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1268-1277. [PDF, 174KB]

(2011) Jesús González-Rubio & Francisco Casacuberta: The UPV-PRHLT combination system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.140-144. [PDF, 143KB]

(2011) Yun Huang, Min Zhang, & Chew Lim Tan: Nonparametric Bayesian machine transliteration with asynchronous adaptor grammars. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.534-539. [PDF, 162KB]

(2011) Zhifei Li, Jason Eisner, Ziyuan Wang, Sanjeev Khudanpur, & Brian Roark: Minimum imputed risk: unsupervised discriminative training for machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.920-929. [PDF, 236KB]

(2011) Coşkun Mermer & Murat Saraçlar: Bayesian word alignment for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.182-187. [PDF, 140KB]

(2011) Toshiaki Nakazawa & Sadao Kurohashi: Bayesian subtree alignment model based on dependency trees. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.794-802. [PDF, 790KB]

(2011) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Bayesian extraction of minimal SCFG rules for hierarchical phrase-based translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.533-541. [PDF, 148KB]

(2011) Yinggong Zhao, Shujie Liu, Yangsheng Ji, Jiajun Chen, & Guodong Zhou: Transductive minimum error rate training for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.641-648. [PDF, 214KB]

(2010) Abhishek Arun, Barry Haddow, & Philipp Koehn: A unified approach to minimum risk training and decoding. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 365-374. [PDF, 225KB]

(2010) Graeme Blackwood, Adrià de Gispert, & William Byrne: Efficient path counting transducers for minimum Bayes-risk decoding of statistical machine translation lattices. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.27-32. [PDF, 288KB]

(2010) Graeme Blackwood, Adrià de Gispert & William Byrne: Fluency constraints for minimum Bayes-risk decoding of statistical machine translation lattices. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.71-79. [PDF, 547KB]

(2010) Boxing Chen, George Foster, & Roland Kuhn: Fast consensus hypothesis regeneration for machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 11-16. [PDF, 120KB]

(2010) Jinhua Du, Pavel Pecina, & Andy Way: An augmented three-pass system combination framework: DCU combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 290-295. [PDF, 550KB]

(2010) Nan Duan, Mu Li, Dongdong Zhang, & Ming Zhou: Mixture model-based minimum Bayes risk decoding using multiple machine translation systems. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.313-321. [PDF, 372KB]

(2010) Vladimir Eidelman, Chris Dyer, & Philip Resnik: The University of Maryland statistical machine translation system for the Fifth Workshop on Machine Translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 72-76. [PDF, 115KB]

(2010) Christian Hardmeier, Arianna Bisazza, & Marcello Federico: FBK at WMT 2010: word lattices for morphological reduction and chunk-based reordering.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 88-92. [PDF, 132KB]

(2010) Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K.Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L.Forcada, & Andy Way: MaTrEx: the DCU MT system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 143-148. [PDF, 535KB]

(2010) Juan Pino, Gonzalo Iglesias, Adrià de Gispert, Graeme Blackwood, Jamie Brunning, & William Byrne: The CUED HiFST system for the WMT10 translation shared task. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 155-160. [PDF, 97KB]

(2010) Sami Virpioja, Jaako Väyrynen, André Mansikkaniemi, & Mikko Kurimo: Applying morphological decomposition to statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 195-200. [PDF, 116KB]

Bilingual corpora [see also Example-based methods, Multilingual corpora]

(2011) Vamshi Ambati, Sanjika Hewavitharana, Stephan Vogel, & Jaime Carbonell: Active learning with multiple annotations for comparable data classification task. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.69-77. [PDF, 201KB]

(2011) Alexandra Antonova & Alexey Misyurev: Building a web-based parallel corpus and filtering out machine-translated text. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.136-144. [PDF, 217KB]

(2011) Elizabeth Baran & Nianwen Xue: Singular or plural? Exploiting parallel corpora for Chinese number prediction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.207-214. [PDF, 187KB]

(2011) Luciano Barbosa, Srinivas Bangalore, & Vivek Kumar Sridhar Rangarajan: Crawling back and forth: using back and out links to locate bilingual sites. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.429-437. [PDF, 192KB]

(2011) Caroline Barrière & Pierre Isabelle: Searching parallel corpora for contextually equivalent terms. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.105-112. [PDF, 149KB]

(2011) Anja Belz & Eric Kow: Unsupervised alignment of comparable data and text resources. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.102-109. [PDF, 143KB]

(2011) Shane Bergsma, David Yarowsky, & Kenneth Church: Using large monolingual and bilingual corpora to improve coordination disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1346-1355. [PDF, 157KB]

(2011) Bruno Cartoni, Sandrine Zufferey, Thomas Meyer, & Andrei Popescu-Belis: How comparable are parallel corpora? Measuring the distribution of general vocabulary and connectives. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.78-86. [PDF, 153KB]

(2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Bootstrapping Arabic-Italian SMT through comparable texts and pivot translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.249-256. [PDF, 354KB]; presentation, 12 slides [PDF]

(2011) Wenliang Chen, Jun’ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, & Haizhou Li: SMT helps bitext dependency parsing. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.73-83. [PDF, 583KB]

(2011) Guy De Pauw, Peter Waiganjo Wagacha, & Gilles-Maurice de Schryver: Towards English-Swahili machine translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23-27 January, 2011; 2pp. [PDF, 77KB]

(2011) Ali El-Kahky, Kareem Darwish, Ahmed Saad Aldein, Mohamed Abd El-Wahab, Ahmed Hefny, & Waleed Ammar: Improved transliteration mining using graph reinforcement. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1384-1393. [PDF, 939KB]

(2011) Darja Fišer, Nikola Ljubešić, Špela Vintar, & Senja Pollak: Building and using comparable corpora for domain-specific bilingual lexicon extraction. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.19-26. [PDF, 287KB]

(2011) Ruiji Fu, Bing Qin, & Ting Liu: Generating Chinese named entity data from a parallel corpus. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.264-272. [PDF, 450KB]

(2011) Souhir Gahbiche-Braham, Hélène Bonneau-Maynard, & François Yvon: Two ways to use a noisy parallel news corpus for improving statistical machine translation. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.44-51. [PDF, 176KB]

(2011) Juri Ganitkevitch, Chris Callison-Burch, Courtney Napoles, & Benjamin Van Durme: Learning sentential paraphrases from bilingual parallel corpora for text-to-text generation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1168-1179. [PDF, 311KB]

(2011) Qin Gao & Stephan Vogel: Corpus expansion for statistical machine translation with semantic role label substitution rules. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.294-298. [PDF, 151KB]

(2011) Petter Haugereid & Francis Bond: Extracting transfer rules for multiword expressions from parallel corpora. Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World (MWE 2011), Portland, Oregon, USA, 23 June 2011; pp.92-100. [PDF, 228KB]

(2011) Amir Hazem, Emmanuel Morin & Sebastian Peña Saldarriaga: Bilingual lexicon extraction from comparable corpora as metasearch. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.35-43. [PDF, 159KB]

(2011) Carlos A.Henríquez Q., José B.Mariño, & Rafael E.Banchs: Deriving translation units using small additional corpora. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.121-128. [PDF, 438KB]

(2011) Sanjika Hewavitharana & Stephan Vogel: Extracting parallel phrases from comparable data. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.61-68. [PDF, 217KB]

(2011) Masamichi Ideue, Kazahide Yamamoto, Masao Utiyama, & Eiichiro Sumita: A comparison of unsupervised bilingual term extraction methods using phrase tables. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.346-351. [PDF, 160KB]

(2011) Kevin Knight: Putting a value on comparable data [abstract]. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; p.1. [PDF, 92KB]

(2011) Kriste Krstovski & David A.Smith: A minimally supervised approach for detecting and ranking document translation pairs.  [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.207-216. [PDF, 389KB]

(2011) Els Lefever, Véronique Hoste, & Martine De Cock: ParaSense or how to use parallel corpora for word sense disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.317-322. [PDF, 119KB]

 (2011) Bo Li, Eric Gaussier, & Akiko Aizawa: Clustering comparable corpora for bilingual lexicon extraction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.473-478. [PDF, 216KB]

(2011) Feifan Liu, Fei Liu, & Yang Liu: Learning from Chinese-English parallel data for Chinese tense prediction. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1116-1124. [PDF, 556KB]

(2011) Yashar Mehdad, Matteo Negri, & Marcello Federico: Using bilingual parallel corpora for cross-lingual textual entailment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1336-1345. [PDF, 236KB]

(2011) Emmanuel Morin & Emmanuel Prochasson: Bilingual lexicon extraction from comparable corpora enhanced with parallel corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.27-34. [PDF, 125KB]

(2011) Sara Morrissey: Body at work: using corpora in sign language machine translation. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 20 slides [PDF of PPT, 2584KB]

(2011) Alexandre Patry & Philippe Langlais: Identifying parallel documents from a large bilingual collection of texts: application to parallel article extraction in Wikipedia. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.87-95. [PDF, 147KB]

(2011) Marion Potet, Raphaël Rubino, Benjamin Lecouteux, Stéphane Huet, Hervé Blanchon, Laurent Besacier, & Fabrice Lefèvre: The LIGA (LIG/LIA) machine translation system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.440-446. [PDF, 92KB]

(2011) Emmanuel Prochasson & Pascale Fung: Rare word translation extraction from aligned comparable documents. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1327-1335. [PDF, 157KB]

(2011) Spencer Rarrick, Chris Quirk, & Will Lewis: MT detection in web-scraped parallel corpora. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.422-429. [PDF, 323KB]

(2011) Markus Saers & Dekai Wu: Principled induction of phrasal bilexica. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.313-320. [PDF, 373KB]

(2011) Hassan Sajjad, Nadir Durrani, Helmut Schmid, & Alexander Fraser: Comparing two techniques for learning transliteration models using a parallel corpus. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.129-137. [PDF, 605KB]

 (2011) Matthew Snover, Xiang Li, Wen-Pin Lin, Zheng Chen, Suzanne Tamang, Mingmin Ge, Adam Lee, Qi Li, Hao Li, Sam Anzaroot, & Heng Ji: Cross-lingual slot filling from comparable corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.110-119. [PDF, 271KB]

(2011) Kaveh Taghipour, Shahram Khadivi, & Jia Xu: Parallel corpus refinement as an outlier detection algorithm. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.414-421. [PDF, 213KB]

(2011) Ivan Vulić, Wim De Smet, & Marie-Francine Moens: Identifying word translations from comparable corpora using latent topic models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.479-484. [PDF, 153KB]

(2011) Jia Xu & Weiwei Sun: Generating virtual parallel corpus: a compatibility centric method. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.406-413. [PDF, 859KB]

(2010) Takeshi Abekawa, Masao Utiyama, Eiichiro Sumita, & Kyo Kageura: Community-based construction of draft and final translation corpus through a translation hosting site Minna no Hon’yaku (MNH).  LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3662-3669. [PDF, 1695KB]

(2010) Lars Ahrenberg: Alignment-based profiling of Europarl data in an English-Swedish parallel corpus. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3398-3404. [PDF, 349KB]

(2010) José João Almeida & Alberto Simões: Automatic parallel corpora and bilingual terminology extraction from parallel websites. [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.50-55. [PDF, 257KB]

(2010) Vamshi Ambati, Stephen Vogel, & Jaime Carbonell: Active learning and crowd-sourcing for machine translation. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2169-2174. [PDF, 436KB]

(2010) Vamshi Ambati & Stephan Vogel: Can crowds build parallel corpora for machine translation systems? Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.62-65. [PDF, 89KB]

(2010) Marianna Apidianaki & Yifan He: An algorithm for cross-lingual sense-clustering tested in a MT evaluation setting. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.219-226. [PDF, 474KB]

(2010) Ondřej Bojar, Adam Liška, & Zdeněk Žabokrtský: Evaluating utility of data sources in a large parallel Czech-English corpus CzEng 0.9. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.447-452. [PDF, 359KB]

(2010) Ondřej Bojar, Pavel Straňák, & Daniel Zeman: Data issues in English-to-Hindi machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1771-1777. [PDF, 557KB]

(2010) Fabienne Braune & Alexander Fraser: Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.81-89. [PDF, 228KB]

(2010) David Burkett, Slav Petrov, John Blitzer, & Dan Klein: Learning better monolingual models with unannotated bilingual text. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.46-54. [PDF, 431KB]

(2010) Mauro Cettolo, Marcello Federico, & Nicola Bertoldi: Mining parallel fragments from comparable texts. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.227-234. [PDF, 393KB]

(2010) Diptesh Chatterjee, Sudeshna Sarkar, & Arpit Mishra: Co-occurrence graph based iterative bilingual lexicon extraction from comparable corpora. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.35-42. [PDF, 618KB]

(2010) Chen Yuncong & Pascale Fung: Unsupervised synthesis of multilingual Wikipedia articles. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.197-205. [PDF, 1040KB]

(2010) Hercules Dalianis, Hao-chun Xing, & Xin Zhang: Creating a reusable English-Chinese parallel corpus for bilingual dictionary construction. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1700-1705. [PDF, 409KB]

(2010) Do Thi Ngoc Diep, Laurent Besacier, & Eric Castelli: A fully unsupervised approach for mining parallel data from comparable corpora. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 856KB]; presentation: 30 slides [PDF, 2621KB]

(2010) Andreas Eisele & Jia Xu: Improving machine translation performance using comparable corpora.  [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.35-41. [PDF, 104KB]

(2010) Yanhui Feng, Yu Hong, Zhenxiang Yan, Jianmin Yao, & Qiaoming Zhu: A novel method for bilingual web page acquisition from search engine web records. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.294-302. [PDF, 184KB]

(2010) Mark Fishel & Heiki-Jaan Kaalep: CorporAl: a method and tool for handling overlapping parallel corpora. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.67-76. [PDF, 149KB]; presentation [PDF, 1037KB]

(2010) Pascale Fung, Emmanuel Prochasson, & Simon Shi: Trillions of comparable documents. [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.26-34. [PDF, 199KB]

(2010) Pablo Gamallo Otero & Isaac González López: Wikipedia as multilingual source of comparable corpora. [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.21-25. [PDF, 180KB]

(2010) J.González-Rubio, J.Civera, A.Juan, & F.Casacuberta: Saturnalia: a Latin-Catalan parallel corpus for statistical MT. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3405-3408. [PDF, 261KB]

(2010) Degen Huang, Lian Zhao, Lishuang Li, & Haitao Yu: Mining large-scale comparable corpora from Chinese-English news collections. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.472-480. [PDF, 255KB]

(2010) Hiroyuki Kaji, Takashi Tsunakawa, & Daisuke Okada: Using comparable corpora to adapt a translation model to domains. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2182-2188. [PDF, 432KB]

(2010) Philipp Koehn & Jean Senellart: Fast approximate string matching with suffix arrays and A* parsing. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 178KB]

(2010) Audrey Laroche & Philippe Langlais: Revisiting context-based projection methods for term-translation spotting in comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.617-625. [PDF, 224KB]

(2010) Lianhau Lee, Aiti Aw, Min Zhang, & Haizhou Li: EM-based hybrid model for bilingual terminology extraction from comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.639-646. [PDF, 114KB]

(2010) Bo Li & Eric Gaussier: Improving corpus comparability for bilingual lexicon extraction from comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.644-652. [PDF, 276KB]

(2010) Bin Lu, Tao Jiang, Kapo Chow, & Benjamin K. Tsou: Building a large English-Chinese parallel corpus from comparable patents and its experimental application to SMT.  [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.42-49. [PDF, 271KB]

(2010) Sara Morrissey, Harold Somers, Robert Smith, Shane Gilchrist & Sandipan Dandapat: Building a sign language corpus for use in machine translation. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.172-177. [PDF, 588KB]

(2010) Smruthi Mukund, Debanjan Ghosh, & Rohini K.Srihari: Using cross-lingual projections to generate semantic role labeled corpus for Urdu – a resource poor language. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.797-805. [PDF, 300KB]

(2010) Masaki Murata, Tomohiro Ohno, Shigeki Matsubara, & Yasuyoshi Inagaki: Construction of chunk-aligned bilingual lecture corpus for simultaneous machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1765-1770. [PDF, 398KB]

(2010) Matteo Negri & Yashar Mehdad: Creating bi-lingual entailment corpus through translations with Mechanical Turk: $100 for a 10-day rush. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.212-216. [PDF, 101KB]

 (2010) Mike O’Malley: The challenges of distributed parallel corpora. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Paula Paiva: Corpus representativeness in the selection of medical terms to be used in translation memory tools [abstract]. UCCTS 2010: Using Corpora in Contrastive and Translation Studies, Edge Hill University, UK, 27-29 July 2010; p.15. [PDF, 91KB]

(2010) Jocelyn Phillips, Carol Van Ess-Dykema, Timothy Allison & Laurie Gerber: Parallel corpus development at NVTC. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 173KB]; abstract

(2010) John C.Platt, Kristina Toutanova, & Wen-tau Yih: Translingual document representations from discriminative projections. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.251-261. [PDF, 319KB]

(2010) Reinhard Rapp & Michael Zock: Utilizing citations of foreign words in corpus-based dictionary generation. [Coling 2010] Proceedings  of the Second Workshop on NLP Challenges in the Information Explosion Era, Beijing, China, 28 August 2010; pp.50-59. [PDF, 188KB]

(2010) Gudrun Rawoens: Multilingual corpora in cross-lingusitic research: focus on the compilation of a Dutch-Swedish parallel corpus. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1287-1294. [PDF, 417KB]

(2010) Gábor Recski, András Rung, Attila Zséder, & András Kornai: NP alignment in bilingual corpora. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3379-3382. [PDF, 305KB]

(2010) Inguna Skadiņa, Andrejs Vasiļjevs, Raivis Skadiņš, Robert Gaizauskas, Dan Tufiş, & Tatiana Gornostay: Analysis and evaluation of comparable corpora for under resourced areas of machine translation. [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.6-14. [PDF, 318KB]

(2010) Jason R.Smith, Chris Quirk, & Kristina Toutanova: Extracting parallel sentences from comparable corpora using document level alignment. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.403-411. [PDF, 313KB]

(2010) Yulia Tsvetkov & Shuly Wintner: Automatic acquisition of parallel corpora from websites with dynamic content.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3389-3392. [PDF, 447KB]

(2010) Yulia Tsvetkov & Shuly Wintner: Extraction of multi-word expressions from small parallel corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1256-1264. [PDF, 257KB]

(2010) Jakob Uszkoreit, Jay M.Ponte, Ashok C.Popat, & Moshe Dubiner: Large scale parallel document mining for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1101-1109. [PDF, 241KB]

(2010) Tom Vanallemeersch: Belgisch Staatsblad corpus: retrieving French-Dutch sentences from official documents. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3413-3416. [PDF, 273KB]

(2010) Sina Zarrieß, Aoife Cahill, Jonas Kuhn, & Christian Rohrer: Cross-lingual induction of deep broad-coverage syntax: a case study on German participles. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1426-1434. [PDF, 106KB]

Bi-text see Bilingual corpora

Book reviews

(2011) Marcello Federico: [review] Cross-language information retrieval [by] Jian-Yun Nie. Computational Linguistics 37 (2), pp. 411-412 [PDF, 37KB]

(2010) Colin Cherry: Book review: Statistical machine translation, [by] Philipp Koehn. Computational Linguistics 36 (4), pp. 773-776 [PDF, 49KB]

Bracketing transduction grammar see Transduction grammars

Bridge language see Pivot language

Capitalization see Written forms

Case grammar and case frames

(2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

(2011) Alex Rudnick: A resource-light approach to learning verb valencies. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 54KB]; presentation: 31 slides [PDF of PPT, 697KB]

(2010) Maria Rosenberg: Lexical representation of agentive nominal compounds in French and Swedish [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

Categorial grammar

(2011) Hala Almaghout, Jie Jiang, & Andy Way: CCG contextual labels in hierarchical phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.281-288. [PDF, 317KB]; presentation, 25 slides [PDF, 964KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: CCG augmented hierarchical phrase-based machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.211-218. [PDF, 653KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: The DCU machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.37-44. [PDF, 621KB]

(2010) Rejwanul Haque, Sudip Kumar Naskar, Antal van den Bosch, & Andy Way: Supertags as source language context in hierarchical phrase-based SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 196KB]

(2010) Richard Moot: Semi-automated extraction of a wide-coverage type-logical grammar for French. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 6pp. [PDF, 60KB]

(2010) Richard Moot: Wide-coverage Freench syntax and semantics using Grail. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 121KB]

Character alignment

(2011) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Japanese-Chinese phrase alignment using common Chinese characters information. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.464-471. [PDF, 347KB]

Chart parsing

(2010) Martin Čmejrek & Bowen Zhou: Two methods for extending hierarchical rules from the bilingual chart parsing. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.180-188. [PDF, 200KB]

 (2010) Petr Homola & Vladislav Kuboň: Exploiting charts in the MT between related languages [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Yang Liu & Liang Huang: Tree-based and forest-based translation. ACL 2010:Tutorial abstracts, Uppsala, Sweden, 11 July, 2010; p.2. [PDF, 58KB]

(2010) Haitao Mi & Qun Liu: Constituency to dependency translation with forests. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1433-1442. [PDF, 190KB]

(2010) Haitao Mi, Liang Huang, & Qun Liu: Machine translation with lattices and forests. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.837-845. [PDF, 268KB]

(2010) Zhaopeng Tu, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Dependency forest for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1092-1100. [PDF, 215KB]

Cleaning and filtering

(2011) Alexandra Antonova & Alexey Misyurev: Building a web-based parallel corpus and filtering out machine-translated text. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.136-144. [PDF, 217KB]

(2011) Česlav Przywara & Ondřej Bojar: eppex: epochal phrase table extraction for statistical machine translation. Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.89-98. [PDF, 137KB]; presentation, 24 slides [PDF of PPT, 118KB]

(2011) Spencer Rarrick, Chris Quirk, & Will Lewis: MT detection in web-scraped parallel corpora. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.422-429. [PDF, 323KB]

(2010) Hailong Cao & Eiichiro Sumita: Filtering syntactic constraints for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.17-21. [PDF, 65KB]

 (2010) Jie Jiang, Andy Way, & Julie Carson-Berndsen: Lattice score based data cleaning for phrase-based statistical machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 631KB]

(2011) Sara Stymne: Spell checking techniques for replacment of unknown words and data cleaning for Haitian Creole SMS translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.470-477. [PDF, 107KB]

Closely related languages

(2011) Luís Marujo, Nuno Grazina, Tiago Luís, Wang Ling, Luísa Coheur, & Isabel Trancoso: BP2EP – adaptation of Brazilian Portuguese texts to European Portuguese. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.129-136. [PDF, 352KB]

 (2011) Vishal Goyal & Gurpreet Singh Lehal: Hindi to Punjabi machine translation system. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Portland, Oregon, June 19-24, 2011; pp.1-6. [PDF, 247KB]

(2011) Pim Otte & Francis M.Tyers: Rapid rule-based machine translation between Dutch and Afrikaans. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.153-160. [PDF, 317KB]

(2011) Tihomir Rangelov: Rule-based machine translation between Bulgarian and Macedonian. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.53-59. [PDF, 164KB]

(2010) Mikel L.Forcada, Boyan Ivanov Bonev, Sergio Ortiz Rojas, Juan Antonio Pérez Ortiz, Gema Ramírez Sánchez, Felipe Sánchez Martínez, Carme Armentano-Oller, Marco A.Montava, & Francis M.Tyers: Documentation of the open-source shallow-transfer machine translation platform Apertium; ed. Mireia Ginestí Rosell. Departament de Llenguatges i Sistemes Informàtics, Universitat d’Alacant, March 10, 2010; 214pp. [PDF, 700KB]

 (2010) Petr Homola & Vladislav Kuboň: Exploiting charts in the MT between related languages [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Jernej Vičič & Petr Homola: Speeding up the implementation process of a shallow transfer machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 624KB]

Code switching see Dialects and registers

Cognates

 (2011) Taylor Berg-Kirkpatrick & Dan Klein: Simple effective decipherment via combinatorial optimization. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.313-321. [PDF, 299KB]

(2011) Mirabela Navlea & Amalia Todiraşcu: Cognate identification for a French-Romanian lexical alignment system: empirical study. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.145-152. [PDF, 271KB]

Cohesion see Reference

Collocations see Multiword expressions

Combination systems (see also Multi-engine systems)

(2011) Eleftherios Avramidis: DFKI system combination with sentence ranking at ML4HMT-2011. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.99-103. [PDF, 260KB]

(2011) Loïc Barrault: MANY improvements for WMT’11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.135-139. [PDF, 151KB]

(2011) LoïcBarrault & Patrik Lambert: Machine translation system combination with MANY for ML4HMT. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.89-92. [PDF, 344KB]

(2011) Jacob Devlin, Antti-Veikko I.Rosti, Shankar Ananthakrishnan, & Spyros Matsoukas: System combination using discriminative cross-adaptation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.667-675. [PDF, 627KB]

(2011) Nan Duan, Mu Li, & Ming Zhou: Hypothesis mixture decoding for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1258-1267. [PDF, 1027KB]

(2011) Kevin Duh, Katsuhito Sudoh, Xianchao Wu, Hajime Tsukada, & Masaaki Nagata: Generalized minimum Bayes risk system combination. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1356-1360. [PDF, 162KB]

(2011) Sarah Ebling, Andy Way, Martin Volk, & Sundip Kumar Naskar: Combining semantic and syntactic generalization in example-based machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.209-216. [PDF, 354KB]; presentation, 37 slides [PDF]

(2011) Christian Federmann, Sabine Hunsicker, Yu Chen, & Rui Wang: DFKI system combination using syntactic information at ML4HMT-2011. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.104-109. [PDF, 294KB]

(2011) Markus Freitag, Gregor Leusch, Joern Wuebker, Stephan Peitz, Hermann Ney, Teresa Herrmann, Jan Niehues, Alex Waibel, Alexandre Allauzen, Gilles Adda, Josep Maria Crego, Bianka Buschbeck, Tonio Wandmacher, & Jean Senellart: Joint WMT submission of the QUAERO project. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.358-364. [PDF, 107KB]

(2011) Jesús González-Rubio, Alfons Juan, & Francisco Casacuberta: Minimum Bayes-risk system combination. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1268-1277. [PDF, 174KB]

(2011) Jesús González-Rubio & Francisco Casacuberta: The UPV-PRHLT combination system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.140-144. [PDF, 143KB]

(2011) Kenneth Heafield & Alon Lavie: CMU system combination in WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.145-151. [PDF, 108KB]

(2011) Gregor Leusch, Markus Freitag & Hermann Ney: The RWTH system combination system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.152-158. [PDF, 205KB]

(2011) Yupeng Liu, Tiejun Zhao, & Sheng Li: Hypergraph training and decoding of system combination in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.570-577 [PDF, 235KB]

(2011) Wei-Yun Ma & Kathleen McKeown: System combination for machine translation based on text-to-text generation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.546-553. [PDF,

(2011) Tsuyoshi Okita & Josef van Genabith: DCU confusion network-based system combination for ML4HMT. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.93-98. [PDF, 377KB]

 (2011) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Expected BLEU training for graphs: BBN system description for WMT11 system combination task. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.159-165. [PDF, 147KB]

 (2011) Rico Sennrich: The UZH system combination system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.166-170.  [PDF, 104KB]

 (2011) Linfeng Song, Haitao Mi, Yajuan Lü, & Qun Liu: Bagging-based system combination for domain adaptation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.293-298. [PDF,

 (2011) Taro Watanabe & Eiichiro Sumita: Machine translation system combination by confusion forest. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1249-1257. [PDF, 451KB]

 (2011) Daguang Xu, Yuan Cao, & Damianos Karakos: Description of the JHU system combination scheme for WMT 2011.  [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.171-176. [PDF, 123KB]

 (2011) Jia Xu, Hans Uszkoreit, Casey Kennington, David Vilar, & Xiaojun Zhang: DFKI hybrid machine translation system for WMT 2011 – on the integration of SMT and RBMT. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.485-489. [PDF, 92KB]

(2010) Loïc Barrault: Open source machine translation system combination. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.147-155. [PDF, 163KB]

(2010) Loïc Barrault: MANY: open source MT system combination at WMT’10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 271-275. [PDF, 101KB]

(2010) Ergun Biçici & S.Serdar Kozat: Adaptive model weighting and transductive regression for predicting best system combinations. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 276-281. [PDF, 181KB]

(2010) Ergun Biçici & Deniz Yuret: L1 regularized regression for reranking and system combination in machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 282-289. [PDF, 197KB]

(2010) Josep Maria Crego, Aurélien Max, & François Yvon: Local lexical adaptation in machine translation through triangulation: SMT helping SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.232-240. [PDF, 197KB]

(2010) Lei Cui, Dongdong Zhang, Mu Li, Ming Zhou, & Tiejun Zhao: Hybrid decoding: decoding with partial hypotheses combination over multiple SMT systems. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.214-222. [PDF, 497KB]

(2010) John DeNero, Shankar Kumar, Ciprian Chelba, & Franz Och: Model combination for machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.975-983. [PDF, 573KB]

(2010) Jinhua Du & Andy Way: Using TERp to augment the system combination for SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 177KB]

(2010) Jinhua Du, Pavel Pecina, & Andy Way: An augmented three-pass system combination framework: DCU combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 290-295. [PDF, 550KB]

(2010) Nan Duan, Hong Sun, & Ming Zhou: Translation model generalization using probability averaging for machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.304-312. [PDF, 296KB]

(2010) Chooi-Ling Goh, Taro Watanabe, Michael Paul, Andrew Finch, & Eiichiro Sumita: The NICT translation  system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.139-146. [PDF, 379KB]

(2010) Jesús González-Rubio, Jesús Andrés-Ferrer, Germán Sanchis-Trilles, Guillem Gascó, Pascual Martínez-Gómez, Martha-Alicia Rocha, Joan-Andreu Sánchez & Francisco Casacuberta: The UPV-PRHLT combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 296-300. [PDF, 105KB]

(2010) Kenneth Heafield & Alon Lavie: Voting on n-grams for machine translation system combination. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 128KB]

(2010) Carmen Heger, Joern Wuebker, David Vilar, & Hermann Ney: A combination of hierarchical systems with forced alignments from phrase-based systems. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.291-297. [PDF, 377KB]

(2010) Almut Silja Hildebrand & Stephan Vogel: CMU system combination via hypothesis selection for WMT’10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 307-310. [PDF, 90KB]

(2010) Gregor Leusch, Aurélien Max, Josep Maria Crego, & Hermann Ney: Multi-pivot translation by system combination. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.299-306. [PDF, 303KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) Chi-Ho Li, Nan Duan, Yinggong Zhao, Shujie Liu, Lei Cui, Mei-yuh Hwang, Amittai Axelrod, Jianfeng Gao, Yaodong Zhang, & Li Deng: The MSRA machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.135-138. [PDF, 254KB]; presentation: 17 slides [PDF of PPT, 477KB]

(2010) Saab Mansour, Stephan Peitz, David Vilar, Joern Wuebker, & Hermann Ney: The RWTH Aachen machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.163-168. [PDF, 248KB]

(2010) Sushant Narsale: JHU system combination scheme for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 311-314. [PDF, 99KB]

(2010) Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K.Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L.Forcada, & Andy Way: MaTrEx: the DCU MT system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 143-148. [PDF, 535KB]

(2010) Wade Shen, Tim Anderson, Ray Slyh, & A.Ryan Aminzadeh: The MIT-LL/AFRL IWSLT-2010 MT system. Proceedings of the 7th International Workshop on Spoken L anguage Translation, 2-3 December 2010, Paris, France; pp.127-134. [PDF, 341KB]; presentation: 11 slides [PDF of PPT, 422KB]

(2010) Daniel Stein, Christoph Schmidt, & Hermann Ney: Sign language machine translation overkill. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.337-344. [PDF, 2052KB]

(2010) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: BBN system description for WMT10 system combination task. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 321-326. [PDF, 102KB]

(2010) Tong Xiao, Jingbo Zhu, Muhua Zhu, & Huizhen Wang: Boosting-based system combination for machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.739-748. [PDF, 380KB]

Compound nouns see Nouns and noun phrases

Concatenation grammar see Transduction grammars

Concordances

(2010) Alain Désilets: WeBiText: multilingual concordancer built from public high quality web content. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

Conditional random fields

(2011) Thomas Lavergne, Josep Maria Crego, Alexandre Allauzen, & François Yvon: From n-gram-based to CRF-based translation models. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.542-553. [PDF, 197KB]

(2010) Benjamin Roth, Andrew McCallum, Marc Dymetman & Nicola Cancedda: Machine translation using overlapping alignments and SampleRank. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 138KB]

(2010) Dong Yang, Paul Dixon, & Sadaoki Furui: Jointly optimizing a two-step conditional random field model for machine transliteration and its fast decoding algorithm. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.275-280. [PDF, 115KB]

Confidence measures

(2011) Elefterios Avramidis, Maja Popovic, David Vilar, & Aljoscha Burchardt: Evaluate with confidence  estimation: machine ranking of translation outputs using grammatical features. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.65-70. [PDF, 107KB]

(2011) Nguyen Bach, Fei Huang, & Yaser Al-Onaizan: Goodness: a method for measuring machine translation confidence. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.211-219. [PDF, 976KB]

(2011) Christian Hardmeier: Improving machine translation quality prediction with syntactic tree kernels. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.233-240. [PDF, 633KB] ; presentation, 18 slides [PDF]

(2011) Yanjun Ma, Yifan He, & Josef van Genabith: From the confidence estimation of machine translation to the integration of MT and translation memory. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.13-15. [PDF, 75KB]

(2011) Lluís Màrquez & Jesús Giménez: Automatic evaluation in machine translation: towards combined linguistically-motivated measures [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 101 slides [PDF of PPT, 1248KB]

 (2011) Daniele Pighin & Lluís Màrquez: Automatic projection of semantic structures: an application of pairwise translation ranking. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.1-9. [PDF, 142KB]

(2011) Felipe Sánchez-Martínez: Choosing the best machine translation system to translate a sentence by using only source-language information. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.97-104. [PDF, 338KB] ; presentation, 32 slides [PDF]

(2011) Lucia Specia: Exploiting objective annotations for measuring translation post-editing effort. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.73-80. [PDF, 365KB]; presentation: 94 slides [PDF]

(2011) Lucia Specia: Linguistic indicators for quality estimation of machine translation [abstract]. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; p.14. [PDF, 139KB]

 (2011) Viet Ha-Thuc & Nicola Cancedda: Confidence-weighted learning of factored discriminitave language models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.439-444. [PDF,

(2010) Graeme Blackwood, Adrià de Gispert & William Byrne: Fluency constraints for minimum Bayes-risk decoding of statistical machine translation lattices. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.71-79. [PDF, 547KB]

(2010) Jesús González-Rubio, Daniel Ortiz-Martínez, & Francisco Casacuberta: Balancing user effort and translation error in interactive machine translation via confidence measures. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.173-177. [PDF, 97KB]

(2010) Jesús González-Rubio, Daniel Ortiz-Martínez, & Francisco Casacuberta: On the use of confidence measures within an interactive-predictive machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 525KB]

(2010) Yifan He, Yanjun Ma, Josef van Genabith, & Andy Way: Bridging SMT and TM with translation recommendation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.622-630. [PDF, 573KB]

(2010) Radu Soricut & Abdessamad Echihabi: TrustRank: inducing trust in automatic translations via ranking. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.612-621. [PDF, 222KB]

(2010) Lucia Specia & Jesús Giménez: Combining confidence estimation and reference-based metrics for segment-level MT evaluation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 152KB]

(2010) Deyi Xiong, Min Zhang, & Haizhou Li: Error detection for statistical machine translation using linguistic features. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.604-611. [PDF, 520KB]

Confusion networks (see also Combination systems, Lattices and networks, Multi-engine systems)

(2011) Loïc Barrault: MANY improvements for WMT’11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.135-139. [PDF, 151KB]

(2011) LoïcBarrault & Patrik Lambert: Machine translation system combination with MANY for ML4HMT. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.89-92. [PDF, 344KB]

(2011) Jie Jiang, Jinhua Du, & Andy Way: Incorporating source-language paraphrases into phrase-based SMT with confusion networks. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.31-40. [PDF, 1325KB]

(2011) Tsuyoshi Okita & Josef van Genabith: DCU confusion network-based system combination for ML4HMT. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.93-98. [PDF, 377KB]

(2011) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Expected BLEU training for graphs: BBN system description for WMT11 system combination task. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.159-165. [PDF, 147KB]

(2011) Taro Watanabe & Eiichiro Sumita: Machine translation system combination by confusion forest. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1249-1257. [PDF, 451KB]

(2011) Daguang Xu, Yuan Cao, & Damianos Karakos: Description of the JHU system combination scheme for WMT 2011.  [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.171-176. [PDF, 123KB]

(2010) Loïc Barrault: Open source machine translation system combination. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.147-155. [PDF, 163KB]

(2010) Loïc Barrault: MANY: open source MT system combination at WMT’10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 271-275. [PDF, 101KB]

(2010) Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: Statistical machine translation of texts with misspelled words. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.412-419. [PDF, 160KB]

(2010) Jinhua Du, Pavel Pecina, & Andy Way: An augmented three-pass system combination framework: DCU combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 290-295. [PDF, 550KB]

(2010) Carmen Heger, Joern Wuebker, Matthias Huck, Gregor Leusch, Saab Mansour, Daniel Stein, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 93-97. [PDF, 98KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, & Jason Eisner: Unsupervised discriminative language model training for machine translation using simulated confusion sets. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.656-664. [PDF, 348KB]

(2010) Sushant Narsale: JHU system combination scheme for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 311-314. [PDF, 99KB]

(2010) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: BBN system description for WMT10 system combination task. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 321-326. [PDF, 102KB]

Conjunctions

(2011) Bruno Cartoni, Sandrine Zufferey, Thomas Meyer, & Andrei Popescu-Belis: How comparable are parallel corpora? Measuring the distribution of general vocabulary and connectives. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.78-86. [PDF, 153KB]

(2011) Thomas Meyer: Disambiguating temporal-contrastive discourse connectives for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.46-51. [PDF, 114KB]

Consistency

(2011) Yifan He, Yanjun Ma, Andy Way, & Josef van Genabith: Rich linguistic features for translation memory-inspired consistent translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.456-463. [PDF, 159KB]

 (2011) Yanjun Ma, Yifan He, Andy Way, & Josef van Genabith: Consistent translation using discriminative learning: a translation memory-inspired approach. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1239-1248. [PDF, 193KB]

 (2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

Constraint synchronous grammar see Context-free grammar

Context-based methods

(2011) Daniel Andrade, Takuya Matsuzaki, & Jun’ichi Tsujii: Learning the optimal use of dependency-parsing information for finding translations with comparable corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.10-18. [PDF, 651KB]

(2011) Caroline Barrière & Pierre Isabelle: Searching parallel corpora for contextually equivalent terms. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.105-112. [PDF, 149KB]

(2011) Marta R.Costa-jussà & Rafael E.Banchs: The BM-I2R Haitian-Créole-to-English translation system description for the WMT 2011 evaluation campaign. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.452-456. [PDF,

(2011) Zhengxian Gong, Min Zhang, & Guodong Zhou: Cache-based document-level statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.909-919. [PDF, 330KB]

(2011) Maxim Khalilov & Khalil Sima’an: Context-sensitive syntactic source-reordering by statistical translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.38-46. [PDF, 327KB]

(2011) Jan Niehues, Teresa Herrmann, Stephan Vogel, & Alex Waibel: Wider context by using bilingual langauge models in machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.198-206. [PDF, 570KB]

(2011) Takashi Onishi, Masao Utiyama, & Eiichiro Sumita: Reordering constraint based on document-level context. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.434-438. [PDF, 128KB]

(2011) Alexandre Patry & Philippe Langlais: Going beyond word cooccurrences in global lexicon selection for statistical machine translation using a multilayer perceptron. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.658-666. [PDF, 428KB]

(2011) Avneesh Saluja, Ian Lane, & Ying Zhang: Context-aware language modeling for conversational speech translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.97-104. [PDF, 177KB]

(2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

(2011) Deyi Xiong, Min Zhang, & Haizhou Li: Enhancing language models in statistical machine translation with backward n-grams and mutual information triggers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1288-1297. [PDF, 620KB]

(2011) Jaako Väyrynen, Timo Honkela, & Marcus Dobrinkat: Context in machine translation: challenge and data set. META-NET Workshop: Context in MT, June 14, 2011; 10 slides [PDF of PPT, 146KB]

(2010) George Foster, Pierre Isabelle, & Roland Kuhn: Translating structured documents. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 132KB]

(2010) Rejwanul Haque, Sudip Kumar Naskar, Antal van den Bosch, & Andy Way: Supertags as source language context in hierarchical phrase-based SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 196KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Maximum entropy based phrase reordering for hierarchical phrase-based translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.555-563. [PDF, 144KB]

(2010) Helena de Medeiros Caseli, Bruno Akio Sugiyama, & Junia Coutinho Anacleto: Using common sense to generate culturally contextualized machine translation. Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, Los Angeles, CA, June 2010; pp.24-31. [PDF, 353KB]

(2010) Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K.Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L.Forcada, & Andy Way: MaTrEx: the DCU MT system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 143-148. [PDF, 535KB]

(2010) Fangzhong Su & Katja Markert: Word sense subjectivity for cross-lingual lexical substitution. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.357-360. [PDF, 109KB]

(2010) Rabih Zbib, Spyros Matsoukas, Richard Schwartz, & John Makhoul: Decision trees for lexical smoothing in statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the work shop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 428-437. [PDF, 2766KB]

Context-free grammars

 (2011) Qin Gao & Stephan Vogel: Utilizing target-side semantic role labels to assist hierarchical phrase-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.107-115. [PDF, 352KB]

 (2011) Greg Hanneman & Alon Lavie: CMU syntax-based machine translation at WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.365-371. [PDF, 118KB]

 (2011) Greg Hanneman, Michelle Burroughs, & Alon Lavie: A general-purpose rule extractor for SCFG-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.135-144. [PDF, 267KB]

 (2011) Greg Hanneman & Alon Lavie: Automatic category label coarsening for syntax-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.98-106. [PDF, 264KB]

(2011) Gonzalo Iglesias, Cyril Allauzen, William Byrne, Adrià de Gispert, & Michael Riley: Hierarchical phrase-based translation representations. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1373-1383. [PDF, 334KB]

(2011) Markos Mylonakis & Khalil Sima’an: Learning hierarchical translation structure with linguistic annotations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.642-652. [PDF, 165KB]

(2011) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Bayesian extraction of minimal SCFG rules for hierarchical phrase-based translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.533-541. [PDF, 148KB]

 (2011) Jonathan Weese, Juri Ganitkevitch, Chris Callison-Burch, Matt Post, & Adam Lopez: Joshua 3.0: syntax-based machine translation with the Thrax grammar extractor. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.478-484. [PDF, 164KB]

(2011) Fai Wong, Francisco Oliveira, Sam Chao, & Chi-Wai Tang: Handheld machine translation system based on constraint synchronous grammar. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.439-446. [PDF, 158KB]

 (2011) Andreas Zollmann & Stephan Vogel: A word-class approach to labeling PSCFG rules for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1-11. [PDF, 230KB]

(2010) Chris Dyer & Philip Resnik: Context-free reordering, finite-state translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.858-866. [PDF, 272KB]

(2010) Chris Dyer, Adam Lopez, Juri Ganitkevitch, Jonathan Weese, Ferhan Ture, Phil Blunsom, Hendra Setiawan, Vladimir Eidelman, & Philip Resnik: cdec: a decoder, alignment, and learning framework for finite-state and context-free translation models. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.7-12. [PDF, 171KB]

(2010) Marc Dymetman & Nicola Cancedda: Intersecting hierarchical and phrase-based models of translation: formal aspects and algorithms. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.1-9. [PDF, 354KB]

(2010) Zhongqiang Huang, Martin Čmejrek & Bowen Zhou: Soft syntactic constraints for hierarchical phrase-based translation using latent syntactic distributions.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.138-147. [PDF, 506KB]

(2010) Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, & Jason Eisner: Unsupervised discriminative language model training for machine translation using simulated confusion sets. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.656-664. [PDF, 348KB]

(2010) Yang Liu & Qun Liu: Joint parsing and translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.707-715. [PDF, 211KB]

(2010) Adam Lopez: Context-free translation models. Presentation at Fifth Machine Translation Marathon, 13-18 September 2010, University of Le Mans; 160 slides. [PDF, 3817KB]

(2010) Markos Mylonakis & Khalil Sima’an: Learning probabilistic synchronous CFGs for phrase-based translation. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.117-125. [PDF, 151KB]

(2010) Hwidong Na, Jin-Ji Li, Yeha Lee, & Jong-Hyeok Lee: A synchronous context free grammar using depependency sequence for syntax-based statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 184KB]

(2010) Francisco Oliveira, Fai Wong, Iok-Sai Hong & Ming-Chui Dong: Parsing extended constraint synchronous grammar in Chinese-Portuguese machine translation. International Conference on Computational Processing of Portuguese Language (PROPOR 2010), Porto Alegre, Brazil, 2010; 6pp. [PDF, 91KB]

(2010) Andreas Zollmann & Stephan Vogel: New parameterizations and features for PSCFG-based machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.110-117. [PDF, 181KB]

Conversation see Dialogue

Coordination

(2011) Shane Bergsma, David Yarowsky, & Kenneth Church: Using large monolingual and bilingual corpora to improve coordination disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1346-1355. [PDF, 157KB]

Corpora see Bilingual corpora, Monolingual corpora, Multilingual corpora

Corpus-based MT see Context-based methods, Example-based methods, Statistical MT

Data elicitation

(2011) Sergei Nirenburg & Marjorie McShane: Morphological aspects of computer-driven elicitation of  knowledge about any language [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; presentation: 47 slides [PDF of PPT, 1907KB]

(2010) Vamshi Ambati, Stephan Vogel & Jaime Carbonell: Active learning-based elicitation for semi-supervised word alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.365-370. [PDF, 182KB]

Decomposition see Segmentation

Dependency relations

 (2011) Daniel Andrade, Takuya Matsuzaki, & Jun’ichi Tsujii: Learning the optimal use of dependency-parsing information for finding translations with comparable corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.10-18. [PDF, 651KB]

 (2011) Giuseppe Attardi, Atanas Chanev, & Antonio Valerio Miceli Barone: A dependency based statistical translation model. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.79-87. [PDF, 174KB]

(2011) Igor Boguslavsky: Processing of Russian by the ETAP-3 linguistic processor. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 110KB]; presentation: 48 slides [PDF of PPT, 227KB]

 (2011) Wenliang Chen, Jun’ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, & Haizhou Li: SMT helps bitext dependency parsing. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.73-83. [PDF, 583KB]

(2011) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Japanese-Chinese phrase alignment using common Chinese characters information. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.464-471. [PDF, 347KB]

(2011) Michael Gasser: Towards synchronous extensible dependency grammar. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.3-10. [PDF, 265KB]

(2011) Kevin Gimpel & Noah A.Smith: Quasi-synchronous phrase dependency grammars for machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.474-485. [PDF, 360KB]

(2011) Nathan Green: Effects of noun phrase bracketing in dependency parsing and machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.69-74. [PDF, 288KB]

 (2011) Wenjing Lang, Qiaoli Zhou, Guiping Zhang, & Dongfeng Cai: Multi-stage Chinese dependency parsing based on dependency direction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.64-71. [PDF, 128KB]

(2011) Ryan McDonald, Slav Petrov, & Keith Hall: Multi-source transfer of delexicalized dependency parsers. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.62-72. [PDF, 284KB]

(2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

(2011) Hwidong Na & Jong-Hyeok Lee: Multi-word unit dependency forest-based translation rule extraction. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.41-51. [PDF, 466KB]

(2011) Toshiaki Nakazawa & Sadao Kurohashi: Bayesian subtree alignment model based on dependency trees. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.794-802. [PDF, 790KB]

(2011) Martin Popel, David Mareček, Nathan Green, & Zdeněk Žabokrtský: Influence of parser choice on dependency-based MT. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.433-439. [PDF, 113KB]

(2011) Anders Søgaard: Data point selection for cross-language adaptation of dependency parsers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.682-686. [PDF, 124KB]

(2011) Xianchao Wu, Takuya Matsuzaki, Jun’ichi Tsujii: Effective use of function words for rule generalization in forest-based translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.22-31. [PDF, 857KB]

(2011) Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Extracting pre-ordering rules from chunk-based dependency trees for Japanese-to-English translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.300-307. [PDF, 859KB]

 (2011) Jun Xie, Haitao Mi & Qun Liu: A novel dependency-to-string model for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.216-226. [PDF, 992KB]

 (2010) Samuel Brody: Unsupervised dependency parsing via word alignment. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.1214-1222. [PDF, 344KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Manipuri-English bidirectional statistical machine translation systems using morphology and dependency relations. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.83-91. [PDF, 510KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Statistical machine translation of English-Manipuri using morpho-syntactic and semantic information. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 680KB]

(2010) Michael Gasser: A dependency grammar for Amharic. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, 17 May 2010, Valletta, Malta; 9pp. [PDF, 179KB]

(2010) Yvette Graham: Sulis: an open source transfer decoder for deep syntactical statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.17-26. [PDF, 705KB]

(2010) Christian Hardmeier & Marcello Federico: Modelling pronominal anaphora in statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.283-289. [PDF, 353KB]

(2010) Yifan He, Jinhua Du, Andy Way, & Josef van Genabith: The DCU dependency-based metric in WMT-MetricsMATR 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 349-353. [PDF, 444KB]

 (2010) Elena Irimia & Alexandru Ceausu: Dependency-based translation equivalents for factored machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 60KB]

(2010) Jin-Ji Li, Jungi Kim, & Jong-Hyeok Lee: Chinese syntactic reordering through contrastive analysis of predicate-predicate patterns in Chinese-to-Korean SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 817KB]

(2010) Yanjun Ma & Andy Way: HMM word-to-phrase alignment with dependency constraints. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.101-109. [PDF, 191KB]

(2010) David Mareček, Martin Popel, & Zdeněk Žabokrtský: Maximum entropy translation model in dependency-based MT framework. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 201-206. [PDF, 254KB]

(2010) Yuval Marton, Nizar Habash, & Owen Rambow: Improving Arabic dependency parsing with lexical and inflectional morphological features. Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages, pages 13–21, Los Angeles, CA, USA, June 2010; pp.13-21. [PDF, 187KB]

(2010) Haitao Mi & Qun Liu: Constituency to dependency translation with forests. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1433-1442. [PDF, 190KB]

(2010) Simon Mille & Leo Wanner: Syntactic dependencies for multilingual and multilevel corpus annotation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1889-1896. [PDF, 395KB]

(2010) Hwidong Na, Jin-Ji Li, Yeha Lee, & Jong-Hyeok Lee: A synchronous context free grammar using depependency sequence for syntax-based statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 184KB]

(2010) Sylwia Ozdowska & Vincent Claveau: Inferring syntactic rules for word alignment through inductive logic programming. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.749-754. [PDF, 370KB]

(2010) Libin Shen, Jinxi Xu, & Ralph Weischedel: String-to-dependency statistical machine translation. Computational Linguistics 36 (4), pp. 649-671 [PDF, 233KB]

(2010) Drahomira (Johanka) Spoustová & Miroslav Spousta: Dependency parsing as a sequence labeling task. Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.7-14. [PDF, 271KB]

(2010) Kathrin Spreyer: Notes on the evaluation of dependency parsers obtained through cross-lingual projection. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1176-1184. [PDF, 114KB]

(2010) Kathrin Spreyer, Lilja Øvrelid, & Jonas Kuhn: Training parsers on partial trees: a cross-language comparison. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1978-1985. [PDF, 285KB]

(2010) Jinsong Su, Yang Liu, Haitao Mi, Hongmei Zhao, Yajuan Lü, & Qun Liu: Dependency-based bracketing transduction grammar for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1185-1193. [PDF, 226KB]

(2010) Zhaopeng Tu, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Dependency forest for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1092-1100. [PDF, 215KB]

(2010) Sriram Venkatapathy, Rajeev Sangal, Aravind Joshi, & Karthik Gali: A discriminative approach to dependency based statistical machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.66-74. [PDF, 187KB]

(2010) Zhiyang Wang, Yajuan Lü, Qun Liu, & Young-Sook Hwang: Better filtration and augmentation for hierarchical phrase-based translation rules. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.142-146. [PDF, 135KB]

Dialects and registers

(2011) Luís Marujo, Nuno Grazina, Tiago Luís, Wang Ling, Luísa Coheur, & Isabel Trancoso: BP2EP – adaptation of Brazilian Portuguese texts to European Portuguese. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.129-136. [PDF, 352KB]

(2010) Yassine Benajiba & Mona Diab: A web application for dialectal Arabic text annotation. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 91-98. [PDF, 747KB]

(2010) Mona Diab, Nizar Habash, Owen Rambow, Mohamed Altantawy, & Yassine Benajiba: COLABA: Arabic dialect annotation and processing. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 66-74. [PDF, 747KB]

 (2010) Kathleen Egan: Cross lingual Arabic blog alerting (COLABA). AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Hassan Sawaf: Arabic dialect handling in hybrid machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 146KB]

Dialogue

(2011) Avneesh Saluja, Ian Lane, & Ying Zhang: Context-aware language modeling for conversational speech translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.97-104. [PDF, 177KB]

(2010) Sherri Condon, Dan Parvaz, John Aberdeen, Christy Doran, Andrew Freeman, & Marwan Awad: Evaluation of machine translation errors in English and Iraqi Arabic. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.729-735. [PDF, 644KB]

(2010) Xiangyu Duan, Rafael E.Banchs, Jun Lang, Deyi Xiong, Aiti Aw, Min Zhang, & Haizhou Li: I2R’s machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.67-72. [PDF, 391KB]

(2010) Guillem Gascó, Vicent Alabau, Jesús Andrés-Ferrer, Jesús González-Rubio, Martha-Alicia Rocha, Germán Sanchis-Trilles, Francisco Casacuberta, Jorge González, & Joan-Andreu Sánchez: ITI-UPV system description for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.85-92. [PDF, 414KB]; presentation by Germán Sanchis-Trilles: 16 slides [PDF of PPT, 481KB]

(2010) Chooi-Ling Goh, Taro Watanabe, Michael Paul, Andrew Finch, & Eiichiro Sumita: The NICT translation  system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.139-146. [PDF, 379KB]

(2010) Chi-Ho Li, Nan Duan, Yinggong Zhao, Shujie Liu, Lei Cui, Mei-yuh Hwang, Amittai Axelrod, Jianfeng Gao, Yaodong Zhang, & Li Deng: The MSRA machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.135-138. [PDF, 254KB]; presentation: 17 slides [PDF of PPT, 477KB]

(2010) Wang Ling, Tiago Luís, João Graça, Luísa Coheur & Isabel Trancoso: The INESC-ID machine translation system for the IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.81-84. [PDF, 227KB]

(2010) Michael Paul, Marcello Federico, & Sebastian Stüker: Overview of the IWSLT 2010 evaluation campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.3-27. [PDF, 558KB]

(2010) Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao, & Claudia Baur: A multilingual CALL game based on speech translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1531-1538. [PDF, 573KB]

(2010) Anne H.Schneider, Ielka van der Sluis, & Saturnino Luz: Comparing intrinsic and extrinsic evaluation of MT output in a dialogue system. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.329-336. [PDF, 448KB]

(2010) Yang Mei & Katrin Kirchhoff: Contextual modeling for meeting translation using unsupervised word sense disambiguation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1227-1235. [PDF, 461KB]

(2010) Hao Xiong, Jun Xie, Hui Yu, Kai Liu, Wei Luo, Haitao Mi, Yang Liu, Yajuan Lü, & Qun Liu: The ICT statistical machine translation system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.73-79. [PDF, 510KB]

Dictionaries see Lexical resources

Direct translation approach

(2011) Septina Dian Larasati & Vladislav Kuboň: A study of Indonesian-to-Malaysian MT system. MALINDO 2010: the 4th International MALINDO Workshop, Depok, Indonesia, August 2nd 2010; 7pp. [PDF, 363KB]

Disambiguation (see also Word sense disambiguation)

(2011) Shane Bergsma, David Yarowsky, & Kenneth Church: Using large monolingual and bilingual corpora to improve coordination disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1346-1355. [PDF, 157KB]

(2011) Shujian Huang, Stephan Vogel, & Jiajun Chen: Dealing with spurious ambiguity in learning ITG-based word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.379-383. [PDF, 182KB]

(2011) Thomas Meyer: Disambiguating temporal-contrastive discourse connectives for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.46-51. [PDF, 114KB]

(2011) Shoichi Yokoyama & Yuichi Takano: Investigation of translation disambiguation of verbs in patent sentences using word grouping. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.60-63. [PDF, 152KB]

(2010) Benoît Gaillard, Malek Boualem, & Olivier Collin: Query translation using Wikipedia-based resources for analysis and disambiguation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 579KB]

Discontiguous relations

 (2011) Mohit Bansal, Chris Quirk, & Robert C.Moore: Gappy phrasal alignment by agreement . ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1308-1317. [PDF, 215KB]

(2011) Kevin Gimpel & Noah A.Smith: Generative models of monolingual and bilingual gappy patterns. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.512-522. [PDF, 379KB]

(2011) Wei Wei & Bo Xu: Effective use of discontinuous phrases for hierarchical phrase-based translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.397-404. [PDF, 148KB]

 (2010) Michel Galley & Christopher D.Manning: Accurate non-hierarchical phrase-based translation.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.966-974. [PDF, 433KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Improved discriminative ITG alignment using hierarchical phrase pairs and semi-supervised training. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.730-738. [PDF, 1050KB]

(2010) Hwidong Na, Jin-Ji Li, Yeha Lee, & Jong-Hyeok Lee: A synchronous context free grammar using depependency sequence for syntax-based statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 184KB]

Discourse representation

(2011) Bruno Cartoni, Sandrine Zufferey, Thomas Meyer, & Andrei Popescu-Belis: How comparable are parallel corpora? Measuring the distribution of general vocabulary and connectives. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.78-86. [PDF, 153KB]

(2011) Thomas Meyer: Disambiguating temporal-contrastive discourse connectives for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.46-51. [PDF, 114KB]

(2010) George Foster, Pierre Isabelle, & Roland Kuhn: Translating structured documents. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 132KB]

(2010) Jesús Giménez, Lluís Màrquez, Elisabet Comelles, Irene Castellón, & Victoria Arranz: Document-level automatic MT evaluation based on discourse representations. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 333-338. [PDF, 112KB]

(2010) Yang Mei & Katrin Kirchhoff: Contextual modeling for meeting translation using unsupervised word sense disambiguation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1227-1235. [PDF, 461KB]

(2010) Richard Moot: Wide-coverage Freench syntax and semantics using Grail. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 121KB]

Dissertations see Theses and dissertations

Divergence see Lexical transfer, Syntactic transfer

Domain identification

(2011) Zhengxian Gong, Min Zhang, & Guodong Zhou: Cache-based document-level statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.909-919. [PDF, 330KB]

(2011) Zhengxian Gong, Guodong Zhou, & Liangyou Li: Improve SMT with source-side “topic-document” distributions. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.496-501. [PDF, 297KB]

(2011) Bruno Pouliquen, Christophe Mazenc & Aldo Iorio: Tapta: a user-driven translation system for patent documents based on domain-aware statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.5-12. [PDF, 342KB]; presentation, 15 slides [PDF, 1642KB]

 (2011) Ivan Vulić, Wim De Smet, & Marie-Francine Moens: Identifying word translations from comparable corpora using latent topic models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.479-484. [PDF, 153KB]

Domain restriction, adaptation and specification

(2011) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Domain adaptation in statistical machine translation of user-forum data using component-level mixture modelling. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.285-292. [PDF, 137KB]

(2011) Alexandru Ceauşu, John Tinsley, Jian Zhang, & Andy Way: Experiments on domain adaptation for patent machine translation in the PLuTO project. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.21-28. [PDF, 832KB]; presentation, 16 slides [PDF, 824KB]

(2011) Han-Bin Chen, Hen-Hsen Huang, Jengwei Tjiu, Ching-Ting Tan, & Hsin-His Chen: Identification and translation of significant patterns for cross-domain SMT applications. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.277-284. [PDF, 137KB]

(2011) Hal Daumé & Jagadeesh Jagarlamudi: Domain adaptation for machine translation by mining unseen words. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.407-412. [PDF, 80KB]

(2011) Kevin Duh, Katsuhito Sudoh, Tomoharu Iwata, & Hajime Tsukada: Alignment inference and Bayesian adaptation for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.114-121. [PDF, 117KB]

(2011) Kevin Duh, Akinori Fujino, & Masaaki Nagata: Is machine translation ripe for cross-lingual sentiment classification?  ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.429-433. [PDF, 95KB]

(2011) Cristina España-Bonet, Ramona Enache, Adam Slaski, Aarne Ranta, Lluís Màrquez, & Meritxell Gonzàlez: Patent translation within the MOLTO project. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.70-78. [PDF, 217KB]

(2011) Souhir Gahbiche-Braham, Hélène Bonneau-Maynard, & François Yvon: Two ways to use a noisy parallel news corpus for improving statistical machine translation. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.44-51. [PDF, 176KB]

(2011) Monica Gavrila & Natalia Elita: Comparing corpus-based MT approaches using restricted resources. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.43-49. [PDF, 304KB]

(2011) Mitesh M.Khapra, Salil Joshi, & Pushpak Bhattacharyya: It takes two to tango: a bilingual unsupervised approach for estimating sense distributions using expectation maximization. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.695-704. [PDF, 280KB]

(2011) Patrik Lambert, Holger Schwenk, Christophe Servan, & Sadaf Abdul-Rauf: Investigations on translation model adaptation using monolingual data. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.284-293. [PDF, 124KB]

(2011) Abby Levenberg, Miles Osborne, & David Matthews: Multi-stream language models for statistical machine translation.  [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.177-186. [PDF, 232KB]

(2011) John McCrae, Maurizio Espinoza, Elena Monteil-Ponsoda, Guadalupe Aguado-de-Cea, & Philipp Cimiano: Combining statistical and semantic approaches to the translation of ontologies and taxonomies. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.116-125. [PDF, 576KB]

(2011) Emmanuel Morin & Emmanuel Prochasson: Bilingual lexicon extraction from comparable corpora enhanced with parallel corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.27-34. [PDF, 125KB]

(2011) Pavel Pecina, Antonio Toral, Andy Way, Vassilis Papavassiliou, Prokopis Prokopidis, & Maria Giagkou: Towards using web-crawled data for domain adaptation in statistical machine translation. [EAMT 2011]:  proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.297-304. [PDF, 363KB]; presentation, 25 slides [PDF]

(2011) Linfeng Song, Haitao Mi, Yajuan Lü, & Qun Liu: Bagging-based system combination for domain adaptation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.293-298. [PDF,

(2011) George Tambouratzis, Fotini Simistira, Sokratis Sofianopoulos, Nikos Tsimboukakis, & Marina Vassiliou: A resource-light phrase scheme for language-portable MT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.185-1 92. [PDF, 186KB]

(2010) Pratyush Banerjee, Jinhua Du, Sudip Naskar, Baoli Li, Andy Way, & Josef van Genabith: Combining multi-domain statistical machine translation models using automatic classifiers. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 304KB]

(2010) Josep Maria Crego, Aurélien Max, & François Yvon: Local lexical adaptation in machine translation through triangulation: SMT helping SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.232-240. [PDF, 197KB]

(2010) Kevin Duh, Katsuhito Sudoh, & Hajime Tsukada: Analysis of translation model adaptation in statistical machine translation.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.243-250. [PDF, 556KB]

 (2010) George Foster, Cyril Goutte, & Roland Kuhn: Discriminative instance weighting for domain adaptation in statistical machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.451-459. [PDF, 256KB]

(2010) Hiroyuki Kaji, Takashi Tsunakawa, & Daisuke Okada: Using comparable corpora to adapt a translation model to domains. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2182-2188. [PDF, 432KB]

(2010) Petr Knoth, Trevor Collins, Elsa Sklavounou, & Zdenek Zdrahal: Facilitating cross-language retrieval and machine translation by multilingual domain ontologies. [LREC 2010] Workshop on Supporting eLearning with Language Resources  and Semantic Data, Valletta, Malta, 22 May 2010; 42 slides. [PDF of PPT, 440KB]

(2010) William D.Lewis, Chris Wendt, & David Bullock: Achieving domain specificity in SMT without overt siloing. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2878-2883. [PDF, 328KB]

(2010) Jan Niehues & Alex Waibel: Domain adaptation in statistical machine translation using factored translation models. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 581KB]; presentation: 27 slides [PDF, 887KB]

(2010) Mohammad Taher Pilevar & Heshaam Faili: PersianSMT: a first attempt to English-Persian statistical machine translation. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1101-1111. [PDF, 945KB]

(2010) Germán Sanchis-Trilles & Mauro Cettolo: Online language model adaptation via n-gram mixtures for statistical machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]; presentation: 22 slides [PDF, 241KB]

(2010) Germán Sanchis-Trilles, Jesús Andrés-Ferrer, Guillem Gascó, Jesús González-Rubio, Pascual Martínez-Gómez, Martha-Alicia Rocha, Joan-Andreu Sánchez, & Francisco Casacuberta: UPV-PRHLT English-Spanish system for WMT10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 172-176. [PDF, 90KB]

(2010) Kashif Shah, Loïc Barrault, & Holger Schwenk: Translation model adaptation by resampling. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 392-399. [PDF, 176KB]

(2010) Jörg Tiedemann: Context adaptation in statistical machine translation using models with exponentially decaying cache. Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing, ACL 2010, Uppsala, Sweden, 15 July 2010; pp.8-15. [PDF, 148KB]

(2010) Jörg Tiedemann: To cache or not to cache? Experiments with adaptive models in statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 189-194. [PDF, 127KB]

(2010) Bin Wei & Christopher Pal: Cross lingual adaptation: an experiment in sentiment classifications. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.258-262. [PDF, 109KB]

Emotional expression

 (2011) Kevin Duh, Akinori Fujino, & Masaaki Nagata: Is machine translation ripe for cross-lingual sentiment classification?  ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.429-433. [PDF, 95KB]

(2010) Carmen Banea, Rada Mihalcea, & Janyce Wiebe: Multilingual subjectivity: are more languages better? Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.28-36. [PDF, 128KB]

(2010) Christian Scheible: Sentiment translation through lexicon induction. Proceedings of the ACL 2010 Student Research Workshop, Uppsala, Sweden, 13 July 2010; pp.25-30. [PDF, 119KB]

(2010) Christian Scheible, Florian Laws, Lukas Michelbacher, & Hinrich Schütze: Sentiment translation through multi-edge graphs. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1104-1112. [PDF, 236KB]

(2010) Bin Wei & Christopher Pal: Cross lingual adaptation: an experiment in sentiment classifications. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.258-262. [PDF, 109KB]

Entailment (see also Logical aspects)

(2011) Ido Dagan & Shachar Mirkin: Textual entailment inference in machine translation [abstract]. Machine Translation and Morphologically-rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 39 slides [PDF of PPT, 663KB]

 (2011) Yashar Mehdad, Matteo Negri, & Marcello Federico: Using bilingual parallel corpora for cross-lingual textual entailment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1336-1345. [PDF, 236KB]

(2011) Matteo Negri, Luisa Bentivogli, Yashar Mehdad, Danilo Giampiccolo, & Alessandro Marchetti: Divide and conquer: crowdsourcing the creation of cross-lingual textual entailment corpora.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.670-679. [PDF, 493KB]

(2010) Yashar Mehdad, Matteo Negri, & Marcello Federico: Towards cross-lingual textual entailment.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.321-324. [PDF, 230KB]

(2010) Matteo Negri & Yashar Mehdad: Creating bi-lingual entailment corpus through translations with Mechanical Turk: $100 for a 10-day rush. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.212-216. [PDF, 101KB]

Estimation maximization see Learning and training, Statistical analysis

Evaluation measures and metrics

(2011) Takako Aikawa & Spencer Rarrick: Are numbers good enough for you? A linguistically meaningful MT evaluation method. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.332-337. [PDF, 145KB]

(2011) Elefterios Avramidis, Maja Popovic, David Vilar, & Aljoscha Burchardt: Evaluate with confidence  estimation: machine ranking of translation outputs using grammatical features. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.65-70. [PDF, 107KB]

(2011) Rafael E.Banchs & Haizhou Li: AM-FM: a semantic framework for translation quality assessment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.153-158. [PDF, 83KB]

(2011) Luisa Bentivogli, Marcello Federico, Giovanni Moretti, & Michael Paul: Getting expert quality from the crowd for machine translation evaluation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.521-528. [PDF, 337KB]

(2011) Jan Berka, Martin Černý, & Ondřej Bojar: Quiz-based evaluation of machine translation. Prague Bulletin of Mathematical Linguistics, no.95, April 2011; pp.77-86. [PDF, 190KB]

(2011) Alexandra Birch & Miles Osborne: Reordering metrics for MT. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1027-1035. [PDF, 976KB]

(2011) Ondřej Bojar, Miloš Ercegovčević, Martin Popel, & Omar F.Zaidan: A grain of salt for the WMT manual evaluation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.1-11. [PDF, 191KB]

(2011) Chris Callison-Burch, Philipp Koehn, Christof Monz, Omar F.Zaidan: Findings of the 2011 workshop on statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.22-64. [PDF, 292KB]

(2011) Boxing Chen & Roland Kuhn: AMBER: a modified BLEU, enhanced ranking metric. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.71-77. [PDF, 148KB]

(2011) Elisabet Comelles, Irene Castellón, Jordi Atserias, Victoria Arranz, & Olivier Hamon: VERTa: exploring a multidimensional linguistically-motivated metric. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.21-28. [PDF, 309KB]

(2011) Daniel Dahlmeier, Chang Liu, & Hwee Tou Ng: TESLA at WMT 2011: translation evaluation and tunable metric. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.78-84. [PDF, 197KB]

(2011) Michael Denkowski & Alon Lavie: Meteor 1.3: automatic metric for reliable optimization and evaluation of machine translation systems. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.85-91. [PDF, 119KB]

(2011) Federico Gaspari, Antonio Toral & Sudip Kumar Naskar: User-focused task-oriented MT evaluation for wikis: a case study. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.13-22. [PDF, 961KB]

(2011) Kimmo Kettunen: Packing it all up in search for a language independent MT quality measure tool [abstract]. In: Zygmunt Vetulani (ed.): Human Language Technology: Challenges for Computer Science and Linguistics: 4th Language and Technology Conference, November 6-8, Poznań, Poland (Lecture Notes in Computer Science, 6562; Heidelberg: Springer Verlag, 2011); pp.567-576. [PDF, 14KB] 

(2011) Gorka Labaka, Arantza Díaz de Ilarraza, Cristina España-Bonet, Kepa Sarsola, & Lluís Màrquez: Deep evaluation of hybrid architectures: simple metrics correlated with human judgments. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.50-57. [PDF, 313KB]

(2011) Alon Lavie: Evaluating the output of  machine translation systems. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; p.15. [PDF, 75KB]

(2011) Maoxi Li, Chenqing Zong, & Hwee Tou Ng: Automatic evaluation of Chinese translation output: word-level or character-level? ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.159-164. [PDF, 151KB]

(2011) Xianhua Li, Yajuan Lü, Yao Meng, Qun Liu, & Hao Yu: Feedback selecting of manually acquired rules using automatic evaluation. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.52-59. [PDF, 185KB]

 (2011) Chang Liu, Daniel Dahlmeier & Hwee Tou Ng: Better evaluation metrics lead to better machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.375-384. [PDF, 264KB]

 (2011) Chi-kiu Lo & Dekai Wu: MEANT: an inexpensive, high-accuracy, semi-automatic metric for evaluating translation utility via semantic frames. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.220-229. [PDF, 935KB]

 (2011) Chi-kiu Lo & Dekai Wu: A radically simple, effective annotation and alignment methodology for semantic frame based SMT and MT evaluation. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.58-65. [PDF, 898KB]

(2011) Chi-kiu Lo & Dekai Wu: Structured vs. flat semantic role representations for machine translation evaluation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.10-20. [PDF, 758KB]

(2011) Yanjun Ma, Yifan He, & Josef van Genabith: From the confidence estimation of machine translation to the integration of MT and translation memory. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.13-15. [PDF, 75KB]

(2011) Matouš Macháček & Ondřej Bojar: Approximating a deep-syntactic metric for MT evaluation and tuning. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.92-98. [PDF, 130KB]

(2011) Lluís Màrquez & Jesús Giménez: Automatic evaluation in machine translation: towards combined linguistically-motivated measures [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 101 slides [PDF of PPT, 1248KB]

(2011) Sudip Kumar Naskar, Antonio Toral, Federico Gaspari, & Andy Way: A framework for diagnostic evaluation of MT based on linguistic checkpoints. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.529-536. [PDF, 170KB]

(2011) Kristen Parton, Joel Tetreault, Nitin Madnani, & Martin Chodorow: E-rating machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.108-115. [PDF, 108KB]

(2011) Michael Paul & Eiichiro Sumita: Translation quality indicators for pivot-based statistical MT. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.811-818. [PDF, 938KB]

(2011) Daniele Pighin & Lluís Màrquez: Automatic projection of semantic structures: an application of pairwise translation ranking. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.1-9. [PDF, 142KB]

 (2011) MajaPopović: Morphemes and POS tags for n-gram based evaluation metrics. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.104-107. [PDF, 68KB]

 (2011) Maja Popović & Aljoscha Burchardt: From human to automatic error classification for machine translation output. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.265-272. [PDF, 183KB]; presentation, 16 slides [PDF, 385KB]

 (2011) Maja Popović, David Vilar, Eleftherios Avramidis, & Aljoscha Burchardt: Evaluation without references: IBM1 scores as evaluation metrics. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.99-103. [PDF, 88KB]

 (2011) Miguel Rios, Wilker Aziz, & Lucia Specia: TINE: a metric to assess MT adequacy. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.116-122. [PDF, 132KB]

(2011) Felipe Sánchez-Martínez: Choosing the best machine translation system to translate a sentence by using only source-language information. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.97-104. [PDF, 338KB] ; presentation, 32 slides [PDF]

(2011) Raivis Skadiņš, Maris Puriņš, Inguna Skadiņa, & Andrejs Vasiļjevs: Evaluation of SMT in localization to under-resourced inflected language. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.35-40. [PDF, 287KB]; presentation, 17 slides [PDF, 796KB]

 (2011) Xingyi Song & Trevor Cohn: Regression and ranking based optimisation for sentence level machine translation evaluation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.123-129. [PDF, 122KB]

(2011) Lucia Specia: Linguistic indicators for quality estimation of machine translation [abstract]. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; p.14. [PDF, 139KB]

(2011) Lucia Specia, Najeh Hajlaoui, Catalina Hallett, & Wilker Aziz: Predicting machine translation adequacy. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.513-520. [PDF, 193KB]

 (2011) Lucia Specia: Quality estimation for machine translation. Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; 46 slides [PDF of PPT, 749KB]

(2011) Ankit K.Srivastava, Yanjun Ma, & Andy Way: Oracle-based training for phrase-based statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.169-176. [PDF, 428KB]

 (2011) Marianne Starlander & Paula Estrella: Looking for the best evaluation method for interlingua-based spoken language translation in the medical domain. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.81-92. [PDF, 905KB]

 (2011) Hirokazu Suzuki: Automatic post-editing based on SMT and its selective application by sentence-level automatic quality evaluation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.156-163. [PDF, 725KB]

 (2011) David Talbot, Hideto Kazawa, Hiroshi Ichikawa, Jason Katz-Brown, Masakazu Seno, & Franz J.Och: A lightweight evaluation framework for machine translation reordering. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.12-21. [PDF, 371KB]

 (2011) Billy Wong & Chunyu Kit: Comparative evaluation of term informativeness measures in machine translation evaluation metrics. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.537-544. [PDF, 147KB]

 (2011) Tong Xiao, Jingbo Zhu, Shujie Yao, & Hao Zhang: Document-level consistency verification in machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.131-138. [PDF, 126KB]

 (2011) Omar F.Zaidan: MAISE: a flexible, configurable, extensible open source package for mass AI system evaluation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.130-134. [PDF, 108KB]

 (2011) Dan Zeman, Mark Fishel, Jan Berka, & Ondřej Bojar: Addicter: what’s wrong with my translations? Sixth Machine Translation Marathon, 5-10 September 2011, Trento; Prague Bulletin of Mathematical Linguistics, no.96, October 2011; pp.79-88. [PDF, 276KB]; presentation, 20 slides [PDF of PPT, 359KB]

(2011) ACCURAT: analysis and evaluation of comparable corpora for under resourced areas of machine translation. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.323. [PDF, 287KB]

(2010) Milam Aiken & Mina Park: The efficacy of round-trip translation for MT evaluation. Translation Journal 14 (1), January 2010; 10pp. [PDF, 149KB]

(2010) Marianna Apidianaki & Yifan He: An algorithm for cross-lingual sense-clustering tested in a MT evaluation setting. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.219-226. [PDF, 474KB]

(2010) Alexandra Birch & Miles Osborne: LRscore for evaluating lexical and reordering quality in MT.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 327-332. [PDF, 123KB]

(2010) Michael Bloodgood & Chris Callison-Burch: Using Mechanical Turk to build machine translation evaluation sets. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.208-211. [PDF, 89KB]

(2010) Ondřej Bojar, Kamil Kos, & David Mareček: Tackling sparse data issue in machine translation evaluation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.86-91. [PDF, 132KB]

(2010) Chris Callison-Burch, Philipp Koehn, Christof Monz, Kay Peterson, Mark Przybocki, & Omar F.Zaidan: Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 17-53. [PDF, 837KB]

(2010) Marine Carpuat & Mona Diab: Task-based evaluation of multiword expressions: a pilot study in statistical machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.242-245. [PDF, 92KB]

(2010) Daniel Cer, Christopher D.Manning, & Daniel Jurafsky: The best lexical metric for phrase-based statistical MT system optimization. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.555-563. [PDF, 169KB]

(2010) Michael Denkowski & Alon Lavie: Choosing the right evaluation for machine translation: an examination of annotator and automatic metric performance on human judgment tasks. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 151KB]

(2010) Michael Denkowski & Alon Lavie: Extending the METEOR machine translation evaluation metric to the phrase level. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.250-253. [PDF, 99KB]

(2010) Michael Denkowski & Alon Lavie: Exploring normalization techniques for human judgments of machine translation adequacy collected using Amazon Mechanical Turk. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.57-61. [PDF, 248KB]

(2010) Michael Denkowski & Alon Lavie: METEOR-NEXT and the METEOR paraphrase tables: improved evaluation support for five target languages. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 339-342. [PDF, 104KB]

(2010) Marcus Dobrinkat, Tero Tapiovaara, Jaakko Väyrynen, & Kimmo Kettunen: Evaluating machine translations using mNCD. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.80-85. [PDF, 145KB]

(2010) Marcus Dobrinkat, Jaako Väyrynen, Tero Tapiovaara, & Kimmo Kettunen: Normalized compression distance based measures for MetricsMATR 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 343-348. [PDF, 118KB]

(2010) Jinhua Du & Andy Way: Using TERp to augment the system combination for SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 177KB]

(2010) Hiroshi Echizen-ya & Kenji Araki: Automatic evaluation method for machine translation using noun-phrase chunking. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.108-117. [PDF, 209KB]

(2010) Mireia Farrús, Marta R.Costa-jussà, José B.Mariño, & José A.R.Fonollosa: Linguistic-based evaluation criteria to identift statistical machine translation errors. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 606KB]

(2010) Christian Federmann: Appraise: an open-source toolkit for manual phrase-based evaluation of translations. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1731-1734. [PDF, 368KB]

(2010) Pamela Forner, Danilo Giampiccolo, Bernardo Magnini, Anselmo Peñas, Álvaro Rodrigo, & Richard Sutcliffe: Evaluating multilingual question answering systems at CLEF. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2774-2781. [PDF, 314KB]

(2010) Jesús Giménez & Lluís Màrquez: Asiya: an open toolkit for automatic machine translation (meta-)evaluation. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.77-86. [PDF, 271KB]; presentation [PDF, 299KB]

(2010) Jesús Giménez: Empirical machine translation and its evaluation. Presentation of Best Thesis Award for 2008 at EAMT 2010: 14th Annual conference of the European Association for Machine Translation, 28 May 2010, Saint-Raphaël, France. 39 slides [PDF of PPT presentation, 496KB]

(2010) Jesús Giménez, Lluís Màrquez, Elisabet Comelles, Irene Castellón, & Victoria Arranz: Document-level automatic MT evaluation based on discourse representations. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 333-338. [PDF, 112KB]

(2010) Yvette Graham & Josef van Genabith: Factor templates for factored machine translation models. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.275-282. [PDF, 366KB]

(2010) Olivier Hamon: Is my judge a good one? LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1735-1740. [PDF, 281KB]

(2010) Yifan He, Jinhua Du, Andy Way, & Josef van Genabith: The DCU dependency-based metric in WMT-MetricsMATR 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 349-353. [PDF, 444KB]

 (2010) Hideki Isozaki, Tsutomu Hirao, Kevin Duh, Katsuhito Sudoh, & Hajime Tsukada: Automatic evaluation of translation quality for distant language pairs.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.778-788. [PDF, 226KB]

(2010) Judith L. Klavans: [Panel on] Task-based evaluation methods for machine translation, in practice and theory. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Adrien Lardilleux, Julien Gosme, & Yves Lepage: Bilingual lexicon induction: effortless evaluation of word alignment tools and production of resources for improbable language pairs. [LREC 2010]: Proceedings of the Second Workshop on African Language Technology, AFLAT 2010, Valetta, Malta; pp.252-256. [PDF, 330KB]

(2010) Alon Lavie: Evaluating the output of machine translation systems. Tutorial at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 86 slides [PDF of PPT, 956KB]

(2010) Chang Liu, Daniel Dahlmeier, & Hwee Tou Ng: TESLA: translation evaluation of sentences with linear-programming-based analysis. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 354-359. [PDF, 181KB]

(2010) Chi-kiu Lo & Dekai Wu: Evaluating machine translation utility via semantic role labels. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2873-2877. [PDF, 417KB]

(2010) Chi-kiu Lo & Dekai Wu: Semantic vs. syntactic vs. n-gram structure for machine translation evaluation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.52-60. [PDF, 401KB]

(2010) Adam Lopez: Introduction and machine translation evaluation. Presentation at Fifth Machine Translation  Marathon, 13-18 September 2010, University of Le Mans; 329 slides. [PDF, 11,952KB]

(2010) Daniel Marcu, Kathleen Egan, Chuck Simmons, & Ning-Ning Mahlmann: Utilizing automated translation with quality scores to increase productivity. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Aurélien Max, Josep Maria Crego, & François Yvon: Contrastive lexical evaluation of machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta,