Machine Translation Archive

Subject Index

to methods, techniques, issues and topics

Publications 2000 to 2004

 

Click here for publications since 2010, 2005-2009

Click here for indexes to publications 1990-1999 and publications before 1990

To return to home page click here

 

Adjectives

(2001) Taiichi Hashimoto, Kosuke Nishidate, Kiyoaki Shirai, Takenobu Tokunaga & Hozumi Tanaka: Decision lists for determining adjective dependency in Japanese. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 151-156. [PDF, 133KB]

Alignment [see also Character alignment, Statistical analysis, Word alignment]

(2004) Oliver Bender, Richard Zens, Evgeny Matusov & Hermann Ney: Alignment templates: the RWTH SMT system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 79-84 [PDF, 193KB]

(2004) Christophe Chenon: Dynamical visualization of nested correspondences. Translating and the Computer 26: proceedings of the Twenty-sixth International Conference on Translating and the Computer, 18-19 November 2004, London. (London: Aslib, 2004); 17pp. [PDF, 191KB]

(2004) Percy Cheung & Pascale Fung: Sentence alignment in parallel, comparable, and quasi-comparable corpora.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 30-33. [PDF, 323KB]

(2004) Thomas C.Chuang, Jian-Cheng Wu, Tracy Lin, Wen-chie Shei & Jason S.Chang: Bilingual sentence alignment based on punctuation statistics and lexicon. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.224-232. [abstract]

(2004) Lea Cyrus & Hendrik Feddes: A model for fine-grained alignment of multilingual texts. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.15-22. [PDF, 141KB]

(2004) Hal Daumé III & Daniel Marcu: A phrase-based HMM approach to document/abstract alignment.  EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 111KB]

(2004) Pascale Fung & Percy Cheung: Multi-level bootstrapping for extracting parallel sentences from a quasi-comparable corpus.  Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 245KB]

(2004) Pascale Fung & Percy Cheung: Mining very-non-parallel corpora: parallel sentence and lexicon extraction via bootstrapping and EM. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 276KB]

(2004) Daniel Gildea:  Dependencies vs constituents for tree-based alignment. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 97KB]

(2004) Adrià de Gispert, José B. Mariño & Josep M. Crego: Phrase-based alignment combining corpus cooccurrences and linguistic knowledge. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 107-114 [PDF, 178KB]

(2004) Declan Groves, Mary Hearne, & Andy Way: Robust sub-sentential alignment of phrase-structure trees. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 94KB]

(2004) Paul Kingsbury, Nianwen Xue, & Martha Palmer: Propbanking in parallel.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 34-37. [PDF, 305KB]

(2004) Patrick Lambert & Núria Castell: Alignment of parallel corpora exploiting asymmetrically aligned phrases.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 26-29. [PDF, 239KB]

(2004) Li Weigang, Liu Ting, Wang Zhen, & Li Sheng: Aligning bilingual corpora using sentences location information. [ACL-2004] Proceedings of the Third SIGHAN Workshop on Chinese Language Learning, 25 July 2004, Barcelona; 7pp. [PDF, 100KB]

(2004) Feifan Liu, Qianli Jin, Jun Zhao & Bo Xu: Bilingual chunk alignment based on interactional matching and probabilistic latent semantic indexing.  First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.416-425. [abstract]

(2004) Chris Pike & I.Dan Melamed: An automatic filter for non-parallel texts. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the interactive poster and demonstration sessions, 21-26 July 2004, Barcelona, Spain; 4pp. [PDF, 128KB]

(2004) Kiyotaka Uchimoto, Yujie Zhang, Kiyoshi Sudo, Masaki Murata, Satoshi Sekine, & Hitoshi Isahara: Multilingual aligned parallel treebank corpus reflecting contextual information and its applications. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.63-70. [PDF, 162KB]

(2004) Jui-Feng Yeh, Chung-Hsien Wu, Ming-Jun Chen, & Liang-Chih Yu: Automated alignment and extraction of bilingual ontology for cross-language domain-specific applications. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 196KB]

(2004) Hao Zhang & Daniel Gildea: Syntax-based alignment: supervised or unsupervised? Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 87KB]

(2003) Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka, & Hideki Tanaka: Word selection for EBMT based on monolingual similarity and translation confidence HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 1565KB]

(2003) Regina Barzilay & Noemie Elhadad: Sentence alignment for monolingual comparable corpora EMNLP-2003: proceedings of the 2003  conference on Empirical Methods in Natural Language Processing, a meeting of SIGDAT, a special interest group of the ACL, held in conjunction with ACL-03,  11-12 July  2003, Sapporo, Japan; 8pp. [PDF, 94KB]

(2003) Daniel Gildea: Loosely tree-based alignment for machine translation ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 109KB]

(2003) Hideki Kashioka, Takehiko Maruyama, & Hideki Tanaka: Building a parallel corpus for monologues with clause alignment MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.216-223. [PDF, 379KB]

(2003) Chun-Jen Lee & Jason S. Chang: Acquisition of English-Chinese transliterated word pairs from parallel-aligned texts using a statistical machine transliteration model HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 344KB]

(2003) Joel Martin, Howard Johnson, Benoit Farley, and Anna Maclachlan: Aligning and using an English-Inuktitut parallel corpus HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 30KB]

(2003) I.Dan Melamed: Multitext grammars and synchronous parsers HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 8pp. [PDF, 1126KB]

(2003) Francisco Nevado, Francisco Casacuberta, & Enrique Vidal: Parallel corpora segmentation using anchor words. 7th EAMT Workshop, "Improving machine translation through other language technology tools", 13 April 2003, Budapest, Hungary; pp. 33-40 [PDF, 382KB]

(2003) Stephen Nightingale & Hideki Tanaka: Comparing the sentence alignment yield from two news corpora using a dictionary-based alignment system HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 142KB]

(2003) Franz Josef Och & Hermann Ney: A systematic comparison of various statistical alignment models. Computational Linguistics 29 (1), pp.19-51 [PDF, 296KB]

(2003) Bo Pang, Kevin Knight, & Daniel Marcu: Syntax-based alignment of multiple translations: extracting paraphrases and generating new sentences HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 8pp. [PDF, 91KB]

(2003) Emmanuel Planas & Osamu Furuse: Formalizing translation memory. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp. 157-188.

(2003) Lee Schwartz, Takako Aikawa, & Chris Quirk: Disambiguation of English PP attachment using multilingual aligned data MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.330-337. [PDF, 98KB]

(2003) Michel Simard: Translation spotting for translation memories HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 120KB]

(2003) Michel Simard & Philippe Langlais Statistical translation alignment with compositionality constraints HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 65KB]

(2003) Ashish Venugopal, Stephan Vogel, & Alex Waibel: Effective phrase translation extraction from aligned models ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 134KB]

(2003) Jian-Cheng Wu, Kevin C.Yeh, Thomas C.Chuang, Wen-Chi Shei, & Jason S.Chang: TotalRecall: a bilingual concordance for computer assisted translation and language learning ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 76KB]

(2003) Bing Zhao, Klaus Zechner, Stephen Vogel, & Alex Waibel: Efficient optimization for bilingual sentence alignment based on linear regression HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 93KB]

(2002) Mosleh H.Al-Adhaileh, Tang Enya Kong, & Zaharin Yusoff: A synchronization structure of SSTC and its applications in machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 8pp. [PDF, 272KB]

(2002) Michael Barlow: ParaConc: concordance software for multilingual parallel corpora. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Language resources for translation work and research, Las Palmas Canary Islands, 27 May 2002; pp.20-24. [PDF, 97KB]

(2002) Luisa Bentivogli & Emanuele Pianta: Opportunistic semantic tagging. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp. 1401-1406. [PDF, 61KB]

(2002) Thomas C.Chuang, G.N.You, & Jason Chang: Adaptive bilingual sentence alignment. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 21-30. [go to publisher details]

(2002) George Foster, Philippe Langlais, & Guy Lapalme: Text prediction with fuzzy alignment. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 44-53. [go to publisher details]

(2002) Ismael García Varea, Franz J. Och, Hermann Ney & Francisco Casacuberta: Improving alignment quality in statistical machine translation using context-dependent maximum entropy models. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 208KB]

(2002) Ismael Garcia Varea, Franz J.Och, Hermann Ney, & Francisco Casacuberta: Efficient integration of maximum entropy lexicon models within the training of statistical alignment models. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 54-63. [go to publisher details]

(2002) Mathieu Guidère: Towards a corpus-based machine translation for standard Arabic. Translation Journal 6 (1), January 2002; 12pp. [PDF, 173KB]

(2002) Kenji Imamura: Application of translation knowledge acquired by hierarchical phrase alignment for pattern-based MT. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002. [PDF, 142KB]

(2002) Tz-Liang Kueng & Keh-Yih Su: A robust cross-style bilingual sentences alignment model. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 290KB]

(2002) Kenji Matsumoto & Hideki Tanaka: Automatic alignment of Japanese and English newspaper articles using an MT system and a bilingual company name dictionary. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.480-484. [PDF, 252KB]

(2002) Robert C. Moore: Fast and accurate sentence alignment of bilingual corpora. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 135-144. [go to publisher details]

(2002) R.Muñoz, R.Mitkov, M.Palomar, J.Peral, R.Evans, L.Moreno, C.Orasan, M.Saiz-Noeda, A.Ferrández, C.Barbu, P.Martínez-Barco, & A.Suárez: Bilingual alignment of anaphoric expressions. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.2088-2093. [PDF, 76KB]

(2002) Noah A.Smith: From words to corpora: recognizing translation. EMNLP-2002: Proceedings of the 2002 conference on Empirical Methods in Natural Language Processing, July 2002, Philadelphia, USA; pp.95-102 [PDF, 303KB]

(2002) Le Sun, Song Xue, Weimin Qu, Xiaofeng Wang, & Yufang Sun: Constructing a large-scale Chinese-English parallel corpus. Coling-2002: Third Workshop on Asian Language resources and International Standarization, 31 August 2002, Taipei,Taiwan; 8pp. [PDF, 359KB]

(2002) Taro Watanabe, Kenji Imamura and Eiichiro Sumita: Statistical machine translation based on hierarchical phrase alignment. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.188-198. [PDF, 165KB]

(2002) Bing Zhao & Stephan Vogel: Full-text story alignment models for Chinese-English bilingual news corpora. ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.517-520; abstract [PDF, 45KB]

(2001) Kirsty Macdonald: Improving automatic alignment for translation memory creation. Translating and the Computer 23: papers from the Aslib conference held on 29 & 30 November 2001 (London: Aslib, 2001); 11pp. [PDF, 114KB]

(2001) Arul Menezes & Stephen D. Richardson: A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora. MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Workshop on Example-Based Machine Translation. [PDF, 72KB]

(2001) Arul Menezes & Stephen D.Richardson: A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.39-46. [PDF, 84KB]

(2001) Philip Resnik: [review of] Parallel text processing: alignment and use of translation corpora [ed. by] Jean Véronis. Computational Linguistics 27 (4), pp.592-595. [PDF, 333KB]

(2001) António Ribeiro, Gaël Dias, Gabriel Lopes & João Mexia: Cognates alignment. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.287-292. [PDF, 70KB]

(2001) Mark Stevenson: [review of] Jean Véronis: Parallel text processing (Kluwer). In: Machine Translation Review, issue 12: December 2001; pp.75-76.

(2000) Michael Carl: Extracting invertible translations from pre-aligned texts.  LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 6pp. [PDF, 321KB]

(2000) Seonho Kim, Juntae Yoon, & Mansuk Song: Structural feature selection for English-Korean statistical machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 439-445 [PDF,.635KB]

(2000) Fang Li & Wilhelm Weisweber: Bilingual lexicon extraction from Internet. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 7pp. [PDF, 59KB]

(2000) Hiroshi Masuichi, Raymond Flournoy, Stefan Kaufmann, & Stanley Peters: A bootstrapping method for extracting bilingual text pairs Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1066-1070 [PDF,.464KB]

(2000) Franz Joseph Och & Hermann Ney: Improved statistical alignment models. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong, October 2000. [PDF, 199KB]

(2000) Franz Josef Och & Hermann Ney: A comparison of alignment models for statistical machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1086-1090 [PDF,.430KB]

(2000) António Ribeiro, Gabriel Lopes, & João Mexia: Using confidence bands for parallel texts alignment. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong, October 2000. [PDF, 260KB]

(2000) António Ribeiro, Gabriel Lopes, & João Mexia: A self-learning method of parallel texts alignment. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.30-39. [go to publisher details]

(2000) Sayori Shimohata: An empricial method for identifying and translating technical terminology Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 782-788 [PDF,.513KB]

(2000) Noah A. Smith & Michael E. Jahr: Cairo: an alignment visualization tool. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 549-551. [PDF, 45KB]

(2000) Ioannis Triantafyllou, Iason Demiros, Christos Malavazos, & Stelios Piperidis: An alignment architecture for translation memory bootstrapping. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1579KB]

(2000) S.Vogel & H.Ney: Construction of a hierarchical translation memory Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1131-1135 [PDF,.396KB]

(2000) KaorouYamamoto & Yuji Matsumoto: Acquisition of phrase-level bilingual correspondence using dependency structure Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 933-939 [PDF,.592KB]

Ambiguity preservation (see also Disambiguation)

(2000) Kevin Knight & Irene Langkilde: Preserving ambiguities in generation via automata intersection. 17th National conference of the American Association for Artificial Intelligence (AAAI 2000) July 30- August 3, 2000, Austin,Texas. [PDF, 149KB]

Analogies and analogical modelling (see also Example-based methods)

(2004) Yves Lepage: Lower and higher estimates of “true analogies” between sentences contained in a large multilingual corpus. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 999KB]

(2000) Christos Malavazos & Stelios Piperidis: Application of analogical modelling to example-based machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 516-522 [PDF,.665KB]

Analysis see Parsing; Semantic analysis; Syntactic analyis

Anaphora resolution

(2004) R. Florian, H.Hassan, A. Ittycheriah, H.Jing, K.Kambhatla, X.Luo, N.Nicolov, & S.Roukos: A statistical model for multilingual entity detection and tracking. HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp.1-8. [PDF, 132KB]

(2004) Agnès Tutin, Meriam Haddara, Ruslan Mitkov, & Constantin Orasan: Annotation of anaphoric expressions in an aligned bilingual corpus.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.267-270. [PDF, 524KB]

(2002) Teruko Mitamura, Eric Nyberg, Enrique Torrejon, Dave Svoboda, Annelen Brunner and Kathryn Baker: Pronominal anaphora resolution in the KANTOO multilingual machine translation system. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp. 115-129. [PDF, 112KB]

(2002) R.Muñoz, R.Mitkov, M.Palomar, J.Peral, R.Evans, L.Moreno, C.Orasan, M.Saiz-Noeda, A.Ferrández, C.Barbu, P.Martínez-Barco, & A.Suárez: Bilingual alignment of anaphoric expressions. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.2088-2093. [PDF, 76KB]

(2001) Teruko Mitamura, Eric Nyberg, Enrique Torrejon, David Svoboda & Kathryn Baker: Pronominal anaphora resolution in KANTOO English-to-Spanish machine translation system. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.235-239. [PDF, 171KB]

(2001) Shigeko Nariyama: Multiple argument ellipses resolution in Japanese. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.241-245. [PDF, 167KB]

(2000) Catalina Barbu & Ruslan Mitkov: Evaluation environment for anaphora resolution. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1772KB]

(2000) David Farwell & Stephen Helmreich: An interlingual-based approach to reference resolution. NAACL-ANLP 2000 Workshop: Applied Interlinguas: Practical Applications of Interlingual Approaches to NLP, Seattle, May 2000; pp. 1-11 [PDF, 809KB]

(2000) R. Muñoz, M. Saiz-Noeda, A. Suárez, & M. Palomar: Semantic approach to bridging reference resolution. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1750KB]

(2000) Jesús Peral & Antonio Ferrández: An application of the interlingua system ISS for Spanish-English pronominal anaphora generation. NAACL-ANLP 2000 Workshop: Applied Interlinguas: Practical Applications of Interlingual Approaches to NLP, Seattle, May 2000; pp. 42-51 [PDF, 733KB]

(2000) Maximiliano Saiz-Noeda, Manuel Palomar, & David Farwell: NLP system oriented to anaphora resolution. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 7pp. [PDF, 1564KB]

(2000) L. Sobha & B.N.Patnaik: VASISTH - an ellipsis resolution algorithm for Indian languages. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 5pp. [PDF, 987KB]

(2000) Hristo Tanev & Ruslan Mitkov: LINGUA – a robust architecture for text processing and anaphora resolution in Bulgarian. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1780KB]

Annotation

(2004) Robert S.Belvin, Susanne Riehemann, & Kristin Precoda: A fine-grained evaluation method for speech-to-speech translation using concept annotations. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1427-1430. [PDF, 765KB]

(2004) Luisa Bentivogli, Pamela Forner, & Emanuele Pianta: Evaluating cross-language annotation transfer in the MultiSemCor corpus. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 66KB]

(2004) Martin Čmejrek, Jan Cuřin, Jiři Havelka, Jan Hajič, & Vladislav Kuboň: Prague Czech-English dependency treebank: syntactically annotated resources for machine translation . LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1597-1600. [PDF, 292KB]

(2004) Martin Čmejrek, Jan Cuřín, & Jiří Havelka: Prague Czech-English dependency treebank: any hopes for a common annotation scheme? HLT/NAACL 2004: Frontiers in Corpus Annotation, Proceedings of the Workshop, Boston, Massachusetts, May 6, 2004; 8pp. [PDF, 230KB]

(2004) Stephen Helmreich, David Farwell, Florence Reeder, Keith Miller, Bonnie Dorr, Nizar Habash, Eduard Hovy, Lori Levin, Owen Rambow, & Advaith Siddharthan: Interlingual annotation of multilingual text corpora. HLT/NAACL 2004: Frontiers in Corpus Annotation, Proceedings of the Workshop, Boston, Massachusetts, May 6, 2004; 8pp. [PDF, 126KB]

(2004) Paul Kingsbury, Nianwen Xue, & Martha Palmer: Propbanking in parallel.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 34-37. [PDF, 305KB]

(2004) Nadia Mana, Roldano Cattoni, Emanuele Pianta, Franca Rossi, Fabio Pianesi, & Susanne Burger: The Italian NESPOLE! Corpus: a multilingual database with interlingua annotation in tourism and medical domains. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1467-1470. [PDF, 753KB]

(2004) Florence Reeder, Bonnie Dorr, David Farwell, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Teruko Mitamura, Keith Miller, Owen Rambow, & Advaith Siddharthan: Interlingual annotation for MT development. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 236-245. [go to publisher details]

(2004) Catarina Ribeiro, Ricardo Santos, Rui Pedro Chaves, & Palmira Marrafa: Semi-automatic UNL dictionary generation using WordNet.PT.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.279-282. [PDF, 344KB]

(2004) Agnès Tutin, Meriam Haddara, Ruslan Mitkov, & Constantin Orasan: Annotation of anaphoric expressions in an aligned bilingual corpus.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.267-270. [PDF, 524KB]

(2004) M. Vanni, C.R.Voss, & C. Tate: Ground truth, reference truth & “omniscient truth” – parallel phrases in parallel texts for MT evaluation.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 10-13. [PDF, 352KB]

(2004) Fai Wong, Dong Cheng Hu, Yu Hang Mao, Ming Chui Dong: A flexible example annotation schema: translation corresponding tree representation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 178KB]

(2004) Wong Fai, Hu Dong Cheng, Mao Yu Hang, Tang Chi Wai, & Dong Ming Chui: Application of translation corresponding tree (TCT) annotation schema in example-based machine translation.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 42-45. [PDF, 474KB]

(2003) Hiroshi Kanayama & Hideo Watanabe: Multilingual translation via annotated hub language MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.202-207. [PDF, 167KB]

(2003) Lluís Màrquez, Adrià de Gispert, Xavier Carreras, & Lluís Padró: Low-cost named entity classification for Catalan: exploiting multilingual resources and unlabeled data.  ACL 2003 Workshop on Multilingual and Mixed-language Named Entity Recognition, July 12, 2003, Sapporo, Japan; 8pp. [PDF, 80KB]

(2003) Stephanie Strassel, Alexis Mitchell, & Shudong Huang: Multilingual resources for entity extraction. ACL 2003 Workshop on Multilingual and Mixed-language Named Entity Recognition, July 12, 2003, Sapporo, Japan; 8pp. [PDF, 138KB]

(2002) Mosleh H.Al-Adhaileh, Tang Enya Kong, & Zaharin Yusoff: A synchronization structure of SSTC and its applications in machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 8pp. [PDF, 272KB]

(2002) Chang Baobao, Zhang Huarui, Yu Shiwen, & Kang Shiyong: Bilingual corpus construction and its management for Chinese-English machine translation. In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.31-41.

(2002) Dien Dinh: Building a training corpus for word sense disambiguation in English-to-Vietnamese machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 281KB]

(2002): Hideo Watanabe, Katashi Nagao, Michael C. McCord & Arendse Bernth: An annotation system for enhancing quality of natural language processing. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 202KB]

(2001) George Wilson, Inderjeet Mani, Beth Sundheim, & Lisa Ferro: A multilingual approach to annotating and extracting temporal information. ACL 2001 Workshop on Temporal and Spatial Information Processing, July 2001, Toulouse, France; 7pp. [PDF, 49KB]

Applications of MT see index of applications

Artificial languages

(2001) Marcos Franco Sabarís, José Luis Rojas Alonso, C. Dafonte & B. Arcay: Multilingual authoring through an artificial language. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 99-102. [PDF, 138KB]

Aspect

(2003) Anna Kupść: Two approaches to aspect assignment in an English-Polish machine translation system 7th EAMT Workshop, "Improving machine translation through other language technology tools", 13 April 2003, Budapest, Hungary; pp. 17-24 [PDF, 297KB]

(2001) Masaki Murata, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Using a support-vector machine for Japanese-to-English translation of tense, aspect, and modality. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.111-119. [PDF, 230KB]

(2001) Cornelia Zelinsky-Wibbelt: Unifying tense, aspect and modality across languages. ACL 2001 Workshop on Temporal and Spatial Information Processing, July 2001, Toulouse, France; 8pp. [PDF, 61KB]

Bilingual corpora [see also Example-based methods, Multilingual corpora]

(2004) proceedings of  Workshop: The amazing utility of parallel and comparable corpora. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Lisbon, Portugal, 25 May 2004. [PDF, 2226KB]

(2004) Michael Barlow: Parallel concordancing and translation. Translating and the Computer 26: proceedings of the Twenty-sixth International Conference on Translating and the Computer, 18-19 November 2004, London. (London: Aslib, 2004); 11pp. [PDF, 84KB]

(2004) Robert S.Belvin, Win May, Shrikanth Narayanan, Panayiotis Georgiou, & Shadi Ganjavi: Creation of a doctor-patient dialogue corpus using standardized patients. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.187-190. [PDF, 480KB]

(2004) Indrajit Bhattacharya, Lise Getoor, & Yoshua Bengio: Unsupervised sense disambiguation using bilingual probabilistic models.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp. 287-294. [PDF, 166KB]

(2004) Michael Carl, Ecaterina Rascu, & Johann Haller: Using weighted abduction to align term variant translations in bilingual texts. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1973-1976. [PDF, 294KB]

(2004) Chen Benfeng & Pascale Fung: Automatic construction of an English-Chinese bilingual FrameNet.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA – Short Papers; pp. 29-32. [PDF, 185KB]

(2004) Luisa Bentivogli, Pamela Forner, & Emanuele Pianta: Evaluating cross-language annotation transfer in the MultiSemCor corpus. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 66KB]

(2004) Thomas C.Chuang, Jian-Cheng Wu, Tracy Lin, Wen-chie Shei & Jason S.Chang: Bilingual sentence alignment based on punctuation statistics and lexicon. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.224-232. [abstract]

(2004) Jan Cuřin, Martin Čmejrek, Jiří Havelka & Vladislav Kuboň: Building a parallel bilingual syntactically annotated corpus. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.168-176. [abstract]

(2004) Yuan Ding & Martha Palmer: Automatic learning of parallel dependency treelet pairs. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.233-243. [abstract]

(2004) Sanae Fujita & Francis Bond: A method of creating new bilingual valency entries using alternations.  Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.47-54. [PDF, 176KB]

(2004) Pascale Fung & Percy Cheung: Multi-level bootstrapping for extracting parallel sentences from a quasi-comparable corpus.  Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 245KB]

(2004) Pascale Fung & Percy Cheung: Mining very-non-parallel corpora: parallel sentence and lexicon extraction via bootstrapping and EM. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 276KB]

(2004) E. Gaussier, J.-M.Renders, I.Matveeva, C.Goutte, & H.Déjean: A geometric view on bilingual lexicon extraction from comparable corpora.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.526-533. [PDF, 122KB]

(2004) Tamás Grőbler, Gábor Hodász, & Balázs Kis: MetaMorpho TM: a rule-based translation corpus. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.339-342. [PDF, 390KB]

(2004) Hiroyuki Kaji: Adapted seed lexicon and combined bidirectional similarity measures for translation equivalent extraction from comparable corpora; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.115-124. [PDF, 378KB]

(2004) Mihoko Kitamura & Yuji Matsumoto: Practical translation pattern acquisition from combined language resources. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.244-253. [abstract]

(2004) Michael Kluck: Evaluation of cross-language information retrieval using the domain-specific GIRT data as parallel German-English corpus. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1343-1346. [PDF, 1533KB]

(2004) Jonas Kuhn: Experiments in parallel-text based grammar induction. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.470-477. [PDF, 99KB]

(2004) Tadashi Kumano, Hideki Kashioka, Hideki Tanaka & Takahiro Fukusima: Acquiring bilingual named entity translations from content-aligned corpora. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.177-186. [abstract]

(2004) Alon Lavie, Katharina Probst, Erik Peterson, Stephan Vogel, Lori Levin, Ariadna Font-Llitjos, & Jaime Carbonell: A trainable transfer-based MT approach for languages with limited resources 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 116-123. [PDF, 265KB]

(2004) Hang Li & Cong Li: Word translation disambiguation using bilingual bootstrapping. Computational Linguistics 30 (1), pp. 1-22. [PDF, 2311KB]

(2004) Li Weigang, Liu Ting, Wang Zhen, & Li Sheng: Aligning bilingual corpora using sentences location information. [ACL-2004] Proceedings of the Third SIGHAN Workshop on Chinese Language Learning, 25 July 2004, Barcelona; 7pp. [PDF, 100KB]

(2004) Tracy Lin, Jian-Cheng Wu, & Jason S. Chang: Extraction of name and transliteration in monolingual and parallel corpora. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 177-186. [go to publisher details]

(2004) Dragos Stefan Munteanu, Alexander Fraser, & Daniel Marcu: Improved machine translation performance via parallel sentence extraction from comparable corpora.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp. 265-272. [PDF, 1125KB]

(2004) Francisco Nevado, Francisco Casacuberta, & Josu Landa: Translation memories enrichment by statistical bilingual segmentation. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.335-338. [PDF, 354KB]

(2004) Sylwia Ozdowska: Identifying correspondences between words: an approach based on a bilingual syntactic analysis of French/English parallel corpora. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.55-62. [PDF, 213KB]

(2004) Chris Pike & I.Dan Melamed: An automatic filter for non-parallel texts. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the interactive poster and demonstration sessions, 21-26 July 2004, Barcelona, Spain; 4pp. [PDF, 128KB]

(2004) Monica Rogati & Yiming Yang: Customizing parallel corpora at the document level. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the interactive poster and demonstration sessions, 21-26 July 2004, Barcelona, Spain; 4pp. [PDF, 80KB]

(2004) Li Shao & Hwee Tou Ng: Mining new word translations from comparable corpora. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 305KB]

(2004) Keita Tsuji & Kyo Kageura: Extracting low-frequency translation pairs from Japanese-English bilingual corpora. Coling 2004, CompuTerm 2004: 3rd International Workshop on Computational Technology, Proceedings of the Workshop, 29th August 2004, Geneva, Switzerland; 8pp. [PDF, 178KB]

(2004) Dan Tufiş, Radu Ion, & Nancy Ide: Fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 192KB]

(2004) Agnès Tutin, Meriam Haddara, Ruslan Mitkov, & Constantin Orasan: Annotation of anaphoric expressions in an aligned bilingual corpus.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.267-270. [PDF, 524KB]

(2004) Martin Volk & Yvonne Samuelsson: Bootstrapping parallel treebanks. Coling 2004: Proceedings of the 5th International Workshop on Linguistically Interpreted Corpora, August 29th 2004, Geneva, Switzerland; 7pp. [PDF, 192KB]

(2004) Jui-Feng Yeh, Chung-Hsien Wu, Ming-Jun Chen, & Liang-Chih Yu: Automated alignment and extraction of bilingual ontology for cross-language domain-specific applications. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 196KB]

(2003) Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka, & Hideki Tanaka: Word selection for EBMT based on monolingual similarity and translation confidence HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 1565KB]

(2003) Chris Callison-Burch & Miles Osborne: Bootstrapping parallel corpora HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 83KB]

(2003) Katri A. Clodfelder: An LSA implementation against parallel texts in French and English HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 119KB]

(2003) Pernilla Danielsson: Units of meaning in translation – how to make real use of corpus evidence. Translating and the Computer 25: proceedings of the Twenty-fifth International Conference on Translating and the Computer, 20-21 November 2003, London. (London: Aslib, 2003); 15pp. [PDF, 71KB]

(2003) Dinh Dien & Hoang Kiem: POS-tagger for English Vietnamese bilingual corpus HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 202KB]

(2003) Yuan Ding, Daniel Gildea, & Martha Palmer: An algorithm for word-level alignment of parallel dependency trees MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 242KB]

(2003) Takao Doi, Eiichiro Sumita, & Hirofumi Yamamoto: Adaptation using out-of-domain corpus within EBMT HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 33KB]

(2003) Fei Huang, Stephan Vogel, & Alex Waibel: Automatic extraction of named entity translingual equivalence based on multi-feature cost minimization ACL-2003 Workshop on Multilingual and Mixed-language Named Entity Recognition, July 12, 2003, Sapporo, Japan; 8pp.. [PDF, 262KB]

(2003) Kenji Imamura, Eiichiro Sumita, & Yuji Matsumoto: Feedback cleaning of machine translation rules using automatic evaluation ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 62KB]

(2003) Kenji Imamura, Eiichiro Sumita & Yuji Matsumoto: Automatic construction of machine translation knowledge using translation literalness. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.155-162 [PDF, 397KB]

(2003) Genichiro Kikui, Eiichiro Sumita, Toshiyuki Takezawa, & Seiichi Yamamoto: Creating corpora for speech-to-speech translation.  Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.381-384; abstract [PDF, 34KB]

(2003) Tadashi Kumano, Hideki Kashioka, Hideki Tanaka, & Takahiro Fukusima: Construction and analysis of Japanese-English broadcast news corpus with named entity tags ACL-2003 Workshop on Multilingual and Mixed-language Named Entity Recognition, July 12, 2003, Sapporo, Japan; 8pp.. [PDF, 56KB]

(2003) Qing Ma, Yujie Zhang, Masaki Murata, & Hitoshi Isahara: Semantic maps for word alignment in bilingual parallel corpora  ACL-2003: Second SIGHAN Workshop on Chinese Language Processing, July 11-12, 2003, Sapporo, Japan; 6pp.. [PDF, 199KB]

(2003) Sara Laviosa: Corpora and the translator. In: Harold Somers (ed.) Computers and translation: a translator’s guide (Amsterdam/Philadelphia: John Benjamins Publishing Company, 2003); pp.105-117.

(2003) Elliott Macklovitch: On the pleasure of being bi-textual; or My life in parallel text. HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF from PPT, 301KB]

(2003) Lluís Màrquez, Adrià de Gispert, Xavier Carreras, & Lluís Padró: Low-cost named entity classification for Catalan: exploiting multilingual resources and unlabeled data.  ACL 2003 Workshop on Multilingual and Mixed-language Named Entity Recognition, July 12, 2003, Sapporo, Japan; 8pp. [PDF, 80KB]

(2003) Joel Martin, Howard Johnson, Benoit Farley, and Anna Maclachlan: Aligning and using an English-Inuktitut parallel corpus HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 30KB]

(2003) Robert C.Moore: Learning translations of named-entity phrases from parallel corpora. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.259-266 [PDF, 377KB]

(2003) Francisco Nevado, Francisco Casacuberta, & Enrique Vidal: Parallel corpora segmentation using anchor words. 7th EAMT Workshop, "Improving machine translation through other language technology tools", 13 April 2003, Budapest, Hungary; pp. 33-40 [PDF, 382KB]

(2003) Hwee Tou Ng, Bin Wang, & Yee Seng Chan: Exploiting parallel texts for word sense disambiguation: an empirical study ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 364KB]

(2003) Stephen Nightingale & Hideki Tanaka: Comparing the sentence alignment yield from two news corpora using a dictionary-based alignment system HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 142KB]

(2003) Daniel Ortíz, Ismael García-Varea, Francisco Casacuberta, Antonio Lagarda, & Jorge González: On the use of statistical machine-translation techniques within a memory-based translation system (AMETRA) MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.299-306. [PDF, 88KB]

(2003) Katharina Probst: Using ‘smart’ bilingual projection to feature-tag a monolingual dictionary HLT-NAACL 2003: proceedings of Seventh Conference on Natural Language Learning,  May 27 – June 1,  2003, Edmonton, Canada; 8pp. [PDF, 108KB]

(2003) Philip Resnik & Noah A. Smith: The web as a parallel corpus. Computational Linguistics 29 (3), pp.349-380. [PDF, 8130KB]

(2003) Fatiha Sadat, Masatoshi Yoshikawa, & Shunsuke Uemura: Bilingual terminology acquisition from comparable corpora and phrasal translation to cross-language information retrieval ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 61KB]

(2003) Fatiha Sadat, Masatoshi Yoshikawa, & Shunsuke Uemura: Learning bilingual translations from comparable corpora to cross-language information retrieval: hybrid ststistics-based and linguistics-based approach IRAL 2003: Sixth International Workshop on Information Retrieval with Asian Languages,  July 7,  2003, Sapporo, Japan; 8pp. [PDF, 122KB]

(2003) Lee Schwartz, Takako Aikawa, & Chris Quirk: Disambiguation of English PP attachment using multilingual aligned data MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.330-337. [PDF, 98KB]

(2003) Toshiyuki Takezawa & Genichiro Kikui: Collecting machine-translation-aided bilingual dialogues for corpus-based speech translation. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2757-2760; abstract [PDF, 34KB]

(2003) Takehito Utsuro, Takashi Horiuchi, Kohei Hino, Takeshi Hamamoto & Takeaki Nakayama: Effect of cross-language IR in bilingual lexical acquisition from comparable corpora. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.355-362 [PDF, 734KB]

(2003) Stephan Vogel: Using noisy bilingual data for statistical machine translation. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.175-178 [PDF, 194KB]

(2003) Dominic Widdows, Stanley Peters, Scott Cederberg, Chiu-Ki Chan, Diana Steffen, & Paul Buitelaar: Unsupervised monolingual and bilingual word-sense disambiguation of medical documents using UMLS. ACL 2003 Workshop on Natural Language Processing in Biomedicine, July 2003, Sapporo, Japan; 8pp. [PDF, 124KB]

(2003) Hua Wu & Ming Zhou: Optimizing synonym extraction using monolingual and bilingual resources. ACL 2003 International Workshop on Paraphrasing, July 11, 2003, Sapporo, Japan; 8pp. [PDF, 142KB]

(2003) Hua Wu & Ming Zhou: Synonymous collocation extraction using translation information ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 139KB]

(2003) Jian-Cheng Wu, Kevin C.Yeh, Thomas C.Chuang, Wen-Chi Shei, & Jason S.Chang: TotalRecall: a bilingual concordance for computer assisted translation and language learning ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 76KB]

(2003) Kaoru Yamamoto, Taku Kudo, Yuta Tsuboi, & Yuji Matsumoto: Learning sequence-to-sequence correspondences from parallel corpora via sequential pattern mining HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 101KB]

(2003) Bing Zhao & Stephan Vogel: Word alignment based on bilingual bracketing HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 56KB]

(2003) Bing Zhao, Klaus Zechner, Stephen Vogel, & Alex Waibel: Efficient optimization for bilingual sentence alignment based on linear regression HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 93KB]

(2002) proceedings of the Workshop: Language resources for translation work and research, LREC-2002: Third International Conference on Language Resources and Evaluation, Las Palmas Canary Islands, 27 May 2002. [PDF, 653KB]

(2002) Mosleh H.Al-Adhaileh, Tang Enya Kong, & Zaharin Yusoff: A synchronization structure of SSTC and its applications in machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 8pp. [PDF, 272KB]

(2002) Yaser Al-Onaizan & Kevin Knight: Translating named entities using monolingual and bilingual resources; ACL-2002: 40th Annual meeting of the Association for Computational Linguistics, July 2002, Philadelphia, USA; pp.400-408 [PDF, 186KB]

(2002) Toni Badia, Gemma Boleda, Carme Colominas, Agnès González, Mireia Garmendia, Martí Quixal: BancTrad: a web interface for integrated access to parallel annotated corpora. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Language resources for translation work and research, Las Palmas Canary Islands, 27 May 2002; pp.15-19. [PDF, 106KB]

(2002) Michael Barlow: ParaConc: concordance software for multilingual parallel corpora. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Language resources for translation work and research, Las Palmas Canary Islands, 27 May 2002; pp.20-24. [PDF, 97KB]

(2002) Ralf Brown: Corpus-driven splitting of compound words. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002. [PDF, 95KB]

(2002) Yves Champollion: Automated translation: the next frontier.  Translating and the Computer 24: proceedings from the Aslib conference held on 21-22 November 2002 (London: Aslib, 2002); 7pp. [PDF, 47KB]; presentation:  10pp. [PDF, 44KB]

(2002) Baobao Chang, Pernilla Danielsson, & Wolfgang Teubert: Extraction of translation unit from Chinese-English parallel corpora; Coling-2002: First SIGHAN Workshop on Chinese Language Processing, 1 September 2002, Taipei,Taiwan; 5pp. [PDF, 157KB]

(2002) Chang Baobao, Zhang Huarui, Yu Shiwen, & Kang Shiyong: Bilingual corpus construction and its management for Chinese-English machine translation. In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.31-41.

(2002) Yun-Chuang Chiao & Pierre Zweigenbaum: Looking for candidate translational equivalents in specialized, comparable corpora. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 82KB]

(2002) Dien Dinh: Building a training corpus for word sense disambiguation in English-to-Vietnamese machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 281KB]

(2002) Mathieu Guidère: Towards a corpus-based machine translation for standard Arabic. Translation Journal 6 (1), January 2002; 12pp. [PDF, 173KB]

(2002) Nancy Ide, Tomaz Erjavec, & Dan Tufis: Sense discrimination with parallel corpora. ACL-2002 SIGLEX/SENSEVAL workshop on Word Sense Disambiguation "Recent successes and future directions", 11 July 2002, Philadelphia, USA; pp. 54-60 [PDF, 387KB]

(2002) Takahiro Ikeda, Shinichi Ando, Kenji Satoh, Akitoshi Okumura, & Takao Watanabe: Automatic interpretation system integrating free-style sentence translation and parallel text based translation; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 85-92 [PDF, 240KB]

(2002) Kenji Imamura & Eiichiro Sumita: Bilingual corpus cleaning focusing on translation literality. ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1713-1716; abstract [PDF, 47KB]

(2002) Shankar Kumar & William Byrne: Minimum Bayes-risk word alignments of bilingual texts. EMNLP-2002: Proceedings of the 2002 conference on Empirical Methods in Natural Language Processing, July 2002, Philadelphia, USA; pp.140-147 [PDF, 263KB]

(2002) Shigeki Matsubara, Akira Tagaki, Nobuo Kawaguchi, & Yasuyoshi Inagaki: Bilingual spoken monologue corpus for simultaneous machine interpretation research. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.153-159. [PDF, 108KB]

(2002) Robert C. Moore: Fast and accurate sentence alignment of bilingual corpora. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 135-144. [go to publisher details]

(2002) Masaki Murata, Masao Utiyama, Kiyotaka Uchimoto, Qing Ma and Hitoshi Isahara: Correction of errors in a modality corpus used for machine translation using machine-learning. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.125-134. [PDF, 106KB]

(2002) Jessie Pinkham & Martine Smets: Modular MT with a learned bilingual dictionary: rapid deployment of a new language pair. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 59KB]

(2002) Andrei Popescu-Belis, Margaret King, & Houcine Benantar: Towards a corpus of corrected human translations. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Machine translation evaluation: human evaluators meet automated metrics, Las Palmas Canary Islands, 27 May 2002; pp.17-21. [PDF, 50KB]

(2002) Magnus Sahlgren, Preben Hansen & Jussi Karlgren: English-Japanese cross-lingual query expansion using random indexing of aligned bilingual text data. NTCIR Workshop 3: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, October 8-10, 2002, Tokyo, Japan; 5pp. [PDF, 82KB]

(2002) Deepak Sharma, K.Vikram, Manav Ratan Mital, Amitabha Mukerjee & Achla M.Raina: An integrated discourse semantic model for bilingual corpora. ICUKL-2002: International Conference on Universal Knowledge and Language, 25th-29th November 2002, Goa, India, organised by UNDL Foundation and Indian Institute of  Technology Bombay; 16pp. [PDF,  645KB]

(2002) Noah A.Smith: From words to corpora: recognizing translation. EMNLP-2002: Proceedings of the 2002 conference on Empirical Methods in Natural Language Processing, July 2002, Philadelphia, USA; pp.95-102 [PDF, 303KB]

(2002) Le Sun, Song Xue, Weimin Qu, Xiaofeng Wang, & Yufang Sun: Constructing a large-scale Chinese-English parallel corpus. Coling-2002: Third Workshop on Asian Language resources and International Standarization, 31 August 2002, Taipei,Taiwan; 8pp. [PDF, 359KB]

(2002) Toshiyuki Takezawa, Eiichiro Sumita, Fumiaki Sugaya, Hirofumi Yamamoto, & Seiichi Yamamoto: Toward a broad-coverage bilingual corpus for speech translation of travel conversations in the real world. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.147-152. [PDF, 216KB]

(2002) Keita Tsuji, Beatrice Daille, & Kyo Kageura: Extracting French-Japanese word pairs from bilingual corpora based on transliteration rules. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.499-502. [PDF, 120KB]

(2002) Taro Watanabe, Mitsuo Shimohata, & Eiichiro Sumita: Statistical machine translation on paraphrased corpora. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1954-1957. [PDF, 133KB]

(2002) Setsuo Yamada, Kenji Imamura and Kazuhide Yamamoto: Corpus-assisted expansion of manual MT knowledge. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.198-208. [PDF, 111KB]

(2001) Michael Carl: Inducing probabilistic invertable translation grammars from aligned texts.  ACL-EACL 2001 Workshop on Computational Natural Language Learning (CoNLL), July 2001, Toulouse, France; 7pp. [PDF, 160KB]

(2001) Tatsuya Izuha: Machine translation using bilingual term entries extracted from parallel texts. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 169-173 [PDF, 200KB]

(2001) F.Jelinek, W.Byrne, S.Khudanpur, B.Hladká, H.Ney, F.J.Och, J.Cuřin, J.Psutka: Robust knowledge discovery from parallel speech and text sources. HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 3pp. [PDF, 44KB]

(2001) Philippe Langlais, George Foster & Guy Lapalme: Integrating bilingual lexicons in a probabilistic translation assistant. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.197-202. [PDF, 95KB]

(2001) Benoit Lavoie, Michael White, & Tanya Korelsky: Inducing lexico-structural transfer rules from parsed bi-texts. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.17-24. [PDF, 60KB]

(2001) Gideon S.Mann & David Yarowsky: Multipath translation lexicon induction via bridge languages.  [NAACL-2001] Language Technologies 2001: the Second meeting of the North American Chapter of the Association for Computational Linguistics, Carnegie Mellon University, Pittsburgh, PA, 2-7 June 2001; 8pp. [PDF, 181KB]

(2001) Arul Menezes & Stephen D. Richardson: A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora. MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Workshop on Example-Based Machine Translation. [PDF, 72KB]

(2001) Arul Menezes & Stephen D.Richardson: A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.39-46. [PDF, 84KB]

(2001) Robert C.Moore: Towards a simple and accurate statistical approach to learning translation relationhips among words. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.79-86. [PDF, 55KB]

(2001) Sonja Nießen & Hermann Ney: Toward hierarchical models for statistical machine translation of inflected languages. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.47-54. [PDF, 62KB]

(2001) Jessie Pinkham, Monica Corston-Oliver, Martine Smets & Martine Pettenaro: Rapid assembly of a large-scale French-English MT system. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.277-281. [PDF, 180KB]

(2001) Philip Resnik: [review of] Parallel text processing: alignment and use of translation corpora [ed. by] Jean Véronis. Computational Linguistics 27 (4), pp.592-595. [PDF, 333KB]

(2001) Jean Senellart, Mirko Plitt, Christophe Bailly & Françoise Cardoso: Resource alignment for machine translation or implicit transfer. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.317-323. [PDF, 112KB]

(2001) Kaoru Yamamoto, Yuji Matsumoto, & Mihoko Kitamura: A comparative study on translation units for bilingual lexicon extraction. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.87-94. [PDF, 91KB]

(2001) Keiji Yasuda, Fumiaki Sugaya, Toshiyuki Takezawa, Seiichi Yamamoto & Masuzo Yanagida: An automatic evaluation method of translation quality using translation answer candidates queried from a parallel corpus. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.373-378. [PDF, 193KB]

(2001) Ying Zhang, Ralf Brown, Robert Frederking & Alon Lavie: Pre-processing of bilingual corpora for Mandarin-English EBMT.  MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.385-390. [PDF, 2374KB]

(2000) Yaser Al-Onaizan, Ulrich Germann, Ulf Hermjakob, Kevin Knight, Philip Koehn, Daniel Marcu, & Kenji Yamada: Translating with scarce resources 17th National conference of the American Association for Artificial Intelligence (AAAI 2000) July 30- August 3, 2000, Austin,Texas. [PDF, 166KB]

(2000) Lars Borin: You'll take the high road and I'll take the low road: using a third language to improve bilingual word alignment Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 97-103 [PDF,.534KB]

(2000) Michael Carl: Extracting invertible translations from pre-aligned texts.  LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 6pp. [PDF, 321KB]

(2000) Arantza Casillas, Joseba Abaitua, & Raquel Martínez: Recycling annotated parallel corpora for bilingual document composition. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.117-126. [go to publisher details]

(2000) Arantza Casillas, Joseba Abaitua & Raquel Martínez: DTD-driven bilingual document generation. INLG’2000 : Proceedings of the First International Conference on Natural Language Generation, Mitzpe Ramon, Israel, 12-16 June 2000; pp.32-38. [PDF, 586KB]

(2000) David Chambers: Automatic bilingual terminology extraction: a practical approach. Translating and the Computer 22: proceedings of the Twenty-second International Conference… 16-17 November 2000 (London: Aslib, 2000); 19pp.  [PDF, 383KB]

(2000) Jiang Chen & Jian-Yun Nie: Automatic construction of parallel English-Chinese corpus for cross-language information retrieval.  ANLP-NAACL-2000: proceedings of the Sixth conference on Applied Natural Language Processing and 1st Meeting of the North American Chapter of the Association for Computational Linguistics, April 29 – May 4, 2000, Seattle, Washington; pp.7-12. [PDF, 753KB]

(2000) Pernilla Danielsson & Katarina Mühlenbock: Small but efficient: the misconception of high-frequency words in Scandinavian translation. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.158-168. [go to publisher details]

(2000) Tomaš Erjavec: Slovene-English datasets for MT Fifth EAMT Workshop "Harvesting existing resources", May 11 - 12, 2000, Ljubljana, Slovenia. [PDF, 104KB]

(2000) Zhao-Ming Gao: Automatic acquisition of a high-precision translation lexicon from parallel Chinese-English corpora. In: IAI Working Paper no.36, 2000; 12pp. [PDF, 193KB]

(2000) Jin-Xia Huang & Key-Sun Choi: Chinese-Korean word alignment based on linguistic comparison. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong, October 2000. [PDF, 208KB]

(2000) Zheng Jie & Mao Yuhang: A word sense disambiguation model using bilingual corpus. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 517-521. [PDF, 85KB]

(2000) Olivier Kraif: Evaluation of statistical tools for automatic extraction of lexical correspondences between parallel texts. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1579KB]

(2000) Sun Le, Jin Youbing, Du Lin, & Sun Yufang: Automatic extraction of English-Chinese term lexicons from noisy bilingual corpora. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 751-755. [PDF, 128KB]

(2000) Sun Le, Jin Youbing, Du Lin, & Sun Yufang: Word alignment of English-Chinese bilingual corpus based on chunks. EMNLP-2000: Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 7-8 October 2000, Hong Kong; pp. 110-116 [PDF, 509KB]

(2000) Elliott Macklovitch: Two types of translation memory. Translating and the Computer 22: proceedings of the Twenty-second International Conference… 16-17 November 2000 (London: Aslib, 2000); 15pp.  [PDF, 198KB]

(2000) Bernardo Magnini & Carlo Strapparava: Experiments in word domain disambiguation for parallel texts. ACL 2000 Workshop on Word Senses and Multi-linguality, Hong Kong, October 2000; pp. 27-33 [PDF, 605KB]

(2000) Hiroshi Masuichi, Raymond Flournoy, Stefan Kaufmann, & Stanley Peters: A bootstrapping method for extracting bilingualtext pairs Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1066-1070 [PDF,.464KB]

(2000) Hiroshi Nakagawa: Disambiguation of lexical translations based on bilingual comparable corpora. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 6pp. [PDF, 100KB]

(2000) Masumi Narita: A corpus-based English language assistant to Japanese software engineers. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1844KB]

(2000) Uwe Reinke: Towards a closer integration of termbases, translation memories and parallel corpora: a translation-oriented view. In: IAI Working Paper no.36, 2000; 14pp. [PDF, 216KB]

(2000) António Ribeiro, Gabriel Lopes, & João Mexia: Using confidence bands for parallel texts alignment. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong, October 2000. [PDF, 260KB]

(2000) Jörg Tiedemann: Extracting phrasal terms using bitext. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 7pp. [PDF, 160KB]

(2000) Tamás Váradi: Lexical and translation equivalence in parallel corpora. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 539-543. [PDF, 756KB]

(2000) Špela Vintar: Extracting terminological collocations from a parallel corpus. Fifth EAMT Workshop "Harvesting existing resources", May 11 - 12, 2000, Ljubljana, Slovenia; pp.21-30. [PDF, 132KB]

(2000) S.Vogel & H.Ney: Construction of a hierarchical translation memory Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1131-1135 [PDF,.396KB]

(2000) Hideo Watanabe, Sadao Kurohashi, & Eiji Aramaki: Finding structural correspondences from bilingual parsed corpus for corpus-based translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 906-912 [PDF,.701KB]

(2000) KaorouYamamoto & Yuji Matsumoto: Acquisition of phrase-level bilingual correspondence using dependency structure Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 933-939 [PDF,.592KB]

Bi-text see Bilingual corpora

Book reviews

(2004) Walter Daelemans: [review of] Recent advances in example-based machine translation [eds.] Michael Carl and Andy Way. Computational Linguistics 30 (4), pp. 516-520. [PDF, 87KB]

(2004) Elliott Macklovitch: [review of] Readings in machine translation [eds.] Sergei Nirenburg, Harold Somers, Yorick Wilks. Computational Linguistics 30 (1), pp. 107-109. [PDF, 40KB]

(2002) Maghi King: Book review – Early years in machine translation [ed.] W.John Hutchins. In: MT News International no.30, March 2002. [PDF]

(2002) Warren J.Plath: Book review: Early years in machine translation: memoirs and biographies of pioneers, W.John Hutchins (ed.). Computational Linguistics 28 (4), pp. 554-559 [PDF, 57KB]

(2002) Graham Russell: [review of] Empirical methods for exploiting parallel texts, by I.Dan Melamed. In: MT News International no.31, Winter 2002. [PDF]

(2002) Derek Lewis: [review of] Chan Sin-wai (ed.) Translation and information technology (Chinese University Press, 2002).  In: Machine Translation Review, issue 13: December 2002; pp.40-41.

(2001) Philip Resnik: [review of] Parallel text processing: alignment and use of translation corpora [ed. by] Jean Véronis. Computational Linguistics 27 (4), pp.592-595. [PDF, 333KB]

(2001) Mark Stevenson: [review of] Jean Véronis: Parallel text processing (Kluwer). In: Machine Translation Review, issue 12: December 2001; pp.75-76.

Bridge language see Pivot language

Capitalization see Written forms

Case-based methods

(2003) Bróna Collins & Harold Somers: EBMT seen as case-based reasoning. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp. 115-153.

(2003) Harold Somers & Hermione Lovel:Computer-based support for patients with limited English7th EAMT Workshop, "Improving machine translation through other language technology tools", 13 April 2003, Budapest, Hungary; pp. 41-49 [PDF, 198KB]

(2001) Harold Somers: EBMT seen as case-based reasoning MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Workshop on Example-Based Machine Translation. [PDF, 186KB]

Case grammar and case frames

(2004) Sanae Fujita & Francis Bond: An automatic method of creating valency entries using plain bilingual dictionaries; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.55-64. [PDF, 147KB]

(2004) Sanae Fujita & Francis Bond: A method of creating new bilingual valency entries using alternations.  Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.47-54. [PDF, 176KB]

(2003) Ondřej Bojar: Towards automatic extraction of verb frames. Prague Bulletin of Mathematical Linguistics, no.79/80, 2003; 21pp. [PDF, 390KB]

(2003) Francis Bond & Sanae Fujita: Evaluation of a method of creating new valency entries MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 108KB]

(2002) Timothy Baldwin and Francis Bond: Alternation-based lexicon reconstruction. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002. [PDF, 187KB]

(2002) Hans C. Boas: Bilingual FrameNet dictionaries for machine translation. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1364-1371. [PDF, 123KB]

(2002) Sanae Fujita and Francis Bond: A method of adding new entries to a valency dictionary by exploiting existing lexical resources. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002. [PDF, 179KB]

(2002) Sanae Fujita & Francis Bond: Extending the coverage of a valency dictionary; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 180KB]

(2001) Nizar Habash & Bonnie Dorr: Large scale language independent generation using thematic hierarchies. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 139-144. [PDF, 256KB]

(2000) Uzzi Ornan & Israel Gutter: Machine translation by semantic features. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1471KB]

Character alignment

(2000) Byung-Ju Kang & Key-Sun Choi: English-Korean automatic transliteration/back-transliteration system and character alignment. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong. Demonstration notes, 3-6 October 2000; pp.17-18 [PDF, 117KB]

Closely related languages

(2004) Petr Homola & Vladislav Kubon: A translation model for languages of accessing countries 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 90-97. [PDF, 231KB]

(2003) Jan Hajič, Petr Homola, Vladislav Kuboň: A simple multilingual machine translation system MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 204KB]

(2002) Kemal Altintas & Ilyas Cicekli: A machine translation system between a pair of closely related languages. Proceedings of the 17th International Symposium on Computer and Information Sciences (ISCIS 2002), Orlando, Florida, 2002 (CRC Press); pp.192-196.  [PDF, 204KB]

(2002) Lukasz Dębowski, Jan Hajič, & Vladislav Kuboň: Testing the limits – adding a new language to an MT system. Prague Bulletin of Mathematical Linguistics, no.78, 2002; 7pp. [PDF, 208KB]

(2001) Michael Paul: Translation knowledge recycling for related languages. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.265-268. [PDF, 223KB]

(2001) Jesús Tomás & Francisco Casacuberta: Monotone statistical translation using word groups. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.357-361. [PDF, 56KB]

(2000) Pernilla Danielsson & Katarina Mühlenbock: Small but efficient: the misconception of high-frequency words in Scandinavian translation. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.158-168. [go to publisher details]

(2000) Jan Hajič, Jan Hric, & Vladislav Kuboň: Machine translation of very close languages.  ANLP-NAACL-2000: proceedings of the Sixth conference on Applied Natural Language Processing and 1st Meeting of the North American Chapter of the Association for Computational Linguistics, April 29 – May 4, 2000, Seattle, Washington; pp.7-12. [PDF, 527KB]

(2000) Jan Hajič, Jan Hric, & Vladislav Kuboň: ČESÍLKO – an MT system for closely related languages. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong. Demonstration notes, 3-6 October 2000; pp.7-8 [PDF, 187KB]

(2000) Carole Tiberius & Lynne Cahill: Incorporating metaphonemes in a multilingual lexicon. Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp.1126-1130. [PDF, 405KB]

(2000) Lluís de Yzaguirre, Marta Ribas, Jordi Vivaldi, & M.Teresa Cabré: Some technical aspects about aligning near languages. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 545-548. [PDF, 2276KB]

Cognates

(2004) Ana-Maria Barbu: Simple linguistic methods for improving a word alignment algorithm. JADT 2004: 7es Journées internationals d’Analyse statistique des Données Textuelles, 10-12 mars 2004, Louvain La Neuve, Belgique; pp.88-98. [PDF, 89KB]

(2003) Grzegorz Kondrak, Daniel Marcu, & Kevin Knight: Cognates can improve statistical translation models HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 28KB]

(2003) Gina-Anne Levow: Issues in pre- and post-translation document expansion: untranslatable cognates and missegmented words IRAL 2003: Sixth International Workshop on Information Retrieval with Asian Languages,  July 7,  2003, Sapporo, Japan; 7pp. [PDF, 139KB]

(2001) Gideon S.Mann & David Yarowsky: Multipath translation lexicon induction via bridge languages.  [NAACL-2001] Language Technologies 2001: the Second meeting of the North American Chapter of the Association for Computational Linguistics, Carnegie Mellon University, Pittsburgh, PA, 2-7 June 2001; 8pp. [PDF, 181KB]

(2001) António Ribeiro, Gaël Dias, Gabriel Lopes & João Mexia: Cognates alignment. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.287-292. [PDF, 70KB]

Collocations see Multiword expressions

Compositionality

(2004) Feifan Liu, Qianli Jin, Jun Zhao & Bo Xu: Bilingual chunk alignment based on interactional matching and probabilistic latent semantic indexing.  First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.416-425. [abstract]

Compound nouns see Nouns and noun phrases

Concordances

(2004) Michael Barlow: Parallel concordancing and translation. Translating and the Computer 26: proceedings of the Twenty-sixth International Conference on Translating and the Computer, 18-19 November 2004, London. (London: Aslib, 2004); 11pp. [PDF, 84KB]

 (2004) Lynne Bowker & Michael Barlow: Bilingual concordancers and translation memories: a comparative evaluation. Coling 2004: Proceedings of the Second International Workshop on Language Resources for Translation Work, Research and Training, 28th August, University of Geneva, Switzerland; pp.70-79. [PDF, 88KB]

(2003) Pernilla Danielsson: Units of meaning in translation – how to make real use of corpus evidence. Translating and the Computer 25: proceedings of the Twenty-fifth International Conference on Translating and the Computer, 20-21 November 2003, London. (London: Aslib, 2003); 15pp. [PDF, 71KB]

(2003) Jian-Cheng Wu, Kevin C.Yeh, Thomas C.Chuang, Wen-Chi Shei, & Jason S.Chang: TotalRecall: a bilingual concordance for computer assisted translation and language learning ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 76KB]

(2002) Michael Barlow: ParaConc: concordance software for multilingual parallel corpora. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Language resources for translation work and research, Las Palmas Canary Islands, 27 May 2002; pp.20-24. [PDF, 97KB]

(2000) Elliott Macklovitch: Two types of translation memory. Translating and the Computer 22: proceedings of the Twenty-second International Conference… 16-17 November 2000 (London: Aslib, 2000); 15pp.  [PDF, 198KB]

Confidence estimation

(2002) Eiichiro Sumita, Yasuhiro Akiba, & Kenji Imamura: Reliability measures for translation quality.  ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1893-1896; abstract [PDF, 43KB]

Confusion networks see Lattices and networks

Context-free grammars

(2003) Imelda P.de Castro: The Philippine Style Chat – instant messenger translation system. Translating and the Computer 25: proceedings of the Twenty-fifth International Conference on Translating and the Computer, 20-21 November 2003, London. (London: Aslib, 2003); 11pp. [PDF, 177KB]

(2000) Marc Dymetman & Frédéric Tendeau: Context-free grammar rewriting and the transfer of packed linguistic representations Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1016-1020 [PDF,.515KB]

Controlled languages see index of applications

Conversation see Dialogue

Corpora see Bilingual corpora, Monolingual corpora, Multilingual corpora

Corpus-based MT see Context-based methods, Example-based methods, Statistical MT

Correlational grammar

(2000) Ernst von Glasersfeld: Silvio Ceccato and the correlational grammar. In: W.John Hutchins (ed.) Early years in machine translation: memoirs and biographies of pioneers (Amsterdam/Philadelphia: John Benjamins Publishing Company, 2000); pp.313-324.

Data elicitation

(2002) Katharina Probst and Lori Levin: Challenges in automated elicitation of a controlled bilingual corpus. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.157-167. [PDF, 173KB]

(2002) Fumiaki Sugaya, Toshiyuki Takezawa, Genichiro Kikui, & Seiichi Yamamoto: Proposal of a very-large-corpus acquisition method by cell-formed registration. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.326-328. [PDF, 32KB]

(2001) Katharina Probst, Ralf Brown, Jaime Carbonell, Alon Lavie, Lori Levin & Erik Peterson: Design and implementation of controlled elicitation for machine translation of low-density languages MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Towards a Road Map for MT [PDF, 90KB]

Data-oriented translation (see also Example-based methods)

(2004) Declan Groves, Mary Hearne, & Andy Way: Robust sub-sentential alignment of phrase-structure trees. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 94KB]

(2003) Mary Hearne & Andy Way: Seeing the wood for the trees: data-oriented translation MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 88KB]

(2003) Andy Way: Translating with examples: the LFG-DOT models of translation. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp.443-472.

(2001) Andy Way: Solving headswitching translation cases in LFG-DOT. Proceedings of the LFG conference, University of Hong Kong, 25-27 June 2001 [Stanford University, CSLI Publications, 2001]; pp.305-323. [PDF, 142KB]

(2000) Arjen Poutsma: Data-oriented translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 635-641 [PDF,.552KB]

Decomposition see Segmentation

Dependency relations

(2004) Eneko Agirre, Aitziber Atutxa, Koldo Gojenola, & Kepa Sarasola: Exploring portability of syntactic information from English to Basque. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.47-50. [PDF, 270KB]

(2004) Anthony Aue, Arul Menezes, Bob Moore, Chris Quirk & Eric Ringger: Statistical machine translation using labeled semantic dependency graphs; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.125-134. [PDF, 127KB]

(2004) Martin Čmejrek, Jan Cuřin, Jiři Havelka, Jan Hajič, & Vladislav Kuboň: Prague Czech-English dependency treebank: syntactically annotated resources for machine translation . LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1597-1600. [PDF, 292KB]

(2004) Martin Čmejrek, Jan Cuřín, & Jiří Havelka: Prague Czech-English dependency treebank: any hopes for a common annotation scheme? HLT/NAACL 2004: Frontiers in Corpus Annotation, Proceedings of the Workshop, Boston, Massachusetts, May 6, 2004; 8pp. [PDF, 230KB]

(2004) Jan Cuřin, Martin Čmejrek, Jiří Havelka & Vladislav Kuboň: Building a parallel bilingual syntactically annotated corpus. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.168-176. [abstract]

(2004) Yuan Ding & Martha Palmer: Synchronous dependency insertion grammars: a grammar formalism for syntax based statistical MT. Coling’04 workshop: Proceedings Recent Advances in Dependency Grammar, August 28, 2004, Geneva, Switzerland; 8pp. [PDF, 135KB]

(2004) Yuan Ding & Martha Palmer: Automatic learning of parallel dependency treelet pairs. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.233-243. [abstract]

(2004) Mark Dras & Chung-hye Han: Non-contiguous tree parsing; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.30-39. [PDF, 82KB]

(2004) Daniel Gildea:  Dependencies vs constituents for tree-based alignment. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 97KB]

(2004) Dekang Lin: A path-based transfer model for machine translation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 6pp. [PDF, 201KB]

(2004) Yajuan Lü & Ming Zhou: Collocation translation acquisition using monolingual corpora. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.167-174. [PDF, 95KB]

(2004) Sylwia Ozdowska: Identifying correspondences between words: an approach based on a bilingual syntactic analysis of French/English parallel corpora. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.55-62. [PDF, 213KB]

(2004) Mark Pedersen, Domenyk Eades, Samir K.Amin & Lakshmi Prakash: Relative clauses in Hindi and Arabic: a Paninian dependency grammar analysis. Coling’04 workshop: Proceedings Recent Advances in Dependency Grammar, August 28, 2004, Geneva, Switzerland; 8pp. [PDF, 115KB]

(2004) Wei Wang & Ming Zhou: Improving word alignment models using structured monolingual corpora. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 209KB]

(2003) Martin Čmejrek, Jan Cuřín &Jiří Havelka: Czech-English dependency-based machine translation. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.83-90 [PDF, 404KB]

(2003) Yuan Ding, Daniel Gildea, & Martha Palmer: An algorithm for word-level alignment of parallel dependency trees MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 242KB]

(2003) Jason Eisner: Learning non-isomorphic tree mappings for machine translation ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 89KB]

(2003) Hideo Watanabe, Sadao Kurohashi & Eiji Aramaki: Finding translation patterns from paired source and target dependency structures. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp. 397-420.

(2002) Hiyan Alshawi & Shona Douglas: Speech translation performance of statistical dependency transduction and semantic similarity transduction; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 31-38 [PDF, 92KB]

(2002) Etienne Blanc, Gilles Sérasset, & WangJu Tsai: Structural and lexical transfer from an UNL graph to an equivalent natural language dependency tree. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: First international workshop on UNL, other interlinguas and their applications, Las Palmas Canary Islands, 27 May 2002; pp.14-18. [PDF, 53KB]

(2002) Martin Čmejrek, Jan Cuřín & Jiří Havelka: Czech-English dependency-based machine translation: data preparation for the starting up experiments. Prague Bulletin of Mathematical Linguistics, no.78, 2002; 16pp. [PDF, 289KB]

(2002) Bonnie J.Dorr, Lisa Pearl, Rebecca Hwa, & Nizar Habash: DUSTer: a method for unraveling cross-language divergences for statistical word-level alignment. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 31-43. [go to publisher details]

(2002) Hongzhao He & Jianfeng Gao: NTCIR-3 CLIR experiments at MSRA. NTCIR Workshop 3: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, October 8-10, 2002, Tokyo, Japan; 7pp. [PDF, 196KB]

(2001) Taiichi Hashimoto, Kosuke Nishidate, Kiyoaki Shirai, Takenobu Tokunaga & Hozumi Tanaka: Decision lists for determining adjective dependency in Japanese. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 151-156. [PDF, 133KB]

(2000) Hiyan Alshawi, Srinivas Bangalore, & Shona Douglas: Learning dependency translation models as collections of finite-state head transducers. Computational Linguistics 26 (1), pp.45-60 [PDF, 978KB]

(2000) Chung-hye Han, Benoit Lavoie, Martha Palmer, Owen Rambow, Richard Kittredge, Tanya Korelsky, Nari Kim, & Myunghee Kim: Handling structural divergences and recovering dropped arguments in a Korean/English machine translation system. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.40-53. [go to publisher details]

(2000) Benoit Lavoie, Richard Kittredge, Tanya Korelsy, & Owen Rambow: A framework for MT and multilingual NLG systems based on uniform lexico-structural processing. ANLP-NAACL-2000: proceedings of the Sixth conference on Applied Natural Language Processing and 1st Meeting of the North American Chapter of the Association for Computational Linguistics, April 29 – May 4, 2000, Seattle, Washington; pp. 60-67. [PDF, 527KB]

(2000) Hideo Watanabe, Sadao Kurohashi, & Eiji Aramaki: Finding structural correspondences from bilingual parsed corpus for corpus-based translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 906-912 [PDF,.701KB]

(2000) KaorouYamamoto & Yuji Matsumoto: Acquisition of phrase-level bilingual correspondence using dependency structure Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 933-939 [PDF,.592KB]

Dialects and registers

(2004) Pu-Jen Cheng, Yi-Cheng Pan, Wen-Hsiang Lu, & Lee-Feng Chien: Creating multilingual translation lexicons with regional variations using web corpora.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp. 534-541. [PDF, 283KB]

Dialogue

(2004) Robert S.Belvin, Win May, Shrikanth Narayanan, Panayiotis Georgiou, & Shadi Ganjavi: Creation of a doctor-patient dialogue corpus using standardized patients. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.187-190. [PDF, 480KB]

(2004) Hervé Blanchon, Christian Boitet & Laurent Besacier: Spoken dialogue translation systems evaluation: results, new trends, problems and proposals. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 95-102 [PDF, 360KB]

(2004) Hervé Blanchon: HLT modules scalability within the NESPOLE! Project. Interspeech 2004 – ICSLP 8th International  Conference on  Spoken Language Processing, Jeju Island, Korea, October 4-8, 2004; pp.1641-1644; abstract [PDF, 72KB]

(2004) Matthias Eck, Stephan Vogel, & Alex Waibel: Improving statistical machine translation in the medical domain using the Unified Medical Language System. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 81KB]

(2004) Tanja Schultz, Dorcas Alexander, Alan W.Black, Kay Peterson, Sinaporn Suebvisai, & Alex Waibel: A Thai speech translation system for medical dialogs. HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; demonstration paper, 2pp. [PDF, 122KB]

(2004) Toshiyuki Takezawa & Genichiro Kikui: A comparative study on human communication behaviors and linguistic characteristics for speech-to-speech translation.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1589-1592. [PDF, 585KB]

(2004) Voice ideas – Automating the Tower of Babel. [On research by Yuqing Gao at IBM]  Speech Technology, November/December 2004.

(2003) Toshiyuki Takezawa & Genichiro Kikui: Collecting machine-translation-aided bilingual dialogues for corpus-based speech translation. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2757-2760; abstract [PDF, 34KB]

(2002) Sherri Condon & Keith Miller: Sharing problems and solutions for machine translation of spoken and written interaction; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 93-100 [PDF, 66KB]

(2002) Erica Costantini, Susanne Burger, & Fabio Pianesi: NESPOLE!’s multilingual and multimodal corpus. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.165-170. [PDF, 84KB]

(2002) Takayuki Nakata, Takahiro Ikeda, Shinichi Ando, & Akitoshi Okumura: Topic detection based on dialogue history; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp.9-14 [PDF, 111KB]

(2002) Akitoshi Okumura, Ken-ichi Iso, Shin-ichi Doi, Kiyoshi Yamabana, Ken Hanazawa, & Takao Watanabe: An automatic speech translation system for travel conversation. HLT 2002: Human Language Technology Conference: proceedings of the second international conference on human language technology research, March 24-27, 2002, San Diego, California; ed. Mitchell Marcus [San Francisco, CA: Morgan Kaufmann for DARPA]; pp. 411-412. [PDF, 127KB]

(2002) Fumiaki Sugaya, Toshiyuki Takezawa, Genichiro Kikui, & Seiichi Yamamoto: Proposal of a very-large-corpus acquisition method by cell-formed registration. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.326-328. [PDF, 32KB]

(2002) Chengqing Zong, Bo Xu, & Taiyi Huang: Interactive Chinese to English speech translation based on dialogue management; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 61-68 [PDF, 266KB]

(2001) Susanne Burger, Laurent Besacier, Paolo Coletti, Florian Metze, & Céline Morel: The Nespole! VoIP dialogue database. Eurospeech 2001 Scandinavia: 7th European Conference on Speech Communication and Technology, 2nd Interspeech Event, Aalborg, Denmark, September 3-7, 2001; pp.2043-2046 [PDF,84KB]; abstract [PDF, 81KB]

(2001) H. Ney, F.J.Och, & S.Vogel: The RWTH system for statistical translation of spoken dialogues. HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 7pp. [PDF, 166KB]

(2000) Jan Alexandersson, Peter Poller, Michael Kipp, & Ralf Engel: Multilingual summary generation in a speech-to-speech translation system for multilingual dialogues. INLG'2000 Proceedings of the First International Conference on Natural Language Generation, Mitzpe Ramon, Israel, 12-16 June 2000; pp.148-155. [PDF, 670KB]

(2000) Raymond S.Flournoy & Christopher Callison-Burch: Reconciling user expectations and translation technology to create a useful real-world application. Translating and the Computer 22: proceedings of the Twenty-second International Conference… 16-17 November 2000 (London: Aslib, 2000); 8pp.  [PDF, 106KB]

(2000) Lori Levin, Donna Gates, Alon Lavie, Fabio Pianesi, Dorcas Wallace, Taro Watanabe, & Monika Woszczyna: Evaluation of a practical interlingua for task-oriented dialogue. NAACL-ANLP 2000 Workshop: Applied Interlinguas: Practical Applications of Interlingual Approaches to NLP, Seattle, May 2000; pp. 18-23 [PDF, 408KB]

(2000) Hua Wu, Taiyi Huang, Chengqing Zong,& Bo Xu: Chinese generation in a spoken dialogue translation system Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1141-1145 [PDF,.398KB]

(2000) Setsuo Yamada, Eiichiro Sumita, & Hideki Kashioka: Translation using information on dialogue participants.  ANLP-NAACL-2000: proceedings of the Sixth conference on Applied Natural Language Processing and 1st Meeting of the North American Chapter of the Association for Computational Linguistics, April 29 – May 4, 2000, Seattle, Washington; pp.37-43. [PDF, 582KB]

Dictionaries see Lexical resources

Direct translation approach

(2004) Haytham Alsharaf, Sylviane Cardey, & Peter Greenfield: French to Arabic machine translation: the specificity of language couples 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp.11-17 [PDF, 197KB ]

(2003) William A.Whitaker & John F.White: Blitz Latin – a machine translator for Latin to English.  In: Machine Translation Review, issue 14: December 2003; pp.17-28.

(2002) Mikel L. Forcada: Using multilingual content on the web to build fast finite-state direct translation systems. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002: Workshop. [PDF, 155KB]

(2001) Paul R.Bowden: Latin to English machine translation: a direct approach. In: Machine Translation Review, issue 12: December 2001; pp.6-12.

(2001) Juan Antonio Pérez-Ortiz and Mikel L.Forcada: Discovering machine translation strategies beyond word-for-word translation: a laboratory assignment. MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Workshop on Teaching Machine Translation [PDF, 41KB]

Disambiguation (see also Word sense disambiguation)

(2004) Bogdan Babych & Anthony Hartley: Disambiguating translation strategies in MT using automatic named entity recognition 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 18-25. [PDF, 243KB ]

(2004) Sanae Fujita & Francis Bond: An automatic method of creating valency entries using plain bilingual dictionaries; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.55-64. [PDF, 147KB]

(2004) Felipe Sánchez-Martínez, Juan Antonio Pérez-Ortiz & Mikel L. Forcada: Cooperative unsupervised training of the part-of-speech taggers in a bidirectional machine translation system; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.135-144. [PDF, 92KB]

(2004) Lee Schwartz & Takako Aikawa: Multilingual corpus-based approach to the resolution of English –ing.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.959-962. [PDF, 301KB]

(2003) Arendse Bernth & Michael C. McCord: A hybrid approach to deriving selectional preferences MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 61KB]

(2003) Lee Schwartz, Takako Aikawa, & Chris Quirk: Disambiguation of English PP attachment using multilingual aligned data MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.330-337. [PDF, 98KB]

(2003) Eric Wehrli: Translation of words in context MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.502-505. [PDF, 203KB]

(2002) Eneko Agirre, Mikel Lersundi, & David Martinez: A multi-lingual approach to disambiguate prepositions and case suffixes; ACL-2002 SIGLEX/SENSEVAL workshop on Word Sense Disambiguation "Recent successes and future directions", July 2002, Philadelphia, USA; pp. 1-8 [PDF, 166KB]

(2002) Paisarn Charoenpornsawat, Virach Sornlertlamvanich & Thatsanee Charoenporn: Improving translation quality of rule-based machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 6pp. [PDF, 355KB]

(2002) Mathieu Lafourcade & Christian Boitet: UNL lexical selection with conceptual vectors. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1958-1964. [PDF, 82KB]

(2002) Hyo-Kyung Lee: Classification approach to word selection in machine translation. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 114-123. [go to publisher details]

(2002) Li Tangqiu, Yang Xiaofeng, Hong Qingyang, & Li Shaozhi: A hybrid method for syntactic and semantic structure disambiguation of Chinese. In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.43-55.

(2002) Arul Menezes: Better contextual translation using machine learning. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 124-134. [go to publisher details]

(2001) Ismael García-Varea, Franz J. Och, Hermann Ney, & Francisco Casacuberta: Refined lexicon models for statistical machine translation using a maximum entropy approach ACL-EACL-2001: 39th Annual meeting [of the Association for Computational Linguistics] and 10th Conference of the European Chapter [of ACL], July 9th - 11th 2001, Toulouse, France; pp.204-211. [PDF, 72KB]

(2001) Keiko Horiguchi: Information-based machine translation. [NAACL-2001] Language Technologies 2001: the Second meeting of the North American Chapter of the Association for Computational Linguistics, Carnegie Mellon University, Pittsburgh, PA, 2-7 June 2001; 8pp. [PDF,  105KB]

 (2001) K.L.Kwok: NTCIR-2 Chinese, cross language retrieval experiment using PIRCS. NTCIR Workshop 2: Proceedings of the Second NTCIR Workshop on Research in Chinese & Japanese Text retrieval and Text Summarization, March 7-9, 2001, Tokyo, Japan; 8pp. [PDF, 202KB]

(2000) Michael Schiehlen: Granularity effects in tense translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 712-718 [PDF,.568KB]

Discourse representation

(2003) Kurt Eberle: Coordination, incorporation and dynamic semantic representation in transfer Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003 [PDF, 176KB]

(2003) Bassam Haddad & Mustafa Yaseen: Towards semantic composition of Arabic: a λ-DRT based approach. MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans,USA, 23 September 2003 [PDF, 162KB]

(2002) Amitabha Mukerjee, M.Chandra Sekhar Reddy, Anand Sukhakar, Rajesh Rajsekhar, & Vikrant Kumar: UNL as intermediary for generating graphical simulations of NL discourse. ICUKL-2002: International Conference on Universal Knowledge and Language, 25th-29th November 2002, Goa, India, organised by UNDL Foundation and Indian Institute of  Technology Bombay; 12pp. [PDF,  660KB]

(2002) Deepak Sharma, K.Vikram, Manav Ratan Mital, Amitabha Mukerjee & Achla M.Raina: An integrated discourse semantic model for bilingual corpora. ICUKL-2002: International Conference on Universal Knowledge and Language, 25th-29th November 2002, Goa, India, organised by UNDL Foundation and Indian Institute of  Technology Bombay; 16pp. [PDF,  645KB]

(2001) Guillermo Barrutieta, Joseba Abaitua & Josuha Díaz: Gross-grained RST through XML metadata for multilingual document generation. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 39-42. [PDF, 151KB]

(2001) Kurt Eberle: FUDR-based MT, head switching and the lexicon. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 93-98. [PDF, 210KB]

(2000) Rodolfo Delmonte, Dario Bianchi, & Emanuele Pianta: Generating from a discourse model. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 10pp. [PDF, 2145KB]

(2000) Stephen Helmreich & David Farwell: Text meaning representation as a basis for representation of text interpretation. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.179-188. [go to publisher details]

(2000) Mari Olsen, David Traum, Carol Van Ess-Dykema, Amy Weinberg, & Ron Dolan: Telicity as a cue to temporal and discourse structure in Chinese-English machine translation. NAACL-ANLP 2000 Workshop: Applied Interlinguas: Practical Applications of Interlingual Approaches to NLP, Seattle, May 2000; pp. 34-41 [PDF, 654KB]

Dissertations see Theses and dissertations

Divergence see Lexical transfer, Syntactic transfer

Domain identification

(2000) Bernardo Magnini & Carlo Strapparava: Experiments in word domain disambiguation for parallel texts. ACL 2000 Workshop on Word Senses and Multi-linguality, Hong Kong, October 2000; pp. 27-33 [PDF, 605KB]

Domain restriction, adaptation and specification

(2004) Victoria Arranz, Elisabet Comelles, David Farwell, Climent Nadeu, Jaume Padrell, Albert Febrer, Dorcas Alexander, & Kay Peterson: A speech-to-speech translation system for Catalan, Spanish, and English. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 7-16. [go to publisher details]

(2004) Necip Fazil Ayan, Bonnie J. Dorr, & Nizar Habash: Multi-Align: combining linguistic and statistical techniques to improve alignments for adaptable MT. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 17-26. [go to publisher details]

(2004) Luisa Bentivogli, Pamela Forner, Bernardo Magnini, & Emanuele Pianta: Revising the Wordnet Domains hierarchy: semantics, coverage and balancing. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.101-108. [PDF, 80KB]

(2004) Hervé Blanchon: HLT modules scalability within the NESPOLE! Project. Interspeech 2004 – ICSLP 8th International  Conference on  Spoken Language Processing, Jeju Island, Korea, October 4-8, 2004; pp.1641-1644; abstract [PDF, 72KB]

(2004) Violetta Cavalli-Sforza, Jaime G. Carbonell, & Peter J. Jansen: Developing language resources for a transnational digital government system. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.945-948. [PDF, 271KB]

(2004) Matthias Eck, Stephan Vogel, & Alex Waibel: Improving statistical machine translation in the medical domain using the Unified Medical Language System. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 81KB]

(2004) Matthias Eck, Stephan Vogel, & Alex Waibel: Language model adaptation for statistical machine translation based on information retrieval. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.327-330. [PDF, 291KB]

(2004) Michael Kluck: Evaluation of cross-language information retrieval using the domain-specific GIRT data as parallel German-English corpus. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1343-1346. [PDF, 1533KB]

(2004) Kristin Precoda, Horacio Franco, Ascander Dost, Michael Frandsen, John Fry, Andreas Kathol, Colleen Ritchie, Susanne Riehemann, Dimitra Vergyri, & Jing Zheng: Limited-domain speech-to-speech translation between English and Pashto. HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; demonstration paper, 4pp. [PDF, 57KB]

(2004) Manny Rayner, Pierrette Bouillon, Beth Ann Hockey, Nikos Chatzichrisafis & Marianne Starlander: Comparing rule-based and statistical approaches to speech understanding in a limited domain speech translation system; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.21-29. [PDF, 86KB]

(2004) Stephen D. Richardson: Machine translation of online product support articles using data-driven MT system. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 246-251. [go to publisher details]

(2004) Tanja Schultz, Dorcas Alexander, Alan W.Black, Kay Peterson, Sinaporn Suebvisai, & Alex Waibel: A Thai speech translation system for medical dialogs. HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; demonstration paper, 2pp. [PDF, 122KB]

(2004) Mitsuo Shimohata, Eiichiro Sumita & Yuji Matsumoto: Method for retrieving a similar sentence and its application to machine translation; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.105-114. [PDF, 112KB]

(2004) Toshiyuki Takezawa & Genichiro Kikui: A comparative study on human communication behaviors and linguistic characteristics for speech-to-speech translation.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1589-1592. [PDF, 585KB]

(2004) Hua Wu & Haifeng Wang: Improving domain-specific word alignment with a general bilingual corpus. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 262-271. [go to publisher details]

(2004) Wu Hua & Wang Haifeng: Improving domain-specific word alignment for computer assisted translation. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the interactive poster and demonstration sessions, 21-26 July 2004, Barcelona, Spain; 4pp. [PDF, 221KB]

(2004) Jui-Feng Yeh, Chung-Hsien Wu, Ming-Jun Chen, & Liang-Chih Yu: Automated alignment and extraction of bilingual ontology for cross-language domain-specific applications. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 196KB]

(2003) Necip Fazil Ayan, Bonnie J.Dorr, & Okan Kolak: Evaluation techniques applied to domain tuning of MT lexicons. "Towards systematizing MT evaluation": a workshop on machine translation evaluation at the MT Summit IX, New Orleans, USA, 27 September 2003; pp.3-11. [PDF, 328KB]

(2003) Takao Doi, Eiichiro Sumita, & Hirofumi Yamamoto: Adaptation using out-of-domain corpus within EBMT HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 33KB]

(2003) Lori Levin, Chad Langley, Alon Lavie, Donna Gates, Dorcas Wallace, & Kay Peterson: Domain specific speech acts for spoken language translation. Proceedings of 4th SIGdial Workshop on Discourse and Dialogue (SIGDIAL-2003), Sapporo, Japan, July 2003; 10pp. [PDF, 103KB]

(2003) Manny Rayner, Pierrette Bouillon, Vol Van Dalsem III, Hitoshi Isahara, Kyoko Kanzaki, & Beth Ann Hockey: A limited-domain English to Japanese medical speech translator built using REGULUS 2 ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 41KB]

(2002) Chan Sin-wai: The making of TransRecipe: a translational approach to the machine translation of Chinese cookbooks. In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.3-22.

(2002) Robert E. Frederking, Alan W. Black, Ralf D. Brown, Alexander Rudnicky, John Moody, & Eric Steinbrecher: Speech translation on a tight budget without enough data; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 77-84 [PDF, 156KB]

(2002) Adria de Gispert & José B.Mariño: Using x-grams for speech-to-speech translation. ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1885-1888; abstract [PDF, 65KB]

(2002) Benoit Lavoie, Michael White, & Tanya Korelsky: Learning domain-specific transfer rules: an experiment with Korean to English translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 43KB]

(2002) Solange Rossato, Hervé Blanchon, & Laurent Besacier: Speech-to-speech translation system evaluation: results for French for the NESPOLE! project first showcase.  ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1905-1908; abstract [PDF, 68KB]

(2002) Nestor Rychtyckyj: An assessment of machine translation for vehicle assembly process planning at Ford Motor Company. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 207-215. [go to publisher details]

(2002) David Stallard, Premkumar Natarajan, Mohammed Noamany, Richard Schwartz, & John Makhoul: Design for a speech-to-speech translator for field use. ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1705-1708; abstract [PDF, 46KB]

(2001) Srinivas Bangalore & Giuseppe Riccardi: A finite-state approach to machine translation. [NAACL-2001] Language Technologies 2001: the Second meeting of the North American Chapter of the Association for Computational Linguistics, Carnegie Mellon University, Pittsburgh, PA, 2-7 June 2001; 8pp. [PDF, 210KB]

(2001) Alon Lavie, Chad Langley, Alex Waibel, Fabio Pianesi, Ganni Lazzari, Paolo Coletti, Loredana Taddei, & Franco Balducci: Architecture and design considerations in NESPOLE!: a speech translation system for e-commerce applications.  HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 4pp. [PDF, 51KB]

(2001) Alon Lavie, Lorin Levin, Tanja Schultz, Chad Langley, Benjamin Han, Alicia Tribble, Donna Gates, Dorcas Wallace, & Kay Peterson: Domain portability in speech-to-speech translation.  HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 5pp. [PDF, 58KB]

(2001) Jessie Pinkham & Monica Corston-Oliver: Adding domain specificity to an MT system. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.103-110. [PDF, 73KB]

(2000) Srinivas Bangalore & Giuseppe Riccardi: Stochastic finite-state models for spoken language machine translation; ANLP/NAACL 2000 workshop: Embedded machine translation systems, May 4, 2000, Seattle, Washington, [USA]; pp.52-59. [PDF, 638KB]

(2000) Alexander Franz, Keiko Horiguchi, Lei Duan, Doris Ecker, Eugene Koontz, & Kazami Uchida: An integrated architecture for example-based machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1031-1035 [PDF,.369KB]

(2000) Estela Saquete & Patricio Martínez-Barco: Grammar specification for the recognition of temporal expressions. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 7pp. [PDF, 1397KB]

(2000) Sayori Shimohata: An empricial method for identifying and translating technical terminology Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 782-788 [PDF,.513KB]

(2000) John Weisgerber, Jin Yang, & Pete Fisher: Pacific Rim portable translator. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.196-201. [go to publisher details]

Economics of MT see index of applications

Ellipsis

(2003) Shigeko Nariyama: Resolving 'incognito' ellipsis: treatment for constructions that disguise ellipsis MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.261-268. [PDF, 39KB]

(2002) Shigeko Nariyama: Grammar for ellipsis resolution in Japanese. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.135-145. [PDF, 30KB]

(2001) Shigeko Nariyama: Multiple argument ellipses resolution in Japanese. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.241-245. [PDF, 167KB]

(2000) L. Sobha & B.N.Patnaik: VASISTH - an ellipsis resolution algorithm for Indian languages. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 5pp. [PDF, 987KB]

Embedded MT see index of applications

Emotional expression

(2004) Hiroshi Kanayama, Tetsuya Nasukawa, & Hideo Watanabe: Deeper sentiment analysis using machine translation technology. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 323KB]

Estimation maximization see Learning and training, Statistical analysis

Evaluation measures and metrics

(2004) Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, & Hiroshi G. Okuno: Incremental methods to select test sentences for evaluating translation ability.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.2015-2018. [PDF, 271KB]

(2004) Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto, & Hiroshi Okuno: Using a mixture of n-best lists from multiple MT systems in rank-sum-based confidence measure for MT outputs. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 152KB]

(2004) Bogdan Babych & Anthony Hartley: Extending the BLEU MT evaluation method with frequency weightings.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp. 621-628. [PDF, 132KB]

(2004) Bogdan Babych, Debbie Elliott, & Anthony Hartley: Extending MT evaluation tools with translation complexity metrics.  Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 68KB]

(2004) Bogdan Babych, Debbie Elliott, & Anthony Hartley: Calibrating resource-light automatic MT evaluation: a cheap approach to ranking MT systems by the usability of their output. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.2031-2034. [PDF, 237KB]

(2004) Bogdan Babych & Anthony Hartley: Modelling legitimate translation variation for automatic evaluation of MT quality. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.833-836. [PDF, 283KB]

(2004) Bogdan Babych: Weighted n-gram model for evaluating machine translation output. Proceedings of the 7th Annual Research Colloquium of the UK Special Interest Group for Computational Linguistics [CLUK-2004], Birmingham, UK, 6-7 January 2004; 8pp. [PDF, 174KB]

(2004) Robert S.Belvin, Susanne Riehemann, & Kristin Precoda: A fine-grained evaluation method for speech-to-speech translation using concept annotations. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1427-1430. [PDF, 765KB]

(2004) Hervé Blanchon, Christian Boitet, Francis Brunet-Manquat, Mutsuko Tomokiyo, Agnès Hamon, Vo Trung Hung & Youcef Bey: Towards fairer evaluations of commercial MT systems on basic travel expressions corpora. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 21-26 [PDF, 563KB]

(2004) John Blatz, Erin Fitzgerald, George Foster, Simona Gandrabur, Cyril Goutte, Alex Kulesza, Alberto Sanchis, & Nicola Ueffing: Confidence estimation for machine translation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; pp.315-321. [PDF, 1150KB]

(2004) Ray Clifford, Neil Granoien, Douglas Jones, Wade Shen, & Clifford Weinstein: The effect of text difficulty on machine translation performance: a pilot study with ILR-rated texts in Spanish, Farsi, Arabic, Russian and Korean. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.343-346. [PDF, 643KB]

(2004) Debbie Elliott, Anthony Hartley, & Eric Atwell: A fluency error categorization scheme to guide automated machine translation evaluation. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 64-73. [go to publisher details]

(2004) Debbie Elliott, Eric Atwell & Anthony Hartley: Compiling and using a shareable parallel corpus for machine translation evaluation. LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 18-21. [PDF, 286KB]

(2004) Andrew Finch, Yasuhiro Akiba, & Eiichiro Sumita: How does automatic machine translation evaluation correlate with human scoring as the number of reference translations increases? LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.2019-2022. [PDF, 284KB]

(2004) Richard Jelinek: Modern MT systems and the myth of human translation: real world status quo. Translating and the Computer 26: proceedings of the Twenty-sixth International Conference on Translating and the Computer, 18-19 November 2004, London. (London: Aslib, 2004); 15pp. [PDF, 149KB]

(2004) Michael Kluck: Evaluation of cross-language information retrieval using the domain-specific GIRT data as parallel German-English corpus. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1343-1346. [PDF, 1533KB]

(2004) Philipp Koehn: Statistical significance tests for machine translation evaluation. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 137KB]

(2004) Alex Kulesza & Stuart M. Shieber: A learning approach to improving sentence-level MT evaluation; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.75-84. [PDF, 108KB]

(2004) Alon Lavie, Kenji Sagae, & Shyamsundar Jayaraman: The significance of recall in automatic metrics for MT evaluation. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 134-143. [go to publisher details]

(2004) Chin-Yew Lin & Franz Josef Och: ORANGE: a method for evaluating automatic evaluation metrics for amchine translation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 211KB]

(2004) Chin-Yew Lin & Franz Josef Och: Automatic evaluation of machine translation quality using longest common subsequence and skip-bigram statistics.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.605-612. [PDF, 167KB]

(2004) Widad Mustafa El Hadi, Marianne Dabbadie, Ismail Timimi, Martin Rajman, Philippe Langlais, Anthony Hartley, & Andrei Popescu Belis: Work-in-progree project report: CESTA – machine translation evaluation campaign. Coling 2004: Proceedings of the Second International Workshop on Language Resources for Translation Work, Research and Training, 28th August, University of Geneva, Switzerland; pp. 16-25.. [PDF, 151KB]

(2004) Franz Josef Och, Daniel Gildea, Sanjeev Khudanpur, Anoop Sarkar, Kenji Yamada, Alex Fraser, Shankar Kumar, Libin Shen, David Smith, Katherine Eng, Viren Jain, Zhen Jin, & Dragomir Radev: A smorgasbrod of features for statistical machine translation.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp. 161-168. [PDF, 192KB]

(2004) Carol Peters, Martin Braschler, Khalid Choukri, Julio Gonzalo, & Michael Kluck: The future of evaluation for cross-language information retrieval systems. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.841-844. [PDF, 266KB]

(2004) Christopher B. Quirk: Training a sentence-level machine translation confidence measure.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.825-828. [PDF, 243KB]

(2004) Florence Reeder: Investigation of intelligibility judgments. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 227-235. [go to publisher details]

(2004) Diana Santos, Belinda Mala, & Luís Sarmento: Gathering empirical data to evaluate MT from English to Portuguese.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 14-17. [PDF, 317KB]

(2004) Libin Shen, Anoop Sarkar, & Franz Josef Och: Discriminitive reranking for machine translation.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp.177-184. [PDF, 106KB]

(2004) Radu Soricut & Eric Brill: A unified framework for automatic evaluation using n-gram cooccurrence statistics.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.613-620. [PDF, 119KB]

(2004) Aree Teeraparbseree: Qualitative evaluation of automatically calculated acception based MLDB.  Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.23-30. [PDF, 261KB]

(2004) M. Vanni, C.R.Voss, & C. Tate: Ground truth, reference truth & “omniscient truth” – parallel phrases in parallel texts for MT evaluation.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 10-13. [PDF, 352KB]

(2004) Keiji Yasuda, Fumiaki Sugaya, Eiichiro Sumita, Toshiyuki Takezawa, Genichiro Kikui, & Seiichi Yamamoto: Automatic measuring of English language proficiency using MT evaluation technology. Coling 2004: Workshop eLearning for Computational Linguistics and Computational Linguistics for eLearning, Geneva, 28 August 2004; 8pp. [PDF, 762KB]

(2004) Ying Zhang & Stephan Vogel: Measuring confidence intervals for the machine translation evaluation metrics; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.85-94. [PDF, 348KB]

(2004) Ying Zhang, Stephan Vogel & Alex Waibel: Interpreting BLEU/NIST scores: how much improvement do we need to have a better system?  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.2051-2054. [PDF, 361KB]

(2004) The 2004 NIST machine translation evaluation plan (MT-04). [NIST, 2004]; 3pp. [PDF, 125KB]

(2003) proceedings of the workshop Towards systematizing MT evaluation at the MT Summit IX, New Orleans,USA, 27 September 2003. [HTML]

(2003) Yasuhiro Akiba, Eiichiro Sumita, Hiromi Nakaiwa, Seiichi Yamamoto & Hiroshi G. Okuno: Experimental comparison of MT evaluation methods: RED vs.BLEU MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 96KB]

(2003) Andreea Calude: Machine translation of various text genres. [Unpublished] Presented at 7th Language and Society Conference of the New Zealand Linguistic Society, November 2002, Hamilton, New Zealand. 12pp. [PDF, 206KB]

(2003) Michael Carl & Sisay Fissaha: Phrase-based evaluation of word-to-word alignments HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 253KB]

(2003) Nelson Correa: A fine-grained evaluation framework for machine translation system development MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 60KB]

(2003) Deborah Coughlin: Correlating automated and human assessments of machine translation quality MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 88KB]

(2003) Christopher Culy & Susanne Z. Riehemann: The limits of n-gram translation evaluation metrics MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 338KB]

(2003) Marcello Federico: Evaluation frameworks for speech translation technologies. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.377-380; abstract [PDF, 33KB]

(2003) Kenji Imamura, Eiichiro Sumita, & Yuji Matsumoto: Feedback cleaning of machine translation rules using automatic evaluation ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 62KB]

(2003) Kanayama Hiroshi: Paraphrasing rules for automatic evaluation of translation into Japanese. ACL 2003 International Workshop on Paraphrasing, July 11, 2003, Sapporo, Japan; 6pp. [PDF, 59KB]

(2003) Margaret King, Andrei Popescu-Belis, & Eduard Hovy: FEMTI: creating and using a framework for MT evaluation MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.224-231. [PDF, 163KB]

(2003) Gregor Leusch, Nicola Ueffing, & Hermann Ney: A novel string-to-string distance measure with applications to machine translation evaluation MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.240-247. [PDF, 104KB]

(2003) I.Dan Melamed, Ryan Green, & Joseph P.Turian: Precision and recall of machine translation HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 153KB]

(2003) Rada Mihalcea & Ted Pedersen: An evaluation exercise for word alignment HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada.[PDF, 131KB]

(2003) Andrei Popescu-Belis: An experiment in comparative evaluation: humans vs. computers MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.307-314.[PDF, 71KB]

(2003) Jesús Tomás, Josep Àngel Mas, & Francisco Casacuberta: A quantitative method for machine translation evaluation. EACL 2003: Proceedings of the Workshop on Evaluation Initiatives in Natural Language Processing: are evaluation methods, metrics and resources reusable? April 14th 2003, Budapest, Hungary; pp.27-34. [PDF, 6570KB]

(2003) Joseph P. Turian, Luke Shen, & I.Dan Melamed: Evaluation of machine translation and its evaluation MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.386-393. [PDF, 71KB]

(2003) John S. White: How to evaluate machine translation. In: Harold Somers (ed.) Computers and translation: a translator’s guide (Amsterdam/Philadelphia: John Benjamins Publishing Company, 2003); pp.211-244.

(2003) The 2003 NIST machine translation evaluation plan (MT-03). [NIST, 2003]; 3pp. [PDF, 173KB]

(2002) proceedings of  Workshop: Machine translation evaluation: human evaluators meet automated metrics, LREC-2002: Third International Conference on Language Resources and Evaluation, Las Palmas Canary Islands, 27 May 2002. [PDF, 249KB]

(2002) Richard Campbell, Carmen Lozano, Jessie Pinkham, & Martine Smets: Machine translation as a testbed for multilingual analysis; Coling-2002 workshop "Grammar engineering and evaluation", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 199KB]

(2002) Marianne Dabbadie, Anthony Hartley, Margaret King, Keith J.Miller, Widad Mustafa El Hadi, Andrei Popescu-Belis, Florence Reeder, & Michelle Vanni: A hands-on study of the reliability and coherence of evaluation metrics. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Machine translation evaluation: human evaluators meet automated metrics, Las Palmas Canary Islands, 27 May 2002; pp.8-16. [PDF, 114KB]

(2002) George Doddington: Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. HLT 2002: Human Language Technology Conference: proceedings of the second international conference on human language technology research, March 24-27, 2002, San Diego, California; ed. Mitchell Marcus [San Francisco, CA: Morgan Kaufmann for DARPA]; pp. 138-145. [PDF, 344KB]

(2002) George Doddington: Automatic evaluation of language translation using n-gram cooccurrence statistics. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Machine translation evaluation: human evaluators meet automated metrics, Las Palmas Canary Islands, 27 May 2002; 9pp. [PDF of PPT presentation, 572KB]

(2002) Kurt Godden: Towards a speech-to-speech machine translation quality metric; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 117-120 [PDF, 194KB]

(2002) Eduard Hovy, Margaret King, & Andrei Popescu-Belis: Computer-aided specification of quality models for machine translation evaluation. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1239-1246. [PDF, 78KB]

(2002) Eduard Hovy, Maghi King, & Andrei Popescu-Belis: An introduction to MT evaluation. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Machine translation evaluation: human evaluators meet automated metrics, Las Palmas Canary Islands, 27 May 2002; pp.1-7. [PDF, 96KB]

(2002) Alon Lavie, Stephan Vogel, Alex Waibel, Ulrich Germann, Kevin Knight, Daniel Marcu, Young-Suk Lee, Kishore Papineni, Salim Roukos, Franz Josef Och, Moussa Bamba, Chris Cieri, Shudong Huang, Florence Reeder, George Doddington: DARPA TIDES MT group meeting, Marina del Rey, Jan 25, 2002; 9pp. [PDF of PPT, 540KB]

(2002) Kishore Papineni: Machine translation evaluation: n-grams to the rescue. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; 2pp.[abstract only] [PDF, 52KB]

(2002) Kishore Papineni, Salim Roukos, Todd Ward & Wei-Jing Zhu: BLEU: a method for automatic evaluation of machine translation. ACL-2002: 40th Annual meeting of the Association for Computational Linguistics, Philadelphia, July 2002; pp.311-318. [PDF, 281KB]

(2002) Kishore Papineni, Salim Roukos, Todd Ward, John Henderson, & Florence Reeder: Corpus-based comprehensive and diagnostic MT evaluation: initial Arabic, Chinese, French, and Spanish results. HLT 2002: Human Language Technology Conference: proceedings of the second international conference on human language technology research, March 24-27, 2002, San Diego, California; ed. Mitchell Marcus [San Francisco, CA: Morgan Kaufmann for DARPA]; pp. 132-137. [PDF, 198KB]

(2002) Andrei Popescu-Belis, Margaret King, & Houcine Benantar: Towards a corpus of corrected human translations. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Machine translation evaluation: human evaluators meet automated metrics, Las Palmas Canary Islands, 27 May 2002; pp.17-21. [PDF, 50KB]

(2002) Martin Rajman & Anthony Hartley: Automatic ranking of MT systems. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1247-1253. [PDF, 48KB]

(2002) Véronique Sauron: Tearing out the terms: evaluating terms extractors. Translating and the Computer 24: proceedings from the Aslib conference held on 21-22 November 2002 (London: Aslib, 2002); 18pp. [PDF, 142KB]

(2002) Eiichiro Sumita, Yasuhiro Akiba, & Kenji Imamura: Reliability measures for translation quality.  ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1893-1896; abstract [PDF, 43KB]

(2002) Michelle Vanni & Keith Miller: Scaling the ISLE framework: use of existing corpus resources for validation of MT evaluation metrics across languages. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1254-1262. [PDF, 91K]

(2002) Jianmin Yao, Ming Zhou, Tiejun Zhao, Hao Yu & Sheng Li: An automatic evaluation method for localization oriented lexicalised EBMT system. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 320KB]

(2002) The 2002 NIST machine translation evaluation plan (MT-02). [NIST, 2002]; 2pp. [PDF, 42KB]

(2001) proceedings of the Workshop on MT evaluation, MT Summit VIII, Santiago de Compostela, Spain, 21 September 2001.

(2001) proceedings of MT Eval Workshop, Geneva, 19-24 April 2001.

(2001) Timothy Baldwin: Low cost, high-performance translation retrieval: dumber is better ACL-EACL-2001: 39th Annual meeting [of the Association for Computational Linguistics] and 10th Conference of the European Chapter [of ACL], July 9th - 11th 2001, Toulouse, France; pp.18-25. [PDF, 283KB]

(2001) Martin Braschler & Carol Peters: The CLEF campaign. NTCIR Workshop 2: Proceedings of the Second NTCIR Workshop on Research in Chinese & Japanese Text retrieval and Text Summarization, March 7-9, 2001, Tokyo, Japan; 6pp. [PDF, 163KB]

(2001) Chris Callison-Burch & Raymond S. Flournoy: A program for automatically selecting the best output from multiple machine translation engines. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 63-66. [PDF, 155KB]

(2001) Simon Corston-Oliver, Michael Gamon, & Chris Brockett: A machine learning approach to the automatic evaluation of machine translation ACL-EACL-2001: 39th Annual meeting [of the Association for Computational Linguistics] and 10th Conference of the European Chapter [of ACL], July 9th - 11th 2001, Toulouse, France; pp.140-147. [PDF, 87KB]

(2001) M. Fuji, N. Hatanaka, E. Ito, S. Kamei, H. Kumai, T. Sukehiro, T. Yoshimi & H. Isahara: Evaluation method for determining groups of users who find MT "useful". MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 103-108. [PDF, 303KB]

(2001) Michael Gamon, Hisami Suzuki & Simon Corston-Oliver: Using machine learning for system-internal evaluation of transferred linguistic representations. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 109-114. [PDF, 85KB]

(2001) A. Guessoum & R. Zantout: Semi-automatic evaluation of the grammatical coverage of machine translation systems. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 133-138. [PDF, 55KB]

(2001) Kevin Knight, Lori Levin, Young-Suk Lee, Salim Roukos, & Alex Waibel: Machine translation in TIDES. Planning Committee report, [2001]. 13 slides [PDF from PPT, 20KB]

(2001) Keith J. Miller & Michelle Vanni: Scaling the ISLE taxonomy: development of metrics for the multi-dimensional characterisation of machine translation quality. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.229-238. [PDF, 134KB]

(2001) Hermann Ney: Stochastic modelling: from pattern classification to language translation. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.33-37. [PDF, 68KB]

(2001) Kishore Papineni, Salim Roukos, Todd Ward, & Wei-Jing Zhu: Bleu: a method for automatic evaluation of machine translation. IBM Research Report, RC22176, September 17, 2001. 10pp. [PDF, 379KB]

(2001) Florence Reeder: Is that your final answer?  HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 4pp. [PDF, 55KB]

 (2001) Celia Rico: Reproducible models for CAT tools evaluation: a user-oriented perspective. Translating and the Computer 23: papers from the Aslib conference held on 29 & 30 November 2001 (London: Aslib, 2001); 12pp. [PDF, 76KB]

(2001) Fumiaki Sugaya, Keiji Yasuda, Toshiyuki Takezawa & Seiichi Yamamoto: Precise measurement method of a speech translation system's capability with a paired comparison method between the system and humans. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.345-350. [PDF, 78KB]

(2001) Michelle Vanni & Keith J. Miller: Scaling the ISLE framework: validating tests of machine translation quality for multi-dimensional measurement. MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Workshop on MT Evaluation; pp.21-27. [PDF, 121KB]

(2001) John White: Predicting intelligibility from fidelity in MT evaluation. MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Workshop on MT Evaluation; pp.35-37. [PDF, 235KB]

(2001) Keiji Yasuda, Fumiaki Sugaya, Toshiyuki Takezawa, Seiichi Yamamoto & Masuzo Yanagida: An automatic evaluation method of translation quality using translation answer candidates queried from a parallel corpus. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.373-378. [PDF, 193KB]

(2001) Shoichi Yokoyama, Hideki Kashioka, Akira Kumano, Masaki Matsudaira, Yoshiko Shirokizawa, Shuji Kodama, Terumasa Ehara, Shinichiro Miyazawa & Yuzo Murata: An automatic evaluation method for machine translation using two-way MT. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.379-384. [PDF, 176KB]

(2001) [NIST]. Automatic evaluation of language translation using n-gram co-occurrence statistics. [NIST, 2001]; 8pp. [PDF of PPT, 40KB]

(2000) Lars Ahrenberg, Magnus Merkel, Anna Sågvall Hein, & Jörg Tiedemann: Evaluation of word alignment systems. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 1255-1261. [PDF, 124KB]

(2000) Lars Ahrenberg & Magnus Merkel: Correspondence measures for MT evaluation. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop proceedings: Evaluation of machine translation, Athens, Greece, 29 May 2000; pp. 41-45. [PDF, 219KB]

(2000) Catalina Barbu & Ruslan Mitkov: Evaluation environment for anaphora resolution. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1772KB]

(2000) Niamh Bohan, Elisabeth Breidt, & Martin Volk: Evaluating translation quality as input to product development. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 33-37. [PDF, 57KB]

(2000) Martin Braschler, Donna Harman, Michael Hess, Michael Kluck, Carol Peters, & Peter Schäuble: The evaluation of systems for cross-language information retrieval. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 1469-1474. [PDF, 60KB]

(2000) Michael Carl: A model of competence for corpus-based machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 997-1001 [PDF,.473KB]

(2000) Lynette Hirschman, Florence Reeder, John D.Burger, & Keith Miller: Name translation as a machine translation evaluation task. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop proceedings: Evaluation of machine translation, Athens, Greece, 29 May 2000; pp. 21-28?. [PDF, 189KB]

(2000) Susanne J. Jekat & Lorenzo Tessiore: End-to-end evaluation of machine interpretation systems: a graphical evaluation tool. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 1583-1588. [PDF, 69KB]

(2000) Douglas A.Jones & Gregory M.Rusk: Toward a scoring function for quality-driven machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 376-382 [PDF, 644KB]

(2000) Sonja Nießen, Franz Josef Och, Gregor Leusch, & Hermann Ney: An evaluation tool for machine translation: fast evaluation for MT research. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 39-45. [PDF, 1048KB]

(2000) Uwe Reinke: Evaluating the linguistic performance of translation memory systems. [abstract] In: IAI Working Paper no.36, 2000; 5pp. [PDF, 76KB]

 (2000) Harold Somers & Elizabeth Wild: Evaluating machine translation: the cloze procedure revisited. Translating and the Computer 22: proceedings of the Twenty-second International Conference… 16-17 November 2000 (London: Aslib, 2000); 11pp.  [PDF, 85KB]

(2000) Michelle Vanni & Florence Reeder: How are you doing? A look at MT evaluation. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.109-116. [go to publisher details]

(2000) Stephan Vogel, Sonja Nießen, & Hermann Ney: Automatic extrapolation of human assessment of translation quality. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop proceedings: Evaluation of machine translation, Athens, Greece, 29 May 2000; pp. 35-39. [PDF, 636KB]

(2000) John S. White: Contemplating automatic MT evaluation. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.100-108. [go to publisher details]

(2000) John White: Toward an automated, task-based MT evaluation strategy. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop proceedings: Evaluation of machine translation, Athens, Greece, 29 May 2000; p. 11. [abstract only] [PDF, 72KB]

(2000) John S.White, Jennifer B. Doyon, & Susan W. Talbott: Task tolerance of MT output in integrated text processes; ANLP/NAACL 2000 workshop: Embedded machine translation systems, May 4, 2000, Seattle, Washington, [USA]; pp.9-16. [PDF, 786KB]

(2000) John White, Jennifer Doyon, & Susan Talbott: Determining the tolerance of text-handling tasks for MT output. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 29-32. [PDF, 34KB]

Evaluations of systems (see also User experiences in index of applications)

(2004) Heather Fulford & Joaquin Granell-Zafra: The freelance translator's workstation: an empirical investigation 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp.53-61. [PDF, 165KB]

(2004) Federico Gaspari: Online MT services and real users' needs: an empirical usability evaluation. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 74-85. [go to publisher details]

(2004) Federico Gaspari: Integrating on-line MT services into monolingual web-sites for dissemination purposes: an evaluation perspective 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp.62-72. [PDF, 186KB ]

(2004) Nano Gough & Andy Way: Robust large-scale EBMT with marker-based segmentation; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.95-104. [PDF, 94KB]

(2004) Richard Jelinek: Modern MT systems and the myth of human translation: real world status quo. Translating and the Computer 26: proceedings of the Twenty-sixth International Conference on Translating and the Computer, 18-19 November 2004, London. (London: Aslib, 2004); 15pp. [PDF, 149KB]

(2004) J. Laoudi, C.Tate, & Clare R.Voss: Towards an automated evaluation of an embedded MT system 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 106-115. [PDF, 250KB]

(2004) Johann Roturier: Assessing the set of controlled language rules: can they improve the performance of commercial machine translation systems? Translating and the Computer 26: proceedings of the Twenty-sixth International Conference on Translating and the Computer, 18-19 November 2004, London. (London: Aslib, 2004); 14pp. [PDF, 178KB]

(2004) Diana Santos, Belinda Mala, & Luís Sarmento: Gathering empirical data to evaluate MT from English to Portuguese.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 14-17. [PDF, 317KB]

(2004) Kiyoshi Sudo, Satoshi Sekine, & Ralph Grishman: Cross-lingual information extraction system evaluation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 186KB]

(2004) Gregor Thurmair: Comparing rule-based and statistical MT output. LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 5-9. [PDF, 225KB]

(2004) M. Vanni, C.R.Voss, & C. Tate: Ground truth, reference truth & “omniscient truth” – parallel phrases in parallel texts for MT evaluation.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 10-13. [PDF, 352KB]

(2004) Per Weijnitz, Eva Forsbom, Ebba Gustavii, Eva Pettersson, & Jörg Tiedemann: MT goes farming: comparing two machine translation approaches on a new domain.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.2043-2046. [PDF, 296KB]

(2003) Katri A. Clodfelder: An LSA implementation against parallel texts in French and English HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 119KB]

(2003) John Hutchins: Has machine translation improved? some historical comparisons MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.181-188. [PDF, 190KB]

(2003) Philipp Koehn, Franz Josef Och, & Daniel Marcu: Statistical phrase-based translation HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; pp.48-54 [PDF, 100KB]

(2003) Rob Koeling, Adam Kilgarriff, David Tugwell, & Roger Evans: An evaluation of a lexicographer's workbench: building lexicons for machine translation 7th EAMT Workshop,"Improving machine translation through other language technology tools", 13 April 2003, Budapest, Hungary; pp. 9-16 [PDF, 250KB]

(2003) Elisabeth Maier & Anthony Clarke: Scalability in MT systems MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.248-253. [PDF, 414KB]

(2003) Harold Somers & Yuri Sugita: Evaluating commercial spoken language translation software MT Summit IX, New Orleans, USA, 23-27 September 2003; pp. 370-377. [PDF, 174KB]

(2003) David Stallard, John Makhoul, Frederick Choi, Ehry Macrostie, Premkumar Natarajan, Richard Schwartz, & Bushra Zawaydeh: Design and evaluation of a limited two-way speech translator. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2221-2224; abstract [PDF, 34KB]

(2003) Eiichiro Sumita, Yasuhiro Akiba, Takao Doi, Andrew Finch, Kenji Imamura, Michael Paul, Mitsuo Shimohata & Taro Watanabe: A corpus-centred approach to spoken language translation. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.171-174 [PDF, 243KB]

(2003) Marina Vassiliou, Stella Markantonatou, Yanis Maistros & Vangelis Karkaletsis: Evaluating specifications for controlled Greek Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003; pp.185-193. [PDF, 278KB]

(2003) Ashish Venugopal, Stephan Vogel, & Alex Waibel: Effective phrase translation extraction from aligned models ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 134KB]

(2003) Keiji Yasuda, Fumiaki Sugaya, Toshiyuki Takezawa, Seiichi Yamamoto & Masuzo Yamagida:  Applications of automatic evaluation methods to measuring a capability of speech translation system. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.371-378 [PDF, 1659KB]

(2002) R.Cattoni, G.Lazzari, N.Mana, F.Pianesi, E.Pianta, S.Burger, D.Gates,A.Lavie, L.Levin, C.Langley, K.Peterson, T.Schultz, A.Waibel, D.Wallace, F.Metze, J.McDonough, H.Soltau, L.Besacier, H.Blanchon, D.Vaufreydaz, E.Costantini, & L.Taddei: Not only translation quality: evaluating the NESPOLE! speech-to-speech translation system along other viewpoints; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; 9pp. [PDF, 148KB]

(2002) Marianne Dabbadie, Widad Mustafa El Hadi, & Ismail Timimi: Terminological enrichment for non-interactive MT evaluation. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1878-1884. [PDF, 63KB]

(2002) Robert E.Frederking, Alan W.Black, Ralf D.Brown, John Moody, & Eric Steinbrecher: Field testing the Tongues speech-to-speech machine translation system. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.160-164. [PDF, 130KB]

(2002) Bowen Hui: Measuring user acceptability of machine translations to diagnose system errors: an experience report; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 254KB]

(2002) Yu-Seop Kim, Jeong-Ho Chang, Byoung-Tak Zhang: A comparative evaluation of data-driven models in translation selection of machine translation. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 170KB]

(2002) Philippe Langlais, Marie Loranger, & Guy Lapalme: Translators at work with TransType: resource and evaluation. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.2128-2135. [PDF, 253KB]

(2002) Alon Lavie, Florian Metze, Roldano Cattoni, & Erica Costantini: A multi-perspective evaluation of the NESPOLE! speech-to-speech translation system; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 121-128 [PDF, 144KB]

(2002) Douglas W.Oard, Frederic C. Gey, & Bonnie J. Dorr: Evaluating Arabic retrieval from English or French queries: the TREC-2001 cross-language information retrieval track. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop Arabic language resources and evaluation: status and prospects, Las Palmas de Gran Canaria, Spain, 1 June 2002; 6pp. [PDF, 183KB]

(2002) Martin Rajman & Anthony Hartley: Automatic ranking of MT systems. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1247-1253. [PDF, 48KB]

(2002) Solange Rossato, Hervé Blanchon, & Laurent Besacier: Speech-to-speech translation system evaluation: results for French for the NESPOLE! project first showcase.  ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1905-1908; abstract [PDF, 68KB]

(2002) Véronique Sauron: Tearing out the terms: evaluating terms extractors. Translating and the Computer 24: proceedings from the Aslib conference held on 21-22 November 2002 (London: Aslib, 2002); 18pp. [PDF, 142KB]

(2002) Fumiaki Sugaya, Keiji Yasuda, Toshiyuki Takezawa, & Seiichi Yamamoto: Quality-sensitive test set selection for a speech translation system; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 109-116 [PDF, 297KB]

(2002) Keiji Yasuda, Fumiaki Sugaya, Toshiyuki Takezawa, Seiichi Yamamoto, & Masuzo Yanagida: Automatic machine translation selection scheme to output the best result. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.525-528. [PDF, 113KB]

(2002) Angelika Zerfass: Evaluating translation memory systems. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Language resources for translation work and research, Las Palmas Canary Islands, 27 May 2002; pp.49-52. [PDF, 29KB]

(2002) Machine translation and the Virtual Museum of Canada. [2002]. 188pp. [PDF, 903KB]

(2001) Chris Callison-Burch: Upping the ante for “best of breed” machine translation providers.  Translating and the Computer 23: papers from the Aslib conference held on 29 & 30 November 2001 (London: Aslib, 2001); 9pp. [PDF, 52KB]

(2001) Maki Darwin: Trial and error: an evaluation project on Japanese <> English MT output quality. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 77-82. [PDF, 248KB]

(2001) Sungryong Koh, Jinee Maeng, Ji-Young Lee, Young-Sook Chae & Key-Sun Choi: A test suite for evaluation of English-to-Korean machine translation systems. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.191-195. [PDF, 140KB]

(2001) Derek Lewis: PC-based machine translation: an illustration of capabilities in response to submitted test sentences.  In: Machine Translation Review, issue 12: December 2001; pp.36-57.

(2001) Elisabeth Maier, Anthony Clarke & Hans-Udo Stadler: Evaluation of machine translation systems at CLS Corporate Language Services AG. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.223-228. [PDF, 187KB]

(2001) Sébastien Sauvé, Philippe Langlais, & Guy Lapalme: User interface aspects of a translation typing system.  Advances in Artificial Intelligence: 14th Biennial Conference of the Canadian Society for Computational Studies of Intelligence, AI 2001, Ottawa, Canada, June 7-9, 2001, Proceedings; pp. 246-256 [PDF, 267KB]

(2001) Benjamin K. Tsou & Oi Yee Kwong: Evaluating Chinese-English translation systems for personal name coverage MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Towards a Road Map for MT [PDF, 197KB]

(2000) Arendse Bernth & Michael C. McCord: The effect of source analysis on translation confidence. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.89-99. [go to publisher details]

(2000) Damir Ćavar, Uwe Küssner, & Dan Tidhar: From human evaluation to automatic selection of good translations. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop proceedings: Evaluation of machine translation, Athens, Greece, 29 May 2000; pp. 29-33. [PDF, 158KB]

(2000) A.Fourla, O.Yannoutsou, I.Tsakou, S.Stamou, & A.Petrits: The contribution of a user group to the evaluation and improvement of an MT system. Translating and the Computer 22: proceedings of the Twenty-second International Conference… 16-17 November 2000 (London: Aslib, 2000); 13pp.  [PDF, 132KB]

(2000) M.Holland, C. Schlesinger, & C. Tate: Evaluating embedded machine translation in military field exercises. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.239-247. [go to publisher details]

(2000) Olivier Kraif: Evaluation of statistical tools for automatic extraction of lexical correspondences between parallel texts. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1579KB]

(2000) Philippe Langlais, Sébastien Sauvé, George Foster, Elliott Macklovitch, & Guy Lapalme: Evaluation of TransType, a computer-aided translation typing system: a comparison of a theoretical- and a user-oriented evaluation procedures. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 641-648. [PDF, 1302KB]

(2000) Lori Levin, Boris Bartlog, Ariadna Font Llitjos, Donna Gates, Alon Lavie, Dorcas Wallace, Taro Watanabe, & Monika Woszczyna: Lessons learned from a task-based evaluation of speech-to-speech machine translation. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 721-724. [PDF, 460KB]

(2000) Rita Nübel & Jörg Schütz: Evaluation as a language technology deployment trigger Fifth EAMT Workshop "Harvesting existing resources", May 11 - 12, 2000, Ljubljana, Slovenia; pp.69-75. [PDF, 55KB]

 (2000) Harold Somers & Elizabeth Wild: Evaluating machine translation: the cloze procedure revisited. Translating and the Computer 22: proceedings of the Twenty-second International Conference… 16-17 November 2000 (London: Aslib, 2000); 11pp.  [PDF, 85KB]

(2000) Clare R.Voss & Carol Van Ess-Dykema: When is an embedded MT system “good enough” for filtering? ANLP/NAACL 2000 workshop: Embedded machine translation systems, May 4, 2000, Seattle, Washington, [USA]; pp.1-8. [PDF, 649KB]

(2000) Rémi Zajac, Steve Helmreich, & Karine Megerdoomian: Black-box/glass-box evaluation in Shiraz. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop proceedings: Evaluation of machine translation, Athens, Greece, 29 May 2000; pp. 13-20. [PDF, 693KB]

Example-based methods

(2004) Eiji Aramaki & Sadao Kurohashi: Example-based machine translation using structural translation examples. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 91-94 [PDF, 203KB]

(2004) Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka & Hideki Tanaka: Example-based machine translation without saying inferable predicate. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.206-215. [abstract]

(2004) Ralf D. Brown: A modified Burrows-Wheeler transform for highly scalable example-based translation. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 27-36. [go to publisher details]

(2004) Violetta Cavalli-Sforza, Jaime G. Carbonell, & Peter J. Jansen: Developing language resources for a transnational digital government system. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.945-948. [PDF, 271KB]

(2004) Violetta Cavalli-Sforza, Ralf D.Brown, Jaime G.Carbonell, Peter J. Jansen, & Jae Dong Kim: Challenges in using an example-based MT system for a transnational digital government project 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp.33-42. [PDF, 216KB ]

(2004) Walter Daelemans: [review of] Recent advances in example-based machine translation [eds.] Michael Carl and Andy Way. Computational Linguistics 30 (4), pp. 516-520. [PDF, 87KB]

(2004) Takao Doi & Eiichiro Sumita: Splitting input sentences for machine translation using language model with sentence similarity.  Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 61KB]

(2004) Nano Gough & Andy Way: Example-based controlled translation 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp.73-81. [PDF, 155KB]

(2004) Nano Gough & Andy Way: Robust large-scale EBMT with marker-based segmentation; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.95-104. [PDF, 94KB]

(2004) Gábor Hodász, Tamás Gröbler, & Balázs Kis: Translation memory as a robust example-based translation system 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp.82-89. [PDF, 227KB]

(2004) Hou Hongxu, Deng Dan, Zou Gang, Yu Hongkui, Liu Yang, Xiong Deyi & Liu Qun: An EBMT system based on word alignment. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 47-49 [PDF, 132KB]

(2004) Kenji Imamura, Hideo Okuma, Taro Watanabe, & Eiichiro Sumita: Example-based machine translation based on syntactic transfer with statistical models. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 56KB]

(2004) Hiroyuki Kaji: Adapted seed lexicon and combined bidirectional similarity measures for translation equivalent extraction from comparable corpora; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.115-124. [PDF, 378KB]

(2004) Lambros Kranias & Anna Samiotou: Automatic translation memory fuzzy match post-editing: a step beyond traditional TM/MT integration. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.331-334. [PDF, 346KB]

(2004) Michael Paul, Eiichiro Sumita, & Seiichi Yamamoto: Example-based rescoring of statistical machine translation output.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA – Short Papers; pp.9-12 [PDF, 89KB]

(2004) Suhaimi Ab. Rahman & Normaziah Abdul Aziz: Improving word alignment in an English-Malay parallel corpus for machine translation.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 22-25. [PDF, 414KB]

(2004) Mitsuo Shimohata, Eiichiro Sumita & Yuji Matsumoto: Method for retrieving a similar sentence and its application to machine translation; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.105-114. [PDF, 112KB]

(2004) Eiichiro Sumita, Yasuhiro Akiba, Takao Doi, Andrew Finch, Kenji Imamura, Hideo Okuma, Michael Paul, Mitsuo Shimohata & Taro Watanabe: EBMT, SMT, hybrid and more: ATR spoken language translation system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp.13-20 [PDF, 170KB]

(2004) Cristina Vertan: Language resources for the Semantic Web – perspectives for machine translation. Coling 2004: Proceedings of the Second International Workshop on Language Resources for Translation Work, Research and Training, 28th August, University of Geneva, Switzerland; pp. 37-41. [PDF, 232KB]

(2004) Wong Fai, Tang Chi Wai, Dong Ming Chui, Mao Yu Hang, & Li Yi Ping: Example-based machine translation based on translation corresponding tree representation. Proceedings of The First International Joint Conference on Natural Language Processing (IJCNLP-04) Asian Symposium on Natural Language Processing to Overcome Language Barriers 31-38. Hainan Island, China; pp.31-38. [PDF, 334KB]

(2004) Fai Wong, Dong Cheng Hu, Yu Hang Mao, Ming Chui Dong: A flexible example annotation schema: translation corresponding tree representation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 178KB]

(2004) Wong Fai, Hu Dong Cheng, Mao Yu Hang, Tang Chi Wai, & Dong Ming Chui: Application of translation corresponding tree (TCT) annotation schema in example-based machine translation.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 42-45. [PDF, 474KB]

(2004) Xin Zhao, Fuji Ren, & Stefan Voß: A super-function based Japanese-Chinese machine translation system for business users. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 272-281. [go to publisher details]

(2004) Yang Muyun, Zhao Tiejun, Liu Haijie, Shi Xiasheng & Jiang Hongfei: Auto word alignment based Chinese-English EBMT. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 27-29 [PDF, 232KB]

(2003) Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation. Dordrecht/Boston/London: Kluwer Academic Publishers, 2003. (Text, Speech and Language Technology, vol.21)

(2003) Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka, & Hideki Tanaka: Word selection for EBMT based on monolingual similarity and translation confidence HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 1565KB]

(2003) Ralf D. Brown, Rebecca Hutchinson, Paul N. Bennett, Jaime G. Carbonell, & Peter Jansen: Reducing boundary friction using translation-fragment overlap MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 117KB]

(2003) Michael Carl: Data-assisted controlled translation Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003 [PDF, 172KB]

(2003) Michael Carl & Philippe Langlais: Tuning general translation knowledge to a sublanguage Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003 [PDF, 229KB]

(2003) Takao Doi & Eiichiro Sumita: Input sentence splitting and translating HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 201KB]

(2003) Takao Doi, Eiichiro Sumita, & Hirofumi Yamamoto: Adaptation using out-of-domain corpus within EBMT HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 33KB]

(2003) Nano Gough &  Andy Way: Controlled generation in example-based machine translation MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 45KB]

(2003) Deepa Gupta & Niladri Chatterjee: Identification of divergence for English to Hindi EBMT MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 181KB]

(2003) Kenji Imamura, Eiichiro Sumita & Yuji Matsumoto: Automatic construction of machine translation knowledge using translation literalness. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.155-162 [PDF, 397KB]

(2003) Mihoko Kitamura & Toshiki Murata: Practical machine translation system allowing complex patterns MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.232-239. [PDF, 109KB]

(2003) Yoon-Hyung Roh, Munpyo Hong, Sung-Kwon Choi, Ki-Young Lee, & Sang-Kyu Park: For the proper treatment of long sentences in a sentence pattern-based English-Korean MT system MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.323-329. [PDF, 158KB]

(2003) Mitsuo Shimohata, Eiichiro Sumita, & Yuji Matsumoto: Example-based rough translation for speech-to-speech translation MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.354-361. [PDF, 53KB]

(2003) Mitsuo Shimohata, Eiichiro Sumita, & Yuji Matsumoto: Retrieving meaning-equivalent sentences for example-based rough translation HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 43KB]

(2003) K.C.Siu, Helen M.Meng, & C.C.Wong: Example-based bi-directional Chinese-English machine translation with semi-automatically induced grammars. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2801-2804; abstract [PDF, 34KB]

(2003) Harold Somers & Hermione Lovel: Computer-based support for patients with limited English 9th EAMT Workshop, "Improving machine translation through other language technology tools", 13 April 2003, Budapest, Hungary; pp. 41-49 [PDF, 198KB]

(2003) Eiichiro Sumita, Yasuhiro Akiba, Takao Doi, Andrew Finch, Kenji Imamura, Michael Paul, Mitsuo Shimohata & Taro Watanabe: A corpus-centred approach to spoken language translation. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.171-174 [PDF, 243KB]

(2003) Taro Watanabe & Eiichiro Sumita: Example-based decoding for statistical machine translation MT Summit IX, New Orleans, USA, 23-27 September 2003; pp. 410-417. [PDF, 101KB]

(2003) Andy Way & Nano Gough: wEBMT: developing and validating an example-based machine translation system using the World Wide Web. Computational Linguistics 29 (3), pp.421-457. [PDF, 169KB]

(2003) Andy Way & Nano Gough: Teaching and assessing empirical approaches to machine translation. MT Summit IX, New Orleans, USA, Workshop on Teaching Translation Technologies and Tools, 27 September 2003. [PDF, 34KB]

(2003) Kaoru Yamamoto, Taku Kudo, Yuta Tsuboi, & Yuji Matsumoto: Learning sequence-to-sequence correspondences from parallel corpora via sequential pattern mining HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 101KB]

(2003) How should we define data-driven MT? In: MT News International no.33, Autumn 2003. [PDF]

(2002) Hiyan Alshawi & Shona Douglas: Speech translation performance of statistical dependency transduction and semantic similarity transduction; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 31-38 [PDF, 92KB]

(2002) Chris Brockett, Takako Aikawa, Anthony Aue, Arul Menezes, Chris Quirk, & Hisami Suzuki: English-Japanese example-based machine translation using abstract linguistic representation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan ; 7pp. [PDF, 127KB]

(2002) Ralf Brown: Example-based machine translation.  Tutorial notes, at AMTA 2002: Fifth biennial conference of the Association for Machine Translation in the Americas, October 9, 2002, Tiburon, California, USA [PDF of PPT presentation, 514KB]

(2002) Mirosław Gajer: The pattern-based French-to-Polish machine translation system.  In: Machine Translation Review, issue 13: December 2002; pp.6-34.

(2002) Nano Gough, Andy Way, & Mary Hearne: Example-based machine translation via the Web. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 74-83. [go to publisher details]

(2002) Takahiro Ikeda, Shinichi Ando, Kenji Satoh, Akitoshi Okumura, & Takao Watanabe: Automatic interpretation system integrating free-style sentence translation and parallel text based translation; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 85-92 [PDF, 240KB]

(2002) Kenji Imamura: Application of translation knowledge acquired by hierarchical phrase alignment for pattern-based MT. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002. [PDF, 142KB]

(2002) Changhyun Kim, Munpyo Hong, Yinxia Huang, Young Kil Kim, Sung Il Yang, Young Ae Seo, & Sung-Kwon Choi: Korean-Chinese machine translation based on verb patterns. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 94-103. [go to publisher details]

(2002) Kit Chunyu, Pan Haihua, & Jonathan J.Webster: Example-based machine translation: a new paradigm. In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.57-78.

(2002) Philippe Langlais & Michel Simard: Merging example-based and statistical machine translation: an experiment. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp.104-113. [go to publisher details]

(2002) Arul Menezes: Better contextual translation using machine learning. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 124-134. [go to publisher details]

(2002) Reinhard Rapp: A part-of-speech-based search algorithm for translation memories. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.466-472. [PDF, 102KB]

(2002) Mitsuo Shimohata & Eiichiro Sumita: Identifying synonymous expressions from a bilingual corpus for example-based machine translation; Coling-2002 workshop "Machine translation in Asia", July 2002, Philadelphia, USA; 6pp. [PDF, 1324KB]

(2002) Eiichiro Sumita: Corpus-centered computation; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp.1-8 [PDF, 189KB]

(2002) Koichi Takeda: Sentence generation for pattern-based machine translation. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.168-177. [PDF, 224KB]

(2002) Hideki Tanaka, Stephen Nightingale, Hideki Kashioka, Kenji Matsumoto, Masamchi Nishiwaki, Tadashi Kumano, & Takehiko Maruyama: Speech to speech translation system for monologues – data driven approach. ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1717-1720; abstract [PDF, 50KB]

(2002) Jonathan J.Webster, Sin King Kui, & Hu Qinan: The application of Semantic Web technology for example-based machine translation (EBMT).  In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.79-91.

(2002) Jianmin Yao, Ming Zhou, Tiejun Zhao, Hao Yu & Sheng Li: An automatic evaluation method for localization oriented lexicalised EBMT system. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 320KB]

(2002) Keiji Yasuda, Fumiaki Sugaya, Toshiyuki Takezawa, Seiichi Yamamoto, & Masuzo Yanagida: Automatic machine translation selection scheme to output the best result. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.525-528. [PDF, 113KB]

(2001) proceedings of Workshop on Example-Based Machine Translation, MT Summit VIII, Santiago de Compostela, Spain, 18 September 2001. [HTML]

(2001) Eiji Aramaki, Sadao Kurohashi, Satoshi Sato & Hideo Watanabe: Finding translation correspondences from parallel parsed corpus for example-based translation. . MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 27-32. [PDF, 356KB]

(2001) Sivaji Bandyopadhyay: An example based MT systems in news items domain from English to Indian languages.  In: Machine Translation Review, issue 12: December 2001; pp.19-23.

(2001) Michael Carl: Inducing probabilistic invertable translation grammars from aligned texts.  ACL-EACL 2001 Workshop on Computational Natural Language Learning (CoNLL), July 2001, Toulouse, France; 7pp. [PDF, 160KB]

(2001) Ilyas Cicekli: A specific least general generalization of strings and its application to example-based machine translation. Proceedings of the 10th Turkish Symposium on Artificial Intelligence and Neural Networks (TAINN 2001), North Cyprus, June 2001; pp.228-237.  [PDF, 138KB]

(2001) Ilyas Cicekli & H.Altay Güveniri: Learning translation templates from bilingual translation examples. Applied Intelligence, vol.15, no.1; pp.57-76.  [PDF, 251KB]

(2001) Mirosław Gajer: Towards a fully-automatic high-quality machine translation for unrestricted text.  In: Machine Translation Review, issue 12: December 2001; pp.24-35.

(2001) Keiko Horiguchi: Information-based machine translation. [NAACL-2001] Language Technologies 2001: the Second meeting of the North American Chapter of the Association for Computational Linguistics, Carnegie Mellon University, Pittsburgh, PA, 2-7 June 2001; 8pp. [PDF,  105KB]

(2001) Daniel Marcu: Towards a unified approach to memory- and statistical-based machine translation ACL-EACL-2001: 39th Annual meeting [of the Association for Computational Linguistics] and 10th Conference of the European Chapter [of ACL], July 9th - 11th 2001, Toulouse, France; pp.378-385. [PDF, 67KB]

(2001) Mark-Jan Nederhof: Approximating context-free by rational transduction for example-based MT. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.25-32. [PDF, 96KB]

(2001) Stephen Richardson, William Dolan, Arul Menezes, & Jessie Pinkham: Achieving commercial-quality translation with example-based methods. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.293-298. [PDF, 187KB]

(2001) Stephen D.Richardson, William B.Dolan, Arul Menezes, & Monica Corston-Oliver: Overcoming the customization bottleneck using example-based MT. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.9-16. [PDF, 86KB]

(2001) Young-Ae Seo, Yoon-Hyung Roh, Ki-Young Lee & Sang-Kyu Park: CaptionEye/EK: a English-to-Korean caption translation system using the sentence pattern. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.325-329. [PDF, 279KB]

(2001) Jie Yang, Jiang Gao, Ying Zhang, & Alex Waibel: Towards automatic sign translation. HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 4pp. [PDF, 206KB]

(2001) Ying Zhang, Ralf Brown, Robert Frederking & Alon Lavie: Pre-processing of bilingual corpora for Mandarin-English EBMT.  MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.385-390. [PDF, 2374KB]

(2001) Ying Zhang, Ralf D.Brown, & Robert E.Frederking: Adapting an example-based translation system to Chinese. HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 4pp. [PDF, 50KB]

(2001) Eiichiro Sumita: Example-based machine translation using DP-matching between word sequences. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.1-8. [PDF, 79KB]

(2000) Sivaji Bandyopadhyay: An example-based MT system in news items domain from English to Indian languages. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 5pp. [PDF, 1026KB]

(2000) Ralf D.Brown: Automated generalization of translation examples Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 125-131 [PDF,.650KB]

(2000) Michael Carl: Combining invertible example-based machine translation with translation memory technology. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.127-136. [go to publisher details]

(2000) Michael Carl: A model of competence for corpus-based machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 997-1001 [PDF,.473KB]

(2000) Alexander Franz, Keiko Horiguchi, Lei Duan, Doris Ecker, Eugene Koontz, & Kazami Uchida: An integrated architecture for example-based machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1031-1035 [PDF,.369KB]

(2000) Jin-Xia Huang, Hyi-Jeong Song, Yeong-Mi Song, Ji-Hyoun Kim, Won-Sek Kang, Chung-Won Seo, Young-Souk Chae, & Key-Sun Choi: MATES/CK: a Chinese-Korean machine translation. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong. Demonstration notes, 3-6 October 2000; pp.32-33 [PDF, 154KB]

(2000) Jung-Jae Kim, Key-Sun Choi, & Young-Soog Chae: Phrase-pattern-based Korean to English machine translation using two level translation pattern selection. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong, October 2000. [PDF, 185KB]

(2000) Christos Malavazos & Stelios Piperidis: Application of analogical modelling to example-based machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 516-522 [PDF,.665KB]

(2000) Christos Malavazos, Stelios Piperidis, & George Carayannis: Towards memory and template-based translation synthesis. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1929KB]

(2000) Emmanuel Planas & Osamu Furuse: Multi-level similar segment matching algorithm for translation memories and example-based machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 621-627 [PDF,.564KB]

(2000) Sayori Shimohata: An empricial method for identifying and translating technical terminology Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 782-788 [PDF,.513KB]

(2000) Hideo Watanabe, Sadao Kurohashi, & Eiji Aramaki: Finding structural correspondences from bilingual parsed corpus for corpus-based translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 906-912 [PDF,.701KB]

(2000) KaorouYamamoto & Yuji Matsumoto: Acquisition of phrase-level bilingual correspondence using dependency structure Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 933-939 [PDF,.592KB]

Failures and limitations (see also Unknown words)

(2003) Doug Arnold: Why translation is difficult for computers. In: Harold Somers (ed.) Computers and translation: a translator’s guide (Amsterdam/Philadelphia: John Benjamins Publishing Company, 2003); pp.119-142.

(2003) Nico Weber: Automatic translation: background, problems and perspectives. Presented at International Colloquium on Multilingualism and Electronic Information Management, University of the Free State, 22 September 2003. 12pp. [PDF, 255KB]

(2002) Evangeline S.P.Almberg: Shall we dance, when the smart machines take over, virtually?  In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.185-193.

(2002) Yves Champollion: Automated translation: the next frontier.  Translating and the Computer 24: proceedings from the Aslib conference held on 21-22 November 2002 (London: Aslib, 2002); 7pp. [PDF, 47KB]; presentation:  10pp. [PDF, 44KB]

(2002) Daniel Gervais: The full-text multilingual corpus: breaking the translation memory bottleneck. Translating and the Computer 24: proceedings from the Aslib conference held on 21-22 November 2002 (London: Aslib, 2002); 12pp. [PDF,160KB]; presentation, 22 slides [PDF of PPT, 2753KB]

(2001) Yves Champollion: Machine translation (MT), and the future of the translation industry. Translation Journal 5 (1), January 2001; 3pp. [PDF, 72KB]

(2001) Alan K. Melby: Should I use machine translation? Translation Research Group. [HTML files]

(2000) Nancy Brigham: Machine translation: its past, the potential and the problems (Global communications and the global language divide, 1). University of Michigan School of Information, February 21, 2000. 23pp. [PDF, 443KB]

(2000) Steve  Vlasta Vitek: Reflections of a human translator on machine translation. Translation Journal 4 (3), July 2000; 8pp. [PDF, 93KB]

(2000) Paul F. Wood: What do translators do? And what machines can not. The Linguist, October 2000; [PDF, 133KB]

Feedback see Learning and training

Finite-state methods (see also Statistical analysis)

(2004) Francisco Casacuberta & Enrique Vidal: Machine translation with inferred stochastic finite-state transducers. Computational Linguistics 30 (2), pp. 205-224. [PDF, 501KB]

(2004) Jorge Civera, Elsa Cubel, Antonio L.Lagarda, David Picó, Jorge González, Enrique Vidal, Francisco Casacuberta, Juan M.Vilar, & Sergio Barrachina: From machine translation to computer-assisted translation using finite-state models.  EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 77KB]

(2004) Josep M.Crego, José B.Marino, & Adria de Gispert: Finite-state-based and phrase-based statistical machine translation. Interspeech 2004 – ICSLP 8th International  Conference on  Spoken Language Processing, Jeju Island, Korea, October 4-8, 2004; pp.37-40; abstract [PDF, 73KB]

(2004) Adrià de Gispert & José B. Mariño: TALP: Xgram-based spoken language translation system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 85-90 [PDF, 175KB]

(2004) Hajime Tsukada & Masaaki Nagata: Efficient decoding for statistical machine translation with a fully expanded WFST model. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 7pp. [PDF, 75KB]

(2003) Elsa Cubel, Jorge González, Antonio Lagarda, Francisco Casuberta, Alfons Juan & Enrique Vidal: Adapting finite-state translation to the TransType2 project Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003 [PDF, 71KB]

(2003) Shankar Kumar & William Byrne: A weighted finite state transducer implementation of the alignment template model for statistical machine translation HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; pp.63-70 [PDF, 141KB]

(2003) Shuly Wintner: Finite state technology and its applications to machine translation.  Tutorial at MT Summit IX, New Orleans, USA, September 2003 [PDF of presentation, 952KB]

(2002) Kemal Altintas & Ilyas Cicekli: A machine translation system between a pair of closely related languages. Proceedings of the 17th International Symposium on Computer and Information Sciences (ISCIS 2002), Orlando, Florida, 2002 (CRC Press); pp.192-196.  [PDF, 204KB]

(2002) Rafael C.Carrasco & Mikel L.Forcada: Incremental construction and maintenance of minimal finite-state automata. Computational Linguistics 28 (2), pp. 207-216 [PDF, 103KB]

(2002) Paul C. Davis: Stone soup translation: the linked automata model. Ph D thesis, Ohio State University, 2002. 323pp. [PDF, 1078KB]

(2002) Mikel L. Forcada: Using multilingual content on the web to build fast finite-state direct translation systems. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002: Workshop. [PDF, 155KB]

(2002) Adria de Gispert & José B.Mariño: Using x-grams for speech-to-speech translation. ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1885-1888; abstract [PDF, 65KB]