ACL-IJCNLP 2009
NEWS 2009
2009 Named Entities Workshop:
Shared Task on
Transliteration
Proceedings of the Workshop
7 August 2009
Table of Contents
Report of NEWS 2009 Machine Transliteration Shared
Task
Haizhou Li, A Kumaran,
Vladimir Pervouchine and Min Zhang......................................... 1
Whitepaper of NEWS 2009 Machine Transliteration
Shared Task
Haizhou Li, A Kumaran,
Min Zhang and Vladimir Pervouchine....................................... 19
Automata
for Transliteration and Machine Translation
Kevin Knight..................................................................................................................... 27
DirecTL: a Language
Independent Approach to Transliteration
Sittichai Jiampojamarn, Aditya Bhargava, Qing Dou,
Kenneth Dwyer and Grzegorz Kondrak.. . 28
Named Entity Transcription with Pair n-Gram Models
Martin Jansche and Richard Sproat.................................................................................... 32
Jong-Hoon Oh, Kiyotaka
Uchimoto and Kentaro Torisawa............................................... 36
A
Language-Independent Transliteration Schema Using Character Aligned Models at
NEWS 2009
Praneeth Shishtla,
Surya Ganesh V, Sethuramalingam
Subramaniam and Vasudeva Varma 40
Experiences
with English-Hindi, English-Tamil and English-Kannada Transliteration Tasks at
NEWS
2009
Manoj Kumar Chinnakotla
and
Testing and Performance Evaluation of Machine
Transliteration System for Tamil Language
Kommaluri Vijayanand, Inampudi Ramesh Babu and Poonguzhali Sandiran.................... 48
Transliteration by Bidirectional Statistical Machine
Translation
Andrew Finch and Eiichiro Sumita.................................................................................... 52
Transliteration of Name Entity via Improved
Statistical Translation on Character Sequences
Yan Song, Chunyu Kit and Xiao Chen.............................................................................. 57
Learning Multi Character Alignment Rules and
Classification of Training Data for Transliteration
Dipankar Bose and Sudeshna
Sarkar................................................................................. 61
Fast Decoding and Easy Implementation:
Transliteration as Sequential Labeling
Eiji Aramaki and
Takeshi Abekawa................................................................................... 65
Colin Cherry and Hisami Suzuki....................................................................................... 69
Dong Yang, Paul
Dixon, Yi-Cheng Pan, Tasuku Oonishi, Masanobu
Nakamura and Sadaoki Furui 72
Oi Yee Kwong................................................................................................................... 76
English to Hindi Machine Transliteration System at
NEWS 2009
Amitava Das, Asif Ekbal, Tapabrata Mondal and Sivaji Bandyopadhyay.......................... 80
Improving Transliteration Accuracy Using
Word-Origin Detection and Lexicon Lookup
Mitesh Khapra and Pushpak Bhattacharyya....................................................................... 84
A Noisy
Channel Model for Grapheme-based Machine Transliteration
Jia Yuxiang,
Zhu Danqing and Yu Shiwen........................................................................ 88
Substring-based Transliteration with Conditional
Random Fields
Sravana Reddy and Sonjia
Waxmonsky............................................................................ 92
A
Syllable-based Name Transliteration System
Xue Jiang, Le Sun and Dakun
Zhang................................................................................ 96
Transliteration System Using Pair HMM with
Weighted FSTs
Peter Nabende.................................................................................................................. 100
English-Hindi Transliteration Using Context-Informed
PB-SMT: the DCU System for NEWS 2009
Rejwanul Haque, Sandipan Dandapat, Ankit Kumar Srivastava, Sudip Kumar Naskar and
Way
A Hybrid
Approach to English-Korean Name Transliteration
Gumwon Hong, Min-Jeong
Kim, Do-Gil Lee and Hae-Chang Rim................................. 108
Language Independent Transliteration System Using
Phrase-based SMT Approach on Substrings
Sara Noeman.................................................................................................................... 112
Combining MDL Transliteration Training with
Discriminative Modeling
Dmitry Zelenko............................................................................................................... 116
Î-extension Hidden Markov Models and Weighted Transducers
for Machine Transliteration
Balakrishnan Vardarajan
and Delip Rao.......................................................................... 120
Modeling Machine Transliteration
as a Phrase Based Statistical Machine Translation Problem
Taraka Rama and Karthik
Gali......................................................................................... 124
Maximum n-Gram HMM-based Name Transliteration:
Experiment in NEWS 2009 on English-Chinese
Corpus
Yilu Zhou......................................................................................................................... 128
Name Transliteration with Bidirectional Perceptron Edit Models
Dayne Freitag and
Zhiqiang Wang................................................................................... 132
Bridging Languages by SuperSense
Entity Tagging
Davide Picca, Alfio Massimiliano Gliozzo and Simone Campora.................................... 136
Chinese-English Organization Name Translation Based on
Correlative Expansion
Feiliang Ren, Muhua Zhu, Huizhen Wang and Jingbo Zhu.............................................. 143
Name Matching between Roman and Chinese Scripts:
Machine Complements Human
Ken Samuel, Alan Rubenstein, Sherri Condon and Alex Yeh.......................................... 152
Analysis
and Robust Extraction of Changing Named Entities
Masatoshi Tsuchiya, Shoko Endo and Seiichi Nakagawa................................................. 161
Tag Confidence Measure for Semi-Automatically Updating
Named Entity Recognition
Kuniko Saito and Kenji Imamura..................................................................................... 168
A Hybrid
Model for Urdu Hindi Transliteration
Abbas Malik,
Laurent Besacier, Christian Boitet
and Pushpak Bhattacharyya................. 177
Graphemic Approximation
of Phonological Context for English-Chinese Transliteration
Oi Yee Kwong................................................................................................................. 186
Czech Named Entity Corpus and SVM-based Recognizer
Jana Kravalová and Zdeněk Žabokrtský.......................................................................... 194
Voted NER System using Appropriate Unlabeled Data
Asif Ekbal and Sivaji Bandyopadhyay............................................................................ 202