Statistical Machine TranslationCambridge University Press, 2010 - 433页 The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation. |
其他版本 - 查看全部
常见术语和短语
algorithm alignment points Annual Meeting ofthe Association for Computational back-off best translation bigrams BLEU score candidate translations Chapter chart entries Computational Linguistics confidence counts decoding define efficient English words Europarl Europarl corpus example extracted Figure find finite first foreign word function words grammar rules Haus human translators hypothesis IBM Model input sentence input words instance IWSLT Koehn language model lexical translation log-linear model machine learning machine translation systems mapping match Meeting ofthe Association methods morphological n-best list n-gram Natural Language Processing node nonterminal order n-gram output words parallel corpora parallel corpus parameter parse parse tree part-of-speech tags phrase pairs phrase-based models possible probability distribution problem Proceedings ofthe pruning reference translation reordering model sentence pair sequence span specific Spoken Language Translation stack statistical machine translation structure tion training data translation options translation probability transliteration tree verb word alignment word error rate Workshop on Spoken Workshop on Statistical