Improving adversarial neural machine translation for morphologically rich language

Chenggang Mi, Lei Xie, Yanning Zhang

科研成果: 期刊稿件文章同行评审

15 引用 (Scopus)

摘要

Generative adversarial networks (GAN) have great successes on natural language processing (NLP) and neural machine translation (NMT). However, the existing discriminator in GAN for NMT only combines two words as one query to train the translation models, which restrict the discriminator to be more meaningful and fail to apply rich monolingual information. Recent studies only consider one single reference translation during model training, this limit the GAN model to learn sufficient information about the representation of source sentence. These situations are even worse when languages are morphologically rich. In this article, an extended version of GAN model for neural machine translation is proposed to optimize the performance of morphologically rich language translation. In particular, we use the morphological word embedding instead of word embedding as input in GAN model to enrich the representation of words and overcome the data sparsity problem during model training. Moreover, multiple references are integrated into discriminator to make the model consider more context information and adapt to the diversity of different languages. Experimental results on German\leftrightarrowEnglish, French\leftrightarrowEnglish, Czech\leftrightarrowEnglish, Finnish\leftrightarrowEnglish, Turkish\leftrightarrowEnglish, Chinese\leftrightarrowEnglish, Finnish\leftrightarrowTurkish and Turkish\leftrightarrowCzech translation tasks demonstrate that our method achieves significant improvements over baseline systems.

源语言英语
文章编号9099374
页(从-至)417-426
页数10
期刊IEEE Transactions on Emerging Topics in Computational Intelligence
4
4
DOI
出版状态已出版 - 8月 2020

指纹

探究 'Improving adversarial neural machine translation for morphologically rich language' 的科研主题。它们共同构成独一无二的指纹。

引用此