TY - JOUR
T1 - Improving adversarial neural machine translation for morphologically rich language
AU - Mi, Chenggang
AU - Xie, Lei
AU - Zhang, Yanning
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2020/8
Y1 - 2020/8
N2 - Generative adversarial networks (GAN) have great successes on natural language processing (NLP) and neural machine translation (NMT). However, the existing discriminator in GAN for NMT only combines two words as one query to train the translation models, which restrict the discriminator to be more meaningful and fail to apply rich monolingual information. Recent studies only consider one single reference translation during model training, this limit the GAN model to learn sufficient information about the representation of source sentence. These situations are even worse when languages are morphologically rich. In this article, an extended version of GAN model for neural machine translation is proposed to optimize the performance of morphologically rich language translation. In particular, we use the morphological word embedding instead of word embedding as input in GAN model to enrich the representation of words and overcome the data sparsity problem during model training. Moreover, multiple references are integrated into discriminator to make the model consider more context information and adapt to the diversity of different languages. Experimental results on German\leftrightarrowEnglish, French\leftrightarrowEnglish, Czech\leftrightarrowEnglish, Finnish\leftrightarrowEnglish, Turkish\leftrightarrowEnglish, Chinese\leftrightarrowEnglish, Finnish\leftrightarrowTurkish and Turkish\leftrightarrowCzech translation tasks demonstrate that our method achieves significant improvements over baseline systems.
AB - Generative adversarial networks (GAN) have great successes on natural language processing (NLP) and neural machine translation (NMT). However, the existing discriminator in GAN for NMT only combines two words as one query to train the translation models, which restrict the discriminator to be more meaningful and fail to apply rich monolingual information. Recent studies only consider one single reference translation during model training, this limit the GAN model to learn sufficient information about the representation of source sentence. These situations are even worse when languages are morphologically rich. In this article, an extended version of GAN model for neural machine translation is proposed to optimize the performance of morphologically rich language translation. In particular, we use the morphological word embedding instead of word embedding as input in GAN model to enrich the representation of words and overcome the data sparsity problem during model training. Moreover, multiple references are integrated into discriminator to make the model consider more context information and adapt to the diversity of different languages. Experimental results on German\leftrightarrowEnglish, French\leftrightarrowEnglish, Czech\leftrightarrowEnglish, Finnish\leftrightarrowEnglish, Turkish\leftrightarrowEnglish, Chinese\leftrightarrowEnglish, Finnish\leftrightarrowTurkish and Turkish\leftrightarrowCzech translation tasks demonstrate that our method achieves significant improvements over baseline systems.
KW - adversarial training
KW - morp-hologically rich language
KW - morphological word embedding
KW - multiple references
KW - Neural machine translation (NMT)
UR - http://www.scopus.com/inward/record.url?scp=85085750287&partnerID=8YFLogxK
U2 - 10.1109/TETCI.2019.2960546
DO - 10.1109/TETCI.2019.2960546
M3 - 文章
AN - SCOPUS:85085750287
SN - 2471-285X
VL - 4
SP - 417
EP - 426
JO - IEEE Transactions on Emerging Topics in Computational Intelligence
JF - IEEE Transactions on Emerging Topics in Computational Intelligence
IS - 4
M1 - 9099374
ER -