Improving adversarial neural machine translation for morphologically rich language

Chenggang Mi; Lei Xie; Yanning Zhang

doi:10.1109/TETCI.2019.2960546

Improving adversarial neural machine translation for morphologically rich language

Chenggang Mi, Lei Xie, Yanning Zhang

计算机学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

15 引用（Scopus）

摘要

Generative adversarial networks (GAN) have great successes on natural language processing (NLP) and neural machine translation (NMT). However, the existing discriminator in GAN for NMT only combines two words as one query to train the translation models, which restrict the discriminator to be more meaningful and fail to apply rich monolingual information. Recent studies only consider one single reference translation during model training, this limit the GAN model to learn sufficient information about the representation of source sentence. These situations are even worse when languages are morphologically rich. In this article, an extended version of GAN model for neural machine translation is proposed to optimize the performance of morphologically rich language translation. In particular, we use the morphological word embedding instead of word embedding as input in GAN model to enrich the representation of words and overcome the data sparsity problem during model training. Moreover, multiple references are integrated into discriminator to make the model consider more context information and adapt to the diversity of different languages. Experimental results on German\leftrightarrowEnglish, French\leftrightarrowEnglish, Czech\leftrightarrowEnglish, Finnish\leftrightarrowEnglish, Turkish\leftrightarrowEnglish, Chinese\leftrightarrowEnglish, Finnish\leftrightarrowTurkish and Turkish\leftrightarrowCzech translation tasks demonstrate that our method achieves significant improvements over baseline systems.

源语言	英语
文章编号	9099374
页（从-至）	417-426
页数	10
期刊	IEEE Transactions on Emerging Topics in Computational Intelligence
卷	4
期	4
DOI	https://doi.org/10.1109/TETCI.2019.2960546
出版状态	已出版 - 8月 2020

访问文件

10.1109/TETCI.2019.2960546

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{57ef6f8388c44f1fbc3fa33fb28df160,

title = "Improving adversarial neural machine translation for morphologically rich language",

abstract = "Generative adversarial networks (GAN) have great successes on natural language processing (NLP) and neural machine translation (NMT). However, the existing discriminator in GAN for NMT only combines two words as one query to train the translation models, which restrict the discriminator to be more meaningful and fail to apply rich monolingual information. Recent studies only consider one single reference translation during model training, this limit the GAN model to learn sufficient information about the representation of source sentence. These situations are even worse when languages are morphologically rich. In this article, an extended version of GAN model for neural machine translation is proposed to optimize the performance of morphologically rich language translation. In particular, we use the morphological word embedding instead of word embedding as input in GAN model to enrich the representation of words and overcome the data sparsity problem during model training. Moreover, multiple references are integrated into discriminator to make the model consider more context information and adapt to the diversity of different languages. Experimental results on German\leftrightarrowEnglish, French\leftrightarrowEnglish, Czech\leftrightarrowEnglish, Finnish\leftrightarrowEnglish, Turkish\leftrightarrowEnglish, Chinese\leftrightarrowEnglish, Finnish\leftrightarrowTurkish and Turkish\leftrightarrowCzech translation tasks demonstrate that our method achieves significant improvements over baseline systems.",

keywords = "adversarial training, morp-hologically rich language, morphological word embedding, multiple references, Neural machine translation (NMT)",

author = "Chenggang Mi and Lei Xie and Yanning Zhang",

note = "Publisher Copyright: {\textcopyright} 2017 IEEE.",

year = "2020",

month = aug,

doi = "10.1109/TETCI.2019.2960546",

language = "英语",

volume = "4",

pages = "417--426",

journal = "IEEE Transactions on Emerging Topics in Computational Intelligence",

issn = "2471-285X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "4",

}

TY - JOUR

T1 - Improving adversarial neural machine translation for morphologically rich language

AU - Mi, Chenggang

AU - Xie, Lei

AU - Zhang, Yanning

PY - 2020/8

Y1 - 2020/8

N2 - Generative adversarial networks (GAN) have great successes on natural language processing (NLP) and neural machine translation (NMT). However, the existing discriminator in GAN for NMT only combines two words as one query to train the translation models, which restrict the discriminator to be more meaningful and fail to apply rich monolingual information. Recent studies only consider one single reference translation during model training, this limit the GAN model to learn sufficient information about the representation of source sentence. These situations are even worse when languages are morphologically rich. In this article, an extended version of GAN model for neural machine translation is proposed to optimize the performance of morphologically rich language translation. In particular, we use the morphological word embedding instead of word embedding as input in GAN model to enrich the representation of words and overcome the data sparsity problem during model training. Moreover, multiple references are integrated into discriminator to make the model consider more context information and adapt to the diversity of different languages. Experimental results on German\leftrightarrowEnglish, French\leftrightarrowEnglish, Czech\leftrightarrowEnglish, Finnish\leftrightarrowEnglish, Turkish\leftrightarrowEnglish, Chinese\leftrightarrowEnglish, Finnish\leftrightarrowTurkish and Turkish\leftrightarrowCzech translation tasks demonstrate that our method achieves significant improvements over baseline systems.

AB - Generative adversarial networks (GAN) have great successes on natural language processing (NLP) and neural machine translation (NMT). However, the existing discriminator in GAN for NMT only combines two words as one query to train the translation models, which restrict the discriminator to be more meaningful and fail to apply rich monolingual information. Recent studies only consider one single reference translation during model training, this limit the GAN model to learn sufficient information about the representation of source sentence. These situations are even worse when languages are morphologically rich. In this article, an extended version of GAN model for neural machine translation is proposed to optimize the performance of morphologically rich language translation. In particular, we use the morphological word embedding instead of word embedding as input in GAN model to enrich the representation of words and overcome the data sparsity problem during model training. Moreover, multiple references are integrated into discriminator to make the model consider more context information and adapt to the diversity of different languages. Experimental results on German\leftrightarrowEnglish, French\leftrightarrowEnglish, Czech\leftrightarrowEnglish, Finnish\leftrightarrowEnglish, Turkish\leftrightarrowEnglish, Chinese\leftrightarrowEnglish, Finnish\leftrightarrowTurkish and Turkish\leftrightarrowCzech translation tasks demonstrate that our method achieves significant improvements over baseline systems.

KW - adversarial training

KW - morp-hologically rich language

KW - morphological word embedding

KW - multiple references

KW - Neural machine translation (NMT)

UR - http://www.scopus.com/inward/record.url?scp=85085750287&partnerID=8YFLogxK

U2 - 10.1109/TETCI.2019.2960546

DO - 10.1109/TETCI.2019.2960546

M3 - 文章

AN - SCOPUS:85085750287

SN - 2471-285X

VL - 4

SP - 417

EP - 426

JO - IEEE Transactions on Emerging Topics in Computational Intelligence

JF - IEEE Transactions on Emerging Topics in Computational Intelligence

IS - 4

M1 - 9099374

ER -

Improving adversarial neural machine translation for morphologically rich language

摘要

访问文件

其它文件与链接

指纹

引用此