LLMs-based machine translation for E-commerce

Dehong Gao; Kaidi Chen; Ben Chen; Huangyu Dai; Linbo Jin; Wen Jiang; Wei Ning; Shanqing Yu; Qi Xuan; Xiaoyan Cai; Libin Yang; Zhen Wang

doi:10.1016/j.eswa.2024.125087

LLMs-based machine translation for E-commerce

Dehong Gao, Kaidi Chen, Ben Chen, Huangyu Dai, Linbo Jin, Wen Jiang, Wei Ning, Shanqing Yu, Qi Xuan, Xiaoyan Cai, Libin Yang, Zhen Wang

School of Cybersecurity

Research output: Contribution to journal › Article › peer-review

12 Scopus citations

Abstract

Large language models(LLMs) have shown promising performance for various downstream tasks, especially machine translation. However, LLMs and Specialized Translation Models (STMs) are designed to handle general translation needs, they are not well-suited for domains with specialized terms and writing styles, such as e-commerce, legal, and medicine. In the e-commerce domain, the text often contains many domain-specific terms and keyword-stacked structures, leading to poor translation quality with existing NMT methods. To tackle these problems, we have collected two resources specifically for the e-commerce domain, including aligned Chinese-English bilingual terms and parallel corpus from real e-commerce scenarios for model fine-tuning. We propose an LLMs-based E-commerce machine translation approach(LEMT) which includes LLMs utilization, e-commerce resources collection, and tokenizer optimization. We conduct two-stage fine-tuning and self-contrastive enhancement based on general LLMs to enable the model to learn translation features in the e-commerce domain. Through comprehensive evaluations on real e-commerce titles, our LEMT methodology demonstrates superior translation quality and robustness, outperforming leading NMT models such as NLLB, LLaMA, and even GPT-4.

Original language	English
Article number	125087
Journal	Expert Systems with Applications
Volume	258
DOIs	https://doi.org/10.1016/j.eswa.2024.125087
State	Published - 15 Dec 2024

Keywords

E-commerce domain
Fine-tuning
Large language models
Neural machine translation
Self-contrastive

Access to Document

10.1016/j.eswa.2024.125087

Cite this

@article{0beedf31818747869edc755df6ccdc3c,

title = "LLMs-based machine translation for E-commerce",

abstract = "Large language models(LLMs) have shown promising performance for various downstream tasks, especially machine translation. However, LLMs and Specialized Translation Models (STMs) are designed to handle general translation needs, they are not well-suited for domains with specialized terms and writing styles, such as e-commerce, legal, and medicine. In the e-commerce domain, the text often contains many domain-specific terms and keyword-stacked structures, leading to poor translation quality with existing NMT methods. To tackle these problems, we have collected two resources specifically for the e-commerce domain, including aligned Chinese-English bilingual terms and parallel corpus from real e-commerce scenarios for model fine-tuning. We propose an LLMs-based E-commerce machine translation approach(LEMT) which includes LLMs utilization, e-commerce resources collection, and tokenizer optimization. We conduct two-stage fine-tuning and self-contrastive enhancement based on general LLMs to enable the model to learn translation features in the e-commerce domain. Through comprehensive evaluations on real e-commerce titles, our LEMT methodology demonstrates superior translation quality and robustness, outperforming leading NMT models such as NLLB, LLaMA, and even GPT-4.",

keywords = "E-commerce domain, Fine-tuning, Large language models, Neural machine translation, Self-contrastive",

author = "Dehong Gao and Kaidi Chen and Ben Chen and Huangyu Dai and Linbo Jin and Wen Jiang and Wei Ning and Shanqing Yu and Qi Xuan and Xiaoyan Cai and Libin Yang and Zhen Wang",

note = "Publisher Copyright: {\textcopyright} 2024",

year = "2024",

month = dec,

day = "15",

doi = "10.1016/j.eswa.2024.125087",

language = "英语",

volume = "258",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "Elsevier Ltd",

}

TY - JOUR

T1 - LLMs-based machine translation for E-commerce

AU - Gao, Dehong

AU - Chen, Kaidi

AU - Chen, Ben

AU - Dai, Huangyu

AU - Jin, Linbo

AU - Jiang, Wen

AU - Ning, Wei

AU - Yu, Shanqing

AU - Xuan, Qi

AU - Cai, Xiaoyan

AU - Yang, Libin

AU - Wang, Zhen

PY - 2024/12/15

Y1 - 2024/12/15

N2 - Large language models(LLMs) have shown promising performance for various downstream tasks, especially machine translation. However, LLMs and Specialized Translation Models (STMs) are designed to handle general translation needs, they are not well-suited for domains with specialized terms and writing styles, such as e-commerce, legal, and medicine. In the e-commerce domain, the text often contains many domain-specific terms and keyword-stacked structures, leading to poor translation quality with existing NMT methods. To tackle these problems, we have collected two resources specifically for the e-commerce domain, including aligned Chinese-English bilingual terms and parallel corpus from real e-commerce scenarios for model fine-tuning. We propose an LLMs-based E-commerce machine translation approach(LEMT) which includes LLMs utilization, e-commerce resources collection, and tokenizer optimization. We conduct two-stage fine-tuning and self-contrastive enhancement based on general LLMs to enable the model to learn translation features in the e-commerce domain. Through comprehensive evaluations on real e-commerce titles, our LEMT methodology demonstrates superior translation quality and robustness, outperforming leading NMT models such as NLLB, LLaMA, and even GPT-4.

AB - Large language models(LLMs) have shown promising performance for various downstream tasks, especially machine translation. However, LLMs and Specialized Translation Models (STMs) are designed to handle general translation needs, they are not well-suited for domains with specialized terms and writing styles, such as e-commerce, legal, and medicine. In the e-commerce domain, the text often contains many domain-specific terms and keyword-stacked structures, leading to poor translation quality with existing NMT methods. To tackle these problems, we have collected two resources specifically for the e-commerce domain, including aligned Chinese-English bilingual terms and parallel corpus from real e-commerce scenarios for model fine-tuning. We propose an LLMs-based E-commerce machine translation approach(LEMT) which includes LLMs utilization, e-commerce resources collection, and tokenizer optimization. We conduct two-stage fine-tuning and self-contrastive enhancement based on general LLMs to enable the model to learn translation features in the e-commerce domain. Through comprehensive evaluations on real e-commerce titles, our LEMT methodology demonstrates superior translation quality and robustness, outperforming leading NMT models such as NLLB, LLaMA, and even GPT-4.

KW - E-commerce domain

KW - Fine-tuning

KW - Large language models

KW - Neural machine translation

KW - Self-contrastive

UR - http://www.scopus.com/inward/record.url?scp=85201777124&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2024.125087

DO - 10.1016/j.eswa.2024.125087

M3 - 文章

AN - SCOPUS:85201777124

SN - 0957-4174

VL - 258

JO - Expert Systems with Applications

JF - Expert Systems with Applications

M1 - 125087

ER -

LLMs-based machine translation for E-commerce

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this