TY - JOUR
T1 - LLMs-based machine translation for E-commerce
AU - Gao, Dehong
AU - Chen, Kaidi
AU - Chen, Ben
AU - Dai, Huangyu
AU - Jin, Linbo
AU - Jiang, Wen
AU - Ning, Wei
AU - Yu, Shanqing
AU - Xuan, Qi
AU - Cai, Xiaoyan
AU - Yang, Libin
AU - Wang, Zhen
N1 - Publisher Copyright:
© 2024
PY - 2024/12/15
Y1 - 2024/12/15
N2 - Large language models(LLMs) have shown promising performance for various downstream tasks, especially machine translation. However, LLMs and Specialized Translation Models (STMs) are designed to handle general translation needs, they are not well-suited for domains with specialized terms and writing styles, such as e-commerce, legal, and medicine. In the e-commerce domain, the text often contains many domain-specific terms and keyword-stacked structures, leading to poor translation quality with existing NMT methods. To tackle these problems, we have collected two resources specifically for the e-commerce domain, including aligned Chinese-English bilingual terms and parallel corpus from real e-commerce scenarios for model fine-tuning. We propose an LLMs-based E-commerce machine translation approach(LEMT) which includes LLMs utilization, e-commerce resources collection, and tokenizer optimization. We conduct two-stage fine-tuning and self-contrastive enhancement based on general LLMs to enable the model to learn translation features in the e-commerce domain. Through comprehensive evaluations on real e-commerce titles, our LEMT methodology demonstrates superior translation quality and robustness, outperforming leading NMT models such as NLLB, LLaMA, and even GPT-4.
AB - Large language models(LLMs) have shown promising performance for various downstream tasks, especially machine translation. However, LLMs and Specialized Translation Models (STMs) are designed to handle general translation needs, they are not well-suited for domains with specialized terms and writing styles, such as e-commerce, legal, and medicine. In the e-commerce domain, the text often contains many domain-specific terms and keyword-stacked structures, leading to poor translation quality with existing NMT methods. To tackle these problems, we have collected two resources specifically for the e-commerce domain, including aligned Chinese-English bilingual terms and parallel corpus from real e-commerce scenarios for model fine-tuning. We propose an LLMs-based E-commerce machine translation approach(LEMT) which includes LLMs utilization, e-commerce resources collection, and tokenizer optimization. We conduct two-stage fine-tuning and self-contrastive enhancement based on general LLMs to enable the model to learn translation features in the e-commerce domain. Through comprehensive evaluations on real e-commerce titles, our LEMT methodology demonstrates superior translation quality and robustness, outperforming leading NMT models such as NLLB, LLaMA, and even GPT-4.
KW - E-commerce domain
KW - Fine-tuning
KW - Large language models
KW - Neural machine translation
KW - Self-contrastive
UR - http://www.scopus.com/inward/record.url?scp=85201777124&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2024.125087
DO - 10.1016/j.eswa.2024.125087
M3 - 文章
AN - SCOPUS:85201777124
SN - 0957-4174
VL - 258
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 125087
ER -