ADELIE: Aligning Large Language Models on Information Extraction

Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Large language models (LLMs) usually fall short on information extraction (IE) tasks and struggle to follow the complex instructions of IE tasks. This primarily arises from LLMs not being aligned with humans, as mainstream alignment datasets typically do not include IE data. In this paper, we introduce ADELIE (Aligning large language moDELs on Information Extraction), an aligned LLM that effectively solves various IE tasks, including closed IE, open IE, and on-demand IE. We first collect and construct a high-quality alignment corpus IEInstruct for IE. Then we train ADELIESFT using instruction tuning on IEInstruct. We further train ADELIESFT with direct preference optimization (DPO) objective, resulting in ADELIEDPO. Extensive experiments on various held-out IE datasets demonstrate that our models (ADELIESFT and ADELIEDPO) achieve state-of-the-art (SoTA) performance among open-source models. We further explore the general capabilities of ADELIE, and experimental results reveal that their general capabilities do not exhibit a noticeable decline. We have released the code, data, and models to facilitate further research.

源语言英语
主期刊名EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
编辑Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
出版商Association for Computational Linguistics (ACL)
7371-7387
页数17
ISBN(电子版)9798891761643
出版状态已出版 - 2024
已对外发布
活动2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024 - Hybrid, Miami, 美国
期限: 12 11月 202416 11月 2024

出版系列

姓名EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference

会议

会议2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024
国家/地区美国
Hybrid, Miami
时期12/11/2416/11/24

指纹

探究 'ADELIE: Aligning Large Language Models on Information Extraction' 的科研主题。它们共同构成独一无二的指纹。

引用此