Curriculum Learning for Vision-and-Language Navigation

Jiwen Zhang, Zhongyu Wei, Jianqing Fan, Jiajie Peng

科研成果: 书/报告/会议事项章节会议稿件同行评审

12 引用 (Scopus)

摘要

Vision-and-Language Navigation (VLN) is a task where an agent navigates in an embodied indoor environment under human instructions. Previous works ignore the distribution of sample difficulty and we argue that this potentially degrade their agent performance. To tackle this issue, we propose a novel curriculum-based training paradigm for VLN tasks that can balance human prior knowledge and agent learning progress about training samples. We develop the principle of curriculum design and re-arrange the benchmark Room-to-Room (R2R) dataset to make it suitable for curriculum training. Experiments show that our method is model-agnostic and can significantly improve the performance, the generalizability, and the training efficiency of current state-of-the-art navigation agents without increasing model complexity.

源语言英语
主期刊名Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021
编辑Marc'Aurelio Ranzato, Alina Beygelzimer, Yann Dauphin, Percy S. Liang, Jenn Wortman Vaughan
出版商Neural information processing systems foundation
13328-13339
页数12
ISBN(电子版)9781713845393
出版状态已出版 - 2021
已对外发布
活动35th Conference on Neural Information Processing Systems, NeurIPS 2021 - Virtual, Online
期限: 6 12月 202114 12月 2021

出版系列

姓名Advances in Neural Information Processing Systems
16
ISSN(印刷版)1049-5258

会议

会议35th Conference on Neural Information Processing Systems, NeurIPS 2021
Virtual, Online
时期6/12/2114/12/21

指纹

探究 'Curriculum Learning for Vision-and-Language Navigation' 的科研主题。它们共同构成独一无二的指纹。

引用此