Curriculum Learning for Vision-and-Language Navigation

Jiwen Zhang; Zhongyu Wei; Jianqing Fan; Jiajie Peng

Curriculum Learning for Vision-and-Language Navigation

Jiwen Zhang, Zhongyu Wei, Jianqing Fan, Jiajie Peng

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

12 引用（Scopus）

摘要

Vision-and-Language Navigation (VLN) is a task where an agent navigates in an embodied indoor environment under human instructions. Previous works ignore the distribution of sample difficulty and we argue that this potentially degrade their agent performance. To tackle this issue, we propose a novel curriculum-based training paradigm for VLN tasks that can balance human prior knowledge and agent learning progress about training samples. We develop the principle of curriculum design and re-arrange the benchmark Room-to-Room (R2R) dataset to make it suitable for curriculum training. Experiments show that our method is model-agnostic and can significantly improve the performance, the generalizability, and the training efficiency of current state-of-the-art navigation agents without increasing model complexity.

源语言	英语
主期刊名	Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021
编辑	Marc'Aurelio Ranzato, Alina Beygelzimer, Yann Dauphin, Percy S. Liang, Jenn Wortman Vaughan
出版商	Neural information processing systems foundation
页	13328-13339
页数	12
ISBN（电子版）	9781713845393
出版状态	已出版 - 2021
已对外发布	是
活动	35th Conference on Neural Information Processing Systems, NeurIPS 2021 - Virtual, Online 期限: 6 12月 2021 → 14 12月 2021

出版系列

姓名	Advances in Neural Information Processing Systems
卷	16
ISSN（印刷版）	1049-5258

会议

会议	35th Conference on Neural Information Processing Systems, NeurIPS 2021
市	Virtual, Online
时期	6/12/21 → 14/12/21

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, J., Wei, Z., Fan, J., & Peng, J. (2021). Curriculum Learning for Vision-and-Language Navigation. 在 MA. Ranzato, A. Beygelzimer, Y. Dauphin, P. S. Liang, & J. Wortman Vaughan (编辑), Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021 (页码 13328-13339). (Advances in Neural Information Processing Systems; 卷 16). Neural information processing systems foundation.

Zhang, Jiwen ; Wei, Zhongyu ; Fan, Jianqing 等. / Curriculum Learning for Vision-and-Language Navigation. Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. 编辑 / Marc'Aurelio Ranzato ; Alina Beygelzimer ; Yann Dauphin ; Percy S. Liang ; Jenn Wortman Vaughan. Neural information processing systems foundation, 2021. 页码 13328-13339 (Advances in Neural Information Processing Systems).

@inproceedings{610acbca6f114be6a904bab9422863a5,

title = "Curriculum Learning for Vision-and-Language Navigation",

abstract = "Vision-and-Language Navigation (VLN) is a task where an agent navigates in an embodied indoor environment under human instructions. Previous works ignore the distribution of sample difficulty and we argue that this potentially degrade their agent performance. To tackle this issue, we propose a novel curriculum-based training paradigm for VLN tasks that can balance human prior knowledge and agent learning progress about training samples. We develop the principle of curriculum design and re-arrange the benchmark Room-to-Room (R2R) dataset to make it suitable for curriculum training. Experiments show that our method is model-agnostic and can significantly improve the performance, the generalizability, and the training efficiency of current state-of-the-art navigation agents without increasing model complexity.",

author = "Jiwen Zhang and Zhongyu Wei and Jianqing Fan and Jiajie Peng",

note = "Publisher Copyright: {\textcopyright} 2021 Neural information processing systems foundation. All rights reserved.; 35th Conference on Neural Information Processing Systems, NeurIPS 2021 ; Conference date: 06-12-2021 Through 14-12-2021",

year = "2021",

language = "英语",

series = "Advances in Neural Information Processing Systems",

publisher = "Neural information processing systems foundation",

pages = "13328--13339",

editor = "Marc'Aurelio Ranzato and Alina Beygelzimer and Yann Dauphin and Liang, {Percy S.} and {Wortman Vaughan}, Jenn",

booktitle = "Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021",

}

Zhang, J, Wei, Z, Fan, J & Peng, J 2021, Curriculum Learning for Vision-and-Language Navigation. 在 MA Ranzato, A Beygelzimer, Y Dauphin, PS Liang & J Wortman Vaughan (编辑), Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. Advances in Neural Information Processing Systems, 卷 16, Neural information processing systems foundation, 页码 13328-13339, 35th Conference on Neural Information Processing Systems, NeurIPS 2021, Virtual, Online, 6/12/21.

Curriculum Learning for Vision-and-Language Navigation. / Zhang, Jiwen; Wei, Zhongyu; Fan, Jianqing 等.
Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. 编辑 / Marc'Aurelio Ranzato; Alina Beygelzimer; Yann Dauphin; Percy S. Liang; Jenn Wortman Vaughan. Neural information processing systems foundation, 2021. 页码 13328-13339 (Advances in Neural Information Processing Systems; 卷 16).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Curriculum Learning for Vision-and-Language Navigation

AU - Zhang, Jiwen

AU - Wei, Zhongyu

AU - Fan, Jianqing

AU - Peng, Jiajie

PY - 2021

Y1 - 2021

N2 - Vision-and-Language Navigation (VLN) is a task where an agent navigates in an embodied indoor environment under human instructions. Previous works ignore the distribution of sample difficulty and we argue that this potentially degrade their agent performance. To tackle this issue, we propose a novel curriculum-based training paradigm for VLN tasks that can balance human prior knowledge and agent learning progress about training samples. We develop the principle of curriculum design and re-arrange the benchmark Room-to-Room (R2R) dataset to make it suitable for curriculum training. Experiments show that our method is model-agnostic and can significantly improve the performance, the generalizability, and the training efficiency of current state-of-the-art navigation agents without increasing model complexity.

AB - Vision-and-Language Navigation (VLN) is a task where an agent navigates in an embodied indoor environment under human instructions. Previous works ignore the distribution of sample difficulty and we argue that this potentially degrade their agent performance. To tackle this issue, we propose a novel curriculum-based training paradigm for VLN tasks that can balance human prior knowledge and agent learning progress about training samples. We develop the principle of curriculum design and re-arrange the benchmark Room-to-Room (R2R) dataset to make it suitable for curriculum training. Experiments show that our method is model-agnostic and can significantly improve the performance, the generalizability, and the training efficiency of current state-of-the-art navigation agents without increasing model complexity.

UR - http://www.scopus.com/inward/record.url?scp=85127926395&partnerID=8YFLogxK

M3 - 会议稿件

AN - SCOPUS:85127926395

T3 - Advances in Neural Information Processing Systems

SP - 13328

EP - 13339

BT - Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021

A2 - Ranzato, Marc'Aurelio

A2 - Beygelzimer, Alina

A2 - Dauphin, Yann

A2 - Liang, Percy S.

A2 - Wortman Vaughan, Jenn

PB - Neural information processing systems foundation

T2 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021

Y2 - 6 December 2021 through 14 December 2021

ER -

Zhang J, Wei Z, Fan J, Peng J. Curriculum Learning for Vision-and-Language Navigation. 在 Ranzato MA, Beygelzimer A, Dauphin Y, Liang PS, Wortman Vaughan J, 编辑, Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. Neural information processing systems foundation. 2021. 页码 13328-13339. (Advances in Neural Information Processing Systems).

Curriculum Learning for Vision-and-Language Navigation

摘要

出版系列

会议

其它文件与链接

指纹

引用此