A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect

Lingyun Song; Chengkun Yang; Xuanyu Li; Xuequn Shang

doi:10.18653/v1/2024.findings-emnlp.245

A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect

Lingyun Song, Chengkun Yang, Xuanyu Li, Xuequn Shang

School of Computer Science

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Traditional VQA models are inherently vulnerable to language bias, resulting in a significant performance drop when encountering out-of-distribution datasets. The conventional VQA models suffer from language bias that indicates a spurious correlation between textual questions and answers. Given the outstanding effectiveness of counterfactual causal inference in eliminating bias, we propose a model-agnostic dual-debiasing framework based on Counterfactual Causal Effect (DCCE), which explicitly models two types of language bias (i.e., shortcut and distribution bias) by separate branches under the counterfactual inference framework. The effects of both types of bias on answer prediction can be effectively mitigated by subtracting direct effect of textual questions on answers from total effect of visual questions on answers. Experimental results demonstrate that our proposed DCCE framework significantly reduces language bias and achieves state-of-the-art performance on the benchmark datasets without requiring additional augmented data. Our code is available in https://github.com/sxycyck/dcce.

Original language	English
Title of host publication	EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024
Editors	Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Publisher	Association for Computational Linguistics (ACL)
Pages	4242-4252
Number of pages	11
ISBN (Electronic)	9798891761681
DOIs	https://doi.org/10.18653/v1/2024.findings-emnlp.245
State	Published - 2024
Event	2024 Findings of the Association for Computational Linguistics, EMNLP 2024 - Hybrid, Miami, United States Duration: 12 Nov 2024 → 16 Nov 2024

Publication series

Name	EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024

Conference

Conference	2024 Findings of the Association for Computational Linguistics, EMNLP 2024
Country/Territory	United States
City	Hybrid, Miami
Period	12/11/24 → 16/11/24

Access to Document

10.18653/v1/2024.findings-emnlp.245

Cite this

Song, L., Yang, C., Li, X., & Shang, X. (2024). A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect. In Y. Al-Onaizan, M. Bansal, & Y.-N. Chen (Eds.), EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024 (pp. 4242-4252). (EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2024.findings-emnlp.245

Song, Lingyun ; Yang, Chengkun ; Li, Xuanyu et al. / A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect. EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024. editor / Yaser Al-Onaizan ; Mohit Bansal ; Yun-Nung Chen. Association for Computational Linguistics (ACL), 2024. pp. 4242-4252 (EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024).

@inproceedings{5bd5825ecd474280b26687af8142ba64,

title = "A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect",

abstract = "Traditional VQA models are inherently vulnerable to language bias, resulting in a significant performance drop when encountering out-of-distribution datasets. The conventional VQA models suffer from language bias that indicates a spurious correlation between textual questions and answers. Given the outstanding effectiveness of counterfactual causal inference in eliminating bias, we propose a model-agnostic dual-debiasing framework based on Counterfactual Causal Effect (DCCE), which explicitly models two types of language bias (i.e., shortcut and distribution bias) by separate branches under the counterfactual inference framework. The effects of both types of bias on answer prediction can be effectively mitigated by subtracting direct effect of textual questions on answers from total effect of visual questions on answers. Experimental results demonstrate that our proposed DCCE framework significantly reduces language bias and achieves state-of-the-art performance on the benchmark datasets without requiring additional augmented data. Our code is available in https://github.com/sxycyck/dcce.",

author = "Lingyun Song and Chengkun Yang and Xuanyu Li and Xuequn Shang",

note = "Publisher Copyright: {\textcopyright} 2024 Association for Computational Linguistics.; 2024 Findings of the Association for Computational Linguistics, EMNLP 2024 ; Conference date: 12-11-2024 Through 16-11-2024",

year = "2024",

doi = "10.18653/v1/2024.findings-emnlp.245",

language = "英语",

series = "EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024",

publisher = "Association for Computational Linguistics (ACL)",

pages = "4242--4252",

editor = "Yaser Al-Onaizan and Mohit Bansal and Yun-Nung Chen",

booktitle = "EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024",

}

Song, L, Yang, C, Li, X & Shang, X 2024, A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect. in Y Al-Onaizan, M Bansal & Y-N Chen (eds), EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024. EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024, Association for Computational Linguistics (ACL), pp. 4242-4252, 2024 Findings of the Association for Computational Linguistics, EMNLP 2024, Hybrid, Miami, United States, 12/11/24. https://doi.org/10.18653/v1/2024.findings-emnlp.245

A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect. / Song, Lingyun; Yang, Chengkun; Li, Xuanyu et al.
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024. ed. / Yaser Al-Onaizan; Mohit Bansal; Yun-Nung Chen. Association for Computational Linguistics (ACL), 2024. p. 4242-4252 (EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect

AU - Song, Lingyun

AU - Yang, Chengkun

AU - Li, Xuanyu

AU - Shang, Xuequn

PY - 2024

Y1 - 2024

N2 - Traditional VQA models are inherently vulnerable to language bias, resulting in a significant performance drop when encountering out-of-distribution datasets. The conventional VQA models suffer from language bias that indicates a spurious correlation between textual questions and answers. Given the outstanding effectiveness of counterfactual causal inference in eliminating bias, we propose a model-agnostic dual-debiasing framework based on Counterfactual Causal Effect (DCCE), which explicitly models two types of language bias (i.e., shortcut and distribution bias) by separate branches under the counterfactual inference framework. The effects of both types of bias on answer prediction can be effectively mitigated by subtracting direct effect of textual questions on answers from total effect of visual questions on answers. Experimental results demonstrate that our proposed DCCE framework significantly reduces language bias and achieves state-of-the-art performance on the benchmark datasets without requiring additional augmented data. Our code is available in https://github.com/sxycyck/dcce.

AB - Traditional VQA models are inherently vulnerable to language bias, resulting in a significant performance drop when encountering out-of-distribution datasets. The conventional VQA models suffer from language bias that indicates a spurious correlation between textual questions and answers. Given the outstanding effectiveness of counterfactual causal inference in eliminating bias, we propose a model-agnostic dual-debiasing framework based on Counterfactual Causal Effect (DCCE), which explicitly models two types of language bias (i.e., shortcut and distribution bias) by separate branches under the counterfactual inference framework. The effects of both types of bias on answer prediction can be effectively mitigated by subtracting direct effect of textual questions on answers from total effect of visual questions on answers. Experimental results demonstrate that our proposed DCCE framework significantly reduces language bias and achieves state-of-the-art performance on the benchmark datasets without requiring additional augmented data. Our code is available in https://github.com/sxycyck/dcce.

UR - http://www.scopus.com/inward/record.url?scp=85217622235&partnerID=8YFLogxK

U2 - 10.18653/v1/2024.findings-emnlp.245

DO - 10.18653/v1/2024.findings-emnlp.245

M3 - 会议稿件

AN - SCOPUS:85217622235

T3 - EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024

SP - 4242

EP - 4252

BT - EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024

A2 - Al-Onaizan, Yaser

A2 - Bansal, Mohit

A2 - Chen, Yun-Nung

PB - Association for Computational Linguistics (ACL)

T2 - 2024 Findings of the Association for Computational Linguistics, EMNLP 2024

Y2 - 12 November 2024 through 16 November 2024

ER -

Song L, Yang C, Li X, Shang X. A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect. In Al-Onaizan Y, Bansal M, Chen YN, editors, EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024. Association for Computational Linguistics (ACL). 2024. p. 4242-4252. (EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024). doi: 10.18653/v1/2024.findings-emnlp.245

A Robust Dual-debiasing VQA Model based on Counterfactual Causal Effect

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this