GGBDCA:Scene graph generation based on Global Gradient Balanced Distribution and Compound Attention

Jiajia Liu, Linan Zhao, Guoqing Zhang, Linna Zhang, Yigang Cen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Scene Graph Generation (SGG) is a key task in computer vision, aimed at automatically extracting objects and their relationships from images. Despite significant advances in SGG, the challenge of long-tail distribution continues to impede accurate prediction of rare relationships. To address this, we propose a novel SGG method based on Global Gradient Balanced Distribution and Compound Attention (GGBDCA). First, we introduce a Transformer-based framework that leverages a compound attention mechanism to extract detailed features. To tackle the long-tail identification problem, we propose a Global Gradient Balanced Distribution (GGBD) algorithm, which converts the long-tail issue into a multi-objective optimization problem. Through Gradient Balance Grouping (GBG), similar categories are clustered to enhance attention on rare classes. Then, the Multi-Gradient Descent Algorithm (MGDA) is employed to solve the multi-objective optimization, while the Adaptive Calibration Function (ACF) dynamically adjusts classification scores to improve the model's generalization. These three core modules - GBG, MGDA, and ACF - work together to balance learning between head and tail classes, focusing more on rare categories and boosting overall performance.

Original languageEnglish
Title of host publicationICSP 2024 - 2024 IEEE 17th International Conference on Signal Processing, Proceedings
EditorsYuan Baozong, Ruan Qiuqi, Wei Shikui, An Gaoyun
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages526-531
Number of pages6
ISBN (Electronic)9798350387384
DOIs
StatePublished - 2024
Externally publishedYes
Event17th IEEE International Conference on Signal Processing, ICSP 2024 - Suzhou, China
Duration: 28 Oct 202431 Oct 2024

Publication series

NameInternational Conference on Signal Processing Proceedings, ICSP
ISSN (Print)2164-5221
ISSN (Electronic)2164-523X

Conference

Conference17th IEEE International Conference on Signal Processing, ICSP 2024
Country/TerritoryChina
CitySuzhou
Period28/10/2431/10/24

Keywords

  • Scene graph generation
  • global gradient balance distribution
  • long-tail distribution

Fingerprint

Dive into the research topics of 'GGBDCA:Scene graph generation based on Global Gradient Balanced Distribution and Compound Attention'. Together they form a unique fingerprint.

Cite this