TY - JOUR
T1 - Domain-Adaptive Crowd Counting via High-Quality Image Translation and Density Reconstruction
AU - Gao, Junyu
AU - Han, Tao
AU - Yuan, Yuan
AU - Wang, Qi
N1 - Publisher Copyright:
© 2012 IEEE.
PY - 2023/8/1
Y1 - 2023/8/1
N2 - Recently, crowd counting using supervised learning achieves a remarkable improvement. Nevertheless, most counters rely on a large amount of manually labeled data. With the release of synthetic crowd data, a potential alternative is transferring knowledge from them to real data without any manual label. However, there is no method to effectively suppress domain gaps and output elaborate density maps during the transferring. To remedy the above problems, this article proposes a domain-adaptive crowd counting (DACC) framework, which consists of a high-quality image translation and density map reconstruction. To be specific, the former focuses on translating synthetic data to realistic images, which prompts the translation quality by segregating domain-shared/independent features and designing content-aware consistency loss. The latter aims at generating pseudo labels on real scenes to improve the prediction quality. Next, we retrain a final counter using these pseudo labels. Adaptation experiments on six real-world datasets demonstrate that the proposed method outperforms the state-of-the-art methods.
AB - Recently, crowd counting using supervised learning achieves a remarkable improvement. Nevertheless, most counters rely on a large amount of manually labeled data. With the release of synthetic crowd data, a potential alternative is transferring knowledge from them to real data without any manual label. However, there is no method to effectively suppress domain gaps and output elaborate density maps during the transferring. To remedy the above problems, this article proposes a domain-adaptive crowd counting (DACC) framework, which consists of a high-quality image translation and density map reconstruction. To be specific, the former focuses on translating synthetic data to realistic images, which prompts the translation quality by segregating domain-shared/independent features and designing content-aware consistency loss. The latter aims at generating pseudo labels on real scenes to improve the prediction quality. Next, we retrain a final counter using these pseudo labels. Adaptation experiments on six real-world datasets demonstrate that the proposed method outperforms the state-of-the-art methods.
KW - Crowd counting
KW - domain adaptation
KW - image translation
UR - http://www.scopus.com/inward/record.url?scp=85119428819&partnerID=8YFLogxK
U2 - 10.1109/TNNLS.2021.3124272
DO - 10.1109/TNNLS.2021.3124272
M3 - 文章
C2 - 34767512
AN - SCOPUS:85119428819
SN - 2162-237X
VL - 34
SP - 4803
EP - 4815
JO - IEEE Transactions on Neural Networks and Learning Systems
JF - IEEE Transactions on Neural Networks and Learning Systems
IS - 8
ER -