TY - JOUR
T1 - Mining High-Quality Pseudoinstance Soft Labels for Weakly Supervised Object Detection in Remote Sensing Images
AU - Qian, Xiaoliang
AU - Huo, Yu
AU - Cheng, Gong
AU - Gao, Chenyang
AU - Yao, Xiwen
AU - Wang, Wei
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2023
Y1 - 2023
N2 - Weakly supervised object detection in remote sensing images (RSI) is still a challenge because of the lack of instance-level labels, and many existing methods have two problems. First, most of the existing methods usually mine the pseudoground-truth (PGT) instances solely relying on proposal class scores (PCSs). Actually, the reliability of PCS is not enough because of the bird's eye view imaging and large-scale chaotic background of RSIs, and the instances with high PCS incline to cover the discriminative region, rather than the whole object. Second, the existing methods assign a one-hot label to each instance, and the label of the PGT instance is copied to its neighbor instances, which induces the misclassification problem to some extent. Actually, the probability that the neighbor instances contain the object with the same category is smaller than the PGT instance. For the first problem, the proposal quality score (PQS) is proposed for mining high-quality PGT instances, which contain PCS and dual-context projection score (DCPS). The DCPS is calculated through semantic segmentation and is employed to measure the completeness that each proposal covers an object. For the second problem, a pseudosoft label assignment (PSLA) strategy is proposed to assign a more precise soft label for each instance, where the soft label is determined by the spatial distance between each instance and its nearest PGT instance. The ablation study validates the effectiveness of the PQS and PSLA. The comprehensive comparisons with other WSOD methods on three popular benchmarks show the excellent performance of our method.
AB - Weakly supervised object detection in remote sensing images (RSI) is still a challenge because of the lack of instance-level labels, and many existing methods have two problems. First, most of the existing methods usually mine the pseudoground-truth (PGT) instances solely relying on proposal class scores (PCSs). Actually, the reliability of PCS is not enough because of the bird's eye view imaging and large-scale chaotic background of RSIs, and the instances with high PCS incline to cover the discriminative region, rather than the whole object. Second, the existing methods assign a one-hot label to each instance, and the label of the PGT instance is copied to its neighbor instances, which induces the misclassification problem to some extent. Actually, the probability that the neighbor instances contain the object with the same category is smaller than the PGT instance. For the first problem, the proposal quality score (PQS) is proposed for mining high-quality PGT instances, which contain PCS and dual-context projection score (DCPS). The DCPS is calculated through semantic segmentation and is employed to measure the completeness that each proposal covers an object. For the second problem, a pseudosoft label assignment (PSLA) strategy is proposed to assign a more precise soft label for each instance, where the soft label is determined by the spatial distance between each instance and its nearest PGT instance. The ablation study validates the effectiveness of the PQS and PSLA. The comprehensive comparisons with other WSOD methods on three popular benchmarks show the excellent performance of our method.
KW - Dual-context projection score (DCPS)
KW - proposal quality score (PQS)
KW - pseudosoft label
KW - remote sensing image (RSI)
KW - weakly supervised object detection (WSOD)
UR - http://www.scopus.com/inward/record.url?scp=85153389963&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2023.3266838
DO - 10.1109/TGRS.2023.3266838
M3 - 文章
AN - SCOPUS:85153389963
SN - 0196-2892
VL - 61
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5607615
ER -