TY - GEN
T1 - Attribute-Based Out-of-Distribution Detection Using LLaVA
AU - Zhao, Daojie
AU - Hou, Chao
AU - Nie, Yongwei
AU - Zhu, Peican
AU - Tang, Keke
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - Deep neural networks (DNNs) often exhibit overconfidence when encountering out-of-distribution (OOD) samples, which poses significant challenges in real-world applications. To address this issue, large multimodal models (LMMs) have been employed, showing considerable promise. Existing approaches attempt to explore CLIP's textual capabilities by generating extensive (OOD) categories. Recognizing that distinctive attributes of various image categories are essential for differentiating between in-distribution (ID) and OOD samples, this paper introduces an attribute-based method for OOD detection. This approach utilizes the LLaVA to extract image attributes, which are then compared with a reference attribute set established for each ID category to estimate the likelihood of an image being ID or OOD. Furthermore, to comprehensively represent each category, we introduce an attribute selection strategy that considers both the commonality and diversity of attributes, significantly improving OOD detection performance. Enhancing OOD detection performance. Extensive experiments conducted across various ID/OOD settings demonstrate the effectiveness of our method and its superiority over state-of-the-art approaches.
AB - Deep neural networks (DNNs) often exhibit overconfidence when encountering out-of-distribution (OOD) samples, which poses significant challenges in real-world applications. To address this issue, large multimodal models (LMMs) have been employed, showing considerable promise. Existing approaches attempt to explore CLIP's textual capabilities by generating extensive (OOD) categories. Recognizing that distinctive attributes of various image categories are essential for differentiating between in-distribution (ID) and OOD samples, this paper introduces an attribute-based method for OOD detection. This approach utilizes the LLaVA to extract image attributes, which are then compared with a reference attribute set established for each ID category to estimate the likelihood of an image being ID or OOD. Furthermore, to comprehensively represent each category, we introduce an attribute selection strategy that considers both the commonality and diversity of attributes, significantly improving OOD detection performance. Enhancing OOD detection performance. Extensive experiments conducted across various ID/OOD settings demonstrate the effectiveness of our method and its superiority over state-of-the-art approaches.
KW - Attribute-based Methods
KW - LLaVA
KW - Large Multimodal Models
KW - Out-of-distribution Detection
UR - https://www.scopus.com/pages/publications/105012358449
U2 - 10.1007/978-981-96-9815-8_10
DO - 10.1007/978-981-96-9815-8_10
M3 - 会议稿件
AN - SCOPUS:105012358449
SN - 9789819698141
T3 - Lecture Notes in Computer Science
SP - 112
EP - 122
BT - Advanced Intelligent Computing Technology and Applications - 21st International Conference, ICIC 2025, Proceedings
A2 - Huang, De-Shuang
A2 - Pan, Yijie
A2 - Chen, Wei
A2 - Li, Bo
PB - Springer Science and Business Media Deutschland GmbH
T2 - 21st International Conference on Intelligent Computing, ICIC 2025
Y2 - 26 July 2025 through 29 July 2025
ER -