TY - JOUR
T1 - Few-shot decision tree for diagnosis of ultrasound breast tumor using BI-RADS features
AU - Huang, Qinghua
AU - Zhang, Fan
AU - Li, Xuelong
N1 - Publisher Copyright:
© 2018, Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2018/11/1
Y1 - 2018/11/1
N2 - This paper proposes an ultrasound breast tumor CAD system based on BI-RADS features scoring and decision tree algorithm. Because of the difficulty of biopsy label collection, the proposed system adopts a few-shot learning method. The SVM classifier is employed to preliminarily mark the unlabeled cases firstly. Then these unlabeled cases with the pseudo labels are combined with the few real-labeled cases to train the decision tree. To test the performance of the proposed method, 1208 ultrasound breast images were collected, and three well-experienced clinicians and three interns evaluated these images according to the BI-RADS scoring scheme. All of the images are transformed into vectors such that the algorithm can process. The experimental results show that the system performance improves significantly with the help of pseudo-labeled data. Compared to the decision tree trained by the real-labeled cases only, when the number of real-labeled cases was 40, the accuracy, specificity, sensitivity of the proposed system were increased by 2.05%, 2.47% and 1.81%, respectively; the positive predictive value (PPV) and the negative predictive value (NVP) were increased by 1.29% and 3.05%, respectively. Meanwhile, the performance of the proposed method was the same as the method using sufficient samples. When the number of the labeled cases reached 100, the accuracy, specificity, sensitivity, PPV and NVP of the proposed method were 90.03%, 87.02%, 91.68%, 93.07%, and 85.03%, respectively. The results demonstrate that our method can efficiently distinguish the breast tumor although the labeled data is not sufficient.
AB - This paper proposes an ultrasound breast tumor CAD system based on BI-RADS features scoring and decision tree algorithm. Because of the difficulty of biopsy label collection, the proposed system adopts a few-shot learning method. The SVM classifier is employed to preliminarily mark the unlabeled cases firstly. Then these unlabeled cases with the pseudo labels are combined with the few real-labeled cases to train the decision tree. To test the performance of the proposed method, 1208 ultrasound breast images were collected, and three well-experienced clinicians and three interns evaluated these images according to the BI-RADS scoring scheme. All of the images are transformed into vectors such that the algorithm can process. The experimental results show that the system performance improves significantly with the help of pseudo-labeled data. Compared to the decision tree trained by the real-labeled cases only, when the number of real-labeled cases was 40, the accuracy, specificity, sensitivity of the proposed system were increased by 2.05%, 2.47% and 1.81%, respectively; the positive predictive value (PPV) and the negative predictive value (NVP) were increased by 1.29% and 3.05%, respectively. Meanwhile, the performance of the proposed method was the same as the method using sufficient samples. When the number of the labeled cases reached 100, the accuracy, specificity, sensitivity, PPV and NVP of the proposed method were 90.03%, 87.02%, 91.68%, 93.07%, and 85.03%, respectively. The results demonstrate that our method can efficiently distinguish the breast tumor although the labeled data is not sufficient.
KW - BI-RADS
KW - Breast tumors CAD system
KW - Decision tree
KW - Few-shot learning
UR - http://www.scopus.com/inward/record.url?scp=85045958017&partnerID=8YFLogxK
U2 - 10.1007/s11042-018-6026-1
DO - 10.1007/s11042-018-6026-1
M3 - 文章
AN - SCOPUS:85045958017
SN - 1380-7501
VL - 77
SP - 29905
EP - 29918
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 22
ER -