TY - GEN
T1 - Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet
AU - Nachipyangu, Michael
AU - Zheng, Jiangbin
AU - Mawagali, Palme
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Deep learning models require large amounts of data to achieve good results. However, most datasets consist of images taken from similar angles, brightness levels, and orientations, which do not reflect the diverse reality of scenes. To address this issue, data augmentation techniques are employed to generate images that mimic actual scenarios, thereby increasing the training data for the model. In this paper, we propose an on-The-fly data augmentation approach that enhances the dataset while minimizing the need for additional storage by not saving augmented images to disk. We evaluate different pretrained and trained-from-scratch Convolutional Neural Network (CNN) models on benchmark scene datasets (Scene15 and MIT67), and our results demonstrate that fine-Tuning the InceptionResNetV2 model achieves competitive performance compared to state-of-The-Art methods on these datasets with accuracy of 95% and 86% respectively. This research contributes to creating more realistic scene representations through data augmentation while optimizing disk space usage. Furthermore, we highlight the effectiveness of data augmentation as a regularization technique by reducing loss. The findings presented in this paper provide valuable insights for scene understanding tasks and have implications for various applications such as education, healthcare systems, autonomous vehicles, and domestic robot navigation.
AB - Deep learning models require large amounts of data to achieve good results. However, most datasets consist of images taken from similar angles, brightness levels, and orientations, which do not reflect the diverse reality of scenes. To address this issue, data augmentation techniques are employed to generate images that mimic actual scenarios, thereby increasing the training data for the model. In this paper, we propose an on-The-fly data augmentation approach that enhances the dataset while minimizing the need for additional storage by not saving augmented images to disk. We evaluate different pretrained and trained-from-scratch Convolutional Neural Network (CNN) models on benchmark scene datasets (Scene15 and MIT67), and our results demonstrate that fine-Tuning the InceptionResNetV2 model achieves competitive performance compared to state-of-The-Art methods on these datasets with accuracy of 95% and 86% respectively. This research contributes to creating more realistic scene representations through data augmentation while optimizing disk space usage. Furthermore, we highlight the effectiveness of data augmentation as a regularization technique by reducing loss. The findings presented in this paper provide valuable insights for scene understanding tasks and have implications for various applications such as education, healthcare systems, autonomous vehicles, and domestic robot navigation.
KW - Data Augmentation
KW - Deep learning
KW - Scene Understanding
KW - Transfer learning
UR - http://www.scopus.com/inward/record.url?scp=85190067059&partnerID=8YFLogxK
U2 - 10.1109/ISRITI60336.2023.10467503
DO - 10.1109/ISRITI60336.2023.10467503
M3 - 会议稿件
AN - SCOPUS:85190067059
T3 - 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding
SP - 496
EP - 500
BT - 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023
Y2 - 11 December 2023
ER -