Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet

Michael Nachipyangu; Jiangbin Zheng; Palme Mawagali

doi:10.1109/ISRITI60336.2023.10467503

Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet

Michael Nachipyangu, Jiangbin Zheng, Palme Mawagali

School of Software

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Deep learning models require large amounts of data to achieve good results. However, most datasets consist of images taken from similar angles, brightness levels, and orientations, which do not reflect the diverse reality of scenes. To address this issue, data augmentation techniques are employed to generate images that mimic actual scenarios, thereby increasing the training data for the model. In this paper, we propose an on-The-fly data augmentation approach that enhances the dataset while minimizing the need for additional storage by not saving augmented images to disk. We evaluate different pretrained and trained-from-scratch Convolutional Neural Network (CNN) models on benchmark scene datasets (Scene15 and MIT67), and our results demonstrate that fine-Tuning the InceptionResNetV2 model achieves competitive performance compared to state-of-The-Art methods on these datasets with accuracy of 95% and 86% respectively. This research contributes to creating more realistic scene representations through data augmentation while optimizing disk space usage. Furthermore, we highlight the effectiveness of data augmentation as a regularization technique by reducing loss. The findings presented in this paper provide valuable insights for scene understanding tasks and have implications for various applications such as education, healthcare systems, autonomous vehicles, and domestic robot navigation.

Original language	English
Title of host publication	6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	496-500
Number of pages	5
ISBN (Electronic)	9798350358346
DOIs	https://doi.org/10.1109/ISRITI60336.2023.10467503
State	Published - 2023
Event	6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Batam, Indonesia Duration: 11 Dec 2023 → …

Publication series

Name	6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding

Conference

Conference	6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023
Country/Territory	Indonesia
City	Batam
Period	11/12/23 → …

Keywords

Data Augmentation
Deep learning
Scene Understanding
Transfer learning

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/ISRITI60336.2023.10467503

Cite this

Nachipyangu, M., Zheng, J., & Mawagali, P. (2023). Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet. In 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding (pp. 496-500). (6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ISRITI60336.2023.10467503

Nachipyangu, Michael ; Zheng, Jiangbin ; Mawagali, Palme. / Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet. 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 496-500 (6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding).

@inproceedings{44edb324599b4148ae613d5c4c7e50e8,

title = "Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet",

abstract = "Deep learning models require large amounts of data to achieve good results. However, most datasets consist of images taken from similar angles, brightness levels, and orientations, which do not reflect the diverse reality of scenes. To address this issue, data augmentation techniques are employed to generate images that mimic actual scenarios, thereby increasing the training data for the model. In this paper, we propose an on-The-fly data augmentation approach that enhances the dataset while minimizing the need for additional storage by not saving augmented images to disk. We evaluate different pretrained and trained-from-scratch Convolutional Neural Network (CNN) models on benchmark scene datasets (Scene15 and MIT67), and our results demonstrate that fine-Tuning the InceptionResNetV2 model achieves competitive performance compared to state-of-The-Art methods on these datasets with accuracy of 95% and 86% respectively. This research contributes to creating more realistic scene representations through data augmentation while optimizing disk space usage. Furthermore, we highlight the effectiveness of data augmentation as a regularization technique by reducing loss. The findings presented in this paper provide valuable insights for scene understanding tasks and have implications for various applications such as education, healthcare systems, autonomous vehicles, and domestic robot navigation.",

keywords = "Data Augmentation, Deep learning, Scene Understanding, Transfer learning",

author = "Michael Nachipyangu and Jiangbin Zheng and Palme Mawagali",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 ; Conference date: 11-12-2023",

year = "2023",

doi = "10.1109/ISRITI60336.2023.10467503",

language = "英语",

series = "6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "496--500",

booktitle = "6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding",

}

Nachipyangu, M, Zheng, J & Mawagali, P 2023, Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet. in 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding. 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding, Institute of Electrical and Electronics Engineers Inc., pp. 496-500, 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023, Batam, Indonesia, 11/12/23. https://doi.org/10.1109/ISRITI60336.2023.10467503

Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet. / Nachipyangu, Michael; Zheng, Jiangbin; Mawagali, Palme.
6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding. Institute of Electrical and Electronics Engineers Inc., 2023. p. 496-500 (6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet

AU - Nachipyangu, Michael

AU - Zheng, Jiangbin

AU - Mawagali, Palme

PY - 2023

Y1 - 2023

N2 - Deep learning models require large amounts of data to achieve good results. However, most datasets consist of images taken from similar angles, brightness levels, and orientations, which do not reflect the diverse reality of scenes. To address this issue, data augmentation techniques are employed to generate images that mimic actual scenarios, thereby increasing the training data for the model. In this paper, we propose an on-The-fly data augmentation approach that enhances the dataset while minimizing the need for additional storage by not saving augmented images to disk. We evaluate different pretrained and trained-from-scratch Convolutional Neural Network (CNN) models on benchmark scene datasets (Scene15 and MIT67), and our results demonstrate that fine-Tuning the InceptionResNetV2 model achieves competitive performance compared to state-of-The-Art methods on these datasets with accuracy of 95% and 86% respectively. This research contributes to creating more realistic scene representations through data augmentation while optimizing disk space usage. Furthermore, we highlight the effectiveness of data augmentation as a regularization technique by reducing loss. The findings presented in this paper provide valuable insights for scene understanding tasks and have implications for various applications such as education, healthcare systems, autonomous vehicles, and domestic robot navigation.

AB - Deep learning models require large amounts of data to achieve good results. However, most datasets consist of images taken from similar angles, brightness levels, and orientations, which do not reflect the diverse reality of scenes. To address this issue, data augmentation techniques are employed to generate images that mimic actual scenarios, thereby increasing the training data for the model. In this paper, we propose an on-The-fly data augmentation approach that enhances the dataset while minimizing the need for additional storage by not saving augmented images to disk. We evaluate different pretrained and trained-from-scratch Convolutional Neural Network (CNN) models on benchmark scene datasets (Scene15 and MIT67), and our results demonstrate that fine-Tuning the InceptionResNetV2 model achieves competitive performance compared to state-of-The-Art methods on these datasets with accuracy of 95% and 86% respectively. This research contributes to creating more realistic scene representations through data augmentation while optimizing disk space usage. Furthermore, we highlight the effectiveness of data augmentation as a regularization technique by reducing loss. The findings presented in this paper provide valuable insights for scene understanding tasks and have implications for various applications such as education, healthcare systems, autonomous vehicles, and domestic robot navigation.

KW - Data Augmentation

KW - Deep learning

KW - Scene Understanding

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85190067059&partnerID=8YFLogxK

U2 - 10.1109/ISRITI60336.2023.10467503

DO - 10.1109/ISRITI60336.2023.10467503

M3 - 会议稿件

AN - SCOPUS:85190067059

T3 - 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding

SP - 496

EP - 500

BT - 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023

Y2 - 11 December 2023

ER -

Nachipyangu M, Zheng J, Mawagali P. Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet. In 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding. Institute of Electrical and Electronics Engineers Inc. 2023. p. 496-500. (6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding). doi: 10.1109/ISRITI60336.2023.10467503

Transfer Learning and On-Fly Data Augmentation for Scene UnderstandingUsing InceptionResNet

Abstract

Publication series

Conference

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this