Skip to main navigation Skip to search Skip to main content

AugMine: Boosting Coal Mine Accident News Classification with Text Data Augmentation

  • He Hu
  • , Yixu Feng
  • , Chaoqun Wang
  • , Zhaohe Wang
  • , Xiaowen Ma
  • , Peng Wu
  • , Wei Dong
  • , Qingsen Yan
  • Northwestern Polytechnical University Xian
  • Xi'an University of Science and Technology
  • Jining Polytechnic
  • Xi'an University of Architecture and Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In coal mine safety research, the precise classification of accident reports is paramount. This process facilitates a rapid comprehension of accident causes, enabling the formulation of effective preventive measures. The advent of Natural Language Processing (NLP), particularly through the emergence of models like BERT and its variants, has revolutionized our capacity for accurate report classification. Yet, the challenge of scarce coal mine safety labeled data and the prohibitive costs of labeling persists, impeding the optimal utilization of pre-trained models. Data augmentation proves to be a powerful method for addressing these challenges. However, traditional text data augmentation techniques face limitations due to their potential to generate homogenous data and the risk of distorting essential information. To overcome these constraints, we introduce ”AugMine,” an innovative text data augmentation strategy. AugMine capitalizes on ChatGPT’s prowess in generating high-quality text from limited datasets, thereby broadening the training pool and bolstering the model’s proficiency in identifying critical components within coal mine accident reports. Furthermore, we incorporate adversarial training techniques to further enhance classification performance. In this study, we leveraged BERT and its derivatives for feature extraction and assessed multiple data augmentation strategies. The experimental results demonstrate that the “AugMine” approach, which we introduced, notably enhances the precision of classifying coal mine accident reports, outperforming established text data augmentation techniques.

Original languageEnglish
Title of host publicationProceedings of International Conference on Image, Vision and Intelligent Systems, ICIVIS 2024 - Volume I
EditorsPeng You, Yuhui Zheng
PublisherSpringer Science and Business Media Deutschland GmbH
Pages1-22
Number of pages22
ISBN (Print)9789819624317
DOIs
StatePublished - 2025
EventInternational Conference on Image, Vision and Intelligent Systems, ICIVIS 2024 - Xining, China
Duration: 16 Jun 202417 Jun 2024

Publication series

NameLecture Notes in Electrical Engineering
Volume1359
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

ConferenceInternational Conference on Image, Vision and Intelligent Systems, ICIVIS 2024
Country/TerritoryChina
CityXining
Period16/06/2417/06/24

Keywords

  • Coal mine accident
  • Data augmentation
  • Pre-trained language models
  • Text classification

Fingerprint

Dive into the research topics of 'AugMine: Boosting Coal Mine Accident News Classification with Text Data Augmentation'. Together they form a unique fingerprint.

Cite this