跳到主要导航 跳到搜索 跳到主要内容

Research on civil aircraft verification test data retrieval technology based on word segmentation

  • Feng Yunwen
  • , Liu Wanyi
  • , Lu Cheng
  • , Chen Haibo
  • , Yin Xiaolong
  • , Luo Yuhang
  • Northwestern Polytechnical University Xian
  • Ltd.
  • Ltd.

科研成果: 期刊稿件文章同行评审

摘要

There is a massive amount of unstructured data in cıvıl aircraft validation tests, which is difficult to conduct data retrieval and application based on file content. The traditional segmentation retrieval methods cannot meet the segmentation retrieval requirements in the field of validation tests. A segmentation retrieval method based on statistics and terminology dictionaries for the field of validation experiments is proposed. Firstly, the conditional ran-dom fields (CRF) model is used to achieve text initial segmentation. Then, based on the domain files, a terminolo-gy dictionary is constructed, and combined with the reverse maximum matching (RMM) algorithm on the basis of the terminology dictionary and the initial segmented text to achieve professional segmentation of the text. Finally, based on the professional segmentation results, the unstructured files are divided and indexed to support data re-trieval based on file content. A case segmentation is conducted using the text content of a certain experimental outline, and compared with traditional statistical segmentation methods such as CRF, N-gram, and hidden Markov model (HMM). The results show that the proposed method exhibited the best accuracy in professional segmentation, and can build a civil aircraft validation test database and achieve rapid retrieval of unstructured files in the database.

源语言英语
页(从-至)155-163
页数9
期刊Advances in Aeronautical Science and Engineering
16
5
DOI
出版状态已出版 - 1月 2025

指纹

探究 'Research on civil aircraft verification test data retrieval technology based on word segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此