Machine learning based sentiment text classification for evaluating treatment quality of discharge summary

Samer Abdulateef Waheeb, Naseer Ahmed Khan, Bolin Chen, Xuequn Shang

科研成果: 期刊稿件文章同行评审

23 引用 (Scopus)

摘要

Patients' discharge summaries (documents) are health sensors that are used for measuring the quality of treatment in medical centers. However, extracting information automatically from discharge summaries with unstructured natural language is considered challenging. These kinds of documents include various aspects of patient information that could be used to test the treatment quality for improving medical-related decisions. One of the significant techniques in literature for discharge summaries classification is feature extraction techniques from the domain of natural language processing on text data. We propose a novel sentiment analysis method for discharge summaries classification that relies on vector space models, statistical methods, association rule, and extreme learning machine autoencoder (ELM-AE). Our novel hybrid model is based on statistical methods that build the lexicon in a domain related to health and medical records. Meanwhile, our method examines treatment quality based on an idea inspired by sentiment analysis. Experiments prove that our proposed method obtains a higher F1 value of 0.89 with good TPR (True Positive Rate) and FPR (False Positive Rate) values compared with various well-known state-of-the-art methods with different size of training and testing datasets. The results also prove that our method provides a flexible and effective technique to examine treatment quality based on positive, negative, and neutral terms for sentence-level in each discharge summary.

源语言英语
文章编号281
期刊Information (Switzerland)
11
5
DOI
出版状态已出版 - 1 6月 2020

指纹

探究 'Machine learning based sentiment text classification for evaluating treatment quality of discharge summary' 的科研主题。它们共同构成独一无二的指纹。

引用此