Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning

Haiyun Peng, Yukun Ma, Soujanya Poria, Yang Li, Erik Cambria

Research output: Contribution to journalArticlepeer-review

47 Scopus citations

Abstract

The Chinese pronunciation system offers two characteristics that distinguish it from other languages: deep phonemic orthography and intonation variations. In this paper, we hypothesize that these two important properties can play a major role in Chinese sentiment analysis. In particular, we propose two effective features to encode phonetic information and, hence, fuse it with textual information. With this hypothesis, we propose Disambiguate Intonation for Sentiment Analysis (DISA), a network that we develop based on the principles of reinforcement learning. DISA disambiguates intonations for each Chinese character (pinyin) and, hence, learns precise phonetic representations. We also fuse phonetic features with textual and visual features to further improve performance. Experimental results on five different Chinese sentiment analysis datasets show that the inclusion of phonetic features significantly and consistently improves the performance of textual and visual representations and surpasses the state-of-the-art Chinese character-level representations.

Original languageEnglish
Pages (from-to)88-99
Number of pages12
JournalInformation Fusion
Volume70
DOIs
StatePublished - Jun 2021

Keywords

  • Chinese phonetics
  • Deep phonemic orthography
  • Multilingual sentiment analysis
  • Sentiment analysis

Fingerprint

Dive into the research topics of 'Phonetic-enriched text representation for Chinese sentiment analysis with reinforcement learning'. Together they form a unique fingerprint.

Cite this