ISSN 0253-2778

CN 34-1054/N

Open AccessOpen Access JUSTC

Improving emotion expression extraction in Chinese microblogs via new words detection

Cite this:
https://doi.org/10.3969/j.issn.0253-2778.2017.01.009
  • Received Date: 01 March 2016
  • Rev Recd Date: 17 September 2016
  • Publish Date: 31 January 2017
  • Emotion expression extraction is one of the important tasks of fine-grained sentiment mining. Existing methods lack efficiency in dealing with this task in Chinese microblogs because there are many new words and non-standard words in them. It’s found in this paper that a large number of new words are distributed in emotional expressions of the text in Chinese microblogs. A combined extraction model based on CRF is proposed, which incorporates new word detection into the task to improve the original work. The experimental results show that new word detection has good correlation with emotion expression extraction from Chinese microblogs, and that F1 value increases more than 2% on both the data sets of the movie field and the open field in Chinese microblogs.
    Emotion expression extraction is one of the important tasks of fine-grained sentiment mining. Existing methods lack efficiency in dealing with this task in Chinese microblogs because there are many new words and non-standard words in them. It’s found in this paper that a large number of new words are distributed in emotional expressions of the text in Chinese microblogs. A combined extraction model based on CRF is proposed, which incorporates new word detection into the task to improve the original work. The experimental results show that new word detection has good correlation with emotion expression extraction from Chinese microblogs, and that F1 value increases more than 2% on both the data sets of the movie field and the open field in Chinese microblogs.
  • loading
  • 加载中

Catalog

    Article Metrics

    Article views (232) PDF downloads(142)
    Proportional views

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return