Language and Domain Specificity: A Chinese Financial Sentiment Dictionary
认领作者 认领管理权限
发布日期:2021年12月27日 上次修订日期:2022年03月09日


We use supervised machine learning to develop a Chinese language financial sentiment dictionary from 3.1 million financial news articles. Our dictionary maps semantically similar words to a subset of human-expert generated financial sentiment words. In article-level validation tests, our dictionary scores the sentiment of articles consistently with a human reading of full articles. In return validation tests, our dictionary outperforms and subsumes previous Chinese financial sentiment dictionaries such as direct translations of Loughran and McDonald’s (2011) financial words. We also generate a list of politically-related positive words that is unique to China; this list has a weaker association with returns than does the list of otherwise positive words. We demonstrate that state media exhibits a sentiment bias by using more politically-related positive and fewer negative words, and this bias renders state media’s sentiment less return-informative. Our findings demonstrate that dictionary-based sentiment analysis exhibits strong language and domain specificity.


  • 浏览次数:

  • 下载次数:



Zijia Du ; Alan Guoming Huang ; Russ Wermers ; Wenfeng Wu ; Language and Domain Specificity: A Chinese Financial Sentiment Dictionary (2021年12月27日)

*邮 箱