Authors
Clayton Hutto, Eric Gilbert
Publication date
2014/5/16
Journal
Proceedings of the international AAAI conference on web and social media
Volume
8
Issue
1
Pages
216-225
Description
The inherent nature of social media content poses serious challenges to practical applications of sentiment analysis. We present VADER, a simple rule-based model for general sentiment analysis, and compare its effectiveness to eleven typical state-of-practice benchmarks including LIWC, ANEW, the General Inquirer, SentiWordNet, and machine learning oriented techniques relying on Naive Bayes, Maximum Entropy, and Support Vector Machine (SVM) algorithms. Using a combination of qualitative and quantitative methods, we first construct and empirically validate a gold-standard list of lexical features (along with their associated sentiment intensity measures) which are specifically attuned to sentiment in microblog-like contexts. We then combine these lexical features with consideration for five general rules that embody grammatical and syntactical conventions for expressing and emphasizing sentiment intensity. Interestingly, using our parsimonious rule-based model to assess the sentiment of tweets, we find that VADER outperforms individual human raters (F1 Classification Accuracy= 0.96 and 0.84, respectively), and generalizes more favorably across contexts than any of our benchmarks.
Total citations
201520162017201820192020202120222023202438118207360507745105612171383489
Scholar articles
C Hutto, E Gilbert - Proceedings of the international AAAI conference on …, 2014