DOI: 10.1162/coli_a_00049 ISSN:

Lexicon-Based Methods for Sentiment Analysis

Maite Taboada, Julian Brooke, Milan Tofiloski, Kimberly Voll, Manfred Stede
  • Artificial Intelligence
  • Computer Science Applications
  • Linguistics and Language
  • Language and Linguistics

We present a lexicon-based approach to extracting sentiment from text. The Semantic Orientation CALculator (SO-CAL) uses dictionaries of words annotated with their semantic orientation (polarity and strength), and incorporates intensification and negation. SO-CAL is applied to the polarity classification task, the process of assigning a positive or negative label to a text that captures the text's opinion towards its main subject matter. We show that SO-CAL's performance is consistent across domains and in completely unseen data. Additionally, we describe the process of dictionary creation, and our use of Mechanical Turk to check dictionaries for consistency and reliability.