The objective of the exercise is to do the sentiment analysis based on the tweets data downloaded from the Twitter.
We'll do sentiment analysis of movie "Iron Man 3" using Hive and visualize the sentiment data using Tableau.
The dataset containing tweets of "Iron Man 3" movie is located at below location in HDFS
Create Hive tables for calculating and storing sentiment of each tweet. Corresponding hive.sql file is located at below location in HDFS
Connect to Hive using Tableau to visualize the sentiments of various countries using Tableau.
We'll calculate sentiment using a rudimentary technique. We've polarity of common words in below dictionary file in HDFS
Based on the polarity of words, we will calculate the sentiment of each tweet. You can choose exactly same steps or user different strategy altogether to calculate the sentiment.
There are various deviations possible, for example: