Supervised Learning Based Approach to Aspect Based Sentiment Analysis

H.A.T. Kumara – 2011CS006
Supervisor
Mr. Viraj Welgama
Co-Supervisor
Dr. A. R. Weerasinghe
Supervised Learning Based Approach To
Aspect Based Sentiment Analysis

• Proposal Wrap-up
• Background
• Existing Approaches
• Research Aims
• Scope & Limitations
• Design & Methodology
• Current Progress
• Evaluation
Outline

PROPOSAL WRAP-UP
Supervised Learning Based Approach to Aspect Based Sentiment Analysis

Introduction “What people think?”
• “Which Laptop should I buy?”
• “Which Restaurant should I go to?”
• “Which Food do I need to order?”
• “Which Service do I need to use?”

Introduction
Opinion Mining
Everyday a large number of opinion
related documents are put on the
Internet.
People Post
• Product Reviews
• Political Views
• Feelings

Introduction
Opinion Mining
Opinion Mining or sentiment analysis aims to
determine the attitude of a speaker with respect
to some topic or the overall contextual polarity
of a document
? Sentiment
Analysis
attitude of speaker

Introduction
In aspect-based sentiment analysis (ABSA) the
aim is to identify the aspects of entities and the
sentiment expressed for each aspect.

• Aspect Category Extraction
The Shrimp was awesome, but over-priced.
{Entity#Attribute} –> { Food#Quality, Food#Prices }
• Sentiment Polarity
The Shrimp was awesome, but over-priced.
{Entity#Attribute, Polarity} –> {Food#Quality, Positive}
{Food#Prices, Negative}

EXISTING APPROACHES

ExistingApproaches Existing Approaches
Aspect Based
Sentiment Analysis
Sentiment
Classification
Aspect Extraction

ExistingApproaches Sentiment Classification

ExistingApproaches Aspect Extraction
Aspect Extraction
Topic Model Based
Approaches
Frequency Based
Approaches
Supervised Learning
Based Approaches

Sentiment Classification
• .System Technique Model Features
Wagner J. et al. Supervised SVM • SentiWordNet, General Inquirer,
Bing Liu (2004).
• Normalized the lexicon scores
Sentinue Supervised MaxEnt • Lexical features
• Lexicon features
• Domain specific featues
B. Pang Study Supervised SVM, Naïve
Bayes,
MaxEnt
• Unigrams, Bigrams, Adjectives,
Poistion of words
Harb et al. Stuy Unsupervised Association
Rule
• Adjectives and Adverbs

Aspect Extraction
• . System Technique Model Features
NRC Canada Supervised SVM MPQA, General Inquirer, Bing Liu
NRC Hashtag lexicon.
NLANGP Supervised SVM Word Clusters, Pos tags, Head words
Sentinue Supervised MaxEnt Text words and lemmas
Hu and Liu Unsupervised - Noun Frequency
Association Rule Mining

RESEARCH AIMS

Research Objectives
• Discover a novel approach to conduct Aspect Based
Sentiment Analysis for reviews.
• Apply supervised learning based approach to extract
aspect categories and to determine sentiment polarity
• Following objectives are devised, to achieve main targets of
the project;
– An approach to extract aspect category towards which an opinion
is expressed in the given text or review.
– An approach to estimate the sentiment and the average sentiment
of the texts per aspect.

ASSUMPTIONS

DesignAssumptions
Design Assumptions
Input sentences are assumed to be grammatically
correct and in English
Subjectivity detection is not addressed hence assumed
all the sentences are opinionated either positive or
negative
Input sentences are assumed to belong to only one of
the pre identified set of domains

DesignAssumptions
Design Assumptions Cont.
Author and reader standing point is not addressed so it
is assumed that all the input sentences are of
independent observations
Sarcasm is not addressed hence assumed that dataset
does not contain sarcastic sentences.

DESIGN AND
METHODOLOGY

Design Design Overview
Polarity
Input
Preprocessing Aspect Category
Extraction
Sentiment Analyzer
Positive Negative{Entity#Attrubute}
Aspect Category

Design Preprocessing Module
Polarity
Input
Extraction
Sentiment Analyzer
Aspect Category

Design Preprocessing Module
The staff is unbelievably friendly, and I dream
about their fajitas...so good.
(Great for a romantic evening, but over-priced.
The backlit keys are wonderful :-)
The atmosphere isn't the greatest, I won’t so
to this place again for sure.
Yes, Great display "Mac .
white space and punctuations
unexpected symbols/tokens
emoticons
not formal, playful words

Design Aspect Category Extractor
Polarity
Input
Extraction
Sentiment Analyzer
Aspect Category

Design Aspect Category Extractor
{Entity#Attrubute}
Sentiment Lexicon
Aspect Category
Lexical FeaturesIn Domain Sentiment
Lexicon
Classifier

Design Lexicon Generation
Unlabeled Copora In Domain Sentiment
Lexicon
A sentiment score for each term w in the corpus:
PMI stands for pointwise mutual information:

Design
Aspect Category Extractor
• Class labels are already know and limited
• Supervised Learning
• One classifier for each aspect category.
• One-vs-all binary classifier
• Classification Models available
• SVM, Maximum Entropy( According to Literature )

Design Sentiment Analyzer
Polarity
Input
Extraction
Sentiment Analyzer
Aspect Category

Design Sentiment Analyzer
This is a binary classification problem
Classification Models available
-SVM, MaxEnt, Naïve Bayesian ( According to Literature )
Classification features
• Domain Specific Features
• Features from In domain sentiment lexicon.
• Part of Speech Features
• Number of adjectives, adverbs, and nouns in the sentence
• Negation Features
• Single binary feature determined by whether there was
any negation in the sentence

CURRENT PROGRESS

CurrentProgress
Datasets
Laptop Reviews Dataset
From Amazon.com
Restaurants Dataset
From Ganu et al. study
Annotation Process
3 Annotators involved

DataUnderstanding Initial Data Analysis
Restaurants Data Set (Train) – Rapid Minor

Initial Data Analysis
Aspect Category Frequency Distribution – Restaurants
Domain

DataUnderstanding Initial Data Analysis
Laptop Data Set (Train) – Rapid Minor

Initial Data Analysis
Aspect Category Frequency Distribution – Laptops
Domain

CurrentProgress
Evaluation
• Aspect Category Extraction
• Precision and Recall
• F-Score
• Sentiment Polarity
• Cross Validation (k-fold validation)
• Precision and Recall (Compare with two
algorithms)
• F-Score

Progress Progress Overview
Completed
• Literature survey
• Design
• Dataset Understanding
• Existing System
• Preprocessing Module
To-do
• Implementation of modules
• Test and Evaluation
• Completing the Thesis

Supervised Learning Based Approach to Aspect Based Sentiment Analysis

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Supervised Learning Based Approach to Aspect Based Sentiment Analysis

Ähnlich wie Supervised Learning Based Approach to Aspect Based Sentiment Analysis (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Supervised Learning Based Approach to Aspect Based Sentiment Analysis

Hinweis der Redaktion