Deep learning based Domain-specific text generation for online harassment detection

Deep learning based Domain-specific text
generation for online harassment detection
Master’s Thesis Defense
Abhishek Nalamothu
Kno.e.sis Center
Department of Computer Science and
Engineering
Wright State University, Dayton, Ohio
June 25th, 2019
Advisor:
Dr. Amit P. Sheth
Committee Members:
Dr. KeKe Chen,
Dr. Valerie Shalin
Mentor:
Dr. Shreyansh Bhatt

Outline
• Online harassment
• Data related problems to detect harassment
• Ways to solve
- Text generation
• Problems with state of the art text generation models
• Solution
• Evaluation and results
• Conclusion and future work

7%
4-in-10 62%
Young folks use social networking
sites as a tool
Unfiltered Anonymous
- 200576% - 2017
Online Harassment[1]

Automatic online Harassment detection

In our examination…
1. Abusive and hate speech tweets [2]
43 K tweets
3.5 K labeled as harassment (8.1%)
39.5 K labeled as Normal (91.9%)
2. Scarcity of positive labeled data
Harassment detection
Machine learning
Well balanced training data is important

1. Collecting more positive labeled data.
2. Active learning
3. Data augmentation by generating self-diverse and different synthetic samples of
minority class
Ways to address …
Text AugmentationText Generation Data Balance

Text Generation
Text generation is one of the most attractive problems in NLP community
Text generation = the conditional probability of the next word in a sequence
P(wn|w1,w2,w3,w4,..wn-1)
Example: He, is, not, that, retard
Deep learning models become state-of-the-art

State-of-the-art Text Generation Models
Recurrent neural networks (RNNs)[3,4]
problem: Long term dependency learning
Example : I grew up in “France”,…….. I speak “French” fluently.
RNNs with LSTM [5,6,7]
Uses memory cell
Problem: Exposure Bias
The objective is to maximize the
likelihood of true token in the training
sequence
Auto Regressive Models

Generative Adversarial Nets
Generator : Learns to confuse
Discriminator by generating high quality
data
Discriminator : Learns to distinguish
whether a given data instance is real or not
State-of-the-art Text Generation Models
Sequence GANs [8,9,10]
Rewarder : High rewards to training data
Low rewards to generated data
Generator : Aims to get high rewards
Rank GANs [11,12,13]
Ranker : High rank to training data
Low rank to generated data
Generator : Aims to get high rank

Sparse Rewards
Mode Collapse
I’m not sexist but it is female face I’m face not sexist
Example :
SeqGAN & Rank GAN problems
Potential to generate meaningful sentence
Not meaningful
Doesn’t help the model to learn until meaningful part of the
sentence
Generator generates a limited diversity of samples, or
even the same sample, regardless of the input.

Inverse Reinforcement learning
Based GANs [14]
Solution To Reward Sparsity : Rewards at token level
Solution to mode collapse : Entropy
Rewarder objective (similar to previous implementations)
High rewards to training data
Low rewards to generated data

The objective of text generator is to maximize the expected reward plus an entropy.
Where,
τ : Generated sentence.
qθ(τ ) : generator
R∅(τ ) : rewarder
Maximize the expected rewards
of the generated texts
Meaning
Maximize Entropy
Diversity
Generator Objective
Based GANs

• After certain iterations, generated sentence starts loses it’s meaning
Where,
τ : Generated sentence.
qθ(τ ) : generator
R∅(τ ) : rewarder
Maximize the expected rewards
of the generated texts
Meaning
Maximize Entropy
Diversity
Generator Objective problem
Based GANs

Incorporated a term which uses domain specific knowledge to the preserve meaning.
• The objective of text generator is to maximize the following:
The expected reward
The sentence level cosine similarity between train data and generated data
An entropy
Maximize the expected
rewards
to the generated texts
Meaning
Maximize Entropy
Diversity
Uses domain-specific
knowledge to preserve
meaning
IRL With Domain-Specific Knowledge

Generator objective plot
Relatively…
Better meaningful sentences
Improves..
Balance between diversity and meaning

Rewarder objective plot
Relatively…
Better rewards

Dataset - 43000 tweets
1. RT @Rambobiggs: Holy hell these people are disgusting https://t.co/or6RE4DRPd
2. can someone sum this up before i call this guy retarded https://t.co/yuQVEUc
3. RT @ashllyd: SICK OF BITCHES ON THE INTERNET 🐍🙅👉https://t.co/BkyqCFx64G
@UKBloggers1 @FemaleBloggerRT @TheGirlGangHQ #fbloggers #fblchat
4. RT @Stafaa__: I hate them hoe ass braids 😂🙌🏾 https://t.co/fr5gMyp4rJ
5. RT @ayevonnn: bruh i fucking hate people like this 😤 https://t.co/dceEXQhnhq
6. @SenWarren @SenJeffMerkley Yes because u idiots never run out of nonsense…
7. RT @notwaving: I hate it when I'm trying to board a bus and there's already an asshole
on it. https://t.co/Qps29bAaoA
Training
Data
3150
(90%) Harassment
(Positive
Labeled)
Tweets
3500
Test
Data
350
(10%)
1. @discordapp bro i cant fucking wait for the Video Screensharing!!! 😭😂😭😂
2. Forgetting to pack a sports bra for the gym is the fucking worst 👿
3. @zezrie @Jezebel It's like it's a fucking crime to have a vagina in this country now! 😭😡👎🏾👎🏾
4. the lyrics is perfect, the melody is so fucking sexy and the guitar OMFG congrats @ShawnMendes
#JFCShawnMendes
5. RT @IIXXIV_: My face be oily as hell when I wake up n I hate it.
6. BLOODY HELL YES I MISS HIM 😭😭 https://t.co/k2v8u3bPDq
7. Sprint is killing its best 50 percent off deal https://t.co/8GvX6DHF5n https://t.co/wWcCgx1fBe ……
https://t.co/YnihEdvIOO https://t.co/Yry1YK0P0d
Normal
(Negative
Labeled)
Tweets
39600

Tweet-Preprocessor
1. RT @Rambobiggs: Holy hell these people are disgusting https://t.co/or6RE4DRPd
2. can someone sum this up before i call this guy retarded https://t.co/yuQVEUc
3. RT @ashllyd: SICK OF BITCHES ON THE INTERNET 🐍🙅👉https://t.co/BkyqCFx64G @UKBloggers1
@FemaleBloggerRT @TheGirlGangHQ #fbloggers #fblchat
4. @discordapp bro i cant fucking wait for the Video Screensharing!!! 😭😂😭😂
5. BLOODY HELL YES I MISS HIM 😭😭 https://t.co/k2v8u3bPDq
URL, Mention, Hashtag, Reserved Words, Emoji
1. holy hell these people are disgusting
2. can someone sum this up before i call this guy retarded
3. sick of bitches on the internet
4. bro i cant fucking wait for the video screensharing
5. bloody hell yes i miss him

Domain Specific Data Examples:
ugh crippling self doubt is the worst
the number 1 angel shows are gonna be fucking wiiiiild
bro i cant fucking wait for the video screensharing
Andy carroll at the vodafone stadium tonight n cannot fucking wait
yesssssssssss screams oh my bloody hell that would be my two fav youtuber collabing
Domain- Specific data
Domain-specific
data
(Tweets)
Skip-gram Model Word embedding
Abusive and hate speech tweets [1] as domain specific data
Size: 43 K tweets

Tweets generation
Inverse Reinforcement
Learning based GANs
with Domain-Specific
Knowledge
(Our Model)
Harassment Tweets
(Positive Labeled)
Training data : 3150
Vocabulary : ~ 6200
Domain-Specific data
(closely related) 43000
Embeddings
Generated
Tweets
600
(~ 20% of
training data)

Generated Tweets
@ user trump is one ugly hoe
@ user sick i hate the metro
@ user fuck yes i would have slapped the fuck out her
@ user islamic state says us being run by an idiot is getting disgusting
@ user he was just a puppet idiot on a string
@ user yo momma had them ugly ass heels not me
@ user this idiot called it devoutness instead of wormers
@ user islamic state says us being run by an idiot is fucked up
@ user pissed off because im sick and i cannot sleep because of it
@ user you had this worst fucking part and chest like signboard

Diversity
Tweet from Train data: akademiks is fucking ugly lmfaooooo
Generated tweet: akademiks is fucking disgusting
Tweets from Train data:
fuck sake do not let this slip you retard
i hate when a manly looking bitch say she want a nigga like shorty you is a nigga
do not you hate when a bitch say you cannot fight like come here bitch lemme see where yoo hands
i hate bitches that do not know how to mind they business
i hate bitches that say this
i hate bitches that always wanna argue
i hate bitches that wear heels but cannot walk in em
Generated tweet: let this bitch say i hate bitches
Example 1
Example 2

Diversity
i can not wait to be done with these disgusting bitches
i be on my ugly ass niggah line
Generated tweet: done with these ugly ass niggah
two beautiful and sexy best woman hot body sensual hot scene ********* ******* ********
bad bitches do not take days off
some ugly bitches on tonights comedine with me show
yessss people underestimate this they dont wanna gag and shit but do it slurp on that dick
bitch do your fucking homework
you are out of your fucking mind your disgusting lying reckless fucking mind
roman beating undertaker is one thing but that was his fucking retirement match
Generated tweet: two beautiful bitches and shit in your fucking retirement world
Example 3
Example 4

Harassment detection pipeline setup
Augmentation improves detection task by 10%

• Originally BLEU [15] is a metric to evaluate the quality of machine-translated text.
• BLEU is to compute the quality generated set
Example
Calculation:
The Higher the BLEU score is, the better quality the generator gets.
Problem :
Objective of our model is to generate data which is diverse from train data set.
BiLingual Evaluation Understudy (BLEU)
Candidate the the cat
Reference 1 the cat is on the mat

Evaluation
metric
Measures.. Reference set Evaluated set Desired
direction
BLEUdiv
Train-diversity Train data Generated data Lower
Self-BLEU Self-diversity Generated sentence Remaining generated
sentences
Lower
Normalized
perplexity
Quality Domain specific data Generated data Higher
Cosine Similarity Quality Train data Generated data Higher
Evaluation metrics

Normalized perplexity:
Perplexity

Future work
Better knowledge embedding
Style based generation

References
[1] Perrin, Andrew. "Social media usage: 2005–2015. 2015." URL http://www. pewinternet.
org/2015/10/08/social-networking-usage-2005-2015 (2017).
[2] Founta, Antigoni Maria, et al. "Large scale crowdsourcing and characterization of twitter abusive
behavior." Twelfth International AAAI Conference on Web and Social Media. 2018.
[3] Mikolov, Tomáš, et al. "Recurrent neural network based language model." Eleventh annual conference of the
international speech communication association. 2010.
[4] Mikolov, Tomáš, et al. "Extensions of recurrent neural network language model." 2011 IEEE International
Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2011.
[5] Hochreiter, Sepp, and Jürgen Schmidhuber. "Long short-term memory." Neural computation 9.8 (1997):
1735-1780.
[6] Graves, Alex. "Generating sequences with recurrent neural networks." arXiv preprint arXiv:1308.0850
(2013).
[7] Sutskever, Ilya, Oriol Vinyals, and Quoc V. Le. "Sequence to sequence learning with neural networks."
Advances in neural information processing systems. 2014.
[8]Yu, Lantao, et al. "Seqgan: Sequence generative adversarial nets with policy gradient." Thirty-First AAAI
Conference on Artificial Intelligence. 2017.

References
[9]Shrivastava, Ashish, et al. "Learning from simulated and unsupervised images through adversarial
training." Proceedings of the IEEE conference on computer vision and pattern recognition. 2017.
[10] Li, Jiwei, et al. "Adversarial learning for neural dialogue generation." arXiv preprint
arXiv:1701.06547 (2017).
[11]Lin, Kevin, et al. "Adversarial ranking for language generation." Advances in Neural Information
Processing Systems. 2017.
[12]Guo, Jiaxian, et al. "Long text generation via adversarial training with leaked
information." Thirty-Second AAAI Conference on Artificial Intelligence. 2018.
[13]Yang, Zichao, et al. "Unsupervised text style transfer using language models as
discriminators." Advances in Neural Information Processing Systems. 2018.
[14]Z. Shi, X. Chen, X. Qiu, and X. Huang, “Towards diverse text generation with inverse
reinforcement learning,” arXiv preprint arXiv:1804.11258, 2018.
[15] Kishore Papineni, Salim Roukos, Todd Ward, and WeiJing Zhu. 2002. Bleu: a method for
automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the
Association for Computational Linguistics, July 6-12, 2002, Philadelphia, PA, USA., pages 311–318.
ACL.

Acknowledgement
Committee members:
Dr. Amit Sheth
(Advisor)
Dr. Valerie ShalinDr. Keke Chen
Dr. Shreyansh Bhatt
Dr. Krishnaprasad Thirunarayan
co- author:

Deep learning based Domain-specific text generation for online harassment detection

Recommended

Recommended

More Related Content

Similar to Deep learning based Domain-specific text generation for online harassment detection

Similar to Deep learning based Domain-specific text generation for online harassment detection (20)

Recently uploaded

Recently uploaded (20)

Deep learning based Domain-specific text generation for online harassment detection

Editor's Notes