SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
ENHANCING TWITTER
SPAM DETECTION USING
CROSS ACCOUNT
PATTERN MATCHING.
By Ambarish Pande
Contents
▸ Introduction
▸ Motivation
▸ Proposed Algorithm
▸ Implementation Details
▸ Advantages and Drawbacks
▸ Conclusion and Future work
Introduction
▸ Emerging Social Networks.
▹ Popularity of Facebook and Twitter
▹ 1550 Million active FB users.
▹ 320 Million active Twitter Users.
▹ Global Reach
▹ Multi-platform
▸ Social Network’s Revenue Model
▹ Advertising
▹ 85% of Twitter’s Revenue comes
from advertising
Motivation
▸ The Problem
▹ Social networks like twitter provide
a legal way of publicizing content.
▹ Some companies go for illegal
methods like Spam Accounts.
▹ Huge Revenue Loss to Twitter
10,000,00 $ /YrMillions of Dollars per year. That’s a lot of money!
Motivation
▸ Existing Solution
▹ Twitter’s spam detection algorithm
focuses on criteria such as:
▹ harmful links
▹ aggressive following behavior
▹ posting to trending topics,
▹ posting duplicated tweets
▹ Low profile activity
▸ Drawbacks
▹ Spammers have evolved.
▹ Now Twitter cannot detect spam
based on existing algorithm
Proposed Algorithm
▸ Emphasis on interaction between
accounts and not on individual
accounts.
▸ Finding pattern with existing spam
tweets.
▸ Detecting spam accounts based on
tweets and spam tweets based on
accounts.
FLOW
CHART
TO
DETECT
SPAM
Identify Tweets with Malicious Links
Mining Spam Patterns
Spam Likelihood Estimation
Proposed Algorithm
Stage 1 :Identify Tweets with Malicious Links.
1. Collect tweets and user info.
2. Follow links in the Tweet
3. Check whether it is flagged by Twitter or any
other URL Shortening services (goo.gl or
bit.ly)
4. If yes Mark as Spam Else no
Leverage Twitter’s Database of Malicious
links.
Proposed Algorithm
Stage 2: Mining Spam Patterns. .
1. Strip off all URLS, @user mentions and
#hashtags.
2. Strip off all non alphanumeric characters
such as digits 0-9 or characters like *,!,@,#.
3. Create a hash for each stripped off tweet.
4. Compare the hash with hashes of other
tweets.
Find Pattern
Proposed Algorithm
Stage 3: Spam Likelihood Estimation.
1. Iterate through users and assign spam
scores based on the user’s tweets.
2. Iterate through tweets and assign spam
score based on the users of tweet.
Calculate Spam Score
Proposed Algorithm
Stage 3: Spam Likelihood Estimation.
Here comes the MATH
Proposed Algorithm
Implementation Details
▸ Data Collection
▹ Twitter java API - Twitter4j
▹ Registering App with twitter.
Implementation Details
▸ Data Storage
▹ MySQL database.
3,79,867tweets
3,129users
Implementation Details
▸ Twitter API has Rate Limits to Number of
Requests.
▸ 180 Request / 15 min
Implementation Details
▸ Stage 1 Implementation
▹ JSoup - Web Crawler for Java
● t.co - Warning: this link may
be unsafe
● Goo.gl - The site ahead
contains malware
● Bit.ly - STOP - there might be
a problem with the
requested link
Implementation Details
▸ Stage 1 Stats
▹ After implementing the first stage of the
algorithm
Implementation Details
▸ Stage 2 Implementation
▹ Regular Expressions to Strip Off
#hashtags, @user mentions, URLs,
special characters and numbers
▹ Used MD5 Algorithm to generate unique
hashes.
▹ Tweets with same hash values were
marked as spam.
Implementation Details
▸ Stage 2 stats
▹ 13015 duplicate hashes were found
▹ It covered 70,728 tweets
Implementation Details
▸ Stage 3 Stats
▹ Spam tweets which were not initially
labelled by first two stages were found
out.
▹ Users which tweet more spam were
assigned high Spam Score.
▹ And tweets which are tweeted by such
accounts are also assigned higher Spam
Score
Drawbacks
▸ Not good enough in detecting human
controlled spam accounts.
Advantages
▸ Detects bot controlled spam accounts.
▸ Easily detect Spam Campaigns.
▸ Spam tweets with different user mentions
and links are also detected.
▸ Excessive ReTweets to unrelated topics are
also treated as Spam.
Conclusion and Future Work
▸ Cross Account pattern matching method is
highly effective.
▸ Old Methods do not work nowadays.
▸ For Future Work
▹ Clustering of tweets to understand
topics which spammers use the most
▹ Providing a real time spam discovery
solution by implementing Machine
Learning.
Refrences
[1] Publication
http://dl.ifip.org/db/conf/im/im2015m/1374
46.pdf
THANKS!
Any questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Document from Sidra Saghir Asim.pptx
Document from Sidra Saghir Asim.pptxDocument from Sidra Saghir Asim.pptx
Document from Sidra Saghir Asim.pptx
sidrasagheer1
 

Was ist angesagt? (20)

Facebook Password Sniper
Facebook Password SniperFacebook Password Sniper
Facebook Password Sniper
 
Account Fraud Situation and Prevention in Rakuten
Account Fraud Situation and Prevention in RakutenAccount Fraud Situation and Prevention in Rakuten
Account Fraud Situation and Prevention in Rakuten
 
3 Steps to Stopping Social Media Account Hacks
3 Steps to Stopping Social Media Account Hacks3 Steps to Stopping Social Media Account Hacks
3 Steps to Stopping Social Media Account Hacks
 
Yet another talk on bug bounty
Yet another talk on bug bountyYet another talk on bug bounty
Yet another talk on bug bounty
 
Classifying fake news articles using natural language processing to identify ...
Classifying fake news articles using natural language processing to identify ...Classifying fake news articles using natural language processing to identify ...
Classifying fake news articles using natural language processing to identify ...
 
The Human Side of Security
The Human Side of SecurityThe Human Side of Security
The Human Side of Security
 
Bug bounty hunting
Bug bounty huntingBug bounty hunting
Bug bounty hunting
 
Document from Sidra Saghir Asim.pptx
Document from Sidra Saghir Asim.pptxDocument from Sidra Saghir Asim.pptx
Document from Sidra Saghir Asim.pptx
 
What I learned from protecting over 1 million crypto traders and investors fr...
What I learned from protecting over 1 million crypto traders and investors fr...What I learned from protecting over 1 million crypto traders and investors fr...
What I learned from protecting over 1 million crypto traders and investors fr...
 
WCBham Beginner WordPress Security
WCBham Beginner WordPress SecurityWCBham Beginner WordPress Security
WCBham Beginner WordPress Security
 
Computer SOS
Computer SOSComputer SOS
Computer SOS
 
Irm 13-phishing
Irm 13-phishingIrm 13-phishing
Irm 13-phishing
 
Backup-File Artifacts - OWASP Khartoum InfoSec Sessions 2016 - Mazin Ahmed
Backup-File Artifacts - OWASP Khartoum InfoSec Sessions 2016 - Mazin AhmedBackup-File Artifacts - OWASP Khartoum InfoSec Sessions 2016 - Mazin Ahmed
Backup-File Artifacts - OWASP Khartoum InfoSec Sessions 2016 - Mazin Ahmed
 
Nerino Petro - Protecting your Digital ASSets: What we can learn from recent ...
Nerino Petro - Protecting your Digital ASSets: What we can learn from recent ...Nerino Petro - Protecting your Digital ASSets: What we can learn from recent ...
Nerino Petro - Protecting your Digital ASSets: What we can learn from recent ...
 
Sarah frye ceo
Sarah frye ceoSarah frye ceo
Sarah frye ceo
 
Hiroshima University Information Security & Compliance 2017
Hiroshima University Information Security & Compliance 2017Hiroshima University Information Security & Compliance 2017
Hiroshima University Information Security & Compliance 2017
 
Ways to protect your data from hacking
Ways to protect your data from hackingWays to protect your data from hacking
Ways to protect your data from hacking
 
Hackers secrets
Hackers secretsHackers secrets
Hackers secrets
 
WordPress Security
WordPress SecurityWordPress Security
WordPress Security
 
The internet and safety
The internet and safetyThe internet and safety
The internet and safety
 

Andere mochten auch

12 ways trending twitter topics and hashtags may not be working for you
12 ways trending twitter topics and hashtags may not be working for you12 ways trending twitter topics and hashtags may not be working for you
12 ways trending twitter topics and hashtags may not be working for you
Online Promotion Success, Inc.
 
E Mail & Spam Presentation
E Mail & Spam PresentationE Mail & Spam Presentation
E Mail & Spam Presentation
newsan2001
 
Get started with dropbox
Get started with dropboxGet started with dropbox
Get started with dropbox
nick1331
 
sellsheetfinalsail
sellsheetfinalsailsellsheetfinalsail
sellsheetfinalsail
Gary Bush
 

Andere mochten auch (19)

Twitter Spam
Twitter SpamTwitter Spam
Twitter Spam
 
Spam, security
Spam, securitySpam, security
Spam, security
 
Microposts2015 - Social Spam Detection on Twitter
Microposts2015 - Social Spam Detection on TwitterMicroposts2015 - Social Spam Detection on Twitter
Microposts2015 - Social Spam Detection on Twitter
 
Twitter Content-based Spam Filtering - CISIS 2013
Twitter Content-based Spam Filtering - CISIS 2013Twitter Content-based Spam Filtering - CISIS 2013
Twitter Content-based Spam Filtering - CISIS 2013
 
12 ways trending twitter topics and hashtags may not be working for you
12 ways trending twitter topics and hashtags may not be working for you12 ways trending twitter topics and hashtags may not be working for you
12 ways trending twitter topics and hashtags may not be working for you
 
Graph-based KNN Algorithm for Spam SMS Detection
Graph-based KNN Algorithm for Spam SMS DetectionGraph-based KNN Algorithm for Spam SMS Detection
Graph-based KNN Algorithm for Spam SMS Detection
 
Bulk sms
Bulk smsBulk sms
Bulk sms
 
Spamming and Spam Filtering
Spamming and Spam FilteringSpamming and Spam Filtering
Spamming and Spam Filtering
 
E Mail & Spam Presentation
E Mail & Spam PresentationE Mail & Spam Presentation
E Mail & Spam Presentation
 
Spam Filtering
Spam FilteringSpam Filtering
Spam Filtering
 
Spam
SpamSpam
Spam
 
Sebastián salas hernández
Sebastián salas hernándezSebastián salas hernández
Sebastián salas hernández
 
CV June 2015 - Martins
CV June 2015 - MartinsCV June 2015 - Martins
CV June 2015 - Martins
 
Ms word
Ms wordMs word
Ms word
 
Get started with dropbox
Get started with dropboxGet started with dropbox
Get started with dropbox
 
Omega T
Omega TOmega T
Omega T
 
Draftexp2
Draftexp2Draftexp2
Draftexp2
 
Emily t
Emily tEmily t
Emily t
 
sellsheetfinalsail
sellsheetfinalsailsellsheetfinalsail
sellsheetfinalsail
 

Ähnlich wie Enhancing Twitter spam discovery using cross account pattern matching.

PhishAri: Automatic Realtime Phishing Detection on Twitter
PhishAri: Automatic Realtime Phishing Detection on TwitterPhishAri: Automatic Realtime Phishing Detection on Twitter
PhishAri: Automatic Realtime Phishing Detection on Twitter
Anupama Aggarwal
 

Ähnlich wie Enhancing Twitter spam discovery using cross account pattern matching. (20)

Classification Methods for Spam Detection in Online Social Network
Classification Methods for Spam Detection in Online Social NetworkClassification Methods for Spam Detection in Online Social Network
Classification Methods for Spam Detection in Online Social Network
 
Proguard: detecting malicious accounts in social-network-based online promotions
Proguard: detecting malicious accounts in social-network-based online promotionsProguard: detecting malicious accounts in social-network-based online promotions
Proguard: detecting malicious accounts in social-network-based online promotions
 
The Proper Tool is Everything: Using and Evaluating Affordable Social Media T...
The Proper Tool is Everything: Using and Evaluating Affordable Social Media T...The Proper Tool is Everything: Using and Evaluating Affordable Social Media T...
The Proper Tool is Everything: Using and Evaluating Affordable Social Media T...
 
IRJET- Twitter Spammer Detection
IRJET- Twitter Spammer DetectionIRJET- Twitter Spammer Detection
IRJET- Twitter Spammer Detection
 
Real Time Experiment Analytics at Pinterest with Apache Flink - Ben Liu & Par...
Real Time Experiment Analytics at Pinterest with Apache Flink - Ben Liu & Par...Real Time Experiment Analytics at Pinterest with Apache Flink - Ben Liu & Par...
Real Time Experiment Analytics at Pinterest with Apache Flink - Ben Liu & Par...
 
PhishAri: Automatic Realtime Phishing Detection on Twitter
PhishAri: Automatic Realtime Phishing Detection on TwitterPhishAri: Automatic Realtime Phishing Detection on Twitter
PhishAri: Automatic Realtime Phishing Detection on Twitter
 
The Nonprofit Guide To Google Ad Grants 2023 (1).pptx
The Nonprofit Guide To Google Ad Grants 2023 (1).pptxThe Nonprofit Guide To Google Ad Grants 2023 (1).pptx
The Nonprofit Guide To Google Ad Grants 2023 (1).pptx
 
Setting Up a Machine Learning Platform
Setting Up a Machine Learning PlatformSetting Up a Machine Learning Platform
Setting Up a Machine Learning Platform
 
Setting up a Machine Learning Platform - Monitoring social media the “smart” way
Setting up a Machine Learning Platform - Monitoring social media the “smart” waySetting up a Machine Learning Platform - Monitoring social media the “smart” way
Setting up a Machine Learning Platform - Monitoring social media the “smart” way
 
Google Analytics Workshop 2013
Google Analytics Workshop 2013Google Analytics Workshop 2013
Google Analytics Workshop 2013
 
Learn How To Drive Tonnes Of Traffic From Twitter On Autopilot!
Learn How To Drive Tonnes Of Traffic From Twitter On Autopilot!Learn How To Drive Tonnes Of Traffic From Twitter On Autopilot!
Learn How To Drive Tonnes Of Traffic From Twitter On Autopilot!
 
Unlocking the True Value of Social Media
Unlocking the True Value of Social MediaUnlocking the True Value of Social Media
Unlocking the True Value of Social Media
 
Setting up a Machine Learning Platform
Setting up a Machine Learning PlatformSetting up a Machine Learning Platform
Setting up a Machine Learning Platform
 
The Ultimate Easy-Peasy Guide to Tracking Inbound & Internal Marketing Campai...
The Ultimate Easy-Peasy Guide to Tracking Inbound & Internal Marketing Campai...The Ultimate Easy-Peasy Guide to Tracking Inbound & Internal Marketing Campai...
The Ultimate Easy-Peasy Guide to Tracking Inbound & Internal Marketing Campai...
 
Twitter Applications
Twitter ApplicationsTwitter Applications
Twitter Applications
 
Leveraging Twitter Tools for Maximum Use
Leveraging Twitter Tools for Maximum UseLeveraging Twitter Tools for Maximum Use
Leveraging Twitter Tools for Maximum Use
 
Twitter
TwitterTwitter
Twitter
 
Growth Hacking Fundamentals @ Echelon Jakarta (by Growth Hacking Asia)
Growth Hacking Fundamentals @ Echelon Jakarta (by Growth Hacking Asia)Growth Hacking Fundamentals @ Echelon Jakarta (by Growth Hacking Asia)
Growth Hacking Fundamentals @ Echelon Jakarta (by Growth Hacking Asia)
 
A DATA MINING APPROACH FOR FILTERING OUT SOCIAL SPAMMERS IN LARGE-SCALE TWITT...
A DATA MINING APPROACH FOR FILTERING OUT SOCIAL SPAMMERS IN LARGE-SCALE TWITT...A DATA MINING APPROACH FOR FILTERING OUT SOCIAL SPAMMERS IN LARGE-SCALE TWITT...
A DATA MINING APPROACH FOR FILTERING OUT SOCIAL SPAMMERS IN LARGE-SCALE TWITT...
 
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORKDETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
 

Kürzlich hochgeladen

Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
karishmasinghjnh
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Kürzlich hochgeladen (20)

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 

Enhancing Twitter spam discovery using cross account pattern matching.

  • 1. ENHANCING TWITTER SPAM DETECTION USING CROSS ACCOUNT PATTERN MATCHING. By Ambarish Pande
  • 2. Contents ▸ Introduction ▸ Motivation ▸ Proposed Algorithm ▸ Implementation Details ▸ Advantages and Drawbacks ▸ Conclusion and Future work
  • 3. Introduction ▸ Emerging Social Networks. ▹ Popularity of Facebook and Twitter ▹ 1550 Million active FB users. ▹ 320 Million active Twitter Users. ▹ Global Reach ▹ Multi-platform ▸ Social Network’s Revenue Model ▹ Advertising ▹ 85% of Twitter’s Revenue comes from advertising
  • 4. Motivation ▸ The Problem ▹ Social networks like twitter provide a legal way of publicizing content. ▹ Some companies go for illegal methods like Spam Accounts. ▹ Huge Revenue Loss to Twitter 10,000,00 $ /YrMillions of Dollars per year. That’s a lot of money!
  • 5. Motivation ▸ Existing Solution ▹ Twitter’s spam detection algorithm focuses on criteria such as: ▹ harmful links ▹ aggressive following behavior ▹ posting to trending topics, ▹ posting duplicated tweets ▹ Low profile activity ▸ Drawbacks ▹ Spammers have evolved. ▹ Now Twitter cannot detect spam based on existing algorithm
  • 6. Proposed Algorithm ▸ Emphasis on interaction between accounts and not on individual accounts. ▸ Finding pattern with existing spam tweets. ▸ Detecting spam accounts based on tweets and spam tweets based on accounts.
  • 7. FLOW CHART TO DETECT SPAM Identify Tweets with Malicious Links Mining Spam Patterns Spam Likelihood Estimation
  • 8. Proposed Algorithm Stage 1 :Identify Tweets with Malicious Links. 1. Collect tweets and user info. 2. Follow links in the Tweet 3. Check whether it is flagged by Twitter or any other URL Shortening services (goo.gl or bit.ly) 4. If yes Mark as Spam Else no Leverage Twitter’s Database of Malicious links.
  • 9. Proposed Algorithm Stage 2: Mining Spam Patterns. . 1. Strip off all URLS, @user mentions and #hashtags. 2. Strip off all non alphanumeric characters such as digits 0-9 or characters like *,!,@,#. 3. Create a hash for each stripped off tweet. 4. Compare the hash with hashes of other tweets. Find Pattern
  • 10. Proposed Algorithm Stage 3: Spam Likelihood Estimation. 1. Iterate through users and assign spam scores based on the user’s tweets. 2. Iterate through tweets and assign spam score based on the users of tweet. Calculate Spam Score
  • 11. Proposed Algorithm Stage 3: Spam Likelihood Estimation. Here comes the MATH
  • 13. Implementation Details ▸ Data Collection ▹ Twitter java API - Twitter4j ▹ Registering App with twitter.
  • 14. Implementation Details ▸ Data Storage ▹ MySQL database.
  • 15. 3,79,867tweets 3,129users Implementation Details ▸ Twitter API has Rate Limits to Number of Requests. ▸ 180 Request / 15 min
  • 16. Implementation Details ▸ Stage 1 Implementation ▹ JSoup - Web Crawler for Java ● t.co - Warning: this link may be unsafe ● Goo.gl - The site ahead contains malware ● Bit.ly - STOP - there might be a problem with the requested link
  • 17. Implementation Details ▸ Stage 1 Stats ▹ After implementing the first stage of the algorithm
  • 18. Implementation Details ▸ Stage 2 Implementation ▹ Regular Expressions to Strip Off #hashtags, @user mentions, URLs, special characters and numbers ▹ Used MD5 Algorithm to generate unique hashes. ▹ Tweets with same hash values were marked as spam.
  • 19. Implementation Details ▸ Stage 2 stats ▹ 13015 duplicate hashes were found ▹ It covered 70,728 tweets
  • 20. Implementation Details ▸ Stage 3 Stats ▹ Spam tweets which were not initially labelled by first two stages were found out. ▹ Users which tweet more spam were assigned high Spam Score. ▹ And tweets which are tweeted by such accounts are also assigned higher Spam Score
  • 21. Drawbacks ▸ Not good enough in detecting human controlled spam accounts. Advantages ▸ Detects bot controlled spam accounts. ▸ Easily detect Spam Campaigns. ▸ Spam tweets with different user mentions and links are also detected. ▸ Excessive ReTweets to unrelated topics are also treated as Spam.
  • 22. Conclusion and Future Work ▸ Cross Account pattern matching method is highly effective. ▸ Old Methods do not work nowadays. ▸ For Future Work ▹ Clustering of tweets to understand topics which spammers use the most ▹ Providing a real time spam discovery solution by implementing Machine Learning.