SlideShare ist ein Scribd-Unternehmen logo
1 von 38
GRAMPA, WHAT'S A
DELETED TWEET?
Mohammed Nauman Siddique
Web Archiving Forensics (CS 895)
Spring, 2019
Web Science and Digital Libraries Group
Old Dominion University
Norfolk, Virginia, USA
@WebSciDL
Presidential tweets are now government records
@m_nsiddique, @WebSciDL 2
Source: https://web.archive.org/web/20170121171210/http:/twitter.com/realDonaldTrump/status/822853741040771072
News Article: https://theconversation.com/donald-trumps-tweets-are-now-presidential-records-71973
11% of the social media resources are lost in their first year
@m_nsiddique, @WebSciDL 3
Source: SalahEldeen H.M., Nelson M.L. (2012) Losing My Revolution: How Many Resources Shared on Social Media
Have Been Lost?. TPDL 2012. Springer, Berlin, Heidelberg
Blog Link: http://ws-dl.blogspot.com/2012/02/2012-02-11-losing-my-revolution-year.html
Politwoops: Tracks deleted tweets by public officials
@m_nsiddique, @WebSciDL 4
Source: https://projects.propublica.org/politwoops/
The best way to find a typo is to hit send
@m_nsiddique, @WebSciDL 5
Source: https://projects.propublica.org/politwoops/tweet/1056626382548156416
Fixing typos only introduces more typos
@m_nsiddique, @WebSciDL 6
Source: https://twitter.com/RepDannyDavis/status/1056627582148530177
Unretweeted after a year!!!
@m_nsiddique, @WebSciDL 7
Source: https://projects.propublica.org/politwoops/tweet/910352940749254657
Twitter revoked access to Politwoops
@m_nsiddique, @WebSciDL 8
Source: https://www.businessinsider.com/twitter-bans-politwoops-diplotwoops-global-api-access-revoked-open-state-foundation-
2015-8/
Politwoops resumes after 6 months
@m_nsiddique, @WebSciDL 9
Tweet is deleted
Source: https://blog.twitter.com/official/en_us/a/2015/holding-public-officials-accountable-with-twitter-and-politwoops.html
Flight handle is gone
@m_nsiddique, @WebSciDL 10
Source: https://twitter.com/Flight/status/656882929923059713
No worries web archives come to the rescue
@m_nsiddique, @WebSciDL 11
Source: https://web.archive.org/web/20160205000405/https://twitter.com/Flight/status/656882929923059713
Web archives include social media too
@m_nsiddique, @WebSciDL 12
Source: https://web.archive.org/web/20180929210711/https:/twitter.com/RepDannyDavis
Nauman, you are not archived
@m_nsiddique, @WebSciDL 13
Source: https://web.archive.org/web/*/https://twitter.com/m_nsiddique
@BreitbartNews is well archived
@m_nsiddique, @WebSciDL 14
Source: https://web.archive.org/web/*/https://twitter.com/BreitbartNews
@realDonaldTrump is very heavily archived
@m_nsiddique, @WebSciDL 15
Source: https://web.archive.org/web/*/https://twitter.com/realDonaldTrump
Archival captures for top level pages have approximately 20 tweets
@m_nsiddique, @WebSciDL 16
Source: https://web.archive.org/web/20190202074656/https:/twitter.com/realDonaldTrump
Tweet Ids are just a single tweet
@m_nsiddique, @WebSciDL 17
Source: https://web.archive.org/web/20190202054351/https://twitter.com/realdonaldtrump/status/1091427927475085312
Not enough to take screenshots
@m_nsiddique, @WebSciDL 18
Source: https://twitter.com/CasMudde/status/960546130684768256
News Article: https://www.huffingtonpost.com/entry/
breitbart-anti-muslim-tweet_us_5a78b426e4b0164659c70e15
Monday Morning Quarterbacking
@m_nsiddique, @WebSciDL 19
Source: https://twitter.com/BreitbartNews/status/960565890336149504
Penalty: Unsportsmanlike conduct
@m_nsiddique, @WebSciDL 20
Source: https://web.archive.org/web/20180205041213/http:/twitter.com/BreitbartNews/status/960353573581283329
How did we find the deleted tweets?
• Used Twitter API to fetch recent 3200 tweets
• Tweets spanned from Oct 22, 2017 to Feb 18, 2018
• Used Memgator, memento aggregator service to fetch
mementos
@m_nsiddique, @WebSciDL 21
Code to fetch recent tweets using Python-TwitterAPI
import twitter
api = twitter.Api(consumer_key='xxxxxx',
consumer_secret='xxxxxx',
access_token_key='xxxxxx',
access_token_secret='xxxxxx',
sleep_on_rate_limit=True)
twitter_response = api.GetUserTimeline(screen_name=screen_name,
count=200, include_rts=True)
@m_nsiddique, @WebSciDL 22
Run MemGator locally
$ memgator --contimeout=10s --agent=XXXXXX server
MemGator 1.0-rc7
_____ _______ __
/  _____ _____ / _____/______/ |___________
/ Y Y / __ / /  _____  _/ _ _ _ 
/ | |  ___/ Y Y  _ / __ | | |_| | | /
__/_____/______|_|__/_______/_____|__|___/|__|
TimeMap : http://localhost:1208/timemap/{FORMAT}/{URI-R}
TimeGate : http://localhost:1208/timegate/{URI-R} [Accept-
Datetime]
Memento :
http://localhost:1208/memento[/{FORMAT}|proxy]/{DATETIME}/{UR
I-R}
# FORMAT => link|json|cdxj
# DATETIME => YYYY[MM[DD[hh[mm[ss]]]]]
# Accept-Datetime => Header in RFC1123 format
@m_nsiddique, @WebSciDL 23
Source: https://github.com/oduwsdl/MemGator
TimeGate
@m_nsiddique, @WebSciDL 24
RFC: https://tools.ietf.org/html/rfc7089
Source: http://mementoweb.org/guide/quick-intro/
TimeMap
msiddique@atria:~$ curl -i
https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html
HTTP/1.1 200 OK
Content-Type: application/link-format
Date: Wed, 06 Feb 2019 18:46:16 GMT
X-Generator: MemGator:1.0-rc7
X-Memento-Count: 93
Transfer-Encoding: chunked
<http://example.org/index.html>; rel="original",
<https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html>;
rel="self"; type="application/link-format",
<http://web.archive.org/web/20021016101337/http://www.example.org:80/index.html>;
rel="first memento"; datetime="Wed, 16 Oct 2002 10:13:37 GMT",
<http://web.archive.org/web/20031207031049/http://www.example.org:80/index.html>;
rel="memento"; datetime="Sun, 07 Dec 2003 03:10:49 GMT",
….Deletia…
<http://web.archive.org/web/20190131051056/http://www.example.org/index.html>;
rel="last memento"; datetime="Thu, 31 Jan 2019 05:10:56 GMT",
@m_nsiddique, @WebSciDL 25
Play with TimeMap and TimeGate
@m_nsiddique, @WebSciDL 26
Source: http://memgator.cs.odu.edu/api.html
Code to fetch TimeMap for any Twitter handle
url = "http://localhost:1208/timemap/"
data_format = "cdxj"
command = url + data_format +
"/http://twitter.com/<screen-name>" +
response = requests.get(command)
@m_nsiddique, @WebSciDL 27
Code to parse tweet-related information
import bs4
soup = bs4.BeautifulSoup(open(<HTML representation of
Memento>),"html.parser")
match_tweet_div_tag = soup.select('div.js-stream-tweet')
for tag in match_tweet_div_tag:
if tag.has_attr("data-tweet-id"):
# Get Tweet id
...........
# Parse tweets
match_timeline_tweets = tag.select('p.js-tweet-
text.tweet-text')
...........
# Parse tweet timestamps
match_tweet_timestamp = tag.find("span", {"class":
"js-short-timestamp"})
...........
@m_nsiddique, @WebSciDL 28
Analysis of Breitbart News Deleted Tweets
• Of the 22 deleted tweets, 20 were of the form where
Breitbart News retweeted someone's tweet but the
original tweet was lost.
• Of those 20 tweets, 18 were from two affiliates of Breitbart
News, @NolteNC and @carney. Therefore, we decided to
have a look at both the accounts to determine the reason
for their deleted tweets.
@m_nsiddique, @WebSciDL 29
Unretweeted tweet by Breitbart News
@m_nsiddique, @WebSciDL 30
Source: https://web.archive.org/web/20180111162729/https:/twitter.com/BreitbartNews
Original Live Tweet
@m_nsiddique, @WebSciDL 31
Source: https://twitter.com/LibertarianBlue/status/951465189161029632
Breitbart News retweets another tweet
@m_nsiddique, @WebSciDL 32
Source: https://twitter.com/LibertarianBlue/status/951578248986710016
Analysis on @carney and @NolteNC
• Mementos fetched between Nov 3, 2017 and Feb 17,
2018
• Low number of mementos for @carney
• @NolteNC had 169 live tweets and 3569 deleted tweets
• Fetched live tweets using Twitter API for both accounts for
over two weeks
@m_nsiddique, @WebSciDL 33
Tweets older than a week on Tuesday and Saturday are deleted
@m_nsiddique, @WebSciDL 34
Tweets older than a week on Wednesday and Saturday are deleted
@m_nsiddique, @WebSciDL 35
• With 1000s of deleted tweets, it seemed unlikely that he
was manually deleting tweets.
• We have all the reasons to believe that @carney and
@NolteNC deleted tweets automatically using some tweet
deletion service.
@m_nsiddique, @WebSciDL 36
Deletion Behavior
Take Away
• It is not enough to make screen shots of controversial
tweets, rather we need to push it to the web archives for
longer retention capability than our personal archives.
• For finding deleted tweets, web archives work effectively
for popular accounts but for less popular accounts this
approach might not work.
• For finding deleted tweets, top level page works better
than individual Tweet Id URLs.
• Most deletions for Breitbart News come from automatic
deletion of tweets by some of its correspondents.
@m_nsiddique, @WebSciDL 37
You can read more on the blog
http://ws-dl.blogspot.com/2018/04/2018-04-23-grampa-
whats-deleted-tweet.html
@m_nsiddique, @WebSciDL 38

Weitere ähnliche Inhalte

Ähnlich wie Grampa, What's a deleted tweet?

Challenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter PagesChallenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter PagesKritika Garg
 
Web Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsWeb Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsTarannum Zaki
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Michael Nelson
 
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGESMINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGESIAEME Publication
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveEugene Yan Ziyou
 
OSINT using Twitter & Python
OSINT using Twitter & PythonOSINT using Twitter & Python
OSINT using Twitter & Python37point2
 
Uncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesUncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesMichael Nelson
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Farida Vis
 
GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1Harald Meier
 
From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...Debanjan Mahata
 
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...Shawn Jones
 
All You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITINGAll You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITINGHeather Lee
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Michael Nelson
 
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22News Leaders Association's NewsTrain
 
Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15spiderQube
 
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia UniversityTunghai University
 
Extracting Information from Twitter Screenshots
Extracting Information from Twitter ScreenshotsExtracting Information from Twitter Screenshots
Extracting Information from Twitter ScreenshotsTarannum Zaki
 

Ähnlich wie Grampa, What's a deleted tweet? (20)

Challenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter PagesChallenges in Replaying Archived Twitter Pages
Challenges in Replaying Archived Twitter Pages
 
Web Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter ScreenshotsWeb Archives for Verifying Attribution in Twitter Screenshots
Web Archives for Verifying Attribution in Twitter Screenshots
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
 
Social Media Mining and Analytics
Social Media Mining and AnalyticsSocial Media Mining and Analytics
Social Media Mining and Analytics
 
Ijciet 08 02_024
Ijciet 08 02_024Ijciet 08 02_024
Ijciet 08 02_024
 
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGESMINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
MINING OPINIONS ABOUT TRAFFIC STATUS USING TWITTER MESSAGES
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDive
 
OSINT using Twitter & Python
OSINT using Twitter & PythonOSINT using Twitter & Python
OSINT using Twitter & Python
 
Congressional Deleted Tweets
Congressional Deleted TweetsCongressional Deleted Tweets
Congressional Deleted Tweets
 
Uncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pagesUncertainty in replaying archived Twitter pages
Uncertainty in replaying archived Twitter pages
 
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
 
GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1GIJC19 - NodeXL Tutorial - Session 1
GIJC19 - NodeXL Tutorial - Session 1
 
From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...From Chirps to Whistles - Discovering Event-specific Informative Content from...
From Chirps to Whistles - Discovering Event-specific Informative Content from...
 
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
Social Cards Probably Provide For Better Understanding Of Web Archive Collect...
 
All You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITINGAll You Need To Know About LETTER WRITING
All You Need To Know About LETTER WRITING
 
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence Weaponized Web Archives: Provenance Laundering of Short Order Evidence
Weaponized Web Archives: Provenance Laundering of Short Order Evidence
 
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
Social-media reporting tools - Sona Patel - Fresno NewsTrain - 4.22-23.22
 
Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15Spider Tag: #JVMChat 3/31/15
Spider Tag: #JVMChat 3/31/15
 
CML's Presentation at FengChia University
CML's Presentation at FengChia UniversityCML's Presentation at FengChia University
CML's Presentation at FengChia University
 
Extracting Information from Twitter Screenshots
Extracting Information from Twitter ScreenshotsExtracting Information from Twitter Screenshots
Extracting Information from Twitter Screenshots
 

Kürzlich hochgeladen

Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Sonam Pathan
 
NSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentationNSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentationMarko4394
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一Fs
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxeditsforyah
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Sonam Pathan
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)Christopher H Felton
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作ys8omjxb
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Excelmac1
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一Fs
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书zdzoqco
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhimiss dipika
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一Fs
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMartaLoveguard
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleanscorenetworkseo
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书rnrncn29
 

Kürzlich hochgeladen (20)

Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
 
NSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentationNSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentation
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
 
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptx
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhi
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptx
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleans
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
 

Grampa, What's a deleted tweet?

  • 1. GRAMPA, WHAT'S A DELETED TWEET? Mohammed Nauman Siddique Web Archiving Forensics (CS 895) Spring, 2019 Web Science and Digital Libraries Group Old Dominion University Norfolk, Virginia, USA @WebSciDL
  • 2. Presidential tweets are now government records @m_nsiddique, @WebSciDL 2 Source: https://web.archive.org/web/20170121171210/http:/twitter.com/realDonaldTrump/status/822853741040771072 News Article: https://theconversation.com/donald-trumps-tweets-are-now-presidential-records-71973
  • 3. 11% of the social media resources are lost in their first year @m_nsiddique, @WebSciDL 3 Source: SalahEldeen H.M., Nelson M.L. (2012) Losing My Revolution: How Many Resources Shared on Social Media Have Been Lost?. TPDL 2012. Springer, Berlin, Heidelberg Blog Link: http://ws-dl.blogspot.com/2012/02/2012-02-11-losing-my-revolution-year.html
  • 4. Politwoops: Tracks deleted tweets by public officials @m_nsiddique, @WebSciDL 4 Source: https://projects.propublica.org/politwoops/
  • 5. The best way to find a typo is to hit send @m_nsiddique, @WebSciDL 5 Source: https://projects.propublica.org/politwoops/tweet/1056626382548156416
  • 6. Fixing typos only introduces more typos @m_nsiddique, @WebSciDL 6 Source: https://twitter.com/RepDannyDavis/status/1056627582148530177
  • 7. Unretweeted after a year!!! @m_nsiddique, @WebSciDL 7 Source: https://projects.propublica.org/politwoops/tweet/910352940749254657
  • 8. Twitter revoked access to Politwoops @m_nsiddique, @WebSciDL 8 Source: https://www.businessinsider.com/twitter-bans-politwoops-diplotwoops-global-api-access-revoked-open-state-foundation- 2015-8/
  • 9. Politwoops resumes after 6 months @m_nsiddique, @WebSciDL 9 Tweet is deleted Source: https://blog.twitter.com/official/en_us/a/2015/holding-public-officials-accountable-with-twitter-and-politwoops.html
  • 10. Flight handle is gone @m_nsiddique, @WebSciDL 10 Source: https://twitter.com/Flight/status/656882929923059713
  • 11. No worries web archives come to the rescue @m_nsiddique, @WebSciDL 11 Source: https://web.archive.org/web/20160205000405/https://twitter.com/Flight/status/656882929923059713
  • 12. Web archives include social media too @m_nsiddique, @WebSciDL 12 Source: https://web.archive.org/web/20180929210711/https:/twitter.com/RepDannyDavis
  • 13. Nauman, you are not archived @m_nsiddique, @WebSciDL 13 Source: https://web.archive.org/web/*/https://twitter.com/m_nsiddique
  • 14. @BreitbartNews is well archived @m_nsiddique, @WebSciDL 14 Source: https://web.archive.org/web/*/https://twitter.com/BreitbartNews
  • 15. @realDonaldTrump is very heavily archived @m_nsiddique, @WebSciDL 15 Source: https://web.archive.org/web/*/https://twitter.com/realDonaldTrump
  • 16. Archival captures for top level pages have approximately 20 tweets @m_nsiddique, @WebSciDL 16 Source: https://web.archive.org/web/20190202074656/https:/twitter.com/realDonaldTrump
  • 17. Tweet Ids are just a single tweet @m_nsiddique, @WebSciDL 17 Source: https://web.archive.org/web/20190202054351/https://twitter.com/realdonaldtrump/status/1091427927475085312
  • 18. Not enough to take screenshots @m_nsiddique, @WebSciDL 18 Source: https://twitter.com/CasMudde/status/960546130684768256 News Article: https://www.huffingtonpost.com/entry/ breitbart-anti-muslim-tweet_us_5a78b426e4b0164659c70e15
  • 19. Monday Morning Quarterbacking @m_nsiddique, @WebSciDL 19 Source: https://twitter.com/BreitbartNews/status/960565890336149504
  • 20. Penalty: Unsportsmanlike conduct @m_nsiddique, @WebSciDL 20 Source: https://web.archive.org/web/20180205041213/http:/twitter.com/BreitbartNews/status/960353573581283329
  • 21. How did we find the deleted tweets? • Used Twitter API to fetch recent 3200 tweets • Tweets spanned from Oct 22, 2017 to Feb 18, 2018 • Used Memgator, memento aggregator service to fetch mementos @m_nsiddique, @WebSciDL 21
  • 22. Code to fetch recent tweets using Python-TwitterAPI import twitter api = twitter.Api(consumer_key='xxxxxx', consumer_secret='xxxxxx', access_token_key='xxxxxx', access_token_secret='xxxxxx', sleep_on_rate_limit=True) twitter_response = api.GetUserTimeline(screen_name=screen_name, count=200, include_rts=True) @m_nsiddique, @WebSciDL 22
  • 23. Run MemGator locally $ memgator --contimeout=10s --agent=XXXXXX server MemGator 1.0-rc7 _____ _______ __ / _____ _____ / _____/______/ |___________ / Y Y / __ / / _____ _/ _ _ _ / | | ___/ Y Y _ / __ | | |_| | | / __/_____/______|_|__/_______/_____|__|___/|__| TimeMap : http://localhost:1208/timemap/{FORMAT}/{URI-R} TimeGate : http://localhost:1208/timegate/{URI-R} [Accept- Datetime] Memento : http://localhost:1208/memento[/{FORMAT}|proxy]/{DATETIME}/{UR I-R} # FORMAT => link|json|cdxj # DATETIME => YYYY[MM[DD[hh[mm[ss]]]]] # Accept-Datetime => Header in RFC1123 format @m_nsiddique, @WebSciDL 23 Source: https://github.com/oduwsdl/MemGator
  • 24. TimeGate @m_nsiddique, @WebSciDL 24 RFC: https://tools.ietf.org/html/rfc7089 Source: http://mementoweb.org/guide/quick-intro/
  • 25. TimeMap msiddique@atria:~$ curl -i https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html HTTP/1.1 200 OK Content-Type: application/link-format Date: Wed, 06 Feb 2019 18:46:16 GMT X-Generator: MemGator:1.0-rc7 X-Memento-Count: 93 Transfer-Encoding: chunked <http://example.org/index.html>; rel="original", <https://memgator.cs.odu.edu/timemap/link/http://example.org/index.html>; rel="self"; type="application/link-format", <http://web.archive.org/web/20021016101337/http://www.example.org:80/index.html>; rel="first memento"; datetime="Wed, 16 Oct 2002 10:13:37 GMT", <http://web.archive.org/web/20031207031049/http://www.example.org:80/index.html>; rel="memento"; datetime="Sun, 07 Dec 2003 03:10:49 GMT", ….Deletia… <http://web.archive.org/web/20190131051056/http://www.example.org/index.html>; rel="last memento"; datetime="Thu, 31 Jan 2019 05:10:56 GMT", @m_nsiddique, @WebSciDL 25
  • 26. Play with TimeMap and TimeGate @m_nsiddique, @WebSciDL 26 Source: http://memgator.cs.odu.edu/api.html
  • 27. Code to fetch TimeMap for any Twitter handle url = "http://localhost:1208/timemap/" data_format = "cdxj" command = url + data_format + "/http://twitter.com/<screen-name>" + response = requests.get(command) @m_nsiddique, @WebSciDL 27
  • 28. Code to parse tweet-related information import bs4 soup = bs4.BeautifulSoup(open(<HTML representation of Memento>),"html.parser") match_tweet_div_tag = soup.select('div.js-stream-tweet') for tag in match_tweet_div_tag: if tag.has_attr("data-tweet-id"): # Get Tweet id ........... # Parse tweets match_timeline_tweets = tag.select('p.js-tweet- text.tweet-text') ........... # Parse tweet timestamps match_tweet_timestamp = tag.find("span", {"class": "js-short-timestamp"}) ........... @m_nsiddique, @WebSciDL 28
  • 29. Analysis of Breitbart News Deleted Tweets • Of the 22 deleted tweets, 20 were of the form where Breitbart News retweeted someone's tweet but the original tweet was lost. • Of those 20 tweets, 18 were from two affiliates of Breitbart News, @NolteNC and @carney. Therefore, we decided to have a look at both the accounts to determine the reason for their deleted tweets. @m_nsiddique, @WebSciDL 29
  • 30. Unretweeted tweet by Breitbart News @m_nsiddique, @WebSciDL 30 Source: https://web.archive.org/web/20180111162729/https:/twitter.com/BreitbartNews
  • 31. Original Live Tweet @m_nsiddique, @WebSciDL 31 Source: https://twitter.com/LibertarianBlue/status/951465189161029632
  • 32. Breitbart News retweets another tweet @m_nsiddique, @WebSciDL 32 Source: https://twitter.com/LibertarianBlue/status/951578248986710016
  • 33. Analysis on @carney and @NolteNC • Mementos fetched between Nov 3, 2017 and Feb 17, 2018 • Low number of mementos for @carney • @NolteNC had 169 live tweets and 3569 deleted tweets • Fetched live tweets using Twitter API for both accounts for over two weeks @m_nsiddique, @WebSciDL 33
  • 34. Tweets older than a week on Tuesday and Saturday are deleted @m_nsiddique, @WebSciDL 34
  • 35. Tweets older than a week on Wednesday and Saturday are deleted @m_nsiddique, @WebSciDL 35
  • 36. • With 1000s of deleted tweets, it seemed unlikely that he was manually deleting tweets. • We have all the reasons to believe that @carney and @NolteNC deleted tweets automatically using some tweet deletion service. @m_nsiddique, @WebSciDL 36 Deletion Behavior
  • 37. Take Away • It is not enough to make screen shots of controversial tweets, rather we need to push it to the web archives for longer retention capability than our personal archives. • For finding deleted tweets, web archives work effectively for popular accounts but for less popular accounts this approach might not work. • For finding deleted tweets, top level page works better than individual Tweet Id URLs. • Most deletions for Breitbart News come from automatic deletion of tweets by some of its correspondents. @m_nsiddique, @WebSciDL 37
  • 38. You can read more on the blog http://ws-dl.blogspot.com/2018/04/2018-04-23-grampa- whats-deleted-tweet.html @m_nsiddique, @WebSciDL 38