SlideShare ist ein Scribd-Unternehmen logo
1 von 48
Downloaden Sie, um offline zu lesen
Hamdan Azhar
hamdan@prismoji.com
@hamdanazhar
// November 3, 2017
From data with ❤ //
An introduction to Emoji Data Science
🏭 📈📊%
à
& 📈📊%
Industry data science -> Indie data science
While emojis might seem fun and
trivial, they are at their core vessels
for conveying human emotion.
Emojis are also data.
And data science can help us
understand them.
The birth of
My reaction to this article,
in emoji
https://motherboard.vice.com/en_us/article/the-emojis-of-great-brexit
How do people use emojis to react to a major
political event in real time?
Getting the data
■ Use Twitter API to sample 100,000 tweets for five hashtags related to Britain’s
EU Referendum
§ Hashtags: #NotMyVote, #VoteRemain, #EURef, #Brexit, #VoteLeave
§ Data pulled for June 24, 2016 (the day after the referendum)
§ English language tweets only
§ After removing retweets, we’re left with 23,989 unique tweets, i.e. the “Brexit
dataset”
§ Of these, 1,505 tweets (6.3%) contain at least one emoji
Analyzing the data
§ Use regular expressions in R, along with Unicode emoji dictionaries, to
extract emojis from tweets
§ Compute emoji counts in the Brexit dataset
§ Compare with counts for all >10B emoji tweets on Twitter since 2013
(from emojitracker.com)
§ Extract hashtags from tweets and compute hashtag profiles for various
emojis
emoji emoji name brexit rank brexit count
brexit density
(per 1,000)
😂 face with tears of joy 1 285 11.9
flag of united kingdom 2 249 10.4
👍 thumbs up sign 3 75 3.1
👏 clapping hands sign 4 70 2.9
❤ heavy black heart 5 60 2.5
😭 loudly crying face 6 48 2.0
😔 pensive face 7 40 1.7
😩 weary face 8 38 1.6
😢 crying face 9 34 1.4
🙈 see-no-evil monkey 10 34 1.4
Brexit chatter includes a wide variety of emojis
Of every
1,000 tweets
in our
dataset, 11.9
of them
contain 😂
Can we use hashtag data to understand what
these emojis mean?
Finding the “hashtag signature” of a given emoji
§ We know the distribution of
hashtags in our entire dataset
§ We can pick a given emoji and
compute the distribution of
hashtags for tweets that use that
emoji
§ By comparing these two
distributions, we can estimate
which hashtags an emoji is most
likely to be used with
15%
17%
20%
29%
19%
https://motherboard.vice.com/en_us/article/a-data-scientists-emoji-guide-to-kanye-west-and-taylor-swift
Hearts vs. Snakes:
The emoji battle underyling the epic Taylor Swift – Kanye West feud
Source: Analysis of 100,000
public tweets mentioning
@taylorswift13 and
@kanyewest from
Aug. 1-4, 2016.
(PRISMOJI)
Taylor Swift is winning hearts (and minds)
Source: Analysis of 100,000
public tweets mentioning
@taylorswift13 and
@kanyewest from
Aug. 1-4, 2016.
(PRISMOJI)
equal
higher association with
@taylorswift13
higher association with
@kanyewest
Our common emoji language of #fanlove
Source: Analysis of 250,000
public tweets mentioning
@beyonce, @justinbieber,
@djkhaled, @drake, and
@rihanna from
Aug. 1-4, 2016.
(PRISMOJI)
Sometimes love hurts
Examples of in tweets involving #fanlove
Source: Analysis of 250,000
public tweets mentioning
@beyonce, @justinbieber,
@djkhaled, @drake, and
@rihanna from
Aug. 1-4, 2016.
(PRISMOJI)
Now let’s turn to something more serious
Emoji reactions to Election Night results
Source: Analysis of sampled
Election Night tweets,
from Nov. 9, 2016 12:00am –
3:30am EST.
(PRISMOJI)
More common in tweets mentioning TrumpMore common in tweets mentioning Clinton
https://prismoji.com/2016/11/26/the-top-emojis-of-election-day-2016/
Election Day emojis in tweets mentioning Hillary Clinton
More common AFTER midnightMore common BEFORE midnight
Source: Analysis of sampled
Election Day tweets from
Nov. 8, 2016 6:00am to
Nov. 9, 2016 3:30am.
(PRISMOJI)
Source: Analysis of sampled Election Day tweets from Nov. 8, 2016. (PRISMOJI)
The divided emoji states of America
Source: Analysis of sampled Election Night tweets, from Nov. 9, 2016 12:00am – 3:30am EST. Word clouds shown for tweets mentioning Hillary Clinton
or Donald Trump as well as the given emoji above. Randomly selected tweet shown for each emoji.
http://www.huffingtonpost.com/entry/raised-fist-emoji-resistance_us_5899e39fe4b09bd304bd8ac9
Emojis of the resistance
#NoBanNoWall
#TheResistance
#NotMyPresident
#WomensMarch
Source: Analysis of
sampled English-
language tweets
from Jan. 28-29,
2017 (Emojipedia)
The media doesn’t always appreciate emojis
36
37
38
Latest work
The Top Emojis of #MeToo
Source: Analysis of 28,629 English-language tweets sampled on Oct. 16, 2017 using
the Twitter Search API. Top 10 emojis shown in descending order of frequency.
The Top Emojis of #MeToo
Source: Analysis of 28,629 English-language tweets sampled on Oct. 16, 2017 using
the Twitter Search API. Top 20 emojis shown, grouped by emoji type, in descending
order of frequency.
❤💔💜💕💖💙💛💗
😔😢😡😍😞😕😥
👇✊💪🙏👏
The Top Emojis of #MeToo: Skin Tone Distributions
Source: Analysis of 28,629 English-language tweets sampled on Oct. 16, 2017 using the Twitter Search API.
Color 1 Color 2 Color 3 Color 4 Color 5
Mean
Skin Tone
✊ 9% 37% 29% 14% 11% 2.8
👇 29% 29% 18% 12% 12% 2.5
🙏 20% 50% 15% 15% 0% 2.2
💪 38% 28% 25% 9% 0% 2.0
👊 50% 25% 25% 0% 0% 1.8
🖕 50% 36% 14% 0% 0% 1.6
For emojis that are
most frequently used
with skin tone
modifiers, we look at
the distribution of skin
tone usage for each
emoji. The yellow
band indicates the
”mean skin tone” for a
given emoji.
While emojis might seem fun and
trivial, they are at their core vessels
for conveying human emotion.
Emojis are also data.
And data science can help us
understand them.
You too can become an emoji data scientist!
https://prismoji.com/2017/02/06/emoji-data-science-in-r-tutorial/
Mailchimp emoji analysis
https://blog.mailchimp.com/mailchimps-most-popular-subject-line-emojis/
Spotify emoji analysis
https://insights.spotify.com/us/2017/05/02/spotify-emoji-music/
Next steps
■ Journalism vs. Academic & Industry
■ Study emoji usage across countries & cultures
■ Study private emoji usage vs. public emoji usage
■ Using this as a kick-off point to dive head-in into data
journalism and storytelling about unexplored questions in
society and culture
Thank you!
Send me questions and tips! Or reach out if you’d like to
collaborate J
On Twitter: @hamdanazhar
Or by email: hamdan@prismoji.com
Check out more emoji data science at prismoji.com

Weitere ähnliche Inhalte

Ähnlich wie Introduction to Emoji Data Science (Open Data Science Conference, 2017)

Twitter Sentiment and Network Analysis
Twitter Sentiment and Network AnalysisTwitter Sentiment and Network Analysis
Twitter Sentiment and Network AnalysisXudong Brandon Liang
 
Baidu facemoji-07-11-2019-eoy-report-ii
Baidu facemoji-07-11-2019-eoy-report-iiBaidu facemoji-07-11-2019-eoy-report-ii
Baidu facemoji-07-11-2019-eoy-report-iiJosh Fenn
 
Gretchen McCulloch and Emojis as an Internet Language Gone Mainstream
Gretchen McCulloch and Emojis as an Internet Language Gone MainstreamGretchen McCulloch and Emojis as an Internet Language Gone Mainstream
Gretchen McCulloch and Emojis as an Internet Language Gone MainstreamBGGD
 
A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation ...
A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation ...A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation ...
A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation ...Artificial Intelligence Institute at UofSC
 
Extraction of Emoticons with Sentimental Bar
Extraction of Emoticons with Sentimental BarExtraction of Emoticons with Sentimental Bar
Extraction of Emoticons with Sentimental Barvivatechijri
 
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...CSCJournals
 
Noa Ha'aman - 2017 - MojiSem: Varying Linguistic Purposes of Emoji in (Twitte...
Noa Ha'aman - 2017 - MojiSem: Varying Linguistic Purposes of Emoji in (Twitte...Noa Ha'aman - 2017 - MojiSem: Varying Linguistic Purposes of Emoji in (Twitte...
Noa Ha'aman - 2017 - MojiSem: Varying Linguistic Purposes of Emoji in (Twitte...Association for Computational Linguistics
 
Adobe Future of Creativity 2022 U.S. Emoji Trend Report.pdf
Adobe Future of Creativity 2022 U.S. Emoji Trend Report.pdfAdobe Future of Creativity 2022 U.S. Emoji Trend Report.pdf
Adobe Future of Creativity 2022 U.S. Emoji Trend Report.pdfPoppy Rodham
 
Emojis & Emoticons Around The World
Emojis & Emoticons Around The WorldEmojis & Emoticons Around The World
Emojis & Emoticons Around The WorldGrant McGovern
 
Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter FeedsEu Jin Lok
 
Itamoji: Italian Emoji Prediction @ Evalita 2018
Itamoji: Italian Emoji Prediction @ Evalita 2018Itamoji: Italian Emoji Prediction @ Evalita 2018
Itamoji: Italian Emoji Prediction @ Evalita 2018University of Torino
 
Apple Responds To Emoji Diversity Problem
Apple Responds To Emoji Diversity ProblemApple Responds To Emoji Diversity Problem
Apple Responds To Emoji Diversity Problemjoey parker
 
Smile, Wink and Pray: Can Emojis Increase your Email Open Rate? (New Research)
Smile, Wink and Pray: Can Emojis Increase your Email Open Rate? (New Research)Smile, Wink and Pray: Can Emojis Increase your Email Open Rate? (New Research)
Smile, Wink and Pray: Can Emojis Increase your Email Open Rate? (New Research)dlvr.it
 
2016 Presidential Candidate Tracker
2016 Presidential Candidate Tracker2016 Presidential Candidate Tracker
2016 Presidential Candidate TrackerAnwar Jameel
 
TCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns TodayTCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns TodayRoland Frasier
 

Ähnlich wie Introduction to Emoji Data Science (Open Data Science Conference, 2017) (20)

A Semantics-Based Measure of Emoji Similarity
A Semantics-Based Measure of Emoji SimilarityA Semantics-Based Measure of Emoji Similarity
A Semantics-Based Measure of Emoji Similarity
 
Twitter Sentiment and Network Analysis
Twitter Sentiment and Network AnalysisTwitter Sentiment and Network Analysis
Twitter Sentiment and Network Analysis
 
Baidu facemoji-07-11-2019-eoy-report-ii
Baidu facemoji-07-11-2019-eoy-report-iiBaidu facemoji-07-11-2019-eoy-report-ii
Baidu facemoji-07-11-2019-eoy-report-ii
 
Gretchen McCulloch and Emojis as an Internet Language Gone Mainstream
Gretchen McCulloch and Emojis as an Internet Language Gone MainstreamGretchen McCulloch and Emojis as an Internet Language Gone Mainstream
Gretchen McCulloch and Emojis as an Internet Language Gone Mainstream
 
A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation ...
A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation ...A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation ...
A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation ...
 
Extraction of Emoticons with Sentimental Bar
Extraction of Emoticons with Sentimental BarExtraction of Emoticons with Sentimental Bar
Extraction of Emoticons with Sentimental Bar
 
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...
 
Noa Ha'aman - 2017 - MojiSem: Varying Linguistic Purposes of Emoji in (Twitte...
Noa Ha'aman - 2017 - MojiSem: Varying Linguistic Purposes of Emoji in (Twitte...Noa Ha'aman - 2017 - MojiSem: Varying Linguistic Purposes of Emoji in (Twitte...
Noa Ha'aman - 2017 - MojiSem: Varying Linguistic Purposes of Emoji in (Twitte...
 
Adobe Future of Creativity 2022 U.S. Emoji Trend Report.pdf
Adobe Future of Creativity 2022 U.S. Emoji Trend Report.pdfAdobe Future of Creativity 2022 U.S. Emoji Trend Report.pdf
Adobe Future of Creativity 2022 U.S. Emoji Trend Report.pdf
 
Emojis & Emoticons Around The World
Emojis & Emoticons Around The WorldEmojis & Emoticons Around The World
Emojis & Emoticons Around The World
 
Improving Emoji Understanding Tasks using EmojiNet – A Mini-Tutorial
Improving Emoji Understanding Tasks using EmojiNet – A Mini-TutorialImproving Emoji Understanding Tasks using EmojiNet – A Mini-Tutorial
Improving Emoji Understanding Tasks using EmojiNet – A Mini-Tutorial
 
Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter Feeds
 
Itamoji: Italian Emoji Prediction @ Evalita 2018
Itamoji: Italian Emoji Prediction @ Evalita 2018Itamoji: Italian Emoji Prediction @ Evalita 2018
Itamoji: Italian Emoji Prediction @ Evalita 2018
 
Apple Responds To Emoji Diversity Problem
Apple Responds To Emoji Diversity ProblemApple Responds To Emoji Diversity Problem
Apple Responds To Emoji Diversity Problem
 
Smile, Wink and Pray: Can Emojis Increase your Email Open Rate? (New Research)
Smile, Wink and Pray: Can Emojis Increase your Email Open Rate? (New Research)Smile, Wink and Pray: Can Emojis Increase your Email Open Rate? (New Research)
Smile, Wink and Pray: Can Emojis Increase your Email Open Rate? (New Research)
 
State of Emoji Report 2022
State of Emoji Report 2022State of Emoji Report 2022
State of Emoji Report 2022
 
State of Emoji Report 2022
State of Emoji Report 2022State of Emoji Report 2022
State of Emoji Report 2022
 
2016 Presidential Candidate Tracker
2016 Presidential Candidate Tracker2016 Presidential Candidate Tracker
2016 Presidential Candidate Tracker
 
The Unicode Standard
The Unicode StandardThe Unicode Standard
The Unicode Standard
 
TCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns TodayTCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns Today
 

Kürzlich hochgeladen

Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFMichael Gough
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialJoão Esperancinha
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfAarwolf Industries LLC
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Karmanjay Verma
 

Kürzlich hochgeladen (20)

Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDF
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorial
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdf
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#
 

Introduction to Emoji Data Science (Open Data Science Conference, 2017)

  • 1. Hamdan Azhar hamdan@prismoji.com @hamdanazhar // November 3, 2017 From data with ❤ // An introduction to Emoji Data Science
  • 2. 🏭 📈📊% à & 📈📊% Industry data science -> Indie data science
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9. While emojis might seem fun and trivial, they are at their core vessels for conveying human emotion. Emojis are also data. And data science can help us understand them.
  • 11.
  • 12. My reaction to this article, in emoji
  • 14. How do people use emojis to react to a major political event in real time?
  • 15. Getting the data ■ Use Twitter API to sample 100,000 tweets for five hashtags related to Britain’s EU Referendum § Hashtags: #NotMyVote, #VoteRemain, #EURef, #Brexit, #VoteLeave § Data pulled for June 24, 2016 (the day after the referendum) § English language tweets only § After removing retweets, we’re left with 23,989 unique tweets, i.e. the “Brexit dataset” § Of these, 1,505 tweets (6.3%) contain at least one emoji
  • 16. Analyzing the data § Use regular expressions in R, along with Unicode emoji dictionaries, to extract emojis from tweets § Compute emoji counts in the Brexit dataset § Compare with counts for all >10B emoji tweets on Twitter since 2013 (from emojitracker.com) § Extract hashtags from tweets and compute hashtag profiles for various emojis
  • 17. emoji emoji name brexit rank brexit count brexit density (per 1,000) 😂 face with tears of joy 1 285 11.9 flag of united kingdom 2 249 10.4 👍 thumbs up sign 3 75 3.1 👏 clapping hands sign 4 70 2.9 ❤ heavy black heart 5 60 2.5 😭 loudly crying face 6 48 2.0 😔 pensive face 7 40 1.7 😩 weary face 8 38 1.6 😢 crying face 9 34 1.4 🙈 see-no-evil monkey 10 34 1.4 Brexit chatter includes a wide variety of emojis Of every 1,000 tweets in our dataset, 11.9 of them contain 😂
  • 18.
  • 19. Can we use hashtag data to understand what these emojis mean?
  • 20. Finding the “hashtag signature” of a given emoji § We know the distribution of hashtags in our entire dataset § We can pick a given emoji and compute the distribution of hashtags for tweets that use that emoji § By comparing these two distributions, we can estimate which hashtags an emoji is most likely to be used with 15% 17% 20% 29% 19%
  • 21.
  • 23. Hearts vs. Snakes: The emoji battle underyling the epic Taylor Swift – Kanye West feud Source: Analysis of 100,000 public tweets mentioning @taylorswift13 and @kanyewest from Aug. 1-4, 2016. (PRISMOJI)
  • 24. Taylor Swift is winning hearts (and minds) Source: Analysis of 100,000 public tweets mentioning @taylorswift13 and @kanyewest from Aug. 1-4, 2016. (PRISMOJI) equal higher association with @taylorswift13 higher association with @kanyewest
  • 25. Our common emoji language of #fanlove Source: Analysis of 250,000 public tweets mentioning @beyonce, @justinbieber, @djkhaled, @drake, and @rihanna from Aug. 1-4, 2016. (PRISMOJI)
  • 26. Sometimes love hurts Examples of in tweets involving #fanlove Source: Analysis of 250,000 public tweets mentioning @beyonce, @justinbieber, @djkhaled, @drake, and @rihanna from Aug. 1-4, 2016. (PRISMOJI)
  • 27. Now let’s turn to something more serious
  • 28. Emoji reactions to Election Night results Source: Analysis of sampled Election Night tweets, from Nov. 9, 2016 12:00am – 3:30am EST. (PRISMOJI) More common in tweets mentioning TrumpMore common in tweets mentioning Clinton https://prismoji.com/2016/11/26/the-top-emojis-of-election-day-2016/
  • 29. Election Day emojis in tweets mentioning Hillary Clinton More common AFTER midnightMore common BEFORE midnight Source: Analysis of sampled Election Day tweets from Nov. 8, 2016 6:00am to Nov. 9, 2016 3:30am. (PRISMOJI)
  • 30. Source: Analysis of sampled Election Day tweets from Nov. 8, 2016. (PRISMOJI)
  • 31. The divided emoji states of America Source: Analysis of sampled Election Night tweets, from Nov. 9, 2016 12:00am – 3:30am EST. Word clouds shown for tweets mentioning Hillary Clinton or Donald Trump as well as the given emoji above. Randomly selected tweet shown for each emoji.
  • 33.
  • 34. Emojis of the resistance #NoBanNoWall #TheResistance #NotMyPresident #WomensMarch Source: Analysis of sampled English- language tweets from Jan. 28-29, 2017 (Emojipedia)
  • 35. The media doesn’t always appreciate emojis
  • 36. 36
  • 37. 37
  • 38. 38
  • 40. The Top Emojis of #MeToo Source: Analysis of 28,629 English-language tweets sampled on Oct. 16, 2017 using the Twitter Search API. Top 10 emojis shown in descending order of frequency.
  • 41. The Top Emojis of #MeToo Source: Analysis of 28,629 English-language tweets sampled on Oct. 16, 2017 using the Twitter Search API. Top 20 emojis shown, grouped by emoji type, in descending order of frequency. ❤💔💜💕💖💙💛💗 😔😢😡😍😞😕😥 👇✊💪🙏👏
  • 42. The Top Emojis of #MeToo: Skin Tone Distributions Source: Analysis of 28,629 English-language tweets sampled on Oct. 16, 2017 using the Twitter Search API. Color 1 Color 2 Color 3 Color 4 Color 5 Mean Skin Tone ✊ 9% 37% 29% 14% 11% 2.8 👇 29% 29% 18% 12% 12% 2.5 🙏 20% 50% 15% 15% 0% 2.2 💪 38% 28% 25% 9% 0% 2.0 👊 50% 25% 25% 0% 0% 1.8 🖕 50% 36% 14% 0% 0% 1.6 For emojis that are most frequently used with skin tone modifiers, we look at the distribution of skin tone usage for each emoji. The yellow band indicates the ”mean skin tone” for a given emoji.
  • 43. While emojis might seem fun and trivial, they are at their core vessels for conveying human emotion. Emojis are also data. And data science can help us understand them.
  • 44. You too can become an emoji data scientist! https://prismoji.com/2017/02/06/emoji-data-science-in-r-tutorial/
  • 47. Next steps ■ Journalism vs. Academic & Industry ■ Study emoji usage across countries & cultures ■ Study private emoji usage vs. public emoji usage ■ Using this as a kick-off point to dive head-in into data journalism and storytelling about unexplored questions in society and culture
  • 48. Thank you! Send me questions and tips! Or reach out if you’d like to collaborate J On Twitter: @hamdanazhar Or by email: hamdan@prismoji.com Check out more emoji data science at prismoji.com