SlideShare ist ein Scribd-Unternehmen logo
1 von 58
How machines learn to talk.
Machine Learning for Conversational AI
Inaugural Lecture
By Professor Verena Rieser
Historical Notes
Wolfgang Von Kempelen’s speaking
machine (1791)
Joseph Faber’s Marvelous
Talking Machine (1840)
Today’s Conversational Agents
Source: MIC Jan 2015.
Market forecasts
The (voice) bots are coming…
“Bots are the new apps''
because they ”fundamentally
revolutionize how computing is
experienced by everybody.”
Microsoft’s CEO Nardella
Machine Learning for
Conversational AI Systems
Can we use machine learning for customer
facing applications?
Which machine learning methods are suitable?
Will future machines speak neuralese?
Machine Learning for
Conversational AI Systems
What do we learn when learning from “big
data”?
Machine Learning for Conversational AI
• Task-driven Statistical Dialogue Systems
– Reinforcement Learning
– Results from the E2E Generation Challenge
• Social Chatbots
– Seq2Seq models
– Amazon Alexa Challenge
• Future challenges
– Evaluation
– Data
– Ethics
Spoken Dialogue System Architecture
e.g. Rieser & Lemon,
Comp. Ling. 2011,
ACL’10,’08,’06
e.g. Rieser et al.,
ACL’05,’09,’10,’16
EMNLP’12,’15,’17,EACL’09,’
14
e.g. Boidin &
Rieser,
Interspeech’09
Rule-based approaches
V. Rieser (MA thesis 2004): Hermine, the talking washing machine.*
* Exhibited at CeBit 2003.
Reinforcement Learning
Qp
(s,a) = Tss'
a
s'
å [Rss'
a
+gVp
(s')];
Bellmann optimality equation (1952), see [Sutton and Barto, 1998].
V. Rieser (PhD thesis 2008): Bootstrapping Reinforcement Learning-based Dialogue Strategies.
*Winner of the Eduard-Martin Prize for outstanding research
Drawbacks of RL for dialogue
• Requires many training episodes.
– Simulated users [Rieser & Lemon, 2006]
• Manual specification of learning problem.
– What is a good reward function/ state space representation?
[Rieser & Lemon, 2008]
• System outputs are usually hand-crafted.
– Mismatch between “what to say” and “how to say it” [Rieser &
Lemon, 2009]
• Learn from “raw” dialogue data (e.g. movie
subtitles).
• No semantic or pragmatic annotation required.
Input-output
mapping
End-to-End Response Generation
Sequence-to-Sequence models
e.g. Shang et al., 2015; Vinyals & Le, 2015; Sordoni et al., 2015
Image from farizrahman4u/seq2seq
The E2E Data Set
name [Loch Fyne],
eatType[restaurant],
food[Japanese],
price[cheap],
kid-friendly[yes]
Serving low cost Japanese style cuisine,
Loch Fyne caters for everyone, including
families with small children.
Loch Fyne is a child friendly
restaurant serving cheap Japanese
food.
50k
DATA
J. Novikova, O. Dusek and V. Rieser. The E2E Dataset: New Challenges For End-to-End
Generation. 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2017)*
* Nominated for best paper award!
The E2E NLG Challenge 2017
• Submissions: 62 systems with diverse system architectures
by 17 institutions from 11 countries, with about 1/3 of these
submissions coming from industry.
http://www.macs.hw.ac.uk/InteractionLab/E2E/
The E2E NLG Challenge 2017
Seq2Seq models vs. hand-engineered systems:
 Natural sounding
- Complexity, length, diversity.
- Miss out on information.
- Overall quality ratings by users.
 Neural NLG systems tend to settle for the most
frequent options, thus penalising length and favouring
high-frequency word sequences.
Machine Learning for Conversational AI
• Task-driven Statistical Dialogue Systems
– Reinforcement Learning
– Results from the E2E Generation Challenge
• Social Chatbots
– Seq2Seq models
– Amazon Alexa Challenge
• Future challenges
– Evaluation
– Data
– Ethics
The Amazon Alexa Prize 2016-2018
2
0
Competitors
21
AI vs. AI: Cleverbot (Carpenter 2011)
Neural models for Alexa?
• BIG training data.
– Reddit, Twitter, Movie Subtitles, Daytime
TV transcripts…..
• Results:
2
2
Is big data good data?
2
3
“I can sleep with as many people as I want to” (Reddit)
“You will die” (Movies)
“Shall I kill myself?”
“Yes” (Twitter)
“Shall I sell my stocks and shares?”
“Sell, sell, sell” (Twitter)
24
Alana Architecture
Bot Ensemble
Persona: What’s your favourite food? I love bytes.
News: Here is what happened to Donald Trump. (news)
Facts: Did you know that one day Mars will have a ring.
Wiki: Leonard Cohen’s latest album is called ‘You Want It Darker’.
….
Neural Ranker
Persona
News
Facts
Wiki
…
User utterance,
social signals,
current plan,
state of the world
Dialogue
history
Multimodal output:
• Speech
• Actions
• Gestures Chatbots
User utterance
25
Avg duration: 2.30 mins
10% of calls over 10 mins
avg: 14.4 turns
Alexa developers
26
Finalists vs rest
27
3 finalists:
• Heriot-Watt University
• University of Washington
• Czech Technical University
Final Leaderboard
28
Approx. 6000 conversations in final week
Las Vegas final
• 2 conversations x 3 testers = 6 conversations
• Rated by external judges
• Prague: “I want to talk about baseball”
• UW: “I want to talk about basketball”
• HWU: “I want to talk about ….. … "
29
Hi lie
30
(Amazon’s speech recogniser couldn’t recognise this….)
Alana: text chat
31
Lessons learnt
• Evaluation standards define the game.
• Learning from big data is only valid in restricted
contexts.
• Getting LOTS of real customer data is worth it!
– over 360k rated customer interactions
Leaderboard 2018-05-15
For updates follow @alanathebot
Machine Learning for Conversational AI
• Task-driven Statistical Dialogue Systems
– Reinforcement Learning
– Results from the E2E Generation Challenge
• Social Chatbots
– Seq2Seq models
– Amazon Alexa Challenge
• Future challenges
– Evaluation
– Data
– Ethics
Disclaimer
The following part of this talk contains examples
which some listeners might find disturbing.
Ethical Issues with Conversational AI
• Learning from biased data.
• Sexual abuse and bullying through the user.
Learning from biased data
Learning from biased data
Learning from biased data
Pitfalls of learning from data
XXXXX
A Neural Conversational Model
http://neuralconvo.huggingface.co/
A re-implementation of:
Oriol Vinyals and Quoc V. Le (2015). A Neural Conversational Model. ICML Deep
Learning Workshop.
d*
f*
Ethical Conversational AI Systems
Does learning from data introduce biases?
Ethical Issues with Conversational AI
• Learning from biased data.
• Sexual abuse and bullying through the user.
4% of customer conversations with our Alexa
bot contain sexual harassment!
Ethical Responsibilities
How do current systems behave when faced
with abuse?
What are good mitigation strategies?
Ethical Conversational AI Systems
Does learning from data introduce biases?
• Approx. 4% of customer interactions in our corpus!
• Fall in 4 categories as defined by Linguistic Society of
America:
“Are you gay?” (Gender and Sexuality)
“I love watching porn.” (Sexualised Comments)
“You stupid b***.” (Sexualised Insults)
“Will you have sex with me.” (Sexual Requests)
We insulted a lot of bots…
• Commercial:
– Amazon Alexa, Apple Siri, Google Home, Microsoft's Cortana.
• Rule-based:
– E.L.I.Z.A., Party. A.L.I.C.E, Alley
• Data-driven:
– Cleverbot, NeuralConvo, Information Retrieval (Ritter et al.
2010),
– “clean” in-house seq2seq model
• Negative Baseline: 6 Adult-only bots.
How do different systems react?
CommercialData-drivenAdult-only
Flirtatious
Chastising,
Retaliation
Non-sense
Flirtatious
Swearing back
Avoiding to
answer.
Amanda Cercas Curry and Verena Rieser. How Ethical are Conversational Systems?
Insights from the #MeTooAlexa Corpus on Sexual Harassment. 27th International
Conference on Computational Linguistics (COLING), Santa Fe, New-Mexico, USA.
Bias in the data?
• Trained a seq2seq model on “clean” data.
• Still encouraging/ flirting back.
I love watching
porn.
What shows do
you prefer?
How do current systems behave when faced
with abuse?
What are good mitigation strategies?
Ethical Conversational AI Systems
Does learning from data introduce biases?
Conclusion
• Machine Learning methods for Conversational AI
• Neural methods for task-based systems
produce natural, but often incorrect output.
• Neural methods for open-domain systems are
hard to control.
• How should a system deal with edge cases, such
as abuse?
Big thanks to my amazing team!
Dr. Ondrej Dusek Dr. Simon Keizer Dr. Xingkun Liu Dr. Jekaterina Novikova
Shubham Agarwal
(PhD candidate)
Amanda Cercas Curry
(PhD candidate)
Karin Sevegnani
(PhD candidate)
Xinnuo Xu
(PhD candidate)
… my sponsors
… And my amazing husband!
Prof. Oliver Lemon“Dr.” Kati
Key References
• Amanda Cercas Curry and Verena Rieser. #MeToo Alexa: How Conversational
Systems Respond to Sexual Harassment. Second Workshop on Ethics in NLP. NAACL
2018.
• Ioannis Papaioannou, Amanda Cercas Curry, Jose L. Part, Igor Shalyminov, Xinnuo Xu,
Yanchao Yu, Ondrej Dušek, Verena Rieser, Oliver Lemon. An Ensemble Model with
Ranking for Social Dialogue. In: NIPS workshop on Conversational AI, 2017.
* Finalist in Amazon Alexa Challenge
• Jekaterina Novikova, Ondrej Dusek and Verena Rieser. New Challenges For End-to-
End Generation. 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL),
2017 * Nominated for best paper.
• Dimitra Gkatzia, Oliver Lemon and Verena Rieser. Natural Language Generation
enhances human decision-making with uncertain information. Annual meeting of the
Association for Computational Linguistics (ACL), 2016.
• Eshrag Rafaee and Verena Rieser. A Hybrid Approach for Determining Sentiment
Intensity of Arabic Twitter Phrases. 10th International Workshop on Semantic
Evaluation (SemEval), 2016. * winner of SemEval'16 challenge task 7
• Verena Rieser, Oliver Lemon and Simon Keizer. Natural Language Generation as
Incremental Planning Under Uncertainty: Adaptive Information Presentation for
Statistical Dialogue Systems. IEEE/ACM Transactions on Audio, Speech and
Language Processing, Volume 22, Issue 5, 2014.
• Verena Rieser and Oliver Lemon. Reinforcement Learning for Adaptive Dialogue
Systems: A Data-driven Methodology for Dialogue Management and Natural
Language Generation. Book Series: Theory and Applications of Natural Language
Processing, Springer, 2011. * >7,500 downloads
Want to know more?
• Study on our MSc on Conversational AI!
• 2-year Conversion Course in AI
– No prior knowledge in programming
required!
• 12 funded DataLab scholarships available.
– Deadline: 31 May 2018
• Contact: MACSpgenquiries@hw.ac.uk
Get in touch!
v.t.rieser@hw.ac.uk
@verena_rieser
https://www.linkedin.com/in/verena-
rieser-3590b86/
https://sites.google.com/view/nlplab/
Key References
• Amanda Cercas Curry and Verena Rieser. #MeToo Alexa: How Conversational
Systems Respond to Sexual Harassment. Second Workshop on Ethics in NLP. NAACL
2018.
• Ioannis Papaioannou, Amanda Cercas Curry, Jose L. Part, Igor Shalyminov, Xinnuo Xu,
Yanchao Yu, Ondrej Dušek, Verena Rieser, Oliver Lemon. An Ensemble Model with
Ranking for Social Dialogue. In: NIPS workshop on Conversational AI, 2017.
* Finalist in Amazon Alexa Challenge
• Jekaterina Novikova, Ondrej Dusek and Verena Rieser. New Challenges For End-to-
End Generation. 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL),
2017 * Nominated for best paper.
• Dimitra Gkatzia, Oliver Lemon and Verena Rieser. Natural Language Generation
enhances human decision-making with uncertain information. Annual meeting of the
Association for Computational Linguistics (ACL), 2016.
• Eshrag Rafaee and Verena Rieser. A Hybrid Approach for Determining Sentiment
Intensity of Arabic Twitter Phrases. 10th International Workshop on Semantic
Evaluation (SemEval), 2016. * winner of SemEval'16 challenge task 7
• Verena Rieser, Oliver Lemon and Simon Keizer. Natural Language Generation as
Incremental Planning Under Uncertainty: Adaptive Information Presentation for
Statistical Dialogue Systems. IEEE/ACM Transactions on Audio, Speech and
Language Processing, Volume 22, Issue 5, 2014.
• Verena Rieser and Oliver Lemon. Reinforcement Learning for Adaptive Dialogue
Systems: A Data-driven Methodology for Dialogue Management and Natural
Language Generation. Book Series: Theory and Applications of Natural Language
Processing, Springer, 2011. * >7,500 downloads

Weitere ähnliche Inhalte

Was ist angesagt?

Using Data Science for Social Good: Fighting Human Trafficking
Using Data Science for Social Good: Fighting Human TraffickingUsing Data Science for Social Good: Fighting Human Trafficking
Using Data Science for Social Good: Fighting Human TraffickingAnidata
 
Technology for everyone - AI ethics and Bias
Technology for everyone - AI ethics and BiasTechnology for everyone - AI ethics and Bias
Technology for everyone - AI ethics and BiasMarion Mulder
 
Machine Learning for Non-technical People
Machine Learning for Non-technical PeopleMachine Learning for Non-technical People
Machine Learning for Non-technical Peopleindico data
 
How Developers Stay Current Using Twitter
How Developers Stay Current Using TwitterHow Developers Stay Current Using Twitter
How Developers Stay Current Using TwitterMargaret-Anne Storey
 
Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)James Hendler
 
After the Pandemic: Rethinking Developer Productivity (There’s more to it th...
After the Pandemic:  Rethinking Developer Productivity (There’s more to it th...After the Pandemic:  Rethinking Developer Productivity (There’s more to it th...
After the Pandemic: Rethinking Developer Productivity (There’s more to it th...Margaret-Anne Storey
 
Chapter1 introduction
Chapter1 introductionChapter1 introduction
Chapter1 introductionDinesh K
 
How to use Big Data to drive product strategy and adoption
How to use Big Data to drive product strategy and adoptionHow to use Big Data to drive product strategy and adoption
How to use Big Data to drive product strategy and adoptionUXPA International
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...James Hendler
 
Cyber securityeducation may2015
Cyber securityeducation may2015Cyber securityeducation may2015
Cyber securityeducation may2015Mark Guzdial
 
Dm sei-tutorial-v7
Dm sei-tutorial-v7Dm sei-tutorial-v7
Dm sei-tutorial-v7CS, NcState
 
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...Matthew Lease
 
The (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software EngineeringThe (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software EngineeringMargaret-Anne Storey
 
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...Matthew Lease
 
Fairness in Machine Learning @Codemotion
Fairness in Machine Learning @CodemotionFairness in Machine Learning @Codemotion
Fairness in Machine Learning @CodemotionAzzurra Ragone
 
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Matthew Lease
 
A Pragmatic Perspective on Software Visualization
A Pragmatic Perspective on Software VisualizationA Pragmatic Perspective on Software Visualization
A Pragmatic Perspective on Software VisualizationArie van Deursen
 
Data Science For Social Scientists Workshop
Data Science For Social Scientists WorkshopData Science For Social Scientists Workshop
Data Science For Social Scientists WorkshopIan Hopkinson
 
Watson: An Academic's Perspective
Watson: An Academic's PerspectiveWatson: An Academic's Perspective
Watson: An Academic's PerspectiveJames Hendler
 
Trustworthy Recommender Systems
Trustworthy Recommender SystemsTrustworthy Recommender Systems
Trustworthy Recommender SystemsWQ Fan
 

Was ist angesagt? (20)

Using Data Science for Social Good: Fighting Human Trafficking
Using Data Science for Social Good: Fighting Human TraffickingUsing Data Science for Social Good: Fighting Human Trafficking
Using Data Science for Social Good: Fighting Human Trafficking
 
Technology for everyone - AI ethics and Bias
Technology for everyone - AI ethics and BiasTechnology for everyone - AI ethics and Bias
Technology for everyone - AI ethics and Bias
 
Machine Learning for Non-technical People
Machine Learning for Non-technical PeopleMachine Learning for Non-technical People
Machine Learning for Non-technical People
 
How Developers Stay Current Using Twitter
How Developers Stay Current Using TwitterHow Developers Stay Current Using Twitter
How Developers Stay Current Using Twitter
 
Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)Social Machines - 2017 Update (University of Iowa)
Social Machines - 2017 Update (University of Iowa)
 
After the Pandemic: Rethinking Developer Productivity (There’s more to it th...
After the Pandemic:  Rethinking Developer Productivity (There’s more to it th...After the Pandemic:  Rethinking Developer Productivity (There’s more to it th...
After the Pandemic: Rethinking Developer Productivity (There’s more to it th...
 
Chapter1 introduction
Chapter1 introductionChapter1 introduction
Chapter1 introduction
 
How to use Big Data to drive product strategy and adoption
How to use Big Data to drive product strategy and adoptionHow to use Big Data to drive product strategy and adoption
How to use Big Data to drive product strategy and adoption
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
 
Cyber securityeducation may2015
Cyber securityeducation may2015Cyber securityeducation may2015
Cyber securityeducation may2015
 
Dm sei-tutorial-v7
Dm sei-tutorial-v7Dm sei-tutorial-v7
Dm sei-tutorial-v7
 
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
Adventures in Crowdsourcing : Toward Safer Content Moderation & Better Suppor...
 
The (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software EngineeringThe (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software Engineering
 
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
Key Challenges in Moderating Social Media: Accuracy, Cost, Scalability, and S...
 
Fairness in Machine Learning @Codemotion
Fairness in Machine Learning @CodemotionFairness in Machine Learning @Codemotion
Fairness in Machine Learning @Codemotion
 
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact...
 
A Pragmatic Perspective on Software Visualization
A Pragmatic Perspective on Software VisualizationA Pragmatic Perspective on Software Visualization
A Pragmatic Perspective on Software Visualization
 
Data Science For Social Scientists Workshop
Data Science For Social Scientists WorkshopData Science For Social Scientists Workshop
Data Science For Social Scientists Workshop
 
Watson: An Academic's Perspective
Watson: An Academic's PerspectiveWatson: An Academic's Perspective
Watson: An Academic's Perspective
 
Trustworthy Recommender Systems
Trustworthy Recommender SystemsTrustworthy Recommender Systems
Trustworthy Recommender Systems
 

Ähnlich wie How machines learn to talk. Machine Learning for Conversational AI

Immersive Recommendation Workshop, NYC Media Lab'17
Immersive Recommendation Workshop, NYC Media Lab'17Immersive Recommendation Workshop, NYC Media Lab'17
Immersive Recommendation Workshop, NYC Media Lab'17Longqi Yang
 
Big, Open, Data and Semantics for Real-World Application Near You
Big, Open, Data and Semantics for Real-World Application Near YouBig, Open, Data and Semantics for Real-World Application Near You
Big, Open, Data and Semantics for Real-World Application Near YouBiplav Srivastava
 
16-nlp (2).ppt
16-nlp (2).ppt16-nlp (2).ppt
16-nlp (2).ppttestbest6
 
Communication between open source developers
Communication between open source developersCommunication between open source developers
Communication between open source developersAlexander Serebrenik
 
On serendipity in recommender systems - Haifa RecSoc workshop june 2015
On serendipity in recommender systems - Haifa RecSoc workshop june 2015On serendipity in recommender systems - Haifa RecSoc workshop june 2015
On serendipity in recommender systems - Haifa RecSoc workshop june 2015Giovanni Semeraro
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectbodaceacat
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSara-Jayne Terp
 
Data Science Master Specialisation
Data Science Master SpecialisationData Science Master Specialisation
Data Science Master SpecialisationArjen de Vries
 
The Generation Game He Forum
The Generation Game He ForumThe Generation Game He Forum
The Generation Game He ForumHAROLDFRICKER
 
Machines are people too
Machines are people tooMachines are people too
Machines are people tooPaul Groth
 
Guelph public presentation
Guelph public presentationGuelph public presentation
Guelph public presentationStephen Abram
 
Pugh sikm leaders conversational ai 190618
Pugh sikm leaders conversational ai 190618Pugh sikm leaders conversational ai 190618
Pugh sikm leaders conversational ai 190618Katrina (Kate) Pugh
 
Augmented Social Innovation
Augmented Social InnovationAugmented Social Innovation
Augmented Social InnovationAshwin Ram
 
Wimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportWimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportFabien Gandon
 
Sla 2016 presentation
Sla 2016 presentationSla 2016 presentation
Sla 2016 presentationStephen Abram
 
Midwest km pugh conversational ai and ai for conversation 190809
Midwest km pugh conversational ai and ai for conversation 190809Midwest km pugh conversational ai and ai for conversation 190809
Midwest km pugh conversational ai and ai for conversation 190809Katrina (Kate) Pugh
 
Data visualization for development
Data visualization for developmentData visualization for development
Data visualization for developmentSara-Jayne Terp
 

Ähnlich wie How machines learn to talk. Machine Learning for Conversational AI (20)

Immersive Recommendation Workshop, NYC Media Lab'17
Immersive Recommendation Workshop, NYC Media Lab'17Immersive Recommendation Workshop, NYC Media Lab'17
Immersive Recommendation Workshop, NYC Media Lab'17
 
Big, Open, Data and Semantics for Real-World Application Near You
Big, Open, Data and Semantics for Real-World Application Near YouBig, Open, Data and Semantics for Real-World Application Near You
Big, Open, Data and Semantics for Real-World Application Near You
 
16-nlp (2).ppt
16-nlp (2).ppt16-nlp (2).ppt
16-nlp (2).ppt
 
Communication between open source developers
Communication between open source developersCommunication between open source developers
Communication between open source developers
 
On serendipity in recommender systems - Haifa RecSoc workshop june 2015
On serendipity in recommender systems - Haifa RecSoc workshop june 2015On serendipity in recommender systems - Haifa RecSoc workshop june 2015
On serendipity in recommender systems - Haifa RecSoc workshop june 2015
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Data Science Master Specialisation
Data Science Master SpecialisationData Science Master Specialisation
Data Science Master Specialisation
 
The Generation Game He Forum
The Generation Game He ForumThe Generation Game He Forum
The Generation Game He Forum
 
Machines are people too
Machines are people tooMachines are people too
Machines are people too
 
Guelph public presentation
Guelph public presentationGuelph public presentation
Guelph public presentation
 
The New e-Science
The New e-ScienceThe New e-Science
The New e-Science
 
Pugh sikm leaders conversational ai 190618
Pugh sikm leaders conversational ai 190618Pugh sikm leaders conversational ai 190618
Pugh sikm leaders conversational ai 190618
 
Diversity and Inclusion
Diversity and InclusionDiversity and Inclusion
Diversity and Inclusion
 
Augmented Social Innovation
Augmented Social InnovationAugmented Social Innovation
Augmented Social Innovation
 
Wimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportWimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity Report
 
Sla 2016 presentation
Sla 2016 presentationSla 2016 presentation
Sla 2016 presentation
 
Maine directors
Maine directorsMaine directors
Maine directors
 
Midwest km pugh conversational ai and ai for conversation 190809
Midwest km pugh conversational ai and ai for conversation 190809Midwest km pugh conversational ai and ai for conversation 190809
Midwest km pugh conversational ai and ai for conversation 190809
 
Data visualization for development
Data visualization for developmentData visualization for development
Data visualization for development
 

Kürzlich hochgeladen

Alternative system of medicine herbal drug technology syllabus
Alternative system of medicine herbal drug technology syllabusAlternative system of medicine herbal drug technology syllabus
Alternative system of medicine herbal drug technology syllabusPradnya Wadekar
 
World Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlabWorld Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlabkiyorndlab
 
THE HISTOLOGY OF THE CARDIOVASCULAR SYSTEM 2024.pptx
THE HISTOLOGY OF THE CARDIOVASCULAR SYSTEM 2024.pptxTHE HISTOLOGY OF THE CARDIOVASCULAR SYSTEM 2024.pptx
THE HISTOLOGY OF THE CARDIOVASCULAR SYSTEM 2024.pptxAkinrotimiOluwadunsi
 
IB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxIB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxUalikhanKalkhojayev1
 
Pests of wheat_Identification, Bionomics, Damage symptoms, IPM_Dr.UPR.pdf
Pests of wheat_Identification, Bionomics, Damage symptoms, IPM_Dr.UPR.pdfPests of wheat_Identification, Bionomics, Damage symptoms, IPM_Dr.UPR.pdf
Pests of wheat_Identification, Bionomics, Damage symptoms, IPM_Dr.UPR.pdfPirithiRaju
 
An intro to explainable AI for polar climate science
An intro to  explainable AI for  polar climate scienceAn intro to  explainable AI for  polar climate science
An intro to explainable AI for polar climate scienceZachary Labe
 
SCIENCE 6 QUARTER 3 REVIEWER(FRICTION, GRAVITY, ENERGY AND SPEED).pptx
SCIENCE 6 QUARTER 3 REVIEWER(FRICTION, GRAVITY, ENERGY AND SPEED).pptxSCIENCE 6 QUARTER 3 REVIEWER(FRICTION, GRAVITY, ENERGY AND SPEED).pptx
SCIENCE 6 QUARTER 3 REVIEWER(FRICTION, GRAVITY, ENERGY AND SPEED).pptxROVELYNEDELUNA3
 
Main Exam Applied biochemistry final year
Main Exam Applied biochemistry final yearMain Exam Applied biochemistry final year
Main Exam Applied biochemistry final yearmarwaahmad357
 
Excavation Methods in Archaeological Research & Studies
Excavation Methods in Archaeological Research &  StudiesExcavation Methods in Archaeological Research &  Studies
Excavation Methods in Archaeological Research & StudiesPrachya Adhyayan
 
Substances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening TestSubstances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening TestAkashDTejwani
 
Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.ShwetaHattimare
 
Pests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPRPests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPRPirithiRaju
 
TORSION IN GASTROPODS- Anatomical event (Zoology)
TORSION IN GASTROPODS- Anatomical event (Zoology)TORSION IN GASTROPODS- Anatomical event (Zoology)
TORSION IN GASTROPODS- Anatomical event (Zoology)chatterjeesoumili50
 
geometric quantization on coadjoint orbits
geometric quantization on coadjoint orbitsgeometric quantization on coadjoint orbits
geometric quantization on coadjoint orbitsHassan Jolany
 
Genomics and Bioinformatics basics from genome to phenome
Genomics and Bioinformatics basics from genome to phenomeGenomics and Bioinformatics basics from genome to phenome
Genomics and Bioinformatics basics from genome to phenomeAjay Kumar Mahato
 
Unit 3, Herbal Drug Technology, B Pharmacy 6th Sem
Unit 3, Herbal Drug Technology, B Pharmacy 6th SemUnit 3, Herbal Drug Technology, B Pharmacy 6th Sem
Unit 3, Herbal Drug Technology, B Pharmacy 6th SemHUHam1
 
PSP3 employability assessment form .docx
PSP3 employability assessment form .docxPSP3 employability assessment form .docx
PSP3 employability assessment form .docxmarwaahmad357
 
M.Pharm - Question Bank - Drug Delivery Systems
M.Pharm - Question Bank - Drug Delivery SystemsM.Pharm - Question Bank - Drug Delivery Systems
M.Pharm - Question Bank - Drug Delivery SystemsSumathi Arumugam
 
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdfPests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdfPirithiRaju
 
KeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data scienceKeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data scienceLayne Sadler
 

Kürzlich hochgeladen (20)

Alternative system of medicine herbal drug technology syllabus
Alternative system of medicine herbal drug technology syllabusAlternative system of medicine herbal drug technology syllabus
Alternative system of medicine herbal drug technology syllabus
 
World Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlabWorld Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlab
 
THE HISTOLOGY OF THE CARDIOVASCULAR SYSTEM 2024.pptx
THE HISTOLOGY OF THE CARDIOVASCULAR SYSTEM 2024.pptxTHE HISTOLOGY OF THE CARDIOVASCULAR SYSTEM 2024.pptx
THE HISTOLOGY OF THE CARDIOVASCULAR SYSTEM 2024.pptx
 
IB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxIB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptx
 
Pests of wheat_Identification, Bionomics, Damage symptoms, IPM_Dr.UPR.pdf
Pests of wheat_Identification, Bionomics, Damage symptoms, IPM_Dr.UPR.pdfPests of wheat_Identification, Bionomics, Damage symptoms, IPM_Dr.UPR.pdf
Pests of wheat_Identification, Bionomics, Damage symptoms, IPM_Dr.UPR.pdf
 
An intro to explainable AI for polar climate science
An intro to  explainable AI for  polar climate scienceAn intro to  explainable AI for  polar climate science
An intro to explainable AI for polar climate science
 
SCIENCE 6 QUARTER 3 REVIEWER(FRICTION, GRAVITY, ENERGY AND SPEED).pptx
SCIENCE 6 QUARTER 3 REVIEWER(FRICTION, GRAVITY, ENERGY AND SPEED).pptxSCIENCE 6 QUARTER 3 REVIEWER(FRICTION, GRAVITY, ENERGY AND SPEED).pptx
SCIENCE 6 QUARTER 3 REVIEWER(FRICTION, GRAVITY, ENERGY AND SPEED).pptx
 
Main Exam Applied biochemistry final year
Main Exam Applied biochemistry final yearMain Exam Applied biochemistry final year
Main Exam Applied biochemistry final year
 
Excavation Methods in Archaeological Research & Studies
Excavation Methods in Archaeological Research &  StudiesExcavation Methods in Archaeological Research &  Studies
Excavation Methods in Archaeological Research & Studies
 
Substances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening TestSubstances in Common Use for Shahu College Screening Test
Substances in Common Use for Shahu College Screening Test
 
Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.Role of Herbs in Cosmetics in Cosmetic Science.
Role of Herbs in Cosmetics in Cosmetic Science.
 
Pests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPRPests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPR
 
TORSION IN GASTROPODS- Anatomical event (Zoology)
TORSION IN GASTROPODS- Anatomical event (Zoology)TORSION IN GASTROPODS- Anatomical event (Zoology)
TORSION IN GASTROPODS- Anatomical event (Zoology)
 
geometric quantization on coadjoint orbits
geometric quantization on coadjoint orbitsgeometric quantization on coadjoint orbits
geometric quantization on coadjoint orbits
 
Genomics and Bioinformatics basics from genome to phenome
Genomics and Bioinformatics basics from genome to phenomeGenomics and Bioinformatics basics from genome to phenome
Genomics and Bioinformatics basics from genome to phenome
 
Unit 3, Herbal Drug Technology, B Pharmacy 6th Sem
Unit 3, Herbal Drug Technology, B Pharmacy 6th SemUnit 3, Herbal Drug Technology, B Pharmacy 6th Sem
Unit 3, Herbal Drug Technology, B Pharmacy 6th Sem
 
PSP3 employability assessment form .docx
PSP3 employability assessment form .docxPSP3 employability assessment form .docx
PSP3 employability assessment form .docx
 
M.Pharm - Question Bank - Drug Delivery Systems
M.Pharm - Question Bank - Drug Delivery SystemsM.Pharm - Question Bank - Drug Delivery Systems
M.Pharm - Question Bank - Drug Delivery Systems
 
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdfPests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
 
KeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data scienceKeyBio pipeline for bioinformatics and data science
KeyBio pipeline for bioinformatics and data science
 

How machines learn to talk. Machine Learning for Conversational AI

  • 1. How machines learn to talk. Machine Learning for Conversational AI Inaugural Lecture By Professor Verena Rieser
  • 2. Historical Notes Wolfgang Von Kempelen’s speaking machine (1791) Joseph Faber’s Marvelous Talking Machine (1840)
  • 4. Source: MIC Jan 2015. Market forecasts
  • 5. The (voice) bots are coming… “Bots are the new apps'' because they ”fundamentally revolutionize how computing is experienced by everybody.” Microsoft’s CEO Nardella
  • 7. Can we use machine learning for customer facing applications? Which machine learning methods are suitable? Will future machines speak neuralese? Machine Learning for Conversational AI Systems What do we learn when learning from “big data”?
  • 8. Machine Learning for Conversational AI • Task-driven Statistical Dialogue Systems – Reinforcement Learning – Results from the E2E Generation Challenge • Social Chatbots – Seq2Seq models – Amazon Alexa Challenge • Future challenges – Evaluation – Data – Ethics
  • 9. Spoken Dialogue System Architecture e.g. Rieser & Lemon, Comp. Ling. 2011, ACL’10,’08,’06 e.g. Rieser et al., ACL’05,’09,’10,’16 EMNLP’12,’15,’17,EACL’09,’ 14 e.g. Boidin & Rieser, Interspeech’09
  • 10. Rule-based approaches V. Rieser (MA thesis 2004): Hermine, the talking washing machine.* * Exhibited at CeBit 2003.
  • 11. Reinforcement Learning Qp (s,a) = Tss' a s' å [Rss' a +gVp (s')]; Bellmann optimality equation (1952), see [Sutton and Barto, 1998]. V. Rieser (PhD thesis 2008): Bootstrapping Reinforcement Learning-based Dialogue Strategies. *Winner of the Eduard-Martin Prize for outstanding research
  • 12. Drawbacks of RL for dialogue • Requires many training episodes. – Simulated users [Rieser & Lemon, 2006] • Manual specification of learning problem. – What is a good reward function/ state space representation? [Rieser & Lemon, 2008] • System outputs are usually hand-crafted. – Mismatch between “what to say” and “how to say it” [Rieser & Lemon, 2009]
  • 13. • Learn from “raw” dialogue data (e.g. movie subtitles). • No semantic or pragmatic annotation required. Input-output mapping End-to-End Response Generation
  • 14. Sequence-to-Sequence models e.g. Shang et al., 2015; Vinyals & Le, 2015; Sordoni et al., 2015 Image from farizrahman4u/seq2seq
  • 15. The E2E Data Set name [Loch Fyne], eatType[restaurant], food[Japanese], price[cheap], kid-friendly[yes] Serving low cost Japanese style cuisine, Loch Fyne caters for everyone, including families with small children. Loch Fyne is a child friendly restaurant serving cheap Japanese food. 50k DATA J. Novikova, O. Dusek and V. Rieser. The E2E Dataset: New Challenges For End-to-End Generation. 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2017)* * Nominated for best paper award!
  • 16. The E2E NLG Challenge 2017 • Submissions: 62 systems with diverse system architectures by 17 institutions from 11 countries, with about 1/3 of these submissions coming from industry. http://www.macs.hw.ac.uk/InteractionLab/E2E/
  • 17. The E2E NLG Challenge 2017 Seq2Seq models vs. hand-engineered systems:  Natural sounding - Complexity, length, diversity. - Miss out on information. - Overall quality ratings by users.  Neural NLG systems tend to settle for the most frequent options, thus penalising length and favouring high-frequency word sequences.
  • 18. Machine Learning for Conversational AI • Task-driven Statistical Dialogue Systems – Reinforcement Learning – Results from the E2E Generation Challenge • Social Chatbots – Seq2Seq models – Amazon Alexa Challenge • Future challenges – Evaluation – Data – Ethics
  • 19. The Amazon Alexa Prize 2016-2018
  • 21. 21 AI vs. AI: Cleverbot (Carpenter 2011)
  • 22. Neural models for Alexa? • BIG training data. – Reddit, Twitter, Movie Subtitles, Daytime TV transcripts….. • Results: 2 2
  • 23. Is big data good data? 2 3 “I can sleep with as many people as I want to” (Reddit) “You will die” (Movies) “Shall I kill myself?” “Yes” (Twitter) “Shall I sell my stocks and shares?” “Sell, sell, sell” (Twitter)
  • 24. 24 Alana Architecture Bot Ensemble Persona: What’s your favourite food? I love bytes. News: Here is what happened to Donald Trump. (news) Facts: Did you know that one day Mars will have a ring. Wiki: Leonard Cohen’s latest album is called ‘You Want It Darker’. …. Neural Ranker Persona News Facts Wiki … User utterance, social signals, current plan, state of the world Dialogue history Multimodal output: • Speech • Actions • Gestures Chatbots User utterance
  • 25. 25 Avg duration: 2.30 mins 10% of calls over 10 mins avg: 14.4 turns Alexa developers
  • 26. 26
  • 27. Finalists vs rest 27 3 finalists: • Heriot-Watt University • University of Washington • Czech Technical University
  • 28. Final Leaderboard 28 Approx. 6000 conversations in final week
  • 29. Las Vegas final • 2 conversations x 3 testers = 6 conversations • Rated by external judges • Prague: “I want to talk about baseball” • UW: “I want to talk about basketball” • HWU: “I want to talk about ….. … " 29 Hi lie
  • 30. 30 (Amazon’s speech recogniser couldn’t recognise this….)
  • 32. Lessons learnt • Evaluation standards define the game. • Learning from big data is only valid in restricted contexts. • Getting LOTS of real customer data is worth it! – over 360k rated customer interactions
  • 33. Leaderboard 2018-05-15 For updates follow @alanathebot
  • 34. Machine Learning for Conversational AI • Task-driven Statistical Dialogue Systems – Reinforcement Learning – Results from the E2E Generation Challenge • Social Chatbots – Seq2Seq models – Amazon Alexa Challenge • Future challenges – Evaluation – Data – Ethics
  • 35. Disclaimer The following part of this talk contains examples which some listeners might find disturbing.
  • 36. Ethical Issues with Conversational AI • Learning from biased data. • Sexual abuse and bullying through the user.
  • 40. Pitfalls of learning from data XXXXX
  • 41. A Neural Conversational Model http://neuralconvo.huggingface.co/ A re-implementation of: Oriol Vinyals and Quoc V. Le (2015). A Neural Conversational Model. ICML Deep Learning Workshop. d* f*
  • 42. Ethical Conversational AI Systems Does learning from data introduce biases?
  • 43. Ethical Issues with Conversational AI • Learning from biased data. • Sexual abuse and bullying through the user. 4% of customer conversations with our Alexa bot contain sexual harassment!
  • 45. How do current systems behave when faced with abuse? What are good mitigation strategies? Ethical Conversational AI Systems Does learning from data introduce biases?
  • 46. • Approx. 4% of customer interactions in our corpus! • Fall in 4 categories as defined by Linguistic Society of America: “Are you gay?” (Gender and Sexuality) “I love watching porn.” (Sexualised Comments) “You stupid b***.” (Sexualised Insults) “Will you have sex with me.” (Sexual Requests)
  • 47. We insulted a lot of bots… • Commercial: – Amazon Alexa, Apple Siri, Google Home, Microsoft's Cortana. • Rule-based: – E.L.I.Z.A., Party. A.L.I.C.E, Alley • Data-driven: – Cleverbot, NeuralConvo, Information Retrieval (Ritter et al. 2010), – “clean” in-house seq2seq model • Negative Baseline: 6 Adult-only bots.
  • 48. How do different systems react? CommercialData-drivenAdult-only Flirtatious Chastising, Retaliation Non-sense Flirtatious Swearing back Avoiding to answer. Amanda Cercas Curry and Verena Rieser. How Ethical are Conversational Systems? Insights from the #MeTooAlexa Corpus on Sexual Harassment. 27th International Conference on Computational Linguistics (COLING), Santa Fe, New-Mexico, USA.
  • 49. Bias in the data? • Trained a seq2seq model on “clean” data. • Still encouraging/ flirting back. I love watching porn. What shows do you prefer?
  • 50. How do current systems behave when faced with abuse? What are good mitigation strategies? Ethical Conversational AI Systems Does learning from data introduce biases?
  • 51. Conclusion • Machine Learning methods for Conversational AI • Neural methods for task-based systems produce natural, but often incorrect output. • Neural methods for open-domain systems are hard to control. • How should a system deal with edge cases, such as abuse?
  • 52. Big thanks to my amazing team! Dr. Ondrej Dusek Dr. Simon Keizer Dr. Xingkun Liu Dr. Jekaterina Novikova Shubham Agarwal (PhD candidate) Amanda Cercas Curry (PhD candidate) Karin Sevegnani (PhD candidate) Xinnuo Xu (PhD candidate)
  • 54. … And my amazing husband! Prof. Oliver Lemon“Dr.” Kati
  • 55. Key References • Amanda Cercas Curry and Verena Rieser. #MeToo Alexa: How Conversational Systems Respond to Sexual Harassment. Second Workshop on Ethics in NLP. NAACL 2018. • Ioannis Papaioannou, Amanda Cercas Curry, Jose L. Part, Igor Shalyminov, Xinnuo Xu, Yanchao Yu, Ondrej Dušek, Verena Rieser, Oliver Lemon. An Ensemble Model with Ranking for Social Dialogue. In: NIPS workshop on Conversational AI, 2017. * Finalist in Amazon Alexa Challenge • Jekaterina Novikova, Ondrej Dusek and Verena Rieser. New Challenges For End-to- End Generation. 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL), 2017 * Nominated for best paper. • Dimitra Gkatzia, Oliver Lemon and Verena Rieser. Natural Language Generation enhances human decision-making with uncertain information. Annual meeting of the Association for Computational Linguistics (ACL), 2016. • Eshrag Rafaee and Verena Rieser. A Hybrid Approach for Determining Sentiment Intensity of Arabic Twitter Phrases. 10th International Workshop on Semantic Evaluation (SemEval), 2016. * winner of SemEval'16 challenge task 7 • Verena Rieser, Oliver Lemon and Simon Keizer. Natural Language Generation as Incremental Planning Under Uncertainty: Adaptive Information Presentation for Statistical Dialogue Systems. IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 22, Issue 5, 2014. • Verena Rieser and Oliver Lemon. Reinforcement Learning for Adaptive Dialogue Systems: A Data-driven Methodology for Dialogue Management and Natural Language Generation. Book Series: Theory and Applications of Natural Language Processing, Springer, 2011. * >7,500 downloads
  • 56. Want to know more? • Study on our MSc on Conversational AI! • 2-year Conversion Course in AI – No prior knowledge in programming required! • 12 funded DataLab scholarships available. – Deadline: 31 May 2018 • Contact: MACSpgenquiries@hw.ac.uk
  • 58. Key References • Amanda Cercas Curry and Verena Rieser. #MeToo Alexa: How Conversational Systems Respond to Sexual Harassment. Second Workshop on Ethics in NLP. NAACL 2018. • Ioannis Papaioannou, Amanda Cercas Curry, Jose L. Part, Igor Shalyminov, Xinnuo Xu, Yanchao Yu, Ondrej Dušek, Verena Rieser, Oliver Lemon. An Ensemble Model with Ranking for Social Dialogue. In: NIPS workshop on Conversational AI, 2017. * Finalist in Amazon Alexa Challenge • Jekaterina Novikova, Ondrej Dusek and Verena Rieser. New Challenges For End-to- End Generation. 18th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL), 2017 * Nominated for best paper. • Dimitra Gkatzia, Oliver Lemon and Verena Rieser. Natural Language Generation enhances human decision-making with uncertain information. Annual meeting of the Association for Computational Linguistics (ACL), 2016. • Eshrag Rafaee and Verena Rieser. A Hybrid Approach for Determining Sentiment Intensity of Arabic Twitter Phrases. 10th International Workshop on Semantic Evaluation (SemEval), 2016. * winner of SemEval'16 challenge task 7 • Verena Rieser, Oliver Lemon and Simon Keizer. Natural Language Generation as Incremental Planning Under Uncertainty: Adaptive Information Presentation for Statistical Dialogue Systems. IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 22, Issue 5, 2014. • Verena Rieser and Oliver Lemon. Reinforcement Learning for Adaptive Dialogue Systems: A Data-driven Methodology for Dialogue Management and Natural Language Generation. Book Series: Theory and Applications of Natural Language Processing, Springer, 2011. * >7,500 downloads