SlideShare ist ein Scribd-Unternehmen logo
1 von 111
Downloaden Sie, um offline zu lesen
Practical Natural Language Processing
From Theory to Industrial Applications
Jaganadh G
http://jaganadhg.in
jaganadhg@gmail.com
IIIT-MK
Thiruvananthapuram
6th
April 2013
Jaganadh G Practical Natural Language Processing
About me !!
Working as Data Scientist to a Fortune 500 Company
Working in Natural Language Processing, Machine
Learning, Data Mining etc...
Passionate about Free and Open source :-)
When gets free time teaches Python, Speaks about FOSS
and blogs at
http://jaganadhg.in
I am a computational linguist / Linguist and Indologist,
Book reviewer
Software Engineer by Profession
Jaganadh G Practical Natural Language Processing
Past to Future
Jaganadh G Practical Natural Language Processing
Question ??
Have you ever used any Natural Language Processing based
tools/services?
Jaganadh G Practical Natural Language Processing
Question ??
Have you ever used any Natural Language Processing based
tools/services?
Jaganadh G Practical Natural Language Processing
Question ??
Have you ever used any Natural Language Processing based
tools/services?
Jaganadh G Practical Natural Language Processing
What is Natural Language Processing (NLP) ?
Aim : To build intelligent systems that can interact with
human beings as like human beings
Jaganadh G Practical Natural Language Processing
What is Natural Language Processing (NLP) ?
Aim : To build intelligent systems that can interact with
human beings as like human beings
Jaganadh G Practical Natural Language Processing
What is Natural Language Processing (NLP) ?
Aim : To build intelligent systems that can interact with
human beings as like human beings
A sub-field of Artificial Intelligence (AI)
Jaganadh G Practical Natural Language Processing
What is Natural Language Processing (NLP) ?
Aim : To build intelligent systems that can interact with
human beings as like human beings
A sub-field of Artificial Intelligence (AI)
Inter-disciplinary subject (Language + Linguistics +
Statistics + Computer Science + .. )
Natural Language
Refers to the language spoken by people, e.g.
English,Japanese, Tamil, Malayalam as opposed to artificial
languages, like C++, Java, etc.
Jaganadh G Practical Natural Language Processing
Definition
Natural Language Processing
Natural Language Processing is a theoretically motivated range
of computational techniques for analyzing and representing
naturally occurring texts/speech at one or more levels of
linguistic analysis for the purpose of achieving human-like
language processing for a range of tasks or applications.
NLP was considered as an academic discipline before
some 10 to 20 years.
Now concepts from NLP is applied in variety of
Computing Platforms and Services
Jaganadh G Practical Natural Language Processing
Practical NLP ?
Problem
Picture Courtesy: http://twitpic.com/1y21qm/full
Jaganadh G Practical Natural Language Processing
Practical NLP ?
Problem
Before going to some theory can we have some funny
practical problems to solve ?
Picture Courtesy: http://twitpic.com/1y21qm/full
Jaganadh G Practical Natural Language Processing
Practical NLP ?
Problem
Before going to some theory can we have some funny
practical problems to solve ?
Picture Courtesy: http://twitpic.com/1y21qm/full
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Tweet-a-Toddy receives thousands of tweets per day
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Tweet-a-Toddy receives thousands of tweets per day
Tweets requesting home delivery
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Tweet-a-Toddy receives thousands of tweets per day
Tweets requesting home delivery
Tweets about quality of products
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Tweet-a-Toddy receives thousands of tweets per day
Tweets requesting home delivery
Tweets about quality of products
Tweets related to enquirers
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Tweet-a-Toddy receives thousands of tweets per day
Tweets requesting home delivery
Tweets about quality of products
Tweets related to enquirers
They requires following things to be automated
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Tweet-a-Toddy receives thousands of tweets per day
Tweets requesting home delivery
Tweets about quality of products
Tweets related to enquirers
They requires following things to be automated
Identify tweet category
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Tweet-a-Toddy receives thousands of tweets per day
Tweets requesting home delivery
Tweets about quality of products
Tweets related to enquirers
They requires following things to be automated
Identify tweet category
Process home-delivery request
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Tweet-a-Toddy receives thousands of tweets per day
Tweets requesting home delivery
Tweets about quality of products
Tweets related to enquirers
They requires following things to be automated
Identify tweet category
Process home-delivery request
Evaluate quality related tweets
Jaganadh G Practical Natural Language Processing
Practical NLP
Problem
Tweet-a-Toddy receives thousands of tweets per day
Tweets requesting home delivery
Tweets about quality of products
Tweets related to enquirers
They requires following things to be automated
Identify tweet category
Process home-delivery request
Evaluate quality related tweets
How?
How to find a solution for Tweet-a-Toddy
Jaganadh G Practical Natural Language Processing
Solution
??
Any Solutions
Jaganadh G Practical Natural Language Processing
Solution
??
Any Solutions
Some thoughts
Jaganadh G Practical Natural Language Processing
Solution
??
Any Solutions
Some thoughts
Text Classification
Jaganadh G Practical Natural Language Processing
Solution
??
Any Solutions
Some thoughts
Text Classification
Entity Identification
Jaganadh G Practical Natural Language Processing
Solution
??
Any Solutions
Some thoughts
Text Classification
Entity Identification
Information Extraction
Jaganadh G Practical Natural Language Processing
Solution
??
Any Solutions
Some thoughts
Text Classification
Entity Identification
Information Extraction
Sentiment Analysis
Jaganadh G Practical Natural Language Processing
Solution
??
Any Solutions
Some thoughts
Text Classification
Entity Identification
Information Extraction
Sentiment Analysis
Parsing, gammer ...
Jaganadh G Practical Natural Language Processing
Solution
??
Any Solutions
Some thoughts
Text Classification
Entity Identification
Information Extraction
Sentiment Analysis
Parsing, gammer ...
Regex (Regular Expressions)
Jaganadh G Practical Natural Language Processing
Another Practical Question
Everybody might have used spell checker available in word
processing systems like OpenOffice.org or Microsoft Word Any
guess on how to develop a spell checker system ?
Solutions
Jaganadh G Practical Natural Language Processing
Another Practical Question
Everybody might have used spell checker available in word
processing systems like OpenOffice.org or Microsoft Word Any
guess on how to develop a spell checker system ?
Solutions
Word List
Jaganadh G Practical Natural Language Processing
Another Practical Question
Everybody might have used spell checker available in word
processing systems like OpenOffice.org or Microsoft Word Any
guess on how to develop a spell checker system ?
Solutions
Word List
Structure of words
Jaganadh G Practical Natural Language Processing
Another Practical Question
Everybody might have used spell checker available in word
processing systems like OpenOffice.org or Microsoft Word Any
guess on how to develop a spell checker system ?
Solutions
Word List
Structure of words
Dynamic Programming (Edit Distance)
Jaganadh G Practical Natural Language Processing
Another Practical Question ...
Context Sensitive Spell-checking
Identifying and suggesting spelling of words based on context
How ??
Jaganadh G Practical Natural Language Processing
Another Practical Question ...
Context Sensitive Spell-checking
Identifying and suggesting spelling of words based on context
How ??
Solutions
Jaganadh G Practical Natural Language Processing
Another Practical Question ...
Context Sensitive Spell-checking
Identifying and suggesting spelling of words based on context
How ??
Solutions
Statistical Models
Jaganadh G Practical Natural Language Processing
Another Practical Question ...
Context Sensitive Spell-checking
Identifying and suggesting spelling of words based on context
How ??
Solutions
Statistical Models
Word category based suggestions
Jaganadh G Practical Natural Language Processing
Can Machines Translate ??
Answer !!!
Jaganadh G Practical Natural Language Processing
Why NLP ?
Because ”Information is Power !!!”
Jaganadh G Practical Natural Language Processing
Why NLP ?
Because ”Information is Power !!!”
Picture Courtesy: http://soundsgood.in/wikipediafat print book/
Jaganadh G Practical Natural Language Processing
Why NLP ?
Because ”Information is Power !!!”
Every day wast amount of text and speech data is being
produced
Picture Courtesy: http://soundsgood.in/wikipediafat print book/
Jaganadh G Practical Natural Language Processing
Why NLP ?
Because ”Information is Power !!!”
Every day wast amount of text and speech data is being
produced
Internet == at least 40 Million pages
Picture Courtesy: http://soundsgood.in/wikipediafat print book/
Jaganadh G Practical Natural Language Processing
Why NLP ?
Because ”Information is Power !!!”
Every day wast amount of text and speech data is being
produced
Internet == at least 40 Million pages
Picture Courtesy: http://soundsgood.in/wikipediafat print book/
Jaganadh G Practical Natural Language Processing
History
Jaganadh G Practical Natural Language Processing
History
Second World War !!!
Jaganadh G Practical Natural Language Processing
History
Second World War !!!
Machine Translation
Jaganadh G Practical Natural Language Processing
History
Second World War !!!
Machine Translation
Now :
Jaganadh G Practical Natural Language Processing
History
Second World War !!!
Machine Translation
Now :
Most promising imperfect technology
Jaganadh G Practical Natural Language Processing
History
Second World War !!!
Machine Translation
Now :
Most promising imperfect technology
Moves from Lab to Industry to Layman
Jaganadh G Practical Natural Language Processing
NLP Really Hard to Achieve?
NLP delas with human languages
Human Language is dynamic and mysterious !!!
Jaganadh G Practical Natural Language Processing
NLP Really Hard to Achieve?
NLP delas with human languages
Human Language is dynamic and mysterious !!!
Communication in Human Language
Jaganadh G Practical Natural Language Processing
NLP Really Hard to Achieve?
Levels of Knowledge encoding in Language Data
Jaganadh G Practical Natural Language Processing
Tasks in NLP
Broad Areas
Jaganadh G Practical Natural Language Processing
Tasks in NLP
Broad Areas
Text Processing
Jaganadh G Practical Natural Language Processing
Tasks in NLP
Broad Areas
Text Processing
Speech Processing
Jaganadh G Practical Natural Language Processing
Major tasks in Text Processing
Jaganadh G Practical Natural Language Processing
Major tasks in Text Processing
Word Level Analysis
Jaganadh G Practical Natural Language Processing
Major tasks in Text Processing
Word Level Analysis
Morphological Synthesis
Jaganadh G Practical Natural Language Processing
Major tasks in Text Processing
Word Level Analysis
Morphological Synthesis
Part of Speech Tagging
Jaganadh G Practical Natural Language Processing
Major tasks in Text Processing
Word Level Analysis
Morphological Synthesis
Part of Speech Tagging
Stemming
Jaganadh G Practical Natural Language Processing
Major tasks in Text Processing
Word Level Analysis
Morphological Synthesis
Part of Speech Tagging
Stemming
Lemmatization
Jaganadh G Practical Natural Language Processing
Major tasks in Text Processing
Word Level Analysis
Morphological Synthesis
Part of Speech Tagging
Stemming
Lemmatization
Sentence Level Analysis - Syntactical Parsing
Jaganadh G Practical Natural Language Processing
Major tasks in Text Processing
Word Level Analysis
Morphological Synthesis
Part of Speech Tagging
Stemming
Lemmatization
Sentence Level Analysis - Syntactical Parsing
Discourse Analysis - Semantic Processing
Jaganadh G Practical Natural Language Processing
Morphology
The branch of linguistics that studies word structures.
Jaganadh G Practical Natural Language Processing
Morphology
The branch of linguistics that studies word structures.
To a computer program a word is : ???
Jaganadh G Practical Natural Language Processing
Morphology
The branch of linguistics that studies word structures.
To a computer program a word is : ???
Morphological analysis can be explained as: the process of
analyzing words to identify its constituents
Jaganadh G Practical Natural Language Processing
Morphology
The branch of linguistics that studies word structures.
To a computer program a word is : ???
Morphological analysis can be explained as: the process of
analyzing words to identify its constituents
Computational Analysis of Morphology
Morphological Analysis
Jaganadh G Practical Natural Language Processing
Morphology
The branch of linguistics that studies word structures.
To a computer program a word is : ???
Morphological analysis can be explained as: the process of
analyzing words to identify its constituents
Computational Analysis of Morphology
Morphological Analysis
Jaganadh G Practical Natural Language Processing
Morphology
The branch of linguistics that studies word structures.
To a computer program a word is : ???
Morphological analysis can be explained as: the process of
analyzing words to identify its constituents
Computational Analysis of Morphology
Morphological Analysis
Morphological Generation
Jaganadh G Practical Natural Language Processing
Morphology
The branch of linguistics that studies word structures.
To a computer program a word is : ???
Morphological analysis can be explained as: the process of
analyzing words to identify its constituents
Computational Analysis of Morphology
Morphological Analysis
Morphological Generation
Stemming
Jaganadh G Practical Natural Language Processing
Morphology
The branch of linguistics that studies word structures.
To a computer program a word is : ???
Morphological analysis can be explained as: the process of
analyzing words to identify its constituents
Computational Analysis of Morphology
Morphological Analysis
Morphological Generation
Stemming
Lemmatization
Jaganadh G Practical Natural Language Processing
Practical Question from Morphology
Approximate number of word forms that can be derived from
the word
”maram”
Jaganadh G Practical Natural Language Processing
Parts of Speech Tagging
POS tagging is the process of marking up the words in a text
(corpus) as corresponding to a particular part of speech, based
on both its definition, as well as its context.
Ram goes to school.
Ram/NNP goes/VBZ to/TO school/NN ./.
Jaganadh G Practical Natural Language Processing
Parts of Speech Tagging
POS tagging is the process of marking up the words in a text
(corpus) as corresponding to a particular part of speech, based
on both its definition, as well as its context.
Ram goes to school.
Ram/NNP goes/VBZ to/TO school/NN ./.
Words are ambiguous !!!!
e.g. book, cricket, bank
Jaganadh G Practical Natural Language Processing
Syntactical Parsing
Parsing
In computer science and linguistics, parsing, or, more formally,
syntactic analysis, is the process of analyzing a text, made of a
sequence of tokens (for example, words), to determine its
grammatical structure with respect to a given (more or less)
formal grammar.
Jaganadh G Practical Natural Language Processing
Syntactical Parsing
Parsing
In computer science and linguistics, parsing, or, more formally,
syntactic analysis, is the process of analyzing a text, made of a
sequence of tokens (for example, words), to determine its
grammatical structure with respect to a given (more or less)
formal grammar.
Sentences are ambiguous !!!!
Jaganadh G Practical Natural Language Processing
Semantics
Study of meaning ans its structure
Jaganadh G Practical Natural Language Processing
Semantics
Study of meaning ans its structure
Word meaning is ambiguous !!!!
E.g. marriage
Jaganadh G Practical Natural Language Processing
Where can I apply this techniques?
Machine Translation Systems
Jaganadh G Practical Natural Language Processing
Where can I apply this techniques?
Machine Translation Systems
Search Engine
Jaganadh G Practical Natural Language Processing
Where can I apply this techniques?
Machine Translation Systems
Search Engine
Spell-checker
Jaganadh G Practical Natural Language Processing
Where can I apply this techniques?
Machine Translation Systems
Search Engine
Spell-checker
Grammar Checker
Jaganadh G Practical Natural Language Processing
Where can I apply this techniques?
Machine Translation Systems
Search Engine
Spell-checker
Grammar Checker
..........
Jaganadh G Practical Natural Language Processing
Other Interesting Tasks
Named Entity Identification
Jaganadh G Practical Natural Language Processing
Other Interesting Tasks
Named Entity Identification
Information Extraction
Jaganadh G Practical Natural Language Processing
Other Interesting Tasks
Named Entity Identification
Information Extraction
Information Retrieval
Jaganadh G Practical Natural Language Processing
Other Interesting Tasks
Named Entity Identification
Information Extraction
Information Retrieval
Text Classification and Clustering
Jaganadh G Practical Natural Language Processing
Speech Processing
Two Major Areas
Text to Speech
Speech Recognition
Jaganadh G Practical Natural Language Processing
Speech Processing
Two Major Areas
Text to Speech
Speech Recognition
Practical Applications
IVR
Technology for Visually Challenged People
Mobile Phones
Speech Enabled Web
Vehicle Mounted GPS Navigator
Jaganadh G Practical Natural Language Processing
Commerical NLP Applications
What Industry Looks
Jaganadh G Practical Natural Language Processing
Commerical NLP Applications
What Industry Looks
Components of Word Processors
Jaganadh G Practical Natural Language Processing
Commerical NLP Applications
What Industry Looks
Components of Word Processors
Machine Translation Systems
Jaganadh G Practical Natural Language Processing
Commerical NLP Applications
What Industry Looks
Components of Word Processors
Machine Translation Systems
Custom Search Systems
Jaganadh G Practical Natural Language Processing
Commerical NLP Applications
What Industry Looks
Components of Word Processors
Machine Translation Systems
Custom Search Systems
Information Extraction
Jaganadh G Practical Natural Language Processing
Commerical NLP Applications
What Industry Looks
Components of Word Processors
Machine Translation Systems
Custom Search Systems
Information Extraction
Entity Identification
Jaganadh G Practical Natural Language Processing
Commerical NLP Applications
What Industry Looks
Components of Word Processors
Machine Translation Systems
Custom Search Systems
Information Extraction
Entity Identification
Text Summarization
Jaganadh G Practical Natural Language Processing
Commerical NLP Applications
What Industry Looks
Components of Word Processors
Machine Translation Systems
Custom Search Systems
Information Extraction
Entity Identification
Text Summarization
Speech Systems
Jaganadh G Practical Natural Language Processing
Commerical NLP Applications
What Industry Looks
Components of Word Processors
Machine Translation Systems
Custom Search Systems
Information Extraction
Entity Identification
Text Summarization
Speech Systems
Question Answering Systems
Jaganadh G Practical Natural Language Processing
Future of NLP
Future!!!
Semantics oriented technologies
Jaganadh G Practical Natural Language Processing
NLP in other domains
Bio-Medical
Legal
Forensic Science
Advertisement
Education
Politics
E-governance
Business Development
Marketing
and where ever we use language !!!
Jaganadh G Practical Natural Language Processing
Natural Language Processing in India
Academic Institutions
IIT Kanpur, Kharagpur, Bombay
IIIT hydrabad
IISc Bangalore
AU-KBC Chennai
Amritha University Ettimadai, Coimbatore
IIITMK, Trivandrum
Central University, Hydrabad
JNU, Delhi
Tamil University, Thanjore
Jaganadh G Practical Natural Language Processing
Natural Language Processing in India
Industry
Microsoft
Yahoo!
AOL
365Media Pvt. Ltd.
Inside View
Thaazza
AIAIO Labs
Jaganadh G Practical Natural Language Processing
Questions ??
Jaganadh G Practical Natural Language Processing
References
Daniel Jurafsky,James H. Martin, SPEECH and
LANGUAGE PROCESSING, 2nd
Edition.
U.S. Tiwary, Tanveer Siddiqui , Natural Language
Processing and Information Retrieval
Jaganadh G Practical Natural Language Processing
Finally
Jaganadh G Practical Natural Language Processing
Questions ??
Jaganadh G Practical Natural Language Processing
References
Daniel Jurafsky,James H. Martin, SPEECH and
LANGUAGE PROCESSING, 2nd
Edition.
U.S. Tiwary, Tanveer Siddiqui , Natural Language
Processing and Information Retrieval
Jaganadh G Practical Natural Language Processing
Finally
Jaganadh G Practical Natural Language Processing

Weitere ähnliche Inhalte

Was ist angesagt?

Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?Diana Maynard
 
Tools for (Almost) Real-Time Social Media Analysis
Tools for (Almost) Real-Time Social Media AnalysisTools for (Almost) Real-Time Social Media Analysis
Tools for (Almost) Real-Time Social Media AnalysisDiana Maynard
 
Scaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningScaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningVo Viet Anh
 
Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)Kavita Ganesan
 
Social media analytics as a service: tools from GATE
Social media analytics as a service: tools from GATESocial media analytics as a service: tools from GATE
Social media analytics as a service: tools from GATEDiana Maynard
 
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextTwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextLeon Derczynski
 

Was ist angesagt? (7)

Cls8 decarbonet
Cls8 decarbonetCls8 decarbonet
Cls8 decarbonet
 
Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?Can Social Media Analysis Improve Collective Awareness of Climate Change?
Can Social Media Analysis Improve Collective Awareness of Climate Change?
 
Tools for (Almost) Real-Time Social Media Analysis
Tools for (Almost) Real-Time Social Media AnalysisTools for (Almost) Real-Time Social Media Analysis
Tools for (Almost) Real-Time Social Media Analysis
 
Scaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningScaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine Learning
 
Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)
 
Social media analytics as a service: tools from GATE
Social media analytics as a service: tools from GATESocial media analytics as a service: tools from GATE
Social media analytics as a service: tools from GATE
 
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextTwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
 

Ähnlich wie Practical Natural Language Processing

Natural Language Processing in Artificial Intelligence - Codeup #5 - PayU
Natural Language Processing in Artificial Intelligence  - Codeup #5 - PayU Natural Language Processing in Artificial Intelligence  - Codeup #5 - PayU
Natural Language Processing in Artificial Intelligence - Codeup #5 - PayU Artivatic.ai
 
ChatGPT in academic settings H2.de
ChatGPT in academic settings H2.deChatGPT in academic settings H2.de
ChatGPT in academic settings H2.deDavid Döring
 
Pycon India 2018 Natural Language Processing Workshop
Pycon India 2018   Natural Language Processing WorkshopPycon India 2018   Natural Language Processing Workshop
Pycon India 2018 Natural Language Processing WorkshopLakshya Sivaramakrishnan
 
AI生成工具的新衝擊 - MS Bing & Google Bard 能否挑戰ChatGPT-4領導地位
AI生成工具的新衝擊 - MS Bing & Google Bard 能否挑戰ChatGPT-4領導地位AI生成工具的新衝擊 - MS Bing & Google Bard 能否挑戰ChatGPT-4領導地位
AI生成工具的新衝擊 - MS Bing & Google Bard 能否挑戰ChatGPT-4領導地位eLearning Consortium 電子學習聯盟
 
Assistive Technology Reading and Writing Tools
Assistive Technology Reading and Writing ToolsAssistive Technology Reading and Writing Tools
Assistive Technology Reading and Writing ToolsATLA
 
Using AI for Learning.pptx
Using AI for Learning.pptxUsing AI for Learning.pptx
Using AI for Learning.pptxGDSCUOWMKDUPG
 
Scientists meet Entrepreneurs - AI & Machine Learning, Mark Fishel, Institute...
Scientists meet Entrepreneurs - AI & Machine Learning, Mark Fishel, Institute...Scientists meet Entrepreneurs - AI & Machine Learning, Mark Fishel, Institute...
Scientists meet Entrepreneurs - AI & Machine Learning, Mark Fishel, Institute...MobileMonday Estonia
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهsadegh salehi
 
SpeakGlobal: The language software company
SpeakGlobal: The language software companySpeakGlobal: The language software company
SpeakGlobal: The language software companySpeakGlobal, Ltd.
 
Cole Napper: Are you ready for generative AI in people analytics?
Cole Napper: Are you ready for generative AI in people analytics?Cole Napper: Are you ready for generative AI in people analytics?
Cole Napper: Are you ready for generative AI in people analytics?Edunomica
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligenceiarthur
 
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...AishwaryaChemate
 
Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introductionananth
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment AnalysisJaganadh Gopinadhan
 
Supporting Literacy For Free
Supporting Literacy For FreeSupporting Literacy For Free
Supporting Literacy For FreeJennifer Kraft
 
CSCE181 Big ideas in NLP
CSCE181 Big ideas in NLPCSCE181 Big ideas in NLP
CSCE181 Big ideas in NLPInsoo Chung
 
NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA DATASCIENCE
 

Ähnlich wie Practical Natural Language Processing (20)

Natural Language Processing in Artificial Intelligence - Codeup #5 - PayU
Natural Language Processing in Artificial Intelligence  - Codeup #5 - PayU Natural Language Processing in Artificial Intelligence  - Codeup #5 - PayU
Natural Language Processing in Artificial Intelligence - Codeup #5 - PayU
 
Practical Machine Learning
Practical Machine Learning Practical Machine Learning
Practical Machine Learning
 
ChatGPT in academic settings H2.de
ChatGPT in academic settings H2.deChatGPT in academic settings H2.de
ChatGPT in academic settings H2.de
 
Pycon India 2018 Natural Language Processing Workshop
Pycon India 2018   Natural Language Processing WorkshopPycon India 2018   Natural Language Processing Workshop
Pycon India 2018 Natural Language Processing Workshop
 
AI生成工具的新衝擊 - MS Bing & Google Bard 能否挑戰ChatGPT-4領導地位
AI生成工具的新衝擊 - MS Bing & Google Bard 能否挑戰ChatGPT-4領導地位AI生成工具的新衝擊 - MS Bing & Google Bard 能否挑戰ChatGPT-4領導地位
AI生成工具的新衝擊 - MS Bing & Google Bard 能否挑戰ChatGPT-4領導地位
 
Assistive Technology Reading and Writing Tools
Assistive Technology Reading and Writing ToolsAssistive Technology Reading and Writing Tools
Assistive Technology Reading and Writing Tools
 
Using AI for Learning.pptx
Using AI for Learning.pptxUsing AI for Learning.pptx
Using AI for Learning.pptx
 
Scientists meet Entrepreneurs - AI & Machine Learning, Mark Fishel, Institute...
Scientists meet Entrepreneurs - AI & Machine Learning, Mark Fishel, Institute...Scientists meet Entrepreneurs - AI & Machine Learning, Mark Fishel, Institute...
Scientists meet Entrepreneurs - AI & Machine Learning, Mark Fishel, Institute...
 
Action on Google
Action on GoogleAction on Google
Action on Google
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغه
 
SpeakGlobal: The language software company
SpeakGlobal: The language software companySpeakGlobal: The language software company
SpeakGlobal: The language software company
 
Cole Napper: Are you ready for generative AI in people analytics?
Cole Napper: Are you ready for generative AI in people analytics?Cole Napper: Are you ready for generative AI in people analytics?
Cole Napper: Are you ready for generative AI in people analytics?
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Introduction to NLTK
Introduction to NLTKIntroduction to NLTK
Introduction to NLTK
 
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
 
Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introduction
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment Analysis
 
Supporting Literacy For Free
Supporting Literacy For FreeSupporting Literacy For Free
Supporting Literacy For Free
 
CSCE181 Big ideas in NLP
CSCE181 Big ideas in NLPCSCE181 Big ideas in NLP
CSCE181 Big ideas in NLP
 
NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2
 

Mehr von Jaganadh Gopinadhan

Elements of Text Mining Part - I
Elements of Text Mining Part - IElements of Text Mining Part - I
Elements of Text Mining Part - IJaganadh Gopinadhan
 
Natural Language Processing with Per
Natural Language Processing with PerNatural Language Processing with Per
Natural Language Processing with PerJaganadh Gopinadhan
 
Indian Language Spellchecker Development for OpenOffice.org
Indian Language Spellchecker Development for OpenOffice.org Indian Language Spellchecker Development for OpenOffice.org
Indian Language Spellchecker Development for OpenOffice.org Jaganadh Gopinadhan
 
Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic Jaganadh Gopinadhan
 
Script to Sentiment : on future of Language TechnologyMysore latest
Script to Sentiment : on future of Language TechnologyMysore latestScript to Sentiment : on future of Language TechnologyMysore latest
Script to Sentiment : on future of Language TechnologyMysore latestJaganadh Gopinadhan
 
A tutorial on Machine Translation
A tutorial on Machine TranslationA tutorial on Machine Translation
A tutorial on Machine TranslationJaganadh Gopinadhan
 
Linguistic localization framework for Ooo
Linguistic localization framework for OooLinguistic localization framework for Ooo
Linguistic localization framework for OooJaganadh Gopinadhan
 
ntroduction to GNU/Linux Linux Installation and Basic Commands
ntroduction to GNU/Linux Linux Installation and Basic Commands ntroduction to GNU/Linux Linux Installation and Basic Commands
ntroduction to GNU/Linux Linux Installation and Basic Commands Jaganadh Gopinadhan
 
Let’s Learn Python An introduction to Python
Let’s Learn Python An introduction to Python Let’s Learn Python An introduction to Python
Let’s Learn Python An introduction to Python Jaganadh Gopinadhan
 
Introduction to Free and Open Source Software
Introduction to Free and Open Source Software Introduction to Free and Open Source Software
Introduction to Free and Open Source Software Jaganadh Gopinadhan
 
Opinion Mining and Sentiment Analysis Issues and Challenges
Opinion Mining and Sentiment Analysis Issues and Challenges Opinion Mining and Sentiment Analysis Issues and Challenges
Opinion Mining and Sentiment Analysis Issues and Challenges Jaganadh Gopinadhan
 

Mehr von Jaganadh Gopinadhan (17)

Elements of Text Mining Part - I
Elements of Text Mining Part - IElements of Text Mining Part - I
Elements of Text Mining Part - I
 
Natural Language Processing with Per
Natural Language Processing with PerNatural Language Processing with Per
Natural Language Processing with Per
 
Indian Language Spellchecker Development for OpenOffice.org
Indian Language Spellchecker Development for OpenOffice.org Indian Language Spellchecker Development for OpenOffice.org
Indian Language Spellchecker Development for OpenOffice.org
 
Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic
 
Script to Sentiment : on future of Language TechnologyMysore latest
Script to Sentiment : on future of Language TechnologyMysore latestScript to Sentiment : on future of Language TechnologyMysore latest
Script to Sentiment : on future of Language TechnologyMysore latest
 
A tutorial on Machine Translation
A tutorial on Machine TranslationA tutorial on Machine Translation
A tutorial on Machine Translation
 
Linguistic localization framework for Ooo
Linguistic localization framework for OooLinguistic localization framework for Ooo
Linguistic localization framework for Ooo
 
Ilucbe python v1.2
Ilucbe python v1.2Ilucbe python v1.2
Ilucbe python v1.2
 
Social Media Analytics
Social Media Analytics Social Media Analytics
Social Media Analytics
 
Success Factor
Success Factor Success Factor
Success Factor
 
ntroduction to GNU/Linux Linux Installation and Basic Commands
ntroduction to GNU/Linux Linux Installation and Basic Commands ntroduction to GNU/Linux Linux Installation and Basic Commands
ntroduction to GNU/Linux Linux Installation and Basic Commands
 
Let’s Learn Python An introduction to Python
Let’s Learn Python An introduction to Python Let’s Learn Python An introduction to Python
Let’s Learn Python An introduction to Python
 
Introduction to Free and Open Source Software
Introduction to Free and Open Source Software Introduction to Free and Open Source Software
Introduction to Free and Open Source Software
 
Opinion Mining and Sentiment Analysis Issues and Challenges
Opinion Mining and Sentiment Analysis Issues and Challenges Opinion Mining and Sentiment Analysis Issues and Challenges
Opinion Mining and Sentiment Analysis Issues and Challenges
 
Hdfs
HdfsHdfs
Hdfs
 
Mahout Tutorial FOSSMEET NITC
Mahout Tutorial FOSSMEET NITCMahout Tutorial FOSSMEET NITC
Mahout Tutorial FOSSMEET NITC
 
Will Foss get me a Job?
Will Foss get me a Job?Will Foss get me a Job?
Will Foss get me a Job?
 

Kürzlich hochgeladen

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Kürzlich hochgeladen (20)

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

Practical Natural Language Processing

  • 1. Practical Natural Language Processing From Theory to Industrial Applications Jaganadh G http://jaganadhg.in jaganadhg@gmail.com IIIT-MK Thiruvananthapuram 6th April 2013 Jaganadh G Practical Natural Language Processing
  • 2. About me !! Working as Data Scientist to a Fortune 500 Company Working in Natural Language Processing, Machine Learning, Data Mining etc... Passionate about Free and Open source :-) When gets free time teaches Python, Speaks about FOSS and blogs at http://jaganadhg.in I am a computational linguist / Linguist and Indologist, Book reviewer Software Engineer by Profession Jaganadh G Practical Natural Language Processing
  • 3. Past to Future Jaganadh G Practical Natural Language Processing
  • 4. Question ?? Have you ever used any Natural Language Processing based tools/services? Jaganadh G Practical Natural Language Processing
  • 5. Question ?? Have you ever used any Natural Language Processing based tools/services? Jaganadh G Practical Natural Language Processing
  • 6. Question ?? Have you ever used any Natural Language Processing based tools/services? Jaganadh G Practical Natural Language Processing
  • 7. What is Natural Language Processing (NLP) ? Aim : To build intelligent systems that can interact with human beings as like human beings Jaganadh G Practical Natural Language Processing
  • 8. What is Natural Language Processing (NLP) ? Aim : To build intelligent systems that can interact with human beings as like human beings Jaganadh G Practical Natural Language Processing
  • 9. What is Natural Language Processing (NLP) ? Aim : To build intelligent systems that can interact with human beings as like human beings A sub-field of Artificial Intelligence (AI) Jaganadh G Practical Natural Language Processing
  • 10. What is Natural Language Processing (NLP) ? Aim : To build intelligent systems that can interact with human beings as like human beings A sub-field of Artificial Intelligence (AI) Inter-disciplinary subject (Language + Linguistics + Statistics + Computer Science + .. ) Natural Language Refers to the language spoken by people, e.g. English,Japanese, Tamil, Malayalam as opposed to artificial languages, like C++, Java, etc. Jaganadh G Practical Natural Language Processing
  • 11. Definition Natural Language Processing Natural Language Processing is a theoretically motivated range of computational techniques for analyzing and representing naturally occurring texts/speech at one or more levels of linguistic analysis for the purpose of achieving human-like language processing for a range of tasks or applications. NLP was considered as an academic discipline before some 10 to 20 years. Now concepts from NLP is applied in variety of Computing Platforms and Services Jaganadh G Practical Natural Language Processing
  • 12. Practical NLP ? Problem Picture Courtesy: http://twitpic.com/1y21qm/full Jaganadh G Practical Natural Language Processing
  • 13. Practical NLP ? Problem Before going to some theory can we have some funny practical problems to solve ? Picture Courtesy: http://twitpic.com/1y21qm/full Jaganadh G Practical Natural Language Processing
  • 14. Practical NLP ? Problem Before going to some theory can we have some funny practical problems to solve ? Picture Courtesy: http://twitpic.com/1y21qm/full Jaganadh G Practical Natural Language Processing
  • 15. Practical NLP Problem Jaganadh G Practical Natural Language Processing
  • 16. Practical NLP Problem Tweet-a-Toddy receives thousands of tweets per day Jaganadh G Practical Natural Language Processing
  • 17. Practical NLP Problem Tweet-a-Toddy receives thousands of tweets per day Tweets requesting home delivery Jaganadh G Practical Natural Language Processing
  • 18. Practical NLP Problem Tweet-a-Toddy receives thousands of tweets per day Tweets requesting home delivery Tweets about quality of products Jaganadh G Practical Natural Language Processing
  • 19. Practical NLP Problem Tweet-a-Toddy receives thousands of tweets per day Tweets requesting home delivery Tweets about quality of products Tweets related to enquirers Jaganadh G Practical Natural Language Processing
  • 20. Practical NLP Problem Tweet-a-Toddy receives thousands of tweets per day Tweets requesting home delivery Tweets about quality of products Tweets related to enquirers They requires following things to be automated Jaganadh G Practical Natural Language Processing
  • 21. Practical NLP Problem Tweet-a-Toddy receives thousands of tweets per day Tweets requesting home delivery Tweets about quality of products Tweets related to enquirers They requires following things to be automated Identify tweet category Jaganadh G Practical Natural Language Processing
  • 22. Practical NLP Problem Tweet-a-Toddy receives thousands of tweets per day Tweets requesting home delivery Tweets about quality of products Tweets related to enquirers They requires following things to be automated Identify tweet category Process home-delivery request Jaganadh G Practical Natural Language Processing
  • 23. Practical NLP Problem Tweet-a-Toddy receives thousands of tweets per day Tweets requesting home delivery Tweets about quality of products Tweets related to enquirers They requires following things to be automated Identify tweet category Process home-delivery request Evaluate quality related tweets Jaganadh G Practical Natural Language Processing
  • 24. Practical NLP Problem Tweet-a-Toddy receives thousands of tweets per day Tweets requesting home delivery Tweets about quality of products Tweets related to enquirers They requires following things to be automated Identify tweet category Process home-delivery request Evaluate quality related tweets How? How to find a solution for Tweet-a-Toddy Jaganadh G Practical Natural Language Processing
  • 25. Solution ?? Any Solutions Jaganadh G Practical Natural Language Processing
  • 26. Solution ?? Any Solutions Some thoughts Jaganadh G Practical Natural Language Processing
  • 27. Solution ?? Any Solutions Some thoughts Text Classification Jaganadh G Practical Natural Language Processing
  • 28. Solution ?? Any Solutions Some thoughts Text Classification Entity Identification Jaganadh G Practical Natural Language Processing
  • 29. Solution ?? Any Solutions Some thoughts Text Classification Entity Identification Information Extraction Jaganadh G Practical Natural Language Processing
  • 30. Solution ?? Any Solutions Some thoughts Text Classification Entity Identification Information Extraction Sentiment Analysis Jaganadh G Practical Natural Language Processing
  • 31. Solution ?? Any Solutions Some thoughts Text Classification Entity Identification Information Extraction Sentiment Analysis Parsing, gammer ... Jaganadh G Practical Natural Language Processing
  • 32. Solution ?? Any Solutions Some thoughts Text Classification Entity Identification Information Extraction Sentiment Analysis Parsing, gammer ... Regex (Regular Expressions) Jaganadh G Practical Natural Language Processing
  • 33. Another Practical Question Everybody might have used spell checker available in word processing systems like OpenOffice.org or Microsoft Word Any guess on how to develop a spell checker system ? Solutions Jaganadh G Practical Natural Language Processing
  • 34. Another Practical Question Everybody might have used spell checker available in word processing systems like OpenOffice.org or Microsoft Word Any guess on how to develop a spell checker system ? Solutions Word List Jaganadh G Practical Natural Language Processing
  • 35. Another Practical Question Everybody might have used spell checker available in word processing systems like OpenOffice.org or Microsoft Word Any guess on how to develop a spell checker system ? Solutions Word List Structure of words Jaganadh G Practical Natural Language Processing
  • 36. Another Practical Question Everybody might have used spell checker available in word processing systems like OpenOffice.org or Microsoft Word Any guess on how to develop a spell checker system ? Solutions Word List Structure of words Dynamic Programming (Edit Distance) Jaganadh G Practical Natural Language Processing
  • 37. Another Practical Question ... Context Sensitive Spell-checking Identifying and suggesting spelling of words based on context How ?? Jaganadh G Practical Natural Language Processing
  • 38. Another Practical Question ... Context Sensitive Spell-checking Identifying and suggesting spelling of words based on context How ?? Solutions Jaganadh G Practical Natural Language Processing
  • 39. Another Practical Question ... Context Sensitive Spell-checking Identifying and suggesting spelling of words based on context How ?? Solutions Statistical Models Jaganadh G Practical Natural Language Processing
  • 40. Another Practical Question ... Context Sensitive Spell-checking Identifying and suggesting spelling of words based on context How ?? Solutions Statistical Models Word category based suggestions Jaganadh G Practical Natural Language Processing
  • 41. Can Machines Translate ?? Answer !!! Jaganadh G Practical Natural Language Processing
  • 42. Why NLP ? Because ”Information is Power !!!” Jaganadh G Practical Natural Language Processing
  • 43. Why NLP ? Because ”Information is Power !!!” Picture Courtesy: http://soundsgood.in/wikipediafat print book/ Jaganadh G Practical Natural Language Processing
  • 44. Why NLP ? Because ”Information is Power !!!” Every day wast amount of text and speech data is being produced Picture Courtesy: http://soundsgood.in/wikipediafat print book/ Jaganadh G Practical Natural Language Processing
  • 45. Why NLP ? Because ”Information is Power !!!” Every day wast amount of text and speech data is being produced Internet == at least 40 Million pages Picture Courtesy: http://soundsgood.in/wikipediafat print book/ Jaganadh G Practical Natural Language Processing
  • 46. Why NLP ? Because ”Information is Power !!!” Every day wast amount of text and speech data is being produced Internet == at least 40 Million pages Picture Courtesy: http://soundsgood.in/wikipediafat print book/ Jaganadh G Practical Natural Language Processing
  • 47. History Jaganadh G Practical Natural Language Processing
  • 48. History Second World War !!! Jaganadh G Practical Natural Language Processing
  • 49. History Second World War !!! Machine Translation Jaganadh G Practical Natural Language Processing
  • 50. History Second World War !!! Machine Translation Now : Jaganadh G Practical Natural Language Processing
  • 51. History Second World War !!! Machine Translation Now : Most promising imperfect technology Jaganadh G Practical Natural Language Processing
  • 52. History Second World War !!! Machine Translation Now : Most promising imperfect technology Moves from Lab to Industry to Layman Jaganadh G Practical Natural Language Processing
  • 53. NLP Really Hard to Achieve? NLP delas with human languages Human Language is dynamic and mysterious !!! Jaganadh G Practical Natural Language Processing
  • 54. NLP Really Hard to Achieve? NLP delas with human languages Human Language is dynamic and mysterious !!! Communication in Human Language Jaganadh G Practical Natural Language Processing
  • 55. NLP Really Hard to Achieve? Levels of Knowledge encoding in Language Data Jaganadh G Practical Natural Language Processing
  • 56. Tasks in NLP Broad Areas Jaganadh G Practical Natural Language Processing
  • 57. Tasks in NLP Broad Areas Text Processing Jaganadh G Practical Natural Language Processing
  • 58. Tasks in NLP Broad Areas Text Processing Speech Processing Jaganadh G Practical Natural Language Processing
  • 59. Major tasks in Text Processing Jaganadh G Practical Natural Language Processing
  • 60. Major tasks in Text Processing Word Level Analysis Jaganadh G Practical Natural Language Processing
  • 61. Major tasks in Text Processing Word Level Analysis Morphological Synthesis Jaganadh G Practical Natural Language Processing
  • 62. Major tasks in Text Processing Word Level Analysis Morphological Synthesis Part of Speech Tagging Jaganadh G Practical Natural Language Processing
  • 63. Major tasks in Text Processing Word Level Analysis Morphological Synthesis Part of Speech Tagging Stemming Jaganadh G Practical Natural Language Processing
  • 64. Major tasks in Text Processing Word Level Analysis Morphological Synthesis Part of Speech Tagging Stemming Lemmatization Jaganadh G Practical Natural Language Processing
  • 65. Major tasks in Text Processing Word Level Analysis Morphological Synthesis Part of Speech Tagging Stemming Lemmatization Sentence Level Analysis - Syntactical Parsing Jaganadh G Practical Natural Language Processing
  • 66. Major tasks in Text Processing Word Level Analysis Morphological Synthesis Part of Speech Tagging Stemming Lemmatization Sentence Level Analysis - Syntactical Parsing Discourse Analysis - Semantic Processing Jaganadh G Practical Natural Language Processing
  • 67. Morphology The branch of linguistics that studies word structures. Jaganadh G Practical Natural Language Processing
  • 68. Morphology The branch of linguistics that studies word structures. To a computer program a word is : ??? Jaganadh G Practical Natural Language Processing
  • 69. Morphology The branch of linguistics that studies word structures. To a computer program a word is : ??? Morphological analysis can be explained as: the process of analyzing words to identify its constituents Jaganadh G Practical Natural Language Processing
  • 70. Morphology The branch of linguistics that studies word structures. To a computer program a word is : ??? Morphological analysis can be explained as: the process of analyzing words to identify its constituents Computational Analysis of Morphology Morphological Analysis Jaganadh G Practical Natural Language Processing
  • 71. Morphology The branch of linguistics that studies word structures. To a computer program a word is : ??? Morphological analysis can be explained as: the process of analyzing words to identify its constituents Computational Analysis of Morphology Morphological Analysis Jaganadh G Practical Natural Language Processing
  • 72. Morphology The branch of linguistics that studies word structures. To a computer program a word is : ??? Morphological analysis can be explained as: the process of analyzing words to identify its constituents Computational Analysis of Morphology Morphological Analysis Morphological Generation Jaganadh G Practical Natural Language Processing
  • 73. Morphology The branch of linguistics that studies word structures. To a computer program a word is : ??? Morphological analysis can be explained as: the process of analyzing words to identify its constituents Computational Analysis of Morphology Morphological Analysis Morphological Generation Stemming Jaganadh G Practical Natural Language Processing
  • 74. Morphology The branch of linguistics that studies word structures. To a computer program a word is : ??? Morphological analysis can be explained as: the process of analyzing words to identify its constituents Computational Analysis of Morphology Morphological Analysis Morphological Generation Stemming Lemmatization Jaganadh G Practical Natural Language Processing
  • 75. Practical Question from Morphology Approximate number of word forms that can be derived from the word ”maram” Jaganadh G Practical Natural Language Processing
  • 76. Parts of Speech Tagging POS tagging is the process of marking up the words in a text (corpus) as corresponding to a particular part of speech, based on both its definition, as well as its context. Ram goes to school. Ram/NNP goes/VBZ to/TO school/NN ./. Jaganadh G Practical Natural Language Processing
  • 77. Parts of Speech Tagging POS tagging is the process of marking up the words in a text (corpus) as corresponding to a particular part of speech, based on both its definition, as well as its context. Ram goes to school. Ram/NNP goes/VBZ to/TO school/NN ./. Words are ambiguous !!!! e.g. book, cricket, bank Jaganadh G Practical Natural Language Processing
  • 78. Syntactical Parsing Parsing In computer science and linguistics, parsing, or, more formally, syntactic analysis, is the process of analyzing a text, made of a sequence of tokens (for example, words), to determine its grammatical structure with respect to a given (more or less) formal grammar. Jaganadh G Practical Natural Language Processing
  • 79. Syntactical Parsing Parsing In computer science and linguistics, parsing, or, more formally, syntactic analysis, is the process of analyzing a text, made of a sequence of tokens (for example, words), to determine its grammatical structure with respect to a given (more or less) formal grammar. Sentences are ambiguous !!!! Jaganadh G Practical Natural Language Processing
  • 80. Semantics Study of meaning ans its structure Jaganadh G Practical Natural Language Processing
  • 81. Semantics Study of meaning ans its structure Word meaning is ambiguous !!!! E.g. marriage Jaganadh G Practical Natural Language Processing
  • 82. Where can I apply this techniques? Machine Translation Systems Jaganadh G Practical Natural Language Processing
  • 83. Where can I apply this techniques? Machine Translation Systems Search Engine Jaganadh G Practical Natural Language Processing
  • 84. Where can I apply this techniques? Machine Translation Systems Search Engine Spell-checker Jaganadh G Practical Natural Language Processing
  • 85. Where can I apply this techniques? Machine Translation Systems Search Engine Spell-checker Grammar Checker Jaganadh G Practical Natural Language Processing
  • 86. Where can I apply this techniques? Machine Translation Systems Search Engine Spell-checker Grammar Checker .......... Jaganadh G Practical Natural Language Processing
  • 87. Other Interesting Tasks Named Entity Identification Jaganadh G Practical Natural Language Processing
  • 88. Other Interesting Tasks Named Entity Identification Information Extraction Jaganadh G Practical Natural Language Processing
  • 89. Other Interesting Tasks Named Entity Identification Information Extraction Information Retrieval Jaganadh G Practical Natural Language Processing
  • 90. Other Interesting Tasks Named Entity Identification Information Extraction Information Retrieval Text Classification and Clustering Jaganadh G Practical Natural Language Processing
  • 91. Speech Processing Two Major Areas Text to Speech Speech Recognition Jaganadh G Practical Natural Language Processing
  • 92. Speech Processing Two Major Areas Text to Speech Speech Recognition Practical Applications IVR Technology for Visually Challenged People Mobile Phones Speech Enabled Web Vehicle Mounted GPS Navigator Jaganadh G Practical Natural Language Processing
  • 93. Commerical NLP Applications What Industry Looks Jaganadh G Practical Natural Language Processing
  • 94. Commerical NLP Applications What Industry Looks Components of Word Processors Jaganadh G Practical Natural Language Processing
  • 95. Commerical NLP Applications What Industry Looks Components of Word Processors Machine Translation Systems Jaganadh G Practical Natural Language Processing
  • 96. Commerical NLP Applications What Industry Looks Components of Word Processors Machine Translation Systems Custom Search Systems Jaganadh G Practical Natural Language Processing
  • 97. Commerical NLP Applications What Industry Looks Components of Word Processors Machine Translation Systems Custom Search Systems Information Extraction Jaganadh G Practical Natural Language Processing
  • 98. Commerical NLP Applications What Industry Looks Components of Word Processors Machine Translation Systems Custom Search Systems Information Extraction Entity Identification Jaganadh G Practical Natural Language Processing
  • 99. Commerical NLP Applications What Industry Looks Components of Word Processors Machine Translation Systems Custom Search Systems Information Extraction Entity Identification Text Summarization Jaganadh G Practical Natural Language Processing
  • 100. Commerical NLP Applications What Industry Looks Components of Word Processors Machine Translation Systems Custom Search Systems Information Extraction Entity Identification Text Summarization Speech Systems Jaganadh G Practical Natural Language Processing
  • 101. Commerical NLP Applications What Industry Looks Components of Word Processors Machine Translation Systems Custom Search Systems Information Extraction Entity Identification Text Summarization Speech Systems Question Answering Systems Jaganadh G Practical Natural Language Processing
  • 102. Future of NLP Future!!! Semantics oriented technologies Jaganadh G Practical Natural Language Processing
  • 103. NLP in other domains Bio-Medical Legal Forensic Science Advertisement Education Politics E-governance Business Development Marketing and where ever we use language !!! Jaganadh G Practical Natural Language Processing
  • 104. Natural Language Processing in India Academic Institutions IIT Kanpur, Kharagpur, Bombay IIIT hydrabad IISc Bangalore AU-KBC Chennai Amritha University Ettimadai, Coimbatore IIITMK, Trivandrum Central University, Hydrabad JNU, Delhi Tamil University, Thanjore Jaganadh G Practical Natural Language Processing
  • 105. Natural Language Processing in India Industry Microsoft Yahoo! AOL 365Media Pvt. Ltd. Inside View Thaazza AIAIO Labs Jaganadh G Practical Natural Language Processing
  • 106. Questions ?? Jaganadh G Practical Natural Language Processing
  • 107. References Daniel Jurafsky,James H. Martin, SPEECH and LANGUAGE PROCESSING, 2nd Edition. U.S. Tiwary, Tanveer Siddiqui , Natural Language Processing and Information Retrieval Jaganadh G Practical Natural Language Processing
  • 108. Finally Jaganadh G Practical Natural Language Processing
  • 109. Questions ?? Jaganadh G Practical Natural Language Processing
  • 110. References Daniel Jurafsky,James H. Martin, SPEECH and LANGUAGE PROCESSING, 2nd Edition. U.S. Tiwary, Tanveer Siddiqui , Natural Language Processing and Information Retrieval Jaganadh G Practical Natural Language Processing
  • 111. Finally Jaganadh G Practical Natural Language Processing