Lecture 1
Lecture 1
Some content in these slides was adapted from Prof. Rada Mihalcea
Natural?
Natural Language?
language spoken by people, e.g. English, Japanese, Swahili, as opposed to
artificial languages, like C++, Java, etc.
Natural Language Processing
Applications that deal with natural language in a way or another
Computational Linguistics
Doing linguistics on computers, more on the linguistic side than NLP, but closely
related
Why Natural Language Processing?
★ “ … language is what made us human” (Guy Deutscher)
★ Through language humans:
○ Pass on knowledge
○ Create new thoughts and ideas
○ Express deep (and not so deep) reflections
★ Practical value:
○ Companies want to know what consumers are saying
○ Intelligence communities want to know what persons of interest are planning
○ New products that use language as the interface with humans
★ Scientific value:
○ Gain a deeper understanding of how the human brain is able to process language
Why Natural Language Processing?
● We are constantly generating data
● Computer programs that can process this data require NLP expertise:
○ Classify text into categories
○ Index and search large texts
○ Automatic translation
○ Speech understanding (understand phone conversations)
○ Information extraction (extract useful information from resumes)
○ Automatic summarization (Condense 1 book into 1 page)
○ Question answering
○ Knowledge acquisition
○ Text generation / dialogues
Why Natural Language Processing?
do doable
like likeable
cool uncool
stable unstable
zip unzip
dress undress
Option 2:
Preposition attachment:
The storm destroyed at least 65 homes and damaged at least 460 more,
authorities said. Firefighters are continuing their painstaking work of
combing through the debris with heavy equipment and hand tools, aware more
bodies are likely buried beneath. At least four people remain missing.
How many deaths? 20? 65? 460? Where? California’s coastal road
http://web.stanford.edu/~jurafsky/slp3/
Please be sure to read the corresponding chapter before class!
Tentative schedule
Week ADMINISTRIVIA, INTRODUCTION Reading
2 Language models & Classification J&M Ch. 4th, 6th & 7th 3rd ed.
3 HMMs and POS tagging J&M Ch. 9 and 10th 3rd ed.
4 Vector Semantics and word embeddings J&M Ch. 15th and 16th 3rd ed.
5 Word Senses and HMMs J&M 18th and 8th 3rd ed.
6 Formal Grammars and Syntactic Parsing J&M Ch. 12. and 13th
Tentative schedule
Week ADMINISTRIVIA, INTRODUCTION Reading
7 Statistical Parsing and Dependency Parsing J&M Ch. 13th and 14th 3rd. Ed.