+ All Categories
Home > Documents > Linguistics Computational - University of Washington · What is computational linguistics? Using...

Linguistics Computational - University of Washington · What is computational linguistics? Using...

Date post: 22-Sep-2020
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
69
Computational Linguistics by Caitlin Harding with material from Jurafsky and Martin
Transcript
Page 1: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Computational Linguistics

by Caitlin Hardingwith material from Jurafsky and Martin

Page 2: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

credit: xkcd, http://xkcd.com/114

Page 3: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What is computational linguistics?

Page 4: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What is computational linguistics?

� Using computers/computational methods to model and/or process natural language

Page 5: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What is computational linguistics?

� Using computers/computational methods to model and/or process natural language

� Interdisciplinary� linguistics, computer science, artificial

intelligence, mathematics, logic, philosophy, cognitive science, psychology, neurology, etc.

Page 6: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

Page 7: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

SIRI

Page 8: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

SIRI

chat bots (ELIZA)

Page 9: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

SIRI Watsonchat bots (ELIZA)

Page 10: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

SIRI Watson

Google translate

chat bots (ELIZA)

Page 11: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

SIRI Watson

Google translate

chat bots (ELIZA)

Vocaloid

Page 12: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

SIRI Watson

Google translate

chat bots (ELIZA)

Vocaloid

language analysis

Page 13: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

SIRI Watson

Google translate

chat bots (ELIZA)

spam detection

Vocaloid

language analysis

Page 14: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

SIRI Watson

Google translate

chat bots (ELIZA)

spam detection

Vocaloid

voice automated services

language analysis

Page 15: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

What does this look like in the real world?

SIRI Watson

Google translate

chat bots (ELIZA)

spam detection

Vocaloid

voice automated services

language analysis

robots

Page 16: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Let's break this down!

Page 17: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Let's break this down!

� Words

Page 18: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Let's break this down!

� Words� regex, automata, n-grams, part-of-speech (POS)

tagging

Page 19: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Let's break this down!

� Words� regex, automata, n-grams, part-of-speech (POS)

tagging

� Speech

Page 20: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Let's break this down!

� Words� regex, automata, n-grams, part-of-speech (POS)

tagging

� Speech� speech synthesis, voice recognition

Page 21: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Let's break this down!

� Words� regex, automata, n-grams, part-of-speech (POS)

tagging

� Speech� speech synthesis, voice recognition

� Syntax

Page 22: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Let's break this down!

� Words� regex, automata, n-grams, part-of-speech (POS)

tagging

� Speech� speech synthesis, voice recognition

� Syntax� grammars, syntactic parsing, statistical parsing

Page 23: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Let's break this down!

� Words� regex, automata, n-grams, part-of-speech (POS)

tagging

� Speech� speech synthesis, voice recognition

� Syntax� grammars, syntactic parsing, statistical parsing

� Semantics

Page 24: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Let's break this down!

� Words� regex, automata, n-grams, part-of-speech (POS)

tagging

� Speech� speech synthesis, voice recognition

� Syntax� grammars, syntactic parsing, statistical parsing

� Semantics� logic, word-sense disambiguation, word

relations, semantic role labeling

Page 25: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words

Page 26: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

Page 27: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"

Page 28: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

Page 29: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata

Page 30: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata� finite state automata, finite state machines

Page 31: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata� finite state automata, finite state machines� often used for modelling morphological processes

Page 32: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata� finite state automata, finite state machines� often used for modelling morphological processes

� n-grams

Page 33: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata� finite state automata, finite state machines� often used for modelling morphological processes

� n-grams� probability of a given word/character given the word(s)

/character(s) preceding it

Page 34: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata� finite state automata, finite state machines� often used for modelling morphological processes

� n-grams� probability of a given word/character given the word(s)

/character(s) preceding it� unigram, bigram, trigram, etc.

Page 35: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata� finite state automata, finite state machines� often used for modelling morphological processes

� n-grams� probability of a given word/character given the word(s)

/character(s) preceding it� unigram, bigram, trigram, etc.� spelling correction, language identification, etc.

Page 36: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata� finite state automata, finite state machines� often used for modelling morphological processes

� n-grams� probability of a given word/character given the word(s)

/character(s) preceding it� unigram, bigram, trigram, etc.� spelling correction, language identification, etc.� Claude Shannon

Page 37: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata� finite state automata, finite state machines� often used for modelling morphological processes

� n-grams� probability of a given word/character given the word(s)

/character(s) preceding it� unigram, bigram, trigram, etc.� spelling correction, language identification, etc.� Claude Shannon

� POS tagging

Page 38: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Words� regex

� short for "regular expressions"� used by a lot of search engines

� automata� finite state automata, finite state machines� often used for modelling morphological processes

� n-grams� probability of a given word/character given the word(s)

/character(s) preceding it� unigram, bigram, trigram, etc.� spelling correction, language identification, etc.� Claude Shannon

� POS tagging� rule-based or stochastic (statistical)

Page 39: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Speech

Page 40: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Speech

� Closer to hard sciences/lab work

Page 41: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Speech

� Closer to hard sciences/lab work� speech synthesis and text-to-speech

Page 42: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Speech

� Closer to hard sciences/lab work� speech synthesis and text-to-speech

� many stages

Page 43: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Speech

� Closer to hard sciences/lab work� speech synthesis and text-to-speech

� many stages� text -> segmented text -> phones (sounds) -> syllable

boundaries -> intonation, pitch, and length -> F0 (fundamental frequency)

Page 44: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Speech

� Closer to hard sciences/lab work� speech synthesis and text-to-speech

� many stages� text -> segmented text -> phones (sounds) -> syllable

boundaries -> intonation, pitch, and length -> F0 (fundamental frequency)

� Vocaloid

Page 45: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Speech

� Closer to hard sciences/lab work� speech synthesis and text-to-speech

� many stages� text -> segmented text -> phones (sounds) -> syllable

boundaries -> intonation, pitch, and length -> F0 (fundamental frequency)

� Vocaloid

� speech recognition

Page 46: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Syntax

Page 47: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Syntax

� grammar

Page 48: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Syntax

� grammar� model for a given language that is to be used by

a computer for language processing and/or generation

Page 49: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Syntax

� grammar� model for a given language that is to be used by

a computer for language processing and/or generation

� syntactic and statistical parsing

Page 50: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Syntax

� grammar� model for a given language that is to be used by

a computer for language processing and/or generation

� syntactic and statistical parsing� attempt to clarify structural ambiguities ("Mary

saw the man with binoculars.")

Page 51: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Syntax

� grammar� model for a given language that is to be used by

a computer for language processing and/or generation

� syntactic and statistical parsing� attempt to clarify structural ambiguities ("Mary

saw the man with binoculars.")� label roles, POS

Page 52: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Semantics

Page 53: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Semantics

� "Colorless green ideas sleep furiously."� syntactically correct� semantically incorrect - how can we determine that computationally?

Page 54: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Semantics

� "Colorless green ideas sleep furiously."� syntactically correct� semantically incorrect - how can we determine that computationally?

� logic� language used to talk about language

Page 55: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Semantics

� "Colorless green ideas sleep furiously."� syntactically correct� semantically incorrect - how can we determine that computationally?

� logic� language used to talk about language

� word-sense disambiguation� "I went fishing for some bass." vs "The bass in this song is awesome."

Page 56: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Semantics

� "Colorless green ideas sleep furiously."� syntactically correct� semantically incorrect - how can we determine that computationally?

� logic� language used to talk about language

� word-sense disambiguation� "I went fishing for some bass." vs "The bass in this song is awesome."

� word relations� synonyms, antonyms, etc.

Page 57: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Semantics

� "Colorless green ideas sleep furiously."� syntactically correct� semantically incorrect - how can we determine that computationally?

� logic� language used to talk about language

� word-sense disambiguation� "I went fishing for some bass." vs "The bass in this song is awesome."

� word relations� synonyms, antonyms, etc.

� semantic role labeling� "Mary gave the book to John." (Mary = agent, book = theme, John =

recipient)

Page 58: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Semantics

� "Colorless green ideas sleep furiously."� syntactically correct� semantically incorrect - how can we determine that computationally?

� logic� language used to talk about language

� word-sense disambiguation� "I went fishing for some bass." vs "The bass in this song is awesome."

� word relations� synonyms, antonyms, etc.

� semantic role labeling� "Mary gave the book to John." (Mary = agent, book = theme, John =

recipient)� General meaning

� sometimes becomes a philosophical question - what really is a table?� emotion - how do you determine if something is sad, happy, etc.?

Page 59: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

NLP

Page 60: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

NLP

� natural language processing

Page 61: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

NLP

� natural language processing� concerned with the interactions between

computers and natural human languages, specifically in extracting meaningful information from natural language or producing meaningful natural language

Page 62: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

NLP

� natural language processing� concerned with the interactions between

computers and natural human languages, specifically in extracting meaningful information from natural language or producing meaningful natural language

� related to artificial intelligence, machine learning, machine translation, etc.

Page 63: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

NLP

� natural language processing� concerned with the interactions between

computers and natural human languages, specifically in extracting meaningful information from natural language or producing meaningful natural language

� related to artificial intelligence, machine learning, machine translation, etc.

� deals with many open problems

Page 64: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Growing popularity

Page 65: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Growing popularity

� many open problems (problems that have not been completely solved or have room for improvements)

Page 66: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Growing popularity

� many open problems (problems that have not been completely solved or have room for improvements)

� a lot of interest in solving these problems, both academically and commercially - we live in the age of "big data"

Page 67: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Growing popularity

� many open problems (problems that have not been completely solved or have room for improvements)

� a lot of interest in solving these problems, both academically and commercially - we live in the age of "big data"

� UW CS department just hired four of the biggest names in machine learning and "big data"

Page 68: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

Growing popularity

� many open problems (problems that have not been completely solved or have room for improvements)

� a lot of interest in solving these problems, both academically and commercially - we live in the age of "big data"

� UW CS department just hired four of the biggest names in machine learning and "big data"

� need hand annotated data

Page 69: Linguistics Computational - University of Washington · What is computational linguistics? Using computers/computational methods to model and/or process natural language Interdisciplinary

I'm sold! Now what?

� CSE472/LING472 - Intro to Computational Linguistics

� UW Professional Master's in Computational Linguistics

� Coursera - Stanford's NLP Course� CS Department Website (research, talks, etc.)� Advisors


Recommended