0% found this document useful (0 votes)

13 views41 pages

Probabilistic Language Modeling Basics

The document discusses probabilistic language models, focusing on assigning probabilities to sentences for applications like machine translation, spell correction, and speech recognition. It explains how to compute these probabilities using the Chain Rule and introduces the Markov Assumption to simplify calculations. Additionally, it covers estimating N-gram probabilities, specifically bigrams and trigrams, with examples of calculating sentence probabilities.

Uploaded by

spv12344321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views41 pages

Probabilistic Language Modeling Basics

Uploaded by

spv12344321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Language Model

Probabilistic Language Models

• Today’s goal: assign a probability to a sentence
• Machine Translation:
• P(high winds tonite) > P(large winds tonite)
• Spell Correction
Why? • The office is about fifteen minuets from my house
• P(about fifteen minutes from) > P(about fifteen minuets from)
• Speech Recognition
• P(I saw a van) >> P(eyes awe of an)
• + Summarization, question-answering, etc., etc.!!
Probabilistic Language Modeling
• Goal: compute the probability of a sentence or sequence
of words:
P(W) = P(w1,w2,w3,w4,w5…wn)

• Related task: probability of an upcoming word:

P(w5|w1,w2,w3,w4)
• A model that computes either of these:
P(W) or P(wn|w1,w2…wn-1) is called a language model.
• Better: the grammar But language model or LM is standard
How to compute P(W)
• How to compute this joint probability:

• P(its, water, is, so, transparent, that)

• Intuition: let’s rely on the Chain Rule of Probability
Reminder: The Chain Rule
• Recall the definition of conditional probabilities
p(B|A) = P(A,B)/P(A) Rewriting: P(A,B) = P(A)P(B|A)

P(w1w2 … wn ) = Õ P(wi | w1w2 … wi-1 )

P(“its water is so transparent”) =

P(its) × P(water|its) × P(is|its water)
× P(so|its water is) × P(transparent|its water is
so)
How to estimate these probabilities
• Could we just count and divide?

P(the | its water is so transparent that) =

Count(its water is so transparent that the)
Count(its water is so transparent that)
• No! Too many possible sentences!
• We’ll never see enough data for estimating these
Markov Assumption

•Simplifying assumption:
Andrei Markov

P(the | its water is so transparent that) » P(the | that)

•Or maybe

P(the | its water is so transparent that) » P(the | transparent that)

Markov Assumption

P(w1w2 … wn ) » Õ P(wi | wi-k … wi-1 )

i
•In other words, we approximate each
component in the product

P(wi | w1w2 … wi-1) » P(wi | wi-k … wi-1)

Language
Modeling
Introduction to N-grams
Language
Modeling
Estimating N-gram
Probabilities
Estimating bigram probabilities
• The Maximum Likelihood Estimate

count(wi-1,wi )
P(wi | w i-1) =
count(w i-1 )

c(wi-1,wi )
P(wi | w i-1 ) =
c(wi-1)
Build the Bigram Model:
Assume our training corpus contains the following
sentences:

"I am happy"
"I am sad"
"I am excited"
"You are happy"
Bigrams: Unigrams:

("I", "am"): 3 "I": 3

("am", "happy"): 2 "am": 3
("am", "sad"): 1 "happy": 2
("am", "excited"): 1 "sad": 1
("You", "are"): 1 "excited": 1
("are", "happy"): 1 "You": 1
"are": 1
Calculate Bigram Probabilities:
Calculate the Probability of the Full Sentence:
When some type the words I am, the probability of next word
is Happy or Sad or Excited
Given the following trigram probabilities derived from a training corpus,
calculate the probability of the sentence "She is feeling very good."

Trigrams:
Unigrams:

("She", "is", "feeling"): 2 "She": 2

("is", "feeling", "very"): 1 "is": 2
("feeling", "very", "good"): 2 "feeling": 2
"very": 2
Bigrams: "good": 2
•("She", "is"): 2
•("is", "feeling"): 1
•("feeling", "very"): 1
•("very", "good"): 2
P("She is feeling very good")=P("feeling"∣"She is")×P("very"∣"is feeling")
×P("good"∣"feeling very")

=1.0×1.0×2.0 =2.0

The probability of the sentence "She is feeling very good" using the
trigram model is 2.0.
Problem Statement

Given the partial sentence "She is feeling", predict the most likely next word using the trigram model.
Trigram Model Data

From the training corpus, we have the following trigrams and their counts:

Trigrams:
("She", "is", "feeling"): 2
("is", "feeling", "very"): 1
("is", "feeling", "good"): 1
("feeling", "very", "good"): 2
("feeling", "good", "happy"): 1

Bigrams:
("She", "is"): 2
("is", "feeling"): 1
("feeling", "very"): 1
("feeling", "good"): 1
("good", "happy"): 1
𝐶𝑜𝑢𝑛𝑡(“𝑖𝑠 𝑓𝑒𝑒𝑙𝑖𝑛𝑔 𝑔𝑜𝑜𝑑”)
P(“good”|”is feeling”)= =1
𝐶𝑜𝑢𝑛𝑡(“𝑖𝑠 𝑓𝑒𝑒𝑙𝑖𝑛𝑔 ”)
Example Sentence: The Cat

Understanding Probabilistic Language Models
No ratings yet
Understanding Probabilistic Language Models
41 pages
N-gram Language Models in NLP
No ratings yet
N-gram Language Models in NLP
49 pages
N-gram Language Models Explained
No ratings yet
N-gram Language Models Explained
15 pages
NLP 5th Unit
No ratings yet
NLP 5th Unit
23 pages
NLP Module 3
No ratings yet
NLP Module 3
36 pages
Lec 04 22AIE315 NLP
No ratings yet
Lec 04 22AIE315 NLP
20 pages
N-gram Language Modeling Overview
No ratings yet
N-gram Language Modeling Overview
65 pages
Lecture 3 Language Model
No ratings yet
Lecture 3 Language Model
125 pages
Understanding N-grams in Language Modeling
No ratings yet
Understanding N-grams in Language Modeling
35 pages
UNIT 5 - N Gram Models - Complete
No ratings yet
UNIT 5 - N Gram Models - Complete
15 pages
N-Gram Models and Markov Assumption
No ratings yet
N-Gram Models and Markov Assumption
33 pages
Understanding Language Models in NLP
No ratings yet
Understanding Language Models in NLP
36 pages
N-grams in Statistical Language Models
No ratings yet
N-grams in Statistical Language Models
87 pages
N-Gram Models in NLP
No ratings yet
N-Gram Models in NLP
23 pages
N-Gram Models in NLP Explained
100% (1)
N-Gram Models in NLP Explained
4 pages
N-gram Models in Natural Language Processing
No ratings yet
N-gram Models in Natural Language Processing
37 pages
Understanding Language Models and HMMs
No ratings yet
Understanding Language Models and HMMs
34 pages
N-Gram Language Modeling Techniques
No ratings yet
N-Gram Language Modeling Techniques
54 pages
N-gram Language Modeling Overview
No ratings yet
N-gram Language Modeling Overview
84 pages
NLP Lecture 4 PDF
No ratings yet
NLP Lecture 4 PDF
26 pages
Introduction to N-grams in Language Models
No ratings yet
Introduction to N-grams in Language Models
13 pages
Lecture Recap: Language Models & N-Grams
No ratings yet
Lecture Recap: Language Models & N-Grams
41 pages
Understanding Statistical Language Models
No ratings yet
Understanding Statistical Language Models
41 pages
N-grams in Language Modeling Explained
No ratings yet
N-grams in Language Modeling Explained
70 pages
Understanding Language Models and N-Grams
No ratings yet
Understanding Language Models and N-Grams
26 pages
Understanding Language Modeling Basics
No ratings yet
Understanding Language Modeling Basics
4 pages
N-gram Language Model Overview
No ratings yet
N-gram Language Model Overview
75 pages
Understanding Language Models in NLP
No ratings yet
Understanding Language Models in NLP
59 pages
N-gram Language Modeling Overview
No ratings yet
N-gram Language Modeling Overview
75 pages
N-gram and HMM Language Models Explained
No ratings yet
N-gram and HMM Language Models Explained
60 pages
Understanding N-Grams and Probabilities
No ratings yet
Understanding N-Grams and Probabilities
13 pages
Understanding N-Gram Language Models
No ratings yet
Understanding N-Gram Language Models
79 pages
N-Gram Language Models Explained
No ratings yet
N-Gram Language Models Explained
21 pages
Language Modeling with N-grams and Smoothing
No ratings yet
Language Modeling with N-grams and Smoothing
8 pages
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
No ratings yet
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
28 pages
N-gram Language Model Overview
No ratings yet
N-gram Language Model Overview
52 pages
N-Gram Language Modeling Overview
No ratings yet
N-Gram Language Modeling Overview
27 pages
Understanding Language Models & N-Grams
No ratings yet
Understanding Language Models & N-Grams
48 pages
Introduction to N-grams in Language Modeling
No ratings yet
Introduction to N-grams in Language Modeling
77 pages
Language Modeling Techniques Overview
No ratings yet
Language Modeling Techniques Overview
63 pages
Language Modeling Techniques Overview
No ratings yet
Language Modeling Techniques Overview
35 pages
Understanding N-gram Language Models
No ratings yet
Understanding N-gram Language Models
33 pages
N-gram Language Modeling Explained
No ratings yet
N-gram Language Modeling Explained
82 pages
N-Gram Language Models
No ratings yet
N-Gram Language Models
86 pages
Understanding Discourse and Language Models
No ratings yet
Understanding Discourse and Language Models
16 pages
Understanding N-grams in Language Modeling
No ratings yet
Understanding N-grams in Language Modeling
69 pages
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
No ratings yet
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
28 pages
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
No ratings yet
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
28 pages
N-grams in Language Modeling Explained
No ratings yet
N-grams in Language Modeling Explained
27 pages
Understanding Statistical Language Models
No ratings yet
Understanding Statistical Language Models
56 pages
Understanding N-gram Language Models
No ratings yet
Understanding N-gram Language Models
3 pages
N-gram Language Model Overview
No ratings yet
N-gram Language Model Overview
75 pages
Introduction to N-gram Models
No ratings yet
Introduction to N-gram Models
76 pages
Lecture 5 N Gram Language Models
No ratings yet
Lecture 5 N Gram Language Models
44 pages
Unit 3 NLP
No ratings yet
Unit 3 NLP
93 pages
NLP Cat 2
No ratings yet
NLP Cat 2
78 pages
WINSEM2025-26 CSE3015 ETH AP2025264000647 2026-01-08 Reference-Material-I
No ratings yet
WINSEM2025-26 CSE3015 ETH AP2025264000647 2026-01-08 Reference-Material-I
51 pages
Beat the Competition: Quiz Insights
No ratings yet
Beat the Competition: Quiz Insights
116 pages
Download The Teacher App Now
No ratings yet
Download The Teacher App Now
30 pages
Comprehensive Exam Preparation Guide
No ratings yet
Comprehensive Exam Preparation Guide
11 pages
Semantic Networks vs. Frames in AI
No ratings yet
Semantic Networks vs. Frames in AI
22 pages
Problem Solving Techniques Using Search
No ratings yet
Problem Solving Techniques Using Search
102 pages
Knowledge-Based Agents in AI
No ratings yet
Knowledge-Based Agents in AI
17 pages
Ant Colony Optimization Overview
No ratings yet
Ant Colony Optimization Overview
27 pages
Project Consent Request Letter Template
No ratings yet
Project Consent Request Letter Template
1 page
Translation Studies and Colonialism Insights
No ratings yet
Translation Studies and Colonialism Insights
9 pages
p5 English Comprehensive Assessment Term II 2023-2024
No ratings yet
p5 English Comprehensive Assessment Term II 2023-2024
5 pages
Professional Profile of Jayamini Ruwanthi
No ratings yet
Professional Profile of Jayamini Ruwanthi
1 page
Understanding Types of Trusts
No ratings yet
Understanding Types of Trusts
31 pages
Indian Folk Dances and Cultural Heritage
No ratings yet
Indian Folk Dances and Cultural Heritage
10 pages
Understanding Modal Verbs in English
No ratings yet
Understanding Modal Verbs in English
7 pages
Didactics of Teaching English as a Foreign Language
No ratings yet
Didactics of Teaching English as a Foreign Language
101 pages
Laryngeal
No ratings yet
Laryngeal
29 pages
Present Perfect vs. Continuous Tense Guide
No ratings yet
Present Perfect vs. Continuous Tense Guide
10 pages
Myanmar Language and Script Overview
No ratings yet
Myanmar Language and Script Overview
13 pages
Skill and Performance Crossword Puzzle
No ratings yet
Skill and Performance Crossword Puzzle
1 page
Understanding Verbal Behavior Analysis
No ratings yet
Understanding Verbal Behavior Analysis
10 pages
Understanding Causative Verbs in English
No ratings yet
Understanding Causative Verbs in English
8 pages
Car Ownership Trends in Britain (1971-2007)
No ratings yet
Car Ownership Trends in Britain (1971-2007)
2 pages
Teaching Simple Present Grammar
No ratings yet
Teaching Simple Present Grammar
20 pages
Understanding Present Perfect Tenses
No ratings yet
Understanding Present Perfect Tenses
27 pages
Jayalalithaa: A Journey of Resilience
No ratings yet
Jayalalithaa: A Journey of Resilience
2 pages
Language A: Literature SL Course Overview
No ratings yet
Language A: Literature SL Course Overview
2 pages
Annie Griffiths: A Photographer's Journey
No ratings yet
Annie Griffiths: A Photographer's Journey
34 pages
One Word Substitutions Explained
No ratings yet
One Word Substitutions Explained
14 pages
Kiny S5 LKK Notes Final22
No ratings yet
Kiny S5 LKK Notes Final22
98 pages
The Application of Functional Linguistic Models Fo
No ratings yet
The Application of Functional Linguistic Models Fo
31 pages
Year 11 English Course Outline
No ratings yet
Year 11 English Course Outline
4 pages
Understanding Concord in English Grammar
No ratings yet
Understanding Concord in English Grammar
36 pages
Building a StatusStrip in Windows Forms
No ratings yet
Building a StatusStrip in Windows Forms
29 pages
HPGD2203 Educational Management Assignment
No ratings yet
HPGD2203 Educational Management Assignment
7 pages
Psellus' Dialogue on Daemons Explained
100% (2)
Psellus' Dialogue on Daemons Explained
51 pages
Latest IELTS General Writing Topics 2024
No ratings yet
Latest IELTS General Writing Topics 2024
1 page
Single Fillers for SBI Clerk Prelims
No ratings yet
Single Fillers for SBI Clerk Prelims
8 pages
Enhancing Speaking Skills with AV Material
No ratings yet
Enhancing Speaking Skills with AV Material
9 pages

Probabilistic Language Modeling Basics

Uploaded by

Probabilistic Language Modeling Basics

Uploaded by

Language Model

Probabilistic Language Models

• Related task: probability of an upcoming word:

• P(its, water, is, so, transparent, that)

P(w1w2 … wn ) = Õ P(wi | w1w2 … wi-1 )

P(“its water is so transparent”) =

P(the | its water is so transparent that) =

P(the | its water is so transparent that) » P(the | that)

P(the | its water is so transparent that) » P(the | transparent that)

P(w1w2 … wn ) » Õ P(wi | wi-k … wi-1 )

P(wi | w1w2 … wi-1) » P(wi | wi-k … wi-1)

("I", "am"): 3 "I": 3

("She", "is", "feeling"): 2 "She": 2

You might also like