0% found this document useful (0 votes)

334 views8 pages

Dynamic Programming Parsing in NLP

Dynamic programming parsing is an efficient method in NLP and compiler design that avoids redundant computations by storing intermediate results, allowing for polynomial-time parsing of context-free grammars. Shallow parsing identifies main constituents without building complete parse trees, while probabilistic context-free grammars (PCFGs) assign probabilities to parse rules, aiding in ambiguity resolution. Unification of feature structures combines linguistic constraints, ensuring agreement and consistency in grammar frameworks.

Uploaded by

thrimurthimasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

334 views8 pages

Dynamic Programming Parsing in NLP

Uploaded by

thrimurthimasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Dynamic programming parsing :

• Dynamic Programming (DP) parsing is a method used in natural language processing

(NLP) and compiler design to efficiently parse sentences or strings according to a
grammar, usually a context-free grammar (CFG). Instead of recomputing the parse of
every substring multiple times, DP parsing stores intermediate results and reuses
them—making it much more efficient than naïve recursive parsing.
• Naïve recursive approaches may repeatedly recompute whether the same substring can
be generated from a nonterminal. Dynamic programming avoids this redundancy by
breaking the input into smaller substrings, solving each subproblem once, and storing
results in a table.
• It avoids redundant computations (unlike naïve recursion).
• Handles ambiguity by storing multiple parses for the same span.
• Enables probabilistic parsing by attaching probabilities to entries in the table.
• Dynamic programming parsing (like CKY or Earley) builds up possible parses of
substrings in a table and reuses them to assemble parses of longer spans. This makes
context-free parsing polynomial-time instead of exponential.
• The Dynamic programming parsers are
• 1) Shallow parsing
• 2) Probabilistic CFG

Shallow parsing :
• Shallow parsing is a lightweight form of syntactic analysis that identifies the main
constituents of a sentence (like noun phrases and verb phrases) but does not build a
complete parse tree.
• Instead of analyzing the full hierarchical structure of the sentence (like deep parsing
does), shallow parsing just "chunks" words into meaningful groups.
Example
Sentence:
"The quick brown fox jumps over the lazy dog."
Deep Parsing → full syntax tree:
S
├── NP (The quick brown fox)
└── VP (jumps over the lazy dog)
└── PP (over the lazy dog)
└── NP (the lazy dog)

Shallow Parsing → chunks:

[NP The quick brown fox] [VP jumps] [PP over] [NP the lazy dog]
So shallow parsing doesn’t care about the full hierarchical relations, just about grouping
words into chunks.
Shallow parsing usually combines:
1. POS Tagging → assign part-of-speech tags to each word.
o "The/DT quick/JJ brown/JJ fox/NN jumps/VBZ over/IN the/DT lazy/JJ
dog/NN"
2. Chunking Rules / Models → group tokens into chunks:
o Determiners + adjectives + nouns → NP
o Verbs (possibly with auxiliaries) → VP
o Prepositions → PP
These can be rule-based (using regex over POS tags) or statistical (using machine
learning like CRFs, HMMs, or neural models).

Probabilistic CFG :
• A Probabilistic Context-Free Grammar (PCFG) is just like a regular context-free
grammar (CFG), but each production rule has a probability attached.
• This makes the grammar stochastic: instead of just telling us whether a sentence is
grammatical, it lets us assign probabilities to different possible parses.

Formal Definition
A PCFG is a 5-tuple:
G=(N,Σ,R,S,P)
• N: Set of nonterminals (e.g., S, NP, VP)
• Σ: Set of terminals (words/tokens, e.g., "dog", "runs")
• R: Set of production rules (e.g., S → NP VP)
• S: Start symbol (e.g., S)
• P: Probability distribution over rules (for each nonterminal, rule probabilities sum to 1)

Example
Sentence: "the dog chased the cat"
Grammar:

S → NP VP [1.0]
NP → Det N [0.6]
NP → "the dog" [0.4]
VP → V NP [1.0]
Det → "the" [1.0]
N → "dog" [0.5]
N → "cat" [0.5]
V → "chased" [1.0]

Uses of PCFG
• Ambiguity resolution: If a sentence has multiple parses, PCFG chooses the most
probable one.
• Data-driven: Rule probabilities can be estimated from a treebank (corpus of parsed
sentences).
• Foundation of probabilistic parsing: CKY, Earley, and other DP parsers can be
extended to PCFGs.

Limitations of PCFGs
• Assume independence of rules (not always true in real language).
• Struggle with long-distance dependencies (e.g., subject–verb agreement).
• Often extended with lexicalization (probabilities conditioned on words).

Probabilistic CYK Algorithm :

The CYK algorithm is a classic dynamic programming parser for context-free grammars in
Chomsky Normal Form (CNF).
The probabilistic version (for PCFGs) extends CYK by not just storing whether a substring
can be derived from a nonterminal, but also the probability of the best derivation.

Algorithm Outline
Input:
• A sentence of length nnn.
• A PCFG in CNF (rules of form A → BC or A → a).
Output:
• A parse table storing the maximum probability of each nonterminal spanning
substring [i,j][i, j][i,j].
• The most probable parse tree (if desired).
Example
Grammar (PCFG in CNF):
S → NP VP [1.0]
NP → Det N [0.6]
NP → "dogs" [0.4]
VP → V NP [1.0]
Det → "the" [1.0]
N → "dogs" [0.5]
N → "bones" [0.5]
V → "chase" [1.0]
Probabilistic Lexicalized CFGs:
Probabilistic Lexicalized CFGs are an extension of PCFGs where every phrase is annotated
with a head word, and rule probabilities are conditioned on these heads. This makes the
grammar more sensitive to actual word choices, improving disambiguation and parsing
accuracy.
A lexicalized context-free grammar is a CFG where each phrasal category (like NP, VP,
PP, …) is associated with a head word (the word that determines the phrase’s syntactic and
semantic properties).
Example:
• NP → the big dog (head = "dog")
• VP → chased the cat (head = "chased")

Why Lexicalization?
Standard PCFGs assume rules are chosen independently of the actual words.
• Example: A PCFG might say PP → P NP has probability 1.0, no matter what the
preposition is.
• But in real language:

o "eat pizza with fork"

o "eat pizza with anchovies"

o "eat pizza with telescope" less likely

Lexicalization lets probabilities depend on specific words, capturing stronger
syntactic/semantic preferences.

Probabilistic Lexicalized CFG (PLCFG)

A PLCFG = a CFG where:
1. Every constituent has a head word.
2. Rules expand head-annotated categories.
3. Rule probabilities are conditioned on the head word.

Example
Sentence: "the dog chased the cat"
CFG (lexicalized version):
S(chased) → NP(dog) VP(chased)
VP(chased) → V(chased) NP(cat)
NP(dog) → Det(the) N(dog)
NP(cat) → Det(the) N(cat)
Probabilities:
• P(S(chased) → NP(dog) VP(chased)) = 1.0
• P(VP(chased) → V(chased) NP(cat)) = 0.9
• P(NP(dog) → Det(the) N(dog)) = 0.7
• P(NP(cat) → Det(the) N(cat)) = 0.6
Now probabilities depend not just on the category (NP, VP) but also on the lexical head
("dog", "chased", "cat").
This allows more nuanced probabilities, e.g.:
• VP(eat) → V(eat) NP(pizza) might be very probable.
• VP(eat) → V(eat) NP(idea) might be very improbable.

Unification of feature structures.

Unification of feature structures is the process of combining two sets of linguistic constraints
into one consistent set. If constraints clash, unification fails. This mechanism lets grammar
frameworks enforce agreement and other linguistic conditions in a principled way.

What are Feature Structures?

A feature structure is a set of attribute–value pairs used to represent linguistic information
(syntax, semantics, morphology, agreement, etc.).
Example (NP “the dogs”):
[ CAT: NP
NUM: plural
PERS: 3rd ]
Here:
• CAT = syntactic category
• NUM = number (singular/plural)
• PERS = person

What is Unification?
Unification is the operation of merging two feature structures into one, provided they are
compatible.
• If both have the same feature with the same value → keep it.
• If one has a feature the other lacks → include it.
• If both specify a feature with different values → conflict → unification fails.

Think of it as “combining constraints”: the result must satisfy both.

Example 1: Successful Unification

FS1:
[ CAT: NP
NUM: plural ]
FS2:
[ PERS: 3rd
NUM: plural ]
Unification result:
[ CAT: NP
NUM: plural
PERS: 3rd ]

Works, since both agree on NUM = plural.

Example 2: Failed Unification

FS1:
[ CAT: NP
NUM: singular ]
FS2:
[ NUM: plural
PERS: 3rd ]

Conflict: NUM = singular vs NUM = plural.

Unification fails.

🛠 Why is Unification Useful?

In grammars, rules often carry constraints. Unification enforces these:
• Subject–verb agreement:
o Subject NP: [NUM: plural]
o Verb: [NUM: plural]
o Unification succeeds → sentence grammatical.
o If mismatch → unification fails → sentence rejected.
• Case, gender, tense, semantic roles can all be enforced through unification.

Applications
• Unification grammars (HPSG, LFG, PATR-II)
• Feature-based parsing in NLP
• Constraint solving (agreement, selection restrictions)
• Computational morphology (inflection constraints)

Common questions

Lexicalization is important because it conditions grammar rules on specific head words rather than just categories like NP or VP. This adjustment makes the grammar more sensitive to actual word choices, capturing stronger syntactic and semantic preferences, and thus significantly improving disambiguation and parsing accuracy. Lexicalized grammars are especially effective in capturing language patterns that are not easily modeled with standard PCFG due to their reliance on specific lexical context .

PCFGs assume rule independence and struggle with long-distance dependencies, which can be critical in natural language. Lexicalized CFGs address these issues by associating each rule with head words, allowing probabilities to be conditioned on specific words, which captures stronger syntactic and semantic preferences. This enables better disambiguation and parsing accuracy, particularly in sentences with nuanced structures .

Shallow parsing supports probabilistic parsing by chunking sentences into meaningful parts using statistical models like Hidden Markov Models (HMMs) or Conditional Random Fields (CRFs). These models can learn from annotated corpora to predict the most likely structure of a sentence, thus providing a statistical basis for assigning probabilities to different parses, which enhances accuracy and efficiency in syntactic analysis .

Feature-based parsing in NLP utilizes the unification of feature structures to solve agreement and selection restrictions by representing linguistic information as sets of attributes and values. During parsing, these feature structures are unified to ensure that all constraints—such as agreement in number, gender, case, and specific selection restrictions—are consistently applied. If any constraints clash, unification fails, flagging an inconsistency in the parse tree, which ensures that only grammatically correct sentences are accepted .

A PCFG consists of nonterminals, terminals, production rules, a start symbol, and a probability distribution over the rules. These components contribute to its probabilistic nature by assigning probabilities to production rules, allowing the grammar to not only determine whether a sentence is grammatical but also to assign probabilities to different possible parses. This stochastic nature aids in resolving ambiguities by selecting the most probable parse tree based on these probabilities .

Dynamic programming parsers handle ambiguity by storing multiple possible parses for the same spans in a table. This means they can efficiently manage cases where substrings of a sentence can be parsed in different ways, allowing for probabilistic parsing where the parser later determines the most likely interpretation by combining the stored parses into complete parse trees .

Dynamic programming parsing improves efficiency by storing intermediate results in a table and reusing them, instead of recomputing the parse of every substring multiple times like naïve recursive parsing. This avoids redundant computations and makes parsing polynomial-time, which is efficient compared to the potential exponential time complexity of naïve recursive methods .

Feature structures in unification grammars represent linguistic information as attribute–value pairs. Unification is the process of merging two feature structures, ensuring that all constraints are satisfied. It plays a critical role in enforcing linguistic constraints such as subject-verb agreement and case, by ensuring consistency across feature structures, and rejecting parse trees where constraints clash .

Shallow parsing differs from deep parsing as it identifies the main constituents of a sentence without constructing a full hierarchical parse tree. It groups words into chunks (e.g., noun phrases, verb phrases) rather than analyzing their full syntactic relationships. This simplicity means shallow parsing is faster and less resource-intensive but may miss more complex syntactic details that deep parsing captures, such as nested structures and detailed syntactic roles .

Probabilistic parsing using the CYK algorithm extends standard CYK parsing by storing not only whether a substring can be derived from a nonterminal, but also the probability of the best derivation for each substring. This allows the parser to choose the most probable parse tree, effectively handling ambiguity in parse structures and enhancing parsing accuracy based on probabilistic models .

NLP Unit 3: Parsing and Ambiguity
100% (2)
NLP Unit 3: Parsing and Ambiguity
19 pages
Ambiguity Resolution in NLP Parsing
No ratings yet
Ambiguity Resolution in NLP Parsing
26 pages
Syntactic Parsing in NLP Explained
No ratings yet
Syntactic Parsing in NLP Explained
45 pages
Unsmoothed N-grams in NLP Models
100% (1)
Unsmoothed N-grams in NLP Models
6 pages
Syntactic Parsing in Natural Language Processing
No ratings yet
Syntactic Parsing in Natural Language Processing
42 pages
Language Model Adaptation Techniques
100% (1)
Language Model Adaptation Techniques
10 pages
Pragmatic Analysis in NLP: Unit 3 Notes
100% (1)
Pragmatic Analysis in NLP: Unit 3 Notes
20 pages
Understanding Predicate-Argument Structure
No ratings yet
Understanding Predicate-Argument Structure
20 pages
Spelling Error Detection in NLP
No ratings yet
Spelling Error Detection in NLP
9 pages
Semantics and Pragmatics Overview
100% (1)
Semantics and Pragmatics Overview
40 pages
Unsmoothed N-grams in NLP Analysis
100% (1)
Unsmoothed N-grams in NLP Analysis
20 pages
Lexical Resources in NLP
100% (2)
Lexical Resources in NLP
31 pages
Evaluating Language Models in NLP
No ratings yet
Evaluating Language Models in NLP
21 pages
N-gram Modeling and PoS Tagging in NLP
100% (2)
N-gram Modeling and PoS Tagging in NLP
35 pages
Backoff and Interpolation in NLP
No ratings yet
Backoff and Interpolation in NLP
3 pages
Types of Meaning Representation Systems
No ratings yet
Types of Meaning Representation Systems
3 pages
NLP Challenges and Language Modeling
No ratings yet
NLP Challenges and Language Modeling
98 pages
Language Modeling Overview for Students
No ratings yet
Language Modeling Overview for Students
72 pages
NLP Unit 2: Semantics & Knowledge Representation
No ratings yet
NLP Unit 2: Semantics & Knowledge Representation
26 pages
Unsmoothed N-grams and Laplace Smoothing
100% (2)
Unsmoothed N-grams and Laplace Smoothing
2 pages
Syntactic Analysis and CFGs in NLP
100% (1)
Syntactic Analysis and CFGs in NLP
36 pages
NLP Unit 4: Predicate-Argument Structure
100% (1)
NLP Unit 4: Predicate-Argument Structure
8 pages
Semantic Interpretation in NLP
No ratings yet
Semantic Interpretation in NLP
24 pages
Challenges in POS Tagging Models
No ratings yet
Challenges in POS Tagging Models
14 pages
Bayesian Estimation in NLP Models
No ratings yet
Bayesian Estimation in NLP Models
2 pages
Understanding Word Classes in NLP
100% (1)
Understanding Word Classes in NLP
5 pages
R22 JNTUH CSE Language Model Syllabus
100% (2)
R22 JNTUH CSE Language Model Syllabus
16 pages
Understanding Language Modeling Techniques
No ratings yet
Understanding Language Modeling Techniques
15 pages
Unit IV Notes
No ratings yet
Unit IV Notes
14 pages
Morphological Analysis in NLP
No ratings yet
Morphological Analysis in NLP
47 pages
Semantic Attachments in NLP
No ratings yet
Semantic Attachments in NLP
22 pages
Understanding Semantic Interpretation in NLP
No ratings yet
Understanding Semantic Interpretation in NLP
17 pages
Machine Translation in NLP: Benefits & Challenges
100% (1)
Machine Translation in NLP: Benefits & Challenges
25 pages
Overview of Semantic Parsing Techniques
No ratings yet
Overview of Semantic Parsing Techniques
11 pages
NLP Tokenization and Language Models
75% (4)
NLP Tokenization and Language Models
18 pages
Parsing Algorithms in NLP Explained
No ratings yet
Parsing Algorithms in NLP Explained
12 pages
Multilingual vs Cross-Lingual NLP
No ratings yet
Multilingual vs Cross-Lingual NLP
2 pages
NLP Concepts and Applications Overview
100% (1)
NLP Concepts and Applications Overview
72 pages
Natural Language Semantics in AI Knowledge
No ratings yet
Natural Language Semantics in AI Knowledge
4 pages
Meaning Representation in NLP Systems
No ratings yet
Meaning Representation in NLP Systems
9 pages
Ambiguity Resolution in Parsing Models
No ratings yet
Ambiguity Resolution in Parsing Models
15 pages
Parsing Techniques in NLP
No ratings yet
Parsing Techniques in NLP
17 pages
Anaphora Resolution in NLP Techniques
100% (2)
Anaphora Resolution in NLP Techniques
14 pages
Types of Grammar in NLP Explained
No ratings yet
Types of Grammar in NLP Explained
55 pages
Syntactic Analysis in NLP: CFGs & Parsing
No ratings yet
Syntactic Analysis in NLP: CFGs & Parsing
93 pages
Evolution of Natural Language Processing
No ratings yet
Evolution of Natural Language Processing
106 pages
Language Modeling in NLP Explained
No ratings yet
Language Modeling in NLP Explained
28 pages
System Paradigms for NLP Meaning
No ratings yet
System Paradigms for NLP Meaning
8 pages
NLP Challenges: Irregularity, Ambiguity, Productivity
No ratings yet
NLP Challenges: Irregularity, Ambiguity, Productivity
9 pages
Semantics and Pragmatics in NLP
100% (1)
Semantics and Pragmatics in NLP
10 pages
Document Structure in NLP: Methods
No ratings yet
Document Structure in NLP: Methods
39 pages
Introduction to Natural Language Processing
No ratings yet
Introduction to Natural Language Processing
20 pages
Ambiguity Resolution in NLP Parsing
No ratings yet
Ambiguity Resolution in NLP Parsing
11 pages
NLP Unitwise Imp Questions
No ratings yet
NLP Unitwise Imp Questions
5 pages
NLP Syntax Analysis and Parsing Techniques
No ratings yet
NLP Syntax Analysis and Parsing Techniques
15 pages
NLP Unit 3
No ratings yet
NLP Unit 3
20 pages
Semantic Parsing in NLP: PAS Overview
No ratings yet
Semantic Parsing in NLP: PAS Overview
13 pages
Multilingual & Cross-Lingual NLP Models
No ratings yet
Multilingual & Cross-Lingual NLP Models
20 pages
Logic's Role in Knowledge Representation
100% (1)
Logic's Role in Knowledge Representation
5 pages
Understanding Probabilistic Parsing in NLP
No ratings yet
Understanding Probabilistic Parsing in NLP
30 pages
Examen de Inglés Bimestral 1RO
No ratings yet
Examen de Inglés Bimestral 1RO
2 pages
Teaching Possessive Nouns & Adjectives
No ratings yet
Teaching Possessive Nouns & Adjectives
8 pages
Ingles 3 Bach Modulo 2 Term
No ratings yet
Ingles 3 Bach Modulo 2 Term
10 pages
Word Formation: Suffixes Guide
No ratings yet
Word Formation: Suffixes Guide
8 pages
Functional English Language Course Guide
No ratings yet
Functional English Language Course Guide
6 pages
English Exam Question Paper Pattern
No ratings yet
English Exam Question Paper Pattern
3 pages
Evolve Digital Level 5 Course Overview
No ratings yet
Evolve Digital Level 5 Course Overview
4 pages
Yes & No Grammar Practice Worksheets
No ratings yet
Yes & No Grammar Practice Worksheets
10 pages
Upper Intermediate Communication Skills
No ratings yet
Upper Intermediate Communication Skills
39 pages
A2 ESL Lesson: Beach Day Activities
No ratings yet
A2 ESL Lesson: Beach Day Activities
4 pages
Guide to Pronominal Verbs 2025
No ratings yet
Guide to Pronominal Verbs 2025
4 pages
Skou Language: Phonology and Grammar
100% (1)
Skou Language: Phonology and Grammar
220 pages
Understanding English Articles
100% (1)
Understanding English Articles
2 pages
Noun Properties and Classifications
No ratings yet
Noun Properties and Classifications
12 pages
Understanding Simple Future Tense
No ratings yet
Understanding Simple Future Tense
7 pages
Nouns for Class 1 English Grammar
No ratings yet
Nouns for Class 1 English Grammar
8 pages
English Vocabulary Table Examples
No ratings yet
English Vocabulary Table Examples
9 pages
American vs British English Differences
100% (3)
American vs British English Differences
62 pages
Present Simple Tense Explained
No ratings yet
Present Simple Tense Explained
1 page
Origins of Language and Word Formation
No ratings yet
Origins of Language and Word Formation
3 pages
ALC Book 6 Instructor Guide
No ratings yet
ALC Book 6 Instructor Guide
14 pages
Model Millionaire Grammar Exercises
100% (1)
Model Millionaire Grammar Exercises
2 pages
Crafting Effective Research Titles
No ratings yet
Crafting Effective Research Titles
2 pages
Narrative Tenses Explained: Past Forms
No ratings yet
Narrative Tenses Explained: Past Forms
34 pages
Present and Past Simple Tenses Guide
No ratings yet
Present and Past Simple Tenses Guide
2 pages
Class 6 Half-Yearly Exam Datesheet 2025-26
No ratings yet
Class 6 Half-Yearly Exam Datesheet 2025-26
3 pages
Active vs. Passive Voice Quiz
No ratings yet
Active vs. Passive Voice Quiz
23 pages
English Grammar Quiz Channel
No ratings yet
English Grammar Quiz Channel
27 pages
Lesson 4.1. Countable and Uncountable Nouns
No ratings yet
Lesson 4.1. Countable and Uncountable Nouns
4 pages
Class-3 Academic Planner-2026-27
No ratings yet
Class-3 Academic Planner-2026-27
9 pages

Dynamic Programming Parsing in NLP

Uploaded by

Dynamic Programming Parsing in NLP

Uploaded by

Dynamic programming parsing :

• Dynamic Programming (DP) parsing is a method used in natural language processing

Shallow Parsing → chunks:

Probabilistic CYK Algorithm :

o "eat pizza with fork"

o "eat pizza with telescope" less likely

Probabilistic Lexicalized CFG (PLCFG)

Unification of feature structures.

What are Feature Structures?

Think of it as “combining constraints”: the result must satisfy both.

Example 1: Successful Unification

Works, since both agree on NUM = plural.

Example 2: Failed Unification

Conflict: NUM = singular vs NUM = plural.

🛠 Why is Unification Useful?

Common questions

Why is lexicalization considered an important extension in probabilistic context-free grammars, particularly for parsing?

What are the limitations of Probabilistic Context-Free Grammars (PCFGs), and how do lexicalized CFGs address some of these issues?

How does the concept of shallow parsing support probabilistic parsing through statistical models like HMMs or CRFs?

Explain how feature-based parsing in NLP utilizes the unification of feature structures to solve agreement and selection restrictions.

What are the key components of a probabilistic context-free grammar (PCFG) and how do they contribute to its probabilistic nature?

In what ways do dynamic programming parsers handle ambiguity in natural language processing?

How does dynamic programming parsing improve efficiency compared to naïve recursive parsing methods?

What role do feature structures and unification play in enforcing linguistic constraints within unification grammars?

In what ways does shallow parsing differ from deep parsing, and what are the implications of these differences for syntactic analysis?

How does probabilistic parsing using the CYK algorithm differ from standard CYK parsing, and what advantages does this offer?

You might also like