0% found this document useful (0 votes)

71 views13 pages

Semantic Parsing in NLP: PAS Overview

Unit IV of the document focuses on Semantic Parsing II, specifically on Predicate-Argument Structure (PAS) and Meaning Representation Systems in Natural Language Processing (NLP). It discusses the significance of PAS in identifying semantic roles and its applications in Information Extraction, along with resources like FrameNet and PropBank that aid in training machine learning models for Semantic Role Labeling (SRL). The document also outlines various systems and software used for semantic role labeling, emphasizing the transition from rule-based to data-driven approaches in NLP.

Uploaded by

Kollu Spoorthy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views13 pages

Semantic Parsing in NLP: PAS Overview

Uploaded by

Kollu Spoorthy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

SBIT –AUTONOMOUS NLP

NATURAL LANGUAGE PROCESSING (NLP)

UNIT IV

SEMANTIC PARSING II

UNIT - IV Semantic Parsing II: 1. Predicate-Argument Structure,

2. Meaning Representation 1Systems

1. PREDICATE-ARGUMENT STRUCTURE
Topics :
1. Resources

2. Systems

3. Software

Q1: Explain Predicate-Argument Structure. (Resources, Systems, Software)

OR Q: Write short notes on Predicate-Argument Structure found in NLP.

Q2: What are the Syntactic and Semantic Constraints on Predicate-Argument Structure?

Introduction

Predicate-Argument Structure (PAS), also known as Semantic Role Labeling (SRL),

is the process of identifying the semantic roles of various arguments associated with a
predicate (such as a verb, noun, or adjective) in a sentence. The goal is to determine
"who did what to whom, when, where, and how." For a given predicate, the system
identifies all constituents in the sentence that act as arguments and assigns them
specific semantic labels.

The knowledge of PAS has significant real-world applications, particularly in the area
of Information Extraction. By understanding the semantic relationships in a text,
systems can extract structured information from unstructured sources, enabling more
advanced question answering, text summarization, and machine translation.

RESOURCES

In Natural Language Processing, the analysis of Predicate-Argument Structure has

transitioned from rule-based methods to data-oriented approaches. This shift was
enabled by the development of large, semantically annotated corpora. These resources

III CSE(AI/ML) 1 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

provide the foundation for training and evaluating machine learning models for Semantic
Role Labeling (SRL). The two most influential resources are FrameNet and PropBank.

1. FrameNet

FrameNet is a resource based on the theory of frame semantics. This theory suggests
that the meaning of a word (a predicate) is understood by evoking a conceptual structure
or scenario, known as a semantic frame.

 Core Concepts:
o Semantic Frame: Represents a specific situation or event. For example,
the AWARENESS frame describes a scenario where a conscious entity
(Cognizer) has a certain piece of information (Content).

o Frame Elements (FEs): These are the semantic roles specific to a frame.
They are the participants and props in the scenario described by the frame
(e.g., Cognizer, Content, Topic.

o Lexical Unit (LU): This is the pairing of a predicate (a word) with the specific
frame it evokes. A single word can be part of multiple LUs if it has multiple
meanings (is polysemous). For example, the verb break can evoke:

 The COMPLIANCE frame when it means "to fail to observe an agreement."

 The CAUSE_TO_FRAGMENT frame when it means "to cause to separate

into pieces."

 Annotation Process:
1. A semantic frame is defined.
2. A set of frame-specific roles (Frame Elements) is created for that frame.
3. Predicates that can evoke this frame are identified.
4. Sentences containing these predicates are annotated by identifying the
arguments and labeling them with the corresponding Frame Elements.

 Example:The AWARENESS Frame

the below diagram illustrates the AWARENESS frame, its associated Frame
Elements, and a sample of predicates (verbs and nouns) that can evoke it.

III CSE(AI/ML) 2 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

Diagram : FrameNet example

The following sentences show this frame in action:

1. Cognizer We] [Predicate:verb believe] [Content it is a fair and
generous price]

2. No doubts existed as to [Cognizer our] [Predicate:noun comprehension]

[Content of it]

FrameNet contains annotations for a wide variety of predicates, including verbs, nouns,
adjectives, and prepositions, using sentences from the British National Corpus (BNC).

2. PropBank (Proposition Bank)

PropBank offers a different, more "linguistically neutral" approach. It is built upon the
syntactic structures of the Penn Treebank and primarily focuses on annotating the
arguments of verbs.

 Core Concepts:
o PropBank restricts argument boundaries to the exact syntactic constituents found
in the Penn Treebank parse trees.

o It defines two types of arguments: core and adjunctive.

III CSE(AI/ML) 3 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

 Argument Types:
 Core Arguments (ARGN): These are arguments whose semantic role is
dependent on the specific predicate. They are labeled numerically
from ARG0 to ARG5. While there are general tendencies (e.g., ARG0 is often
the agent, ARG1 the patient), their precise meaning is defined for each
predicate in a corresponding frames file.

 Adjunctive Arguments (ARGM-X): These are modifier arguments whose

meaning is consistent across all predicates. They represent general notions
like time (ARGM-TMP), location (ARGM-LOC), manner (ARGM-MNR), etc.

 The below Table shows how the meaning of core arguments changes with the
predicate. For operate.01, ARG1 is the "Thing operated," while
for author.01, ARG1 is the "Text authored."

Table : Argument labels for operate.01 and author.01

III CSE(AI/ML) 4 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

The below Table lists some common adjunctive arguments, which maintain their
meaning regardless of the predicate.

Table : List of adjunctive arguments in PropBank—ARGMS

The below example extracted from the ProbBank corpus along with its syntax tree
representation and argument labels shown in the below diagram:

Other Resources

The methodologies of FrameNet and PropBank have inspired the creation of other
resources and have been adapted for numerous languages.

 NomBank: Inspired by PropBank, NomBank focuses on identifying and tagging

the arguments of nominal predicates (nouns).

III CSE(AI/ML) 5 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

 VerbNet: This resource provides a richer representation by linking PropBank

frames with predicate-independent thematic roles and Levin verb classes.
 Cross-lingual Adaptation:
o FrameNet has been adapted for languages like Japanese, Spanish, and
Swedish, as its frame-semantic basis is largely independent of a specific
language's grammar.
o PropBank has inspired similar corpora in Chinese, Arabic, Korean,
Spanish, and Hindi. Unlike FrameNet, creating a new PropBank requires
a new set of frame files for each language.

Thus, the success of these projects has inspired others, such as NomBank for noun
predicates and VerbNet, which links PropBank frames to more general thematic roles.
These philosophies have also been extended to many other languages, including
Chinese, Japanese, Arabic, and Spanish, demonstrating the cross-lingual applicability of
predicate-argument analysis.

SYSTEMS
Introduction to Semantic Role Labeling (SRL) Systems

The introduction of large, semantically annotated corpora like FrameNet and PropBank
in the late 1990s marked a major shift in approaching predicate-argument structure
recognition. Research moved from traditional, rule-based heuristic systems to more
robust, data-driven machine learning frameworks. This modern approach, popularly
known as Semantic Role Labeling (SRL), treats the problem as a supervised
classification task.

The seminal work by Gildea and Jurafsky was the first to formalize SRL in this manner.
They proposed that the arguments of a predicate could be identified and labeled by
mapping them to nodes in a sentence's syntactic parse tree. This formulation led to the
establishment of three standard evaluation tasks that have become central to the field:

1. Argument Identification: The task of identifying which constituents (nodes) in a

parse tree represent a valid semantic argument for a given predicate. This is a binary
classification task for each node (argument vs. not an argument).

2. Argument Classification: Given the correct (gold-standard) argument constituents,

the task is to assign the correct semantic role label (e.g., ARG0, ARG1, ARGM-LOC)
to each one.

3. Argument Identification and Classification: A combination of the first two tasks,

where the system must both identify the argument boundaries and assign the correct
label. This is the complete and most challenging SRL task.

III CSE(AI/ML) 6 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

The pseudocode for a generic Semantic Role Labeling(SRL) algorithm is shown in the
below Algorithm:

The Semantic Role Lableing (SRL) Algorithm:

Procedure: SRL(Sentence) returns best Semantic Role Labeling

Input: Syntactic Parse of the Sentence

1. generate a full syntatuc parse of the sentence.

2. identify all the predicates

3. for all predicate sentence do

4. extract a set of features for each node in the tree relative to the predicate.

5. classify each feature vector using the model created in training.

6. select the class of highest scoring classifier

7. return best semantic role labelling

8. end for

for each predicate in a sentence:

1. Consider every node in the syntactic parse tree as a potential argument.
2. Extract a feature vector for each node relative to the predicate.
3. Use a trained classification model to predict a label for each node
(including a "NULL" label for non-arguments).
4. Select the best-scoring label for each node to produce the final semantic
role annotation.
Syntactic Representations
The following are various types of sentence representations:
1. Phrase Structure Grammar (PSG)

Since PropBank annotations are layered directly onto the phrase structure trees of the
Penn Treebank, using a Phrase Structure Grammar (PSG) parse is the most natural
and common approach. Early and influential systems extracted a rich set of features
from these trees.

III CSE(AI/ML) 7 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

Key Features from PSG:

 Path: The syntactic path from the argument constituent to the predicate through
the parse tree. It is a highly informative feature, represented as a string of node
labels with arrows indicating upward (↑) or downward (↓) traversal
(e.g., NP↑S↓VP↓VBZ).
 Predicate: The lemma of the predicate verb itself is a crucial feature, as
argument structures are predicate-dependent.
 Phrase Type: The syntactic category of the constituent being classified (e.g.,
NP, PP, SBAR).
 Position: A binary feature indicating whether the constituent appears before or
after the predicate.
 Voice: A binary feature indicating whether the predicate is in the active or
passive voice, which is critical as it often affects the position of the Agent and
Patient roles.
 Head Word: The syntactic head word of the constituent. For example, in the
phrase "the big red car," the head word is "car."
 Subcategorization: The phrase structure rule that expands the predicate's
parent node (e.g., VP → VBZ NP PP). This captures the local syntactic frame of
the predicate.
 Verb Clustering: To handle data sparsity and unseen verbs, predicates are
grouped into semantic classes based on their co-occurrence with direct objects.
This allows the model to generalize across verbs with similar meanings
(e.g., eat, devour, savor).
 Named Entities (NE): Identifying if a constituent contains a named entity (e.g.,
PERSON, LOCATION, TIME). This is particularly useful for identifying adjunctive
arguments like ARGM-LOC and ARGM-TMP.
 Verb Sense Information: The specific frameset ID of a predicate in PropBank
(e.g., talk.01 vs. talk.02). Disambiguating the verb sense is critical because
different senses have different argument structures.
 Path Generalizations: Due to the data-sparse nature of the full path feature,
various generalization techniques are used, such as replacing non-clause nodes
with a wildcard, decomposing the path into n-grams, or using only the partial path
to the lowest common ancestor.

2. Combinatory Categorial Grammar (CCG)

While PSG paths are informative, they can be long and sparse, making them difficult to
generalize. A Combinatory Categorial Grammar (CCG) offers a lexicalized
representation that can produce shorter, more direct dependency paths between a
predicate and its arguments. Features from CCG are often used to augment a primary
PSG-based system.

III CSE(AI/ML) 8 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

Key Features from CCG:

 Phrase Type: In CCG, this is the category of the maximal projection between the
predicate and the dependent word.
 Categorial Path: A concise path feature formed by concatenating three values:
(i) the category of the dependent word, (ii) the direction of dependency, and
(iii) the argument slot filled by the dependent. For example, the path
between denied and plans might be (S[dcl]\NP)/NP.2.
 Tree Path: The CCG analogue of the PSG path feature, tracing the path
between the predicate and the argument through the binary CCG parse tree.

3. Tree-Adjoining Grammar (TAG)

A Tree-Adjoining Grammar (TAG) is another formalism used in SRL, primarily for its
ability to effectively model long-distance dependencies in text. Systems using TAG
extract features from its unique structural representations.

Key Features from TAG:

 Supertag Path: A path feature derived from a TAG structure, analogous to the
PSG path.
 Supertag: The elementary tree frame associated with either the predicate or the
argument, which provides rich lexical and structural information.
 Surface and Deep Syntactic Roles: TAG analysis can provide both a surface
role and a deep syntactic role for an argument (e.g., subject, direct object), which
helps normalize across syntactic variations like passivization.
 Surface and Deep Subcategorization: These features capture the argument
frame of a predicate at both the surface level and a more abstract, deep-structure
level (e.g., NP0_NP1 for a transitive verb).

 Semantic Subcategorization: An extension of the subcategorization frame that

includes semantic role information, providing a tighter link between syntax and
semantics.

SOFTWARE
Following is a list of software packages available for semantic role labeling.

 ASSERT (Automatic Statistical Semantic Role Tagger)

[http:://www/[Link]/[Link]]

III CSE(AI/ML) 9 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

A semantic role labeler trained on the English PropBank data.

 C-ASSERT [[Link]
An extension of ASSERT for the Chinese Language.

 SwiRL [[Link]
Another semantic role labeler trained on PropBank data.

 Shalmaneser ( A Shallow Semantic Parser)

[[Link]
A toolchain for shallow semantic parsing based on the FrameNet data.

*****

2. MEANING REPRESENTATION
Topics :
1. Resources

2. Systems

3. Software

Write short notes on Meaning Representation.

Introduction

Meaning Representation is a deep level of semantic interpretation in Natural Language

Processing whose primary objective is to transform a natural language input (like a
sentence or query) into a formal, unambiguous, and canonical representation that a
machine can directly act upon or execute. This process is often referred to as deep
semantic parsing.

An effective analogy is the relationship between high-level programming languages and

low-level machine code. While natural language is comprehensible to humans, it is
ambiguous and context-dependent. A meaning representation, much like machine code,
is structured, precise, and directly executable by a computer, making it comprehensible
to machines but often incomprehensible to humans. The core challenge is to bridge this
gap by developing techniques to interpret and encode the context and world knowledge
inherent in human language.

III CSE(AI/ML) 10 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

Resources

A number of projects have created representations and resources that have promoted
experimentation in this area. These resources typically provide a corpus of natural
language inputs paired with their corresponding formal meaning representations, which
are essential for training and evaluating systems.

 ATIS (Air Travel Information System): Considered one of the first major efforts,
ATIS focused on transforming spoken user queries about flight information into a
representation that could be compiled into a SQL query to interact with a flight
database. The resource provides thousands of user utterances annotated with
intermediate hierarchical frame representations. The diagram below illustrates
this process:

Diagram : Sample user query and its frame representation in the ATIS
program

Generated code

┌────────────────────┐
│ FRAME │ SHOW:
│ Representation │ FLIGHTS:
│ │ TIME:
│ │ PART-OF-DAY: morning
│ │ ORIGIN:
│ │ CITY: Boston
│ │ DEST:
│ │ CITY: San Francisco
│ │ DATE:
│ │ DAY-OF-WEEK: Tuesday
└────────────────────┘
▲
|
| Semantic Parsing
|
┌────────────────────┐
│ Natural Language │ Please show me morning flights from
Boston to
│ Representation │ San Francisco on Tuesday
└────────────────────┘
The diagram shows how a natural language query is mapped to a structured frame. The
frame captures the key semantic roles (like ORIGIN, DEST, TIME) and fills them with
specific values (Boston, San Francisco, morning) extracted from the sentence.

III CSE(AI/ML) 11 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

This unambiguous representation can then be used by a system to perform an action,

such as querying a database.
 Communicator: This program was a follow-on to ATIS that involved more
complex mixed-initiative dialogs, where both the user and the machine could
lead the conversation. The resource consists of thousands of collected dialogs
related to travel planning.
 GeoQuery: This resource provides a natural language interface to a geographic
database called Geobase. The corpus contains natural language questions
paired with their formal representations as Prolog queries.
 For example, "What are the major cities in Kansas?" is mapped to answer(C,
(major(C), city(C), loc(C, S), equal(S, stateid(kansas)))).
 RoboCup: CLang: In the domain of robotic soccer, this project uses a special
formal language called CLang to encode advice from a team coach. The
representation is expressed as if-then rules. For instance, "If the ball is in our
penalty area..." is mapped to ((bpos (penalty-area our)) ...).

Systems

Various systems have been developed to tackle the problem of mapping natural
language to a meaning representation. These systems can be broadly categorized into
rule-based and supervised approaches.

1. Rule-Based Systems

These systems use a handcrafted semantic grammar to parse meaning units directly
from the input. The philosophy is that underlying semantic information is less complex
than a full syntactic explanation. This approach is robust against speech recognition
errors and ungrammatical input.

 Example System: The Phoenix system used recursive transition networks

(RTNs) and a handcrafted grammar to extract a hierarchical frame structure for
the ATIS and Communicator projects.
[Link] Systems

These systems use statistical models trained on hand-annotated data. They learn a
mapping from natural language sentences to their formal meaning representations
automatically.

 Example Systems:
o CHILL (Constructive Heuristics Induction for Language Learning): Learns
to map sentences into Prolog programs.

III CSE(AI/ML) 12 Mrs. N Savitha [Link].,(Ph.D)

SBIT –AUTONOMOUS NLP

o SCISSOR (Semantic Composition that Integrates Syntax and

Semantics...): Uses a statistical parser to create a semantically augmented
parse tree (SAPT) to compositionally build the meaning representation.
o KRISP (Kernel-based Robust Interpretation for Semantic Parsing): Uses
string kernels and Support Vector Machines (SVMs).
o WASP (Word Alignment-based Semantic Parsing): Treats semantic parsing
as a machine translation problem, "translating" natural language into its
meaning representation.


Software

While older rule-based systems are not widely available, several software programs for
supervised semantic parsing systems are available for download and research, which
are shown below:

 WASP: [[Link]
 KRISPER: [[Link]
 CHILL: [[Link]

*****

IMPORTANT QUESTIONS:

1. Explain Predicate – Argument Structure.

2. Explain Meaning Representation.

III CSE(AI/ML) 13 Mrs. N Savitha [Link].,(Ph.D)

Common questions

The effectiveness of Semantic Role Labeling (SRL) systems in identifying predicate-argument structures lies in their ability to transition from heuristic-based to data-driven approaches. Initially, SRL relied on rule-based systems, which though effective in specific contexts, couldn't generalize broadly . The shift to machine learning frameworks, formalized by Gildea and Jurafsky, allowed SRL to treat argument identification and classification as supervised classification tasks . These systems now can automatically learn from large corpora, offering higher robustness and accuracy. However, challenges still exist in terms of data sparsity, language adaptation, and accurately capturing complex syntactic structures . Advances in evaluation tasks like Argument Classification and Identification have improved SRL capabilities, but effective generalization across unseen predicates and syntactic variations remains a significant challenge .

Creating cross-lingual semantic role labeling resources based on PropBank involves several challenges. Firstly, the inherent need for language-specific frame files for each new language adaptation means substantial initial groundwork for every target language, differing from FrameNet's more language-independent approach . PropBank's reliance on syntactic constituency also requires corresponding lexical and syntactic corpora in each language, complicating adaptation to syntactically diverse languages . The requirement to maintain linguistic neutrality while capturing language-specific nuances poses additional complexity. Furthermore, ensuring consistency and accuracy in argument annotation across languages with distinct syntactic and semantic structures is a challenge due to varied interpretations and syntactic constructions, increasing the resource and validation demands . These factors combine to make cross-linguistic expansion of PropBank a resource-intensive and technically demanding task .

Using a Combinatory Categorial Grammar (CCG) in Semantic Role Labeling offers several advantages, especially in handling language ambiguity and syntactic variability. CCG provides a more lexicalized representation that produces shorter, more direct dependency paths between predicates and arguments, assisting in better generalization across varied syntactic structures . Unlike Phrase Structure Grammar (PSG), CCG paths are concise, and its categorial path feature helps in capturing the direction and slot of dependencies, making it robust to certain complexities like long-distance dependencies . These features enhance SRL's ability to process complex sentences where traditional PSG paths might become too long and sparse to effectively generalize .

FrameNet and PropBank differ primarily in their approach to semantic role annotations. FrameNet uses a language-specific frame-semantic approach, identifying and labeling arguments as Frame Elements related to specific frames, which can apply to verbs, nouns, and adjectives . PropBank, on the other hand, uses a more 'linguistically neutral' approach by annotating the arguments of verbs within the syntactic structure of sentences as found in the Penn Treebank. It defines arguments as core and adjunctive, labeled numerically or with general types (e.g., ARGM-LOC for location). While FrameNet requires extensive language-specific annotations, PropBank generalizes across languages but necessitates creating a new set of frame files for each language adaptation .

NomBank and VerbNet represent arguments in different ways, aligned with their specific focuses in semantic role labeling. NomBank, inspired by PropBank, is centered around annotating the arguments of nominal predicates (nouns) and assigns labels similar to PropBank's structure . It extends the idea of argument annotation beyond verbs to nouns but doesn't provide a thematic role framework. In contrast, VerbNet links the argument structures of verbs with broader thematic roles and Levin verb classes, offering a richer and more generalized representation . This linkage allows VerbNet to capture higher-level semantic relationships across verbs sharing similar meanings or syntactical patterns, aiding in thematic role and sense disambiguation .

Tree-Adjoining Grammar (TAG) offers several key features beneficial for semantic role labeling by effectively handling long-distance dependencies and capturing syntax-semantics interface. The 'Supertag Path' feature in TAG is similar to the PSG path but leverages TAG's ability to encapsulate complex syntactic information concisely . 'Supertag' provides rich lexical and structural insights into each elementary tree, offering depth in semantic parsing . Additionally, the concept of 'Surface and Deep Syntactic Roles' from TAG helps normalize across syntactic variations by differentiating between evident surface roles and more abstract deep syntactic roles . TAG also supports 'Surface and Deep Subcategorization' frames, distinguishing based on immediate and underlying grammatical structures . These features collectively aid SRL in managing syntactic variability effectively, enriching the semantic layer with nuanced syntactic understanding .

Phrase Structure Grammars (PSG) play a foundational role in semantic role labeling systems as they provide a natural framework for overlaying semantic annotations due to their alignment with the syntactic representation of sentences in the Penn Treebank . Key features derived from PSG, such as the path from an argument to a predicate, phrase type, and voice, are critical for feature extraction in SRL . PSG paths offer detailed descriptions of syntactic relations, benefiting from already extensive, informative lexical and syntactic data. However, issues arise when paths become long and sparse, challenging generalization across diverse syntactic variations . Despite this, PSG remains integral in many SRL systems due to its alignment with existing large corpora like the Penn Treebank .

Meaning representation offers significant benefits to Natural Language Processing (NLP) systems by transforming ambiguous and context-dependent natural language inputs into formal, unambiguous, and canonical representations that machines can execute directly . This transformation enables systems to effectively interpret and act upon human language. By creating structured representations akin to machine code, meaning representation facilitates the precise execution of tasks such as querying databases or executing commands, improving accuracy in goal-oriented tasks . Moreover, it bridges the gap between human-centric natural language and machine-centric execution frameworks, enhancing the machine's ability to understand the nuanced context and world knowledge inherent in human communication .

Semantic role labeling systems face several challenges in generalizing across unseen verbs due to data sparsity and variability in verb usage. Key challenges include the limited availability of annotated examples for less common verbs, which hampers the ability of models to learn comprehensive patterns for these verbs . Systems often rely on observed patterns and verb roles, which can result in inaccuracies when encountering verbs with unique or rare argument structures not previously encountered in training data . Verb clustering attempts to mitigate this by grouping similar verbs to facilitate generalization based on semantic classes, but this can still fall short with truly novel verbs . Additionally, the need for disambiguation of verb senses adds to the complexity, as different senses of a verb can entail different argument structures, necessitating more sophisticated feature extraction and reasoning capabilities .

Rule-based and supervised systems take different approaches to mapping natural language to meaning representations. Rule-based systems rely on handcrafted semantic grammars to parse meaning directly from input, leveraging human-designed rules and structures to capture semantic information . This approach can be robust against inputs with recognition errors or non-standard syntax since it operates on established patterns of language. However, it is inflexible and limited by the predefined rules . Supervised systems, in contrast, use statistical models trained on annotated datasets to learn mappings automatically . These systems, such as CHILL and SCISSOR, adapt through exposure to vast data, covering a wider range of linguistic variability and providing better generalization across different language uses . Although more flexible, supervised systems rely heavily on the quality and quantity of training data and require significant computational resources for model training .

Understanding Predicate-Argument Structure
No ratings yet
Understanding Predicate-Argument Structure
20 pages
Understanding Predicate-Argument Structure
No ratings yet
Understanding Predicate-Argument Structure
26 pages
NLP Unit 4: Predicate-Argument Structure
100% (1)
NLP Unit 4: Predicate-Argument Structure
8 pages
R22 JNTUH CSE Syllabus Overview
No ratings yet
R22 JNTUH CSE Syllabus Overview
3 pages
NLP Unit 3
No ratings yet
NLP Unit 3
20 pages
Natural Language Semantics in AI Knowledge
No ratings yet
Natural Language Semantics in AI Knowledge
4 pages
R22 JNTUH CSE Language Model Syllabus
100% (2)
R22 JNTUH CSE Language Model Syllabus
16 pages
Morphological Analysis in NLP
No ratings yet
Morphological Analysis in NLP
24 pages
Ambiguity Resolution Models in NLP Parsing
No ratings yet
Ambiguity Resolution Models in NLP Parsing
7 pages
Understanding Semantic Parsing Techniques
No ratings yet
Understanding Semantic Parsing Techniques
19 pages
Syntactic Parsing in Natural Language Processing
No ratings yet
Syntactic Parsing in Natural Language Processing
42 pages
Evaluating Language Models in NLP
No ratings yet
Evaluating Language Models in NLP
21 pages
Language Modeling Overview for Students
No ratings yet
Language Modeling Overview for Students
72 pages
Object Segmentation in Data Analytics
No ratings yet
Object Segmentation in Data Analytics
23 pages
Understanding Language Modeling Techniques
No ratings yet
Understanding Language Modeling Techniques
15 pages
Sentence and Topic Boundary Detection
No ratings yet
Sentence and Topic Boundary Detection
17 pages
Understanding Processes and Events
No ratings yet
Understanding Processes and Events
49 pages
Fiot Unit-4
No ratings yet
Fiot Unit-4
10 pages
KRR 5th Unit Part-1
No ratings yet
KRR 5th Unit Part-1
14 pages
Ambiguity Resolution Models in Parsing
100% (1)
Ambiguity Resolution Models in Parsing
7 pages
Transaction Flow Testing in Software
No ratings yet
Transaction Flow Testing in Software
15 pages
Bayesian Estimation in NLP Models
No ratings yet
Bayesian Estimation in NLP Models
2 pages
STM UNIT 3 Part 1
No ratings yet
STM UNIT 3 Part 1
30 pages
Meaning Representation in NLP Explained
No ratings yet
Meaning Representation in NLP Explained
27 pages
NLP Challenges: Irregularity, Ambiguity, Productivity
No ratings yet
NLP Challenges: Irregularity, Ambiguity, Productivity
9 pages
Knowledge Representation and Reasoning Notes
No ratings yet
Knowledge Representation and Reasoning Notes
28 pages
Understanding Word Structure in NLP
No ratings yet
Understanding Word Structure in NLP
12 pages
Language Modeling in NLP Explained
No ratings yet
Language Modeling in NLP Explained
28 pages
Multilingual vs Cross-Lingual NLP
No ratings yet
Multilingual vs Cross-Lingual NLP
2 pages
Syntax of Context in KRR
No ratings yet
Syntax of Context in KRR
67 pages
AI Processes and Procedures in KRR
No ratings yet
AI Processes and Procedures in KRR
25 pages
Knowledge Representation in AI
No ratings yet
Knowledge Representation in AI
16 pages
Data Analytics Unit 1 Notes
100% (1)
Data Analytics Unit 1 Notes
23 pages
Sensor Cloud in Cloud Computing
No ratings yet
Sensor Cloud in Cloud Computing
30 pages
Understanding Word Structure in NLP
No ratings yet
Understanding Word Structure in NLP
162 pages
Understanding Software Testing Basics
No ratings yet
Understanding Software Testing Basics
15 pages
Logic's Role in Knowledge Representation
No ratings yet
Logic's Role in Knowledge Representation
31 pages
Ambiguity Resolution in Parsing Models
No ratings yet
Ambiguity Resolution in Parsing Models
15 pages
NLP Syntax Analysis and Parsing Techniques
No ratings yet
NLP Syntax Analysis and Parsing Techniques
15 pages
Overview of Semantic Parsing Techniques
No ratings yet
Overview of Semantic Parsing Techniques
11 pages
Types of Meaning Representation Systems
No ratings yet
Types of Meaning Representation Systems
3 pages
Viterbi Decoding for POS Tagging in Python
No ratings yet
Viterbi Decoding for POS Tagging in Python
26 pages
Meaning Representation in NLP Systems
No ratings yet
Meaning Representation in NLP Systems
9 pages
Language Model Adaptation Techniques
100% (1)
Language Model Adaptation Techniques
10 pages
Characteristics of Sensing in FIOT
No ratings yet
Characteristics of Sensing in FIOT
18 pages
Data Handling and Analytics
No ratings yet
Data Handling and Analytics
6 pages
Lexemes in Natural Language Processing
No ratings yet
Lexemes in Natural Language Processing
101 pages
Understanding Semantic Interpretation in NLP
No ratings yet
Understanding Semantic Interpretation in NLP
17 pages
Good vs Bad State Graphs in Testing
No ratings yet
Good vs Bad State Graphs in Testing
31 pages
Document Structure in NLP Explained
No ratings yet
Document Structure in NLP Explained
16 pages
Knowledge Engineering in KRR Explained
No ratings yet
Knowledge Engineering in KRR Explained
2 pages
KRR Unit 4: Processes and Classification
0% (1)
KRR Unit 4: Processes and Classification
9 pages
Data Visualization Techniques Overview
100% (1)
Data Visualization Techniques Overview
19 pages
Bayesian Parameter Estimation in NLP
No ratings yet
Bayesian Parameter Estimation in NLP
62 pages
Unit-1 Notes
No ratings yet
Unit-1 Notes
32 pages
Non-Speech Audio Retrieval Techniques
No ratings yet
Non-Speech Audio Retrieval Techniques
19 pages
IDS Unit 2
No ratings yet
IDS Unit 2
49 pages
Understanding Big Data Analytics
No ratings yet
Understanding Big Data Analytics
23 pages
IDS Unit-1-Handwritten
No ratings yet
IDS Unit-1-Handwritten
39 pages
NLP Unit 4
No ratings yet
NLP Unit 4
14 pages
NTFSInfo Utility Analysis for CS 2301
No ratings yet
NTFSInfo Utility Analysis for CS 2301
4 pages
M.E. Advanced Digital System Design Exam
No ratings yet
M.E. Advanced Digital System Design Exam
2 pages
Iso Iec 2382 2015
No ratings yet
Iso Iec 2382 2015
6 pages
Custom AI Solutions for Financial Services
No ratings yet
Custom AI Solutions for Financial Services
6 pages
Vaibhav Sharma: Python Developer Profile
No ratings yet
Vaibhav Sharma: Python Developer Profile
2 pages
Real-Time Daylight Analysis with ML
No ratings yet
Real-Time Daylight Analysis with ML
19 pages
S7MPI Pro User Manual for PLC Communication
No ratings yet
S7MPI Pro User Manual for PLC Communication
12 pages
Basics of Set Theory Explained
No ratings yet
Basics of Set Theory Explained
27 pages
Mobile App Development Exam Questions
No ratings yet
Mobile App Development Exam Questions
1 page
Microsoft Operations Management Suite Overview
No ratings yet
Microsoft Operations Management Suite Overview
40 pages
Polynomial and Piece-wise Functions
No ratings yet
Polynomial and Piece-wise Functions
7 pages
Smart Campus Surveillance System
No ratings yet
Smart Campus Surveillance System
6 pages
Step 1: Raw Data To SDTM: Clinical R Programming Workflow SDTM Adam TLF
No ratings yet
Step 1: Raw Data To SDTM: Clinical R Programming Workflow SDTM Adam TLF
19 pages
Steve Wozniak and Business Vocabulary
No ratings yet
Steve Wozniak and Business Vocabulary
5 pages
Unsur Cuaca dan Hujan Kilat
No ratings yet
Unsur Cuaca dan Hujan Kilat
18 pages
Geometric Properties of Roots in Complex Equations
No ratings yet
Geometric Properties of Roots in Complex Equations
11 pages
Quantum Computing Concepts Explained
No ratings yet
Quantum Computing Concepts Explained
23 pages
CVMC Purchase Request for Equipment
No ratings yet
CVMC Purchase Request for Equipment
2 pages
Comparative E-Banking: HDFC vs ICICI
No ratings yet
Comparative E-Banking: HDFC vs ICICI
97 pages
Abu Dhabi BIM Code of Practice 2020
No ratings yet
Abu Dhabi BIM Code of Practice 2020
27 pages
Evolution of EPA SWMM Model
No ratings yet
Evolution of EPA SWMM Model
35 pages
Configure PO Approval Email Notifications
No ratings yet
Configure PO Approval Email Notifications
11 pages
SEO Analysis for Sarkari Yojana 2025
No ratings yet
SEO Analysis for Sarkari Yojana 2025
22 pages
DSP8062 Wall Mount Speaker Overview
No ratings yet
DSP8062 Wall Mount Speaker Overview
4 pages
Dynamic Graph Model for Info Diffusion
No ratings yet
Dynamic Graph Model for Info Diffusion
17 pages
Understanding Website Cookies
No ratings yet
Understanding Website Cookies
68 pages
Project Management with MS Project
No ratings yet
Project Management with MS Project
28 pages
8086 Bus Configuration and Timing Details
No ratings yet
8086 Bus Configuration and Timing Details
104 pages
Fire and Gas Detection Philosophy Document
No ratings yet
Fire and Gas Detection Philosophy Document
12 pages
HDL Laboratory Manual for ECE Students
No ratings yet
HDL Laboratory Manual for ECE Students
49 pages

Semantic Parsing in NLP: PAS Overview

Uploaded by

Semantic Parsing in NLP: PAS Overview

Uploaded by

SBIT –AUTONOMOUS NLP

NATURAL LANGUAGE PROCESSING (NLP)

UNIT - IV Semantic Parsing II: 1. Predicate-Argument Structure,

Q1: Explain Predicate-Argument Structure. (Resources, Systems, Software)

OR Q: Write short notes on Predicate-Argument Structure found in NLP.

Predicate-Argument Structure (PAS), also known as Semantic Role Labeling (SRL),

In Natural Language Processing, the analysis of Predicate-Argument Structure has

III CSE(AI/ML) 1 Mrs. N Savitha [Link].,(Ph.D)

 The COMPLIANCE frame when it means "to fail to observe an agreement."

 The CAUSE_TO_FRAGMENT frame when it means "to cause to separate

 Example:The AWARENESS Frame

III CSE(AI/ML) 2 Mrs. N Savitha [Link].,(Ph.D)

Diagram : FrameNet example

The following sentences show this frame in action:

2. No doubts existed as to [Cognizer our] [Predicate:noun comprehension]

2. PropBank (Proposition Bank)

o It defines two types of arguments: core and adjunctive.

III CSE(AI/ML) 3 Mrs. N Savitha [Link].,(Ph.D)

 Adjunctive Arguments (ARGM-X): These are modifier arguments whose

Table : Argument labels for operate.01 and author.01

III CSE(AI/ML) 4 Mrs. N Savitha [Link].,(Ph.D)

Table : List of adjunctive arguments in PropBank—ARGMS

 NomBank: Inspired by PropBank, NomBank focuses on identifying and tagging

III CSE(AI/ML) 5 Mrs. N Savitha [Link].,(Ph.D)

 VerbNet: This resource provides a richer representation by linking PropBank

1. Argument Identification: The task of identifying which constituents (nodes) in a

2. Argument Classification: Given the correct (gold-standard) argument constituents,

3. Argument Identification and Classification: A combination of the first two tasks,

III CSE(AI/ML) 6 Mrs. N Savitha [Link].,(Ph.D)

The Semantic Role Lableing (SRL) Algorithm:

Procedure: SRL(Sentence) returns best Semantic Role Labeling

Input: Syntactic Parse of the Sentence

1. generate a full syntatuc parse of the sentence.

2. identify all the predicates

3. for all predicate sentence do

5. classify each feature vector using the model created in training.

6. select the class of highest scoring classifier

7. return best semantic role labelling

for each predicate in a sentence:

III CSE(AI/ML) 7 Mrs. N Savitha [Link].,(Ph.D)

Key Features from PSG:

2. Combinatory Categorial Grammar (CCG)

III CSE(AI/ML) 8 Mrs. N Savitha [Link].,(Ph.D)

Key Features from CCG:

3. Tree-Adjoining Grammar (TAG)

Key Features from TAG:

 Semantic Subcategorization: An extension of the subcategorization frame that

 ASSERT (Automatic Statistical Semantic Role Tagger)

III CSE(AI/ML) 9 Mrs. N Savitha [Link].,(Ph.D)

A semantic role labeler trained on the English PropBank data.

 Shalmaneser ( A Shallow Semantic Parser)

Write short notes on Meaning Representation.

Meaning Representation is a deep level of semantic interpretation in Natural Language

An effective analogy is the relationship between high-level programming languages and

III CSE(AI/ML) 10 Mrs. N Savitha [Link].,(Ph.D)

III CSE(AI/ML) 11 Mrs. N Savitha [Link].,(Ph.D)

This unambiguous representation can then be used by a system to perform an action,

 Example System: The Phoenix system used recursive transition networks

III CSE(AI/ML) 12 Mrs. N Savitha [Link].,(Ph.D)

o SCISSOR (Semantic Composition that Integrates Syntax and

1. Explain Predicate – Argument Structure.

2. Explain Meaning Representation.

III CSE(AI/ML) 13 Mrs. N Savitha [Link].,(Ph.D)

Common questions

Evaluate the effectiveness of Semantic Role Labeling (SRL) systems in identifying predicate-argument structures.

Evaluate the effectiveness of Semantic Role Labeling (SRL) systems in identifying predicate-argument structures.

Discuss the challenges of creating cross-lingual semantic role labeling resources based on PropBank.

Discuss the challenges of creating cross-lingual semantic role labeling resources based on PropBank.

What are the advantages of using a Combinatory Categorial Grammar (CCG) in Semantic Role Labeling?

What are the advantages of using a Combinatory Categorial Grammar (CCG) in Semantic Role Labeling?

How does FrameNet differ from PropBank in their approach to semantic role annotations?

How does FrameNet differ from PropBank in their approach to semantic role annotations?

Compare and contrast the representation of arguments in NomBank and VerbNet.

Compare and contrast the representation of arguments in NomBank and VerbNet.

Identify and explain the key features derived from Tree-Adjoining Grammar (TAG) useful in semantic role labeling.

Identify and explain the key features derived from Tree-Adjoining Grammar (TAG) useful in semantic role labeling.