













































Studia grazie alle numerose risorse presenti su Docsity
Guadagna punti aiutando altri studenti oppure acquistali con un piano Premium
Prepara i tuoi esami
Studia grazie alle numerose risorse presenti su Docsity
Prepara i tuoi esami con i documenti condivisi da studenti come te su Docsity
Trova i documenti specifici per gli esami della tua università
Preparati con lezioni e prove svolte basate sui programmi universitari!
Rispondi a reali domande d’esame e scopri la tua preparazione
Riassumi i tuoi documenti, fagli domande, convertili in quiz e mappe concettuali
Studia con prove svolte, tesine e consigli utili
Togliti ogni dubbio leggendo le risposte alle domande fatte da altri studenti come te
Esplora i documenti più scaricati per gli argomenti di studio più popolari
Ottieni i punti per scaricare
Guadagna punti aiutando altri studenti oppure acquistali con un piano Premium
Appunti e slide di tutte le lezioni del secondo semestre del corso Semantic Fundamentals for Natural Language Processing
Tipologia: Appunti
1 / 53
Questa pagina non è visibile nell’anteprima
Non perderti parti importanti!














































In order to have a sentence, 2 components are needed: nouns and verbs. If there’re sentences without nouns or verbs, they’re elliptical sentences, since nouns and verbs are implicit. We have this structure because:
subject, but it doesn’t refer to any object. This can be questioned, but here we’ll assume it for simplicity.
objects to truth-values. This will be clarified in the following. VALENCY The number of the arguments of a verb is called valency.
etc.
The stative/eventive distinction is independent of the valency of the verb:
interpretable as predicating such properties. However, an analysis of these verbs in such terms can be advanced.
By contrast, stative bivalent and trivalent verbs can be seen as predicating a certain relation between two or among three objects.
This supports the interpretation of states as the possession of properties by an object or as the existence of a relation among some objects. VALENCY HIGHER THAN 3?
valency 4. To sell something there must be:
John and the sun are NPs and refer to individuals. However, they’re not arguments of
The NPs (and the Prepositional Phrases - PP) that are not arguments are often called adjuncts. The main idea is that they’re “ optional ” and “ non-obligatory ” phrases. CORE AND ADJUNCTS Sentences can be divided into 2 parts: core and adjuncts:
It’s possible to propose some criteria to distinguish arguments from adjuncts. One might argue that the arguments of a verb must be referred in a sentence that contains that verb. Adjuncts need not. This is a grammatical criterion : sentences must have a certain structure. However, this criterion doesn’t work. Arguments are not
Other criteria were proposed by other linguists and philosophers. The schema is this one, we have eventualities that can be distinguished into:
However, in these cases, the modifier doesn’t measure the time needed to reach the aim of the state (there’s no aim in being tired). Rather, the modifier measures the time needed to reach the state of being tired. The sentence means that it took two minutes to John for being tired, i.e., that the process that led John to be tired lasted two minutes. This process has an aim (the state of being tired itself). PROCESSES
The processes are events , i.e. a kind of change. However, they’re not telic. Not having an aim means: the aim is not part of the meaning of the word, there’s no aim embedded in the meaning of the process verb.
there’s an aim.
In principle, the change they denote could go on indefinitely. In theory, one could go on talking indefinitely. Compare with reading a book. This doesn’t mean that at a certain point one doesn’t stop talking. Of course, one does. But no precise aim is
Processes are usually composed of a series of small cyclic changes. For instance, walking is composed of: put a leg in front, put the other leg in front, etc. This explains why in principle they can go indefinitely: we can always add another cycle. The number of cycles is not defined in advance. Because they last over time, process verbs like state verbs accept the modifier: for
leg in front. Because they denote changes that last a certain amount of time, process verbs accept the progressive form.
reach the aim of the process (there’s no aim in walking). Rather, the modifier measures the time needed to reach the start of a walking process. For example, Ann was paralyzed and she took two hours to walk again. Process can be agentive, but they don’t need to be. ACCOMPLISHMENTS
Unlike processes, accomplishments are telic. They’re changes that last an amount of time and that have an aim. This explains why they’re usually denoted by a VP (V + NP or V + PP). The NP or PP denotes an objects that gives an aim to the change and “measures” it. Building is a process, but building a house is an accomplishment: the house gives an aim to the process of building and measures this process. The state of the object or the relationship between the subject and the object tells us how close the accomplishment is to its aim. We can say how close the builder is to his/her aim by observing the state of the house. If we add a NP or a PP to a process verb, we often obtain an accomplishment verb:
However, this is not always the case. Sometimes, an operation on an object can be
However, while:
The arguments of verbs are not alike.
The different roles played by the arguments are usually called thematic roles. A thematic role is the role played by an object in the state or event denoted by the verb. We can consider thematic roles as the basis under which an object participates in a state or event. Alternatively, we can understand thematic roles as a relation between the object and the eventuality in which it participates. The thematic grid (or θ-grid = theta-gird) is a grid related to a verb that lists the thematic roles that the arguments play in the event or state denoted by that verb:
In eating something, Ann is playing the classical role of an agent. However, we cannot say the same of the subjects of the other sentences. In loving someone, we don’t properly perform an action. Maybe we wouldn’t like to love someone. It’s not voluntary, it’s something that happens to us. Is John performing an action? Is loving an action? For the moment, there’s a difference between the two sentences: Ann performs an action in willing do that action, while the case of John is not voluntary. The same remarks apply to the rock in 3. The rock does something, but the rock has not the will to destroy the car and it’s not voluntary. So, we wouldn’t say that the rock performs an action.
Difference between 2 and 3: John has a mind, a feeling, while the rock has no feeling. Again, if someone stumbles, s/he doesn’t do this at purpose. Stumbling is not a voluntary action. Since they don’t do actions, John, the rock and Ann are not agents in 2 and 4. However, there’re differences among the roles that these individuals fulfill:
doesn’t cause anything, it’s not intentional and it’s not sentient. Its existence is dependent on that of John and doesn’t move independently of him.
rule 2):
In this case, only 2 arguments of the verb are mentioned. Nemo is not mentioned. However, the rules are the same. We apply rule 2, which states that the argument having the greatest number of features of the agent is lexicalized as the subject and the argument having the greatest number of features of the patient is lexicalized as the object.
In this case, only one argument is mentioned. Nemo and the torpedo are not mentioned. However, the rules are the same. We apply rule 1, which says that the sole argument of the verb is lexicalized as the subject.
Sometimes the arguments of a bivalent verb have both some features of the agent and some features of the patient. In these cases, which arguments is projected onto the subject and which onto the object can depend on the verb and on the language.
Thunder —> cause, movement (but not intentional or feeling) John —> feeling (but he’s causally affected by the other participant)
The 2 arguments behave in different ways. English and Italian project the two arguments differently (the Italian amare, on the contrary, behaves as the English to love or to like). I —> feeling (but I’m casually affected by the other participant) Chocolate ice-cream —> cause (but not intentional, not feeling)
psychological subject, while fearing is usually unwelcome. ATTEMPTS OF SOLUTIONS: TANYA REINHART Tanya Reinhart distinguishes more thematic roles, 4 roles, it’s a more fine grained distinction and it’s more rigid since it’s not prototypical. Tanya Reinhart takes a different approach: she believes that thematic roles are characterized by 2 fundamental traits: mental and causal (m and c):
With respect to Dowty:
Therefore, Reinhart’s classification is more rigid than Dowty’s. It’s a classification based on traits , not a prototypical classification.
Both these sentences are grammatical. Since Paul is a sentient and the letter is an instrument, potentially they can both be lexicalized as subjects. Sometimes, the structure depends on the verb:
Probably, both the arguments are [-m, -c]. The water doesn’t directly cause the filling because the water is passive and directed to the swimming pool by a pump. In this
arguments can be subjects. The treatment of reduction is similar to Dowty’s. When less arguments of the verb are mentioned, then it’s as if the verb has less arguments and the subsequent rules apply. For instance:
strictly related to the knowledge of the world. This is one of the central tenets of cognitive semantics. A frame is a structure of concepts that grasps a certain portion of the world. One of Fillmore’s preferred examples is the frame that grasps a commercial transaction. The frame outlines the basic elements of the transaction and their relationships. For instance, in a commercial transaction at least 4 elements are involved:
A frame describes the situation, the objects involved, the roles they play and what happens. Of course, the roles are connected to the thematic roles but are more specific. A frame can be interpreted as a conceptual representation of the scene in which something happens. The scene includes what happens: the eventuality and the individuals that participate to that eventuality. In the scene, the relationships among the individuals that participate to the eventuality are clarified and displayed. A frame can be seen as a structure of connected concepts that describe the scene in which the eventuality occurs. Therefore, the frame is a sketched representation of a portion of the world. Words, in particular verbs, evoke frames. Without knowing the frame evoked by a verb, a speaker cannot know the meaning of the verb.
the same frame: the frame that outlines a commercial transaction. However, these verbs highlight some aspects of the frame and overshadow others. Of course, the role of the buyer has something in common with thematic roles in general and with the role of the agent in particular. However, it’s clearly more specific. The role of the agent concerns a very high number of verbs. By contrast, the role of the buyer concerns specific verbs: the verbs that evoke the frame of commercial transactions. Likewise, the elements of a frame and their roles have something in common with thematic grids. Again, however, the grid of a frame is much more specific than its thematic grid. The concept of frame is intended to describe the complete meaning of a verb. By contrast, the concept of thematic grid hasn’t this aim. Another example of frame is that schematizes the “judging scene”. The elements of this frame are:
FrameNet is a computational resource based on the concept of frame. The project is based at the International Computer Science Institute (ICSI) in California and it was founded by Charles Fillmore in 1997 and directed by him until his death in
An entry of FrameNet consists of various elements:
describing basic commercial transactions involving a buyer and a seller exchanging money and goods, taking the perspective of the seller. The words vary individually in the patterns of frame element realization they allow. For example, the typical patterns for SELL: SELLER sells GOODS to BUYER for MONEY EXAMPLE The second element of an entry of FrameNet is an example sentence that contains the words referring to the individuals mentioned in the definition. These words are colored differently for a better reference. For instance, the example sentence of sell is: The verb is the unique word in capital letters. The sentence refers to the seller, to the good, to the buyer and to the money, which are the individuals mentioned in the definition. The seller is marked in red, the buyer in blue, the good in green, etc. FRAME ELEMENTS Frame elements are what we have called arguments and adjuncts:
the frame of commercial transactions. These words foreground some elements of the frame and background other elements. An entry of FrameNet lists the lexical units that evoke a certain frame, and gives some information on their status (when they were created, the ID of the annotator who created them, the date of creation, etc) SEMANTIC ROLE LABELLING FrameNet is an example of semantic role labelling. Texts can be annotated at various levels: morphological, syntactic, semantic:
Logic studies correct reasoning. A reasoning is correct when preserves the truth (if the premises of the reasoning - i.e. the argument - are true, the conclusion is also true). Formal logic studies the procedures by which the verification of the correctness of reasoning can be rendered automatic. It regiments the sentences that constitute the premises and conclusions of our reasoning by organizing them into fixed schemas. It then identifies the valid schema of inference. In this way, formal logic has constituted the first form of the automatization of thought. For this reason, those who developed programming languages created them based on the formal languages of logic. It’s therefore useful for computational linguists to have some knowledge of logic and formal semantics. COMPOSITIONAL SEMANTICS In order to sketch a semantics of sentences, we must assign meanings to the words of the sentence and then compose these meanings into more and more complex meanings. Natural languages are compositional : the meanings of larger constituents depend on the meanings of smaller constituents. This composition follows the syntactic structure of sentences. When two constituents syntactically merge, their meanings compose with each other.
For instance, it’s because a noun and adjective syntactically merge that we must compose the meanings of the noun and of the adjective. Meanings must be assigned to the leaves (words) of the syntactic tree. Any time two constituents of the tree merge and a larger constituent is created, the meanings of the two constituents must be combined into a more complex meaning that depends on the meanings of the two constituents. The semantic structure mirrors the syntactic structure. This means that there’s a morphism between the syntactic structure and the semantic structure. This morphism is at least a homomorphism : that is, if M is the function that maps a string onto its meaning, R is the syntactic composition of two strings s1 and s2, S is their semantic composition, we have: M(s 1 R s 2 ) = M(s 1 ) S M(s 2 ) The homomorphism presupposes that a given semantic type is assigned to any syntactic constituent of a certain kind. It’s because nouns and adjectives have a certain kind of meaning that they can be combined. In other words, the meanings of nouns and adjectives must be such that their composition is possible. In principle, we should always be able to combine the meanings of nouns and adjectives. Therefore, in principle, the meanings of nouns and adjectives must be combinable. This is so because the type of meaning of nouns is combinable with the type of meaning of adjectives. Homomorphism implies that the same semantic operation is always associated to a given syntactic operation. Adjectives have a certain kind of meaning. Likewise, nouns have a certain kind of meaning. Therefore, when A and N merge, the semantic operation of combination of their meanings is always of the same kind. It’s the combination between a certain kind of meaning with another kind of meaning. Therefore, it’s the same kind of combination. If the semiotic triangle is a valid schema, the task of compositional semantic is:
The set of cats is a subset of the set of animals, the set of regular polygons is a subset of the set of polygons, etc. The set of cats is an improper subset of the set of cats and a proper subset of the set of animals. We can define some operations on sets. An operation is a function that takes one or more sets as input and gives a set as output. Union is an operation that takes two (or more) sets as input and gives as a result the set that contains all the elements of these sets.
Intersection is another operation, that takes two (or more) sets as input and gives as output the set of all elements that are contained in all these sets. The intersection of two sets can be empty.
Complement is an operation that takes a set as input and gives a result all the elements that don’t belong that set.
Mathematicians have introduced two types of special sets: singletons and the empty set :
singleton.
mathematical entity
singletons as the elements of U, but there’s only one empty set because sets are
defined by their elements. The empty set is indicated by ∅. It’s included in any other set. Therefore, for any A,
When a set has many or infinite members (for instance, the set of all men or the set of natural numbers), listing all its members can become inconvenient or impossible. So, it’s useful to have another method to represent a set. The members of the set are
x such that x is a man, that is the set of all things that are men, that is the set of all men. DEFINITIONS OF SET We have 2 methods for representing a set: the enumerative and the “semantic” method. The enumerative one is an extensional definition, the second one is an intensional definition.
The definition characterizes a set as a selection from a larger set (U), by specifying a condition, that is a property that discriminates the elements of the subsets from the elements of the absolute complement of the subset.
Another important mathematical notion used by formal semanticists is that of function. This notion is more complicate than that of set, but it’s useful because the composition of the meanings of two syntactic nodes is interpreted through it. The basic idea is that every time two syntactic nodes combine into a new node, from the semantic point of view, a function takes an argument and gives a value. In other words, when two nodes combine, one of them can be interpreted as an operator and the other one can be interpreted as an operand. Since the result of the function is unique, it can be seen as the result of the composition of meanings. ORDERED SETS A set is ordered if it’s a set and there’s an order among the elements of the set. An ordered set is defined by its elements and their position within the order.