Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Part of Speech Tagging & Sense Disambiguation in Speech Synthesis: ECE 598 Deep Dive - Pro, Study Guides, Projects, Research of Electrical and Electronics Engineering

University of Illinois - Urbana-Champaign Electrical and Electronics Engineering

Prof. Yi Ma

An in-depth exploration of part-of-speech tagging and sense disambiguation in speech synthesis, as covered in the ece 598: speech synthesis course at the university of illinois at urbana-champaign. The challenges of part-of-speech tagging, particularly in languages with minimal morphology, and the importance of tagging for speech synthesis. It also delves into sense disambiguation, using the french language as an example, and the decision-list approach for disambiguation. The document also touches upon other topics related to speech synthesis, such as word pronunciation, abbreviation expansion, and language modeling.

Typology: Study Guides, Projects, Research

Pre 2010

Uploaded on 03/16/2009

koofers-user-onr 🇺🇸

10 documents

1 / 109

This page cannot be seen from the preview

Don't miss anything!

ECE 598: Speech Synthesis

Linguistic Analysis

Richard Sproat

http://www.linguistics.uiuc.edu/rws/

URL for this course:

http://catarina.ai.uiuc.edu/ECE598/

Discover Study Guides, Projects, Research of Electrical and Electronics Engineering University of Illinois - Urbana-Champaign

Partial preview of the text

Download Part of Speech Tagging & Sense Disambiguation in Speech Synthesis: ECE 598 Deep Dive - Pro and more Study Guides, Projects, Research Electrical and Electronics Engineering in PDF only on Docsity!

ECE 598: Speech Synthesis

Linguistic Analysis

Richard Sproat http://www.linguistics.uiuc.edu/rws/ URL for this course: http://catarina.ai.uiuc.edu/ECE598/

Synopsis

Problems

? Part of speech tagging ? Word-sense disambiguation ? Word pronunciation ? Preprocessing: Abbreviation expansion, etc.

Multilingual issues

? Word segmentation in Asian languages ? Architectures for multilingual linguistic analysis

ECE 598: Linguistic Analysis

Part of Speech Tags

Part of speech (POS) tagging is simply the problem of placing words into equivalence classes.
Notion of part of speech tags can be attributed to Dionysius Thrax, 1st Century BC Greek grammarian who classified Greek words into eight classes: noun, verb, pronoun, preposition, adverb, conjunction, participle and article.
Tagging is arguably easiest in languages with rich (inflectional) morphology (e.g. Spanish) for two reasons:

? It’s more obvious what the basic set of tags should be since words fall into ? The morphology gives important cues to what the part of speech is: cantaremos is highly likely to be a verb given the ending -ar-emos.

It’s arguably hardest in languages with minimal (inflectional) morphology:

? there are fewer cues in English than there are in Spanish ? for some languages like Chinese, cues are almost completely absent and linguists can’t even agree on whether (e.g.) Chinese distinguishes verbs from adjectives.

Part of Speech Tags

Linguists typically distinguish a relatively small set of basic categories (like Dionysius Thrax)—sometimes just 4 in the case of Chomsky’s [±N,±V] proposal.

But usually these analyses assume an additional set of morphosyntactic features.

Computational models of tagging usually involve a larger set, which in many cases can be thought of as the linguists’ small set, plus the features squished into one term:

eat/VB, eat/VBP, eats/VBZ, ate/VBD, eaten/VBN

Tagset size has a clear affect on performance of taggers.

“the Penn Treebank project collapsed many tags compared to the original Brown tagset, and got better results.” (http://www.ilc.cnr.it/EAGLES96/ morphsyn/node18.html)

But choosing the right size tagset depends upon the intended application.

As far as I know, there is no demonstration of what is the “optimal” tagset.

http://www.scs.leeds.ac.uk/ccalas/tagsets/brown.html
Motivations for the Penn tagset modifications

? “the Penn Treebank tagset is based on that of the Brown Corpus. However the stochastic orientation of the Penn Treebank and the resulting concern with sparse data led us to modify the Brown tagset by paring it down considerably” (Marcus, Santorini and Marcinkiewicz, 1993). ? eliminated distinctions that were lexically recoverable: thus no separate tags for be, do, have. ? as well as distinctions that were syntactically recoverable (e.g. the distinction between subject and object pronouns)

Problematic Cases

Even with a well-designed tagset, there are cases that even experts find it difficult to agree on.

adjective or participle? a seen event, a rarely seen event, an unseen event,
a child seat, *a very child seat, *this seat is child but: that’s a very MIT paper, she’s sooooooo California
preposition or particle? he threw out the garbage he threw the garbage out he threw the garbage out the door ∗he threw the garbage the door out

How Hard is Tagging?

Many words are unambiguous. From the Brown corpus:

tags # types with that many tags

1 35, 2 3, 3 264 4 61 5 12 6 2 7 1 “still”

Baseline for English (Penn tagset) something like 91%.

Approaches to Automatic Tagging: Hand-Written Rules

ENGTWOL (Voutilainen, 1995) is an FST-based rule-based system for English tagging.

Example rule:

Adverbial that rule: Given input “that”: if:

(+1 A/ADV/QUANT) /* next word is adj, adv. or quant / (+2 SENT-LIM) / following is sentence boundary / (NOT -1 SVOC/A) / prev word not adj comp verb */

then eliminate non-ADV tags else eliminate ADV tag

Approaches to Automatic Tagging: Source-Channel Model

Basic problem: uncover the underlying signal of POS tags as modified by the noisy channel that produces observable words from tags.
For a bigram tagger this would give you the formula for the ith tag:

ti = argmaxjP (tj|ti− 1 )P (wi|tj)

For the whole sentence then we want to maximize:

P (tj|tj− 1 )P (wj|tj)

Note that this can also be derived via Bayes’ formula for a tag sequence T and word sequence W. Thus we want to maximize:

P (T |W )

which is given by

P (T |W ) =

P (T )P (W |T )

P (W )

But since we know the word sequence we can eliminate that and just maximize

P (T )P (W |T )

What do you do if you don’t have tagged data?

You can assume an initial distribution of tags over the corpus (given a dictionary and perhaps some lingustically base guesses) and then use an algorithm such as expectation maximization (EM).

Approaches to Automatic Tagging: Transformation-Based

Learning

TBL was proposed by Eric Brill in his 1995 U Penn dissertation. It is a “weakly statistical” method

The system starts with a set of nominal assignments based on most likely tag. (Recall that this will be right 9 times out of 10.)
Then the system proceeds to learn rules of the form: “change X into Y if preceded/followed by Z”.

Thus: Change NN to VB when the previous tag is TO

So: expected/VBD to/TO race/NN → expected/VBD to/TO race/VB

The rulespace is searched for the rule that gives the most improvement given the corpus.

Part of Speech Tagging & Sense Disambiguation in Speech Synthesis: ECE 598 Deep Dive - Pro, Study Guides, Projects, Research of Electrical and Electronics Engineering

Related documents

Partial preview of the text

Download Part of Speech Tagging & Sense Disambiguation in Speech Synthesis: ECE 598 Deep Dive - Pro and more Study Guides, Projects, Research Electrical and Electronics Engineering in PDF only on Docsity!

ECE 598: Speech Synthesis

Linguistic Analysis

Synopsis

Part of Speech Tags

Part of Speech Tags

Problematic Cases

How Hard is Tagging?

tags # types with that many tags

Approaches to Automatic Tagging: Hand-Written Rules

Approaches to Automatic Tagging: Source-Channel Model

P (T |W )

P (T |W ) =

P (T )P (W |T )

P (W )

P (T )P (W |T )

Approaches to Automatic Tagging: Transformation-Based

Learning