Docsity
Docsity

Prepara i tuoi esami
Prepara i tuoi esami

Studia grazie alle numerose risorse presenti su Docsity


Ottieni i punti per scaricare
Ottieni i punti per scaricare

Guadagna punti aiutando altri studenti oppure acquistali con un piano Premium


Guide e consigli
Guide e consigli


Discourse Analysis: Exploring Language, Context, and Identity, Schemi e mappe concettuali di Lingua Inglese

A comprehensive overview of discourse analysis, exploring its key concepts, methodologies, and applications. It delves into the relationship between language and identity, politeness strategies, and the role of context in shaping meaning. The document also examines corpus linguistics and its use in analyzing large datasets of language, highlighting the importance of studying language in its natural environment. It further discusses the functions of language, register variation, and the impact of the internet on language use.

Tipologia: Schemi e mappe concettuali

2023/2024

Caricato il 09/04/2025

benedetta-bufo-2
benedetta-bufo-2 🇮🇹

3 documenti

1 / 42

Toggle sidebar

Questa pagina non è visibile nell’anteprima

Non perderti parti importanti!

bg1
DISCOURSE ANALYSIS
Discourse analysis is an approach to the analysis of language that examines:
The patterns (modello / schema ricorrente) of language within and across the texts.
The patterns are typical / fixed ways of using language in specific situations.
They allow us to categorize the kinds of texts and the ways in which a text needs to be produced in order to
be recognized by others as valid.
Discourse Analysis compares various texts of the same kind to see if the patterns are repeated.
If the patterns are repeated, they are called conventions.
The social and the cultural contexts in which the texts occur / is produced / is used.
Discourse analysis is therefore concerned in what happens when people draw on (attingere) the knowledge they have
about language and about the world to do things in the real world.
It is therefore concerned in the practical (not theoretical) analyses of the language in use in both spoken and written
interactions, in order to provide a deeper understanding of the texts (of how they are realized and of how they convey
a message → what people mean by what they say, how they make sense of what others mean and how language
presents different views of the world and understandings) and of how they become meaningful to their users (of how
people present themselves to others and achieve their goals through the use of language.)
BRANCHES OF DISCOURSE ANALYSIS
Discourse Analysis is divided in two main branches:
A textually oriented one, which focuses on the linguistic features of the text.
A socially oriented one, which situates / collocates the text in the social context in which it occurs to see:
If and how the text relates with the social context.
If and how the text changes the social context.
Every single participant brings, in fact, something, contributing to an exchange.
The exchange is, therefore, constantly updating and refining the text and the social context.
It allows us to shape / to construct a certain vision of the real world.
The relation between language and identity / gender.
For many years, studies on the relation between language and identity / gender have a lot to do with
the idea of deficit.
But, in more recent years, a new idea has been taken, and it is the idea of identity / gender as performance.
Everyone chooses, in fact, to adopt an identity / a gender.
According to the gender-fluid theory, we do not have to be caged into an identity / gender that does not
always fit us (sometimes we are something and sometimes we are something else.)
The notion of intertextuality.
Every text relies on other texts.
ORIGIN OF DISCOURSE ANALYSIS
The term “discourse analyses” was first introduced by Zellig Harris as a way of analyzing connected speech
and writing by:
Considering language above (al di sopra) word / sentence level.
It means analyzing the whole discourse and not the single words / sentences, in order to define how the linguistic
features are distributed within and across texts and how they are combined in particular kinds of texts.
Discourse analysis compares, in fact, various texts of the same kind to see if the patterns are repeated.
If the patterns are repeated, they are called discourses.
The discourses share particular meanings and have characteristic linguistic features associated with them.
Considering the relationship between linguistic behavior (language / what is said or is written) and non-linguistic
behavior (context / what is meant.) *
It means situating / collocating / contextualize the text in the real world, in order to interpret it based on
the situation and on the culture in which it is produced.
The same text can, in fact, take a particular or a different meanings in different situations and cultures.
NEXT CONTRIBUTION TO DISCOURSE ANALYSIS
Next contribution to discourse analysis was provided by Van Dijk.
According to him, context is not objective / physical, but it is a subjective / mental construct. *
It isn’t, in fact, the context that influences the text, but it is the way in which the participants define the context
in which the text occurs.
The perception of the context is, therefore, constantly updated by the participants in their interactions / dynamic.
As a result, the link between language and context is indirect.
Ex. if I say “there’s a football match at the stadium” it seems to be an objective context.
but, in reality, the perception of the context depends on the participant.
the players will have a perception of the event, the audience in the stadium will have another, the audience on tv
will have another, the security staff will have another, the people who live close to the stadium will have another…
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28
pf29
pf2a

Anteprima parziale del testo

Scarica Discourse Analysis: Exploring Language, Context, and Identity e più Schemi e mappe concettuali in PDF di Lingua Inglese solo su Docsity!

DISCOURSE ANALYSIS

Discourse analysis is an approach to the analysis of language that examines:

  • The patterns (modello / schema ricorrente) of language within and across the texts. The patterns are typical / fixed ways of using language in specific situations. They allow us to categorize the kinds of texts and the ways in which a text needs to be produced in order to be recognized by others as valid. Discourse Analysis compares various texts of the same kind to see if the patterns are repeated. If the patterns are repeated, they are called conventions.
  • The social and the cultural contexts in which the texts occur / is produced / is used. Discourse analysis is therefore concerned in what happens when people draw on (attingere) the knowledge they have about language and about the world to do things in the real world. It is therefore concerned in the practical (not theoretical) analyses of the language in use in both spoken and written interactions, in order to provide a deeper understanding of the texts (of how they are realized and of how they convey a message → what people mean by what they say, how they make sense of what others mean and how language presents different views of the world and understandings) and of how they become meaningful to their users (of how people present themselves to others and achieve their goals through the use of language.) BRANCHES OF DISCOURSE ANALYSIS Discourse Analysis is divided in two main branches:
  • A textually oriented one, which focuses on the linguistic features of the text.
  • A socially oriented one, which situates / collocates the text in the social context in which it occurs to see: − If and how the text relates with the social context. − If and how the text changes the social context. Every single participant brings, in fact, something, contributing to an exchange. The exchange is, therefore, constantly updating and refining the text and the social context. It allows us to shape / to construct a certain vision of the real world. − The relation between language and identity / gender. For many years, studies on the relation between language and identity / gender have a lot to do with the idea of deficit. But, in more recent years, a new idea has been taken, and it is the idea of identity / gender as performance. Everyone chooses, in fact, to adopt an identity / a gender. According to the gender-fluid theory, we do not have to be caged into an identity / gender that does not always fit us (sometimes we are something and sometimes we are something else.) − The notion of intertextuality. Every text relies on other texts. ORIGIN OF DISCOURSE ANALYSIS The term “discourse analyses” was first introduced by Zellig Harris as a way of analyzing connected speech and writing by:
  • Considering language above (al di sopra) word / sentence level. It means analyzing the whole discourse and not the single words / sentences, in order to define how the linguistic features are distributed within and across texts and how they are combined in particular kinds of texts. Discourse analysis compares, in fact, various texts of the same kind to see if the patterns are repeated. If the patterns are repeated, they are called discourses. The discourses share particular meanings and have characteristic linguistic features associated with them.
  • Considering the relationship between linguistic behavior (language / what is said or is written) and non-linguistic behavior (context / what is meant.) ***** It means situating / collocating / contextualize the text in the real world, in order to interpret it based on the situation and on the culture in which it is produced. The same text can, in fact, take a particular or a different meanings in different situations and cultures. NEXT CONTRIBUTION TO DISCOURSE ANALYSIS Next contribution to discourse analysis was provided by Van Dijk. According to him, context is not objective / physical, but it is a subjective / mental construct. ***** It isn’t, in fact, the context that influences the text, but it is the way in which the participants define the context in which the text occurs. The perception of the context is, therefore, constantly updated by the participants in their interactions / dynamic. As a result, the link between language and context is indirect. Ex. if I say “there’s a football match at the stadium” it seems to be an objective context. but, in reality, the perception of the context depends on the participant. the players will have a perception of the event, the audience in the stadium will have another, the audience on tv will have another, the security staff will have another, the people who live close to the stadium will have another…

OTHER CONTRIBUTIONS TO DISCOURSE ANALYSIS

Other contributions to discourse analysis were provided by Firth, Halliday and Sinclair. *****

  • Firth distinguished the context of situation and the context of culture. According to him, one has to use for analysis the authentic language used in reality and not invented examples (ex. translation and foreign language teaching → it is not just about translating and teaching words, but it is about translating and teaching situations in a specific culture.)
  • Halliday theorized that a speaker / writer has a number of options available to communicate a message and that he / she will select them on the basis of the context (ex. register → formality / informality and relations of power.)
  • Sinclair theorized that language has to be studied in naturally occurring contexts (recurring and not sporadic use of language) and that the analysis of how meaning is conveyed has to be the main focus (ex. corpora → digital collections of actual (reale) texts, which allows to see what occurs authentically and repeatedly in language.) DISCOURSE STRUCTURE OF TEXTS The structure is the organization of a text (what comes first, next, and so on.) Mitchell studied the selling and buying interactions, identifying the stages / the steps that the participants go through while carrying out the interaction and understanding that such stages function because the participants carry out a co-operative action (the intention of the buyer is to buy, while the intention of the seller is to sell and, through the interaction, they both satisfy their goal.) Hasan and Ventola studied service encounters / negotiations / transactions, which is a sub-genre of the macro-genre “shopping”, distinguishing between obligatory and optional stages / steps. Ex. obligatory: optional: have you got…? hi! could you show me…? how are you? could I have…? nice / bad weather today… I’ll have… Such stages / steps vary depending on the age of the people involved, where the service encounter / negotiation / translation takes place and whether it is face-to-face or on the phone. Other scholars carried out the conversation analyses, which studies the opening / the closing of interactions, the turn taking, the overlaps, the intonation, the hesitations and the pauses. Different cultures determine, however, a different use of the language and a different way of doing the same things. Ex. japanese e-mails open with a comment regarding the weather (ritual requirement.) DIFFERENT VIEWS OF DISCOURSE ANALYSES Fairclough distinguished:
  • Textually oriented discourse analyses → looks at the language.
  • Socially oriented discourse analyses → situates / collocates the text in a tradition. He stated that such views are not incompatible / mutually exclusive, which means that one does not exclude the other. The language studied under a textually oriented view is, in fact, socially situated and needs to be interpreted in terms of their social meaning and functions. Ex. Obama’s victory speech when he won the US presidential election: “if there is anyone out there who still doubts that America is a place where all things are possible, who still wonders if the dream of our founders is alive in out time, who still questions the power of our democracy, tonight is your answer. it’s the answer spoken by young and old, rich and poor, democrat and republican, black, white, hispanic, asian, native american, straight, gay, disabled and not disabled.” Crystal made a textually oriented analyses of such speech, identifying the following features:parallelism → repeated structure.rule of three → 3 repetitions.list of pairs. Higgins made a socially oriented analyses of such speech, investing where that language comes from and finding out that:the tricolon (series of three) is taken from the Roman and the Greek tradition of rhetoric, in that it was one of Julius Caesar and Cicero’s rhetorical techniques → western tradition. Oratory was, in fact, the supreme political skill, on whose mastery power depended.the slogan “yes, we can” recalls the call-and-response preaching (predicazione) of the American Church and the power that preachers have on people → religious tradition. Williams made a socially oriented analyses of such speech, highlighting / pointing out the optimism manifested through the repetition of the slogan “yes, we can.” Moreover, Obama referenced to previous leaders (the rule of three was previously adopted by Winston Churchill), relying on the knowledge he shares with his audience.

SPEECH ACTS AND DISCOURSE

  • Austin – “How to do Things with Words.”
  • Searle – “Speech Acts.” They were working in a period when logical positivism was the dominant view. According to this view, language is always used to talk about the state of things or about facts and, in order for a statement to be meaningful, it must satisfy the truth or false condition (it must be defined as true or false.) This is, however, a very reductionist view of language. According to Austin and Searle, there are in fact many other things that can be said through language, which have nothing to do with the truth or false condition and which go beyond their literal meaning, producing effects in the real world (performing acts.) Ex. make a request, give orders, give warnings, give advice… The sum of the various words of a sentence is called propositional content, which coincides with the literal meaning. There are, however, other levels of meaning:
  • Locutionary act → act of speaking, literal meaning.
  • Illocutionary act / force → speaker’s intention.
  • Perlocutionary act / effect → effect on the listener’s thoughts or actions. Ex. It’s cold in here. Locutory act → act of saying it. Illocutionary act / force → let the receiver know that the temperature in the room is making me uncomfortable and push him / her to do something (close the window, turn on the heating, give me a hug…) From that moment the sender has, however, no more control, in that it relies on the receiver. A statement may have more than a single illocutionary act / force. Ex. What are you doing tonight? (question, invitation.) I’m busy (answer, rejection.) Nothing. What are you doing? (answer, acceptation, invitation.) DIRECT AND INDIRECT SPEECH ACTS We can be more or less direct in the formulation of a speech act. A speech act is:
  • Direct → if it shows the intention of the speaker (if the speaker means exactly what he / she says.)
  • Indirect → if it doesn’t show the intention of the speaker (if the speaker intends something that is different from the literal meaning of what he / she says.) Ex. bring a plate (food.) FELICITY CONDITIONS The felicity conditions are conditions that must be met in order for a speech act to succeed. To succeed a speech act must in fact:
  • Be a generally accepted procedure in the community in which it is produced. Ex. inviting someone to a wedding through a formal written invitation rather than through an informal message.
  • Be in the appropriate circumstances (place and time.) Ex. someone must be getting married.
  • Be produced by the appropriate person in the context. Ex. the couple getting married.
  • The receiver must be able to understand and carry out the request. If it is not the right person and not the right place the speech act will misfire. If it is not the right time the speech act will be an abused procedure. RULES VS PRINCIPLES
  • Searle stated that felicity conditions are rules.
  • Thomas stated that felicity conditions are principles. According to him there are 5 differences between rules and principles: − Rules tends to be “all or nothing” (either one follows the rule or one does not follow the rule), but reality is more complex than that. Principles tends to be “more or less” (one can follow the principle for some things and cannot follow the principle for other things.) − Rules tends to be “exclusive” (if one follows a rule, he / she does not follow another rule, one rule precludes another rule.) Principles can “co-occur” (one can follow more than one single principle.) − Rules are prescriptive, in that they define what one must do and how one must do it. Principles are descriptive, in that they describe what one often does or what one is more likely to do. − Rules tends to be definite (rigid.) Principles tends to be probabilistic (flexible), in that they describe the behavior. − Rules are arbitrary, in that they have been imposed by someone. Principles are motivated, in that one follows them for a reason / a purpose.

PRESUPPOSITION AND DISCOURSE

The presupposition is the common ground that is assumed to exist between the language users (shared knowledge about the language and about the world.) It determines the way of speaking, in that a speaker say something based on his / her assumption of what the listener is likely to know and of what the listener will infer. As a result, the speaker does not need to give all the information. We can distinguish between two kinds of presupposition:

  • Conventional presupposition → the assumption is less connected with the context and more connected to the language. Ex. would you like some tea? it does not matter when, where or who. the presupposition between the speaker and the listener is that there is some tea which is ready or that there is some tea which can be made.
  • Pragmatic presupposition → the assumption arises (sorgere) from the use of an utterance in a particular context. Ex. number 2! (bingo, competition, supermarket, hospital…) CO-OPERATIVE PRINCIPLES AND DISCOURSE The speaker may not observe the maxims:
  • Flouting. When the speaker deliberately decides to not observe the maxims and wants the listener to be aware of it.
  • Violating. When the speaker deliberately decides to not observe the maxims to deceive (ingannare) the listener. It may also involves lies.
  • Opting out. When the speaker deliberately decides to not observe the maxims for politeness, ethical or legal reasons.
  • Infringing. When the speaker fails to observe the maxims. METAPRAGMATICS AND DISCOURSE Metapragmatics are the ways in which a speaker shows his / her awareness of the meaning and of the effect of what he / she is saying. Ex. only joking (the speaker realizes that what has been said has revealed too much or has offended / hurt the listener.) CROSS-CULTURAL / INTERCULTURAL PRAGMATICS AND DISCOURSE
  • Cross-cultural pragmatics. It studies the speech acts, the cooperative principles and the expectations with their meaning and consequences which change across two different cultures. Ex. a japanese speaker may say “sorry” when he / she means “thank you.” the act of thanking is, in fact, an expression of indebtedness, but in japanese a debt not yet repaid calls for (richiedere) an apology from the debtor.
  • Intercultural pragmatics (between more cultures.) It studies the linguistic production and comprehension across more than two different cultures which use a common language (lingua franca) for communication. None of the people involved is, therefore, a native speaker. CONVERSATIONAL IMPLICATURE AND DISCOURSE The implicature is the inference (deduzione) that the listener makes about the speaker’s intended meaning. It is, therefore, a logical process, based on background knowledge and evidence (within the language or within the context / the real world), that allows the listener to make sense of and to interpret the message. The listener may, however, interpret the message in a way which is not the same as what the speaker intends (misunderstanding.) Ex. “there’s nothing on the tv” does not mean that there’s nothing at all, but rather that there’s nothing that interest the speaker. CONVENTIONAL AND PARTICULARIZED CONVERSATIONAL IMPLICATURE AND SCALAR IMPLICATURE
  • Conventional implicature. No particular context, but only the knowledge of the language, is required to derive the meaning of the implicature. Some words or expressions mean, in fact, the same in all the contexts. Ex. are you going to the party? yes, and pigs may fly.
  • Particularized conversational implicature. A particular context is required to derive the meaning of the implicature. Some words or expressions change, in fact, their meaning depending on the context. Ex. are you going to the party? my boyfriend is coming back from America.
  • Scalar implicature. The speaker uses words from a set of words that expresses a scale. He / she may choose one word from such scale and then correct it with another word of such scale. Ex. always (say something.) well, sometimes (rectify what has been said.) alright, never (deny what has been said.)

DISCOURSE AND GRAMMAR

There is a difference in grammar depending on whether we look at it from a sentence-based perspective or from a text-based / a discourse-based perspective.

  • Sentence-based perspective: − Prescriptive attitude → studies the language’s rules, in order to define what is right and what is wrong and in order to speak and to write correctly. − Abstract attitude → it is not based on a context.
  • Text-based / Discourse-based perspective: − Descriptive attitude → studies the mechanisms of how language works, by finds connections between the form that the grammatical item takes, the function it carries out and the relationship with the context. Traditional explanations of grammar do not adequately capture grammatical selection in long real-world texts, but rather in isolated invented sentences. Halliday and Hasan focused on patterns of grammar items and vocabulary items that combine to tie the text and the meanings in the text together and to connect them to the real world. Some linguistic items may, in fact, take a different function when they are considered at text level / at discourse level (rather than at sentence level.) Ex. in traditional explanations of grammar, the pronoun “it” replaces an inanimate object, but, at text level / at discourse level, it may also refer to a previously developed or to an ongoing topic. in traditional explanations of grammar, the demonstrative “this” indicates the position in space of an object, but, at text level / at discourse level, it may also refer to the raise of a new topic or it may focus the attention on a certain aspect of the current topic (in order to make something visible / relevant.) in traditional explanations of grammar, the demonstrative “that” indicates the position in space of an object, but, at text level / at discourse level, it may also assume a distancing function (in order to make something invisible / irrelevant.) in traditional explanations of grammar, the past perfect is used to narrate two actions that occur together in the past, of which one precedes the other (ex. I turned off the computer after I had saved the document), but, in long texts, it may also be used to introduce information that happened earlier than the narration time and then continue to develop such information using the past simple (ex. the man had already been arrested because he robbed a bank.) in traditional explanations of grammar, be going to is used to narrate intentions, but, in long texts, it may also be used to introduce the intention and then continue to develop it using will (traditionally used to narrate predictions of which we are not sure) or the present simple (traditionally used to narrate immediate actions.) TEXTURE OF A TEXT The texture is how the text holds together Patterns of organization Semantic / Meaning relations between lexical items Unity of structure Unity of texture Information structure Cohesion (Conceptual planning) Focus Flow (theme, rheme and thematic progression) UNITY OF STRUCTURE The information flow is what the speaker or the writer says in the text (content) and how he or she organizes it (structure), in order to make the text go on and to make the text work for its purpose.
  • Theme → starting point of a clause (what the clause is about.) − Topical theme → subject. − Textual theme → conjunction that joins two clauses together. − Interpersonal theme → formulas which express positions / points of view. Ex. probability (perhaps), usuality (sometimes), typicality (generally), obviousness (surely.) persuasion (believe me), opinion (according to me), admission (honestly), prediction (as expected.) desirability (hopefully), entreaty (kindly.)
  • Rheme → remainder of the clause (what the clause has to say about the theme.) If a text has patterns of themes and rheme, it has a certain thematic development.

THEMATIC PROGRESSION

The thematic progression refers to the way in which the theme of a clause may pick up a meaning from a preceding theme or rheme / how information flow is created in a text.

  • Constant / Reiteration theme → the theme of one clause is picked up and repeated at the beginning of the next / the following clause. Ex.
  • Linear / Zigzag theme the rheme of one clause is picked up and repeated in the theme of the next / of the following clause. Ex.
  • Multiple theme → a single clause can include multiple themes. Ex.
  • Split theme → a rheme may include several pieces of information, some of which may be picked up (riprendere) as theme in subsequent clauses. Ex.

COHESION AND DISCOURSE

Cohesion is a network of grammatical, lexical and other relations that connects together the parts of a text. The cohesive devises are:

  • Reference → the identity of an item can be retrieved (recuperare) from either within or outside the text. − Anaphoric reference → an item refers back to another item used earlier in the text. Ex. It seems everyone’s read that self-help book: “He’s Just Not That Into You.” First in the USA, then all over the world, women became converts to the book’s message. They sang its praises, calling it “The Bible.” − Cataphoric reference → an item refers forward to another item used later in the text. Ex. It seems everyone’s read that self-help book: “He’s Just Not That Into You.” − Exophoric reference → the identity of the item being referred to is found in the situation (not inside the speech or the written page) in which the text occurs. Ex. “What kind of book would you say this is?” “I suppose it’s a biography, because she tells her life in it .” − Homophoric reference → the identity of the item can be retrieved from the listener’s or the reader’s cultural knowledge. Ex. First in the USA, then all over the world, women became converts to the book’s message. (The → we don’t need to be told which USA or which world are being referred to.) − Comparative reference → the identity of the item can be retrieved not because it has already been mentioned or will be mentioned in the text, but because an item with which it has been compared has been mentioned in the text. Ex. The book assumes that all men are confident or that, if they really like a girl, they’ll overcome their shyness. The opposite is true (if men really like a girl, they’ll become shy.) − Bridging reference → an item refers to something that has to be inferred from the text or from the situation. It has therefore to do with the listener’s or the reader’s ability of guessing by drawing conclusions and by being imaginative. Ex. I’m not one of those blockes that find approaching women easy. (Those → there are men who find approaching women easy.)
  • Lexical cohesion → relationships in meaning between lexical items (content words and not functional words) within a text. − Repetition → an item that is repeated in the text (including items inflected for genre, number and tense and items derived.) − Synonymy → items that have the same meaning. − Antonymy → items that have an opposite meaning. − Hyponymy → items that have a superordination / subordination, general / specific, a kind of relationship. − Meronymy → items that have a composition, whole / part, a part of relationship. − Collocation → items that tend to co-occur together, in that one item calls for the others. − Expectancy relations → predictable syntactical or semantical / logical association between items. Ex. combinations of noun + verb, event + location.
  • Conjunctions → items that join paragraphs, phrases and sentences by expressing semantic / logic relations between them. They constitute a point of orientation for the listener or for the reader, in that, through them, the speaker or the writer organizes the discourse, indicating how to connect the pieces of information. Halliday and Hasan classify them in: − Additive. − Adversative. − Causal. − Temporal. − Continuatives. Martin and Rose classify them in: − Additive → addition in both a positive and a contrastive sense. Ex. and, moreover, besides, or, alternatively. − Comparative → comparison in both a positive (similarity) and a negative (contrast) sense. Ex. likewise, similarly, but, whereas, on the other hand. − Temporal → succession (successione.) Ex. before, when, meanwhile, after, first, then, next, subsequently, finally. − Consequential → means, condition, cause. Ex. by, thus, in this way. if, provided that (purchè, sempre che), unless. because, since, so, therefore.
  • Substitution → an item replaces another item or clause. Ex. one, do / did / done, so, the same.
  • Ellipses → omission of an item (noun or noun phrase, verb or verb phrase, clause) that can be recovered by referring to a previous element in the text or to the situation where the text occurs. Ex. what do you want to say to Mary? [I want to say] that I’m very sorry for the fight we had yesterday. what was it over? [it was over] something silly actually. they usually are, aren’t they? yeah [they usually are silly.] CORPUS LINGUISTICS Corpus linguistics is an approach to the study of language that involves collecting large quantities of naturally occurring language in electronic format and using a software to obtain information about frequencies and co- occurrences, in order to move from observation to generalization of what will be found in which contexts and to categorize the language features. Such language can be spoken, written or signed, in one variety or more and in one register or more. And the quantity can vary from a few hundred thousand words to billions, though the corpus usually contains more texts that could reasonably be read and remembered by an individual. What distinguishes a corpus from a collection of digitalized texts is that it is formatted in such a way that enables the software to identify patterns that would otherwise be missed by the conventional forms of reading. The output from the software can be a list of items (words or phrases) or a set of numbers visualized as tables, plots (diagramma) and graphs. CORPUS TYPES
  • Comparable corpus. − Monolingual comparable corpus. It is made up of two sub-corpora in the same language. One consists of non-translated texts and the other one consists of translated texts. − Bilingual and multilingual comparable corpus. It is made up of two sub-corpora in two or more languages. One consists of non-translated texts and the other one consists of translated texts. The monolingual, bilingual and multilingual comparable corpora’s texts are assembled on the basis o similar criteria (ex. the author and the readership, the date and the place of publication, the genre, the topic, the communicative situation…)
  • Bilingual parallel corpus. − Unidirectional bilingual parallel corpus. It is made up of two sub-corpora: one containing the original texts in language A and the other one containing their translations in language B. − Bidirectional bilingual parallel corpus. It is made up of four sub-corpora: the original texts in language A, their translations in language B, the original texts in language B and their translations in language A. CORPORA CLASSIFICATION Corpora are classified according to six contrastive parameters:
  • Sample or Monitor. − A sample corpus is of finite size and contains texts that have been collected to represent a language or a language variety. − A monitor corpus keeps increasing in size, in that it is open and constantly integrated with new texts.
  • Synchronic or diachronic. − A synchronic corpus consists of texts produced at one particular time. − A diachronic corpus consists of texts produced over a long period of time.
  • General or specialized. − A general corpus is thought to represent a language for everyday purposes. − A specialized corpus is thought to represent a language for special purposes.
  • Monolingual, bilingual or multilingual. − A monolingual corpus contains texts produced in a single language. − Bilingual and multilingual corpora contain texts produced in two or more languages respectively.
  • Written, spoken, mixed and multi-modal. − A written corpus contains written texts. − A spoken corpus contains recorded texts or texts that are written to be spoken. − A mixed corpus contains written and spoken texts. − A multi-modal corpus contains texts producd by using a combination of language, image and sound.
  • Annotated or non-annotated. − An annotated corpus contains textual or contextual information and an interpretative linguistic analysis (phonological, morphological, lexical, syntactic, semantic, pragmatic, stylistic level.) − A non-annotated corpus contains texts that have not been analyzed in any way.

Both range of distribution and dispersion involve dividing the corpus into segments. Moreover, a calculation of log-likelihood is used to identify the differences between the text type frequencies. A type may, in fact, occur more often in and are more distinctive of conversational speech, task-oriented, imaginative writing or informative writing. KEYWORDS The term “keywords” refers to words (mainly lexical words, in that grammatical words appear at the top of any frequency list) that are important because they occur more frequently in a corpus. They therefore indicate what the corpus is about. Keywords are often grouped in semantic categories, based on their meaning. And they can be tracked (tracciare) to identify where they occur in the corpus. They can, in fact, not be distributed / concentrated evenly in the corpus, but only in some parts of it. In keyword studies, a specialized corpus is compared with a much larger and general corpus. There are three alternatives:

  • Compare it with a very general corpus. It allows the researchers to investigate how the target corpus differs from the language in general.
  • Compare it with a corpus which is more specifically comparable to the target corpus. Ex. compare “Romeo and Juliet” with all of Shakespeare’s plays.
  • Compare it with another target corpus. It allows the researchers to investigate the differences between the two corpora. The most different may, however, be characterized as typical and confirm harmful stereotypes. For this reason, comparative frequencies can also provide an explanation of where stereotypes come from / arise. Ex. compare broadsheet newspapers and tabloid newspapers reporting the same news. compare manifesto documents from two different political parties. MEASURING COLLOCATION One word can be defined in terms of the words that accompany it. Collocation studies analyze how often two words are found together (frequency.) In order to study the collocational significance, the co-occurences are compared with the overall frequency of each of the two words (strength.) Collocation softwares often group collocates in grammatical categories, based on their grammatical relationship to the node word. Ex. noun + preposition, noun + verb, verb + noun, adjective + noun, adverb + adjective, adverb + verb. LEXICAL BUNDLES The collocation between two words may be the consequence of the presence of a longer phrase. Both fixed and variable phrases have been studied under various names. Some of them emphasize the phrase as meaning (ex. unit of meaning / idiom) or the phrase as structure (ex. sentences stem.) Others emphasize the fixed nature of some units (ex. multi-word unit.) In the cases where phrases are variable, it is not easy to identify them with automatic corpus searches, even though concgram searches can be used. In the cases where phrases are fixed, it is easy to identify them automatically. Lexical bundles are sequences of a predetermined number of words (ngrams) that recur in a corpus. There are three main classes of lexical bundles, each of which has its sub-classes:
  • Stance expressions. − Epistemic. Ex. I don’t know what, is / are more likely to… − Attitudinal / Modality. Ex. to be able to, to want / don’t want to, it is important to…
  • Discourse organizers. − Topic introduction / focus. Ex. to want to talk about, if you look at, one of the things, that’s one of the... − Topic elaboration / classification. Ex. it has to do with, as well as…
  • Referential expressions. − Imprecision bundles. Ex. and things like that… − Attribute specifying. Ex. in terms of the, the nature of the, to have a lot of, the size of the… − Time and place. Ex. in this year, as shown in figure… Each discipline use more one kind of lexical bundle.

MULTIDIMENTIONAL ANALYSIS

Multidimensional analysis is an approach to the study of variation between types of text. It is in fact used to compare several sub-corpora, each one representative of a text type. Once the sub-corpora have been tagged or annotated and the frequency of each linguistic feature in each sub-corpus has been calculated, the analyses focuses on which language features are most likely to appear together in a sub- corpus and which ones are most unlikely to appear together in a sub-corpus. This shows which language features attract each other and which ones repel each other. It differs from approaches such as keywords or lexical bundles in three ways:

  • It is based on language features rather than on individual words.
  • It is based on the difference in frequency of such language features across sub-corpora rather than on the overall frequency of any language feature.
  • Requires a corpus that has been tagged or annotated for such language features. The language features include word classes (ex. pronouns), clause types (ex. wh questions) and grammatical features (ex. past tense.) SEMANTIC AND OTHER ANNOTATION The process of corpus annotation or tagging consists in assigning labels to the items in a corpus so that all the items with a particular label can be retrieved. The term “annotation” is generally used to identify the process of adding labels to the items in a corpus, while the term “tag” is generally used to identify the label itself. There are two kinds of annotation or tagging tool available to researchers:
  • Annotate or tag a corpus automatically for some features.
  • Develop a custom-made annotation or tagging systems and apply them semi-manually to a corpus. Automatic annotation or tagging can be carried out with less effort than the non-automatic one, but it uses only tags predetermined by the program developer and it is more prone to error. The words annotated or tagged can be grouped in semantic categories, which can be identified following the same principle as identifying the keywords, that is, by comparing the frequency of the semantic tags in the target corpus with those in a reference corpus.

COMPUTER-AIDED TRANSLATION

The computer-aided translation (CAT) refers to the use of a computer software to assist to various degree a human translator in the translation process. The translation is therefore carried out principally by a person, but it also involves the use of a computer software that facilitates some aspects of it. This contrasts with Machine Translation (MT), which refers to a translation that is carried out principally by a computer software, but that may also involve some human intervention (pre- or post-editing.) CAT TOOLS CAT tools are computer software designed specifically with the translation task. The most popular CAT tool is the Translation Environment Tool (TEnT), which is an integrated suite of tools. It is constructed around a translation memory (TM), which often functions in association with a terminology management system. TRANSLATION MEMORY TOOLS A TM is a tool that allows users to store previously translated text and then consult them for reuse. To do this, the source and the target texts are stored in the TM database as bitexts and an aligned bitext is created by dividing the texts into segments and by linking each segment to its corresponding segment in the translation. When a translator has a new text to translate, the TM divides it into segments and compares each segment with the contents of the TM database, in order to identify whether any portion of the new text has been previously translated as part of a text stored in it. The matches can be accepted, modified or rejected by the translator. TERMINOLOGY TOOLS A terminology management system (TMS) is a tool used to store terminological information in and to retrieve it from a termbase. Translators can customize the term records with various fields (term, equivalent, definition, context, source…) Termbases can also be integrated with TM databases. By doing so, TMS can scan a new text, compare its contents against a specified termbase and identify the matches between them. OTHER TEnT COMPONENTS

  • Machine Translation systems. They generate a machine translation of a segment that has no match in the TM database.
  • Quality Control module. It includes spelling, grammar, completeness or language-compliance (conformità) checkers.

IMPACT ON TRANSLATION

The TM’s benefits are an increased productivity and an improved quality. Once matches are found, being able to automatically copy an paste items from the TM database or termbase into the target text saves translators typing time and reduces typographic errors. Nevertheless, the segment-by-segment processing approach underlying the TM means that the notion of text may be lost. Moreover, CAT tools affect the translators’ professional status and intellectual property rights. Some clients may, in fact, attribute less value to the work of translators who use them, in that it is easier and faster than human translation, or may be more demanding (esigente) if they use their own TM to pre-translate a text before sending it to the translator. Yet, even exact matches do not equate to zero time spent for the translator. He / she has in fact to evaluate the suggested sentences and make adjustments. Finally, legal questions surround the ownership and the sharing of CAT data. Translators may wish to exchange or to sell a TM, but its source texts and translations are ownership of the client and he / she may demand confidentiality. POST-EDITING The term “post-editing” defines the activity of revising a text that has been translated automatically by a Machine Translation (MT.) To do so, the editor compares the source text with the raw (grezzo) translation produced by the Machine Translation and identifies and fixes the errors. At the time, the most common process involves sending the text to the Machine Translation and then editing it post Machine Translation, hence the affix “post” in “post-editing.” We can distinguish between:

  • Light post-editing. Where only grammar and meaning errors are fixed, in order to produce a translation that can be understood.
  • Full post-editing. Where also stylistic errors are fixed, in order to produce a translation that is more or less equivalent to what a translator might produce. ISSUES
  • Productivity. It refers to how quickly a translation can be produced, which means, how many words a translator can edit in one day. With Machine Translation (MT) doing some of the work, expectations of productivity have increased unrealistically. Productivity depends, however, on a number of factors including the language pair, the content type, the MT system and the translator.
  • Quality and cost. The quality produced through MT is not necessarily inferior to that produced in more traditional ways. Nevertheless, even if the raw MT output can be read fluently, it can still have mistakes. Such mistakes are, therefore, difficult to identify. Moreover, there is discussion about how to pay for the work, whether based on the number of words or hours worked.
  • Training. There is general agreement that teaching about MT necessarily includes teaching about post-editing. There is, however, no agreement on where this is best placed in the program. Some programs have courses dedicated to post-editing, while others incorporate it into the translation technology course or into the revision skills course. FUTURE OUTLOOK Significant progress has been made recently within the field of artificial intelligence that increases the speed of technological and translation improvement. There is concern about what impact this might have on the profession. It will not make translators redundant, in that, due to the complexity and evolutionary natura of language, it is not expected that MT will produce error-free output. Maybe translators will only post-edit in the future. Moreover, while MT will serve the need for informative content, humans will serve the need for emotional and creative content that addresses to a specific user.

WHY DO WE NEED A GLOBAL LANGUAGE?

Translation and interpretation have played a central role in human interaction for thousands of years. But they have limits. In fact, the more a community is linguistically mixed, the less it can rely on individuals to ensure the communication between the different groups. In communities where only few languages are in contact, bilingualism can be a solution (children can acquire more than one language.) But in communities where many languages are in contact, such solution can not be applied. Such problem has traditionally been solved by finding a language to act as a lingua franca (common language.) The different communities can, in fact, adopt:

  • A pidgin (simplified language which combines the elements of their different languages) as a lingua franca.
  • The indigenous language of the most powerful community as a lingua franca. As a result, the other communities learn it and become bilingual.
  • A foreign language (from outside the community) as a lingua franca, because of the political, military, economic, technologic and cultural power of a foreign community. The prospect that a lingua franca might be needed for the whole world is something which has emerged only in the 20 th^ century. Since 1950s developed in fact:
  • International bodies.
  • International business communities.
  • International academic communities.
  • Technology of communication and transportation. Such developments affected every country in the world. There are no precedents for this kind of linguistic growth at this speed. There has never been a time when so many countries were needed to talk to each other so much and have been represented in single meeting-places. There has never been a time where so many people have become so mobile (physically and virtually.) As a result, multi-way translation and interpretation would be a more complicated and expensive solution than make use of the same common language. DANGERS OF A GLOBAL LANGUAGE Many people believe that a world with just one language in it would be a very good thing. Nevertheless, the use of a single language by a community is no guarantee of harmonic / peaceful cohesion, mutual understanding and cooperativeness. Moreover, there are a number of effects that globalization may have on the global diversity:
  • Linguistic power. The emergence of a global language may lead to the emergence of an elite monolingual linguistic class. Those who speak a global language as mother tongue may, in fact, be in a position of power compared with those who have to learn it as an official or as a foreign language. They may, in fact, be more able to understand, assimilate, think and produce quickly in it and to manipulate it to their advantage and at the expense of the others. Nevertheless, if a global language is taught early / as children, it will develop a bilingualism indistinguishable from that of a native speaker who have been exposed to and have assimilated it since birth.
  • Linguistic complacency. The emergence of a global language may eliminate the opportunities or the motivation to learn the other languages by making them unnecessary and a waste of time. English speaking people assume, in fact, that everyone speaks English and that, if someone does not, it is his / her fault. Nevertheless, even though English is the world’s language, it is not the world’s only language. Since English monolingual companies are increasingly encountering language difficulties and are losing opportunities, UK and USA are now paying more attention to the other languages.
  • Linguistic triumphalism and linguistic death. The emergence of a global language may lead to the disappearance of the minority languages. Nevertheless, the processes of language dominance and loss exist independently of the emergence of a global language. It is caused by a community coming to be conquered and assimilated within a more dominant community and adopting its language. When a language dies, so much is lost. As a result, the emergence of a global language has stimulated the emergence of movements for language rights, whose aim is language preservation and revitalization. MUTUAL / GLOBAL INTELLIGIBILITY VS LANGUAGE IDENTITY Language is the main means to show where we belong and to distinguish one community from another. The need for language identity is often seen as being opposed to the need for mutual / global intelligibility. Nevertheless, it is possible to develop a situation in which they co-exist to achieve different goals (bilingualism.) It is, in fact, possible for linguistic diversity to exist in a world united by a common language so that the global language provides access to the world community and the national / regional language provides access to the local community. Nevertheless, the emergence of a global language may influence the structure and the lexicon of the other languages. Such influence may be welcomed as an enrichment or may be opposed as a contamination and a damage. But the idea that borrow words leads to a language’s decline is absurd, given that English has borrowed itself many words.

COULD ANYTHING STOP A GLOBAL LANGUAGE?

  • If language dominance is a matter of political and economic influence, then a revolution in the world order and in the global power may affect the choice of the global language.
  • Machine translation … WHY ENGLISH?
  • Political expansion. After the First World War, the League of Nations assigned the former German colonies in Middle East, Africa, Asia and the Pacific to the victors. English influence grew in the areas which came to be assigned to Britain or to other English-speaking nations (such as Australia, New Zealand and South Africa.)
  • International relations. The League of Nations (later replaced by the United Nations) was the first of many international alliances / associations / organizations to give an official status to English by writing documents in this language. It is, in fact, used as lingua franca among its numerous members. In meetings, however, the nations have the right to participate each with their own language. Each pair of languages requires, therefore, translation and interpreting services. If there is no translator and interpreter available for a language pair, English is used as an interlingua (one person translate the speech from language 1 to English and another person translate the speech from language 2 to English.)
  • International travel. The reasons for travelling abroad are various: community rallies (raduno) / manifestations, academic conferences, business meetings, military interventions (army of occupation or of invasion), sport competitions, religious pilgrimages, holidays… The leading tourism spender and earner is the US. As a result, in the tourist spots of the world, the transportation and the accommodation services tend to have a parallel version in English. Away from the tourist routes, however, English is not used.
  • International safety. As world travel has grown, more goods and people are being transported more quickly and simultaneously to more places than ever before. Traffic controllers need, therefore, to make their speech clear / unambiguous in the sending and in the receiving of messages. Projects were, therefore, set up to produce a restricted / an essential English for international maritime (referred to as “seaspeak”), aircraft (referred to as “airspeak”), fire, police and ambulance services. Such English recommends standardized / fixed vocabulary, grammar and structure to express all possible situations. A main influence has come from the language of air traffic control. After the Second World War, the allied leaders organized a conference in which they laid the basis / foundations for the post-war air transport system and in which they agreed that English should have been the international language of aviation. That’s because the leaders of the allies where English speakers, the major aircraft manufacturers were English speakers and most of the pilots (ex-military personnel) were English speakers. Moreover, it is safer that all pilots understand all conversations. They listen, in fact, not only to messages addressed to themselves, but also to messages addressed to other pilots in their proximity. In this way, they can learn about weather and traffic conditions and, if they hear an error in someone else’s conversation, they can correct it.
  • Media. − Press. In the 17th^ century, European countries began to publish newspapers, but taxation, wars, censorship and other restrictions allowed little growth. The growth was much bigger and faster in Britain and in America. The 19th^ century was the period of greatest growth, thanks to the abolition of the Stamp Tax, the emergence of the major news agencies (agenzia di stampa) , the introduction / the development of new techniques of news gathering (raccoglimento) , new printing technology and new methods of mass production and transportation. For this reason, Britain and US acquired more territorial news monopolies than any of their competitors. As a result, the majority of the information being transmitted worldwide was published in English-status countries and was, therefore, written in English. Moreover, the top 5 most influential newspapers on a global scale are all in English (“The New York Times”, “The Washington Post”, “The Wall Street Journal”, “The Times” and “The Sunday Times.”) There are also English-language newspapers intended for a global readership (“International Guardian.”) This can also be told in relation to all kinds of publication, such as literary / academic / technical journals (rivista) , comics, pamphlets (opuscolo, brochure, volantino)