USING CORPORA DISCOURS ANALYSIS

Paul Baker

1. INTRODUCTION

This book is about using corpora and corpus process in order to uncover linguistic patterns which

can enable us to moke sense of the ways that language is used in the construction of discourses.

Some people may know a lot about discourse analysis but not about corpus linguistic; for others the

opposite may be the case, for others still, both areas might be equally opaque. We will begin by

giving a description of corpus linguistic and discourse.

Corpus linguistic

Corpus linguistic is the study of language based on example of real life language use. Corpora are

generally large (consisting of thousands or even millions of words), representative sample of a

particular type of naturally occurring language, so they can therefore be used as a standard reference

with which claims about language can be measured. Electronic corpora are often annotated whit

additional linguistic information. Other types of information can be encoded within corpora, for

example in spoken corpora (containing transcript of dialogue) attributes such as sex, age, socio-

economic group and region can be encoded for each participant. This would allow language

comparasons to be made about different types of speakers. Up until the ’70 only a small number of

studies utilized corpus-based approaches and in the ’80 that corpus linguistics as a methodology

became popular. Between 1976-1991 corpus linguistic has been employed in a number of areas of

linguistic including dictionary creation, as an aid to interpretation of literary text, forensic linguistic,

language description, language variation studies and language teaching materials.

Discourse

The term discourse is used in social and linguistic research in a number of inter-related yet different

ways. In traditional linguistic it is defined a language above the sentence or above the clause. The

term discourse is also sometimes applied to different types of language use or topic, for example,

we can talk about political discourse, colonial discourse, media discourse and environmental

discourse. A number of researchers have used corpora to examine discourse styles of people who

are learners of English. Discourse can also be defined as practices which systematically form the

objects of which they speak. In order to expand, discourse is a system of statements which

constructs an object as a set of meanings, metaphors, representations, images, stories, statements

and so on that in some way together produce a particular version of events. Therefore, discourses

are not valid descriptions of people’s beliefs or opinions and they cannot be taken as representing an

inner aspect of identify such as personality or attitude. They are connected to practices and

structures that are lived out in society from day to day. Discourses can therefore be difficult to pin

down or describe – they are constantly changing, interacting whit each other breaking off and

merging. One way that discourses are constructed is via language. Language is not the same as

discourse, but we can carry out analyses of language in texts in order to uncover traces of

discourses.

Corpus Linguistics and Discourse Analysis: Patterns and Power, Sintesi del corso di Linguistica

Documenti correlati