Show simple item record

dc.creator Sardinha Tony Berber
dc.date 2002
dc.date.accessioned 2013-05-30T12:54:19Z
dc.date.available 2013-05-30T12:54:19Z
dc.date.issued 2013-05-30
dc.identifier http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0102-44502002000200004
dc.identifier http://www.doaj.org/doaj?func=openurl&genre=article&issn=01024450&date=2002&volume=18&issue=2&spage=273
dc.identifier.uri http://koha.mediu.edu.my:8181/jspui/handle/123456789/5481
dc.description The aim of the research presented here is to report on a corpus-based method for discourse analysis that is based on the notion of segmentation, or the division of texts into cohesive portions. For the purposes of this investigation, a segment is defined as a contiguous portion of written text consisting of at least two sentences. The segmentation procedure developed for the study is called LSM (link set median), which is based on the identification of lexical repetition in text. The data analysed in this investigation were three corpora of 100 texts each. Each corpus was composed of texts of one particular genre: research articles, annual business reports, and encyclopaedia entries. The total number of words in the three corpora was 1,262,710 words. The segments inserted in the texts by the LSM procedure were compared to the internal section divisions in the texts. Afterwards, the results obtained through the LSM procedure were then compared to segmentation carried out at random. The results indicated that the LSM procedure worked better than random, suggesting that lexical repetition accounts in part for the way texts are segmented into sections.
dc.publisher Pontifícia Universidade Católica de São Paulo - PUC-SP
dc.source DELTA: Documentação de Estudos em Lingüística Teórica e Aplicada
dc.subject Corpus linguistics
dc.subject Discourse analysis
dc.subject Segmentation
dc.subject Lexical cohesion
dc.subject Repetition
dc.title Segmenting corpora of texts


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account