Lexical frames in academic prose and conversation

Bethany Gray, Douglas E Biber

Research output: Contribution to journalArticle

24 Citations (Scopus)

Abstract

While lexical bundles research identifies continuous sequences (e.g. the end of the, I don't know if), researchers have also been interested in discontinuous sequences in which words form a 'frame' surrounding a variable slot (e.g. I don't * to, it is * to). To date, most research has focused on a few intuitively-selected frames, or has begun with frequent continuous sequences and then analyzed those to identify associated frames. Few previous studies have attempted to directly identify the full set of discontinuous sequences in a corpus. In the present study, we work towards that goal, using a corpus-driven approach to identify the set of recurrent four-word continuous and discontinuous patterns in corpora of conversation and academic writing. This direct computational analysis of the corpora reveals a more complete set of frames than alternative approaches, resulting in the documentation of highly frequent frames that have not been identified in previous research.

Original languageEnglish (US)
Pages (from-to)109-136
Number of pages28
JournalInternational Journal of Corpus Linguistics
Volume18
Issue number1
DOIs
StatePublished - 2013

Fingerprint

conversation
documentation
Prose

Keywords

  • Collocational framework
  • Corpus-driven
  • Database tools
  • Formulaic language
  • Lexical bundles

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Lexical frames in academic prose and conversation. / Gray, Bethany; Biber, Douglas E.

In: International Journal of Corpus Linguistics, Vol. 18, No. 1, 2013, p. 109-136.

Research output: Contribution to journalArticle

@article{7c5f9ac618be414ea2d14583a5a7f9fe,
title = "Lexical frames in academic prose and conversation",
abstract = "While lexical bundles research identifies continuous sequences (e.g. the end of the, I don't know if), researchers have also been interested in discontinuous sequences in which words form a 'frame' surrounding a variable slot (e.g. I don't * to, it is * to). To date, most research has focused on a few intuitively-selected frames, or has begun with frequent continuous sequences and then analyzed those to identify associated frames. Few previous studies have attempted to directly identify the full set of discontinuous sequences in a corpus. In the present study, we work towards that goal, using a corpus-driven approach to identify the set of recurrent four-word continuous and discontinuous patterns in corpora of conversation and academic writing. This direct computational analysis of the corpora reveals a more complete set of frames than alternative approaches, resulting in the documentation of highly frequent frames that have not been identified in previous research.",
keywords = "Collocational framework, Corpus-driven, Database tools, Formulaic language, Lexical bundles",
author = "Bethany Gray and Biber, {Douglas E}",
year = "2013",
doi = "10.1075/ijcl.18.1.08gra",
language = "English (US)",
volume = "18",
pages = "109--136",
journal = "International Journal of Corpus Linguistics",
issn = "1384-6655",
publisher = "John Benjamins Publishing Company",
number = "1",

}

TY - JOUR

T1 - Lexical frames in academic prose and conversation

AU - Gray, Bethany

AU - Biber, Douglas E

PY - 2013

Y1 - 2013

N2 - While lexical bundles research identifies continuous sequences (e.g. the end of the, I don't know if), researchers have also been interested in discontinuous sequences in which words form a 'frame' surrounding a variable slot (e.g. I don't * to, it is * to). To date, most research has focused on a few intuitively-selected frames, or has begun with frequent continuous sequences and then analyzed those to identify associated frames. Few previous studies have attempted to directly identify the full set of discontinuous sequences in a corpus. In the present study, we work towards that goal, using a corpus-driven approach to identify the set of recurrent four-word continuous and discontinuous patterns in corpora of conversation and academic writing. This direct computational analysis of the corpora reveals a more complete set of frames than alternative approaches, resulting in the documentation of highly frequent frames that have not been identified in previous research.

AB - While lexical bundles research identifies continuous sequences (e.g. the end of the, I don't know if), researchers have also been interested in discontinuous sequences in which words form a 'frame' surrounding a variable slot (e.g. I don't * to, it is * to). To date, most research has focused on a few intuitively-selected frames, or has begun with frequent continuous sequences and then analyzed those to identify associated frames. Few previous studies have attempted to directly identify the full set of discontinuous sequences in a corpus. In the present study, we work towards that goal, using a corpus-driven approach to identify the set of recurrent four-word continuous and discontinuous patterns in corpora of conversation and academic writing. This direct computational analysis of the corpora reveals a more complete set of frames than alternative approaches, resulting in the documentation of highly frequent frames that have not been identified in previous research.

KW - Collocational framework

KW - Corpus-driven

KW - Database tools

KW - Formulaic language

KW - Lexical bundles

UR - http://www.scopus.com/inward/record.url?scp=84878101396&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84878101396&partnerID=8YFLogxK

U2 - 10.1075/ijcl.18.1.08gra

DO - 10.1075/ijcl.18.1.08gra

M3 - Article

VL - 18

SP - 109

EP - 136

JO - International Journal of Corpus Linguistics

JF - International Journal of Corpus Linguistics

SN - 1384-6655

IS - 1

ER -