Register Variation on the Searchable Web

A Multi-Dimensional Analysis

Douglas E Biber, Jesse Egbert

Research output: Contribution to journalArticle

15 Citations (Scopus)

Abstract

Most previous linguistic investigations of the web have focused on special linguistic features associated with Internet language (e.g., the use of emoticons, abbreviations, contractions, and acronyms) and the “new” Internet registers that are especially salient to observers (e.g., blogs, Internet forums, instant messages, tweets). Multi-Dimensional (MD) analysis has also been used to analyze Internet registers, focusing on core grammatical features (e.g., nouns, verbs, prepositional phrases). MD research differs theoretically and methodologically from most other research approaches in linguistics in that it is built on the notion of linguistic co-occurrence, with the claim that register differences are best described in terms of sets of co-occurring linguistic features that have a functional underpinning. At the same time, though, most previous MD studies are similar to other previous research in their focus on new Internet registers, such as blogs, Facebook/Twitter posts, and email messages. These are the registers that we immediately think of in association with the Internet, and thus it makes sense that they should be the focus of most previous research. However, that emphasis means that we know surprisingly little at present about the full range of registers found on the web and the patterns of linguistic variation among those registers. This is the goal of the present study. Rather than beginning with a focus on new registers that are assumed to be interesting, we analyze a representative sample of the entire searchable web. End-users coded the situational and communicative characteristics of each document in our corpus, leading to a much wider range of register categories than that used in any previous linguistic study: eight general categories; several hybrid register categories; and twenty-seven specific register categories. This approach thus leads to a much more inclusive and diverse sample of web registers than that found in any previous study of English Internet language. The goal of the present study is to document the patterns of linguistic variation among those registers. Using MD analysis, we explore the dimensions of linguistic variation on the searchable web, and the similarities and differences among web registers with respect to those dimensions.

Original languageEnglish (US)
Pages (from-to)95-137
Number of pages43
JournalJournal of English Linguistics
Volume44
Issue number2
DOIs
StatePublished - 2016

Fingerprint

dimensional analysis
linguistics
Internet
weblog
Multidimensional Analysis
Register Variation
World Wide Web
twitter
facebook
language
research approach

Keywords

  • hybrid registers
  • Internet language
  • Multi-Dimensional analysis
  • register variation
  • web registers

ASJC Scopus subject areas

  • Linguistics and Language
  • Language and Linguistics

Cite this

Register Variation on the Searchable Web : A Multi-Dimensional Analysis. / Biber, Douglas E; Egbert, Jesse.

In: Journal of English Linguistics, Vol. 44, No. 2, 2016, p. 95-137.

Research output: Contribution to journalArticle

@article{6f5150a2dd9641bda75cfacf1112a83a,
title = "Register Variation on the Searchable Web: A Multi-Dimensional Analysis",
abstract = "Most previous linguistic investigations of the web have focused on special linguistic features associated with Internet language (e.g., the use of emoticons, abbreviations, contractions, and acronyms) and the “new” Internet registers that are especially salient to observers (e.g., blogs, Internet forums, instant messages, tweets). Multi-Dimensional (MD) analysis has also been used to analyze Internet registers, focusing on core grammatical features (e.g., nouns, verbs, prepositional phrases). MD research differs theoretically and methodologically from most other research approaches in linguistics in that it is built on the notion of linguistic co-occurrence, with the claim that register differences are best described in terms of sets of co-occurring linguistic features that have a functional underpinning. At the same time, though, most previous MD studies are similar to other previous research in their focus on new Internet registers, such as blogs, Facebook/Twitter posts, and email messages. These are the registers that we immediately think of in association with the Internet, and thus it makes sense that they should be the focus of most previous research. However, that emphasis means that we know surprisingly little at present about the full range of registers found on the web and the patterns of linguistic variation among those registers. This is the goal of the present study. Rather than beginning with a focus on new registers that are assumed to be interesting, we analyze a representative sample of the entire searchable web. End-users coded the situational and communicative characteristics of each document in our corpus, leading to a much wider range of register categories than that used in any previous linguistic study: eight general categories; several hybrid register categories; and twenty-seven specific register categories. This approach thus leads to a much more inclusive and diverse sample of web registers than that found in any previous study of English Internet language. The goal of the present study is to document the patterns of linguistic variation among those registers. Using MD analysis, we explore the dimensions of linguistic variation on the searchable web, and the similarities and differences among web registers with respect to those dimensions.",
keywords = "hybrid registers, Internet language, Multi-Dimensional analysis, register variation, web registers",
author = "Biber, {Douglas E} and Jesse Egbert",
year = "2016",
doi = "10.1177/0075424216628955",
language = "English (US)",
volume = "44",
pages = "95--137",
journal = "Journal of English Linguistics",
issn = "0075-4242",
publisher = "SAGE Publications Ltd",
number = "2",

}

TY - JOUR

T1 - Register Variation on the Searchable Web

T2 - A Multi-Dimensional Analysis

AU - Biber, Douglas E

AU - Egbert, Jesse

PY - 2016

Y1 - 2016

N2 - Most previous linguistic investigations of the web have focused on special linguistic features associated with Internet language (e.g., the use of emoticons, abbreviations, contractions, and acronyms) and the “new” Internet registers that are especially salient to observers (e.g., blogs, Internet forums, instant messages, tweets). Multi-Dimensional (MD) analysis has also been used to analyze Internet registers, focusing on core grammatical features (e.g., nouns, verbs, prepositional phrases). MD research differs theoretically and methodologically from most other research approaches in linguistics in that it is built on the notion of linguistic co-occurrence, with the claim that register differences are best described in terms of sets of co-occurring linguistic features that have a functional underpinning. At the same time, though, most previous MD studies are similar to other previous research in their focus on new Internet registers, such as blogs, Facebook/Twitter posts, and email messages. These are the registers that we immediately think of in association with the Internet, and thus it makes sense that they should be the focus of most previous research. However, that emphasis means that we know surprisingly little at present about the full range of registers found on the web and the patterns of linguistic variation among those registers. This is the goal of the present study. Rather than beginning with a focus on new registers that are assumed to be interesting, we analyze a representative sample of the entire searchable web. End-users coded the situational and communicative characteristics of each document in our corpus, leading to a much wider range of register categories than that used in any previous linguistic study: eight general categories; several hybrid register categories; and twenty-seven specific register categories. This approach thus leads to a much more inclusive and diverse sample of web registers than that found in any previous study of English Internet language. The goal of the present study is to document the patterns of linguistic variation among those registers. Using MD analysis, we explore the dimensions of linguistic variation on the searchable web, and the similarities and differences among web registers with respect to those dimensions.

AB - Most previous linguistic investigations of the web have focused on special linguistic features associated with Internet language (e.g., the use of emoticons, abbreviations, contractions, and acronyms) and the “new” Internet registers that are especially salient to observers (e.g., blogs, Internet forums, instant messages, tweets). Multi-Dimensional (MD) analysis has also been used to analyze Internet registers, focusing on core grammatical features (e.g., nouns, verbs, prepositional phrases). MD research differs theoretically and methodologically from most other research approaches in linguistics in that it is built on the notion of linguistic co-occurrence, with the claim that register differences are best described in terms of sets of co-occurring linguistic features that have a functional underpinning. At the same time, though, most previous MD studies are similar to other previous research in their focus on new Internet registers, such as blogs, Facebook/Twitter posts, and email messages. These are the registers that we immediately think of in association with the Internet, and thus it makes sense that they should be the focus of most previous research. However, that emphasis means that we know surprisingly little at present about the full range of registers found on the web and the patterns of linguistic variation among those registers. This is the goal of the present study. Rather than beginning with a focus on new registers that are assumed to be interesting, we analyze a representative sample of the entire searchable web. End-users coded the situational and communicative characteristics of each document in our corpus, leading to a much wider range of register categories than that used in any previous linguistic study: eight general categories; several hybrid register categories; and twenty-seven specific register categories. This approach thus leads to a much more inclusive and diverse sample of web registers than that found in any previous study of English Internet language. The goal of the present study is to document the patterns of linguistic variation among those registers. Using MD analysis, we explore the dimensions of linguistic variation on the searchable web, and the similarities and differences among web registers with respect to those dimensions.

KW - hybrid registers

KW - Internet language

KW - Multi-Dimensional analysis

KW - register variation

KW - web registers

UR - http://www.scopus.com/inward/record.url?scp=84966570500&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84966570500&partnerID=8YFLogxK

U2 - 10.1177/0075424216628955

DO - 10.1177/0075424216628955

M3 - Article

VL - 44

SP - 95

EP - 137

JO - Journal of English Linguistics

JF - Journal of English Linguistics

SN - 0075-4242

IS - 2

ER -