Neutral evolution and turnover over centuries of English word popularity

D. Ruck, R.A. Bentley, A. Acerbi, P. Garnett, D.J. Hruschka

Research output: Book/ReportReportAcademic

1 Citation (Scopus)

Abstract

Here we test Neutral models against the evolution of English word frequency and vocabulary at the population scale, as recorded in annual word frequencies from three centuries of English language books. Against these data, we test both static and dynamic predictions of two neutral models, including the relation between corpus size and vocabulary size, frequency distributions, and turnover within those frequency distributions. Although a commonly used Neutral model fails to replicate all these emergent properties at once, we find that modified two-stage Neutral model does replicate the static and dynamic properties of the corpus data. This two-stage model is meant to represent a relatively small corpus (population) of English books, analogous to a `canon', sampled by an exponentially increasing corpus of books in the wider population of authors. More broadly, this mode -- a smaller neutral model within a larger neutral model -- could represent more broadly those situations where mass attention is focused on a small subset of the cultural variants.
LanguageEnglish
Number of pages12
StatePublished - 30 Mar 2017

Fingerprint

turnover
dynamic property
model test
prediction
book
distribution
vocabulary

Bibliographical note

12 pages, 5 figures, 1 table

Keywords

  • cs.CL
  • physics.soc-ph

Cite this

Ruck, D., Bentley, R. A., Acerbi, A., Garnett, P., & Hruschka, D. J. (2017). Neutral evolution and turnover over centuries of English word popularity. arXiv
Ruck, D. ; Bentley, R.A. ; Acerbi, A. ; Garnett, P. ; Hruschka, D.J./ Neutral evolution and turnover over centuries of English word popularity. 2017. 12 p. (arXiv).
@book{4c32e8c1a5db4ec2af6bdaad85501fcd,
title = "Neutral evolution and turnover over centuries of English word popularity",
abstract = "Here we test Neutral models against the evolution of English word frequency and vocabulary at the population scale, as recorded in annual word frequencies from three centuries of English language books. Against these data, we test both static and dynamic predictions of two neutral models, including the relation between corpus size and vocabulary size, frequency distributions, and turnover within those frequency distributions. Although a commonly used Neutral model fails to replicate all these emergent properties at once, we find that modified two-stage Neutral model does replicate the static and dynamic properties of the corpus data. This two-stage model is meant to represent a relatively small corpus (population) of English books, analogous to a `canon', sampled by an exponentially increasing corpus of books in the wider population of authors. More broadly, this mode -- a smaller neutral model within a larger neutral model -- could represent more broadly those situations where mass attention is focused on a small subset of the cultural variants.",
keywords = "cs.CL, physics.soc-ph",
author = "D. Ruck and R.A. Bentley and A. Acerbi and P. Garnett and D.J. Hruschka",
note = "12 pages, 5 figures, 1 table",
year = "2017",
month = "3",
day = "30",
language = "English",

}

Ruck, D, Bentley, RA, Acerbi, A, Garnett, P & Hruschka, DJ 2017, Neutral evolution and turnover over centuries of English word popularity. arXiv.

Neutral evolution and turnover over centuries of English word popularity. / Ruck, D.; Bentley, R.A.; Acerbi, A.; Garnett, P.; Hruschka, D.J.

2017. 12 p.

Research output: Book/ReportReportAcademic

TY - BOOK

T1 - Neutral evolution and turnover over centuries of English word popularity

AU - Ruck,D.

AU - Bentley,R.A.

AU - Acerbi,A.

AU - Garnett,P.

AU - Hruschka,D.J.

N1 - 12 pages, 5 figures, 1 table

PY - 2017/3/30

Y1 - 2017/3/30

N2 - Here we test Neutral models against the evolution of English word frequency and vocabulary at the population scale, as recorded in annual word frequencies from three centuries of English language books. Against these data, we test both static and dynamic predictions of two neutral models, including the relation between corpus size and vocabulary size, frequency distributions, and turnover within those frequency distributions. Although a commonly used Neutral model fails to replicate all these emergent properties at once, we find that modified two-stage Neutral model does replicate the static and dynamic properties of the corpus data. This two-stage model is meant to represent a relatively small corpus (population) of English books, analogous to a `canon', sampled by an exponentially increasing corpus of books in the wider population of authors. More broadly, this mode -- a smaller neutral model within a larger neutral model -- could represent more broadly those situations where mass attention is focused on a small subset of the cultural variants.

AB - Here we test Neutral models against the evolution of English word frequency and vocabulary at the population scale, as recorded in annual word frequencies from three centuries of English language books. Against these data, we test both static and dynamic predictions of two neutral models, including the relation between corpus size and vocabulary size, frequency distributions, and turnover within those frequency distributions. Although a commonly used Neutral model fails to replicate all these emergent properties at once, we find that modified two-stage Neutral model does replicate the static and dynamic properties of the corpus data. This two-stage model is meant to represent a relatively small corpus (population) of English books, analogous to a `canon', sampled by an exponentially increasing corpus of books in the wider population of authors. More broadly, this mode -- a smaller neutral model within a larger neutral model -- could represent more broadly those situations where mass attention is focused on a small subset of the cultural variants.

KW - cs.CL

KW - physics.soc-ph

M3 - Report

BT - Neutral evolution and turnover over centuries of English word popularity

ER -

Ruck D, Bentley RA, Acerbi A, Garnett P, Hruschka DJ. Neutral evolution and turnover over centuries of English word popularity. 2017. 12 p. (arXiv).