ScholarGate
Assistant

Historical Corpora and Attested Records

The written records and assembled corpora that provide the primary attested evidence for the history of languages, and the methods for using them responsibly.

Definition

Historical corpora and attested records are the bodies of surviving written evidence for earlier stages of a language, including inscriptions, manuscripts, and structured electronic collections, used as primary data for studying language history.

Scope

This topic covers the documentary basis of historical linguistics: inscriptions, manuscripts, printed texts, and the structured electronic corpora built from them. It addresses the dating and interpretation of attestations, the gaps and biases of the surviving record, the distinction between attested and reconstructed evidence, and the role of corpus methods in tracking change quantitatively.

Core questions

  • What counts as attested evidence for an earlier stage of a language?
  • How are written records dated and localized?
  • What biases and gaps affect the surviving documentary record?
  • How do structured historical corpora enable quantitative study of change?
  • How is attested evidence weighed against reconstructed evidence?

Key theories

Corpus-based study of language change
Structured, annotated historical corpora allow change to be tracked quantitatively across time, genre, and region, while requiring careful attention to the representativeness and biases of the surviving record.

History

Historical linguistics has always depended on attested records, from ancient inscriptions to medieval manuscripts. The development of electronic historical corpora from the late twentieth century, such as the Helsinki Corpus of English Texts, transformed the field by enabling systematic quantitative study of change, complementing traditional philological reading.

Debates

Representativeness of the surviving record
Because surviving texts are biased toward certain genres, regions, and educated registers, there is debate over how far corpus findings reflect the spoken language and ordinary usage of the past.

Key figures

  • Merja Kyto
  • Paivi Pahta
  • Lyle Campbell

Related topics

Seminal works

  • kytoPahta2016
  • campbell2013

Frequently asked questions

What is the difference between attested and reconstructed forms?
Attested forms are actually recorded in surviving texts or inscriptions, while reconstructed forms are inferred from comparison and marked with an asterisk; attested evidence is generally given priority where it exists.
Why are historical corpora useful?
They let researchers measure how features change in frequency over time and across genres and regions, turning scattered textual evidence into systematic, quantifiable data on language change.

Methods for this concept

Related concepts