Historical Corpora and Attested Records
The written records and assembled corpora that provide the primary attested evidence for the history of languages, and the methods for using them responsibly.
Definition
Historical corpora and attested records are the bodies of surviving written evidence for earlier stages of a language, including inscriptions, manuscripts, and structured electronic collections, used as primary data for studying language history.
Scope
This topic covers the documentary basis of historical linguistics: inscriptions, manuscripts, printed texts, and the structured electronic corpora built from them. It addresses the dating and interpretation of attestations, the gaps and biases of the surviving record, the distinction between attested and reconstructed evidence, and the role of corpus methods in tracking change quantitatively.
Core questions
- What counts as attested evidence for an earlier stage of a language?
- How are written records dated and localized?
- What biases and gaps affect the surviving documentary record?
- How do structured historical corpora enable quantitative study of change?
- How is attested evidence weighed against reconstructed evidence?
Key theories
- Corpus-based study of language change
- Structured, annotated historical corpora allow change to be tracked quantitatively across time, genre, and region, while requiring careful attention to the representativeness and biases of the surviving record.
History
Historical linguistics has always depended on attested records, from ancient inscriptions to medieval manuscripts. The development of electronic historical corpora from the late twentieth century, such as the Helsinki Corpus of English Texts, transformed the field by enabling systematic quantitative study of change, complementing traditional philological reading.
Debates
- Representativeness of the surviving record
- Because surviving texts are biased toward certain genres, regions, and educated registers, there is debate over how far corpus findings reflect the spoken language and ordinary usage of the past.
Key figures
- Merja Kyto
- Paivi Pahta
- Lyle Campbell
Related topics
Seminal works
- kytoPahta2016
- campbell2013
Frequently asked questions
- What is the difference between attested and reconstructed forms?
- Attested forms are actually recorded in surviving texts or inscriptions, while reconstructed forms are inferred from comparison and marked with an asterisk; attested evidence is generally given priority where it exists.
- Why are historical corpora useful?
- They let researchers measure how features change in frequency over time and across genres and regions, turning scattered textual evidence into systematic, quantifiable data on language change.