Sign Language Corpus Analysis
Sign language corpus analysis is the methodology for building and studying machine-readable, multimedia collections of signed languages, the natural visual-spatial languages of deaf communities. Because signed languages have no widely used written form, a corpus cannot be a body of text; it must be a structured collection of video recordings of deaf signers, layered with time-aligned annotation that makes the language searchable and analyzable. Trevor Johnston's 2010 account of moving from archive to corpus set out the central methodological principles, most notably ID-glossing, in which every instance of a sign is annotated with a single stable identifier linked to a lexical database so that all tokens of the same sign can be found and counted. The pipeline records signing on video, segments and ID-glosses it, links those glosses to a lexicon, aligns multiple annotation tiers to the video timeline, and then supports quantitative, corpus-based analysis of frequency, variation, and grammar. The result turns a fragile collection of recordings into reusable empirical evidence about how a signed language is actually used.
Read the full method
Sign in with a free account to read this section.
Method map
The neighbourhood of related methods — select a node to explore.
Sources
- Johnston, T. (2010). From archive to corpus: transcription and annotation in the creation of signed language corpora. International Journal of Corpus Linguistics, 15(1), 106-131. DOI: 10.1075/ijcl.15.1.05joh ↗
How to cite this page
ScholarGate. (2026, June 23). Sign Language Corpus Construction and Analysis. ScholarGate. https://scholargate.app/en/disability-studies/sign-language-corpus
Which method?
Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.
- Augmentative and Alternative Communication AssessmentDisability Studies↔ compare