Document and Archival Analysis
Using existing texts and records as data
Document and archival analysis is a research method in which existing materials — such as official reports, meeting minutes, letters, media content, and archival records — serve as primary data rather than newly collected information. The technique is unobtrusive and well suited to historical and longitudinal inquiry. However, the researcher must critically assess each material's authenticity, representativeness, and the purpose for which it was originally produced.
Definition and Scope
Document and archival analysis is a systematic research technique in which the researcher examines pre-existing written, visual, or digital materials without intervening in the setting. These materials may include government archives, institutional records, newspaper and journal articles, official correspondence, meeting minutes, personal diaries, and reports. The method is applicable within qualitative, quantitative, or mixed-methods designs; it can function as a stand-alone analytic tool or as a complement to fieldwork, interviews, or survey research.
How It Works: Types and Key Steps
The process typically involves: (1) identifying document types suited to the research questions; (2) distinguishing between primary (original records) and secondary (interpreted records) documents; (3) critically evaluating materials on dimensions of authenticity, credibility, representativeness, and meaning; and (4) extracting themes, patterns, or evidence through a systematic analytic framework such as content analysis or discourse analysis. Documents may be classified by purpose as primary (contracts, minutes), secondary (news reports, commentaries), or tertiary (indexes, compilations).
A Concrete Application Example
Suppose a researcher wants to examine how university gender-equity policies evolved over time. She assembles senate meeting minutes, equal-opportunity reports, and relevant legislation from 1990 to 2020 through archival access. She evaluates each document by considering who produced it, in what context, and for what purpose, then traces shifts in language and policy priorities using discourse analysis. This approach allows retrospective interpretation of social change and provides access to evidence unmediated by participants' memory-based biases.
Common Pitfalls and Recommendations for Good Practice
One of the most common errors is treating documents at face value without applying a critical filter. Every record was produced for a specific purpose and in a specific context; it should be read as a constructed account rather than a transparent reflection of reality. Researchers must also interrogate the representativeness of accessible materials: examining only surviving or declassified documents can introduce systematic bias. Best practice requires transparent reporting of which documents were accessed and why, detailed documentation of collection and analysis procedures, and sustained attention to how political or institutional power relations may have shaped content.
Key terms
- Primary Document
- An original record created at the time of or by direct participants in an event.
- Authenticity Assessment
- The process of verifying that a document originates from its stated source and has not been altered.
- Representativeness
- The degree to which accessed documents reflect the broader universe of relevant records.
- Unobtrusiveness
- A condition where the researcher's presence does not influence data; a key advantage of documentary analysis.
- Survivorship Bias
- Systematic bias arising from analyzing only preserved records while missing those that were lost or destroyed.