Distant Reading and Computational Comparison
If the world literary archive is too vast for any reader to close-read, comparison may have to proceed at a distance — through quantitative analysis of large corpora, the mapping of genres and forms, and computational models of literary change. Distant reading reframes comparison as a problem of scale.
Definition
An approach to literary comparison that substitutes quantitative, large-scale analysis of many texts for the close reading of a few, treating units such as genres, devices, and statistical trends as the objects of comparative knowledge.
Scope
Covers the theory and practice of analyzing literature at scales beyond the individual text: Moretti's program of distant reading and his use of graphs, maps, and trees; Jockers's macroanalysis; and Underwood's modeling of long-run literary change. Includes debates over what quantitative evidence can and cannot show about literary history and over the relation between distant and close reading.
Core questions
- What can be learned by analyzing thousands of texts that cannot be learned by reading a handful closely?
- Are abstractions like graphs, maps, and trees legitimate objects of literary knowledge?
- How do computational models of literary change relate to traditional period and genre concepts?
- What is lost when interpretation is delegated to corpora and algorithms, and can distant and close reading be combined?
Key theories
- Distant reading
- Moretti proposed that understanding the world literary system requires stepping back from individual texts to study large patterns through abstractions, since the canon read closely represents a vanishingly small fraction of what was written.
- Graphs, maps, trees
- Moretti imported quantitative, geographic, and evolutionary models from other disciplines to visualize literary history, treating genres as the units of an abstract literary morphology.
- Macroanalysis
- Jockers argued that digital methods let literary history move from the close study of exemplary texts to the statistical analysis of entire literary corpora, revealing macro-scale influence and stylistic patterns.
- Modeling literary change
- Underwood used predictive modeling of large text collections to argue that some categories such as genre and prestige change gradually and continuously rather than in the sharp breaks emphasized by conventional literary history.
History
Moretti's 2000 essay 'Conjectures on World Literature' introduced distant reading as a response to the scale of the world literary archive; his 2005 Graphs, Maps, Trees and the 2013 collection Distant Reading developed the program. Jockers's 2013 Macroanalysis and Underwood's 2019 Distant Horizons consolidated computational literary history within the wider digital humanities, while sustained criticism questioned the interpretive value and political assumptions of large-scale quantification.
Debates
- Quantitative evidence versus interpretive reading
- Whether statistical analysis of corpora yields genuine literary knowledge or strips away the attention to language and meaning that defines literary study.
- Continuity versus rupture in literary change
- Computational modeling has suggested gradual, continuous change in categories such as genre, complicating literary history's traditional emphasis on movements and decisive breaks.
Key figures
- Franco Moretti
- Matthew L. Jockers
- Ted Underwood
Related topics
Seminal works
- moretti2013
- jockers2013
- underwood2019
- moretti2000
Frequently asked questions
- Does distant reading replace close reading?
- Most practitioners present it as a complement rather than a replacement: distant reading reveals large-scale patterns across many texts, while close reading remains indispensable for interpreting how individual works make meaning.