Handwritten Text Recognition for Archives
Handwritten text recognition for archives converts digital images of manuscript pages into searchable, machine-readable text, unlocking the vast holdings of handwritten material that optical character recognition, designed for print, cannot read. Exemplified by platforms such as Transkribus, developed in the READ project, modern HTR uses deep neural networks trained on transcribed examples to recognize the highly variable scripts of letters, registers, charters, and notebooks across centuries and languages. The pipeline first analyzes page layout and segments the image into text regions and lines, then a recurrent or transformer-based recognizer decodes each line into characters, typically using connectionist temporal classification to align pixels with text without needing character-level segmentation. Crucially, recognition models are trained and improved on ground-truth transcriptions supplied by scholars, so accuracy rises as more material is annotated. By making manuscripts machine-readable at scale, HTR is the gateway technology of digital archival history, feeding full-text search, named-entity recognition, and large-corpus text mining of sources that were previously legible only page by page.
방법 전문 읽기
무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.
방법 지도
관련 방법들로 이루어진 인접 영역 — 노드를 선택해 살펴보세요.
출처
- Muehlberger, G., Seaward, L., Terras, M., et al. (2019). Transforming scholarship in the archives through handwritten text recognition: Transkribus as a case study. Journal of Documentation, 75(5), 954-976. DOI: 10.1108/JD-07-2018-0114 ↗
- Moretti, F. (2013). Distant Reading. Verso. ISBN: 9781781680841
이 페이지 인용 방법
ScholarGate. (2026, June 23). Handwritten Text Recognition for Archival Manuscripts. ScholarGate. https://scholargate.app/ko/digital-history/handwritten-text-recognition-archives
어떤 방법일까요?
이 방법을 가장 가까운 동류의 방법들과 나란히 놓고 비교해 보세요 — 라이브러리는 책을 펼쳐 놓을 뿐, 선택은 여러분의 몫입니다.
- Historical Corpus Text MiningDigital History↔ 비교
- Historical GISHistorical Geography↔ 비교
- Historical Named-Entity RecognitionDigital History↔ 비교