ScholarGate
Assistant

Metadata and Controlled Vocabularies

Metadata is data about data: the structured descriptions that let digital objects be found, understood, managed, and connected. Controlled vocabularies and authority files give those descriptions a shared, disambiguated language so that resources across collections can be linked.

Find Topic with PaperMindSoonFind papers & topics
Tools & resources
Download slides
Learn & explore
VideoSoon

Definition

Structured descriptions of digital resources, together with the standardized vocabularies and authority systems used to make those descriptions consistent, interoperable, and linkable across collections.

Scope

Covers descriptive, structural, and administrative metadata in the humanities; standards such as Dublin Core; authority files and controlled vocabularies; and the Semantic Web and linked open data as a way of connecting cultural resources. Includes how metadata supports discovery, interoperability, and long-term stewardship.

Core questions

  • What kinds of metadata does a cultural resource need to be found and understood?
  • How do controlled vocabularies reduce ambiguity and enable linking?
  • How do standards such as Dublin Core support interoperability?
  • What does it mean to publish humanities data as linked open data?

Key concepts

  • Descriptive metadata
  • Dublin Core
  • Authority file
  • Controlled vocabulary
  • Linked open data
  • Persistent identifier

Key theories

Types and functions of metadata
Metadata is conventionally divided into descriptive, structural, and administrative categories, each serving discovery, navigation, or management of digital objects.
Interoperability through standards
Shared schemas such as Dublin Core provide a common minimal vocabulary so that resources from different systems can be aggregated and searched together.
Linked data and the Semantic Web
Expressing metadata as machine-readable statements with stable identifiers lets cultural resources be connected into a web of data rather than isolated records.

History

Library cataloguing traditions established controlled vocabularies and authority control long before computing. The Dublin Core initiative of the mid-1990s provided a simple cross-domain metadata standard; the 2001 articulation of the Semantic Web and subsequent linked-data practice extended these ideas toward interconnected cultural heritage data.

Debates

Simplicity versus expressiveness
Minimal standards like Dublin Core maximize interoperability but may be too coarse for rich scholarly description, while detailed schemas risk fragmenting the shared vocabulary.

Key figures

  • Anne J. Gilliland
  • Murtha Baca
  • Tim Berners-Lee

Related topics

Seminal works

  • gilliland2016
  • weibel1998
  • berners2001

Frequently asked questions

Why use a controlled vocabulary instead of free-text tags?
Free-text terms vary in spelling, language, and specificity, which scatters related items. A controlled vocabulary or authority file fixes preferred terms and links variants, so searches retrieve everything about a concept or person regardless of how it was originally phrased.

Methods for this concept

Related concepts