Glossary/Controlled Vocabulary
Data Governance

Controlled Vocabulary

A curated, standardized set of terms used to index, catalog, and retrieve information consistently.

Definition

A controlled vocabulary is a carefully selected, standardized list of terms used to describe and organize information in a consistent, unambiguous way. Unlike free-text tagging where anyone can use any words, a controlled vocabulary restricts indexing and categorization to approved terms with defined meanings. Controlled vocabularies range from simple flat lists (like a list of approved product categories) to hierarchical thesauri (like the Library of Congress Subject Headings) to full ontologies.

Why it matters in 2026

Controlled vocabularies are the foundation of AI-ready data governance. When AI systems search, classify, or analyze enterprise data, consistent terminology is essential for accurate results. Organizations that have invested in controlled vocabularies for their product catalogs, customer segments, and business processes are finding that their AI deployments are dramatically more accurate than those relying on free-text fields. The semantic layer is essentially a controlled vocabulary for business metrics and entities.

How it works

Controlled vocabularies are maintained in terminology management systems and enforced through data entry validation, auto-suggestion, and mapping. Simple controlled vocabularies use flat lists; thesauri add broader/narrower/related term relationships; taxonomies add hierarchical classification; ontologies add logical axioms and reasoning. SKOS (Simple Knowledge Organization System) is the W3C standard for publishing controlled vocabularies as Linked Data.

Real-world example

A global manufacturing company maintains a controlled vocabulary of 50,000 approved part names and descriptions. When engineers enter new parts into the system, they must select from the controlled vocabulary rather than typing free text. An AI agent searching for 'stainless steel fasteners M8' returns precise results because all relevant parts are indexed under the same controlled terms — not scattered across 'SS bolts M8,' 'stainless M8 screws,' and 'metric fasteners stainless.'

Related Terms

4 terms
Browse all 46 terms →

Further Reading