VFJ Indexing & Word Services

View Original

Database Indexing: Key Principles

Database (aka Open System) Indexing

From the Encyclopaedia Britannica:

Database (computer science): also called electronic database, any collection of data, or information, that is specially organized for rapid search and retrieval by a computer. Databases are structured to facilitate the storage, retrieval, modification, and deletion of data in conjunction with various data-processing operations. A database management system (DBMS) extracts information from the database in response to queries.

Unlike back-of-book indexes which are generally stand-alone works with static, closed vocabularies reflecting the jargon used in the book, database indexes typically link many different kinds of publications on a variety of topics, by different authors, and the number of included resources are typically expanding over time.

For example, the platform of ProQuest provides many databases that encompass just about every academic field imaginable. New journal articles, dissertations and theses, and documents in general are constantly being added to those databases.

What makes all that content searchable is the indexing.

The Importance of Being Consistent

The most important thing for database indexing is consistency through time and across documents, so a user searching for a specific concept within a database will retrieve all the relevant results.

A good thesaurus is needed to guide the indexer in maintaining consistency over time and across subject fields in using the most accurate and appropriate term for a concept. This helps users know what to expect when searching and also to find what they’re looking for more efficiently.

Principles & Practices of Database Indexing

ANSI/NISO Z39.4-2021 Criteria for Indexes

“This standard provides guidelines for the content, organization, and presentation of indexes used for the retrieval of documents and parts of documents. It deals with the principles of indexing regardless of the type of material indexed, the indexing method used, the medium of the index, or the method of presentation for searching. It emphasizes three processes essential for all indexes: comprehensive design, vocabulary management, and syntax.”

From page 11, the Summary of Key Considerations section states, “The key consideration for databases and other continuing indexes is continuity in indexing practices, policies, and terminology.” [Emphasis added]

The Value of Controlled Vocabularies

They allow for:

  • consistency across time;

  • consistency among document sets that use very different vocabulary or have very different treatment of similar topics.

    • One example of the need for consistency among document sets with different vocabularies, is this database.

More on Thesauruses & Thesaurus Building

More on Database Indexing