Skip to Main Content

University Library

LibGuides

Metadata Services: Best Practices

Acquisitions & Cataloging Services

Technical Metadata

Technical metadata is used for recording the technical attributes of digital objects that include their production or creation information on the digital capture process, for example, the hardware and software used to acquire the digital object, file formats for master and derivatives, resolutions, color profiles and so on, that enable the reproduction of digital resources in the future. For this reason technical metadata is categorized as administrative and preservation metadata.

Technical metadata can be included in preservation metadata (PREMIS) or structural metadata (METS). The decision regarding where to add the technical metadata can be made based on the types of technical metadata that should be preserved, since METS and PREMIS have different ways of incorporating technical metadata in its schema, as mentioned in Preservation Metadata and Structural Metadata Sections.

Minimum Requirements

Requirements for technical metadata will differ for various media formats.  Regardless of media formats, all technical metadata should include the following information.

  • File Format: Recommended best practice is to select a value from a controlled vocabulary, such as the list of Internet Media Types that defines computer media formats (also known as MIME types).
  • File Size: For most formats, the recommended best practice is to record file size as bytes (e.g., 3,000,000 bytes) and not as kilobytes (KB), megabytes (MB), etc., because it is the most specific measurement of file size. It is also best practice to include duration time for multi-media. For example: 4,200,000 bytes; 5 minutes, 34 seconds.

Object Types and Related Information

Textual Documents

TextMD that is an XML Schema that details technical metadata for text-based digital objects. It can be added into both METS (as an extension) and PREMIS. It can be also used as a standalone document.

When using the textMD schema, the technical metadata for text-based digital objects may include:

  • encoding information (quality, platform, software, agent)
  • character information (character set and size, byte order and size, line terminators)
  • languages
  • fonts
  • markup information
  • processing and textual notes
  • technical requirements for printing and viewing
  • page ordering and sequencing

Still Images

For digital still images, it is recommended to use the NISO Data Dictionary - Technical Metadata for Digital Still Images.

This document includes metadata elements important to the management of image files that could be captured automatically from scanner or digital camera software. The elements are based on MIX.

Technical metadata for digital still images may include:

  • file format
  • file resolution (pixels per inch)
  • dimensions (image dimension or size in inches or centimeters)
  • bit-depth (e.g., 8-bit, 16-bit, 24-bit, etc.)
  • color mode (e.g., RGB, CMYK, or grayscale)
  • scanner or digital camera brand, name, and model number
  • software used to manipulate or compress the image, including the software name and version

Digital Audio Files

For digital audio files (sound only) the technical metadata for the image quality and the image capture process may include:

  • file format, including whether the format is lossless or lossy
  • sample rate
  • resolution
  • number of channels
  • software used to manipulate or compress the audio file, including the software name and version
  • brand, name, and model number of the recording equipment used

Digital Multi-Media Files

For digital multi-media files (including video, sound, and animation) the technical metadata for the image quality and capture process may include:

  • file format
  • bitrate
  • software used to manipulate or compress the multimedia file, including the software name and version
  • brand, name, and model number of the equipment used

Tools that Capture Technical Metadata

Currently most technical metadata can be captured automatically by digital camera or scanner. Following are the most commonly used tools and standards used for extracting from or embedding metadata to the digital objects.

  • JHOVE

JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. JHOVE generates technical metadata about files, usually by extracting information from the files themselves.

  • XMP

Adobe's Extensible Metadata Platform (XMP) is a labeling technology that allows users to embed metadata into the file itself.  XMP is used for both descriptive (based on Dublin Core) and administrative metadata.

  • Exif

Exchangeable image file format (Exif) is a specification for the image file format used by digital cameras. Exif data is embedded within the image file itself and can be editable. The metadata tags defined in the Exif standard include date and time information, camera settings such as the camera model and make, and information that varies with each image such as orientation (rotation), aperture, shutter speed, focal length, metering mode, and ISO speed information. A thumbnail for previewing the picture on the camera's LCD screen, in file managers, or in photo manipulation software, and descriptions and copyright information can be also captured or added later.