Expanded Benchmarks
This documentation provides more specific information about how an organization might apply the benchmarks or determine the level of quality for records and collections of metadata.
Benchmark: criterion that must be met
Metric(s): mechanism or measurement to determine if a record/value meets the benchmark standard; these may depend on local guidelines and field usage
Examples and Notes: non-exhaustive list of additional clarifications and/or examples of values that may or may not meet a metric
In this model, the benchmark and metrics set the standard (i.e., the criteria that must be met to qualify for that quality level) and the examples show some ways that the standard might be applied for different local circumstances. Also note that minimal and ideal levels are clearly defined, while all intermediary benchmarking stages are left up to local organizations. The benchmarks in the “suggested” section describe suggested priorities for organizations setting “better-than-minimal” benchmarks for their metadata that fall between minimal and ideal.
General benchmarks usage:
Each criterion is intended to be “system agnostic” but some may not apply to every situation (e.g., local field requirements)
Criteria are binary – i.e., the set being evaluated must meet all points or it does not meet the benchmarking standard for that level
These benchmarks focus solely on the quality of metadata entry, not the quality of information (i.e., available information is all entered correctly, although we might wish that additional information is known about an item to improve the record)
This framework is intended to be scalable (it is written in the context of 1 record, but could apply across a collection, resource type, or an entire system)
Minimal Benchmarks
Minimal-level benchmarks are mostly objective (e.g., a value is present/not present) and should apply to all records
Benchmark |
Metrics |
Examples |
---|---|---|
The record is specific/scoped correctly |
|
Values that may be correct:
|
|
Possible issues to review:
|
|
Every record has a title |
|
Possible issues to review:
|
Value content matches the field type |
|
Values that may be correct:
|
No values exceed applicable system character limits |
|
Possible issues to review:
|
There is no text encoding that “breaks” records |
|
Note: This is dependent on local system limitations |
Suggested Benchmarks
This category includes suggestions about what ought to be prioritized in local benchmarks to make records “better than minimal,” based on research and professional experience (noted in the justification column). These are mostly objective, but also include some subjective elements; suggested benchmarks are intended to be adjusted as needed and applied when applicable according to local requirements.
Benchmark |
Justification |
Metrics |
Examples |
---|---|---|---|
The record describes the item that it is attached to (i.e., there is not a mismatch between an item and a record describing a different item) |
This is a relatively fundamental need for metadata quality, but generally cannot be verified without manual review of every record (i.e., not scalable for large collections). |
|
Note:
|
All locally-required fields have values |
By definition, required fields should have values, but which fields are required (or available for usage) varies too much among schemas to be stated in a standardized way. |
|
Possible issues to remediate:
|
All conditionally-required fields have values |
By definition, required fields should have values, but which fields are required (or available for usage) varies too much among schemas to be stated in a standardized way. |
|
Possible issues to remediate:
|
|
Possible issues to remediate:
|
||
Fields that require multiple parts or qualifiers have all parts |
This is an extension of required field values, however, not all assessment considers values and other parts (like qualifiers) in tandem and this also incorporates non-required fields when they are in usage. |
|
Possible issues to remediate:
|
Records have some type of subject value |
Subject-based fields contain some of the only values that can help collocate related items across collections (including aggregations, like DPLA, Europeana, etc.) more broadly by topic. This also makes subject-based values a good candidate for review and normalization, if needed. |
|
Values that may be correct:
|
A rights statement is present |
Inclusion of rights information is broadly encouraged within the digital library community. Standardized rights statements were developed through international efforts. |
|
Values that may be correct:
|
Stray character encoding has been removed |
This is a problem that tends to be relatively easy to find programmatically and, depending on string matching, can make a significant difference when terms are normalized. |
|
Possible issues to remediate:
|
All “placeholder” values have been replaced/removed and are not present in the publicly accessible record |
Placeholders can be an indicator of information that is missing, or records that need review and may be easy to find programmatically if placeholders are applied consistently in local records. |
|
Possible issues to remediate:
|
Extremely problematic/offensive terms have been removed or handled appropriately |
Although comprehensive review and revision of records likely falls in the “ideal” category, it may be useful to think about that process iteratively and set first-level local priorities to address some problems more immediately. |
|
Note:
|
Ideal Benchmarks
Ideal-level benchmarks are intended to describe a “perfect” metadata record, i.e., if all available information about a specific item has been entered correctly, according to local standards. Many of these benchmarks are more subjective or item-dependent and not every benchmark will apply depending on system requirements. All applicable benchmarks must be met for a record to be “ideal.”
Benchmark |
Metrics |
Examples |
---|---|---|
All metadata values align with expectations for the material type |
|
Values that may be correct:
|
When applicable, relationships between items and parent collections are clearly represented |
|
Values that may be correct:
|
Relevant recommended/optional fields have values |
|
Note:
|
All relevant information about the item is included |
|
|
Non-required qualifiers or field parts are added to provide enhanced information or functionality |
|
|
“Null” values are used consistently, according to local guidelines |
|
Values that may be correct:
|
Fields/subfields that cannot be repeated occur only once |
|
Possible issues to remediate:
|
All values are appropriate lengths for their fields |
|
Possible issues to remediate:
Note:
|
All values that ought to align with standards conform to applicable vocabularies or rules |
|
Values that may be correct
|
All values are spelled correctly |
|
Notes:
|
Text fields use appropriate punctuation, grammar, abbreviations, etc. |
|
Values that may be correct:
|
Reading level & language use is appropriate for all (relevant) communities or audiences |
|
Values that may be correct:
|
Vocabulary usage aligns with the needs of the audience and material type |
|
Values that may be correct:
|
Values connected to interface functionality work |
|
Values that may be correct:
|
Record language has been evaluated/updated to align with best practices related to reparative metadata, inclusive language, etc. |
|
Note:
|