The Shape of Cocina Descriptive Metadata

The following sunburst diagrams attempt to visualize the shape of Cocina descriptive metadata for objects in the Stanford Digital Repository. Cocina is expressed using JSON, and each wedge (or node) in a diagram represents a portion of a JSON path used in cocina objects, and is sized based on how many occurrences it is part of.

The JSON path radiates from the center out, and wedges can be clicked on to zoom in on additional parts of a path. After zooming in, clicking on the central wedge will cause the view to pop back out one-level. The hover text indicates the name of JSON path property (useful when the wedge is tiny) as well as a count of the number of properties that the path is a part of (essentially a sum of all the leaf nodes it contains).

Each non-empty occurence of a cocina property value is counted, so if an object's descriptive metadata has 5 distinct subject values, they will add 5 to the total count of subjects.

All Occurrences of All Properties for All SDR Cocina Objects

The data for the chart below was generated using the descriptive shape report which generated this CSV file.

All Occurrences of All Properties for SDR Cocina Objects That Link to the ILS Catalog

This visualization only includes descriptions for objects that link to the ILS catalog: records which contain one or more catalogLinks in their Cocina identification metadata. Ostensibly the descriptive metadata was derived from MARC. The data for the chart was generated using the descriptive shape report with the catalog: 'only' option, which resulted in this CSV file.

All Occurrences of All Properties for SDR Cocina Objects That Do NOT Link to the ILS Catalog

This visualization only includes descriptions for cocina objects that do NOT contain catalogLinks in their Cocina identification metadata. The data was generated using the descriptive shape report with the catalog: 'none' option, which resulted in this CSV file.