There are two components to this service:
The Python 3 script, ead2dc.py, takes as its input the Open Archives Initiative (OAI) output of an ArchivesSpace resource finding aid in Encoded Archival Description (EAD) format and outputs an XML file. The XML output contains Dublin Core (DC) records for digital resources found in the finding aid. Only records containing links to digital objects are included. The XML output is a ‘static repository’, inspired by the OAI Static Repository specification, but not adhering to it strictly. The static repository is the data source for the Open Archives Initiative (OAI) Data Provider.
The OAI Data Provider adheres to the OAI standard and supports all the verbs (Identify, ListMetadataFormats, ListSets, ListIdentifiers, ListRecords, and GetRecord), resumption tokens, and sets. Only DC metadata is provided. Sets correspond to the archival collections in the Caltech Archives.
ead2dc - Main features and assumptions:
OAI Data Provider - Main features and assumptions:
The ead2dc.py file is designed to be run from the command line, or from within your favorite editing environment. It uses standard Python libraries and has been tested using Python 3.9.10 and 3.9.17.
The OAI Data Provider is a web application written in Python 3 using the Flask micro web framework. Installation of Flask will include dependent libraries, such as Jinja2 and werkzeug. No additional libraries are required.
Element | Encoded Archival Description | Dublin Core |
---|---|---|
Collection title | archdesc/did/unittitle |
title |
Container titles | dsc/c??/did/unittitle |
title |
Personal creators | dsc/c??/did/origination label="creator"/persname |
creator |
Corporate creators | dsc/c??/did/origination label="creator"/corpname |
creator |
Dates | dsc/c??/did/unitdate |
date |
Extent | dsc/c??/did/physdesc/extent |
extent |
Description | dsc/c??/did/abstract |
description |
Subject, general | dsc/c??/controlaccess/subject |
subject |
Subject, geographic | dsc/c??/controlaccess/geogname |
subject |
Subject, person | dsc/c??/controlaccess/persname |
subject |
Subject, corporate | dsc/c??/controlaccess/corpname |
subject |
Subject, activity | dsc/c??/controlaccess/function |
subject |
Identifier | dsc/c??/did/unitid |
identifier |
Identifier, link | dsc/c??/did/daogrp/daoloc['xlink:href'] |
identifier |
Paul B. MacCready Papers ca. 1931-2002, Caltech Archives
Software produced by the Caltech Library is Copyright © 2023 California Institute of Technology. This software is freely distributed under a BSD-style license. Please see the LICENSE file for more information.
This work was funded by the California Institute of Technology Library.