Knowledge Base

Open Archives Initiative Protocol for Metadata Harvesting

The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a low-barrier mechanism for repository interoperability. OAI-PMH is a set of six verbs or services that are invoked within HTTP. It is also possible to gather metadata of all public datasets on PURR through this protocol. On PURR, the metadata are available in Dublin Core.

Metadata Harvesting Guide

Start with the base URL: purr.purdue.edu/oaipmh

Add a verb:

verb=GetRecord&identifier=(a unique record ID)
This verb is used to retrieve an individual metadata record from a repository. This verb can be used only in combination with the argument metadataPrefix=oai_dc

Example: https://purr.purdue.edu/oaipmh/?verb=GetRecord&identifier=http://dx.doi.org/10.4231/D3DR2P83H&metadataPrefix=oai_dc

verb=ListSets
This verb is used to retrieve the set structure of a repository.

verb=Identify
This verb is used to retrieve information about a repository.

verb=ListMetadataFormats
Specify your metadata format in case there are records in multiple formats

verb=ListRecords
This verb is used to harvest records from a repository

verb=ListIdentifiers
This verb is an abbreviated form of ListRecords, retrieving only headers rather than records.

Example:
https://purr.purdue.edu/oaipmh?verb=ListRecords&metadataPrefix=oai_dc

If using ListIdentifiers or ListRecords, you may specify a date range:

&from=YYYY-MM-DD
&until=YYYY-MM-DD

To view the next page of results, find the <resumptionToken>XXXXX</resumptionToken> at the bottom of the XML and place it at the end of the query string:
&resumptionToken=XXXXX

0 Dislike

Last updated

The Purdue University Research Repository (PURR) is a university core research facility provided by the Purdue University Libraries, the Office of the Executive Vice President for Research and Partnerships, and Information Technology at Purdue (ITaP).