description-indexer

A tool for working with archival description for public access.


License
MIT
Install
pip install description-indexer==0.0.4

Documentation

description_indexer

A tool for working with archival description for public access. description_indexer reads archival description into a minimalist data model for public-facing archival description.

description_indexer can index directly from the ArchivesSpace API to an Arclight Solr instance using ArchivesSnake and PySolr.

This is still a bit drafty, as its only tested on ASpace v2.8.0 and needs better error handling.

Installation

pip install description_indexer

First, you need to configure ArchivesSnake by creating a ~/.archivessnake.ymlfile with your API credentials as detailed by the ArchivesSnake configuration docs.

Next, you also need a ~/.description_indexer.yml file that lists your Solr URL and the core you want to index to. These can also be overridden with args.

solr_url: http://127.0.0.1:8983/solr
solr_core: blacklight-core
last_query: 0

Indexing from ArchivesSpace API to Arclight

Once description_indexer is set up, you can index from the ASpace API to Arclight using the to-arclight command.

Index by id_0

You can provide one or more IDs to index using a resource's id_0` field

to-arclight --id ua807

to-arclight --id mss123 apap106

Index by URI

You can also use integers from ASpace URIs for resource, such as 263 for https://my.aspace.edu/resources/263

to-arclight --uri 435

to-arclight --uri 1 755

Indexing by modified time

Index collections modified in the past hour: to-arclight --hour

Index collections modified in the past day: to-arclight --today

Index collections modified since las run: to-arclight --new

Deleting collections

You can delete one or more collections using the --delete argument in addition to--id. This uses the Solr document ID, such as apap106 for https://my.arclight.edu/catalog/apap106.

to-arclight --id apap101 apap301 --delete