Transforming MARC and Metadata into RDF-based Applications

Tuesday, February 13, 2018 - 9:30 to 12:30

Jeremy Nelson
Metadata & Systems Librarian, Colorado College
Partner, KnowledgeLinks

Mike Stabile
Partner & CEO, KnowledgeLinks


Agenda

Tuesday, February 13, 2018

 9:30 - Welcome and Overview
10:00 - Computer setup
10:20 - Break
10:30 - BIBCAT Introduction
11:20 - Break
11:30 - Next Steps, Questions and Comments
12:30 - End Session

Resources


Overview


RDF Applications Technologies

Data Formats

Datastores

Query Languages

Python Modules

Open-Source Enterprise Tools

Source Control, Packaging, and Deployment


Our Approach

Data Sources

MARC 21 and MARC XML records are exported from an ILS

MODS XML, OAI-PMH Dublin Core, Fedora Commons XML, are harvested from OAI-PMH Ingesters typically from external repositories like Islandora, Omeka, or Content DM digital repositories.

Excel and Comma-Separate-Value, in a couple of instances, the data source are spreadsheets that either start as Microsoft Excel that are converted to CSV, or the original source is CSV.

JSON feeds, for some projects, a JSON feed of Dublin Core metadata is the record source

SPARQL endpoints

Application Programming Interfaces API

BIBCAT Ingesters

  1. Data harvested over OAI-PMH endpoints, HTTP connections, or provided data files
  2. Usually executed from the command-line and can be long-running processes

BIBCAT RML Processors

BIBCAT uses RDF Mapping Language to preform the following processing of incoming data to the final RDF Linked Data output, stored either in triplestore or on disk

BIBCAT uses the Rdfframework for configuration and data management

Access & Discovery

Using RML Processors, BIBCAT is easily able to generate multiple formats and vocabularies outputs depending on the project.


BIBCAT Introduction


https://bibcat.org/introduction/

We will now introduce the bibcat catalog pull platform for building RDF-based bibliographic applications. We'll go through a quick-start to get your Python environment set-up to use bibcat, followed by some useful helper functions bibcat provides for building RDF applications. We'll then go through and convert MARC records to Library of Congress BIBFRAME RDF and then demonstrate the use of a RML map to produce "lean" BIBFRAME RDF. From this lean BIBFRAME RDF, we'll generate schema.org JSON-LD. Finally, we will look at some of the other input data format/vocabulary workflows to produce BIBFRAME RDF.


Next Steps with the RDF Framework


RDF Applications

Building semantic solutions for knowledge organizations


RdfFramework for Managing Linked Data


Thanks!

Questions, & Comments

knowledgelinks.io@gmail.com

+1-719-301-7017

http://knowledgelinks.io/