HTML to MARC: Webscraping Using Googlesheets
- Conference Year
- Online Northwest 2019
- Session
- 2
- Type
- Breakout Session
- Presenters
- Brianne N. Hagen, Humboldt State University
- Abstract
When library information is messy and not easily indexed by discovery layers, it can be problematic to bring the data into a format for easy search and retrieval within our own information ecosystems. Thankfully, there are solutions available to help us get our data into a useful format that can be easily searched in our library catalogs with minimal effort.
With finding aids in html webpage for our example, we will use webscraping tools built into googlesheets to harvest the data. Using minimal coding skills, we will be able to create a CSV file and convert that file into MARC records in batch, ready to be used by any library system.
- Learning Outcomes
- Learn about available tools for webscraping
- Apply some easy tricks to harvest html data
- Links