Online Northwest

A conference on libraries, technology, and culture

Menu
  • Home
  • About
  • Conference Program
  • Local Information
  • Sponsors
  • Archives
  • Contact
Menu

HTML to MARC: Webscraping Using Googlesheets

Conference Year
Online Northwest 2019
Session
2
Type
Breakout Session
Presenters
  • Brianne N. Hagen, Humboldt State University
Abstract

When library information is messy and not easily indexed by discovery layers, it can be problematic to bring the data into a format for easy search and retrieval within our own information ecosystems. Thankfully, there are solutions available to help us get our data into a useful format that can be easily searched in our library catalogs with minimal effort.

With finding aids in html webpage for our example, we will use webscraping tools built into googlesheets to harvest the data. Using minimal coding skills, we will be able to create a CSV file and convert that file into MARC records in batch, ready to be used by any library system.

Learning Outcomes
  • Learn about available tools for webscraping
  • Apply some easy tricks to harvest html data
Links
  • HTML to MARC (PDF)
Orbis Cascade Alliance

Phone: (541) 246-2470
Email: info@orbiscascade.org

Privacy Policy
Report a problem with this page

©2025 Orbis Cascade Alliance