Presentation Type

Presentation

Conference Track

User Experience/Understanding Users

Description

As librarians, we work with an ever-increasing amount of data and metadata. However, these data are often messy, disorganized, and seemingly disparate from other caches of data. Free tools such as OpenRefine allow us to clean, organize, and connect data sets to one another – all without knowing how to write complicated code. In this presentation, I will demonstrate the refining I have undertaken to create better and more usable data sets.

Learning Outcomes

Learn how to install OpenRefine

Learn how to use the tools in OpenRefine to standardize data:

  • Facets
  • Transformations
  • Clusters
  • Filters

Learn how to augment your data with data from web services

Learn how to use the Google Refine Expression Language

Rights

© Copyright the author(s)

IN COPYRIGHT:
http://rightsstatements.org/vocab/InC/1.0/
This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).

DISCLAIMER:
The purpose of this statement is to help the public understand how this Item may be used. When there is a (non-standard) License or contract that governs re-use of the associated Item, this statement only summarizes the effects of some of its terms. It is not a License, and should not be used to license your Work. To license your own Work, use a License offered at https://creativecommons.org/

Comments/Notes

Room: ML 170

Start Date

3-31-2017 11:15 AM

End Date

3-31-2017 12:00 PM

Persistent Identifier

http://archives.pdx.edu/ds/psu/19101

Subjects

Big data -- Management, Big data -- Data processing, OpenRefine -- Applications to library science

Share

COinS
 
Mar 31st, 11:15 AM Mar 31st, 12:00 PM

Using OpenRefine to Standardize and Augment Your Data

As librarians, we work with an ever-increasing amount of data and metadata. However, these data are often messy, disorganized, and seemingly disparate from other caches of data. Free tools such as OpenRefine allow us to clean, organize, and connect data sets to one another – all without knowing how to write complicated code. In this presentation, I will demonstrate the refining I have undertaken to create better and more usable data sets.