Data Cleaning with Open Refine - Online

Overview

Got messy data? Open Refine is a powerful, free open-source software tool for cleaning and transforming data in a way that is easy to reproduce. If you have ever struggled to remember exactly how you modified your data in Excel, give Open Refine a try!

Learning Objectives

By the end of the class you should be able to:

  • Understand where OpenRefine lives on your computer
  • Use OpenRefine to:
    • Facet data
    • Cluster data
    • Split data into multiple columns
    • Undo changes
  • Export your cleaned data
  • Save your cleaning scripts so they can be re-used 

Prerequisites / Preparation

Please complete the following tasks before coming to class:

Download OpenRefine here: http://openrefine.org/download.html If you are having trouble with the download, you can refer to setup instructions here: https://datacarpentry.org/OpenRefine-ecology-lesson/setup.html or email me at ariel.deardorff@ucsf.edu

Download the 2 class data files from the course website

Instructors

Ariel Deardorff, Data Services Librarian, UCSF Library

This will be an online class via Zoom conferencing. The zoom link will be sent out a week in advance.

Tuesday, July 16 at 9:30am to 11:00am

Online

Event Type

Class/Info Session

Audience

Students, Postdocs, Faculty, Staff

Campus

Parnassus

Tags

Data Science Initiative, data management, data sharing, open refine

Website

https://calendars.library.ucsf.edu/ev...

Cost

Free

Department/Group
UCSF Library
Contact Info

ariel.deardorff@ucsf.edu

Subscribe

Event Registration Required

This event requires registration.