Library logo

Training on Dataset tools

Content Contributors: Robert Doiel, Tom Morrell

Lesson Maintainers: Robert Doiel, Tom Morrell

Lesson status: In Development

What you will learn:

Topics:

  1. Intro
  2. Basic Dataset
  3. Working with Larger Amounts of Data

Requirements

This lesson requires basic familiarity with the bash shell, similar to the experience gained through the Software Carpentry shell lesson. You’ll need to have a bash shell installed, you can follow these instructions.

Two tool collections developed at Caltech Library will also be used, datatools and dataset. From datatools we will be using a program called jsonmunge for extracting and re-formatting JSON content. datatools, a collection of tools for working with CSV, XLSX and JSON content, is available here. dataset, a data management tool, is available here.

References