Requirements: Data Carpentry's teaching is hands-on, so participants are encouraged to bring in and use their own laptops to insure the proper setup of tools for an efficient workflow once you leave the workshop. (We will provide instructions on setting up the required software several days in advance) There are no pre-requisites, and we will assume no prior knowledge about the tools.
Contact: Please email library@caltech.edu for questions and information not covered here.To participate in a Data Carpentry workshop, you will need working copies of the software described below. Please make sure to install everything and try opening it to make sure it works before the start of your workshop. If you run into any problems, please feel free to email the instructor or arrive early to your workshop on the first day. Participants should bring and use their own laptops to insure the proper setup of tools for an efficient workflow once you leave the workshop.
We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.
For this workshop you will need a spreadsheet program. Many people already have Microsoft Excel installed, and if you do, you're set! If you need a spreadsheet program, there are a few other options, like OpenOffice and LibreOffice. Install instructions for LibreOffice, which is free and open source, are here.
OpenRefine (previously Google Refine) is a tool for data cleaning that runs through a web browser, and any browser - Safari, Firefox, Chrome, Explorer - should work fine. You will need to download OpenRefine and install it, and when you open it, it will run through the browser, but you don't need an internet connection, and the data will all be stored on your computer.
Git is a version control system that lets you track who made changes to what when and has options for easily updating a shared or public version of your code on github.com. You will need a supported web browser (current versions of Chrome, Firefox or Safari, or Internet Explorer version 9 or above).
You will need an account at github.com for parts of the Git lesson. Basic GitHub accounts are free. We encourage you to create a GitHub account if you don't have one already. Please consider what personal information you'd like to reveal. For example, you may want to review these instructions for keeping your email address private provided at GitHub.
We're going to be using GitHub Desktop to work with Git in this workshop. Download GitHub Desktop
We're going to be using GitHub Desktop to work with Git in this workshop. Download GitHub Desktop
We're going to be using GitHub Desktop to work with Git in this workshop. Unfortunatly Linux support for for GitHub Desktop desktop is still in development. This unofficial release seems to be working for most people Download GitHub Desktop
In the workshop, we will use RStudio. RStudio is a nice interface to the programming language R. To use RStudio, you need to install both R and RStudio.
sudo apt-get install
r-base
, and for Fedora run sudo yum install R
) but make sure that you have at least R 3.2.2
(as pre-packaged versions might be out of date).
sudo dpkg -i rstudio-x.yy.zzz-amd64.deb
at the terminal).