Home Science Computer Science Data Science
Data Science: Productivity Tools

Data Science: Productivity Tools

Data science projects involve keeping track of many data files and analysis scripts. Learn GitHub, git, Unix/Linux and RStudio to keep your projects organized and produce reproducible reports.
Video Beginner
Gallery
Description
A typical data analysis project may involve several parts, each including several data files and different scripts with code. Keeping all this organized can be challenging.

Part of our Professional Certificate Program in Data Science, this course explains how to use Unix/Linux as a tool for managing files and directories on your computer and how to keep the file system organized. You will be introduced to the version control systems git, a powerful tool for keeping track of changes in your scripts and reports. We also introduce you to GitHub and demonstrate how you can use this service to keep your work in a repository that facilitates collaborations.

Finally, you will learn to write reports in R markdown which permits you to incorporate text and code into a document. We'll put it all together using the powerful integrated desktop environment RStudio.

In this course, you will learn what to download and how. We recommend an up-to-date browser.

Pricing:
Free
Level:
Beginner
Duration:
4 weeks, 2h-4h/week
Educator:
Rafael Irizarry
Organization:
Harvard University
Submitted by:
Coursearena
Reviews
Would you recomment this course to a friend?
Discussion
There are no comments yet. Please sign in to start the discussion.