Skip to main content
Browse by:
GROUP

Steal this data: Using R and Github for Open, Collaborative Research

Event Image
Wednesday, January 18, 2017
9:00 am - 10:00 am
Angela Zoss and Joel Herndon
Data Science Sampler: Duke Research Computing Symposium

GitHub is a website where you can share data and code publicly or within a small group of collaborators. Sharing data projects on GitHub can be a great back-up solution and can also help you build a portfolio of data projects. "Stealing" and improving projects that have been shared on GitHub is also a great opportunity to contribute to a larger effort or to learn new skills from more experienced coders and analysts. If you already use RStudio to work on data projects, you can easily use the built-in "projects" to connect straight to an existing GitHub repository.

In this workshop, we will use RStudio to connect to a GitHub repository, and then we will walk through basic data analysis and visualization tasks with packages from Hadley Wickham's tidyverse (e.g., dplyr, tidyr, ggplot2). You should only need to bring a laptop with a web browser for this workshop, but we recommend that you try out the RStudio Container available at the OIT Docker Containers site before you arrive. Email askdata@duke.edu if you have any trouble accessing RStudio from the link above.

Type: RESEARCH
Contact: Joel Herndon