Course Description
It has been said that 80% of data analysis is spent on the process of cleaning and preparing the data. In this course we introduce some relatively-new additions to the R programming language; dplyr and ggplot2. In combination these provide a powerful toolkit to make the process of manipulating and visualising data easy and intuitive.
Course Prequisites
We will assume that you have basic familarity with R and are familiar with vectors, data frames, variables and using functions.
We have a crash course that covers these topics
Several Online videos are available that cover these materials. For example
Topics Covered
How R enables reproducible research
What constitues a tidy dataset
"Piping" commands together to form a workflow
Subseting and filtering datasets using dplyr
Producing summary statistics from a dataset
Joining datasets using dplyr
The grammar of graphics approach to plotting used in ggplot2
Producing publication-ready graphics using ggplot2
Course Data
Please click on this link to download all the files required to run the examples in the course:- CLICK HERE
Software
You will need an updated version of R and RStudio Desktop.
Please check this page to make sure you have the correct versions of the software
Schedule
Course Introduction
Introduction to dplyr
Writing analysis workflows in R
Summarising and Combining data
Plotting data with ggplot2
Extra (optional) exercises on dplyr and ggplot2
Software
You will need an updated version of R and RStudio Desktop.
Please check this page to make sure you have the correct versions of the software
Useful Resources