Data Manipulation and Visualisation in R Prerequisites
This course introduces some relatively new additions to the R programming language: dplyr and ggplot2. In combination these R packages provide a powerful toolkit to make the process of manipulating and visualising data easy and intuitive.
Materials for this course can be found here.
The training room is located on the first floor and there is currently no wheelchair or level access available to this level.
Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.
- Existing R users who are not familiar with dplyr and ggplot2
- Those with programming experience in other languages that want to know what R can offer them
- Graduate students, Postdocs and Staff members from the University of Cambridge, Affiliated Institutions and other external Institutions or individuals
- Please be aware that these courses are only free for registered University of Cambridge students. All other participants will be charged a registration fee in some form. Registration fees and further details regarding the charging policy are available here.
- Further details regarding eligibility criteria are available here
Attending the Introduction to Solving Biological Problems using R course would be beneficial, but not essential. We will assume that you know how to load RStudio, create variables and use functions. Some good introductory videos can be found here.
Number of sessions: 1
# | Date | Time | Venue | Trainers |
---|---|---|---|---|
1 | Tue 12 Mar 2019 09:30 - 17:30 | 09:30 - 17:30 | Bioinformatics Training Room, Craik-Marshall Building | Matthew Eldridge, Chandra Chilamakuri, Adrian Baez-Ortega |
Bioinformatics, Biology, Data handling, Data visualisation
After this course you should be able to:
- Create reproducible documents
- Import and tidy and datasets into R
- Use dplyr to explore a dataset interactively
- Produce simple analysis workflows in R
- Make publication-ready graphics using ggplot2
During this course you will learn about:
- How R enables reproducible research
- What constitues a tidy dataset
- "Piping" commands together to form a workflow
- Subseting and filtering datasets using dplyr
- Producing summary statistics from a dataset
- Joining datasets using dplyr
- The grammar of graphics approach to plotting used in ggplot2
Presentations, demonstrations and practicals
Day 1 | Topics |
9:30 - 10:00 | Introduction |
10:00 - 12:00 | Visualisation with ggplot2 |
12:00 - 12:30 | Tidying and transforming data |
12:30 - 13:30 | Lunch (not provided) |
13:30 - 14:30 | Tidying and transforming data (cont.) |
14:30 - 15:30 | Workflows |
15:30 - 16:30 | Summarising, grouping, and combining data |
16:30 - 17:30 | Customising plots |
- Free for registered University of Cambridge students
- £ 50/day for all University of Cambridge staff, including postdocs, temporary visitors (students and researchers) and participants from Affiliated Institutions. Please note that these charges are recovered by us at the Institutional level
- It remains the participant's responsibility to acquire prior approval from the relevant group leader, line manager or budget holder to attend the course. It is requested that people booking only do so with the agreement of the relevant party as costs will be charged back to your Lab Head or Group Supervisor.
- £ 50/day for all other academic participants from external Institutions and charitable organizations. These charges must be paid at registration
- £ 100/day for all Industry participants. These charges must be paid at registration
- Further details regarding the charging policy are available here
1
A number of times per year
Booking / availability