skip to navigation skip to content
Wed 23 May 2018
09:30 - 17:30

Venue: Bioinformatics Training Room, Craik-Marshall Building, Downing Site

Provided by: Bioinformatics

This course is full - Add me to the waiting list

Other dates:

No more events

[ Show past events ]

Register interest
Register your interest - if you cannot make any of the currently scheduled dates and would be interested in additional dates being scheduled.

Booking / availability

Data Analysis and Visualisation in R

Wed 23 May 2018


This course introduces some relatively new additions to the R programming language: dplyr and ggplot2. In combination these R packages provide a powerful toolkit to make the process of manipulating and visualising data easy and intuitive.

Materials for this course can be found here.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.

Target audience
  • Existing R users who are not familiar with dplyr and ggplot2
  • Those with programming experience in other languages that want to know what R can offer them
  • Graduate students, Postdocs and Staff members from the University of Cambridge, Affiliated Institutions and other external Institutions or individuals
  • Please be aware that these courses are only free for University of Cambridge students. All other participants will be charged a registration fee in some form. Registration fees and further details regarding the charging policy are available here.
  • Further details regarding eligibility criteria are available here

Attending the Introduction to Solving Biological Problems using R course would be beneficial, but not essential. We will assume that you know how to load RStudio, create variables and use functions. Some good introductory videos can be found here.


Number of sessions: 1

# Date Time Venue Trainers
1 Wed 23 May   09:30 - 17:30 09:30 - 17:30 Bioinformatics Training Room, Craik-Marshall Building, Downing Site map Hugo Tavares,  Dr Sandra Cortijo,  Anna Brestovitsky,  Dr Matthew Eldridge
Topics covered

Bioinformatics, Biology, Data handling, Data visualisation


After this course you should be able to:

  • Create reproducible documents
  • Import and tidy and datasets into R
  • Use dplyr to explore a dataset interactively
  • Produce simple analysis workflows in R
  • Make publication-ready graphics using ggplot2

During this course you will learn about:

  • How R enables reproducible research
  • What constitues a tidy dataset
  • "Piping" commands together to form a workflow
  • Subseting and filtering datasets using dplyr
  • Producing summary statistics from a dataset
  • Joining datasets using dplyr
  • The grammar of graphics approach to plotting used in ggplot2

Presentations, demonstrations and practicals

Registration Fees
  • Free for University of Cambridge students
  • £ 50/day for all University of Cambridge staff, including postdocs, and participants from Affiliated Institutions. Please note that these charges are recovered by us at the Institutional level
  • It remains the participant's responsibility to acquire prior approval from the relevant group leader, line manager or budget holder to attend the course. It is requested that people booking only do so with the agreement of the relevant party as costs will be charged back to your Lab Head or Group Supervisor.
  • £ 50/day for all other academic participants from external Institutions and charitable organizations. These charges must be paid at registration
  • £ 100/day for all Industry participants. These charges must be paid at registration
  • Further details regarding the charging policy are available here



A number of times per year

Related courses
Core skills

Booking / availability