skip to navigation skip to content
Instructor-led course

Provided by: Bioinformatics


This course is not scheduled to run.


[ Show past events ]



Register interest
Register your interest - if you would be interested in additional dates being scheduled.


Events available

Introduction to high-throughput sequencing data analysis


Description

This course provides an introduction to high-throughput sequencing (HTS) data analysis methodologies. Lectures will give insight into how biological knowledge can be generated from RNA-seq, ChIP-seq and DNA-seq experiments and illustrate different ways of analyzing such data. Practicals will consist of computer exercises that will enable the participants to apply statistical methods to the analysis of RNA-seq, ChIP-seq and DNA-seq data under the guidance of the lecturers and teaching assistants. It is aimed at researchers who are applying or planning to apply HTS technologies and bioinformatics methods in their research.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

Target audience
  • Graduate students, Postdocs and Staff members from the University of Cambridge, Affiliated Institutions and other external Institutions or individuals
  • Please be aware that these courses are only free for University of Cambridge students. All other participants will be charged a registration fee in some form. Registration fees and further details regarding the charging policy are available here
  • Further details regarding eligibility criteria are available here
Prerequisites

The course will include an introduction to Unix and R but to get the most out of the course we suggest acquiring:

  • Basic experience of command line UNIX
  • Sufficient UNIX experience might be obtained from one of the many UNIX tutorials available online.
  • Basic knowledge of the R syntax
  • For a real beginner's introduction into R see here. More advanced R instructions can be found at Quick-R or An Introduction to R
Topics covered

Bioinformatics, ChIP-seq, Data handling, Data mining, Data visualisation, Epigenomics, Functional genomics, RNA-seq, Sequence variations, Transcriptomics

Objectives

After this course you should be able to:

  • Understand the advantages and limitations of the high-throughput assays presented
  • Assess the quality of your datasets
  • Perform alignment and peak calling of ChIP-seq datasets
  • Perform alignment, quantification of expression and guided transcriptome assembly of RNA-seq datasets
  • Understand the standard file formats for representing variant data
  • Apply filters to your list of variants and functionally annotate variants
Aims

During this course you will learn about:

  • HTS technology
  • Quality control of raw reads: FASTQC and fastx toolkit
  • Considerations on experiment design for ChIP-seq and RNA-seq
  • Read alignment to a reference genome: Bowtie and Tophat
  • File format conversion and processing: UCSC tools and samtools
  • Peak calling: MACS
  • Motif analysis: MEME
  • Quantification of expression and guided transcriptome assembly: Cufflinks
  • Analysis of variants
Format

Presentations, demonstrations and practicals

Timetable

Day 1 - Intro to NGS analysis, QC & Mapping Topics Speaker(s)
09:30 - 10:30 Introduction to Unix Roman Kreuzhuber
10:30 - 10:45 Tea/Coffee Break
10:45 - 12:00 Introduction to Unix Roman Kreuzhuber
12:00 - 13:00 Lunch
13:00 - 15:30  Introduction to R and Bioconductor Mukarram Hossain
15:30 - 15:45 Tea/Coffee Break
15:45 - 16:30  High-throughput sequencing overview Sandra Cortijo
16:30 - 17:30 High-throughput sequencing analysis workflow - group exercise Sandra Cortijo
Day 2 - ChIP-Seq analysis
9:30 - 10:45 High-throughput sequencing quality control Romina Petersen, Sandra Cortijo
10:45 - 11:00  Tea/Coffee Break
11:00 - 12:00  Sequence alignment  Romina Petersen, Sandra Cortijo
12:00 -13:00  Lunch
13:00 - 14:00  Introduction to ChIP-seq Romina Petersen, Sandra Cortijo
14:00 - 15:00 ChIP-seq - peak calling and annotation
15:00 - 15:30  ChIP-seq - motif analysis 
15:30 - 15:45  Tea/Coffee Break
15:45 - 17:30  ChIP-seq - enrichment plots  Romina Petersen, Sandra Cortijo
Day 3 - RNA-Seq analysis
9:30 - 11:00 Introduction to RNA-seq  Luigi Grassi, Tomas Di Domenico
11:00 - 11:15 Tea/Coffee Break
11:15 - 12:00  RNA-seq - Alignment and splice junction identification  Luigi Grassi, Tomas Di Domenico
12:00 - 13:00   RNA-seq - Transcriptome assembly 
13:00 - 14:00  Lunch
14:00 - 15:00  RNA-seq analysis - Differential expression analysis  Luigi Grassi, Tomas Di Domenico
15:00 - 15:30  RNA-seq - Functional annotation 
15:30 - 15:45 Tea/Coffee Break
15:45 - 16:30  RNA-seq - Functional annotation (Cont.)  Luigi Grassi, Tomas Di Domenico
16:30 - 17:30  RNA-seq - Advanced differential expression 
Day 4 - Variants analysis
9:30 - 11:00 Preparing a bam file for variant calling Mareike Herzog, Roman Kreuzhuber
11:00 - 11:15 Tea/Coffee Break
11:15 - 12:30 Calling variants Mareike Herzog, Roman Kreuzhuber
12:30 - 13:30 Lunch
13:30 - 15:00 Filtering and recalibrating variants Mareike Herzog, Roman Kreuzhuber
15:00 - 15:15 Tea/Coffee Break
15:15 - 17:00 Variant annotation Mareike Herzog, Roman Kreuzhuber
Registration Fees
  • Free for University of Cambridge students
  • £ 50/day for all University of Cambridge staff, including postdocs, and participants from Affiliated Institutions. Please note that these charges are recovered by us at the Institutional level
  • It remains the participant's responsibility to acquire prior approval from the relevant group leader, line manager or budget holder to attend the course. It is requested that people booking only do so with the agreement of the relevant party as costs will be charged back to your Lab Head or Group Supervisor.
  • £ 50/day for all other academic participants from external Institutions and charitable organizations. These charges must be paid at registration
  • £ 100/day for all Industry participants. These charges must be paid at registration
  • Further details regarding the charging policy are available here
Duration

4 days

Frequency

A number of times per year

Related courses
Theme
Specialized Training

Events available