skip to navigation skip to content
- Select training provider - (Researcher Development Programme (RDP))
Thu 30 Apr, Thu 7 May 2020
11:00 - 12:00

Venue: Cambridge Digital Humanities Online

Provided by: Cambridge Digital Humanities


Booking

Bookings cannot be made on this event (Event is completed).


Other dates:

No more events



Register interest
Register your interest - if you would be interested in additional dates being scheduled.


Booking / availability

Introduction to Text-mining with Python [remote delivery]
New

Thu 30 Apr, Thu 7 May 2020

Description

This online session will introduce basic methods for reading and processing text files in Python with Jupyter Notebooks. We'll discuss why you might wish to do text-mining, and whether coding with Python is the right choice for you. We'll run through the 5 steps of text-mining, and start to walk through an example that reads in a text corpus, splits it into words and sentences (tokens), removes unwanted words (stopwords), counts the tokens (frequency analysis), and visualises results.

This initial session is one hour long and will be delivered remotely by video conferencing. During the session we will cover the essentials of working with the Jupyter Notebooks provided so that you can carry on working through the materials in your own time. The first session will be followed by a second, optional Q&A session for troubleshooting issues and recapping essentials.

Required preparation: A short internet-based exercise in working with variables and text in Python will be sent out one week prior to the session. You will also get instructions on how to find the materials we will be using and how to log onto the video conferencing platform. Please make sure you have some time to prepare properly so that we can concentrate on teaching during the remote session.

Target audience

PhD students and staff

Prerequisites

No prior knowledge of Python is required, and no installations will be needed. We will use web services available in your browser to follow along.

Sessions

Number of sessions: 2

# Date Time Venue Trainer
1 Thu 30 Apr 2020   11:00 - 12:00 11:00 - 12:00 Cambridge Digital Humanities Online Mary Chester-Kadwell
2 Thu 7 May 2020   11:00 - 12:00 11:00 - 12:00 Cambridge Digital Humanities Online Mary Chester-Kadwell
Theme
Machine Reading the Archive

Booking / availability