skip to navigation skip to content
- Select training provider - (Cambridge Digital Humanities)

All Cambridge Digital Humanities courses

Show:
Show only:

Showing courses 131-134 of 134
Courses per page: 10 | 25 | 50 | 100

The Transkribus Guided Project new Wed 29 Jul 2020   16:00 Finished

We introduce the Transkribus software system that can be taught to read handwriting from images of documents and rapidly convert it into useful digital formats. This guided course provides basic training by practical immersion in this software, which requires only basic IT skills. Transkribus was developed by READ under the Horizon 2020 funding framework and is now a co-operative. It had 20,000+ users in 2019, and is becoming a standard research tool for mass transcription of archival sources. Participants will transcribe anonymised data from pre-loaded scans of forms filled out for the French national census of 1999 in Transkribus's downloadable software interface. These manual transcriptions will help train a handwritten text recognition (HTR) model to automatically transcribe many more of these forms later. In fact, the model will eventually allow the creation of one of the largest data sets ever attempted from manuscript sources. This course is a collaboration with Transkribus and Cambridge Digital Humanities. It is funded by a Cambridge Humanities Research Grant.

Image big data are increasingly being used to understand the built and natural environment and to observe behaviours within it. Data sources include satellite and airborne imagery, 360 street views, and fixed video or time lapse traffic and CCTV cameras. While some of these sources are newer than others what has been changing are the quality of the images, the geographical coverage, and the potential for assessing changes over time. At the same time improvements in machine learning have made it possible to turn images into quantitative data at scale.

In this workshop we will explore the challenges that researchers face when using images at scale to understand environments and behaviours, building on work at Cambridge to estimate cycling levels, using satellite data to estimate motor vehicle volume, and planned data collection in Kenya using 360 cameras.

Join our Methods Fellow, Amira Moeding in a workshop which introduces methods of historical enquiry into the development of digital technologies and digital data. How can we do the history of technology today? What are the limits of historical enquiry; what are its strengths? Moreover, what can we learn from historical narratives about technologies? More concretely, what can the history of “Big Data” tell us about artificial intelligence today? What were, for example, seen as the pitfalls and problems with biases early on in the development of data-driven applications?

Together with you, Amira will think through and employ methods of historical enquiry and critical theory to gain a better understanding of the origin of ‘data-driven’ digital technologies. Therein, the workshop attempts to bring about both an understanding of the statistical or data-driven methods by asking how they came about and why they became attractive to whom. The workshop thus links technologies back to the interests and contexts that rendered them viable. This line of enquiry will allow us to ask what ‘technological progress’ currently is, how stories of ‘progress’ are narrated by industry actors, and what ‘risks’ become apparent from their perspective. By providing this contextualisation and recovering early interests that drove developments in artificial intelligence research and ‘Big Tech’, we will also see that progress, and the promises for the future that it holds, are not ‘objective’ or ‘necessary’ but localised in time and space. We will raise the question to what degree digital humanities cannot only use digital methods to aid the humanities, but how historical and philosophical methods can be employed to provide a basis for criticising and theorising ‘the digital’ and putting the methods so-called ‘artificial intelligences’ are based on into perspective.

This CDH Basics session introduces the IIIF image data framework, which has been developed by a consortium of the world’s leading research libraries and image repositories and methods of access to image collections including the collections of Cambridge University Digital Library. We will also discuss a range of methods using IIIF image data in humanities research.