Text processing for linguists and literary scholars with R
This course is a hands-on introduction to using the programming language R for the analysis of textual data (such as corpora, literary works, web data etc.) It is based on the second edition (2016) of my textbook "Quantitative corpus linguistics with R" and introduces a variety of programming constructs required for text processing: functions and relevant data structures (e.g., vectors), control flow structures such as loops and conditionals, and a sizeable number of regular expressions; in addition and time permitting, we will also cover very elementary basics of data visualization. The kinds of data dealt with in this course come from a variety of differently formatted / annotated corpora and will also include 1-2 examples of literary works and / or XML processing.
2022
2021
2020
2019
2018
2017
- Schedule
- Workshops
- XML-TEI document encoding, structuring, rendering and transformation
- Hands on Humanities Data Workshop - Creation, Discovery and Analysis
- Introduction to programming for the Web
- From Print and Manuscript to Electronic Version: Text Digitization and Annotation
- Text processing for linguists and literary scholars with R
- Spoken Language and Multimodal Corpora
- Stylometry
- The Iconic Turn. Image Driven Digital Art History
- Humanities Data and Mapping Environments
- Working with SQL and graph databases
- Canonical Text Services
- Data Management and legal and ethical issues
- Lectures (public)
- Projects (public)
- Panel (public)
- Teasers / Specials
- Cultural Programme
- Experts
- Lecturers
- Scientific Committee
- Important dates
- Application
- Scholarships
- Fees
- Refund Policy
- T-Shirt
- Flyer
- Child care