Workshops
The Summer School offers a range of parallel weeklong workshops on important areas of Digital Humanities. Each workshop consists of a total of 15 sessions or 30 week-hours. The number of participants in each workshop is limited to 15. Each participant can only take part in one course at a time. Participants are requested to bring along their own materials and projects so that what is being taught can be directly applied and tested.
Taking into account the presented letters of motivation the following workshops will take place:
Christiane Fritze / Malte Rehbein: Transcribing and Describing Primary Sources using TEI-conformant XML
Alejandro Bia: Online Publishing using XML and TEI
- An introduction to XML technology
- Modeling document structure with DTDs and Schemas
- Text markup using XML and TEI
- Text rendering using XSLT (transforming XML to other output formats)
- Other concerns: engineering XML production, digital libraries, digital preservation, etc.
Jan Rybicki: Textual analysis in the Digital Humanities. Methods and Tools
Depending on the level of advancement of its participants, the workshop will consist of all or any of the stages: the basics of electronic text, the sources of electronic texts, OCR, correction, tagging, simple numerical observations (various lengths and frequencies), vocabulary richness, most-frequent-word lists, multivariate analysis, existing tools (with emphasis on Hoover spreadsheet tools and JGAAP).
The texts used for the workshops will be provided by the instructor; if necessary, the participants’ individual corpora will be expanded as needed and as available (online or elsewhere). The texts will be literary, multilingual, and include both originals and translations.
Stefan Th. Gries: Corpus and Corpus Analysis in Language (and Literary) Sciences
This course is a hands-on introduction to (quantitative) corpus linguistics. The course will cover the following contents:
- introduction: what are corpora, what kinds of corpora exist, why do corpus linguistics in the first place, aspects of corpus compilation;
- introduction to R: functions, arguments, and data structures;
- text manipulation 1: loading corpus files into R, performing simple search and replace operations;
- text manipulation 2: searching and replacing in corpora with regular expressions;
- corpus-linguistic applications: frequency lists, simple concordances, complex concordances;
- elementary statistical evaluation;
- examples of more advanced applications.
Data to be dealt with include plain text corpora, corpora with SGML or XML annotation, chat files from CHILDES, Unicode files. Case studies involve examples from morphology, syntax, first and second language acquisition, among other things.
The different levels of knowledgewill be taken into account.