Jump to Navigation

"Culture & Technology" European Summer University in Digital Humanities
University of Leipzig

Asking questions to data in the humanities: right, correct, efficient (Introducing and comparing XQuery, SQL, SPARQL for data from the humanities)

The amount of data in the digital humanities and its complexity is growing continuously. Modern database storage and access technologies are needed to handle this data. This course will give an introduction to three relevant technologies: relational databases and SQL, XQuery for XML-formatted data, and graph databases for highly-interconnected data. 

Relational databases organize their data in simple tables. SQL is the standard query language to search for and extract data from the database. The technology is mature, there are many excellent database systems available, most programming languages and application programs provide easy access to relational databases.

XML is a language to describe the structure of documents as a hierarchy. XQuery is the standard language to query XML single documents or document collections to search for and extract content from these documents. XML is used for large corpora of text, it is supported by many programming languages, there are numerous application programs and editors for XML data, and it is often used in web-based environments.

Graph databases are a relatively new development. Here, data is seen as information nodes, and the nodes are linked via named arcs. Graphs are highly dynamic and thus well suited in the exploration phase when working with corpora.

For the introduction to these technologies, we will bring sample data, which also allows us to compare the different query methods by using the same underlying information in all paradigms. In course projects, participants may work with sample data provided by us or their own data. By looking at the data, we will discuss ways of asking questions to the data and then try to express them in the query language(s)

Participants should have some basic knowledge of XML. In the course students will have access to the database system sqLite and the XML editor Oxygen.

In the first week we will work on the basics of SQL and XQuery, introducing  the basic concepts and syntax of the query languages SQL (for relational databases) and XQuery (for XML data). The key concepts here will be SQL's FROM-SELECT-WHERE, JOIN, aggregate functions and XQuery's FLWOR, functions, output formatting.

In the second week of this workshop we will look at advanced constructions in SQL and XQuery, apply both to the same data sets and compare them to each other. Additionally we will look at graph databases. Following this introduction we will assess  how to find questions based on the data, select the appropriate formalism and express the question in the query language. We will also look at applying XQuery to query TEI documents. Key concepts of this week will be inserting, updating, deleting data, SQL's stored procedures and XQuery's user defined functions, graph databases, SPARQL, application of query languages to participant's own research questions.

  • Deutsch
  • The Name
  • Background
  • Mission
  • Audience
  • Workshops
  • Lectures
  • Projects
  • Round Tables
  • Working Languages
  • Impressum
  • Kontakt

2022

  • Important dates
  • Application
  • Workshops
  • Experts
  • ConfTool
  • Scholarships etc.
  • Participation fees
  • Moodle
  • Scientific Committee

2021

  • ESU DH C&T 2021
  • Important dates 2021
  • ConfTool
  • Programme
  • Workshops
  • Experts
  • Application
  • Lectures
  • Scholarships
  • Participation fees
  • Moodle
  • Scientific Committee

2020

  • Important dates
  • Schedule
  • Workshops
  • Lectures (public)
  • Panel (public)
  • Experts
  • Lecturers
  • Application
  • Scholarships
  • Participation fees

2019

  • Schedule
  • Workshops
  • Lectures (public)
  • Projects (public)
  • Poster Session (public)
  • Panel (public)
  • Teasers (public)
  • Cultural programme
  • Experts
  • Lecturers
  • Scientific Committee
  • Important dates (new)
  • Application
  • Scholarships (updated)
  • Participation fees
  • Refund policy
  • T-Shirts
  • Child care
  • Birthday thoughts

2018

  • Schedule
  • Workshops
    • XML-TEI document encoding, structuring, rendering and transformation
    • Hands on Humanities Data Workshop - Creation, Discovery and Analysis
    • Collocations from a multilingual perspective: theory, tools, and applications
    • Reflected Text Analysis in the Digital Humanities
    • Humanities Data and Mapping Environments
    • Building and analysing multimodal corpora
    • Stylometry
    • Asking questions to data in the humanities: right, correct, efficient (Introducing and comparing XQuery, SQL, SPARQL for data from the humanities)
    • Computer Vision Intervention. How digital methods help to visually understand corpora of art and cultural heritage
    • Integrating Human Science Data using CIDOC-CRM as Formal Ontology: a practical approach
    • The humanities scholar's perspective on rule based machine translation
    • Word Vectors and Corpus Text Mining with Python
    • Text Mining with Canonical Text Services
    • How Research Infrastructures empower eHumanities and eHeritage Research(ers)
    • Introduction to Project Management
  • Lectures (public)
  • Projects (public)
  • Posters (public)
  • Panel discussion (public)
  • Teasers (public)
  • Cultural Programme
  • Experts
  • Lecturers
  • Scientific Committee
  • Important dates
  • Application
  • Scholarships
  • Fees
  • Refund policy
  • T-Shirt
  • The logo riddle
  • Child Care

2017

  • Schedule
  • Workshops
  • Lectures (public)
  • Projects (public)
  • Panel (public)
  • Teasers / Specials
  • Cultural Programme
  • Experts
  • Lecturers
  • Scientific Committee
  • Important dates
  • Application
  • Scholarships
  • Fees
  • Refund Policy
  • T-Shirt
  • Flyer
  • Child care

2016

  • Schedule
  • Workshops
  • Lectures (public)
  • Projects & Posters (public)
  • Panel
  • Teasers (public)
  • Slams
  • Experts
  • Lecturers
  • Scientific Committee
  • Important dates
  • Application
  • Scholarships
  • Fees
  • Refund policy
  • Flyer
  • Child Care

2015

  • Schedule
  • Workshops
  • Lectures
  • Projects
  • Posters
  • Panel
  • Teaser / Special sessions
  • Workshop Slams
  • Experts
  • Lecturers
  • Scientific Committee
  • Important dates
  • Application
  • Scholarships
  • Fees
  • Refund policy
  • Child Care
  • T-Shirt 2015
  • Flyer and Poster
  • Sponsorship
  • Questions

2014

  • Schedule
  • Workshops
  • Lectures
  • Projects
  • Panel
  • Experts
  • Lecturers
  • Scientific Committee
  • Important dates
  • Application
  • Scholarships
  • Fees
  • Child care
  • Flyer
  • Sponsorship

2013

  • Schedule
  • Workshops
  • Lectures
  • Projects & Posters
  • Panel
  • Experts
  • Lecturers
  • Project Presenters
  • Scientific Committee
  • Important dates
  • Application
  • Bursaries
  • Fees
  • Refund Policy
  • T-Shirt
  • Certificate
  • Sponsorship

2012

  • Home
  • Schedule
  • Workshops
  • Lectures
  • Project Presentations
  • Poster Slam & Session
  • Panel Discussions
  • Excursion
  • Lecturers
  • Certificate
  • Scientific Committee
  • Important Dates
  • Duration & Structure
  • Application
  • Registration Fees
  • Bursaries

2010

  • Schedule
  • Workshops
  • Instructors
  • Lectures
  • Round table
  • Important dates
  • Application
  • Fees
  • Bursaries

2009

  • Schedule
  • Workshops
  • Instructors
  • Lectures
  • Project presentations
  • Round tabel

Leipzig

  • Contact
  • Mailinglist
  • Host
  • Venue
  • Moodle
  • Accommodation (updated)
  • City Map
  • Arrival
  • Events
  • Weather

What the ESU means to me

ESU in the Media

ESU 2019 Experiences (DARIAH-EU)
ESU 2018 Experiences (CLARIN-D)
ESU 2017 (CLARIN-D Blog)
CLARIN-D at ESU 2015 (YouTube)
CLARIN-D ESU 2015 (YouTube)
Mephisto 97.6 10.07.13
Campus Online 10.08.2012
Mephisto 97.6 26.07.2010
infotvleipzig 26.07.2010
In India 03.09.2010

Reviews

INFOtheka: Review of ESU DH 2009
INFOtheka: Review of ESU DH 2012
Infoclio.ch: Review of ESU DH
2013

Publications

Multimodal Analysis of “well”

Users

  • Login

DAAD

 

CLARIN ERIC

 

Sächsische Akademie der Wissenschaften

 

Universität Leipzig

 

BMBF

 

Electronic Textual Cultures Lab at the University of Victoria & Digital Humanities Summer Institute

CLARIN-D

 

DARIAH-EU

 

Slovenian Language Technologies Society (SDJT)

 

Parthenos

International Centre/AAA

 

Computational Humanities

 

Oxygen XML Editor

 

Universitätsbibliothek