Tools for working with linguistic data
The members of the HZSK have developed tools, and maintain externally developed services, for processing and analyzing data, some of which have found widespread application in the field of digital humanities, among those are:
ANNIS is an open source, cross platform (Linux, Mac, Windows), web browser-based search and visualization architecture for complex multilayer linguistic corpora with diverse types of annotation. This metadata describes an instance running at HZSK repository.
Converter for CHAT to the ISO-TEI format for transcription of spoken language, version 0.9
For a corpus created with the Partitur-Editor and COMA, you can use EXAKT to: query for transcribed and annotated phenomena using regular expressions , view the transcript context for all matches, play the audio or video corresponding to a match, automatically or manually filter and sort query results, correlate query results with speaker or communication metadata, add your own analysis categories to a query result, save query results or export them to other applications (e.g.. Excel). EXAKT can also be used for corpora created with other tools (FOLKER, Transcriber, ELAN).
The EXMARaLDA Corpus Manager (Coma) is designed to assemble transcripts created with the EXMARaLDA Partiture-Editor with their corresponding recordings into corpora and enrich them with metadata. Metadata can be about speakers, communications (settings), recordings and the actual transcripts.
The EXMARaLDA Partitur-Editor is a tool for transcribing and annotating digital audio and video. With the Partitur-Editor, you can enter, edit and output transcriptions in partitur (“musical score”) notation. Transcription and digital audio or video recordings can be linked and aligned in this process. The Partitur-Editor offers the following functionality: Time-aligned transcription in multiple tiers, Processing of transcriptions according to different transcription conventions (CHAT, HIAT, GAT), Output of transcription data in different layouts (musical score, line-for-line), in different document formats (HTML, MS Word) and with multimedia links to audio or video, Adding manual annotations in an arbitrary number of additional tiers, semi-automatic annotation methods, support for systematic annotation schemes, Documentation of speakers, Data exchange with other widely-used systems (Praat, ELAN, Transcriber etc.), Calling CLARIN-D web services for automatization of workflows (alignment, token annotation).
Converter for EXMARaLDA Basic Transcriptions (EXB) to the ISO-TEI format for transcription of spoken language, version 0.9
Converter for FOLKER (FLN) to the ISO-TEI format for transcription of spoken language, version 0.9
The HZSK Corpus Services bundles functionality used for maintenance, curation, conversion, and visualization of corpus data.
Converter for ISO/TEI to TCF format for transcription of spoken language, version 0.9
Converter for TCF to the ISO-TEI format for transcription of spoken language, version 0.9
TEI Drop ist a tool for easy conversion of common transcription formats to TEI conformant files
This converter transforms a TextGrid file whose location is given by a reference in the wrapping TCF into an ELAN file. The reference inside the TCF will be updated after the conversion.
Converter for Transcriber to the ISO-TEI format for transcription of spoken language, version 0.9
Converter for several transcription and annotation formats
The Web-GIS for Wahrnehmungsdialektologie shows results of the collection of mental maps of the project "Der deutsche Sprachraum aus der Sicht linguistischer Laien".