Linguistic Corpora at the HZSK Repository
EXMARaLDA Demo Corpus 1.0
A selection of short audio and video recordings in various languages to be used for instruction or demonstration of the EXMARaLDA system.
Language: German, English, French, Spanish, Turkish, Polish, Vietnamese, Swedish, Norwegian, Italian, Russian, Afrikaans, Portuguese
License: HZSK-PUB (public)
Community Interpreting Database Pilot Corpus (ComInDat)
Audio and video recordings of various types of community interpreted discourse (doctor-patient communication, simulated doctor-patient communication, courtroom communication) in German (simulated and authentic doctor-patient communication) and US (courtroom communication) institutions with varying community languages. Video recordings only exist for the simulated communication. For the authentic interpreted doctor-patient communication, no audio files will be made available.
Language: German, English, Spanish, Turkish, Polish, Portuguese, Romanian, Russian, Haitian
License: HZSK-RES (restricted)
Hamburg Corpus of Polish in Germany (HamCoPoliG)
Audio recordings of German/Polish bilingual and Polish monolingual adults (16-46 years). Recordings of semi-spontaneous data (3 topics) and renarration of a picture story.
Language: Polish
License: HZSK-RES (restricted)
Hamburg Corpus of Polish in Germany (HamCoPoliG)
Audio recordings of German/Polish bilingual and Polish monolingual adults (16-46 years). Recordings of semi-spontaneous data (3 topics) and renarration of a picture story.
Language: Polish
License: HZSK-RES (restricted)