Linguistic Corpora at the HZSK Repository

The digital repository of the Hamburger Zentrum für Sprachkorpora stores and disseminates linguistic resources and tools. Further information can be found here:

Searched: public
X
Hits: 10
http://hdl.handle.net/11022/0000-0000-4F70-A
general corpus / spoken / discourse

EXMARaLDA Demo Corpus 1.0

A selection of short audio and video recordings in various languages to be used for instruction or demonstration of the EXMARaLDA system.

Language: German, English, French, Spanish, Turkish, Polish, Vietnamese, Swedish, Norwegian, Italian, Russian, Afrikaans, Portuguese

License: HZSK-PUB (public)

Open lock icon indicates accessible resource
CLARIN icon indicates integration into CLARIN Eye icon indicates online browsable resource Download icon indicates downloads available for this resource
http://hdl.handle.net/11022/0000-0006-473B-9

Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650)

The reference corpus of Middle Low German and Low Rhenish texts is based on manuscripts, prints and inscriptions. It is intended to provide an insight into the culture of speech and writing in Middle Low German and Low Rhenish regions. This spectrum of texttypes can be used to trace the linguistic development on the base of diatopic and diacronic subcategorisation. The aim of the project is the publication of diplomatic transcribed, lemmatised and grammatically annotated texts. The processed data – especially on the grammatical level – enables a linguistic analysis of the Middle Low German and Low Rhenish language, which goes far beyond what was possible until now.

Language: Undefined

License: CC-BY 4.0 (public)

Open lock icon indicates accessible resource
CLARIN icon indicates integration into CLARIN Download icon indicates downloads available for this resource
http://hdl.handle.net/11022/0000-0007-C2FA-4

Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650)

The reference corpus of Middle Low German and Low Rhenish texts is based on manuscripts, prints and inscriptions. It is intended to provide an insight into the culture of speech and writing in Middle Low German and Low Rhenish regions. This spectrum of texttypes can be used to trace the linguistic development on the base of diatopic and diacronic subcategorisation. The aim of the project is the publication of diplomatic transcribed, lemmatised and grammatically annotated texts. The processed data – especially on the grammatical level – enables a linguistic analysis of the Middle Low German and Low Rhenish language, which goes far beyond what was possible until now.

Language: Undefined

License: CC-BY 4.0 (public)

Open lock icon indicates accessible resource
CLARIN icon indicates integration into CLARIN Download icon indicates downloads available for this resource
http://hdl.handle.net/11022/0000-0007-C4B1-3

Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650)

The reference corpus of Middle Low German and Low Rhenish texts is based on manuscripts, prints and inscriptions. It is intended to provide an insight into the culture of speech and writing in Middle Low German and Low Rhenish regions. This spectrum of texttypes can be used to trace the linguistic development on the base of diatopic and diacronic subcategorisation. The aim of the project is the publication of diplomatic transcribed, lemmatised and grammatically annotated texts. The processed data – especially on the grammatical level – enables a linguistic analysis of the Middle Low German and Low Rhenish language, which goes far beyond what was possible until now.

Language: Undefined

License: CC-BY 4.0 (public)

Open lock icon indicates accessible resource
CLARIN icon indicates integration into CLARIN Download icon indicates downloads available for this resource
http://hdl.handle.net/11022/0000-0007-C64C-5

Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650)

The reference corpus of Middle Low German and Low Rhenish texts is based on manuscripts, prints and inscriptions. It is intended to provide an insight into the culture of speech and writing in Middle Low German and Low Rhenish regions. This spectrum of texttypes can be used to trace the linguistic development on the base of diatopic and diacronic subcategorisation. The aim of the project is the publication of diplomatic transcribed, lemmatised and grammatically annotated texts. The processed data – especially on the grammatical level – enables a linguistic analysis of the Middle Low German and Low Rhenish language, which goes far beyond what was possible until now.

Language: Undefined

License: CC-BY 4.0 (public)

Open lock icon indicates accessible resource
CLARIN icon indicates integration into CLARIN Download icon indicates downloads available for this resource
https://corpora.uni-hamburg.de/repository/text-corpus:ren-0.6

Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650)

The reference corpus of Middle Low German and Low Rhenish texts is based on manuscripts, prints and inscriptions. It is intended to provide an insight into the culture of speech and writing in Middle Low German and Low Rhenish regions. This spectrum of texttypes can be used to trace the linguistic development on the base of diatopic and diacronic subcategorisation. The aim of the project is the publication of diplomatic transcribed, lemmatised and grammatically annotated texts. The processed data – especially on the grammatical level – enables a linguistic analysis of the Middle Low German and Low Rhenish language, which goes far beyond what was possible until now.

Language: Undefined

License: CC-BY 4.0 (public)

Open lock icon indicates accessible resource
http://hdl.handle.net/11022/0000-0001-B002-5

Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650)

The reference corpus of Middle Low German and Low Rhenish texts is based on manuscripts, prints and inscriptions. It is intended to provide an insight into the culture of speech and writing in Middle Low German and Low Rhenish regions. This spectrum of texttypes can be used to trace the linguistic development on the base of diatopic and diacronic subcategorisation. The aim of the project is the publication of diplomatic transcribed, lemmatised and grammatically annotated texts. The processed data – especially on the grammatical level – enables a linguistic analysis of the Middle Low German and Low Rhenish language, which goes far beyond what was possible until now.

Language: Undefined

License: CC-BY 4.0 (public)

Open lock icon indicates accessible resource
Download icon indicates downloads available for this resource
http://hdl.handle.net/11022/0000-0004-D2C3-2

Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650)

The reference corpus of Middle Low German and Low Rhenish texts is based on manuscripts, prints and inscriptions. It is intended to provide an insight into the culture of speech and writing in Middle Low German and Low Rhenish regions. This spectrum of texttypes can be used to trace the linguistic development on the base of diatopic and diacronic subcategorisation. The aim of the project is the publication of diplomatic transcribed, lemmatised and grammatically annotated texts. The processed data – especially on the grammatical level – enables a linguistic analysis of the Middle Low German and Low Rhenish language, which goes far beyond what was possible until now.

Language: Undefined

License: CC-BY 4.0 (public)

Open lock icon indicates accessible resource
CLARIN icon indicates integration into CLARIN Download icon indicates downloads available for this resource
http://hdl.handle.net/11022/0000-0000-9B1E-1
general corpus / written / religious text

B4 Tatian Corpus of Deviating Examples 2.1

The present corpus, the Tatian Corpus of Deviating Examples T-CODEX 2.1, provides morpho-syntactic and information structural annotation of parts of the Old High German translation attested in the MS St. Gallen Cod. 56, traditionally called the OHG Tatian, one of the largest prose texts from the classical OHG period. This corpus was designed and annotated by Project B4 of Collaborative Research Center on Information Structure at Humboldt University Berlin. The present corpus compiles ca. 2.000 deviating examples found in the text portions of the scribes α, β, γ and ε. Each clause structure represents an extra file annotated with the annotation tool EXMARaLDA and searchable via ANNIS, a general-purpose tool for the publication, visualisation and querying of linguistic data collections, developed by Project D1 of the Collaborative Research Center on Information Structure at Potsdam University.

Language: Latin, Old High German

License: Creative Commons Attribution 3.0 Unported License (public)

Closed lock icon indicates restricted resource
SSO icon indicates single sign-on resource Download icon indicates downloads available for this resource
http://hdl.handle.net/11022/0000-0007-C641-0
general corpus / spoken / encyclopedia

The Spoken Wikipedia Corpora

Language: English, German, Dutch

License: Creative Commons Attribution-ShareAlike 4.0 International (public)

Closed lock icon indicates restricted resource
Download icon indicates downloads available for this resource