Linguistic Corpora at the HZSK Repository

The digital repository of the Hamburger Zentrum für Sprachkorpora stores and disseminates linguistic resources and tools. Further information can be found here:

Searched: Selkup
Hits: 2
general corpus / spoken / discourse

Selkup Language Corpus (SLC)

The Selkup Language Corpus has been created within the project Syntactic description of the Central and Southern Selkup dialects: a corpus based analyses (supported by the German Research Grant; WA 3153/3-1). The primary goal of the project is to build a corpus and research syntactic structures on its base. The corpus contains 144 texts already published in written form with glosses and annotations. All texts have been translated into English, and mostly into Russian and German. The corpus also contains rich metadata on the communications and speakers. The transcription data as well as the metadata of the corpus are processed and stored in EXMARaLDA format.

Language: Selkup, Russian

License: CC BY-NC-SA 4.0 (public)

Open lock icon indicates accessible resource
general corpus / spoken / conv: conversations

INEL Selkup Corpus

Selkup is an endangered Samoyedic language (Uralic family). The INEL Selkup corpus is composed of texts from the archive of Angelina Ivanovna Kuzmina (1924–2002), who gathered a large amount of material on Selkup in almost all regions where the Selkup people lived in 1962–1977. Most texts in the corpus originate from the handwritten part of the archive, the others come from sound recordings made by A.I. Kuzmina, transcribed and translated within the INEL project. Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, annotation of borrowings. Some texts also have annotations for syntactic structure, semantic roles and information status.

Language: Selkup

License: CC BY-NC-SA 4.0 (public)

Open lock icon indicates accessible resource
CLARIN icon indicates integration into CLARIN Eye icon indicates online browsable resource Download icon indicates downloads available for this resource