Domovská stránka repozitáře LINDAT/CLARIAH-CZ

Nově přidané

lexicalConceptualResource

Autoři

Popis:

Czech OOV Inflection Dataset is a Czech inflection dataset of nouns, focused on evaluation in out-of-vocabulary (OOV) conditions. It consists of two parts: a standard lemma-disjoint train-dev-test split of a subset of noun ...

Tento záznam obsahuje 1 soubor (17.08 MB).

Publicly Available Distributed under Creative Commons

lexicalConceptualResource

LINDAT / CLARIAH-CZ

Mapping Czech Verbal Valency to PropBank Argument Labels: LREC2024 - verification data

Autoři

Hajič, Jan ; Fučíková, Eva ; Lopatková, Markéta and Urešová, Zdeňka

Popis:

Mapping table for the article Hajič et al., 2024: Mapping Czech Verbal Valency to PropBank Argument Labels, in LREC-COLING 2024, as preprocess by the algorithm described in the paper. This dataset i smeant for verification ...

Tento záznam obsahuje 1 soubor (4.26 MB).

Publicly Available Distributed under Creative Commons

corpus

LINDAT / CLARIAH-CZ

Coreference in Universal Dependencies 1.2 (CorefUD 1.2)

Popis:

CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.2 consists of 25 datasets for 16 languages. ...

Tento záznam obsahuje 1 soubor (83.66 MB).

Publicly Available Distributed under Creative Commons

Nejnavštěvovanější záznamy

Za poslední týden

toolService

LRT + Open Submissions

Estonian Text-to-Speech Synthesiser for the Blind

Autoři

Neznámý autor

Tento záznam neobsahuje soubory.

corpus

LRT + Open Submissions

BABEL Estonian Database

Autoři

Meister, Einar

Popis:

The database consists of three sets: - Many Talker Set: 30 males, 30 females; each to read 50 numbers, 1-2 connected passages, 1 block of "filler" sentences, and 1 block of syllables. - Few Talker Set: 4 males, 4 females; ...

Tento záznam neobsahuje soubory.

toolService

LRT + Open Submissions

Parole frequency list

Autoři

Neznámý autor

Popis:

frequency list of the Parole corpus, 1 339 787 words

Tento záznam neobsahuje soubory.

Lingvistická data a nástroje

Vyhledávání

Podpora citací (persistentní identifikátory)

Úschova zdarma a bezpečně

Volitelné licence (avšak preferujeme otevřené)

Snadné hledání

Snadná citace

Nově přidané

Nejnavštěvovanější záznamy