Nově přidané
lexicalConceptualResource
Popis:
Czech OOV Inflection Dataset is a Czech inflection dataset of nouns, focused on evaluation in out-of-vocabulary (OOV) conditions. It consists of two parts: a standard lemma-disjoint train-dev-test split of a subset of noun ...
Tento záznam obsahuje 1 soubor (17.08
MB).
Publicly Available
lexicalConceptualResource
Popis:
Mapping table for the article Hajič et al., 2024: Mapping Czech Verbal Valency to PropBank Argument Labels, in LREC-COLING 2024, as preprocess by the algorithm described in the paper. This dataset i smeant for verification ...
Tento záznam obsahuje 1 soubor (4.26
MB).
Publicly Available
corpus
Popis:
CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.2 consists of 25 datasets for 16 languages. ...
Tento záznam obsahuje 1 soubor (83.66
MB).
Publicly Available
Nejnavštěvovanější záznamy
Za poslední týden
toolService
Tento záznam neobsahuje soubory.
corpus
Popis:
The database consists of three sets: - Many Talker Set: 30 males, 30 females; each to read 50 numbers, 1-2 connected passages, 1 block of "filler" sentences, and 1 block of syllables. - Few Talker Set: 4 males, 4 females; ...
Tento záznam neobsahuje soubory.
toolService
Popis:
frequency list of the Parole corpus, 1 339 787 words
Tento záznam neobsahuje soubory.