What's New
lexicalConceptualResource
Description:
Czech OOV Inflection Dataset is a Czech inflection dataset of nouns, focused on evaluation in out-of-vocabulary (OOV) conditions. It consists of two parts: a standard lemma-disjoint train-dev-test split of a subset of noun ...
This item contains 1 file (17.08
MB).
Publicly Available
lexicalConceptualResource
Description:
Mapping table for the article Hajič et al., 2024: Mapping Czech Verbal Valency to PropBank Argument Labels, in LREC-COLING 2024, as preprocess by the algorithm described in the paper. This dataset i smeant for verification ...
This item contains 1 file (4.26
MB).
Publicly Available
corpus
Description:
CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.2 consists of 25 datasets for 16 languages. ...
This item contains 1 file (83.66
MB).
Publicly Available
Most Viewed Items
Top Last Week
toolService
This item contains no files.
corpus
Description:
The database consists of three sets: - Many Talker Set: 30 males, 30 females; each to read 50 numbers, 1-2 connected passages, 1 block of "filler" sentences, and 1 block of syllables. - Few Talker Set: 4 males, 4 females; ...
This item contains no files.
toolService
Description:
frequency list of the Parole corpus, 1 339 787 words
This item contains no files.