数据集:

alexandrainst/ddisco

语言:

da

计算机处理:

monolingual

大小:

1K<n<10K

语言创建人:

expert-generated

批注创建人:

expert-generated

许可:

afl-3.0
中文

Dataset Card for DDisco

Dataset Description

The DDisco dataset is a dataset which can be used to train models to classify levels of coherence in danish discourse. Each entry in the dataset is annotated with a discourse coherence label (rating from 1 to 3):

1: low coherence (difficult to understand, unorganized, contained unnecessary details and can not be summarized briefly and easily) 2: medium coherence 3: high coherence (easy to understand, well organized, only contain details that support the main point and can be summarized briefly and easily). Grammatical and typing errors are ignored (i.e. they do not affect the coherency score) and the coherence of a text is considered within its own domain.

Additional Information

DDisCo: A Discourse Coherence Dataset for Danish

Contributions

@ajders