数据集:

Baybars/parla_text_corpus

语言:

ca

计算机处理:

monolingual

大小:

100K<n<1M

语言创建人:

various

批注创建人:

no-annotation

源数据集:

found

许可:

cc-by-4.0
中文

ParlaTextCorpus

Spoken text corpus for Catalan. Derived and cleaned from three sources. OpenSubtitles, Tv3Parla and Festcat.