数据集:

strombergnlp/x-stance

子任务:

fact-checking

语言:

de fr

计算机处理:

multilingual

大小:

10K<n<100K

语言创建人:

found

批注创建人:

crowdsourced

预印本库:

arxiv:2003.08385

许可:

mit
中文

Dataset Card for X-Stance

Dataset Summary

The x-stance dataset contains more than 150 political questions, and 67k comments written by candidates on those questions. The comments are partly German, partly French and Italian. The data have been extracted from the Swiss voting advice platform Smartvote.

Languages

German, French/Italian

Dataset Structure

Data Instances

An example of 'train' looks as follows:

{
    'id': '0', 
    'question': 'Eine Volksinitiative fordert, dass die Gesamtfläche der Bauzonen in der Schweiz für die nächsten 20 Jahre auf dem heutigen Stand begrenzt wird. Befürworten Sie dieses Anliegen?', 
    'comment': 'Eine fixe Grösse verbieten, ist das falsche Mittel', '
    'label': 0
}

Data Fields

  • id : a 'string' feature.
  • question : a 'string' expressing a claim/topic.
  • comment : a 'string' to be classified for its stance to the source.
  • label :
            0: "AGAINST",
            1: "FAVOR"

Data Splits

languages name instances
de train 33850
de validation 2871
de test 11891
fr train 11790
fr validation 1055
fr test 5814

Dataset Creation

Curation Rationale

[More Information Needed]

Source Data

Initial Data Collection and Normalization

[More Information Needed]

Who are the source language producers?

[More Information Needed]

Annotations

Annotation process

[More Information Needed]

Who are the annotators?

[More Information Needed]

Personal and Sensitive Information

[More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[More Information Needed]

Additional Information

Dataset Curators

[More Information Needed]

Licensing Information

MIT License

Citation Information

@article{vamvas2020x,
  title={X-stance: A multilingual multi-target dataset for stance detection},
  author={Vamvas, Jannis and Sennrich, Rico},
  journal={arXiv preprint arXiv:2003.08385},
  year={2020}
}

Contributions

Thanks to mkonxd , leondz for adding this dataset.