数据集:
webis/args_me
任务:
文本检索子任务:
document-retrieval语言:
en计算机处理:
monolingual大小:
100K<n<1M语言创建人:
crowdsourced批注创建人:
machine-generated源数据集:
original许可:
cc-by-4.0args.me语料库(版本1.0,清理后)包含了从2019年中期的四个辩论门户网站中爬取的382,545个论点。辩论门户网站包括Debatewise、IDebate.org、Debatepedia和Debate.org。这些论点是使用为每个辩论门户设计的启发式方法提取的。
import datasets args = datasets.load_dataset('webis/args_me', 'corpus', streaming=True) args_iterator = iter(args) for arg in args_iterator: print(args['conclusion']) print(args['id']) print(args['argument']) print(args['stance']) break
文档检索,有争议问题的论点检索
args.me语料库是单语的;仅包括英语(主要是en-US)文档。
{'conclusion': 'Science is the best!', 'id': 'd6517702-2019-04-18T12:36:24Z-00000-000', 'argument': 'Science is aright I guess, but Physical Education (P.E) is better. Think about it, you could sit in a classroom for and hour learning about molecular reconfiguration, or you could play football with your mates. Why would you want to learn about molecular reconfiguration anyway? I think the argument here would be based on, healthy mind or healthy body. With science being the healthy mind and P.E being the healthy body. To work this one out all you got to do is ask Steven Hawkins. Only 500 words', 'stance': 'CON'}
[需要更多信息]
[需要更多信息]
[需要更多信息]
[需要更多信息]
源语言制作者是谁?[需要更多信息]
[需要更多信息]
注释者是谁?[需要更多信息]
[需要更多信息]
[需要更多信息]
[需要更多信息]
[需要更多信息]
[需要更多信息]
Creative Commons Attribution 4.0 International (CC BY 4.0)
@dataset{yamen_ajjour_2020_4139439, author = {Yamen Ajjour and Henning Wachsmuth and Johannes Kiesel and Martin Potthast and Matthias Hagen and Benno Stein}, title = {args.me corpus}, month = oct, year = 2020, publisher = {Zenodo}, version = {1.0-cleaned}, doi = {10.5281/zenodo.4139439}, url = {https://doi.org/10.5281/zenodo.4139439} }