数据集:
quail
任务:
多项选择子任务:
multiple-choice-qa语言:
en计算机处理:
monolingual大小:
10K<n<100K语言创建人:
found批注创建人:
crowdsourced源数据集:
original许可:
cc-by-nc-sa-4.0QuAIL 是一个阅读理解数据集。QuAIL 包含了来自四个领域(新闻、用户故事、小说、博客)中,长度为 300-350 个标记的多选问题共 15,000 个。QuAIL 是平衡的,并且标记了问题类型。
'train' 的一个示例如下所示。
This example was too long and was cropped: { "answers": ["the cousin is not friendly", "the cousin could have been pretier", "not enough information", "the cousin was too nice"], "context": "\"That fall came and I went back to Michigan and the school year went by and summer came and I never really thought about it. I'm...", "context_id": "f001", "correct_answer_id": 0, "domain": "fiction", "id": "f001_19", "metadata": { "author": "Joseph Devon", "title": "Black Eyed Susan", "url": "http://manybooks.net/pages/devonjother08black_eyed_susan/0.html" }, "question": "After the events in the text what does the author think about the cousin?", "question_id": "19", "question_type": "Subsequent_state" }
数据字段在所有拆分中都相同。
quailname | train | challenge | validation |
---|---|---|---|
quail | 10246 | 556 | 2164 |
@inproceedings{DBLP:conf/aaai/RogersKDR20, author = {Anna Rogers and Olga Kovaleva and Matthew Downey and Anna Rumshisky}, title = {Getting Closer to {AI} Complete Question Answering: {A} Set of Prerequisite Real Tasks}, booktitle = {The Thirty-Fourth {AAAI} Conference on Artificial Intelligence, {AAAI} 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, {IAAI} 2020, The Tenth {AAAI} Symposium on Educational Advances in Artificial Intelligence, {EAAI} 2020, New York, NY, USA, February 7-12, 2020}, pages = {8722--8731}, publisher = {{AAAI} Press}, year = {2020}, url = {https://aaai.org/ojs/index.php/AAAI/article/view/6398}, timestamp = {Thu, 04 Jun 2020 13:18:48 +0200}, biburl = {https://dblp.org/rec/conf/aaai/RogersKDR20.bib}, bibsource = {dblp computer science bibliography, https://dblp.org} }
感谢 @sai-prasanna 和 @ngdodd 添加此数据集。