数据集:

castorini/triviaqa_gar-t5_expansions

英文

数据集概述

该存储库提供了使用gar-T5对Trivia QA语料库的答案、标题和句子扩展进行的翻译。

数据集结构

包含开发集和测试集。

开发集中的一个示例数据条目如下所示:

{
    "id": "1", 
    "predicted_answers": ["Bz"], "predicted_titles": ["Vehicle registration plates of Belize *** Vehicle registration plate"], "predicted_sentences": ["The international code for Belize is \"\"BZ\"\"."]
}

测试集中的一个示例数据条目如下所示:

{
    "id": "1", 
    "predicted_answers": ["Taurus"], "predicted_titles": ["Jamie Lee Curtis *** Under the Tuscan Sun *** Angels (Jamie Lee Curtis song) *** Under the Tuscan Sun (film) *** John Michael King *** Robert Earl *** Henry Jones, Sr. *** Jamie Lee (singer) *** Under the Tuscan Sun (1974 film) *** Richard Benjamin"], "predicted_sentences": ["In July 2007, several news outlets reported that the couple had quietly married in December 2007, and that Curtis had taken a liking to one another, sharing \"\"sweet nothings\"\" about their relationship."]
}

加载数据集

加载数据集的示例:

data_files = {"dev":"dev/dev.jsonl", "test": "test/test.jsonl"}
dataset = load_dataset('castorini/triviaqa_gar-t5_expansions')