tasksource/bigbench | ATYUN.COM 官网-人工智能教程资讯全方位服务平台

数据集:

tasksource/bigbench

许可:

apache-2.0

源数据集:

original

批注创建人:

machine-generated expert-generated crowdsourced

语言创建人:

machine-generated expert-generated crowdsourced

大小:

size_categories:unknown

计算机处理:

monolingual multilingual

语言:

子任务:

extractive-qa open-domain-qa multiple-choice-qa

任务:

文本分类

问答

多项选择

数据集介绍文件清单

英文

BIG-Bench但不需要官方版本的恶心依赖项（tensorflow，pypi-bigbench，protobuf）。

dataset = load_dataset("tasksource/bigbench",'movie_recommendation')

重现代码： https://colab.research.google.com/drive/1MKdLdF7oqrSQCeavAcsEnPdI85kD0LzU?usp=sharing

将数据集限制为50k个示例，以保持轻巧。我还删除了默认拆分，当训练可用时，默认=train+val，以节省空间。

@article{srivastava2022beyond,
  title={Beyond the imitation game: Quantifying and extrapolating the capabilities of language models},
  author={Srivastava, Aarohi and Rastogi, Abhinav and Rao, Abhishek and Shoeb, Abu Awal Md and Abid, Abubakar and Fisch, Adam and Brown, Adam R and Santoro, Adam and Gupta, Aditya and Garriga-Alonso, Adri{\`a} and others},
  journal={arXiv preprint arXiv:2206.04615},
  year={2022}
}

作者:

tasksource

数据集大小:

11.48 KB