数据集:

HuggingFaceH4/helpful_instructions

中文

Dataset Card for Helpful Instructions

Dataset Summary

Helpful Instructions is a dataset of (instruction, completion) pairs that are derived from public datasets. As the name suggests, it focuses on instructions that are "helpful", i.e. the kind of questions or tasks a human user might instruct an AI assistant to perform. You can load the dataset as follows:

from datasets import load_dataset

# Load all subsets
helpful_instructions = load_dataset("HuggingFaceH4/helpful_instructions", name="all")

# Load a single subset
helpful_instructions_subset = load_dataset("HuggingFaceH4/helpful_instructions", name="self_instruct")

Supported Tasks and Leaderboards

This dataset can be used to fine-tune pretrained language models to follow instructions.

Changelog

  • March 5, 2023: v1.0.0 release, with subsets from HuggingFaceH4/self_instruct ( self_instruct , super_natural_instructions , prompt_source )