数据集:

vietgpt/mfaq_en

任务:

问答

语言:

en

大小:

1M<n<10M

其他:

LM
英文

MFAQ

from datasets import load_dataset

load_dataset("vietgpt/mfaq_en")
  • Format for QA task
def preprocess(
    sample,
    instruction_key="### Instruction:",
    response_key="<|endofprompt|>",
    end_key="<|endoftext|>",
):
    question = sample['question']
    completion = sample['answer']
    return {'text': """Below is an instruction that describes a task. Write a response that appropriately completes the request.
{instruction_key}
{question}
{response_key}
{completion}
{end_key}""".format(
    instruction_key=instruction_key,
    question=question,
    response_key=response_key,
    completion=completion,
    end_key=end_key,
)}

"""
Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
After the application of WOL, can we start the transmission before we have obtained the Wireless Operating License?
<|endofprompt|>
No, one is not legally supposed to begin the operational transmission. However, you can try doing test transmission.
<|endoftext|>
"""