数据集:
gabeorlanski/bc-humaneval
您可以使用原始的 BabelCode Repo ,或者您可以使用 bc_eval Metric 。
BabelCode-HumaneEval(BC-HumanEval)数据集将 HumanEval dataset released by OpenAI 转换为16种编程语言。
BC-HumanEval 支持以下语言:
>>> from datasets import load_dataset >>> load_dataset("gabeorlanski/bc-humaneval") DatasetDict({ test: Dataset({ features: ['qid', 'title', 'language', 'text', 'signature_with_docstring', 'signature', 'arguments', 'solution', 'question_info'], num_rows: 2576 }) })
注意:如果要为预测使用不同的函数名称(对于需要类名称的语言,则是类名称),必须相应地更新 entry_fn_name 和 entry_cls_name。例如,如果原始问题具有 entry_fn_name 为 add,但想要将其更改为 f,则必须更新 ds["question_info"]["entry_fn_name"] 为 f:
>>> from datasets import load_dataset >>> ds = load_dataset("gabeorlanski/bc-humaneval")['test'] >>> # The original entry_fn_name >>> ds[0]['question_info']['entry_fn_name'] hasCloseElements >>> # You MUST update the corresponding entry_fn_name >>> ds[0]['question_info']['entry_fn_name'] = 'f' >>> ds[0]['question_info']['entry_fn_name'] f
有关数据集如何转换的详细信息,请参阅 BabelCode Paper 第2节。
有关如何创建原始 HumanEval 数据集的信息,请参阅 Evaluating Large Language Models Trained on Code paper 。
Google Research
CC-BY-4.0
@article{orlanski2023measuring, title={Measuring The Impact Of Programming Language Distribution}, author={Orlanski, Gabriel and Xiao, Kefan and Garcia, Xavier and Hui, Jeffrey and Howland, Joshua and Malmaud, Jonathan and Austin, Jacob and Singh, Rishah and Catasta, Michele}, journal={arXiv preprint arXiv:2302.01973}, year={2023} } @article{chen2021codex, title={Evaluating Large Language Models Trained on Code}, author={Mark Chen and Jerry Tworek and Heewoo Jun and Qiming Yuan and Henrique Ponde de Oliveira Pinto and Jared Kaplan and Harri Edwards and Yuri Burda and Nicholas Joseph and Greg Brockman and Alex Ray and Raul Puri and Gretchen Krueger and Michael Petrov and Heidy Khlaaf and Girish Sastry and Pamela Mishkin and Brooke Chan and Scott Gray and Nick Ryder and Mikhail Pavlov and Alethea Power and Lukasz Kaiser and Mohammad Bavarian and Clemens Winter and Philippe Tillet and Felipe Petroski Such and Dave Cummings and Matthias Plappert and Fotios Chantzis and Elizabeth Barnes and Ariel Herbert-Voss and William Hebgen Guss and Alex Nichol and Alex Paino and Nikolas Tezak and Jie Tang and Igor Babuschkin and Suchir Balaji and Shantanu Jain and William Saunders and Christopher Hesse and Andrew N. Carr and Jan Leike and Josh Achiam and Vedant Misra and Evan Morikawa and Alec Radford and Matthew Knight and Miles Brundage and Mira Murati and Katie Mayer and Peter Welinder and Bob McGrew and Dario Amodei and Sam McCandlish and Ilya Sutskever and Wojciech Zaremba}, year={2021}, eprint={2107.03374}, archivePrefix={arXiv}, primaryClass={cs.LG} }