数据集:
AhmedSSoliman/DJANGO
Django数据集是Oda等人在2015年的ASE会议上使用的数据集。
Django数据集是用于代码生成的数据集,包括16000个训练样本、1000个开发样本和1805个测试样本。每个数据点由一行Python代码和一个手动创建的自然语言描述组成。
@inproceedings{oda2015ase:pseudogen1, author = {Oda, Yusuke and Fudaba, Hiroyuki and Neubig, Graham and Hata, Hideaki and Sakti, Sakriani and Toda, Tomoki and Nakamura, Satoshi}, title = {Learning to Generate Pseudo-code from Source Code Using Statistical Machine Translation}, booktitle = {Proceedings of the 2015 30th IEEE/ACM International Conference on Automated Software Engineering (ASE)}, series = {ASE '15}, month = {November}, year = {2015}, isbn = {978-1-5090-0025-8}, pages = {574--584}, numpages = {11}, url = {https://doi.org/10.1109/ASE.2015.36}, doi = {10.1109/ASE.2015.36}, acmid = {2916173}, publisher = {IEEE Computer Society}, address = {Lincoln, Nebraska, USA} }