数据集:

AhmedSSoliman/DJANGO

英文

Django代码翻译任务的数据集

Django数据集是Oda等人在2015年的ASE会议上使用的数据集。

Django数据集是用于代码生成的数据集,包括16000个训练样本、1000个开发样本和1805个测试样本。每个数据点由一行Python代码和一个手动创建的自然语言描述组成。

@inproceedings{oda2015ase:pseudogen1,
 author = {Oda, Yusuke and Fudaba, Hiroyuki and Neubig, Graham and Hata, Hideaki and Sakti, Sakriani and Toda, Tomoki and Nakamura, Satoshi},
 title = {Learning to Generate Pseudo-code from Source Code Using Statistical Machine Translation},
 booktitle = {Proceedings of the 2015 30th IEEE/ACM International Conference on Automated Software Engineering (ASE)},
 series = {ASE '15},
 month = {November},
 year = {2015},
 isbn = {978-1-5090-0025-8},
 pages = {574--584},
 numpages = {11},
 url = {https://doi.org/10.1109/ASE.2015.36},
 doi = {10.1109/ASE.2015.36},
 acmid = {2916173},
 publisher = {IEEE Computer Society},
 address = {Lincoln, Nebraska, USA}
}