数据集:
bigbio/paramed
NEJM 是从新英格兰医学杂志网站爬取的中英平行语料库。英文文章通过 https://www.nejm.org/ 分发,中文文章通过 http://nejmqianyan.cn/ 分发。该语料库包含自2011年以来的所有文章对(约2000对)。
@article{liu2021paramed, author = {Liu, Boxiang and Huang, Liang}, title = {ParaMed: a parallel corpus for English–Chinese translation in the biomedical domain}, journal = {BMC Medical Informatics and Decision Making}, volume = {21}, year = {2021}, url = {https://bmcmedinformdecismak.biomedcentral.com/articles/10.1186/s12911-021-01621-8}, doi = {10.1186/s12911-021-01621-8} }