数据集:

ahazeemi/opus-medical-en-de

大小:

100K<n<1M

语言:

de en

任务:

翻译

其他:

medical
中文

Dataset Card for "opus-medical-en-de"

This is a multi-domain German-English parallel data introduced in Aharoni and Goldberg (2020) . It is a new data split created that avoids duplicate examples and leakage from the train split to the dev/test splits. The original multi-domain data first appeared in Koehn and Knowles (2017) and consists of five datasets available in the Opus website .