The Swedish xsum dataset has only been machine-translated to improve downstream fine-tuning on Swedish summarization tasks.
Read about the full details at original English version: https://huggingface.co/datasets/xsum
The Swedish xsum dataset follows the same splits as the original English version and has 3 splits: train , validation , and test .
Dataset Split | Number of Instances in Split |
---|---|
Train | 204,045 |
Validation | 11,332 |
Test | 11,334 |