数据集:
onestop_english
OneStopEnglish is a corpus of texts written at three reading levels, and demonstrates its usefulness for through two applications - automatic readability assessment and automatic text simplification.
[More Information Needed]
[More Information Needed]
An instance example:
{ "text": "When you see the word Amazon, what’s the first thing you think...", "label": 0 }
Note that each instance contains the full text of the document.
The OneStopEnglish dataset has a single train split.
Split | Number of instances |
---|---|
train | 567 |
[More Information Needed]
[More Information Needed]
Who are the source language producers?[More Information Needed]
[More Information Needed]
Who are the annotators?[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
Creative Commons Attribution-ShareAlike 4.0 International License
[More Information Needed]
Thanks to @purvimisal for adding this dataset.