要加载一个不在配置中的语言对,您只需要按照下面的方式指定语言代码。您可以在数据集描述的主页部分找到有效的语言对: http://opus.nlpl.eu/MultiParaCrawl.php 例如
dataset = load_dataset("multi_para_crawl", lang1="en", lang2="nl")
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
Initial Data Collection and Normalization[More Information Needed]
Who are the source language producers?[More Information Needed]
[More Information Needed]
Annotation process[More Information Needed]
Who are the annotators?[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
感谢 @abhishekkrthakur 添加此数据集。