数据集:

AigizK/bashkir-russian-parallel-corpora

任务:

翻译

语言:

ba ru

许可:

cc-by-4.0
中文

Dataset Card for "bashkir-russian-parallel-corpora"

How the dataset was assembled.

  • find the text in two languages. it can be a translated book or an internet page (wikipedia, news site)
  • our algorithm tries to match Bashkir sentences with their translation in Russian
  • We give these pairs to people to check
  • @inproceedings{
    title={Bashkir-Russian parallel corpora},
    author={Iskander Shakirov, Aigiz Kunafin},
    year={2023}
    }