Dataset Card for "bashkir-russian-parallel-corpora"
How the dataset was assembled.
find the text in two languages. it can be a translated book or an internet page (wikipedia, news site)
our algorithm tries to match Bashkir sentences with their translation in Russian
We give these pairs to people to check
@inproceedings{
title={Bashkir-Russian parallel corpora},
author={Iskander Shakirov, Aigiz Kunafin},
year={2023}
}