英文

FERNET-C5

FERNET-C5(灵活的嵌入式表示网络)是一个单语的捷克BERT-base模型,使用93GB的捷克超大型清理抓取语料库(C5)进行预训练。详细信息请参阅我们的论文。

Paper

https://link.springer.com/chapter/10.1007/978-3-030-89579-2_3

我们论文的预印本可在 https://arxiv.org/abs/2107.10042 上获取。

Citation

如果您发现这个模型有用,请引用我们的论文:

@inproceedings{FERNETC5,
    title        = {Comparison of Czech Transformers on Text Classification Tasks},
    author       = {Lehe{\v{c}}ka, Jan and {\v{S}}vec, Jan},
    year         = 2021,
    booktitle    = {Statistical Language and Speech Processing},
    publisher    = {Springer International Publishing},
    address      = {Cham},
    pages        = {27--37},
    doi          = {10.1007/978-3-030-89579-2_3},
    isbn         = {978-3-030-89579-2},
    editor       = {Espinosa-Anke, Luis and Mart{\'i}n-Vide, Carlos and Spasi{\'{c}}, Irena}
}