数据集:
metaeval/sts-companion
https://ixa2.si.ehu.eus/stswiki/index.php/STSbenchmark
STS基准测试的伴随数据集包括我们在2012年至2017年SemEval语境中使用的其他英语数据集。作者汇编了两个数据集,一个数据集包含与机器翻译评估相关的句子对,另一个数据集包含用于领域自适应研究的其他数据集。
@inproceedings{cer-etal-2017-semeval, title = "{S}em{E}val-2017 Task 1: Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation", author = "Cer, Daniel and Diab, Mona and Agirre, Eneko and Lopez-Gazpio, I{\~n}igo and Specia, Lucia", booktitle = "Proceedings of the 11th International Workshop on Semantic Evaluation ({S}em{E}val-2017)", month = aug, year = "2017", address = "Vancouver, Canada", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/S17-2001", doi = "10.18653/v1/S17-2001", pages = "1--14", }