数据集:
bigbio/tmvar_v1
该数据集包含500篇手动注释的PubMed文章,涵盖了各种类型的突变提及。该数据集仅用于命名实体识别任务。数据集分为训练集(334篇)和测试集(166篇)。
@article{wei2013tmvar, title={tmVar: a text mining approach for extracting sequence variants in biomedical literature}, author={Wei, Chih-Hsuan and Harris, Bethany R and Kao, Hung-Yu and Lu, Zhiyong}, journal={Bioinformatics}, volume={29}, number={11}, pages={1433--1439}, year={2013}, publisher={Oxford University Press} }