数据集:
bigbio/tmvar_v2
该数据集包含158篇PubMed文章,手动注释了各种类型的突变提及以及每个突变的dbsnp规范化。可用于命名实体识别(NER)任务和命名实体消歧(NED)任务。该数据集仅有一个划分。
@article{wei2018tmvar, title={tmVar 2.0: integrating genomic variant information from literature with dbSNP and ClinVar for precision medicine}, author={Wei, Chih-Hsuan and Phan, Lon and Feltz, Juliana and Maiti, Rama and Hefferon, Tim and Lu, Zhiyong}, journal={Bioinformatics}, volume={34}, number={1}, pages={80--87}, year={2018}, publisher={Oxford University Press} }