数据集:

bigbio/tmvar_v3

语言:

en

计算机处理:

monolingual

预印本库:

arxiv:2204.03637
中文

Dataset Card for tmVar v3

This dataset contains 500 PubMed articles manually annotated with mutation mentions of various kinds and dbsnp normalizations for each of them. In addition, it contains variant normalization options such as allele-specific identifiers from the ClinGen Allele Registry It can be used for NER tasks and NED tasks, This dataset does NOT have splits.

Citation Information

@misc{https://doi.org/10.48550/arxiv.2204.03637,
  title        = {tmVar 3.0: an improved variant concept recognition and normalization tool},
  author       = {
    Wei, Chih-Hsuan and Allot, Alexis and Riehle, Kevin and Milosavljevic,
    Aleksandar and Lu, Zhiyong
  },
  year         = 2022,
  publisher    = {arXiv},
  doi          = {10.48550/ARXIV.2204.03637},
  url          = {https://arxiv.org/abs/2204.03637},
  copyright    = {Creative Commons Attribution 4.0 International},
  keywords     = {
    Computation and Language (cs.CL), FOS: Computer and information sciences,
    FOS: Computer and information sciences
  }
}