数据集:

bigbio/seth_corpus

语言:

en

计算机处理:

monolingual

许可:

apache-2.0
英文

SETH Corpus数据集介绍

SETH语义关联提取(Semantic Extraction from Text for Human)语料库由630篇PubMed引文组成,用于SNP命名实体识别。

引用信息

@Article{SETH2016,
    Title       = {SETH detects and normalizes genetic variants in text.},
    Author      = {Thomas, Philippe and Rockt{"{a}}schel, Tim and Hakenberg, J{"{o}}rg and Lichtblau, Yvonne and Leser, Ulf},
    Journal     = {Bioinformatics},
    Year        = {2016},
    Month       = {Jun},
    Doi         = {10.1093/bioinformatics/btw234},
    Language    = {eng},
    Medline-pst = {aheadofprint},
    Pmid        = {27256315},
    Url         = {http://dx.doi.org/10.1093/bioinformatics/btw234
}