数据集:
bigbio/pubtator_central
PubTator Central(PTC, https://www.ncbi.nlm.nih.gov/research/pubtator/ )是一个用于探索和检索生物医学全文中的生物概念注释的网络服务。PTC提供来自最先进的文本挖掘系统的基因/蛋白质、遗传变异、疾病、化学物质、物种和细胞系的自动注释,所有数据可立即下载。PTC对PubMed(3000万篇摘要)、PMC开放访问子集和作者手稿集合(300万篇全文文章)进行注释。更新的实体识别方法和基于尖端深度学习技术的消歧模块提供了更高的准确性。
@article{10.1093/nar/gkz389, title = {{PubTator central: automated concept annotation for biomedical full text articles}}, author = {Wei, Chih-Hsuan and Allot, Alexis and Leaman, Robert and Lu, Zhiyong}, year = 2019, month = {05}, journal = {Nucleic Acids Research}, volume = 47, number = {W1}, pages = {W587-W593}, doi = {10.1093/nar/gkz389}, issn = {0305-1048}, url = {https://doi.org/10.1093/nar/gkz389}, eprint = {https://academic.oup.com/nar/article-pdf/47/W1/W587/28880193/gkz389.pdf} }