数据集:
bigbio/an_em
AnEM语料库是一个领域和物种无关的资源,使用细粒度分类系统手动注释了解剖实体提及。该语料库由随机选择的500个文档(超过90,000个词)组成,这些文档来自引文摘要和全文论文,目的是使语料库代表整个可用的生物医学科学文献。语料库标注涵盖了健康和病理解剖实体的提及,并包含3000多个已注释的提及。
@inproceedings{ohta-etal-2012-open, author = {Ohta, Tomoko and Pyysalo, Sampo and Tsujii, Jun{'}ichi and Ananiadou, Sophia}, title = {Open-domain Anatomical Entity Mention Detection}, journal = {}, volume = {W12-43}, year = {2012}, url = {https://aclanthology.org/W12-4304}, doi = {}, biburl = {}, bibsource = {}, publisher = {Association for Computational Linguistics} }