数据集:
bigbio/ehr_rel
EHR-Rel是一个新颖的开源1生物医学概念相关性数据集,包含了3630对概念,是现有数据集的六倍。与以往的工作不同,该数据集是从电子健康记录(EHRs)中采样得到的,以确保概念与EHR概念检索任务相关。对数据集中概念的详细分析显示,其覆盖范围远远超过现有数据集。
@inproceedings{schulz-etal-2020-biomedical, title = {Biomedical Concept Relatedness {--} A large {EHR}-based benchmark}, author = {Schulz, Claudia and Levy-Kramer, Josh and Van Assel, Camille and Kepes, Miklos and Hammerla, Nils}, booktitle = {Proceedings of the 28th International Conference on Computational Linguistics}, month = {dec}, year = {2020}, address = {Barcelona, Spain (Online)}, publisher = {International Committee on Computational Linguistics}, url = {https://aclanthology.org/2020.coling-main.577}, doi = {10.18653/v1/2020.coling-main.577}, pages = {6565--6575}, }