数据集:

liyucheng/chinese_metaphor_dataset

英文

中文隐喻语料库(CMC)

数据集概述

首个中文比喻语料库,用于中文比喻识别和生成。我们构建了一个包含约9000个带有目标和载体标注的中文隐喻句子的大型比喻资源。在 github repo 和我们在COLING 2022上的 paper 中可以查看更多详细信息。

知乎 中查看更多细节。

语言

中文

引用信息

@inproceedings{li-etal-2022-cm,
    title = "{CM}-Gen: A Neural Framework for {C}hinese Metaphor Generation with Explicit Context Modelling",
    author = "Li, Yucheng  and
      Lin, Chenghua  and
      Guerin, Frank",
    booktitle = "Proceedings of the 29th International Conference on Computational Linguistics",
    month = oct,
    year = "2022",
    address = "Gyeongju, Republic of Korea",
    publisher = "International Committee on Computational Linguistics",
    url = "https://aclanthology.org/2022.coling-1.563",
    pages = "6468--6479",
}