AraGPT2检测器

AraGPT2: Pre-Trained Transformer for Arabic Language Generation paper 生成的机器生成的检测器模型

该模型是基于长文本段训练的，并实现了99.4%的F1分数。

如何使用：

from transformers import pipeline
from arabert.preprocess import ArabertPreprocessor

processor = ArabertPreprocessor(model="aubmindlab/araelectra-base-discriminator")
pipe = pipeline("sentiment-analysis", model = "aubmindlab/aragpt2-mega-detector-long")

text = " "
text_prep = processor.preprocess(text)
result = pipe(text_prep)
# [{'label': 'machine-generated', 'score': 0.9977743625640869}]

如果您使用了这个模型，请引用我们的论文：

@misc{antoun2020aragpt2,
      title={AraGPT2: Pre-Trained Transformer for Arabic Language Generation},
      author={Wissam Antoun and Fady Baly and Hazem Hajj},
      year={2020},
      eprint={2012.15520},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

联系方式

Wissam Antoun： Linkedin | Twitter | Github | wfa07@mail.aub.edu | wissam.antoun@gmail.com

Fady Baly： Linkedin | Twitter | Github | fgb06@mail.aub.edu | baly.fady@gmail.com

作者:

AUB MIND LAB

数据集大小:

1.01 GB