symanto/xlm-roberta-base-snli-mnli-anli-xnli | ATYUN.COM 官网-人工智能教程资讯全方位服务平台

模型:

symanto/xlm-roberta-base-snli-mnli-anli-xnli

任务:

文本分类

类库:

PyTorch Transformers

数据集:

SNLI MNLI ANLI XNLI 3AXNLI 3AANLI 3AMNLI 3ASNLI

语言:

其他:

xlm-roberta 零样本分类

模型介绍文件清单

英文

一个经过训练的交叉注意力NLI模型，用于零-shot和少-shot文本分类。

基础模型是 xlm-roberta-base ，使用了 here 的代码进行训练；在 SNLI 、 MNLI 、 ANLI 和 XNLI 上进行训练。

用法：

from transformers import AutoModelForSequenceClassification, AutoTokenizer
import torch
import numpy as np

model = AutoModelForSequenceClassification.from_pretrained("symanto/xlm-roberta-base-snli-mnli-anli-xnli")
tokenizer = AutoTokenizer.from_pretrained("symanto/xlm-roberta-base-snli-mnli-anli-xnli")

input_pairs = [
               ("I like this pizza.", "The sentence is positive."),
               ("I like this pizza.", "The sentence is negative."),
               ("I mag diese Pizza.", "Der Satz ist positiv."),
               ("I mag diese Pizza.", "Der Satz ist negativ."),
               ("Me gusta esta pizza.", "Esta frase es positivo."),
               ("Me gusta esta pizza.", "Esta frase es negativo."),
]
inputs = tokenizer(input_pairs, truncation="only_first", return_tensors="pt", padding=True)
logits = model(**inputs).logits
probs = torch.softmax(logits, dim=1)
probs = probs[..., [0]].tolist()
print("probs", probs)
np.testing.assert_almost_equal(probs, [[0.83], [0.04], [1.00], [0.00], [1.00], [0.00]], decimal=2)

作者:

Symanto Research

数据集大小:

1.05 GB