模型:
GroNLP/gpt2-medium-italian-embeddings
Wietse de Vries • Malvina Nissim
该模型基于中等的OpenAI GPT-2模型( gpt2-medium )。
该模型中的Transformer层权重与原始的英语模型相同,但词汇层已经重新训练为意大利语词汇。
详细信息请参阅我们关于 arXiv 的论文以及 Github 上的代码。
from transformers import pipeline pipe = pipeline("text-generation", model="GroNLP/gpt2-medium-italian-embeddings")
from transformers import AutoTokenizer, AutoModel, TFAutoModel tokenizer = AutoTokenizer.from_pretrained("GroNLP/gpt2-medium-italian-embeddings") model = AutoModel.from_pretrained("GroNLP/gpt2-medium-italian-embeddings") # PyTorch model = TFAutoModel.from_pretrained("GroNLP/gpt2-medium-italian-embeddings") # Tensorflow
@misc{devries2020good, title={As good as new. How to successfully recycle English GPT-2 to make models for other languages}, author={Wietse de Vries and Malvina Nissim}, year={2020}, eprint={2012.05628}, archivePrefix={arXiv}, primaryClass={cs.CL} }