模型:
GroNLP/gpt2-medium-dutch-embeddings
Wietse de Vries • Malvina Nissim
该模型基于中等大小的 OpenAI GPT-2( gpt2-medium )模型。
此模型中的 Transformer 层权重与原始的英语模型相同,但词汇层的训练已重新针对荷兰语词汇进行了。
详情请参阅我们关于 arXiv 的论文以及代码 Github 。
from transformers import pipeline pipe = pipeline("text-generation", model="GroNLP/gpt2-medium-dutch-embeddings")
from transformers import AutoTokenizer, AutoModel, TFAutoModel tokenizer = AutoTokenizer.from_pretrained("GroNLP/gpt2-medium-dutch-embeddings") model = AutoModel.from_pretrained("GroNLP/gpt2-medium-dutch-embeddings") # PyTorch model = TFAutoModel.from_pretrained("GroNLP/gpt2-medium-dutch-embeddings") # Tensorflow
@misc{devries2020good, title={As good as new. How to successfully recycle English GPT-2 to make models for other languages}, author={Wietse de Vries and Malvina Nissim}, year={2020}, eprint={2012.05628}, archivePrefix={arXiv}, primaryClass={cs.CL} }