模型:
adalbertojunior/distilbert-portuguese-cased
这个模型是从 BERTimbau 中提炼出来的
from transformers import AutoTokenizer # Or BertTokenizer from transformers import AutoModelForPreTraining # Or BertForPreTraining for loading pretraining heads from transformers import AutoModel # or BertModel, for BERT without pretraining heads model = AutoModelForPreTraining.from_pretrained('adalbertojunior/distilbert-portuguese-cased') tokenizer = AutoTokenizer.from_pretrained('adalbertojunior/distilbert-portuguese-cased', do_lower_case=False)
你应该根据你自己的数据进行微调
在某些任务中,它可以达到比原始的BERTimbau高达99%的准确率