模型:

spacy/en_core_web_sm

类库:

spaCy

语言:

en

许可:

mit
英文

详细信息: https://spacy.io/models/en#en_core_web_sm

为 CPU 优化的英文处理流程。组件:tok2vec、标注器、解析器、分句器、命名实体识别器、属性规则器、词形归并器。

Feature Description
Name en_core_web_sm
Version 3.6.0
spaCy >=3.6.0,<3.7.0
Default Pipeline tok2vec , tagger , parser , attribute_ruler , lemmatizer , ner
Components tok2vec , tagger , parser , senter , attribute_ruler , lemmatizer , ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources 1232321 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston) 1233321 (Emory University) 1234321 (Princeton University)
License MIT
Author 1235321

标签方案

查看标签方案(3个组件共有113个标签)
Component Labels
tagger $ , '' , , , -LRB- , -RRB- , . , : , ADD , AFX , CC , CD , DT , EX , FW , HYPH , IN , JJ , JJR , JJS , LS , MD , NFP , NN , NNP , NNPS , NNS , PDT , POS , PRP , PRP$ , RB , RBR , RBS , RP , SYM , TO , UH , VB , VBD , VBG , VBN , VBP , VBZ , WDT , WP , WP$ , WRB , XX , _SP , ````
parser ROOT , acl , acomp , advcl , advmod , agent , amod , appos , attr , aux , auxpass , case , cc , ccomp , compound , conj , csubj , csubjpass , dative , dep , det , dobj , expl , intj , mark , meta , neg , nmod , npadvmod , nsubj , nsubjpass , nummod , oprd , parataxis , pcomp , pobj , poss , preconj , predet , prep , prt , punct , quantmod , relcl , xcomp
ner CARDINAL , DATE , EVENT , FAC , GPE , LANGUAGE , LAW , LOC , MONEY , NORP , ORDINAL , ORG , PERCENT , PERSON , PRODUCT , QUANTITY , TIME , WORK_OF_ART

准确率

Type Score
TOKEN_ACC 99.86
TOKEN_P 99.57
TOKEN_R 99.58
TOKEN_F 99.57
TAG_ACC 97.25
SENTS_P 92.02
SENTS_R 89.21
SENTS_F 90.59
DEP_UAS 91.75
DEP_LAS 89.87
ENTS_P 84.55
ENTS_R 84.57
ENTS_F 84.56