Skip to content

No result difference using word2vec vectors for training NER in spacy 3 #9334

Discussion options

You must be logged in to vote

I have successfully regenerated my test results just like spacy 2(Not exact but absolute value is now similar).
Solution I found in spacy documentation https://spacy.io/api/architectures#parser
While generating base_config.cfg it's automatically selecting the tok2vec.model as

[components.ner.model.tok2vec]
@architectures = "spacy.Tok2VecListener.v1"
width = ${components.tok2vec.model.encode.width}
upstream = "*"

According to the documentation I replaced it by

@architectures = "spacy.HashEmbedCNN.v2"
pretrained_vectors = null
width = 96
depth = 4
embed_size = 2000
window_size = 1
maxout_pieces = 3
subword_features = true

Now it's working just like spacy 2.

Replies: 4 comments 5 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
5 replies
@polm
Comment options

@polm
Comment options

@sagorbrur
Comment options

@polm
Comment options

@sagorbrur
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by sagorbrur
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
training Training and updating models feat / vectors Feature: Word vectors and similarity
2 participants
Converted from issue

This discussion was converted from issue #9326 on September 30, 2021 10:09.