Skip to content

Downloading Spacy NLP Engine by default when using AzureAILanguageRecognizer #1551

Answered by omri374
BlaiseCz asked this question in Q&A
Discussion options

You must be logged in to vote

A spaCy pipeline is an inherent part in Presidio, as it's used not just for detecting entities but also for tokenization, lemmatization and more (used in the context awareness mechanism and other places). If you're not interested in downloading a large model, you can configure Presidio to use a small spaCy model, which contains the NLP pipeline components but doesn't use a model for NER.

from presidio_analyzer import AnalyzerEngine, RecognizerRegistry
from presidio_analyzer.nlp_engine import NlpEngineProvider
from presidio_analyzer.predefined_recognizers import AzureAILanguageRecognizer

# Set up the NLP engine
nlp_engine_provider = NlpEngineProvider(nlp_configuration={
    "nlp_engine_name"

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@BlaiseCz
Comment options

@omri374
Comment options

@BlaiseCz
Comment options

Answer selected by BlaiseCz
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants