Problem with using spacy.matcher.matcher.Matcher.add() method

Tags:

matcher

spacy

I am getting an error when trying to use spacy matcher:

~\Anaconda3\lib\site-packages\spacy\matcher\matcher.pyx in spacy.matcher.matcher.Matcher.add()
TypeError: add() takes exactly 2 positional arguments (3 given)

Is there any alternate function for spacy.matcher.matcher.Matcher.add()?

984

asked Feb 11 '21 22:02

Vignesh c s

3 Answers

See the SpaCy Matcher.add() documentation:

Changed in v3.0
As of spaCy v3.0, Matcher.add takes a list of patterns as the second argument (instead of a variable number of arguments). The on_match callback becomes an optional keyword argument.

patterns = [[{"TEXT": "Google"}, {"TEXT": "Now"}], [{"TEXT": "GoogleNow"}]] - matcher.add("GoogleNow", on_match, *patterns) + matcher.add("GoogleNow", patterns, on_match=on_match)

Example usage:

from spacy.matcher import Matcher

matcher = Matcher(nlp.vocab)
pattern = [{"LOWER": "hello"}, {"LOWER": "world"}]
matcher.add("HelloWorld", [pattern])
doc = nlp("hello world!")
matches = matcher(doc)

167

answered Oct 21 '22 01:10

Wiktor Stribiżew

Instead of using matcher.add('Relation_name', None, pattern)

You can use: matcher.add('Relation_name', [pattern], on_match=None)

answered Oct 20 '22 23:10

mpriya

In addition, if you have multiple patterns to be extracted, an example would be as below.

import spacy
nlp = spacy.load('en_core_web_sm')

from spacy.matcher import Matcher
matcher = Matcher(nlp.vocab)

pattern1 = [{'LOWER':'solarpower'}]
pattern2 = [{'LOWER':'solar'},{'IS_PUNCT':True},{'LOWER':'power'}]
pattern3 = [{'LOWER':'solar'},{'LOWER':'power'}]

matcher.add('SolarPower', [pattern1,pattern2,pattern3])
doc = nlp(u"The Solar Power industry continues to grow a solarpower increases. Solar-power is good")
found_matches = matcher(doc)


for _,start,end in found_matches:
    span = doc[start:end]
    print(span)

Output would be:

Solar Power 
solarpower 
Solar-power

answered Oct 21 '22 01:10

Thilee

Related questions
                            
                                Correct POS tags for numbers substituted with ## in spacy
                            
                                AttributeError: type object 'spacy.syntax.nn_parser.array' has no attribute '__reduce_cython__' , (adding Paths to virtual environments)
                            
                                Using different word2vec training data in spaCy
                            
                                Save SpaCy render file as SVG using DisplaCy
                            
                                How to train a sense2vec model
                            
                                Multithreading with spacy: Is joblib necessary?
                            
                                Can't find model 'en_core_web_md'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory
                            
                                NLP, spaCy: Strategy for improving document similarity
                            
                                POS pattern mining with spacy
                            
                                Multi-Threaded NLP with Spacy pipe
                            
                                Spacy Japanese Tokenizer
                            
                                Formatting training dataset for SpaCy NER
                            
                                Extracting names from a text file using Spacy
                            
                                Custom sentence segmentation in Spacy
                            
                                Python: Chunking others than noun phrases (e.g. prepositional) using Spacy, etc
                            
                                'string' has incorrect type (expected str, got spacy.tokens.doc.Doc)
                            
                                Removing named entities from a document using spacy
                            
                                spaCy needs a file that is not there: strings.json
                            
                                ValueError: [E088] Text of length 1027203 exceeds maximum of 1000000. spacy
                            
                                Using RegEx for phrase pattern in EntityRuler

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With