I'm using the FuzzyWuzzy String Matching module from SeatGeek.
I find that when using the token_set_ratio search algorithm, small differences in case gives wildly differing results.
For example, if I am looking for the phrase "I am eating" in a file, I get a 100% match. But if the phrase is "i am eating", just the change in case of ONE letter, gives me a 65% match.
Is there any way to make the algorithm case insensitive?
token_set_ratio() is case insensitive by default.
from fuzzywuzzy import fuzz
fuzz.token_set_ratio("I am eating", "i am eating")
=> 100
I had the same issue, you probably were using Ratio and not TokenSetRatio...
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With