I have a really big (~50MB) file of Spanish sentences. I want to check which of these don't contain foreign words. To achieve that, I am planning to filter out sentences that contain words that don't exist in the spellchecker dictionary. Does such a tool exist? Is it worth to play around with search trees and hash tables to create an efficient spellchecker myself?
You ca try the spell checker in Whoosh, via a short python script as described here: http://pythonhosted.org/Whoosh/spelling.html
or use Pyenchant: http://pythonhosted.org/pyenchant/tutorial.html
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With