Tesseract OCR cube files for Turkish

Question

Where can I find tesseract ocr Turkish language extension for cube mode ?

files:

tr.cube.fold
tr.cube.lm
tr.cube.nn
tr.cube.params
tr.cube.size
tr.cube.word-freq

mesutpiskin · Accepted Answer

It includes all files, just this file is enough "tur.traineddata"

https://github.com/tesseract-ocr/tessdata/blob/master/tur.traineddata

and

https://github.com/tesseract-ocr/langdata/tree/master/tur

--

You could also use the trained data from tessdata_fast if you really need performance and are willing to lose some accuracy.

Grab the Turkish version at https://github.com/tesseract-ocr/tessdata_fast/blob/master/tur.traineddata

user898678 · Answer

Nowhere. Cube is dead-end and will be eliminated from tesseract e.g. see https://github.com/tesseract-ocr/tesseract/issues/40

Donate For Us