Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Tesseract OCR cube files for Turkish

Tags:

ocr

tesseract

Where can I find tesseract ocr Turkish language extension for cube mode ?

files:

tr.cube.fold
tr.cube.lm
tr.cube.nn
tr.cube.params
tr.cube.size
tr.cube.word-freq
like image 655
Adem Aygun Avatar asked Dec 05 '25 02:12

Adem Aygun


2 Answers

It includes all files, just this file is enough "tur.traineddata"

https://github.com/tesseract-ocr/tessdata/blob/master/tur.traineddata

and

https://github.com/tesseract-ocr/langdata/tree/master/tur

--

You could also use the trained data from tessdata_fast if you really need performance and are willing to lose some accuracy.

Grab the Turkish version at https://github.com/tesseract-ocr/tessdata_fast/blob/master/tur.traineddata

like image 85
mesutpiskin Avatar answered Dec 07 '25 03:12

mesutpiskin


Nowhere. Cube is dead-end and will be eliminated from tesseract e.g. see https://github.com/tesseract-ocr/tesseract/issues/40

like image 32
user898678 Avatar answered Dec 07 '25 04:12

user898678



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!