-
Notifications
You must be signed in to change notification settings - Fork 10.4k
Open
Labels
Description
Environment
Windows 64bit.
Tesseract version v5.2.0.20220708 clean install from binaries [tesseract-ocr-w64-setup-v5.2.0.20220708.exe]
Current Behavior:
Command Line to Reproduce Problem:
C:\Program Files\Tesseract-OCR>tesseract --tessdata-dir "C:\Program Files\Tesseract-OCR\tessdata" "d:\temp\input.png" "d:\temp\output" --psm 3 -l eng
output.txt :
12/2/7 12174 /AH
Expected Behavior:
On such a clean Input I would expect the output to be
12/2/7 /2174 /AH
The inconsistent spacing is something that can be lived with, but the digit 1 instead of / is causing a serious problem
I tried all available --psm modes. No luck.
Suggested Fix:
Improve accuracy when digits, letters and special characters are involved
Thank you for the work on the new release
