Skip to content

Floating point exception when trying to OCR an image #3991

@MerlijnWajer

Description

@MerlijnWajer

Basic Information

tesseract 5.3.0-1-gd3a4
 leptonica-1.80.0
  libgif 5.2.1 : libjpeg 6b (libjpeg-turbo 2.0.6) : libpng 1.6.37+apng : libtiff 4.3.0 : zlib 1.2.11 : libopenjp2 2.4.0
 Found AVX2
 Found AVX
 Found FMA
 Found SSE4.1
 Found OpenMP 201511
 Found libarchive 3.6.1 zlib/1.2.11 liblzma/5.2.5 bz2lib/1.0.8
 Found libcurl/7.79.1 GnuTLS/3.7.3 (OpenSSL/1.1.1n) zlib/1.2.11 nghttp2/1.45.1

Operating System

No response

Other Operating System

Gentoo Linux, but also Ubuntu 18.04, 20.04, etc

uname -a

Linux gentoo-x13 5.11.7-gentoo-dist #1 SMP Wed Mar 17 21:03:41 -00 2021 x86_64 AMD Ryzen 7 PRO 4750U with Radeon Graphics AuthenticAMD GNU/Linux

Compiler

GCC 11.2.1

Virtualization / Containers

No response

CPU

AMD Ryzen 7 PRO 4750U with Radeon Graphics

Current Behavior

tesseract -c hocr_char_boxes=1 --dpi 400 ~/Downloads/UNI_1918030101_0003.jp2 - hocr
Floating point exception

Expected Behavior

It would expect Tesseract to not receive SIGFPE.

Suggested Fix

No response

Other Information

The problem doesn't occur if I decompress the JPEG2000 to a TIFF, so perhaps there is some problems with the JPEG2000 handling.

The image is here: https://archive.org/~merlijn/UNI_1918030101_0003.jp2

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions