Skip to content

Segfault with new image and rus+mrz in recent main #4361

@marcreichman-pfi

Description

@marcreichman-pfi

Current Behavior

With the attached image, and recent main (9f17a3fd) I receive a segfault in Release (SIGILL in Debug) with the mrz and rus langages. Both are fast. Rus is official and mrz is available here

Release:

<snip>/tesseract-fork/cmake-build-release/bin/tesseract <snip>/62553353F7495226759F9644269AA485-sample_000101.jpg - --tessdata-dir <snip>/tessdata/ -l mrz+rus
Estimating resolution as 971

Process finished with exit code 139 (interrupted by signal 11:SIGSEGV)

Debug:

Estimating resolution as 971
ELIST2_ITERATOR::add_before_stay_put:Error:Attempting to add an element with non nullptr links, to a list

Process finished with exit code 132 (interrupted by signal 4:SIGILL)

new-segfault-20241125

Expected Behavior

No segfault.

Suggested Fix

No response

tesseract -v

tesseract 5.5.0-26-g9f17a
 leptonica-1.82.0
  libgif 5.1.9 : libjpeg 8d (libjpeg-turbo 2.1.1) : libpng 1.6.37 : libtiff 4.3.0 : zlib 1.2.11 : libwebp 1.2.2 : libopenjp2 2.4.0
 Found AVX
 Found SSE4.1
 Found OpenMP 201511

Operating System

Ubuntu 22.04 Jammy

Other Operating System

This is running via WSL but it does not appear related to the issue.

uname -a

Linux hostname 5.10.16.3-microsoft-standard-WSL2 #1 SMP Fri Apr 2 22:23:49 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux

Compiler

GCC 11.4

CPU

Intel(R) Core(TM) i7-3720QM CPU @ 2.60GHz

Virtualization / Containers

No explicit VM except for WSL2

Other Information

This works in 5.5.0 tag

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions