Compression levels #51

fintelia · 2025-07-11T07:07:14Z

I've been working off-and-on to build better compression for fdeflate. This draft-PR captures the current state of my experimentation. So far, I've mainly focused on the core compression algorithms, so other details aren't handled yet like streaming (at the moment the input is accumulated into a Vec and compressed on flush) or compressing >4GB worth of data (requires a bit of handling code for 32-bit indices rolling over).

Like other compression libraries, this uses different algorithms depending on the compression level:

Level zero uses a dedicated stored-block encoding path
Levels 1-2 look for RLE matches of zero bytes and 8-byte matches via a hash table. Based on lz4's compressor.
Levels 3-6 use a more traditional hash chain match finder with a minimum match size of 4-bytes. Also does greedy matching but with look-ahead to "fizzle" matches that's inspired by zlib-ng.
Levels 7+ are the least polished. They use a binary tree match finder with lazy matching. Much slower than zlib-rs's level 9, but better compression.

Results on raw PNG IDAT data produced by re-encoding the QOI benchmark suite images, reporting the geometric mean of speed and compression ratio...

Encoder              Speed     Ratio
----------     -----------    ------
fdeflate[0]: 12529.7 MiB/s   100.02%
fdeflate[1]:   396.4 MiB/s    24.77%
fdeflate[2]:   229.6 MiB/s    24.58%
fdeflate[3]:   112.5 MiB/s    24.30%
fdeflate[4]:    71.4 MiB/s    23.94%
fdeflate[5]:    67.5 MiB/s    23.90%
fdeflate[6]:    38.2 MiB/s    23.53%
   ...
fdeflate[9]:    10.3 MiB/s    22.72%

See this comment with comparable measurements from other compressors.

fdeflate[1]: 396.4 MiB/s 24.77% fdeflate[2]: 229.6 MiB/s 24.58% fdeflate[3]: 112.5 MiB/s 24.30% fdeflate[4]: 71.4 MiB/s 23.94% fdeflate[5]: 67.5 MiB/s 23.90% fdeflate[6]: 38.2 MiB/s 23.53%

fintelia added 30 commits September 20, 2024 20:02

Checkpoint experiments

1b3df02

checkpoint

ef3b099

checkpoint

2d25d48

Checkpoint

d06cd69

Fixes

17d3d58

Actually produce valid output

484eaf5

Checkpoint hash chain compression

1cccbc6

More lazy

5279f21

Split blocks based on number of symbols seen

161a2bb

Adjust parameters and a bugfix

8c4e806

Hash on demand

d9ec3b3

Hash collision fixes and a few other improvements

c77d39d

Optimizations and experiments

a5a51f6

Segment detection

ee7b1bc

checkpoint

5fa54e2

get_and_insert and fewer constants

c63e91b

Fix lazy matching

d2a0387

Return length/distance rather than index/length

1f9dde0

Pass through parameters

1b2a95a

Fix warnings

16aff78

Better lazy parsing

377fc29

Independent hash3_table and hash4_table handling

2a4a540

Improvements

a2e40af

Optimize literal writing

0bcb092

Checkpoint

d777fda

Checkpoint refactoring

1b92038

Add missing file

1c22667

Checkpoint bt matchfinding

bce8419

Checkpoint BST fixes

e6858b3

Checkpoint high compression fixes

821bd9f

fintelia added 11 commits June 27, 2025 20:36

Support longer min_match

34ba136

Tune for faster compression

8574a72

Checkpoint hashtable match finder

7c5bb5e

fast compressor improvements

6357744

Look for RLE during skipahead

067caaf

Change hash function

fdb9d5e

Merge branch 'main' into compression-levels

8eb50d7

Refactor interface and initial calibration

cf2391d

minmatch=4 for now

126f131

Non-overlapping "lazy" matching

0a527a9

Match zlib-ng more closely for 'medium' compression levels

f9fb91e

fdeflate[1]: 396.4 MiB/s 24.77% fdeflate[2]: 229.6 MiB/s 24.58% fdeflate[3]: 112.5 MiB/s 24.30% fdeflate[4]: 71.4 MiB/s 23.94% fdeflate[5]: 67.5 MiB/s 23.90% fdeflate[6]: 38.2 MiB/s 23.53%

This was referenced Jul 11, 2025

Proof of concept: Higher compression mode #29

Closed

Tracking issue for improving encoding performance image-rs/image-png#611

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Compression levels #51

Compression levels #51

Uh oh!

fintelia commented Jul 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Compression levels #51

Are you sure you want to change the base?

Compression levels #51

Uh oh!

Conversation

fintelia commented Jul 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant