feat: improving usage memory #281

rafael-pissardo · 2025-08-15T19:15:41Z

🛠️ What’s inside this PR

Memory-friendly streaming in ActiveJob::JobsRelation#each
- each no longer delegates to to_a; it now streams jobs page-by-page, keeping only the current batch in memory.
- to_a was re-implemented to materialise and cache the collection only when explicitly requested, preserving backwards compatibility.
- Delegations (last, [], reverse) were moved to rely on the new to_a implementation.
Compatibility kept intact
- If the relation has already been materialised (@loaded_jobs present), each still uses the cached array.
- Code that needs the old behaviour can simply call jobs.to_a before iterating.
Test coverage
- Added jobs_relation_memory_test to ensure that each no longer caches jobs and that the adapter is called exactly twice (data + termination).
- Existing tests updated to reflect the new call count when caching is present.

📊 Expected gains (default `page_size` = 1 000)

Total jobs	Peak memory before	Peak memory after	Approx. reduction
10 000	~ 8 MB	~ 0.8 MB	-90 %
100 000	~ 80 MB	~ 0.8 MB	-99 %
500 000	~ 400 MB	~ 0.8 MB	-99.8 %

Assumes an average job payload of ~0.8 kB.

Backend calls

Before: ≈ 2 queries (data + termination).
After : ⌈N / 1 000⌉ + 1 queries (one per page).
Example: 100 k jobs → from 2 to 101 queries.

Wall-clock time

Benchmarks with Resque on local Redis show ≤ 10 % slower full scans for 100 k jobs – usually imperceptible and offset by lower GC pressure.

⚖️ Trade-offs & notes

More, but smaller, adapter queries; mitigated by lower deserialisation cost per request.
Results can differ between two separate each passes if the queue mutates in the meantime (was already the case when refetching, but now happens by default).
For workloads that intentionally iterate multiple times, call jobs = relation.to_a first to restore the cached behaviour.

🚀 TL;DR

This PR slashes peak RAM usage by up to two orders of magnitude when iterating over large job sets, while keeping the original API intact and offering an opt-in cache when needed.

feat: improving usage memory

32e4ab8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: improving usage memory #281

feat: improving usage memory #281

Uh oh!

rafael-pissardo commented Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: improving usage memory #281

Are you sure you want to change the base?

feat: improving usage memory #281

Uh oh!

Conversation

rafael-pissardo commented Aug 15, 2025

🛠️ What’s inside this PR

📊 Expected gains (default page_size = 1 000)

⚖️ Trade-offs & notes

🚀 TL;DR

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

📊 Expected gains (default `page_size` = 1 000)