Add single epoch optimization #239

csharrison · 2025-08-11T13:29:16Z

This fixes #78 and partially addresses #212 . The algorithm uses the configured lookback window to determine automatically if the attribution is only considering impressions from the current epoch. If so, budget deduction is altered by considering the report's sensitivity to be |l1Norm| rather than 2 * |value|.

In both cases, we assume that the noise scale for Laplace noise is `lambda = |maxValue| / |epsilon|.

Note: An alternative considered was to add a new option to the API to query epochs and apply the optimization if only a single epoch was chosen, but using that API seems difficult, and at a minimum requires exposing the epoch start map to all conversion sites to use it effectively. Additionally, it adds API surface bloat.

Preview | Diff

csharrison · 2025-08-11T14:07:02Z

cc @alexanderknop for initial review.

api.bs

martinthomson · 2025-08-11T23:10:27Z

api.bs

+        1.  Let |impressions| be the result of invoking [=common matching logic=]
+            with |options|, |topLevelSite|, |intermediarySite|, |epoch|, and |now|.


So we have this problem, whereby we let the multi-epoch query avoid deductions unless it selects an impression from an epoch. But the single epoch query does not. I see that this is how it is implemented in psdlib, so I'm not contesting the conclusion, but it's annoying.

If a multi-epoch query only selects impressions from a single epoch, we deduct 2 * value * epsilon / maxValue. That's mostly OK, but that factor of 2 is a real challenge. It is not going to be obvious to people using this API that the epsilon value in their browser is only half of what they get for most queries. That is, the browser might set an epsilon budget of 4, but the site can only reasonably make queries with epsilon = 2 within that budget, unless they are careful to stay within a single epoch. Given the length of our epoch - and its random start offset - that will rarely be useful to them, so I don't expect it to happen.

So we have this problem, whereby we let the multi-epoch query avoid deductions unless it selects an impression from an epoch. But the single epoch query does not. I see that this is how it is implemented in psdlib, so I'm not contesting the conclusion, but it's annoying.

That is not the case. In the single epoch case we:

Return early if matchedImpressions is empty, deducting no budget (before invoking attribution logic)

If there are matched impressings, we deduct proportional to the actual realized L1 norm of the histogram, so even if we removed step (1) we would still deduct nothing (the l1 norm of the empty histogram is 0).

If a multi-epoch query only selects impressions from a single epoch, we deduct 2 * value * epsilon / maxValue. That's mostly OK, but that factor of 2 is a real challenge. It is not going to be obvious to people using this API that the epsilon value in their browser is only half of what they get for most queries. That is, the browser might set an epsilon budget of 4, but the site can only reasonably make queries with epsilon = 2 within that budget, unless they are careful to stay within a single epoch. Given the length of our epoch - and its random start offset - that will rarely be useful to them, so I don't expect it to happen.

Two things:

a. Without the single-budget optimization, we still need the factor of 2 increase, per some of the discussion in #212. This PR only makes this more apparent (since in the existing spec, to achieve the privacy guarantees the noise factor needs to be 2 * |maxValue| / |epsilon|.

b. I actually think there are plenty of use-cases where you will hit the single-budget opt. For instance, I think it is relatively common to have O(day) lookback windows for view throughs. In those cases, I think it is perfectly fine to "silently" optimize and deduct less than the expected budget.

Co-authored-by: Martin Thomson <[email protected]>

api.bs

Co-authored-by: Andrew Paseltiner <[email protected]>

apasel422

LGTM spec-wise; I defer the math part to you and Martin. I can implement this in the simulator in a followup PR.

csharrison · 2025-08-14T15:08:35Z

Thanks folks, I'll wait for review by @alexanderknop before landing.

api.bs

alexanderknop · 2025-08-15T02:28:54Z

The algorithms loogs great to me so other than renaming variables for clarity, I think this is a great!

Add single epoch optimization

be2aef0

csharrison marked this pull request as ready for review August 11, 2025 14:06

martinthomson approved these changes Aug 11, 2025

View reviewed changes

Apply suggestions from code review

eb8a1cf

Co-authored-by: Martin Thomson <[email protected]>

apasel422 requested changes Aug 14, 2025

View reviewed changes

csharrison and others added 2 commits August 14, 2025 08:31

Apply suggestions from code review

17bf263

Co-authored-by: Andrew Paseltiner <[email protected]>

Properly scope |histogram|

4d0fbf8

apasel422 approved these changes Aug 14, 2025

View reviewed changes

alexanderknop approved these changes Aug 15, 2025

View reviewed changes

api.bs Show resolved Hide resolved

api.bs Outdated Show resolved Hide resolved

api.bs Outdated Show resolved Hide resolved

replace l1Norm with attributedValueForSingleEpochOpt

a3d8244

csharrison merged commit 01d9d61 into main Aug 15, 2025
1 of 2 checks passed

csharrison deleted the single-opt branch August 15, 2025 18:52

apasel422 mentioned this pull request Aug 18, 2025

Implement single-epoch optimization in simulator #253

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add single epoch optimization #239

Add single epoch optimization #239

Uh oh!

csharrison commented Aug 11, 2025 •

edited by pr-preview bot

Loading

Uh oh!

csharrison commented Aug 11, 2025

Uh oh!

Uh oh!

Uh oh!

martinthomson Aug 11, 2025

Uh oh!

csharrison Aug 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

apasel422 left a comment

Uh oh!

csharrison commented Aug 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexanderknop commented Aug 15, 2025

Uh oh!

Uh oh!

Uh oh!

		1. Let \|impressions\| be the result of invoking [=common matching logic=]
		with \|options\|, \|topLevelSite\|, \|intermediarySite\|, \|epoch\|, and \|now\|.

Add single epoch optimization #239

Add single epoch optimization #239

Uh oh!

Conversation

csharrison commented Aug 11, 2025 • edited by pr-preview bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

csharrison commented Aug 11, 2025

Uh oh!

Uh oh!

Uh oh!

martinthomson Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

csharrison Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

apasel422 left a comment

Choose a reason for hiding this comment

Uh oh!

csharrison commented Aug 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexanderknop commented Aug 15, 2025

Uh oh!

Uh oh!

Uh oh!

csharrison commented Aug 11, 2025 •

edited by pr-preview bot

Loading

csharrison Aug 12, 2025 •

edited

Loading