MSC4354: Sticky Events #4354

kegsay · 2025-09-16T07:37:31Z

Rendered

Client (receive/handle) [MatrixRTC] Sticky Events support (MSC4354) matrix-js-sdk#5017
Client (usage) Add sticky event support element-hq/element-call#3513
Server MSC4354: Sticky Events element-hq/synapse#18968
Complement Tests MSC4354: Sticky Events complement#806

SCT Stuff:

FCP tickyboxes

MSC checklist

It wasn't particulalry useful for clients, and doesn't help equivocation much.

proposals/4354-sticky-events.md

turt2live · 2025-09-16T17:49:22Z

proposals/4354-sticky-events.md

Implementation requirements:

Client (usage)

Client (receive/handle)

Server

The implementations for the client do not yet implement the latest versions of this MSC. This is currently in progress.

@Half-Shot which parts are those specifically? A review of the implementations appear to show it setting up things in a mostly-correct way. (I have no context on what transpired on this MSC between proto-MSC and now)

There were changes around 4-pules to 3-pules in the key mapping, and actually removing the requirement for the key mapping. This is now implemented in matrix-org/matrix-js-sdk#5028. I'm more happy that the SDK side is plausible now.

We have tested local calls with this MSC and it seems to work fine, but not federated calls. I don't actually see the need to block on federated calls myself, the application layer should be happy.

The 3-pules have become 4-pules again so I'll need to check that this all still works.

Failed to check in, this still works.

proposals/4354-sticky-events.md

Co-authored-by: Johannes Marbach <[email protected]>

proposals/4354-sticky-events.md

Co-authored-by: Travis Ralston <[email protected]>

turt2live · 2025-10-01T16:02:47Z

@mscbot resolve History visibility semantics need clarification/specification
@mscbot resolve Clarify if the sync API returns sticky events on subsequent requests
@mscbot resolve Clarify the endpoint which allows for a start time of "now" on sticky events

Co-authored-by: Travis Ralston <[email protected]>

turt2live · 2025-10-02T16:17:29Z

@mscbot resolve Unclear if addendum is normative for spec process purposes

proposals/4354-sticky-events.md

erikjohnston

Have split the comments into threads (#4354 (comment))

erikjohnston · 2025-10-08T14:00:06Z

proposals/4354-sticky-events.md

+
+To implement these properties, servers MUST:
+
+* Attempt to send their own[^origin] sticky events to all joined servers, whilst respecting per-server backoff times.


Moving from #4354 (comment)

The lack of atomicity in /send means clients may flicker RTC member state (update to old values, then immediately to newer values). This happens today too with state events, but less often.

In Synapse this will be especially slow as when we process each sticky event we go and fetch the previous 10 events and then query the state (assuming a large enough gap). This doesn't happen for state, as we'll get the last event and calculate the state for that chunk and atomically persist it. State flickering can happen if the server receives a chunk of events that contain a bunch of state changes, though empirically this is fairly rare.

This doesn't happen for state, as we'll get the last event and calculate the state for that chunk and atomically persist it.

I don't follow this. If I send 50 PDUs all in the same room, a nice linear chain with no forks, we:

treat all 50 PDUs as live (so will send them down /sync)

calculate the state before each PDU (only the earliest incurring a state query hit)

process each PDU atomically, but not the batch of 50.

So you will see flickering?

I think flickering of ephemeral per-user state is inevitable if we wish to hide the key we're modifying in the map from the server. It's definitely a security / UX tradeoff to make, though we've increasingly leant on the side of security for quite some time now. What would the implications be for flicking live-location shares or flickering RTC members? The former likely means the location is updated gradually as the server/client catch up. I think RTC members are reasonably static (they don't change mid-call), so flickering RTC members could make it appear that older members are joined to the call who then leave the call a few milliseconds later? Is this a problem for the call state machine? cc @toger5

Obviously if someone sends 50 sticky events in short succession then that will cause "flickering" as things come down live, but that is reflecting the reality that that state is flickering. That's totally fine.

However, if those 50 events happened over the course of an hour and you see them flickering of state changes then that is a different thing. We have previously made efforts to avoid much flickering on clients.

I think flickering of ephemeral per-user state is inevitable if we wish to hide the key we're modifying in the map from the server

Doesn't some of the encrypted state proposals allow encrypting the state key as well? Or potentially you could have a pointer to previous sticky events that get superseded and these are pulled in automatically (and if the server pulls them in then it knows not to treat them as "live")?

erikjohnston · 2025-10-08T14:02:53Z

proposals/4354-sticky-events.md

+
+To implement these properties, servers MUST:
+
+* Attempt to send their own[^origin] sticky events to all joined servers, whilst respecting per-server backoff times.


Moving from #4354 (comment)

how does MatrixRTC handle push notifications for incoming calls? (tangential to this MSC but whatever)

The question is: do we want to use sticky events for MatrixRTC notifications, and if so will that make the flickering problem much more noticeable/problematic?

Naively to me it feels odd to not use sticky events for call notifications, e.g. I'd have thought you would want to be notified for all calls in a DM. If you don't use sticky events you could end up in the situation where you see the call in the UI but not be notified about it.

erikjohnston · 2025-10-08T14:04:05Z

proposals/4354-sticky-events.md

+
+To implement these properties, servers MUST:
+
+* Attempt to send their own[^origin] sticky events to all joined servers, whilst respecting per-server backoff times.


Moving from #4354 (comment)

we will accumulate more forward extremities when catching up as we are now including sticky events in the initial batch of events when catching up. This is a concern, but having to deal with lots of forward extremities isn't a new concern.

One potential security concern here is that it makes it easier for users on one server to generate lots of extremities on another server, which can lead to performance issues in very large rooms. This does only work when the connection between the two servers is down (e.g. the remote server is down).

it makes it easier for users on one server to generate lots of extremities on another server

This is true today via message events surely? Like, I can trivially make lots of events and trickle every Nth message to cause forward extremity accumulation?

You can't as a user on the server, but yes the server can.

toger5 · 2025-10-09T16:44:55Z

proposals/4354-sticky-events.md

+users sending multiple events with the same `sticky_key`. To deterministically tie-break, clients which
+implement this behaviour MUST[^maporder]:
+
+- pick the one with the highest `origin_server_ts`,  


With the text below we do try to mitigate any possible client desynchronization. It might be easier to just define the sticky map as:

last to expire wins

This way we have actually prohibit diverging clients. In the current and dont motivate client implementations to maybe add additional tests for the expiration on top of the origin_server_ts ordering.
If a client wants to update the sticky map they are now enforced to use the same (minus time passed) (or greater) expiration time, otherwise their event will not update other clients local sticky maps.

This might help to reduce the text in the following section as well.

Suggested change

- pick the one with the highest `origin_server_ts`,

- pick the one with the highest `origin_server_ts + sticky.duration_ms`,

toger5 · 2025-10-09T16:53:33Z

proposals/4354-sticky-events.md

+> If a client sends two sticky events in the same millisecond, the 2nd event may be replaced by the 1st if
+> the event ID of the 1st event has a higher lexicographical event ID. To protect against this, clients should
+> ensure that they wait at least 1 millisecond between sending sticky events.


This section should mention at one point "with the same sticky_key".
(This information can be guessed becaues of "the 2nd event may be replaced by the 1st" which is only the case if they do have the same sticky_key but the conclusion should include it explicitly imo.

Suggested change

> If a client sends two sticky events in the same millisecond, the 2nd event may be replaced by the 1st if

> the event ID of the 1st event has a higher lexicographical event ID. To protect against this, clients should

> ensure that they wait at least 1 millisecond between sending sticky events.

> If a client sends two sticky events in the same millisecond, the 2nd event may be replaced by the 1st if

> the event ID of the 1st event has a higher lexicographical event ID. To protect against this, clients should

> ensure that they wait at least 1 millisecond between sending sticky events with the same `sticky_key`.

Half-Shot · 2025-10-10T13:42:37Z

proposals/4354-sticky-events.md

+
+- pick the one with the highest `origin_server_ts`,  
+- tie break on the one with the highest lexicographical event ID (A < Z).
+


Redaction behaviour needs specifying.

uhoreg · 2025-10-14T17:15:33Z

proposals/4354-sticky-events.md

+}
+```
+
+Sticky events are expected to be encrypted and so there is no "state filter" equivalent provided for sticky events


What does "state filter" refer to? I don't see that phrase anywhere in the C-S spec. Is it referring to https://spec.matrix.org/unstable/client-server-api/#filtering ?

On the topic of filtering, should events from ignored users be dropped?

MadLittleMods · 2025-10-22T17:16:59Z

proposals/4354-sticky-events.md

+
+Servers may send every event as a sticky event, causing a higher amount of events to be sent eagerly over federation
+and to be sent down `/sync` to clients. The former is already an issue as servers can simply `/send` many events.
+The latter is a new abuse vector, as up until this point the `timeline_limit` would restrict the amount of events


Unbounded number of sticky events in Sliding Sync response

In the current Simplified Sliding Sync extension implementation in Synapse, it will return all unexpired sticky events in the room on initial sync. And is also unbounded for incremental syncs.

For example, it doesn't seem fine to return 100k+ sticky events. The problem is the amount of work and time needed to come up with the giant 100k response and the amount of network effort to get that back to the client. It's the same reason why we have a timeline_limit for timeline events. One may say that this won't happen because of rate-limits, etc but seems possible for a large public event room. Anyone can send a sticky event and we already have examples of large rooms.

Adding a limit

At a minimum, I think we need some limit and a new endpoint to paginate further.

Going further

We're tackling similar problems across a few MSC's now:

thread_subscriptions: MSC4308: Thread Subscriptions extension to Sliding Sync #4308

threads MSC4360 Sliding Sync threads extension #4360

sticky_events: (this MSC)

This feels like the exact same problem as the threads extension. In both of these cases, we are trying to tackle the same problem of gathering a subset of the events in the timeline. Here is what I previously wrote about it:

I think the better way to go about is to apply the same pattern that we just worked through with thread subscriptions. We would need a few things:

A way to return the new sticky events in /sync

Dedicated sticky_events_limited flag when there are other sticky events that we didn't return

A pagination endpoint for the sticky events

To be more detailed, these could be normal timeline events. If there are more sticky events in between the given sync token and the current position, we set the sticky_events_limited flag. For the pagination endpoint, we could overload /messages with a new filter.

-- @MadLittleMods, internal room

With some more thinking on the subject, the dedicated Sliding Sync extension worked well for thread_subscriptions because thread_subscriptions aren't part of the response already.

Whereas with sticky events and thread updates, they are already part of the timeline.

To break down the list of things needed (as listed above) and my current thinking:

"A way to return the new sticky events in /sync"

In the case of threads and sticky events, these events are already included as part of the timeline and /sync already keeps you up to date with all of the events.

"A pagination endpoint for the sticky events"

The dedicated [pagination endpoint] makes sense to back-paginate the gaps (whenever the timeline is limited: true)

"Dedicated sticky_events_limited flag when there are other sticky events that we didn't return"

With more thinking, I think this one is optional.

We could have an extension that indicates whether [sticky_events_limited] but this only saves a few extra requests in the cases where the timeline is limited: true but there weren't actually any new thread updates in the gap.

-- @MadLittleMods, #4360 (comment)

Previous piece-meal discussions:

MSC4354: Sticky Events element-hq/synapse#18968 (comment)

MSC4360 Sliding Sync threads extension #4360 (comment)

MSC4308: Thread Subscriptions extension to Sliding Sync #4308 (comment)

Sticky Events

57ccc48

kegsay changed the title ~~Sticky Events~~ MSC4354: Sticky Events Sep 16, 2025

kegsay added 3 commits September 16, 2025 08:51

Remove prev_batch

94b1a87

It wasn't particulalry useful for clients, and doesn't help equivocation much.

Update 4354-sticky-events.md

50d76e6

Update 4354-sticky-events.md

3baf0d8

ara4n reviewed Sep 16, 2025

View reviewed changes

proposals/4354-sticky-events.md Outdated Show resolved Hide resolved

Update 4354-sticky-events.md

29e9bf7

turt2live added proposal A matrix spec change proposal client-server Client-Server API kind:core MSC which is critical to the protocol's success needs-implementation This MSC does not have a qualifying implementation for the SCT to review. The MSC cannot enter FCP. labels Sep 16, 2025

turt2live added this to Spec Core Team Workflow Sep 16, 2025

github-project-automation bot moved this to Tracking for review in Spec Core Team Workflow Sep 16, 2025

turt2live added the matrix-2.0 Required for Matrix 2.0 label Sep 16, 2025

turt2live reviewed Sep 16, 2025

View reviewed changes

proposals/4354-sticky-events.md Show resolved Hide resolved

Syntax

b6e8159

Johennes reviewed Sep 17, 2025

View reviewed changes

proposals/4354-sticky-events.md Show resolved Hide resolved

Johennes reviewed Sep 17, 2025

View reviewed changes

proposals/4354-sticky-events.md Show resolved Hide resolved

kegsay and others added 7 commits September 18, 2025 13:06

Update proposals/4354-sticky-events.md

33ec282

Co-authored-by: Johannes Marbach <[email protected]>

Update proposals/4354-sticky-events.md

7725f74

Co-authored-by: Johannes Marbach <[email protected]>

Update 4354-sticky-events.md

192c6b4

Update 4354-sticky-events.md

97c9c5b

Update 4354-sticky-events.md

8d101fd

Update 4354-sticky-events.md

c75e19c

Update 4354-sticky-events.md

c925a4c

ara4n reviewed Sep 19, 2025

View reviewed changes

proposals/4354-sticky-events.md Outdated Show resolved Hide resolved

kegsay added 3 commits September 22, 2025 08:11

Update 4354-sticky-events.md

6524be2

Update 4354-sticky-events.md

d14448c

Update 4354-sticky-events.md

ce37b02

mscbot added disposition-merge unresolved-concerns This proposal has at least one outstanding concern labels Sep 30, 2025

turt2live moved this from Proposed for FCP readiness to Ready for FCP ticks in Spec Core Team Workflow Sep 30, 2025

turt2live added the 00-weekly-pings Tracking for weekly pings in the SCT office. 00 to make it first in the labels list. label Sep 30, 2025

clokep reviewed Oct 1, 2025

View reviewed changes

Johennes reviewed Oct 1, 2025

View reviewed changes

proposals/4354-sticky-events.md Outdated Show resolved Hide resolved

Update 4354-sticky-events.md

71e83cb

Half-Shot mentioned this pull request Oct 1, 2025

Implement Sticky Events MSC4354 matrix-org/matrix-js-sdk#5028

Merged

4 tasks

Apply suggestions from code review

b2eab83

Co-authored-by: Travis Ralston <[email protected]>

AndrewFerr mentioned this pull request Oct 1, 2025

MSC4140: Cancellable delayed events #4140

Open

4 tasks

kegsay added 2 commits October 1, 2025 15:51

Update 4354-sticky-events.md

99ee9f8

Update 4354-sticky-events.md

3ff65a5

turt2live mentioned this pull request Oct 1, 2025

Add sticky event parsing matrix-org/gomatrixserverlib#462

Open

kegsay and others added 2 commits October 2, 2025 10:47

Update 4354-sticky-events.md

865746c

Update proposals/4354-sticky-events.md

240d650

Co-authored-by: Travis Ralston <[email protected]>

BillCarsonFr reviewed Oct 3, 2025

View reviewed changes

proposals/4354-sticky-events.md Show resolved Hide resolved

robintown reviewed Oct 6, 2025

View reviewed changes

proposals/4354-sticky-events.md Show resolved Hide resolved

kegsay added 2 commits October 7, 2025 17:04

Update 4354-sticky-events.md

434794d

Update 4354-sticky-events.md

6f94547

erikjohnston reviewed Oct 8, 2025

View reviewed changes

toger5 reviewed Oct 9, 2025

View reviewed changes

Half-Shot reviewed Oct 10, 2025

View reviewed changes

This was referenced Oct 13, 2025

MSC3489: Sharing streams of location data with history #3489

Open

Support MSC4354 Sticky events matrix-org/matrix-widget-api#141

Open

uhoreg reviewed Oct 14, 2025

View reviewed changes

MadLittleMods reviewed Oct 22, 2025

View reviewed changes

MadLittleMods mentioned this pull request Oct 22, 2025

MSC4140: finalised delayed events, and more element-hq/synapse#19038

Open

3 tasks


		To implement these properties, servers MUST:

		* Attempt to send their own[^origin] sticky events to all joined servers, whilst respecting per-server backoff times.

	- pick the one with the highest `origin_server_ts`,
	- pick the one with the highest `origin_server_ts + sticky.duration_ms`,


		- pick the one with the highest `origin_server_ts`,
		- tie break on the one with the highest lexicographical event ID (A < Z).

MSC4354: Sticky Events #4354

Are you sure you want to change the base?

MSC4354: Sticky Events #4354

Uh oh!

Conversation

kegsay commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

turt2live Sep 16, 2025 • edited by kegsay Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

turt2live commented Oct 1, 2025

Uh oh!

turt2live commented Oct 2, 2025

Uh oh!

Uh oh!

Uh oh!

erikjohnston left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

last to expire wins

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Unbounded number of sticky events in Sliding Sync response

Adding a limit

Going further

"A way to return the new sticky events in /sync"

"A pagination endpoint for the sticky events"

"Dedicated sticky_events_limited flag when there are other sticky events that we didn't return"

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

kegsay commented Sep 16, 2025 •

edited

Loading

turt2live Sep 16, 2025 •

edited by kegsay

Loading

erikjohnston left a comment •

edited

Loading

"A way to return the new sticky events in `/sync`"

"Dedicated `sticky_events_limited` flag when there are other sticky events that we didn't return"