[CAS] Improve MappedFileRegionBumpPtr #11269

cachemeifyoucan · 2025-08-27T22:05:55Z

Improve MappedFileRegionBumpPtr so it can handle being opened with different capacities.

cachemeifyoucan · 2025-08-27T22:06:08Z

@swift-ci please test llvm

llvm/lib/CAS/MappedFileRegionBumpPtr.cpp

llvm/include/llvm/CAS/MappedFileRegionBumpPtr.h

benlangmuir · 2025-09-03T17:57:31Z

llvm/lib/CAS/MappedFileRegionBumpPtr.cpp

+    assert(InitLock.Locked == sys::fs::LockKind::Exclusive);
+    // If the BumpPtr larger than or equal to the size of the file (it can be
+    // larger if process is terminated when the out of memory allocation
+    // happens) and smaller than capacity, this was shrunken by a previous


If Result.H->BumpPtr == FileSize->Size we don't know if it's safe to resize the file. The "expected" case is that this happens when we truncated the file during the last close and now reopen it. But it could also happen if the CAS is already open with a smaller capacity, but has allocated all of that smaller capacity. The assert above is correct that we have "exclusive access" on the InitLock, but that only tells us that we are not racing to initialize it, not whether it is currently open (only exclusively locking SharedLockFD like we do in destroyImpl can determine that).

One possible fix would be to try to lock SharedLockFD exclusively here and if that succeeds we resize but if it fails we don't. The downside is we would have to do that almost every time we reopen the CAS, making it more expensive by several syscalls (unlock, try-lock exclusive, unlock, lock shared), and it would need careful handling since the file could have been grown and re-shrunk during the window where we don't hold the shared lock.

Maybe there is a better fix I haven't thought of.

I don't have a good fix for that so.... I am reimplemented it again with a even stricter model.

cachemeifyoucan · 2025-09-04T22:51:13Z

@swift-ci please test llvm

Improve MappedFileRegionBumpPtr so it can handle being opened with different capacities. Mismatching different capacities and header offsets will result in error on creation.

cachemeifyoucan · 2025-09-04T23:44:58Z

@swift-ci please test llvm

benlangmuir

I think storing the capacity in the lock file is not ideal. This hides information that is now necessary for using the CAS in a secondary location. It also complicates opening the file by needing to read an additional file first. I think it would be better to just store the capacity in the header and not worry about the header offset changing. We can do a pread at the header location if we want to avoid needing to fix up the mmap size.

benlangmuir · 2025-09-05T17:27:22Z

llvm/lib/CAS/MappedFileRegionBumpPtr.cpp

+
+  // Return true if succeed to lock the file exclusively.
+  bool tryLockExclusive() {
+    assert(!Locked && "can only try to lock if not locked");


This is inconsistent between lock and tryLock.... I suggest we revert the change to lock and make the caller unlock first if necessary for consistency.

benlangmuir · 2025-09-05T17:31:45Z

llvm/lib/CAS/MappedFileRegionBumpPtr.cpp

+
+  // Release the lock so it will not be unlocked on destruction.
+  void release() {
+    Locked = std::nullopt;


I would also set FD = -1 so that you cannot lock it again by mistake.

benlangmuir · 2025-09-05T17:42:58Z

llvm/lib/CAS/MappedFileRegionBumpPtr.cpp

+  uint64_t ExpectedOffset = support::endian::read<uint64_t, endianness::little>(
+      ConfigBuffer.data() + sizeof(uint64_t));
+
+  if (ExpectedCapacity != Capacity)


I'm leaning towards ignoring this and just using the existing capacity if the CAS is already open.

cachemeifyoucan · 2025-09-05T18:26:42Z

This hides information that is now necessary for using the CAS in a secondary location.

I don't see why this is a problem?

I think it would be better to just store the capacity in the header and not worry about the header offset changing.

We could reserve a real header in the beginning of the file and header offset starts after the reserve space?

We can do a pread at the header location if we want to avoid needing to fix up the mmap size.

We don't have pread. I guess we can readNativeFileSlice, which is the closest we have.

benlangmuir · 2025-09-05T18:58:25Z

I don't see why this is a problem?

I find it conceptually bad to store information in the lock file that is not part of the locking mechanism. From a performance perspective it means we need to read an extra file when opening the CAS. I admit these are not huge issues.

We could reserve a real header in the beginning of the file and header offset starts after the reserve space?

This seems fine too, but might need more work to restructure the client code since their own header moves later.

We don't have pread. I guess we can readNativeFileSlice, which is the closest we have.

readNativeFileSlice is implemented using pread when available

cachemeifyoucan requested review from akyrtzi and benlangmuir August 27, 2025 22:05

cachemeifyoucan force-pushed the eng/PR-mapped-file-bump-ptr branch from c227290 to fe4b0e1 Compare August 28, 2025 20:36

benlangmuir reviewed Sep 2, 2025

View reviewed changes

cachemeifyoucan force-pushed the eng/PR-mapped-file-bump-ptr branch from fe4b0e1 to fa08de2 Compare September 2, 2025 17:42

benlangmuir reviewed Sep 3, 2025

View reviewed changes

cachemeifyoucan force-pushed the eng/PR-mapped-file-bump-ptr branch 2 times, most recently from 5f3599f to c50a1b2 Compare September 4, 2025 22:50

[CAS] Improve MappedFileRegionBumpPtr

dec5bac

Improve MappedFileRegionBumpPtr so it can handle being opened with different capacities. Mismatching different capacities and header offsets will result in error on creation.

cachemeifyoucan force-pushed the eng/PR-mapped-file-bump-ptr branch from c50a1b2 to dec5bac Compare September 4, 2025 23:44

benlangmuir reviewed Sep 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CAS] Improve MappedFileRegionBumpPtr #11269

[CAS] Improve MappedFileRegionBumpPtr #11269

cachemeifyoucan commented Aug 27, 2025

Uh oh!

cachemeifyoucan commented Aug 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benlangmuir Sep 3, 2025

Uh oh!

cachemeifyoucan Sep 4, 2025

Uh oh!

cachemeifyoucan commented Sep 4, 2025

Uh oh!

cachemeifyoucan commented Sep 4, 2025

Uh oh!

benlangmuir left a comment

Uh oh!

benlangmuir Sep 5, 2025

Uh oh!

benlangmuir Sep 5, 2025

Uh oh!

benlangmuir Sep 5, 2025

Uh oh!

cachemeifyoucan commented Sep 5, 2025

Uh oh!

benlangmuir commented Sep 5, 2025

Uh oh!

Uh oh!

[CAS] Improve MappedFileRegionBumpPtr #11269

Are you sure you want to change the base?

[CAS] Improve MappedFileRegionBumpPtr #11269

Conversation

cachemeifyoucan commented Aug 27, 2025

Uh oh!

cachemeifyoucan commented Aug 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benlangmuir Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

cachemeifyoucan Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

cachemeifyoucan commented Sep 4, 2025

Uh oh!

cachemeifyoucan commented Sep 4, 2025

Uh oh!

benlangmuir left a comment

Choose a reason for hiding this comment

Uh oh!

benlangmuir Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

benlangmuir Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

benlangmuir Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

cachemeifyoucan commented Sep 5, 2025

Uh oh!

benlangmuir commented Sep 5, 2025

Uh oh!

Uh oh!