Optimize proc maps parsing code size #729

Noratrieb · 2025-09-23T19:26:54Z

The current code size is really wastefully large. Originally, it was 1500 lines of assembly in Godbolt, now I reduced it to just under 800. The effect of .text size in hello world is from 297028 to 295453 (measured with -Clto=fat -Copt-level=s with a normal sysroot) which is small but not completely irrelevant. It's just a small fish in the bigger pond of DWARF parsing, but it's better than nothing.

I extracted the parsing of each component into a separate function to allow for better sharing. I replaced the string methods with manual iteration since that results in simpler code because it has to handle fewer cases. I also had to use unsafe because the bounds checks were sadly not optimized out and were really large.

I also made the parser less resilient against whitespace, now it no longer handles Unicode whitespace (an obvious simplification) but also no longer handles any whitespace except the normal SP. I think this is fine, it seems highly unlikely that a system would suddenly use another type of whitespace (but I guess not impossible?).

Another simplification was simply removing the parsing of unused fields that were not needed.

I can split it into separate commits if that helps review and maybe some of these changes are more of a hassle than it's worth (while some others like the field removals are obviously good), but I'll let you choose.

workingjubilee · 2025-09-23T19:36:54Z

I think we can safely assume that procfs will only ever use '\x20', yes.

This will require careful review for the unsafe usage, but I am fairly happy to see it. I made a similar pass at this but wasn't quite happy with it.

Noratrieb · 2025-09-23T19:39:06Z

apparently it's normal that these tests fail, lol

workingjubilee · 2025-09-23T19:55:49Z

I am certainly treating the current macOS failures as nonblocking as it is very possible a rustc change is the underlying reason. I have opened #730

workingjubilee · 2025-09-23T20:20:37Z

I can split it into separate commits if that helps review

@Noratrieb If you can cleanly split the changes that micro-optimize the parsing using bytes and such from those that do straight reduction of code, like removing fields, into separate commits, that would be appreciated. Order doesn't matter. Though if they don't have a reasonably clean split then it doesn't matter and it should remain one big mash.

The vec move is very funny, as moving it removes the vec drop from the early return, which is fairly irrelevant but more than 0 bytes.

The current code size is really wastefully large. Originally, it was 1500 lines of assembly in Godbolt, now I reduced it to just under 800. The effect of `.text` size in hello world is from 297028 to 295453 which is small but not completely irrelevant. It's just a small fish in the bigger pond of DWARF parsing, but it's better than nothing. I extracted the parsing of each component into a separate function to allow for better sharing. I replaced the string methods with manual iteration since that results in simpler code because it has to handle fewer cases. I also had to use unsafe because the bounds checks were sadly not optimized out and were really large. I also made the parser less resilient against whitespace, now it no longer handles Unicode whitespace (an obvious simplification) but also no longer handles any whitespace except the normal SP. I think this is fine, it seems highly unlikely that a system would suddenly use another type of whitespace (but I guess not impossible?).

workingjubilee · 2025-09-27T19:44:55Z

src/symbolize/gimli/parse_running_mmaps_unix.rs

        .read_to_string(&mut buf)
        .map_err(|_| "Couldn't read /proc/self/maps")?;
+
+    let mut v = Vec::new();


kinda insane that this matters that much

it's only a tiny improvement, so that much is a bit of an overstatement :). actually i don't have numbers on this one, but I did see the call to drop_in_place before but not after.

workingjubilee · 2025-09-27T19:45:51Z

src/symbolize/gimli/parse_running_mmaps_unix.rs

+       fn error(msg: &str) -> &str {
+            if cfg!(debug_assertions) {
+                msg
+            } else {
+                "invalid map entry"
+            }
+        }


yeah this makes sense.

workingjubilee · 2025-09-27T19:48:41Z

src/symbolize/gimli/parse_running_mmaps_unix.rs

+        // While there are nicer standard library APIs available for this, we aim for minimal code size.
+
+        let mut state = s;
+
+        fn parse_start<'a>(state: &mut &'a str) -> &'a str {
+            // Unsafe is unfortunately necessary to get the bounds check removed (for code size).
+
+            let start_idx = state.bytes().position(|b| b != b' ');
+            if let Some(start_idx) = start_idx {
+                // SAFETY: It comes from position, so it's in bounds.
+                //         It must be on a UTF-8 boundary as it's the first byte that isn't ' '.
+                *state = unsafe { state.get_unchecked(start_idx..) };
+            }
+            let match_idx = state.bytes().position(|b| b == b' ');
+            match match_idx {
+                None => {
+                    let result = *state;
+                    *state = "";
+                    result
+                }
+                Some(match_idx) => {
+                    // SAFETY: match_index comes from .bytes().position() of an ASCII character,
+                    //         so it's both in bounds and a UTF-8 boundary
+                    let result = unsafe { state.get_unchecked(..match_idx) };
+                    // SAFETY: Since match_idx is the ' ', there must be at least the end after it.
+                    *state = unsafe { state.get_unchecked((match_idx + 1)..) };
+                    result
+                }
+            }
+        }


@hkBst This is an example of the kind of microoptimization that is both very hard to do in the compiler and also can reap significant benefits since here we care mostly about code size, even in the dead code, as this code size is essentially multiplied by all Rust binaries.

@workingjubilee Thanks for thinking of me, although I'm not quite sure why you did.

Noratrieb · 2025-10-31T18:26:44Z

@workingjubilee is this still in your queue or did this fall through the cracks?

workingjubilee · 2025-11-04T19:39:47Z

src/symbolize/gimli/parse_running_mmaps_unix.rs

+        let line = unsafe { buf.get_unchecked(..match_idx) };
+
        v.push(line.parse()?);
+
+        // SAFETY: match_idx is the position of the newline, so the byte after it must be valid.
+        buf = unsafe { buf.get_unchecked((match_idx + 1)..) };


Ah, the reason I haven't already approved-and-merged this is because I wanted to doublecheck (maybe triplecheck by now) all the get_unchecked instances. I'll do that.

What's the proof that there's more content after a newline? Or is it simply "get_unchecked correctly returns an empty string in this situation"

Yeah,

const SOMETHING: &[u8] = b"something\n"; fn main() { println!("{:?}", SOMETHING.get(SOMETHING.iter().position(|b| *b == b'\n').unwrap() + 1..)); }

prints Some([]) so yeah we should be good but the comment is indeed slightly misleading.

workingjubilee

the question of the edge of the slice can be confusing to interact with given slice.get_unchecked(slice.len()..slice.len()) is valid but slice.get_unchecked(slice.len()) is not! some possible alternate wordings

workingjubilee · 2025-11-05T04:26:28Z

src/symbolize/gimli/parse_running_mmaps_unix.rs

+
        v.push(line.parse()?);
+
+        // SAFETY: match_idx is the position of the newline, so the byte after it must be valid.


Suggested change

// SAFETY: match_idx is the position of the newline, so the byte after it must be valid.

// SAFETY: match_idx is the position of the newline, so slicing after it is valid

// but may yield `&[]` if that was the last byte.

workingjubilee · 2025-11-05T04:28:38Z

src/symbolize/gimli/parse_running_mmaps_unix.rs

+                    // SAFETY: match_index comes from .bytes().position() of an ASCII character,
+                    //         so it's both in bounds and a UTF-8 boundary
+                    let result = unsafe { state.get_unchecked(..match_idx) };
+                    // SAFETY: Since match_idx is the ' ', there must be at least the end after it.


Suggested change

// SAFETY: Since match_idx is the ' ', there must be at least the end after it.

// SAFETY: Since match_idx is the ' ', there must be at least the `len..len` slice after it

workingjubilee closed this Sep 25, 2025

workingjubilee reopened this Sep 25, 2025

Noratrieb added 3 commits September 26, 2025 20:48

Remove unused fields from /proc/maps parsing code

344344b

Micro-optimize code size of /proc/maps parsing code

311a1d0

The vec move is very funny, as moving it removes the vec drop from the early return, which is fairly irrelevant but more than 0 bytes.

Noratrieb force-pushed the optimize-proc-self-map-size branch from b13e32d to 0b9eeb1 Compare September 26, 2025 18:54

workingjubilee reviewed Sep 27, 2025

View reviewed changes

workingjubilee reviewed Nov 4, 2025

View reviewed changes

workingjubilee reviewed Nov 5, 2025

View reviewed changes


		v.push(line.parse()?);

		// SAFETY: match_idx is the position of the newline, so the byte after it must be valid.

	// SAFETY: match_idx is the position of the newline, so the byte after it must be valid.
	// SAFETY: match_idx is the position of the newline, so slicing after it is valid
	// but may yield `&[]` if that was the last byte.

	// SAFETY: Since match_idx is the ' ', there must be at least the end after it.
	// SAFETY: Since match_idx is the ' ', there must be at least the `len..len` slice after it

Optimize proc maps parsing code size #729

Are you sure you want to change the base?

Optimize proc maps parsing code size #729

Conversation

Noratrieb commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

workingjubilee commented Sep 23, 2025

Uh oh!

Noratrieb commented Sep 23, 2025

Uh oh!

workingjubilee commented Sep 23, 2025

Uh oh!

workingjubilee commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Noratrieb commented Oct 31, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

riking Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

workingjubilee left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Noratrieb commented Sep 23, 2025 •

edited

Loading

workingjubilee commented Sep 23, 2025 •

edited

Loading

riking Nov 5, 2025 •

edited

Loading