Implement tofile on tensors to reduce data write time by 40% #210

justinchuby · 2025-10-03T23:35:49Z

TODO: tests

This PR introduces the tofile method on tensors (similarly named as the one on numpy arrays), which allows for faster write and lower memory usage on external data by bypassing tobytes().

Compatibility with existing TensorProtocols is maintained in the external data module by using tofile only when it is available in the class. The TorchTensor class in PyTorch exporter should be updated accordingly to leverage the new logic when saving.

Note that io time to disk is reduced by 40% below.

Reference: https://github.com/microsoft/onnxscript/pull/2241/files/b2381658492510a9bcc8c0a8574db7368e33bceb

Before:

________________________________________________________
Executed in   48.08 secs    fish           external
   usr time   60.54 secs    0.00 millis   60.54 secs
   sys time   23.06 secs    1.22 millis   23.06 secs

After:

________________________________________________________
Executed in   45.69 secs    fish           external
   usr time   60.68 secs  244.00 micros   60.68 secs
   sys time   22.22 secs  518.00 micros   22.22 secs

Fix #207

Signed-off-by: Justin Chu <[email protected]>

codecov · 2025-10-03T23:36:58Z

Codecov Report

❌ Patch coverage is 55.10204% with 22 lines in your changes missing coverage. Please review.
✅ Project coverage is 76.57%. Comparing base (feb51e5) to head (40cb60d).
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
src/onnx_ir/_core.py	48.57%	14 Missing and 4 partials ⚠️
src/onnx_ir/external_data.py	66.66%	1 Missing and 1 partial ⚠️
src/onnx_ir/tensor_adapters.py	75.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #210      +/-   ##
==========================================
- Coverage   76.83%   76.57%   -0.27%     
==========================================
  Files          40       40              
  Lines        4922     4965      +43     
  Branches      980      989       +9     
==========================================
+ Hits         3782     3802      +20     
- Misses        856      873      +17     
- Partials      284      290       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: Justin Chu <[email protected]>

justinchuby · 2025-10-04T00:58:05Z

cc @iksnagreb

sonarqubecloud · 2025-10-04T18:56:40Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

justinchuby · 2025-10-06T21:46:12Z

src/onnx_ir/tensor_adapters.py

+        # Implement tobytes to support native PyTorch types so we can use types like bloat16
+        # Reading from memory directly is also more efficient because
+        # it avoids copying to a NumPy array
+        _, address = self._get_data_chunk()


Rename variable to be more accurate. This should be a list of some py objects.

justinchuby added 5 commits October 3, 2025 15:55

Fix endian

82b3f58

Signed-off-by: Justin Chu <[email protected]>

nvm

42d8edc

Signed-off-by: Justin Chu <[email protected]>

More implementations

63310c1

Signed-off-by: Justin Chu <[email protected]>

tofile

290ab6c

Signed-off-by: Justin Chu <[email protected]>

hasattr

1b53a6a

Signed-off-by: Justin Chu <[email protected]>

justinchuby added 5 commits October 3, 2025 17:19

tofile!

c05e189

Signed-off-by: Justin Chu <[email protected]>

write

6377435

Signed-off-by: Justin Chu <[email protected]>

always write numpy

3dc5704

Signed-off-by: Justin Chu <[email protected]>

Maintain reference

7fd35d7

Signed-off-by: Justin Chu <[email protected]>

Merge branch 'main' into justinchu/write

40cb60d

justinchuby marked this pull request as ready for review October 4, 2025 00:44

justinchuby requested review from titaiwangms and a team as code owners October 4, 2025 00:44

justinchuby requested a review from gramalingam October 4, 2025 00:44

justinchuby added the module: api label Oct 4, 2025

justinchuby mentioned this pull request Oct 4, 2025

Be smarter about torch tensors jambayk/torch-onnx-models#43

Merged

justinchuby added this to the 0.1.11 milestone Oct 4, 2025

justinchuby changed the title ~~Implement tofile on tensors~~ Implement tofile on tensors to reduce data write time by 40% Oct 6, 2025

justinchuby commented Oct 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement tofile on tensors to reduce data write time by 40% #210

Implement tofile on tensors to reduce data write time by 40% #210

Uh oh!

justinchuby commented Oct 3, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 3, 2025 •

edited

Loading

Uh oh!

justinchuby commented Oct 4, 2025

Uh oh!

sonarqubecloud bot commented Oct 4, 2025

Uh oh!

justinchuby Oct 6, 2025

Uh oh!

Uh oh!

Implement tofile on tensors to reduce data write time by 40% #210

Are you sure you want to change the base?

Implement tofile on tensors to reduce data write time by 40% #210

Uh oh!

Conversation

justinchuby commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO: tests

Uh oh!

codecov bot commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

justinchuby commented Oct 4, 2025

Uh oh!

sonarqubecloud bot commented Oct 4, 2025

Quality Gate passed

Uh oh!

justinchuby Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

justinchuby commented Oct 3, 2025 •

edited

Loading

codecov bot commented Oct 3, 2025 •

edited

Loading