Improve consistency of default engine and return memoryview instead of bytes from to_netcdf() #10656

shoyer · 2025-08-19T19:15:37Z

This PR introduces two breaking changes:

The default backend engine used by Dataset.to_netcdf and DataTree.to_netcdf is now chosen consistently with open_dataset and open_datatree, using whichever netCDF libraries are available and valid, and preferring netCDF4 to h5netcdf to scipy. Previously, DataTree.to_netcdf was hard-coded to use scipy for writing to file-like objects or bytes, and DataTree.to_netcdf was hard-coded to use h5netcdf.
The return value of Dataset.to_netcdf without path is now a memoryview object instead of bytes. This removes an unnecessary memory copy and ensures consistency when using either engine="scipy" or engine="h5netcdf".

It also includes a minor bug-fix, raising an error when returning a memoryview with compute=False

Tests added
User visible changes (including notable bug fixes) are documented in whats-new.rst

This PR introduces a bug fix and a breaking changes: 1. The default backend ``engine`` used by `Dataset.to_netcdf` and `DataTree.to_netcdf` is now chosen consistently with `open_dataset` and `open_datatree`, using whichever netCDF libraries are available and preferring netCDF4 to h5netcdf to scipy. Previously, `DataTree.to_netcdf` was hard-coded to use h5netcdf. 2. The return value of `Dataset.to_netcdf` without ``path`` is now a ``memoryview`` object instead of ``bytes``. This removes an unnecessary memory copy and ensures consistency when using either ``engine="scipy"`` or ``engine="h5netcdf"``. Fixes pydata#10654

shoyer · 2025-08-19T19:22:07Z

xarray/backends/common.py

 @dataclass
-class BytesIOProxy(Generic[BytesOrMemory]):
-    """Proxy object for a write that returns either bytes or a memoryview."""
+class BytesIOProxy:


Note: I'm keeping around BytesIOProxy because we'll need it for #10624

github-actions bot added topic-backends topic-DataTree Related to the implementation of a DataTree class io labels Aug 19, 2025

shoyer added 2 commits August 19, 2025 12:16

Add PR number to whatsnew

3fd0de4

Consistently use BytesIOProxy

bfea52d

shoyer commented Aug 19, 2025

View reviewed changes

shoyer added 2 commits August 19, 2025 12:24

Fix test_engine

594b122

Clarify whats new

af7167b

shoyer mentioned this pull request Aug 19, 2025

Should Xarray prefer h5netcdf and scipy to netCDF4? #10657

Open

shoyer changed the title ~~Improve consistency and engine keyword argument for to_netcdf()~~ Improve consistency of default engine and return memoryview instead of bytes from to_netcdf() Aug 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Improve consistency of default engine and return memoryview instead of bytes from to_netcdf() #10656

Improve consistency of default engine and return memoryview instead of bytes from to_netcdf() #10656

Uh oh!

shoyer commented Aug 19, 2025 •

edited

Loading

Uh oh!

shoyer Aug 19, 2025

Uh oh!

Uh oh!

Uh oh!

Improve consistency of default engine and return memoryview instead of bytes from to_netcdf() #10656

Are you sure you want to change the base?

Improve consistency of default engine and return memoryview instead of bytes from to_netcdf() #10656

Uh oh!

Conversation

shoyer commented Aug 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shoyer Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

shoyer commented Aug 19, 2025 •

edited

Loading