NetCDF: HDF error when saving big array to netcdf4 with dask.distributed #10734
Unanswered
pingponghero12
asked this question in
Q&A
Replies: 2 comments 1 reply
-
It might be related to #10725 or #10736 ? I've been having similar issues since 2025.8.0 with the asynchronous update, you might have luck downgrading to 2025.7.0 until these bugs are sorted out, though it might be a bit slower. |
Beta Was this translation helpful? Give feedback.
1 reply
-
it seems to work with:
but I this makes it horrendously slow. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello. I have problem when saving data to NETCDF4 using both netcdf4 and h5netcdf engines. The data is handled by dask.distributed. What I don't understand is that when I work on smaller data(cube of (60 or 600, 4500, 2000) chunked by 120 in last dimention) it works just fine, but for bigger data(cube of (700, 9000, 1000), chunked by 4 in last dim) some of the breakpoints I have fail at saving the NETCDF4.
The error:
And also at first at worker:
So from what I understand for some reason with this new data the engine backend fails to secure lock or sometimes file gets corrupted during writing.
Also when I skip save at this breakpoint and next one then there is no problem with writing. Those two that fail are relatively simple, rest is much longer and data is undersampled as well, so the size drop by around 3 times. The data that I work on takes approximately 32 GB of memmory, which is similar to what I have RAM.
I tried to save using this and other engine and also
NETCDF4_CLASSIC
but none helped.I opend client in following way:
Where 16 is also number of cores that I have avaible at the machine.
Could anyone help me understand why this error is happening and is it just a problem between the NETCDF saving and dask.distributed or something that I could fix easily.
I've tried to look for similar issues but I could not find any. This one is similar, but I get different error message and now the error is consistent, because for smaller data the program sometimes crashes in similar fassion but sometimes works.
dask/dask#11960
Beta Was this translation helpful? Give feedback.
All reactions