Canvas.raster should support distributed and out-of-core regridding of dask arrays #553

philippjfr · 2018-01-18T18:48:50Z

The Canvas.raster method up- or downsamples xarray DataArrays, which themselves may wrap a numpy or dask array. Since dask arrays can often be larger than can be fit into memory at one time it would be great if it could apply the resampling in a distributed and out-of-core manner. This requires loading the chunks in one by one, up- or downsampling them and then stitching them all together again. Technically it should be possible to apply this using the new xr.apply_ufunc helper but there are certain things I haven't yet fully thought through, e.g. during downsampling I believe you need to have some overlap between chunks so that the aggregation at the edges of each chunk aggregates over the edge of the next chunk and a similar solution may be required for correct interpolation during upsampling.

This is a significant chunk of work but a very well defined task and would be very useful for very large arrays.

The text was updated successfully, but these errors were encountered:

jbednar · 2018-02-09T18:10:38Z

The notebook in https://anaconda.org/philippjfr/tiling_demo/notebook shows some of how to do this manually, I think...

philippjfr · 2019-08-07T17:14:35Z

Implemented in #762

philippjfr added the enhancement label Jan 18, 2018

jbednar mentioned this issue Feb 9, 2018

Quick out-of-core downsampling to provide an overview of large data #560

Open

philippjfr closed this as completed Aug 7, 2019

darribas mentioned this issue Dec 9, 2020

Canvas().raster issues with out-of-core computation #975

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Canvas.raster should support distributed and out-of-core regridding of dask arrays #553

Canvas.raster should support distributed and out-of-core regridding of dask arrays #553

philippjfr commented Jan 18, 2018

jbednar commented Feb 9, 2018

philippjfr commented Aug 7, 2019

Canvas.raster should support distributed and out-of-core regridding of dask arrays #553

Canvas.raster should support distributed and out-of-core regridding of dask arrays #553

Comments

philippjfr commented Jan 18, 2018

jbednar commented Feb 9, 2018

philippjfr commented Aug 7, 2019