Behavior of reduce and apply_dimension #73

m-mohr · 2019-09-02T10:13:49Z

I realized that id you use apply_dimension, there are not many functions that can actually be used. The callback parameter has the data type array and the callback must return an array, but except some functions like cumsum, cummax etc. we don't have many functions to be used. One could maybe be interested in applying absolute, but would need to reduce to a single dimension first, then call apply and then merge the cubes again, which seems a bit too complicated.

mkadunc · 2019-09-02T14:25:08Z

What did you mean by "applying absolute"?

m-mohr · 2019-09-02T14:34:31Z

@claxn created for GEE a process graph with apply_dimension and a callback, which just was the absolute function. See https://github.com/Open-EO/openeo-earthengine-driver/blob/master/docs/s1.json That is not valid as apply_dimensions callback parameter is an array and absolute takes a number as parameter.

Now I'm looking for a way out.

'applying absolute' was therefore meant as apply with a callback absolute, but just for a single dimension it seems. But now that I'm thinking more about it, I'm not sure what @claxn actually was trying to do. Could you clarify, Claudio? It looks like it should be apply instead maybe?

m-mohr · 2019-09-03T09:18:58Z

Discussed during a call with @claxn. The S1 example war wrong. The intended process was apply, not apply_dimension.

mkadunc · 2019-09-03T09:59:20Z

As for use-cases that need to operate on 1D-array partitions of the data cube, these are some off the top of my head:

smoothing along a temporal axis (some types of smoothings can be done with apply_kernel, but not all smoothings are representable in this way)
gap-filling no-data values in a (time) series from other available data
computing (temporal) derivative of values (again, often representable with apply_kernel)
sorting values (e.g. to produce a cube ordered by pixel quality instead of time)
single-pixels atmospheric correction (fit a model to all bands of an image, then convert TOA to BOA reflectance)

m-mohr · 2019-09-03T10:00:53Z

That are good use cases, we should check and try out. Maybe we need changes to the processes to make them work... Can anyone come up with example process graphs? I probably won't have time for it until late this year...

lforesta · 2019-09-18T09:25:13Z

So on the one hand the issue is that we don't have many processes which can be used as callbacks from apply_dimension, and on the other hand we should try and create process graphs for the examples proposed by @mkadunc , right?

About the first point, more processes will come out as we focuse more on the defined use cases and other applications.
About the second, I don't have PGs yet, but one comment for the (time) derivative example. Since the cardinality of the dimension is reduced by one, strictly speaking we can't use apply_dimension, no? apply_kernel as proposed is more flexible for this example.

mkadunc · 2019-09-20T12:44:29Z

Re derivatives: cardinality gets reduced by one if derivative is computed by using a 2-point derivative on all pairs of values, but the same process would also effectively move the positions of samples on the time axis to the mid-point between the two values (at least that's where the 2-point derivative will be the most accurate).

Usually, the user would desire to keep the cardinality and positions along the time axis the same, which can be achieved:

with a symmetric central difference, using a kernel but extending the border — BTW (@m-mohr): The apply_kernel process description says that "resolution, cardinality and the number of dimensions are the same as for the original data cube"
with forward/backward difference formulae on the first and last values, and symmetric difference for 'interior' values
by fitting a model to the values (e.g. a linear combination of seasonal sine waves) and then replacing the measurements with the value of the fitted function at the same timestamps

lforesta · 2019-09-23T08:55:19Z

@mkadunc I agree as long as any "extra" operation is declared in the process description.
In general it seems derivatives are mostly covered by apply_kernel (an exception is your third example) as your already pointed out earlier.

m-mohr · 2019-10-11T10:22:08Z

The apply_kernel process description says that "resolution, cardinality and the number of dimensions are the same as for the original data cube"

Yes, is there an issue with that? If we want to use apply_kernel for more things, we may need to change it but until now I don't really understand what you actually want to change...

mkadunc · 2019-10-28T12:43:23Z

The apply_kernel process description says that "resolution, cardinality and the number of dimensions are the same as for the original data cube"

Yes, is there an issue with that? If we want to use apply_kernel for more things, we may need to change it but until now I don't really understand what you actually want to change...

The issue is that kernels will inevitably reduce cardinality unless the border is extended - we need to address this in the documentation somehow, e.g. either:

drop the constraint that the cardinalilty stays the same, or
define what the kernel yields when some part of the window is over the border (e.g. we could say that outside values are input as no_data), or
define what the prescribed border-extension method is, or
define how the user specifies their required border extension method

The data cubes that we're (virtually) dealing with are so huge that the border effect will mostly be negligible, so I'd go with 2 - the part of the window that is outside the (virtual) data cube will have no_data as its input.

m-mohr · 2019-11-26T10:50:05Z

So the point here is that the new reducer as proposed in #89 pretty much makes apply_dimension obsolete, but applying a sort in a reduce function is not very logical.

We discussed during the telco whether

to remove apply_dimension and make reduce even more universal (+ a name change for reduce) or
to limit the functionality of reduce (don't allow multiple return values / remove target_dimension) and make apply_dimension more universal (+ a name change for apply_dimension, could be renamed to apply_along_dimension).

The preffered alternative by most was to go for two separate functions. I'll issue a PR and let you all review.

m-mohr · 2019-12-13T15:21:05Z

Implementing point 2 also better aligns with aggregate_polygon, aggregate_temporal, resample_cube_temporal and merge_cubes, which all define reducers as processes returning a single value. So extrema, quantiles etc should be used with apply_dimension and thus target_dimension will be moved to, making reduce simpler and might even help with (avoiding?) #94.

…oved `target_dimension` parameter.

m-mohr · 2019-12-18T13:52:40Z

There's a PR up for discussions: #111

m-mohr added help wanted Extra attention is needed waiting labels Sep 2, 2019

m-mohr added this to the v1.0 milestone Sep 2, 2019

m-mohr added the question Further information is requested label Sep 2, 2019

m-mohr closed this as completed Sep 3, 2019

m-mohr reopened this Sep 3, 2019

m-mohr modified the milestones: v1.0, future Nov 26, 2019

m-mohr added accepted and removed help wanted Extra attention is needed waiting question Further information is requested labels Nov 26, 2019

m-mohr modified the milestones: future, v1.0 Nov 26, 2019

m-mohr mentioned this issue Nov 26, 2019

Histogram process #41

Open

m-mohr changed the title ~~apply_dimension callbacks~~ Behavior of reduce and apply_dimension Nov 26, 2019

m-mohr added help wanted Extra attention is needed accepted and removed accepted labels Nov 26, 2019

mkadunc mentioned this issue Nov 26, 2019

Split binary and array based reduction? #94

Closed

m-mohr self-assigned this Dec 13, 2019

m-mohr removed the help wanted Extra attention is needed label Dec 13, 2019

m-mohr added a commit that referenced this issue Dec 18, 2019

#73: New definition for apply_dimension, made reduce stricter and rem…

b81d93b

…oved `target_dimension` parameter.

m-mohr mentioned this issue Dec 18, 2019

Change behavior of apply_dimension and reduce #111

Merged

m-mohr added the work in progress label Dec 18, 2019

m-mohr added a commit that referenced this issue Dec 19, 2019

Improve documentation #73

2b5faf9

m-mohr added a commit that referenced this issue Dec 19, 2019

Improve documentation #73

752ad86

m-mohr added a commit that referenced this issue Dec 19, 2019

Clarified target_dimension in apply_dimension. #73

89e5437

m-mohr added a commit that referenced this issue Dec 20, 2019

Further clarified target_dimension in apply_dimension. #73

62ab849

m-mohr added a commit that referenced this issue Dec 20, 2019

Further clarified target_dimension in apply_dimension. #73

7dd1b4f

m-mohr added a commit that referenced this issue Dec 20, 2019

Further clarified target_dimension in apply_dimension. #73

408d2bc

m-mohr added a commit that referenced this issue Dec 20, 2019

Further clarified target_dimension in apply_dimension. #73

d7ce483

m-mohr added has PR and removed work in progress labels Jan 15, 2020

m-mohr closed this as completed Jan 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Behavior of reduce and apply_dimension #73

Behavior of reduce and apply_dimension #73

m-mohr commented Sep 2, 2019 •

edited

Loading

mkadunc commented Sep 2, 2019

m-mohr commented Sep 2, 2019

m-mohr commented Sep 3, 2019

mkadunc commented Sep 3, 2019

m-mohr commented Sep 3, 2019 •

edited

Loading

lforesta commented Sep 18, 2019

mkadunc commented Sep 20, 2019

lforesta commented Sep 23, 2019

m-mohr commented Oct 11, 2019

mkadunc commented Oct 28, 2019

m-mohr commented Nov 26, 2019

m-mohr commented Dec 13, 2019

m-mohr commented Dec 18, 2019

Behavior of reduce and apply_dimension #73

Behavior of reduce and apply_dimension #73

Comments

m-mohr commented Sep 2, 2019 • edited Loading

mkadunc commented Sep 2, 2019

m-mohr commented Sep 2, 2019

m-mohr commented Sep 3, 2019

mkadunc commented Sep 3, 2019

m-mohr commented Sep 3, 2019 • edited Loading

lforesta commented Sep 18, 2019

mkadunc commented Sep 20, 2019

lforesta commented Sep 23, 2019

m-mohr commented Oct 11, 2019

mkadunc commented Oct 28, 2019

m-mohr commented Nov 26, 2019

m-mohr commented Dec 13, 2019

m-mohr commented Dec 18, 2019

m-mohr commented Sep 2, 2019 •

edited

Loading

m-mohr commented Sep 3, 2019 •

edited

Loading