Vector data cubes + processes #68

m-mohr · 2019-08-16T11:52:35Z

We basically ignored vector data cubes and related processes until now and will need to add more related processes in 1.0, which will be a major work package!

We also need to update existing processes, which currently only support raster-cubes and add vector-cube support to processes that currently only allow geojson.

m-mohr · 2019-09-13T08:45:15Z

Conclusion from 3rd year planning: Will not be tackled in the near future, we add vector related processes once required (e.g. filter_point for the Wageningen use case, see #37).

@jdries and @aljacob will explore further and may define additional processes in the future. Related is also #2

Currently, the processes always refer to "raster data cubes". Clarify (with @edzer?) whether it may be good to just call them data cubes and handle the "type" of cube internally.

m-mohr · 2019-11-26T11:21:47Z

Telco: Still not needed at the moment.

I'll need to go through the processes for 1.0 and check whether the vector-cubes as used at the moment make sense. Depends also on #2

This reverts commit 5ba51c9.

This reverts commit b377487.

m-mohr · 2019-12-18T16:36:37Z

The recent comment from @mkadunc in #2 (comment) fits better here:

IMO the only thing we need to do in order to have vector-cubes support is allow objects as dimension labels (currently we only allow number, string, date, date-time and time). Then vector-cube is just a cube with simple-feature-geometry as dimension labels on the single spatial dimension. If we go for this approach, we already support vector cubes in all processes (but we treat the spatial dimension as nothing special).

We could also ignore vector-cubes altogether (for now), returning a raster cube with an ordinal dimension to encode the index of the corresponding polygon. This should be quite intuitive for the user...

What should we go for? At the moment, we have the two types raster-cube (in most processes) and vector-cubes (in a very limited set of processes), but don't explain at all what the latter is and how it works.

…ult. #68

m-mohr · 2020-01-20T14:21:21Z

We only define vector-cube as part of aggregate_polygon and save_result and handle them as "black box", so back-ends handle the transition. We'll dig into this again once it is needed.

m-mohr · 2020-07-08T14:31:24Z

Recently, the question came up how to support vector-cubes as input data for processes. In this example it was aggregate_spatial that was required to be able to load more than GeoJSON. For reference my answer:

[...]

Fortunately, openEO is extensible and you can add whatever you need. The simplest option is to modify the "geometries" parameter to allow other things to be loaded.

To allow files it's relatively easy. Replace:

        {
            "name": "geometries",
            "description": "Geometries as GeoJSON on which the aggregation will be based.",
            "schema": {
                "type": "object",
                "subtype": "geojson"
            }
        },

with:

        {
            "name": "geometries",
            "description": "Geometries as GeoJSON on which the aggregation will be based.",
            "schema": [
                {
                    "type": "object",
                    "subtype": "geojson"
                },
                {
                    "type": "string",
                    "subtype": "file-path"
                }
            ]
        },

and then it also allows to specify files uploaded to the user workspace. Then it depends on your implementation what can be read. Input file formats should be exposed via GET /file_formats.

A bit more complex, but the way we'd standardize it later is probably to use load_uploaded_files. The issue here is that we haven't really thought about how vector-cubes would work, but you could change the return value in the load_uploaded_files process as follows:

    "returns": {
        "description": "A data cube for further processing.",
        "schema": [
            {
                "type": "object",
                "subtype": "raster-cube"
            },
            {
                "type": "object",
                "subtype": "vector-cube"
            }
        ]
    }

Now it supports loading vector data and returns it in a (virtual) vector data cube, which you can then accept in aggregate_spatial with the following definition for the geometries parameter:

        {
            "name": "geometries",
            "description": "Geometries as GeoJSON on which the aggregation will be based.",
            "schema": [
                {
                    "type": "object",
                    "subtype": "geojson"
                },
                {
                    "type": "object",
                    "subtype": "vector-cube"
                }
            ]
        },

Now, you need to figure out how to pass the data between the processes, but as there's not much more that can handle vector cubes yet, you can just do that how it works best internally.

m-mohr · 2021-04-12T13:44:06Z

As far as I've understood, no use case in openEO Platform requires vector processes directly. There may be single processes required, like aggregate_spatial that can be considered on a case-by-case basis. Nevertheless, it is listed as a separate requirement in the SoW.

edzer · 2021-12-16T08:01:15Z

See also the confusion arising at #308.

A vector data cube is an n-D cube where (at least) one of the dimensions is associated with vector geometries (points, lines, polygons, or their multi-version). Example figures for 3D cubes:

Special, lower-dimensional cases:

one-D: if we have a single attribute associated with a set of geometries
two-D: if we have a single set of (uniform: single type) attributes associated with geometries, e.g. NDVI for different times, or different spectral bands for a single moment in time.

A difficulty of this concept is that our vector data file formats we usually work with (those read/written by GDAL: from shapefile to geopackage to GeoJSON to geodatabase) only can cover the two-D case; we need to juggle with the third dimension to use such formats ("flatten" the cube somehow: either in wide form over the attribute space, or in long form by repeating the geometries). A format that can (properly) handle vector data cubes is NetCDF, e.g. this is an example of a multipolygon x time datacube in NetCDF.

m-mohr added help wanted Extra attention is needed new process labels Aug 16, 2019

m-mohr added this to the v1.0 milestone Aug 16, 2019

m-mohr modified the milestones: v1.0, future Sep 13, 2019

m-mohr added a commit that referenced this issue Nov 18, 2019

Remove support for vector-cubes for now #68

5ba51c9

m-mohr mentioned this issue Nov 21, 2019

merge_cubes: Clarify merging procedure #87

Closed

m-mohr added a commit that referenced this issue Nov 26, 2019

Revert "Remove support for vector-cubes for now #68"

b377487

This reverts commit 5ba51c9.

m-mohr modified the milestones: future, v1.0 Nov 26, 2019

m-mohr removed the help wanted Extra attention is needed label Nov 26, 2019

m-mohr mentioned this issue Nov 26, 2019

aggregate_polygon: Output format #2

Closed

m-mohr added a commit that referenced this issue Dec 2, 2019

Revert "Revert "Remove support for vector-cubes for now #68""

241cc3b

This reverts commit b377487.

m-mohr added the help wanted Extra attention is needed label Dec 18, 2019

m-mohr added a commit that referenced this issue Jan 20, 2020

Don't support vector-cubes, except for aggregate_polygon and save_res…

66e77f5

…ult. #68

m-mohr modified the milestones: v1.0-rc1, v1.0, future Jan 20, 2020

m-mohr added breaking minor labels Aug 4, 2020

m-mohr added the platform label Apr 12, 2021

m-mohr mentioned this issue Apr 21, 2021

CRS for vector data #235

Closed

m-mohr added the vector label Apr 21, 2021

m-mohr self-assigned this Apr 21, 2021

m-mohr mentioned this issue Apr 26, 2021

MultiPolygon support for mask_polygon #237

Closed

m-mohr assigned edzer Dec 15, 2021

edzer mentioned this issue Dec 16, 2021

Flatten / Reshape data cube dimensions #308

Closed

soxofaan mentioned this issue Feb 14, 2022

Process to load a vector cube #322

Closed

m-mohr mentioned this issue Feb 23, 2022

Vector data cubes (overview) Open-EO/openeo.org#58

Open

16 tasks

m-mohr mentioned this issue Mar 14, 2022

filter_temporal, filter_bbox, ... on vector cube #338

Closed

m-mohr linked a pull request Sep 7, 2022 that will close this issue

Migrate to general data cube definition #382

Merged

14 tasks

m-mohr modified the milestones: future, 2.0.0 Oct 12, 2022

m-mohr closed this as completed Jan 30, 2023

This was referenced May 4, 2023

Release v2.0.0-rc.1 #439

Merged

Release openEO processes v2.0.0-rc.1 Open-EO/PSC#21

Closed

soxofaan mentioned this issue Sep 21, 2023

Updates for openeo API v1.2 and openeo-processes v2.0.0 Open-EO/openeo-python-driver#195

Open

m-mohr added a commit that referenced this issue Jan 23, 2024

Renamed create_data_cube to create_cube. #68

47b45d4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vector data cubes + processes #68

Vector data cubes + processes #68

m-mohr commented Aug 16, 2019 •

edited

Loading

m-mohr commented Sep 13, 2019 •

edited

Loading

m-mohr commented Nov 26, 2019 •

edited

Loading

m-mohr commented Dec 18, 2019 •

edited

Loading

m-mohr commented Jan 20, 2020

m-mohr commented Jul 8, 2020

m-mohr commented Apr 12, 2021

edzer commented Dec 16, 2021

Vector data cubes + processes #68

Vector data cubes + processes #68

Comments

m-mohr commented Aug 16, 2019 • edited Loading

m-mohr commented Sep 13, 2019 • edited Loading

m-mohr commented Nov 26, 2019 • edited Loading

m-mohr commented Dec 18, 2019 • edited Loading

m-mohr commented Jan 20, 2020

m-mohr commented Jul 8, 2020

m-mohr commented Apr 12, 2021

edzer commented Dec 16, 2021

m-mohr commented Aug 16, 2019 •

edited

Loading

m-mohr commented Sep 13, 2019 •

edited

Loading

m-mohr commented Nov 26, 2019 •

edited

Loading

m-mohr commented Dec 18, 2019 •

edited

Loading