🚸 Walkthrough on rasterizing vector polygons into label masks #31

weiji14 · 2022-07-18T04:34:33Z

Tutorial on creating vector label masks for an image segmentation task. Using flood water extent polygons digitized by UNITAR-UNOSAT over Johor, Malaysia on Sentinel-1 imagery taken 15 Dec 2019.

Preview at https://zen3geo--31.org.readthedocs.build/en/31/vector-segmentation-masks.html

TODO:

Setup blank datashader.Canvas from template xarray.DataArray input
Rasterize vector polygon labels onto blank canvas
Pair the Sentinel-1 and rasterized label inputs into a single datapipe and dataloader

References:

https://data.humdata.org/dataset/waters-extents-as-of-15-december-2019-over-kota-tinggi-and-mersing-district-johor-state-of

Initial draft tutorial on preparing vector labels for an image segmentation task. Using shapefiles of flood water extent digitized by UNITAR-UNOSAT over Johor, Malaysia on Sentinel-1 imagery taken 15 Dec 2019.

Fix readthedocs build failure because pyogrio was not installed.

New datashader DataPipe classes to use!

Picking a single Sentinel-1 image from 15 Dec 2019 over Johor, Malaysia corresponding to the mapped flood extent polygons. Perform reprojection from EPSG;4326 to UTM, and transform VV channel from linear to decibel. Did a bit of reorganizing to put the raster section before the vector section, may decide to change it later when the canvas part comes in.

Fix readthedocs `ERROR: Couldn't find cache key for notebook file vector-segmentation-labels.md`. Signed-off-by: Wei Ji <23487320+weiji14@users.noreply.github.com>

A comprehensive library for creating static, animated, and interactive visualizations in Python!

Point to https://docs.xarray.dev/en/latest/ instead.

Debugging why readthedocs build fails when reproducing the same environment and commands locally works.

See if this helps with fixing the KeyboardInterrupt when plotting the Sentinel-1 map using matplotlib, hopefully 5 minutes is more than enough.

Make it easier to debug errors when builds fail on Readthedocs, see https://jupyterbook.org/en/latest/content/execute.html#execution-tracebacks-in-the-terminal. Also reverting 1313d38 and ada85a8.

Read in the two shapefiles using PyogrioReaderIterDataPipe, and show a quick example of reprojecting and visualizing the vector polygons.

Using template Sentinel-1 xarray.DataArray grid to create blank datashader.Canvas. Show how the metadata information match! Also some small edits to the previous pyogrio section like fixing an incorrectly set projection system.

Show how the painting of vector geopandas.GeoDataFrame polygons onto the datashader.Canvas works! Got some words of caution and where to seek more advice. Also added spatialpandas to the 'docs' extras list of dependencies.

Simple Python interface for Graphviz!

Marie Kondo-ing the section on pre-processing the vector data, because working with two shapefiles was a pain. The merging of two GeoDataFrame objects using BatchMapper was a bad idea as it returned a DataPipe without a length, so DatashaderRasterizer complained. Decided too that focusing on just one vector was enough to teach the concept, and the masters can read the footnotes on more advanced combinatorial pipelines.

Combining two xarray.DataArray objects into a single xarray.Dataset before slicing. Using Zipper to pair the two DataPipes, and then Collator to do the actual combining. Showing the DataPipe graph after the zip stage to give people an overview of the (many) steps done so far. Double checked too that the image and mask looks ok, which led to the discovery of an issue with rounding numbers or something. Had to change the resolution from 80m to 100m because of this, otherwise the rasterized water mask is cut off at the Southern part.

Fixes `ExecutableNotFound: failed to execute PosixPath('dot'), make sure the Graphviz executables are on your systems' PATH`. Decided to use apt instead of conda because it's less hassle.

Just complete the whole vector segmentation tutorial in one go! Really simplified the ending with a minimal xbatcher slicing and conversion to torch.Tensor step only. Gave a shoutout to Edoardo's work at UNOSAT in the end. Decided to rename the file to vector-segmentation-masks. Added more emojis throughout the page, and should be good to go!

Refactoring the xarray collate functions to use `xr.merge` instead of the dictionary style way of appending data variables to an xarray.Dataset. Solution adapted from 7787f8e in #62 that is more robust to images being cut off due to rounding issues as with 6b18934 in #31. Downside is the need to verbosely rename the xarray.DataArray objects, and handle some conflicting coordinate labels.

* ♻️ Use xarray.merge with join="override" in collate functions Refactoring the xarray collate functions to use `xr.merge` instead of the dictionary style way of appending data variables to an xarray.Dataset. Solution adapted from 7787f8e in #62 that is more robust to images being cut off due to rounding issues as with 6b18934 in #31. Downside is the need to verbosely rename the xarray.DataArray objects, and handle some conflicting coordinate labels. * 📝 Minor tweaks to vector segmentation mask walkthrough A few whitespace fixes and fixing some DataPipe references.

🚧 Walkthrough on rasterizing vector polygons into label masks

f101b9a

Initial draft tutorial on preparing vector labels for an image segmentation task. Using shapefiles of flood water extent digitized by UNITAR-UNOSAT over Johor, Malaysia on Sentinel-1 imagery taken 15 Dec 2019.

weiji14 added the documentation Improvements or additions to documentation label Jul 18, 2022

weiji14 added this to the 0.2.1 milestone Jul 18, 2022

weiji14 self-assigned this Jul 18, 2022

💚 Install pyogrio for documentation build

919768d

Fix readthedocs build failure because pyogrio was not installed.

weiji14 modified the milestones: 0.2.1, 0.3.0, 0.4.0 Jul 24, 2022

weiji14 mentioned this pull request Jul 25, 2022

♻️ Let PyogrioReader return geodataframe only instead of tuple #33

Merged

weiji14 added 4 commits August 14, 2022 18:02

🔀 Merge branch 'main' into vector-segmentation-labels

f5150e2

New datashader DataPipe classes to use!

🚑 Temporarily force re-executing build

1313d38

Fix readthedocs `ERROR: Couldn't find cache key for notebook file vector-segmentation-labels.md`. Signed-off-by: Wei Ji <23487320+weiji14@users.noreply.github.com>

➕ Add matplotlib

ebf26fd

A comprehensive library for creating static, animated, and interactive visualizations in Python!

weiji14 force-pushed the vector-segmentation-labels branch from e0f9a83 to ebf26fd Compare August 16, 2022 13:05

weiji14 added 12 commits August 16, 2022 09:31

🔧 Update intersphinx link for xarray

03dccb8

Point to https://docs.xarray.dev/en/latest/ instead.

🔇 Don't fail when sphinx build encounters a warning

ada85a8

Debugging why readthedocs build fails when reproducing the same environment and commands locally works.

👷 Increase execution timeout from 30 to 300 seconds

2f12d75

See if this helps with fixing the KeyboardInterrupt when plotting the Sentinel-1 map using matplotlib, hopefully 5 minutes is more than enough.

🔊 Log execution traceback in terminal

b2c7c7f

Make it easier to debug errors when builds fail on Readthedocs, see https://jupyterbook.org/en/latest/content/execute.html#execution-tracebacks-in-the-terminal. Also reverting 1313d38 and ada85a8.

📝 Show how PyogrioReader is used and plot vector geodataframe too

4847c73

Read in the two shapefiles using PyogrioReaderIterDataPipe, and show a quick example of reprojecting and visualizing the vector polygons.

📝 Writeup section on rasterizing vector polygons onto canvas

563aaa1

Show how the painting of vector geopandas.GeoDataFrame polygons onto the datashader.Canvas works! Got some words of caution and where to seek more advice. Also added spatialpandas to the 'docs' extras list of dependencies.

➕ Add graphviz

8595a75

Simple Python interface for Graphviz!

💚 Install graphviz via apt in readthedocs build

100f590

Fixes `ExecutableNotFound: failed to execute PosixPath('dot'), make sure the Graphviz executables are on your systems' PATH`. Decided to use apt instead of conda because it's less hassle.

weiji14 marked this pull request as ready for review August 18, 2022 22:55

weiji14 merged commit ce0f4da into main Aug 18, 2022

weiji14 deleted the vector-segmentation-labels branch August 18, 2022 23:06

weiji14 mentioned this pull request Aug 19, 2022

♻️ Refactor DatashaderRasterizer to be up front about datapipe lengths #39

Merged

weiji14 mentioned this pull request Sep 9, 2022

VectorShapesDataset for loading geometries from vector files microsoft/torchgeo#458

Closed

weiji14 mentioned this pull request Oct 2, 2022

♻️ Use xarray.merge with join="override" in collate functions #72

Merged

weiji14 mentioned this pull request Nov 10, 2023

✏️ Edit Sentinel-1 preview PNG urls from Planetary Computer #129

Merged

weiji14 mentioned this pull request Feb 2, 2024

✏️ Edit URLs to shapefiles from UNOSAT #138

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚸 Walkthrough on rasterizing vector polygons into label masks #31

🚸 Walkthrough on rasterizing vector polygons into label masks #31

weiji14 commented Jul 18, 2022 •

edited

Loading

🚸 Walkthrough on rasterizing vector polygons into label masks #31

🚸 Walkthrough on rasterizing vector polygons into label masks #31

Conversation

weiji14 commented Jul 18, 2022 • edited Loading

weiji14 commented Jul 18, 2022 •

edited

Loading