Skip to content

Latest commit

 

History

History
883 lines (578 loc) · 65 KB

CHANGELOG.rst

File metadata and controls

883 lines (578 loc) · 65 KB

Version 0.16.3 (2024-07-04)

This release adds an enhancement and compatibility changes with upstream libraries. Thanks to @raphaelquast, @droumis and @hoxbro.

Enhancements:

  • Add fail-fast for datasets outside the visible extent (#1345)

Compatibility:

  • Compatibility with cudf 2024.06 (#1344)
  • Compatibility with geopandas 1.0 and dask-geopandas 0.4.0 (#1347)

Maintenance:

  • Update docs.yaml (#1346)

Version 0.16.2 (2024-05-31)

This release adds compatibility with Numpy 2.0, along with other improvements and bugfixes. Thanks to @hoxbro for his contributions.

Bugfixes:

  • Remove artifact from Polygon rendering (#1329)

Compatibility:

  • Test dev releases of numpy 2.0 and numba 0.60.0 (#1332)
  • Improve compatibility with dask-expr (#1335)
  • Add gpu marker for test and test both classic and dask-expr Dask.DataFrame's (#1341)

Documentation:

Maintenance:

  • Update list of maintainers (#1336)
  • Parallelize the test suite and fix a test polluted bug (#1338)
  • Update test workflow (#1340)

Version 0.16.1 (2024-04-19)

This release brings compatibility with new release of upstream packages. Thanks to first-time contributor @alexander-beedie, and the regular contributors @philippjfr, @ianthomas23, @maximlt, and @hoxbro.

Enhancements:

  • Improved antialiased mean reduction (#1300)
  • Update the docstring of eq_hist (#1322)

Compatibility:

  • Python 3.12 support (#1317)
  • Basic dask_expr support (#1317)
  • Numpy 2.0 support (#1306)
  • Remove redundant py2 helper code (#1316)

Maintenance:

  • Replace Google Analytics with GoatCounter (#1309)
  • Docs: ignore numpydoc validation checks (#1310)
  • Fix test suite (#1314)
  • General maintenance (#1320)

Version 0.16.0 (2023-10-26)

Datashader 0.16.0 is a significant release adding support for rendering GeoPandas GeoDataFrames directly rather than having to convert them to SpatialPandas first. Support for GeoPandas geometry types in Datashader Canvas functions is as follows:

  • Canvas.line: LineString, MultiLineString, MultiPolygon, Polygon
  • Canvas.point: MultiPoint, Point
  • Canvas.polygons: MultiPolygon, Polygon

There is also support in Canvas.line for a new data type which is a 2D xarray.DataArray (within an xarray.Dataset) containing the coordinates of multiple lines that share the same x coordinates.

The DataShape package is now vendored in Datashader as it has not been maintained for a number of years and is not accepting updates.

Thanks to new contributor @J08ny and regular contributors @Hoxbro and @ianthomas23.

Enhancements:

  • Support rendering of GeoPandas GeoDataFrames as lines, points and polygons (#1285, #1293, #1297)
  • Implement lines using 2D xarray with common x coordinates (#1282)

General code improvements:

  • Add debug logging to compiler module (#1280)
  • Vendor DataShape (#1284)
  • Don't use object as base class (#1286)
  • Fix typos using codespell (#1288)
  • Fix float16 being a floating type. (#1290)
  • Simplify line _internal_build_extend (#1294)

Improvements to CI:

  • Update to latest holoviz_tasks (#1281)
  • Update codecov configuration (#1292)
  • Add pre-commit (#1295, #1296)

Compatibility:

  • Support Pandas 2.1 (#1276, #1287)
  • Replace np.NaN with np.nan (#1289)
  • Drop support for Python 3.8 (#1291)

Version 0.15.2 (2023-08-17)

This release adds antialiased line support for inspection reductions such as max_n and where, including within categorical by reductions. It also improves support for summary reductions and adds CUDA implementations of std and var reductions.

Thanks to regular contributors @Hoxbro, @ianthomas23, @maximlt and @thuydotm.

Enhancements:

  • Antialiasing line support for inspection reductions:
    • Pre-compile antialias stage 2 combination (#1258)
    • Antialiased min and max row index reductions (#1259)
    • CPU shift_and_insert function (#1260)
    • Refactor of CUDA *_n reductions (#1261)
    • Support antialiased lines in *_n reductions (#1262)
    • Replace accumulate with copy on first call to antialiased stage 2 combine (#1264)
    • Separate where combine_cpu functions by ndim (#1265)
    • Antialiased line support for where reductions (#1269)
  • Improved support for summary reductions:
    • Support by reduction within summary reduction (#1254)
    • Support summary containing by reduction with other reductions (#1257)
    • Support summary containing multiple where with the same selector (#1271)
  • CUDA support for std and var reductions (#1267)

General code improvements:

  • Remove pyarrow pin (#1248)

Improvements to CI:

  • Update holoviz_tasks to v0.1a15 (#1251)
  • Use holoviz_tasks/install action for docs (#1272)

Improvements to documentation:

  • Update readme to include Python 3.11 (#1249)
  • Correct links to pandas docs (#1250)
  • Remove twitter from index page (#1253)
  • Create FUNDING.yml (#1263)

Version 0.15.1 (2023-07-05)

This release contains an important bug fix to ensure that categorical column order in maintained across dask partitions. It also adds support for categorical inspection reductions such as by(max_n). The only missing functionality for inspection reductions is now antialiased lines, which in planned for the next release.

Thanks to contributors @ianthomas23, @maximlt and @philippjfr.

Bug fixes:

  • Fix single category reductions (#1231)
  • Ensure categorical column order is the same across dask partitions (#1239)

Enhancements:

  • Categorical inspection reductions:
    • Support by(max_n) and by(min_n) (#1229)
    • Categorical max_row_index, max_n_row_index and min equivalents (#1233)
    • Use enum for row index column rather than None (#1234)
    • Add support for categorical where reductions (#1237)
    • Add tests for handling of NaNs in where reductions (#1241)
  • General code improvements:
    • Only check dask.DataFrame dtypes of columns actually used (#1236)
    • Remove all use of OrderedDict (#1242)
    • Separate out 3d and 4d combine functions (#1243)
    • Reorganise antialiasing code (#1245)

Improvements to CI:

  • Bump holoviz tasks (#1240)
  • Add image is close test helper (#1244)

Improvements to documentation:

  • Update to Google Analytics 4 (#1228)
  • Rename pyviz-dev as holoviz-dev (#1232)

Version 0.15.0 (2023-05-30)

This release provides significant improvements for inspection reductions by adding new first_n, last_n, max_n and min_n reductions, and providing Dask and CUDA support for all existing and new inspection reductions including where. It also provides support for Numba 0.57, NumPy 1.24 and Python 3.11, and drops support for Python 3.7.

Thanks to first-time contributors @danigm and @Jap8nted, and also regulars @Hoxbro, @philippjfr and @ianthomas23

Enhancements:

  • Inspection reductions:
    • Reduction append functions return index not boolean (#1180)
    • first_n, last_n, max_n and min_n reductions (#1184)
    • Add cuda argument to _build_combine (#1194)
    • Support max_n and min_n reductions on GPU (#1196)
    • Use fast cuda mutex available in numba 0.57 (#1212)
    • Dask support for first, last, first_n and last_n reductions (#1214)
    • Wrap use of cuda mutex in where reductions (#1217)
    • Cuda and cuda-with-dask support for inspection reductions (#1219)
  • x and y range attributes on returned aggregations (#1198)
  • Make datashader.composite imports lazy for faster import time (#1222)
  • Improvements to CI:
    • Cancel concurrent test workflows (#1208)
  • Improvements to documentation:
    • Inspection reduction documentation (#1186, #1190)
    • Upgrade to latest nbsite and pydata-sphinx-theme (#1221)
    • Use geodatasets in example data

Bug fixes:

  • Fix conversion from cupy in categorical rescale_discrete_levels (#1179)
  • Validate canvas width, height (#1183)
  • Support antialiasing in pipeline API (#1213)

Compatibility:

Version 0.14.4 (2023-02-02)

This release adds a new where reduction that provides improved inspection capabilities and adds support for colormaps that are tuples of hex values. There are also various bug fixes and compatibility improvements.

Thanks to @ianthomas23, @maximlt and @Hoxbro.

Enhancements:

  • New where reduction to provide improved inspection functionality:
    • Add new where reduction (#1155)
    • Where reduction using dataframe row index (#1164)
    • CUDA support for where reduction (#1167)
    • User guide page for where reduction (#1172)
  • Support colormaps that are tuples of hex values (#1173)
  • Add governance docs (#1165)
  • Improve documentation build system (#1170, #1171)
  • Improvements to CI:
    • Rename default branch from master to main (#1156)
    • Use holoviz_task install action (#1163)

Bug fixes:

  • Validate calculated log canvas range (#1154)
  • Better validate canvas.line() coordinate lengths (#1160)
  • Return early in eq_hist() if all data masked out (#1168)

Compatibility:

  • Follow recommended numba best practice.
    • Ensure cuda functions are correctly jitted (#1153)
    • nopython=True everywhere (#1162)
  • Update dependencies:
    • Pip pyarrow in tests dependencies (#1174)

Version 0.14.3 (2022-11-17)

This release fixes a bug related to spatial indexing of spatialpandas.GeoDataFrames, and introduces enhancements to antialiased lines, benchmarking and GPU support.

Thanks to first-time contributors @eriknw and @raybellwaves, and also @ianthomas23 and @maximlt.

Enhancements:

  • Improvements to antialiased lines:
    • Fit antialiased line code within usual numba/dask framework (#1142)
    • Refactor stage 2 aggregation for antialiased lines (#1145)
    • Support compound reductions for antialiased lines on the CPU (#1146)
  • New benchmark framework:
    • Add benchmarking framework using asv (#1120)
    • Add cudf, dask and dask-cudf Canvas.line benchmarks (#1140)
  • Improvements to GPU support:
    • Cupy implementation of eq_hist (#1129)
  • Improvements to documentation:
  • Improvements to dependency management (#1111, #1116)
  • Improvements to CI (#1132, #1135, #1136, #1137, #1143)

Bug fixes:

  • Ensure spatial index _sindex is retained on dataframe copy (#1122)

Version 0.14.2 (2022-08-10)

This is a bug fix release to fix an important divide by zero bug in antialiased lines, along with improvements to documentation and handling of dependencies.

Thanks to @ianthomas23 and @adamjhawley.

Enhancements:

  • Improvements to documentation:
    • Fix links in docs when viewed in browser (#1102)
    • Add release notes (#1108)
  • Improvements to handling of dependencies:
    • Correct dask and bokeh dependencies (#1104)
    • Add requests as an install dependency (#1105)
    • Better handle returned dask npartitions in tests (#1107)

Bug fixes:

  • Fix antialiased line divide by zero bug (#1099)

Version 0.14.1 (2022-06-21)

This release provides a number of important bug fixes and small enhancements from Ian Thomas along with infrastructure improvements from Maxime Liquet and new reductions from @tselea.

Enhancements:

  • Improvements to antialiased lines:
    • Support antialiased lines for categorical aggregates (#1081, #1083)
    • Correctly handle NaNs in antialiased line coordinates (#1097)
  • Improvements to rescale_discrete_levels for how='eq_hist':
    • Correct implementation of rescale_discrete_levels (#1078)
    • Check before calling rescale_discrete_levels (#1085)
    • Remove empty histogram bins in eq_hist (#1094)
  • Implementation of first and last reduction (#1093) for data types other than raster.

Bug fixes:

  • Do not snap trimesh vertices to pixel grid (#1092)
  • Correctly orient (y, x) arrays for xarray (#1095)
  • Infrastructure/build fixes (#1080, #1089, #1096)

Version 0.14.0 (2022-04-25)

This release has been nearly a year in the making, with major new contributions from Ian Thomas, Thuy Do Thi Minh, Simon Høxbro Hansen, Maxime Liquet, and James Bednar, and additional support from Andrii Oriekhov, Philipp Rudiger, and Ajay Thorve.

Enhancements:

  • Full support for antialiased lines of specified width (#1048, #1072). Previous antialiasing support was limited to single-pixel lines and certain floating-point reduction functions. Now supports arbitrary widths and arbitrary reduction functions, making antialiasing fully supported. Performance ranges from 1.3x to 14x slower than the simplest zero-width implementation; see benchmarks.
  • Fixed an issue with visibility on zoomed-in points plots and on overlapping line plots that was first reported in 2017, with a new option rescale_discrete_levels for how='eq_hist' (#1055)
  • Added a categorical color_key for 2D (unstacked) aggregates (#1020), for producing plots where each pixel has at most one category value
  • Improved docs:

Bugfixes:

Compatibility:

  • Canvas.line() option antialias=True is now deprecated; use line_width=1 (or another nonzero value) instead. (#1048)
  • Removed long-deprecated bokeh_ext.py (#1059)
  • Dropped support for Python 2.7 (actually already dropped from the tests in Datashader 0.12) and 3.6 (no longer supported by many downstream libraries like rioxarray, but several of them are not properly declaring that restriction, making 3.6 much more difficult to support.) (#1033)
  • Now tested on Python 3.7, 3.8, 3.9, and 3.10. (#1033)

Version 0.13.0 (2021-06-10)

Thanks to Jim Bednar, Nezar Abdennur, Philipp Rudiger, and Jean-Luc Stevens.

Enhancements:

  • Defined new dynspread metric based on counting the fraction of non-empty pixels that have non-empty pixels within a given radius. The resulting dynspread behavior is much more intuitive than the old behavior, which counted already-spread pixels as if they were neighbors (#1001)
  • Added ds.count() as the default reduction for ds.by (#1004)

Bugfixes:

  • Fixed array-bounds reading error in dynspread (#1001)
  • Fix color_key argument for dsshow (#986)
  • Added Matplotlib output to the 3_Interactivity getting started page. (#1009)
  • Misc docs fixes (#1007)
  • Fix nan assignment to integer array in RaggedArray (#1008)

Compatibility:

  • Any usage of dynspread with datatypes other than points should be replaced with spread(), which will do what was probably intended by the original dynspread call, i.e. to make isolated lines and shapes visible. Strictly speaking, dynspread could still be useful for other glyph types if that glyph is contained entirely in a pixel, e.g. if a polygon or line segment is located within the pixel bounds, but that seems unlikely.
  • Dynspread may need to have the threshold or max_px arguments updated to achieve the same spreading as in previous releases, though the new behavior is normally going to be more useful than the old.

Version 0.12.1 (2021-03-22)

Major release with new features that should really be considered part of the upcoming 0.13 release; please treat all the new features as experimental in this release due to it being officially a minor release (unintentionally).

Massive thanks to these contributors for substantial new functionality:

  • Nezar Abdennur (nvictus), Trevor Manz, and Thomas Caswell for their contributions to the new dsshow() support for using Datashader as a Matplotlib Artist, providing seamless interactive Matplotlib+Datashader plots.
  • Oleg Smirnov for category_modulo and category_binning for by(), making categorical plots vastly more powerful.
  • Jean-Luc Stevens for spread and dynspread support for numerical aggregate arrays and not just RGB images, allowing isolated datapoints to be made visible while still supporting hover, colorbars, and other plot features that depend on the numeric aggregate values.
  • Valentin Haenel for the initial anti-aliased line drawing support (still experimental).

Thanks to Jim Bednar, Philipp Rudiger, Peter Roelants, Thuy Do Thi Minh, Chris Ball, and Jean-Luc Stevens for maintenance and other contributions.

New features:

  • Expanded (and transposed) performance guide table (#961)
  • Add category_modulo and category_binning for grouping numerical values into categories using by() (#927)
  • Support spreading for numerical (non-RGB) aggregate arrays (#771, #954)
  • Xiaolin Wu anti-aliased line drawing, enabled by adding antialias=True to the Canvas.line() method call. Experimental; currently restricted to sum and max reductions ant only supporting a single-pixel line width. (#916)
  • Improve Dask performance issue using a tree reduction (#926)

Bugfixes:

  • Fix for xarray 0.17 raster files, supporting various nodata conventions (#991)
  • Fix RaggedArray tests to keep up with Pandas test suite changes (#982, #993)
  • Fix out-of-bounds error on Points aggregation (#981)
  • Fix CUDA issues (#973)
  • Fix Xarray handling (#971)
  • Disable the interactivity warning on the homepage (#983)

Compatibility:

  • Drop deprecated modules ds.geo (moved to xarray_image) and ds.spatial (moved to SpatialPandas) (#955)

Version 0.12.0 (2021-01-07)

No release notes produced.

Version 0.11.1 (2020-08-16)

This release is primarily a compatibility release for newer versions of Rapids cuDF and Numba versions along with a small number of bug fixes. With contributions from @jonmmease, @stuartarchibald, @AjayThorve, @kebowen730, @jbednar and @philippjfr.

  • Fixes support for cuDF 0.13 and Numba 0.48 (#933)
  • Fixes for cuDF support on Numba>=0.51 (#934, #947)
  • Fixes tile generation using aggregators with output of boolean dtype (#949)
  • Fixes for CI and build infrastructure (#935, #948, #951)
  • Updates to docstrings (b1349e3, #950)

Version 0.11.0 (2020-05-25)

This release includes major contributions from @maihde (generalizing count_cat to by span for colorize), @jonmmease (Dask quadmesh support), @philippjfr and @jbednar (count_cat/by/colorize/docs/bugfixes), and Barry Bragg, Jr. (TMS tileset speedups).

New features (see getting_started/2_Pipeline.ipynb for examples):

  • New by() categorical aggregator, extending count_cat to work with other reduction functions, no longer just count. Allows binning of aggregates separately per category value, so that you can compare how that aggregate is affected by category value. (#875, #902, #904, #906). See example in the holoviews docs.
  • Support for negative and zero values in tf.shade for categorical aggregates. (#896, #909, #910, #908)
  • Support for span in _colorize(). (#875, #910)
  • Support for Dask-based quadmesh rendering for rectilinear and curvilinear mesh types (#885, #913)
  • Support for GPU-based raster mesh rendering via Canvas.quadmesh (#872)
  • Faster TMS tileset generation (#886)
  • Expanded performance guide (#868)

Bugfixes:

Compatibility (breaking changes and deprecations):

  • To allow negative-valued aggregates, count_cat now weights categories according to how far they are from the minimum aggregate value observed, while previously they were referenced to zero. Previous behavior can be restored by passing color_baseline=0 to count_cat or by
  • count_cat is now deprecated and removed from the docs; use by(..., count()) instead.
  • Result of a count() aggregation is now uint32 not int32 to distinguish counts from other aggregation types (#910).
  • tf.shade now only treats zero values as missing for count aggregates (uint; zero is otherwise a valid value distinct from NaN (#910).
  • alpha is now respected as the upper end of the alpha range for both _colorize() and _interpolate() in tf.shade; previously only _interpolate respected it.
  • Added new nansum_missing utility for working with Numpy>1.9, where nansum no longer returns NaN for all-NaN values.
  • ds.geo and ds.spatial modules are now deprecated; their contents have moved to xarray_spatial and spatialpandas, respectively. (#894)

Download and install: https://datashader.org/getting_started

Version 0.10.0 (2020-01-21)

This release includes major contributions from @jonmmease (polygon rendering, spatialpandas), along with contributions from @philippjfr and @brendancol (bugfixes), and @jbednar (docs, warnings, and import times).

New features:

  • Polygon (and points and lines) rendering for spatialpandas extension arrays (#826, #853)
  • Quadmesh GPU support (#861)
  • Much faster import times (#863)
  • New table in docs listing glyphs supported for each data library (#864, #867)
  • Support for remote Parquet filesystems (#818, #866)

Bugfixes and compatibility:

  • Misc bugfixes and improvements (#844, #860, #866)
  • Fix warnings and deprecations in tests (#859)
  • Fix Canvas.raster (padding, mode buffers, etc. #862)

Download and install: https://datashader.org/getting_started

Version 0.9.0 (2019-12-08)

This release includes major contributions from @jonmmease (GPU support), along with contributions from @brendancol (viewshed speedups), @jbednar (docs), and @jsignell (examples, maintenance, website).

New features:

  • Support for CUDA GPU dataframes (cudf and dask_cudf) (#794, #793, #821, #841, #842)
  • Documented new quadmesh support (renaming user guide section 5_Rasters to 5_Grids to reflect the more-general grid support) (#805)

Bugfixes and compatibility:

  • Avoid double-counting line segments that fit entirely into a single rendered pixel (#839)
  • Improved geospatial toolbox, including 75X speedups to viewshed algorithm (#811, #824, #844)

Version 0.8.0 (2019-10-08)

This release includes major contributions from @jonmmease (quadmesh and filled-area support), @brendancol (geospatial toolbox, tile previewer), @philippjfr (distributed regridding, dask performance), and @jsignell (examples, maintenance, website).

New features:

  • Native quadmesh (canvas.quadmesh() support (for rectilinear and curvilinear grids -- 3X faster than approximating with a trimesh; #779)
  • Filled area (canvas.area() support (#734)
  • Expanded geospatial toolbox, with support for:
    • Zonal statistics (#782)
    • Calculating viewshed (#781)
    • Calculating proximity (Euclidean and other distance metrics, #772)
  • Distributed raster regridding with Dask (#762)
  • Improved dask performance (#798, #801)
  • tile_previewer utility function (simple Bokeh-based plotting of local tile sources for debugging; #761)

Bugfixes and compatibility:

  • Compatibility with latest Numba, Intake, Pandas, and Xarray (#763, #768, #791)
  • Improved datetime support (#803)
  • Simplified docs (now built on Travis, and no longer requiring GeoViews) and examples (now on examples.pyviz.org)
  • Skip rendering of empty tiles (#760)
  • Improved performance for point, area, and line glyphs (#780)
  • InteractiveImage and Pipeline are now deprecated; removed from examples (#751)

Version 0.7.0 (2019-04-08)

This release includes major contributions from @jonmmease (ragged array extension, SpatialPointsFrame, row-oriented line storage, dask trimesh support), @jsignell (maintenance, website), and @jbednar (Panel-based dashboard).

New features:

  • Simplified Panel based dashboard using new Param features; now only 48 lines with fewer new concepts (#707)
  • Added pandas ExtensionArray and Dask support for storing homogeneous ragged arrays (#687)
  • Added SpatialPointsFrame and updated census, osm-1billion, and osm examples to use it (#702, #706, #708)
  • Expanded 8_Geography.ipynb to document other geo-related functions
  • Added Dask support for trimesh rendering, though computing the mesh initially still requires vertices and simplicies to fit into memory (#696)
  • Add zero-copy rendering of row-oriented line coordinates, using a new axis argument (#694)

Bugfixes and compatibility:

  • Added lnglat_to_meters to geo module; new code should import it from there (#708)

Version 0.6.9 (2019-01-29)

This release includes major contributions from @jonmmease (fixing several long-standing bugs), @jlstevens (updating all example notebooks to use current syntax, #685), @jbednar, @philippjfr, and @jsignell (Panel-based dashboard), and @brendancol (geo utilities).

New features:

  • Replaced outdated 536-line Bokeh dashboard.py with 71-line Panel+HoloViews dashboard (#676)
  • Allow aggregating xarray objects (in addition to Pandas and Dask DataFrames) (#675)
  • Create WMTS tiles from Datashader data (#636)
  • Added various geographic utility functions (ndvi, slope, aspect, hillshade, mean, bump map, Perlin noise) (#661)
  • Made OpenSky data public (#691)

Bugfixes and compatibility:

  • Fix array bounds error on line glyph (#683)
  • Fixed the span argument to tf.shade (#680)
  • Fixed composite.add (for use in spreading) to clip colors rather than overflow (#689)
  • Fixed gerrymandering shape file (#688)
  • Updated to match Bokeh (#656), Dask (#681, #667), Pandas/Numpy (#697)

Version 0.6.8 (2018-09-11)

Minor, mostly bugfix, release with some speed improvements.

New features:

  • Added Strange Attractors example (#632)
  • Major speedup: optimized dask datashape detection (#634)

Bugfixes and compatibility:

  • Silenced inappropriate warnings (#631)
  • Fixed various other bugs, including #644
  • Added handling for zero data and zero range (#612, #648)

Version 0.6.7 (2018-07-07)

Minor compatibility release.

  • Supports dask >= 0.18.
  • Updated installation and usage instructions

Version 0.6.6 (2018-05-20)

Minor bugfix release.

  • Now available to install using pip (pip install datashader) or conda defaults (conda install datashader)
  • InteractiveImage is now deprecated; please use the Datashader support in HoloViews instead.
  • Updated installation and example instructions to use new datashader command.
  • Made package building automatic, to allow more frequent releases
  • Ensured transparent (not black) image is returned when there is no data to plot (thanks to Nick Xie)
  • Simplified getting-started example (thanks to David Jones)
  • Various fixes and compatibility updates to examples

Version 0.6.5 (2018-02-01)

Major release with extensive support for triangular meshes and changes to the raster API.

New features:

  • Trimesh support: Rendering of irregular triangular meshes using Canvas.trimesh() (see user guide) (#525, #552)
  • Added a new website at datashader.org, with new Getting Started pages and an extensive User Guide, with about 50% new material not previously in example notebooks. Built entirely from Jupyter notebooks, which can be run in the examples/ directory. Website is now complete except for sections on points (see the nyc_taxi example in the meantime).
  • Canvas.raster() now accepts xarray Dataset types, not just DataArrays, with the specific DataArray selectable from the Dataset using the column= argument of a supplied aggregation function.
  • tf.Images() now displays anything with an HTML representation, to allow laying out Pandas dataframes alongside datashader output.

Bugfixes and compatibility:

  • Changed Raster API to match other glyph types:
    • Now accepts a reduction function via an agg= argument like Canvas.line(), Canvas.points(), etc. The previous downsample_method is still accepted for this release, but is now deprecated.
    • upsample_method is now interpolate, accepting linear=True or linear=False; the previous spelling is now deprecated.
    • The layer= argument previously accepted a 1-based integer index, which was confusing given the standard Python 0-based indexing elsewhere. Changed to accept an xarray coordinate, which can be a 1-based index if that's what is defined on the array, but also works with arbitrary floating-point coordinates (e.g. for a depth parameter in an image stack).
    • Now auto-ranges in x and y when not given explicit ranges, instead of raising an error.
  • Fixed various bugs, including one generating incorrect output in Canvas.raster(agg='mode')

Version 0.6.4 (2017-12-05)

Minor compatibility release to track changes in external packages.

  • Updated imports for bokeh 0.12.11 (fixes #535), though there are issues in 0.12.11 itself and so 0.12.12 should be used instead (to be released shortly).
  • Pinned pillow version on Windows (fixes #534).

Version 0.6.3 (2017-12-01)

Apart from the new website, this is a minor release primarily to catch up with changes in external libraries.

New features:

  • Reorganized examples directory as the basis for a completely new website at https://bokeh.github.io/datashader-docs (#516).
  • Added tf.Images() class to format multiple labeled Datashader images as a table in a Jupyter notebook, now used extensively in the new website.
  • Added utility function dataframe_from_multiple_sequences(x_values, y_values) to convert large numbers of sequences stored as 2D numpy arrays to a NaN-separated pandas dataframe that can be displayed efficiently (see new example in tseries.ipynb) (#512).
  • Improved streaming support (#520).

Bugfixes and compatibility:

  • Added support for Dask 0.15 and 0.16 and pandas 0.21 (#523, #529) and declared minimum required Numba version.
  • Improved and fixed issues with various example notebooks, primarily to update for changes in dependencies.
  • Changes in network graph support: ignore id field by default to avoid surprising dependence on column name, rename directly_connect_edges to connect_edges for accuracy and conciseness.

Version 0.6.2 (2017-10-25)

Release with bugfixes, changes to match external libraries, and some new features.

Backwards compatibility:

  • Minor changes to network graph API, e.g. to ignore weights by default in forcelayout2 (#488)
  • Fix upper-bound bin error for auto-ranged data (#459). Previously, points falling on the upper bound of the plotted area were excluded from the plot, which was consistent with the behavior for individual grid cells, but which was confusing and misleading for the outer boundaries. Points falling on the very outermost boundaries are now folded into the final grid cell, which should be the least surprising behavior.

New or updated examples (.ipynb files in examples/):

  • streaming-aggregation.ipynb: Illustrates combining incoming streams of data for display (also see holoviews streaming).
  • landsat.ipynb: simplified using HoloViews; now includes plots of full spectrum for each point via hovering.
  • Updated and simplified census-hv-dask (now called census-congressional), census-hv, packet_capture_graph.

New features and improvements

  • Updated Bokeh support to work with new bokeh 0.12.10 release (#505)
  • More options for network/graph plotting (configurable column names, control over weights usage; #488, #494)
  • For lines plots (time series, trajectory, networ graphs), switch line-clipping algorithm from Cohen-Sutherland to Liang-Barsky. The performance gains for random lines range from 50-75% improvement for a million lines. (#495)
  • Added tf.Images class to format a list of images as an HTML table (#492)
  • Faster resampling/regridding operations (#486)

Known issues:

  • examples/dashboard has not yet been updated to match other libraries, and is thus missing functionality like hovering and legends.
  • A full website with documentation has been started but is not yet ready for deployment.

Version 0.6.1 (2017-09-13)

Minor bugfix release, primarily updating example notebooks to match API changes in external packages.

Backwards compatibility:

  • Made edge bundling retain edge order, to allow indexing, and absolute coordinates, to allow overlaying on external data.
  • Updated examples to show that xarray now requires dimension names to match before doing arithmetic or comparisons between arrays.

Known issues:

  • If you use Jupyter notebook 5.0 (earlier or later versions should be ok), you will need to override a setting that prevents visualizations from appearing, e.g.: jupyter notebook --NotebookApp.iopub_data_rate_limit=100000000 census.ipynb &
  • The dashboard needs to be rewritten entirely to match current Bokeh and HoloViews releases, so that hover and legend support can be restored.

Version 0.6.0 (2017-08-19)

New release of features that may still be in progress, but are already usable:

  • Added graph/network plotting support (still may be in flux) (#385, #390, #398, #408, #415, #418, #436)
  • Improved raster regridding based on gridtools and xarray (still may be in flux); no longer depends on rasterio and scikit-image (#383, #389, #423)
  • Significantly improved performance for dataframes with categorical fields

New examples (.ipynb files in examples/):

  • osm-1billion: 1-billion-point OSM example, for in-core processing on a 16GB laptop.
  • edge_bundling: Plotting graphs using "edgehammer" bundling of edges to show structure.
  • packet_capture_graph: Laying out and visualizing network packets as a graph.

Backwards compatibility:

  • Remove deprecated interpolate and colorize functions
  • Made raster processing consistently use bin centers to match xarray conventions (requires recent fixes to xarray; only available on a custom channel for now) (#422)
  • Fixed various limitations and quirks for NaN values
  • Made alpha scaling respect min_alpha consistently (#371)

Known issues:

  • If you use Jupyter notebook 5.0 (earlier or later versions should be ok), you will need to override a setting that prevents visualizations from appearing, e.g.: jupyter notebook --NotebookApp.iopub_data_rate_limit=100000000 census.ipynb &
  • The dashboard needs updating to match current Bokeh releases; most parts other than hover and legends, should be functional but it needs a rewrite to use currently recommended approaches.

Version 0.5.0 (2017-05-12)

Major release with extensive optimizations and new plotting-library support, incorporating 9 months of development from 5 main contributors:

  • Extensive optimizations for speed and memory usage, providing at least 5X improvements in speed (using the latest Numba versions) and 2X improvements in peak memory requirements.
  • Added HoloViews support for flexible, composable, dynamic plotting, making it simple to switch between datashaded and non-datashaded versions of a Bokeh or Matplotlib plot.
  • Added examples/environment.yml to make it easy to install dependencies needed to run the examples.
  • Updated examples to use the now-recommended supported and fast Apache Parquet file format
  • Added support for variable alpha for non-categorical aggregates, by specifying a single color rather than a list or colormap #345
  • Added datashader.utils.lnglat_to_meters utility function for working in Web Mercator coordinates with Bokeh
  • Added discussion of why you should be using uniform colormaps), and examples of using uniform colormaps from the new colorcet package
  • Numerous bug fixes and updates, mostly in the examples and Bokeh extension
  • Updated reference manual and documentation

New examples (.ipynb files in examples/):

  • holoviews_datashader: Using HoloViews to create dynamic Datashader plots easily
  • census-hv-dask: Using GeoViews for overlaying shape files, demonstrating gerrymandering by race
  • nyc_taxi-paramnb: Using ParamNB to make a simple dashboard
  • lidar: Visualizing point clouds
  • solar: Visualizing solar radiation data
  • Dynamic 1D histogram example (last code cell in examples/nyc_taxi-nongeo.ipynb)
  • dashboard: Now includes opensky example (python dashboard/dashboard.py -c dashboard/opensky.yml)

Backwards compatibility:

  • To improve consistency with Numpy and Python data structures and eliminate issues with an empty column and row at the edge of the aggregated raster, the provided xrange,yrange bounds are now treated as upper exclusive. Results will thus differ between 0.5.0 and earlier versions. See #259 for discussion.

Known issues:

  • If you use Jupyter notebook 5.0 (earlier or later versions should be ok), you will need to override a setting that prevents visualizations from appearing, e.g.: jupyter notebook --NotebookApp.iopub_data_rate_limit=100000000 census.ipynb &
  • Legend and hover support is currently disabled for the dashboard, due to ongoing development of a simpler approach.

Version 0.4.0 (2016-08-18)

Minor bugfix release to support Bokeh 0.12.1, with some API and defaults changes.

  • Added examples() function to obtain the notebooks and other examples corresponding to the installed datashader version; see examples/README.md.
  • Updated dashboard example to match changes in Bokeh
  • Added default color cycle with distinguishable colors for shading categorical data; now tf.shade(agg) with no other arguments should give a usable plot for both categorical and non-categorical data.

Backwards compatibility:

  • Replaced confusing tf.interpolate() and tf.colorize() functions with a single shading function tf.shade(). The previous names are still supported, but give deprecation warnings. Calls to the previous functions using keyword arguments can simply be renamed to use tf.shade as all the same keywords are accepted, but calls to colorize that used a positional argument for e.g. the color_key will now need to use a keyword when calling shade()
  • Increased default threshold for tf.dynspread() to improve visibility of sparse dots
  • Increased default min_alpha for tf.shade() (formerly tf.colorize()) to avoid undersaturation

Known issues:

  • For Bokeh 0.12.1, some notebooks will give warnings for Bokeh plots when used with Jupyter's "Run All" command. Bokeh 0.12.2 will fix this problem when it is released, but for now you can either downgrade to 0.12.0 or use single-cell execution.
  • There are some Bokeh compatibility issues with the dashboard example that are still being investigated and may require a new Bokeh or datashader release in this series.

Version 0.3.2 (2016-07-18)

Minor bugfix release to support Bokeh 0.12:

  • Fixed InteractiveImage zooming to work with Bokeh 0.12.
  • Added more responsive event throttling for DynamicImage; throttle parameter no longer needed and is now deprecated
  • Fixed datashader-download-data command
  • Improved non-geo Taxi example
  • Temporarily disabled dashboard legends; will re-enable in future release

Version 0.3.0 (2016-06-23)

The major feature of this release is support of raster data via Canvas.raster. To use this feature, you must install the optional dependencies via conda install rasterio scikit-image. Rasterio relies on gdal whose conda package has some known bugs, including a missing dependency for conda install krb5. InteractiveImage in this release requires bokeh 0.11.1 or earlier, and will not work with bokeh 0.12.

  • PR #160 #187 Improved example notebooks and dashboard
  • PR #186 #184 #178 Add datashader-download-data cli command for grabbing example datasets
  • PR #176 #177 Changed census example data to use HDF5 format (slower but more portable)
  • PR #156 #173 #174 Added Landsat8 and race/ethnicity vs. elevation example notebooks
  • PR #172 #159 #157 #149 Added support for images using Canvas.raster (requires rasterio and scikit-image).
  • PR #169 Added legends notebook demonstrating create_categorical_legend and create_ramp_legend - PR #162. Added notebook example for datashader.bokeh_ext.HoverLayer - PR #152. Added alpha``arg to ``tf.interpolate - PR #151 #150, etc. Small bugfixes
  • PR #146 #145 #144 #143 Added streaming example
  • Added hold decorator to utils, summarize_aggregate_values helper function
  • Added FAQ to docs

Backwards compatibility:

  • Removed memoize_method - Renamed datashader.callbacks --> datashader.bokeh_ext - Renamed examples/plotting_problems.ipynb --> examples/plotting_pitfalls.ipynb

Version 0.2.0 (2016-04-01)

A major release with significant new functionality and some small backwards-incompatible changes.

New features:

  • PR #124, census New census notebook example, showing how to work with categorical data.
  • PR #79, tseries, trajectory Added line glyph and ``.any()``reduction, used in new time series and trajectory notebook examples.
  • PR #76, #77, #131 Updated all of the other notebooks in examples/, including nyc_taxi.
  • PR #100, #125: Improved dashboard example: added categorical data support, census and osm datasets, legend and hover support, better performance, out of core option, and more
  • PR #109, #111: Add full colormap support via a new cmap argument to interpolate and colorize supports color ranges as lists, plus Bokeh palettes and matplotlib colormaps
  • PR #98: Added set_background to make it easier to work with images having a different background color than the default white notebooks
  • PR #119, #121: Added eq_hist option for how in interpolate, performing histogram equalization on the data to reveal structure at every intensity level
  • PR #80, #83, #128: Greatly improved InteractiveImage performance and responsiveness
  • PR #74, #123: Added operators for spreading pixels (to make individual datapoints visible, as circles, squares, or arbitrary mask shapes) and compositing (for simple and flexible composition of images)

Backwards compatibility:

  • The low and high color options to interpolate and colorize are now deprecated and will be removed in the next release; use cmap=[low,high] instead.
  • The transfer function merge has been removed to avoid confusion. stack and others can be used instead, depending on the use case.
  • The default how for interpolate and colorize is now eq_hist to reveal the structure automatically regardless of distribution.
  • Pipeline now has a default dynspread step, to make isolated points visible when zooming in, and the default sizes have changed.

Version 0.1.0 (2016-04-01)

Initial public release.