zonal stats: speed up dask case #572

thuydotm · 2021-11-10T12:17:16Z

This PR uses the same approach as #568 to improve performance for zonal stats when input data arrays are dask-backed. It computes stats chunk by chunk and then summarizes all the results and return output as a dask DataFrame.

This also limits stats that supported in dask case to a subset of default stats, which is safer since a custom statistics would not be always element-wise thus can produce unexpected results.

nodata_zones is removed as we already support zone_ids, and exclude invalid values (nan, inf) from our calculations.

xrspatial/zonal.py

ianthomas23 · 2021-11-15T15:00:32Z

Just a few minor comments, otherwise it looks good to merge.

thuydotm · 2021-11-16T05:12:10Z

Thanks Ian, I just updated the code. I'll merge into master once the tests all passed.

thuydotm added 5 commits October 27, 2021 16:45

safely removed nodata_zones arg

2c6d6ea

dask zonal stats

b2d1081

dask case: support zone_ids

46b6505

refactor

8039d63

update docs

5cd5fe4

thuydotm requested a review from ianthomas23 November 15, 2021 08:31

thuydotm added the ready to merge PR is ready to merge label Nov 15, 2021

ianthomas23 reviewed Nov 15, 2021

View reviewed changes

xrspatial/zonal.py Outdated Show resolved Hide resolved

ianthomas23 reviewed Nov 15, 2021

View reviewed changes

xrspatial/zonal.py Outdated Show resolved Hide resolved

ianthomas23 reviewed Nov 15, 2021

View reviewed changes

xrspatial/zonal.py Outdated Show resolved Hide resolved

ianthomas23 reviewed Nov 15, 2021

View reviewed changes

xrspatial/zonal.py Outdated Show resolved Hide resolved

clean code

632c54a

thuydotm merged commit 9d2ee7c into master Nov 16, 2021

thuydotm deleted the zonal_stats_dask_speedup branch December 23, 2021 06:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

zonal stats: speed up dask case #572

zonal stats: speed up dask case #572

Uh oh!

thuydotm commented Nov 10, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ianthomas23 commented Nov 15, 2021

Uh oh!

thuydotm commented Nov 16, 2021

Uh oh!

Uh oh!

zonal stats: speed up dask case #572

zonal stats: speed up dask case #572

Uh oh!

Conversation

thuydotm commented Nov 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ianthomas23 commented Nov 15, 2021

Uh oh!

thuydotm commented Nov 16, 2021

Uh oh!

Uh oh!

thuydotm commented Nov 10, 2021 •

edited

Loading