feat: new blockwise for large data #707

adebardo · 2025-03-27T15:57:26Z

Lots of work to do but we will use this PR to debate tomorrow

adebardo · 2025-03-28T12:41:59Z

rhugonnet · 2025-03-29T16:40:29Z

Great! 🙂

I went through all the code as we mentioned in our discussion yesterday. All good on the functionality. In terms of the structure and consistency with the rest of the package, here are my remarks (difficulty level of the change between parenthesis):

(Easy) As mentioned in the call, we should likely add plane_fit_ransac as an option of apply(), keeping the behaviour consistent with BlockwiseCoreg otherwise,
(Easy-Moderate) I don't understand why reference_dem and to_be_aligned_dem need to be __init__. The tiling_grid could be defined during fit() as it only relates to reference_dem (and not the coregistration method itself), and the profile saved there too?
(Moderate) After reviewing the functions, it would be a bit of effort but not as difficult as I anticipated to integrate these changes into BlockwiseCoreg directly, and trigger the out-of-memory behaviour by passing a MultiprocConfig to fit() and apply().

For 3.: Why? I realized we actually don't need to mirror the full complexity of #525 for BlockwiseCoreg. This is thanks to BlockwiseCoreg having its own fit()/apply() function independent of Coreg.fit()/apply().
As BlockwiseCoreg splits the input array very early on (and that once we have blocks, we don't need any memory considerations), we don't need as many complex out-of-memory processing steps during the rest of the coregistration, as we need for the generic Coreg.fit()/apply() steps in #525. For instance, ignoring the preprocessing, coreg.NuthKaab() still requires out-of-memory subsampling, then out-of-memory interpolation that re-runs at each iteration of the algorithm depending on the shift iteration.
Here, we'd only need two simple, one-time, out-of-memory steps to work in the very beginning, and that's it! Those are:

Out-of-memory reprojection (almost done in [WIP] Multiprocessing interpolation and reprojection geoutils#661),
Out-of-memory masking to define inlier_data (that we could run with map_overlap_multiproc of Generic Multiprocessing Functions for Raster Processing geoutils#669, or a very close version).

In short: We could pass down the MultiprocConfig to the preprocess_fit/apply functions, and define filenames in MultiprocConfig for these two "break points" (that will create a file for the "reprojected_dem" and "inlier_mask"). Then we just continue the rest of the coregistration with these two files!

So, in short, I think 1-2. shouldn't be a problem.
For 3., it's a bit more effort but not as much as initially thought, and would be ideal to avoid splitting into two classes. I'm happy to help out on this to finalize it in time 😉

Finally, I also have one question: Is there any core difference between our interp_points() and resample_dem()?
We have a grid resample wrapper based on interp_points() (tested to match exactly GDAL.warp in a same CRS), which would be easy to run with Multiprocessing:

xdem/xdem/coreg/base.py

Line 1306 in 6e9db15

def _reproject_horizontal_shift_samecrs(

We generally prefer to use this interp_points() implementation for consistency due to:

Its integrated nodata propagation scheme depending on the resampling, to avoid unexpected outliers,
Its understanding of pixel interpretation (center or corner of pixel), to avoid unexpected shifts,
It is the resamping used by the coregistration algorithms, so it ensures the resulting DEM alignment is fully consistent (otherwise, with a different resampling here, if we re-ran the coregistration algorithm, we would likely find another shift again instead of (0,0,0)).

Some issues I mention above sound like the ones you described in the call, so this might part of the solution.
In terms of performance of our resampling: The default "bilinear" sampling relies on scipy.ndimage.map_coordinates (to harness the regular-grid nature of a raster = X and Y axis have equal spacing), and therefore is much faster than Xarray-SciPy's interpn (that works on a rectilinear-grid = X and Y can have any coordinates).
It's likely not as fast as cars-resample, but shouldn't be too far off 😉

adebardo added 2 commits March 27, 2025 16:56

feat: new blockwise for large data

0f8b1d0

minor: readapt with multiproc

5878ac2

feat: save coreg res

ba19ab3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: new blockwise for large data #707

feat: new blockwise for large data #707

adebardo commented Mar 27, 2025

adebardo commented Mar 28, 2025

rhugonnet commented Mar 29, 2025 •

edited

Loading

feat: new blockwise for large data #707

Are you sure you want to change the base?

feat: new blockwise for large data #707

Conversation

adebardo commented Mar 27, 2025

adebardo commented Mar 28, 2025

rhugonnet commented Mar 29, 2025 • edited Loading

rhugonnet commented Mar 29, 2025 •

edited

Loading