-
-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
netCDF: optimize loading of network located dataset with unlimited dimension #7737
Conversation
ef8e145
to
8be3d1a
Compare
int Taken = 0; | ||
|
||
for (int i = 0; i < nd - 2; i++) | ||
{ | ||
int result; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
isn't result
being used uninitialized if the following condition is false?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no, it is also initialized in the else branch at line 2055
…ge (no functional change)
…nd way (helps with network accesses)
…variables of unlimited dimensions on network access for performance reasons Helps for https://lists.osgeo.org/pipermail/gdal-dev/2023-May/057209.html
8be3d1a
to
0ae885e
Compare
this is looking good, I'm not getting blocked from usage now :0 one side question, when I don't use the '/vsicurl' prefix on the bare url I get these vsimem truncated subdataset forms:
with the prefix it's correct
is that just a "don't use bare url" situation? |
yes, the HTTP driver is a bit of a hack, and is for one-time usage of small remote files, as it creates either a in-memory (or on-disk as fallback) temporary file with the whole content of the file before passing it to any other driver that accepts it. This doesn't fly well with subdatasets, at least in a command line usage. Within the same process, and if you keep the original dataset opened, opening the subdatasets could potentially work, but I'm not totally sure. |
awesome, as always much appreciated 🤘 |
Helps for https://lists.osgeo.org/pipermail/gdal-dev/2023-May/057209.html
The time dimension is unlimited, and is the one used to determine GDAL bands. Retrieving the values of the time variable for the metadata requires scanning a lot of places within the file, which is network access unfriendly.
So:
All that combines enables access of a given band with a more reasonable amount of HTTP access.
gdalwarp "vrt://netCDF:\"/vsicurl/http://localhost:8080/tos_day_ACCESS1-3_rcp85_r1i1p1_20560101-20651231.nc\":tos?bands=1" out2.tif -overwrite