-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ProjectIDs on individual records, rather than a dataset as a whole #836
Comments
Adding a multivalue I'd suggest we support any project ID but give clear guidance on how people should refer to GBIF-issued project IDs (e.g. Would that be desirable? If so, we should move this request into gbif/pipelines. * note |
I would think it desirable, thanks - and agree about a prescribed syntax for the record level. Some follow-up considerations, just off the top of my head:
|
@dagendresen FYI |
Moving this into pipelines then. |
Great @ahahn-gbif, we are highly interested in adopting this solution as with the BID project we had to create many matadata only datasets to fullfill the BID report needs |
I was exploring what we need to do in the development side in pipelines. We already have a
Then we need to adapt the IPT, search, downloads, portal, etc. and the field will be used as the other multivalue fields that we already have. Is there anything that we are missing or has to be done differently? |
@marcos-lg This means that projectID from metadata will also become a multivalue field? That also will be useful as a collection o monitoring programs will have multiple financial sources across the years. |
@camiplata we can make the projectID from the metadata multivalue too but it has more implications so we need to plan it more carefully. I created this issue in the IPT so we can track it gbif/ipt#1927 |
I concur that allowing adding projectID to individual occurrences would be very useful to monitor/acknowledge contribution of various projects to bigger datasets. Right now, projects can only refer to metadata only datasets, which is not really representative of their data mobilzation work. |
While this may be over-interpreting the current suggestion, I can see this approach being very useful in a number of contexts for GBIF, e.g.
|
@debpaul this reminds me of your question at tdwg/dwc-qa#199 |
Deployed to PROD. |
Idea/wish captured from feedback of the regional support contractors (BID) to GBIFS:
"It is being defined with SiB Colombia how to identify in each record of a dataset its link with the BID project, within the framework of the publication of data from partner organizations/collections in the Colombian BID-CA2020 projects. The use of DwC fields such as datasetID or datasetName has been proposed by the Regional Support, but in some cases that could create conflict when the field was filled with previous data. GBIF is encouraged in building its new data model to look for a more effective mechanism to accomplish this and clarify it for the BID projects (and project partners)."
There are two main reasons for this request:
Unfortunately, this is not easy – individual records would have to carry the project ID right from the point they are captured at record level – our transfer schema does not really allow for that. We are presently getting around the delivery-reporting requirement, e.g. in cases where records are published through eBird or iNaturalist, by requesting an explicit report on the data published in the project context. This is only for internal evaluation though. The second part is not easily possible, since there is no “project ID” field at record level.
Open question: do the benefits outweigh the added requirements, including internal data management and UI needs for surfacing this information?
The text was updated successfully, but these errors were encountered: