-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Possible examples and need for sharing mixed observation and vouchered specimen record datasets #4432
Comments
Realize I'm not the people you were reaching out to on this but I am currently working with a data provider from USDA with exactly this type of information collected. For one project they have vouchered specimens ( For your other thoughts- users get back data of mixed Hopefully GBIF-S will correct me where I've misspoken. |
Marvelous, your response is wonderful and timely. Thank you for taking the time to offer your experience, and examples! Much appreciated. We can both look forward to further insights from GBIF-S. |
Thanks @debpaul @albenson-usgs I'm sure others will jump in, but I will make a start and try and provide some background information. Firstly, Abby is correct in her reply (thank you). BasisOfRecord can be mixed within a dataset and is commonly varied in a download. Secondly, we're aware that the current dataset classes are insufficient and confusing, and we have a discussion underway to provide a better categorization of datasets. An example of where confusion appears is that checklists can have occurrence data, and an occurrence dataset can hold IDs for sampling events in the occurrence core. The origin of the dataset classes comes from the core record type used in the DwC-A format. Checklists using Taxon, Occurrence using Occurrence and Sampling Event using Event. However, that is not enforced by the system and it just represents the option chosen when registering a dataset (i.e. "I am registering a new dataset of Please also note that the dataset type does not appear in the occurrence search interface of GBIF, nor the occurrence API. They are all occurrence records at this point, with a basisOfRecord. It only appears in the dataset listing and search along with summary statistics (e.g. by country). So where does that leave you? My advice would be to:
With that said, I may be out of touch and you may get a revised suggestion on 4). |
Hi Deb, All, Thanks, @albenson-usgs and @timrobertson100 for your responses - spot on! Just to add that yes, it would indeed be preferable to use the "sampling event" class for datasets that have well-documented sampling methodology and other relevant metadata. This said, with the current limitations of the DwC-A "star schema" (linked extensions only one level deep), the decision is often rather based on the extension data that should/need to be included. As Tim says, we are aware of that particular limitation, and working on solutions in model discussions. |
Greetings GBIF,
RE: datasets containing both observation records (e. g. remote monitoring, human observation) and vouchered specimen data records
Scenario: In monitoring native bees for a given region on the planet (i. e. humans looking at bees and identifying them, maybe imaging them) data are to be collected in a spreadsheet. To start with, for each morphospecies encountered at a given monitoring site, a single specimen will be collected and then vouchered in a natural science collection. Data about this specimen will be entered in a record row in the aforementioned spreadsheet.
Questions:
Other thoughts
@MattBlissett @timrobertson100 and GBIF folks thanks for your help. Also please note I wasn't sure which GBIF repository this ticket would go in. Perhaps it will need to be moved to a different repo. Tagging @seltmann
The text was updated successfully, but these errors were encountered: