Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MIDS Element - MaterialType #14

Open
RBGE-Herbarium opened this issue Jan 20, 2021 · 15 comments
Open

MIDS Element - MaterialType #14

RBGE-Herbarium opened this issue Jan 20, 2021 · 15 comments
Labels
MIDS Element Defines/tracks a MIDS information element that appears at one or more MIDS levels status: not accepted at present

Comments

@RBGE-Herbarium
Copy link

RBGE-Herbarium commented Jan 20, 2021

MIDS information element MaterialType
Definition The material the object is composed of.
DwC term (latest, 2014-11-08)
ABCD term name (3.0)
Applicable standard(s)/recommendation(s)
Element identifier
Required Yes
Repeatable No
Constraints Controlled vocabulary
Examples To be added
Element specification status agreed; accepted in specification
Notes Definition of controlled vocabulary is needed.
@RBGE-Herbarium
Copy link
Author

CETAF DWG Discussion:

Preparation not searchable in GBIF. Element required to find specimen in Institute. Element also used to determine digitisation cost and pipeline

@RBGE-Herbarium
Copy link
Author

GBIF data in gbif_export_20200611_v2

SELECT count(*) FROM elspeth-mids-gbif-280011.gbif_export_20200611_v2.occurrence as oc
where oc.v_preparations is not null

Result: 66,648,857 records

SELECT oc.v_preparations, oc.institutionCode, count(oc.v_preparations) FROM elspeth-mids-gbif-280011.gbif_export_20200611_v2.occurrence as oc
where oc.v_preparations is not null
and oc.institutionCode in ("W","MNHN","NHMD","TAMZ","TU(M)","MZH","MfN","BGBM","snmb","STU","ZFMK","National Museum of the Czech Republic","HNHM","MNHNL","Naturalis Biodiversity Center","NHMO","MIZPAN","MNCN","MA","GNM","GB","GBG","G","MHNG","NHMUK","National Museums Scotland","E","K","SAV")
group by oc.institutionCode, oc.v_preparations

bq-results-20210120-122837-ydhq0a99j5dl.xlsx

@RBGE-Herbarium
Copy link
Author

RBGE-Herbarium commented Jan 20, 2021

Recommendations for controlled vocabulary and standards

We could consider creating a definition of this element that aligns with the work of DiSSCo and iDigBio and GBIF as much as possible and provide recommendations for standards and controlled vocabularies such as the one below. This links with the work of the TDWG CD Group (https://github.com/tdwg/cd).

Join the Dots and the Collections Digitisation Dashboard use the list below as a draft. There isn't a consensus around definitions and e.g. object type vs preservation method.

preservationMethod list at the moment from CDD:

  • Artefacts: climate controlled conditions
  • Artefacts: non climate controlled conditions
  • Cores
  • Cryopreserved/frozen -80C
  • Cryopreserved DNA/RNA
  • Cut/polished gemstones
  • Dried
  • Dried - assembled
  • Dried - not assembled
  • Dried and pinned
  • Fluid preserved
  • Fluids
  • Fossils preserved in amber, natural resin
  • Hazardous material/objects
  • Macrofossils (dry preserved)
  • Macrofossils (fluid preserved)
  • Macro-objects
  • Mesofossils (dry preserved)
  • Mesofossils (fluid preserved)
  • Microfossils (dry preserved)
  • Microfossils (fluid preserved)
  • Micro-objects
  • Microscopic slides
  • Other
  • Other geo/biodiversity
  • Oversized fossils
  • Oversized objects
  • Pressed and dried
  • Spore print
  • Unspecified

@RBGE-Herbarium
Copy link
Author

RBGE-Herbarium commented Jan 20, 2021

RBGE currently submit the following to GBIF:

v_preparations | institutionCode | f0_

wood sample | E | 60
other | E | 85
photograph of unspecified type (including photocopy) | E | 4586
DNA sample | E | 430
liquid-preserved material | E | 6855
herbarium specimen of unspecified type | E | 887309
seed | E | 36
fruit collection/cone collection (unmounted) (carpological collection) | E | 3310
chromosome / cytological specimen | E | 3
medicine/drug (prepared or semi-prepared sample used medicinally) | E | 21
bark | E | 52
herbarium sheet | E | 6

I think that the comment above about the confusion between the object type vs preservation method is significant. This needs to be discussed by the CD groups.

In terms of use cases the following are relevant for this element:

Finding the specimen - specimens are filed by taxonomic group, object type and preservation method. Thus a conifer specimen may be filed as:

  • Dried (but may be carpological, or wood sample and these are all held separately)
  • Pressed and Dried (this would refer to herbarium sheets for us)
  • Fluid preserved (held in a separate location within institute)
  • Microscope slides
  • Oversized objects (but Dried carpological outisize objects are held separately from Pressed and Dried outsize herbarium sheets)
  • Seed

Determining the suitability of the specimen for research (eg DNA extraction)

  • Dried
  • Fluid preserved
  • Spore print
  • etc

However, I think that RBGE could look at using a system that would either follow the list above or be easily mapped to it.

@hardistyar hardistyar added MIDS-1 Info element appears at MIDS level 1 MIDS-2 Info element appears at MIDS level 2 MIDS-3 Info element appears at MIDS level 3 MIDS Element Defines/tracks a MIDS information element that appears at one or more MIDS levels status: not yet discussed Status value of MIDS Element definition labels Jan 20, 2021
@emhaston
Copy link
Contributor

emhaston commented Feb 1, 2021

GeoCase Specimen Type

image

@hardistyar hardistyar added status: under discussion Status value of MIDS Element definition and removed status: not yet discussed Status value of MIDS Element definition labels Feb 1, 2021
@falkogloeckler
Copy link

falkogloeckler commented Feb 1, 2021

SpecimenTypes in GeoCASe

Please be aware that these terms are not yet based on controlled vocabulary. The list is just a facette of indexed raw terms as provided by the different institutions.

@only1chunts
Copy link

only1chunts commented Mar 4, 2021

Could be split into 3 terms:

MIDS level1:
collection type - e.g collectionType from NCD:
Archival | Art | Audio | Cell Cultures | Electronic | Facsimiles | Fossils | Genetic | Geological | Herbarium | Living | Manuscripts | Mineralogical | Observations | Preserved | Products | Specimens | Texts | Tissue | Visual

MIDS level1:
preservation type - e.g. the way in which the material has been preserved
There must be some controlled vocabularies for this somewhere?

MIDS level2:
material type - e.g. material entity from OBIB
https://www.ebi.ac.uk/ols/ontologies/obib/terms?iri=http%3A%2F%2Fpurl.obolibrary.org%2Fobo%2FBFO_0000040&viewMode=All&siblings=false

@smrgeoinfo
Copy link

smrgeoinfo commented Apr 6, 2021

I'm working on a cross domain physical specimen (sample) metadata scheme for the iSample project. Scope includes earth/environmental science, archaeology/anthropology and biology. After studying various sample description schemes from these domains, we're focusing on a core metadata scheme with high level categories for specimenType, materialType, and sampledFeatureType with a controlled vocabulary of 10-20 classes for each type. The design then accommodates domain specific categories extending these type for more granular searching. SpecimenType is concerned with the kind of object (specimen)-similar to MIDS level 1 proposed above, and materialType with what the object (specimen) is composed of-- simlilar to MIDS level 3 proposed above. SampledFeature categories serve to identify broad context for the collection event. This is a work in progress, and we're interested in as much alignment as possible with TDWG work.

Preservation type is pretty specific to biological samples. I looked at the GBIF oc.v_preparations from the query in the comment above, and found quite a variety of things there, many of which I'd suggest are specimen types, roughly: object, biological specimen, tissue, animal, whole animal, animal part, tooth, bone, bird, bird part, plant, plant part, seed, tree part, DNA, human remains, bird nest, egg, bird nest with egg, cast, slide (microscope?), drug, image

@wouteraddink
Copy link
Contributor

Hi Stephen that is interesting. The approach based on decision trees may help in providing guidelines or automated identification of the term needed. The current schema lacks detail however for biological specimen, for example feature of interest can also be a product of an organism such as a birds nest, and besides sampling of whole plants or leaves also other things can be sampled like pollen.

@smrgeoinfo
Copy link

@wouteraddink yes, things like bird's nest are a problem; seems like that would be in a sampled feature category, maybe with other things like cocoon, spider web, . Stuff made by human organisms is already accounted for (specimen type artifact). Also problematic is the material type for bone, egg shell, mollusc/clam/snail type shells.
I was thinking pollen could be considered a part of a plant (specimen type organism part)?

@smrgeoinfo
Copy link

I added a new category on Sampled FEature for 'animal product' (Sampled feature is the product of an animal other than human being, e.g. bird nest, egg, cocoon, fecal matter, dung ball.) to account for Birds nest etc.

@hardistyar hardistyar added this to the MIDS level 1 proposal milestone Jul 28, 2021
@hardistyar hardistyar added status: accepted in specification Status value of MIDS Element definition status: agreed Status value of MIDS Element definition and removed status: under discussion Status value of MIDS Element definition labels Jul 29, 2021
hardistyar added a commit that referenced this issue Jul 29, 2021
Issues #10, #11, #14, #44, #45 agreements carried into text. Proposed draft text to meet milestone [MIDS level 1 proposal](https://github.com/tdwg/mids/milestone/1
@hardistyar
Copy link
Contributor

It's been pointed out by @mswoodburn that I missed the material property from the CD standard work when I was preparing my comparison of terms used by different initiatives.

This is another direct cross-connection into the CIDOC-CRM standard via E57 Material.

@smrgeoinfo
Copy link

smrgeoinfo commented Aug 2, 2021

E57 Material does not seem to have a definition, and inclusion of 'brick' (which by most reckoning would be an Object I suspect) and 'gold' as examples looks to be incoherent.

@wouteraddink
Copy link
Contributor

iSamples version is now: https://github.com/isamplesorg/metadata/blob/main/vocabulary/MaterialTypeDecisionTreev3.pdf
IGSN is using http://vocabulary.odm2.org/medium/ which also seems to fit. Most specimens would be 'organism' in that one and in the iSamples version would be 'organic material'. 'Organism' seems closer to ObjectType (countable thing). the iSamples approach seems cleaner, however interoperability with IGSN would be a nice to have. Both are missing a category for meteorites. In geocase there is no field for this, however geocase distinguises between fossils, minerals, rocks and meteorites.

@matdillen
Copy link
Contributor

This type of information is often currently available/known at a dataset or collection level, although not in a machine-readable manner. Specimens are published in sets that have similar material types and these types are considered to be evident (to humans) based on the dataset title, keywords or provenance.

Implementations of the currently in development Latimer Core would enable machine readability and facilitate MIDS calculation using data from the dataset/collection level, rather than individual specimen records. The downside to this approach is that some collections/datasets are heterogeneous and would not be covered, although it could be argued that these sets, in the absence of any material-and-other-type information at record level, are not worthy of MIDS > 0.

Some sets are also mostly homogeneous, but with a few artifacts/curiosities/errors. In this case, the MIDS score will at least be 'mostly' correct - but this may not be where we want to go. Although MIDS has been defined from the start to be agnostic about data quality, so herbarium sheets mislabeled as preserved birds would not violate MIDS conditions.

@emhaston emhaston removed the MIDS-1 Info element appears at MIDS level 1 label May 5, 2022
@emhaston emhaston added status: not yet discussed Status value of MIDS Element definition and removed status: agreed Status value of MIDS Element definition status: accepted in specification Status value of MIDS Element definition labels Sep 1, 2022
@emhaston emhaston removed this from the MIDS level 1 proposal milestone Sep 1, 2022
@emhaston emhaston removed the MIDS-2 Info element appears at MIDS level 2 label Oct 14, 2022
@emhaston emhaston added status: not accepted at present and removed MIDS-3 Info element appears at MIDS level 3 status: not yet discussed Status value of MIDS Element definition isMaterial labels Dec 15, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
MIDS Element Defines/tracks a MIDS information element that appears at one or more MIDS levels status: not accepted at present
Projects
None yet
Development

No branches or pull requests

7 participants