Structuring Data

Here are instructions for structuring incoming data and getting it ready to ingest:

The loader script expects columns named according to the values in the data/columns.csv file. Use CSV, comma separated value format.
basis_of_record controlled vocabulary
certainty controlled vocabulary
prediction_class controlled vocabulary
trait controlled vocabulary. See "trait" column.
datasource controlled vocabulary for all datasources we are working with. Put new datasource in directory named according to the datasource itself. For example, "sample" goes in a directory called "data/sample". Only load data files less than 10,000 records in github. All others will be added to .gitignore file

Sample Query Page

We can visualize data that is loaded at our beta Phenobase Query Page

Script for loading data

# load 09.06.2024 and do not drop existing records (default option is False)
python loader.py /home/exouser/code/phenobase_data/data/iNaturalist.09.06.2024 False

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
data		data
downloads/npn		downloads/npn
reasoning		reasoning
README.md		README.md
loader.py		loader.py
loading_errors.csv		loading_errors.csv
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Structuring Data

Sample Query Page

Script for loading data

About

Releases

Packages

Contributors 2

Languages

Phenobase/phenobase_data

Folders and files

Latest commit

History

Repository files navigation

Structuring Data

Sample Query Page

Script for loading data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages