Skip to content

Releases: NeotomaDB/DataBUS

DataBUSv1.0.0

01 Dec 18:32

Choose a tag to compare

DataBUSv1.0.0

DataBUS is a Python-based bulk uploader tool for the Neotoma Paleoecology Database. It helps users prepare, validate, and upload large sets of paleoecological records in bulk — using a YAML + CSV template, validation routines, and an upload script that pushes data into a temporary holding database for subsequent ingestion into Neotoma. 

Key Features (v1.0.0)

  • Template-based uploads: Define data using a standardized YAML + CSV template structure that maps CSV columns to Neotoma DB schema (tables/columns) via a “cross-walk.” This enables consistent and repeatable bulk uploads. 
  • Validation suite: BEFORE upload, DataBUS validates submitted CSV data against the template definitions. This includes checks for Site, Collection Unit, Analysis Unit, Dataset, Sample, Data values, dating horizons, and more — reducing risk of malformed or invalid uploads.
  • Automated upload script: Once validated, users can run a single command (python3 data_upload.py) to push data into the neotomaholdingtank or neotoma proper database.
  • Open-source & MIT licensed: DataBUS is released under the MIT license, enabling free use, modification, and redistribution under standard open-source terms. 

Known limitations / Scope & Considerations

  • DataBUS currently expects data templates to be prepared in YAML + CSV format. Data must be in CSV format.
  • Users must follow template rules carefully (column names, vocabularies, types, etc.) — misconfigured templates or CSVs may result in validation failures.
  • Because this is the first official release, the tool will still evolve; future versions might include usability enhancements, more automated checks, or UI tooling.

What's Changed

New Contributors

Full Changelog: v0.0.1...v1.0.0

Alpha DataBUS release

23 Nov 19:29
e4e342e

Choose a tag to compare

This release represents the alpha release of the DataBUS, including template generation and initial package development.

Full Changelog: https://github.com/NeotomaDB/DataBUS/commits/v0.0.1