What is it about?

Geospatial datasets have complex lineages that are crucial for reproducibility and understanding data provenance, yet current metadata standards like STAC (SpatioTemporal Asset Catalog) provide limited support for capturing complete processing workflows. We propose STACD (STAC extension with DAGs), an extension to STAC specifications that incorporates Directed Acyclic Graph (DAG) representations along with defining algorithms and version changes in the workflows. We also provide a reference implementation on Apache Airflow to demonstrate STACD capabilities such as selective recomputation when some datasets or algorithms in a DAG are updated, complete lineage construction for a dataset, and opportunities for improved collaboration and distributed processing that arise with this standard.

Featured Image

Read the Original

This page is a summary of: STACD: STAC Extension with DAGs for Geospatial Data and Algorithm Management, October 2025, ACM (Association for Computing Machinery),
DOI: 10.1145/3759536.3763803.
You can read the full text:

Read

Contributors

The following have contributed to this page