SEDAR: A Semantic Data Reservoir for Heterogeneous Datasets

Sayed Hoseini; Ahmed Ali; Haron Shaker; Christoph Quix

doi:10.1145/3583780.3614753

What is it about?

An open source semantic data lake built from big data technology. We establish a semantic layer to annotate data sources with concepts from knowledge graphs. The user is assisted by different components in a pipeline to create semantic models. The semantic models are then used to perform Ontology-based data acces (OBDA), a mechanism to query the different underlying storages in the lake uniformly.

Photo by Alina Grubnyak on Unsplash

Why is it important?

First system to combine semantic modelling, scalable data management and OBDA in one uniform system. Besides that, the system provides more features like MLOps, source-independet ingestion, meta data extraction, data catalog and more to represent a benchmark for future work on data lakes.

This page is a summary of: SEDAR: A Semantic Data Reservoir for Heterogeneous Datasets, October 2023, ACM (Association for Computing Machinery),
DOI: 10.1145/3583780.3614753.
You can read the full text:

Read

Contributors

The following have contributed to this page

SEDAR: A Semantic Data Reservoir for Heterogeneous Datasets

What is it about?

Why is it important?

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

SEDAR: A Semantic Data Reservoir for Heterogeneous Datasets

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management