Content-based Union and Complement Metrics for Dataset Search over RDF Knowledge Graphs

Michalis Mountantonakis; Yannis Tzitzikas

doi:10.1145/3372750

What is it about?

Dataset Search systems are mainly based on metadata and ignore the contents, however, in tasks related to data integration and enrichment, the contents of datasets have to be considered. This is important for data integration but also for data enrichment, for instance, quite often datasets’ owners want to enrich the content of their dataset, by selecting datasets that provide complementary information for their dataset. We propose an approach relying on a) a set of pre-constructed (and periodically refreshed) semantics-aware indexes , and b) “lattice-based" incremental algorithms that exploit the posting lists of such indexes, as well as set theory properties, for enabling efficient responses at query time. We also discuss the efficiency of the proposed methods by presenting comparative results, and we report measurements for 400 real RDF datasets (containing over 2 billion triples), by exploiting the proposed metrics.

Photo by Markus Winkler on Unsplash

Why is it important?

For improving dataset Discoverability, Interlinking and Reusability.

This page is a summary of: Content-based Union and Complement Metrics for Dataset Search over RDF Knowledge Graphs, Journal of Data and Information Quality, April 2020, ACM (Association for Computing Machinery),
DOI: 10.1145/3372750.
You can read the full text:

Read

Contributors

The following have contributed to this page

Yannis Tzitzikas
University of Crete

Advanced Content-based RDF Dataset Search

What is it about?

Why is it important?

Contributors

You might also like

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Advanced Content-based RDF Dataset Search

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

You might also like

Which Conference Is That? A Case Study in Computer Science

New Trends of Deep Learning in Clinical Cardiology

Compiler Support for Sparse Tensor Computations in MLIR

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management