Iterative algorithms for the post-processing of high-dimensional data

Mike Espig; Wolfgang Hackbusch; Alexander Litvinenko; Hermann G. Matthies; Elmar Zander

doi:10.1016/j.jcp.2020.109396

What is it about?

We provide algorithms to compute the maxima, minima, the number of values in a given interval, frequencies, the mean and variance of the large high-dimensional data set. All these post-processing operations are done in the compressed data format (e.g in a low-rank tensor format, but this representation is not really important). All algorithms are formulated in an abstract setting without reference to a particular compressed format.

Photo by Franki Chamaki on Unsplash

Why is it important?

The amount of data is growing permanently. Very often the data are high-dimensional (e.g., each sample/point is characterized by many features). A new type of algorithms, which require only linear storage and computational cost, are required. We look at some common post-processing tasks which are too time and storage consuming in the uncompressed data format and not obvious in the compressed format, as such huge data sets can not be stored in their entirety, and the value of an element is not readily accessible through simple look-up.

Perspectives

Under certain assumptions, we will be able to solve very large high-dimensional problems, for instance, of size 10^20 or 100^300. Such high-dimensional problems appear in chemistry and physics (Hartree-Fock, Schroedinger, or Master-equations).
Dr. Alexander Litvinenko
Rheinisch Westfalische Technische Hochschule Aachen

This page is a summary of: Iterative algorithms for the post-processing of high-dimensional data, Journal of Computational Physics, March 2020, Elsevier,
DOI: 10.1016/j.jcp.2020.109396.
You can read the full text:

Read

Resources

Presentation
Efficient analysis of high-dimensional data
slides

Contributors

The following have contributed to this page

Dr. Alexander Litvinenko
Rheinisch Westfalische Technische Hochschule Aachen

How to compute level sets, histograms, maxima, minima in a very large data set?

What is it about?

Why is it important?

Perspectives

Resources

Efficient analysis of high-dimensional data

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

How to compute level sets, histograms, maxima, minima in a very large data set?

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Resources

Efficient analysis of high-dimensional data

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management