What is it about?

We consider a weakly supervised regression problem for large data sets with uncertainties. "Weakly" means that some labels are unknown or uncertain. This is a typical situation due to random noise, poor equipment, expensive measurements in terms of money and time, lack of human resources. To optimise the loss function, we use some advanced tricks, such as manifold regularisation and low-rank techniques. As a result, we speed up all matrix computations and reduce memory requirements. Fast algorithms allow us to process larger datasets faster and to get more accurate results and predictions.

Featured Image

Why is it important?

Nowadays, machine learning (ML) theory and methods are rapidly developing and increasingly used in various fields of science and technology. An urgent problem remains a further improvement of ML methodology: the development of methods that allow obtaining accurate and reliable solutions in a reasonable time in conditions of noise distortions, large data size, and lack of training information. In many applications, only a small part of the data can be labeled, i.e., the values of the predicted feature are not provided for all data objects. In the case of a large amount of data and limited resources for its processing, some data objects can be inaccurately labeled.

Perspectives

We see great potential in typical machine learning applications, such as automatic pre-processing of medical or satellite images. As a real-world example of such a problem, one can address the task of manual annotation of a large number of computed tomography (CT) digital images. In order to distinguish the brain areas affected by stroke, it is required to engage a highly qualified radiologist, and the process is rather time-consuming. It is possible that some parts of the images will stay without specifying specific regions (for example, it is simply indicated that pathological signs are present in the given CT scan) or are labeled inaccurately. In this case, the assumed region can be outlined with a frame; the closer to the center of the frame, the greater the confidence that the brain tissue is damaged.

Dr. Alexander Litvinenko
Rheinisch Westfalische Technische Hochschule Aachen

Read the Original

This page is a summary of: Weakly Supervised Regression Using Manifold Regularization and Low-Rank Matrix Representation, January 2021, Springer Science + Business Media,
DOI: 10.1007/978-3-030-77876-7_30.
You can read the full text:

Read

Resources

Contributors

The following have contributed to this page