What is it about?

The most efficient ER method is blocking, inherently uses exponential pair-wise comparisons for the large databases, leading to poor efficiency in resolving the entities. The real world data can either be homogeneous or heterogeneous, generally of two forms, clean-clean ER which does not have any duplicates or dirty-ER which have duplicates within the dataset.

Featured Image

Why is it important?

Entity Resolution (ER) is the method of resolving two similar entities used in the process of data cleaning and data integration. However, the existing ER Framework lead to exhaustive pairwise comparisons.

Read the Original

This page is a summary of: Entity resolution framework using rough set blocking for heterogeneous web of data, Journal of Intelligent & Fuzzy Systems, January 2018, IOS Press,
DOI: 10.3233/jifs-17946.
You can read the full text:

Read

Contributors

The following have contributed to this page