Smart combination of web measures for solving semantic similarity problems

Jorge Martinez‐Gil; José F. Aldana‐Montes

doi:10.1108/14684521211276000

What is it about?

Semantic similarity measures are very important in many computer‐related fields. Previous works on applications such as data integration, query expansion, tag refactoring or text clustering have used some semantic similarity measures in the past. Despite the usefulness of semantic similarity measures in these applications, the problem of measuring the similarity between two text expressions remains a key challenge. This paper aims to address this issue.

Why is it important?

The experimental results using the Miller and Charles and Gracia and Mena benchmark datasets show that the proposed approach is able to outperform classic probabilistic web‐based algorithms by a wide margin.

Perspectives

This paper presents two main contributions. The authors propose a novel technique that beats classic probabilistic techniques for measuring semantic similarity between terms. This new technique consists of using not only a search engine for computing web page counts, but a smart combination of several popular web search engines. The approach is evaluated on the Miller and Charles and Gracia and Mena benchmark datasets and compared with existing probabilistic web extraction techniques.
Dr Jorge Martinez-Gil
Software Competence Center Hagenberg GmbH

This page is a summary of: Smart combination of web measures for solving semantic similarity problems, Online Review, September 2012, Emerald,
DOI: 10.1108/14684521211276000.
You can read the full text:

Read

Resources

Related Content
Smart Combination of Web Measures for Solving Semantic Similarity Problems
Smart Combination of Web Measures for Solving Semantic Similarity Problems - Preprint

Contributors

The following have contributed to this page

Dr Jorge Martinez-Gil
Software Competence Center Hagenberg GmbH

Web similarity

What is it about?

Why is it important?

Perspectives

Resources