Replicability Analysis for Natural Language Processing: Testing                     Significance with Multiple Datasets

Rotem Dror; Gili Baumer; Marina Bogomolov; Roi Reichart

doi:10.1162/tacl_a_00074

What is it about?

It has become standard to evaluate Natural Language Processing (NLP) algorithms on multiple datasets in order to ensure a consistent performance across varied setups. When doing so, one has to change the statistical analysis of the results to consider all tested hypotheses (one for each dataset). In this paper we explain how to perform such an analysis with a special consideration to NLP applications.

Why is it important?

It is crucial to perform a correct and valid statistical analysis in an empirical research area such as NLP. This paper goal is to ensure the researchers perform the statistical analysis in a valid way.

This page is a summary of: Replicability Analysis for Natural Language Processing: Testing Significance with Multiple Datasets, December 2017, The MIT Press,
DOI: 10.1162/tacl_a_00074.
You can read the full text:

Read

Resources

Video
Replicability Analysis for Natural Language Processing - ACL 2018
Replicability Analysis for Natural Language Processing: Testing Significance with Multiple Datasets - A presentation of the paper in ACL 2018 conference in Melbourne.

Contributors

The following have contributed to this page

Testing Statistical Significance when Testing with Multiple Datasets in Natural Language Processing

What is it about?

Why is it important?

Resources

Replicability Analysis for Natural Language Processing - ACL 2018

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Testing Statistical Significance when Testing with Multiple Datasets in Natural Language Processing

What is it about?

Featured Image

Why is it important?

Read the Original

Resources

Replicability Analysis for Natural Language Processing - ACL 2018

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management