Testing the Difference of Correlated Agreement Coefficients for Statistical Significance

Kilem L. Gwet

doi:10.1177/0013164415596420

What is it about?

Researchers often compute Cohen's kappa coefficient for example on two occasions using the same subjects or two overlapping groups of subjects. The first kappa may be calculated before the raters receive any training, while the second aims at quantifying the extent of agreement among raters after they have received a formal training. The fundamental research question is to determine whether the observed difference between the 2 coefficients is statistically significant. This article shows the blueprint for executing this task.

Why is it important?

Before the publication of this article, there was no known formal procedure with broad applicability that could be used for testing two agreement coefficients for statistical significance. Our procedure handles every structure of dependency that may exist in the design of the inter-rater reliability experiment.

Perspectives

I wrote this article primarily to address the multiple requests I received from researchers across the world. My solution is based on a very simple principle that Jordan Ellenberg explained in a very eloquent way "If the universe hands you a hard problem, try to solve an easier one instead, and hope the simple version is close enough to the original problem that the universe doesn't object." The difficulty of testing 2 agreement coefficients for statistical significance stems from the non-linear form of most agreement coefficients. So, I decided to use the linear approximations of these agreement coefficients, which are generally valid for a large number of subjects. This became an easier problem. My simulations proved that the solution works well even when the number of subjects is limited.
KILEM GWET

This page is a summary of: Testing the Difference of Correlated Agreement Coefficients for Statistical Significance, Educational and Psychological Measurement, July 2015, SAGE Publications,
DOI: 10.1177/0013164415596420.
You can read the full text:

Read

Contributors

The following have contributed to this page

KILEM GWET

Two dependent agreement coefficients can be tested for statistical significance

What is it about?

Why is it important?

Perspectives

Contributors

You might also like

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Two dependent agreement coefficients can be tested for statistical significance

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Contributors

Share this page:

You might also like

Predicting Recidivism in a High-Risk Sample of Intimate Partner Violent Men Referred for Police Threat Assessment

Systemic social and emotional learning: Promoting educational success for all preschool to high school students.

The effect of the Internet on decision-making during pregnancy: a systematic review

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management