Testing the Tests: What Are the Impacts of Incorrect Assumptions When Applying Confidence Intervals or Hypothesis Tests to Compare Competing Forecasts?

Eric Gilleland; Amanda S. Hering; Tressa L. Fowler; Barbara G. Brown

doi:10.1175/mwr-d-17-0295.1

What is it about?

Several common statistical hypothesis tests are applied to simulated series with varying degrees (and types) of temporal dependence and contemporaneous correlation. It is shown that the Hering and Genton (HG) test is the most accurate in terms of type-I errors, and is also a powerful test. The circular block bootstrap approach, while not as accurate as the HG test, is generally reasonably accurate when the series are temporally dependent, and the iid bootstrap when they are independent; like the HG test, both are robust to contemporaneous correlation. Other tests, such as the usual t-test and normal approximation, even when a variance-inflation factor is applied, are not robust to the contemporaneous correlation.

Photo by Nancy Cantwell on Unsplash

Why is it important?

In weather forecast verification, the it is fairly uncommon to account for dependence, especially contemporaneous correlation, when testing whether one forecast model is better than another on average. Sometimes the variance-inflation factor (VIF) is used, but in this case, usually only the AR(1) version is used, and it can be very inaccurate if the series does not follow an AR(1) process.

This page is a summary of: Testing the Tests: What Are the Impacts of Incorrect Assumptions When Applying Confidence Intervals or Hypothesis Tests to Compare Competing Forecasts?, Monthly Weather Review, June 2018, American Meteorological Society,
DOI: 10.1175/mwr-d-17-0295.1.
You can read the full text:

Read

Contributors

The following have contributed to this page

Eric Gilleland
National Center for Atmospheric Research

Testing statistical hypothesis methods under difficult conditions

What is it about?

Why is it important?

Contributors

You might also like

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Testing statistical hypothesis methods under difficult conditions

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

You might also like

Mitigating the Desynchronization Attack in Multiserver Environment

Constrained exact boundary controllability of a semilinear model for pipeline gas flow

Metrics for Assessing Overall Performance of Inland Waterway Ports: A Bayesian Network Based Approach

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management