What is it about?

A local false discovery rate, like any other non-subjective probability, depends on a reference class. The probability that a car weighs over a ton depends on whether the reference class is all cars, all American cars, all sports cars, or all American sports cars. In the same way, the probability that a SNP with certain GERP and polyphen2 scores depends on whether the reference class is all SNPs, all SNPs with similar polyphen2 scores, etc. This type of information is used to obtain more SNP-specific proportions of associated SNPs since the subclass affects the probability of association.

Featured Image

Why is it important?

In empirical Bayes multiple testing, a prior distribution is estimated by considering each gene or other feature corresponding to a null hypothesis to be randomly selected from some reference class of other features. That leads to the problem of selecting a reference class for the application of empirical Bayes methods since each feature belongs to a large number of reference classes.

Perspectives

For a simple introduction to the reference class problem, see chapter 5 of D. R. Bickel (2019) Genomics Data Analysis: False Discovery Rates and Empirical Bayes Methods, Chapman and Hall/CRC, New York. https://davidbickel.com/genomics/

David R. Bickel
University of North Carolina at Greensboro

Read the Original

This page is a summary of: Incorporating prior knowledge about genetic variants into the analysis of genetic association data: An empirical Bayes approach, IEEE/ACM Transactions on Computational Biology and Bioinformatics, January 2019, Institute of Electrical & Electronics Engineers (IEEE),
DOI: 10.1109/tcbb.2018.2865420.
You can read the full text:

Read

Resources

Contributors

The following have contributed to this page