Abstract
Efron’s 2-group model is widely used in large-scale multiple testing. This model assumes that test statistics are drawn independently from a mixture of a null and a non-null distribution. The marginal local false discovery rate (locFDR) is the probability that the hypothesis is null given its test statistic. The procedure that rejects null hypotheses with marginal locFDRs below a fixed threshold maximizes power (the expected number of non-nulls rejected) while controlling the marginal false discovery rate in this model. However, in realistic settings the test statistics are dependent, and taking the dependence into account can boost power. Unfortunately, the resulting calculations are typically exponential in the number of hypotheses, which is impractical. Instead, we propose using locFDRN, which is the probability that the hypothesis is null given the test statistics in its N-neighborhood. We prove that rejecting for small locFDRN is optimal in the restricted class where the decision for each hypothesis is only guided by its N-neighborhood, and that power increases with N. The computational complexity of computing the locFDRNs increases with N, so the analyst should choose the largest N-neighborhood that is still computationally feasible. We show through extensive simulations that our proposed procedure can be substantially more powerful than alternative practical approaches, even with small N-neighborhoods. We demonstrate the utility of our method in a genome-wide association study of height.
Original language | English |
---|---|
Article number | ujaf035 |
Journal | Biometrics |
Volume | 81 |
Issue number | 2 |
DOIs | |
State | Published - 1 Jun 2025 |
Keywords
- dependent test statistics
- false discovery rate
- genome-wide association studies
- large scale inference
- multiple testing
All Science Journal Classification (ASJC) codes
- Statistics and Probability
- General Biochemistry,Genetics and Molecular Biology
- General Immunology and Microbiology
- General Agricultural and Biological Sciences
- Applied Mathematics