机器算法验证 - McNemar 检验与条件逻辑回归之间的关系 - 吾爱随笔录

我对成对观察中的二元响应数据建模感兴趣。我们的目标是推断组中事前干预的有效性，可能会调整几个协变量，并确定作为干预的一部分接受特别不同培训的组是否存在效果修改。

给定以下形式的数据：

id phase resp
1  pre   1
1  post  0
2  pre   0
2  post  0
3  pre   1
3  post  0

以及成对响应信息 $2 \times 2$

\begin{array}{cccc} Pre \\ Correct & Incorrect \\ Post & Correct & a & b \\ Incorrect & c & d \end{array}

$\begin{array}{cc|cc} & & \mbox{Pre} & \\ & & \mbox{Correct} & \mbox{Incorrect} \\ \hline \mbox{Post} & \mbox{Correct} & a & b&\\ & \mbox{Incorrect} & c& d&\\ \end{array}$

我们对假设检验感兴趣：。 $\mathcal{H}_0: \theta_c = 1$

McNemar 检验给出：在下（渐近）。这是直观的，因为在零值下，我们期望相等比例的不一致对（和）有利于正面效应（）或负面效应（）。定义了正例定义的概率和。观察到正不一致对的几率是。 $Q = \frac{(b-c)^2}{b+c} \sim \chi^2_1$ $\mathcal{H}_0$ $b$ $c$ $b$ $c$ $p =\frac{b}{b+c}$ $n=b+c$ $\frac{p}{1-p}=\frac{b}{c}$

另一方面，条件逻辑回归使用不同的方法通过最大化条件似然来检验相同的假设：

L (X; β) = \prod_{j = 1}^{n} \frac{\exp (β X_{j, 2})}{\exp (β X_{j, 1}) + \exp (β X_{j, 2})}

$\mathcal{L}(X ; \beta) = \prod_{j=1}^n \frac{\exp(\beta X_{j,2})}{\exp(\beta X_{j,1}) + \exp(\beta X_{j,2})}$

其中。 $\exp(\beta) = \theta_c$

那么，这些测试之间有什么关系呢？如何对前面介绍的列联表进行简单测试？查看 clogit 和 McNemar 方法在 null 下的 p 值校准，您会认为它们完全不相关！

library(survival)
n <- 100
do.one <- function(n) {
  id <- rep(1:n, each=2)
  ph <- rep(0:1, times=n)
  rs <- rbinom(n*2, 1, 0.5)
  c(
    'pclogit' = coef(summary(clogit(rs ~ ph + strata(id))))[5],
    'pmctest' = mcnemar.test(table(ph,rs))$p.value
  )
}

out <- replicate(1000, do.one(n))
plot(t(out), main='Calibration plot of pvalues for McNemar and Clogit tests', 
  xlab='p-value McNemar', ylab='p-value conditional logistic regression')

在此处输入图像描述