机器算法验证 - 大数强与弱定律背后的直觉（使用 R 模拟） - 吾爱随笔录

大数强与弱定律背后的直觉（使用 R 模拟）

机器算法验证 r 直觉大数定律

2022-04-01 16:38:33

这看起来相当基本，但是当提到大数的弱和强定律时，这是我看到的定义（卡塞拉和伯杰）

$方程$

您能否给出一个“直觉”来理解它们之间的区别。

另外，概率内的限制对于强定律意味着什么？

你能给我一个R中的模拟来表示它们之间的区别吗？

1个回答

将弱定律表述为强定律为

{\bar{Y}}_{n} \overset{P}{\to} μ when n \to \infty, i.e. \forall ε > 0 : lim_{n \to \infty} Pr (| {\bar{Y}}_{n} - μ | < ε) = 1

$\overline{Y}_n\ \xrightarrow{P}\ \mu \,\textrm{ when }\ n \to \infty , \text{ i.e. } \forall \varepsilon \gt 0: \lim_{n\to\infty}\Pr\!\left(\,|\overline{Y}_n-\mu| \lt \varepsilon\,\right) = 1$

{\bar{Y}}_{n} \overset{a . s .}{\to} μ when n \to \infty, i.e. Pr (lim_{n \to \infty} {\bar{Y}}_{n} = μ) = 1

$\overline{Y}_n\ \xrightarrow{a.s.}\ \mu \,\textrm{ when }\ n \to \infty , \text{ i.e. } \Pr\!\left( \lim_{n\to\infty}\overline{Y}_n = \mu \right) = 1$

您可能会认为弱定律是说当样本量大时样本平均值通常接近均值，而强定律是说随着样本量的增加，样本平均值几乎肯定会收敛到均值。

当样本平均值接近平均值的失败大到足以阻止收敛时，就会发生差异。

作为使用 R 的说明，以维基百科的第一个示例为例，其中是指数分布的随机变量，参数为且所以。让我们考虑案例： $X$ $1$ $Y= \dfrac{\sin(x) e^x}{x}$ $E[Y]=\frac{\pi}{2}$ $100$ $10000$

set.seed(1)
cases <- 100
samplesize <- 10000
Xmat <- matrix(rexp(samplesize*cases, rate=1), ncol=samplesize)
Ymat <- sin(Xmat) * exp(Xmat) / Xmat
plot(samplemeans <- rowMeans(Ymat), 
    main="most sample averages close to expectation")
abline(h=pi/2, col="red")

但现在看看运行样本平均值在相同的万次观察中未能达到平均值并保持在那里 $1$

plot(cumsum(Ymat)/(1:(samplesize*cases)),
    main="running sample average not always converging to expectation")
abline(h=pi/2, col="red")

其它你可能感兴趣的问题

上一篇在 R 中具有时变协变量的 Weibull 生存模型下一篇主成分分析是参数方法吗？