如何计算 R 中的引导标准误差？

在本文中，我们将研究使用各种包及其在 R 编程语言中的功能来计算引导标准误差的不同方法。

引导标准错误：

bootstrap 样本的标准差（也称为 bootstrap 标准误差）是对均值抽样分布的标准差的估计。

计算给定数据的引导标准误差的步骤：

从给定的数据集中取 k 个重复样本并进行替换。
对于每个样本，计算标准误差：s/√n。
这导致标准误差的 k 个不同估计值。要找到自举标准误，请取 k 个标准误的平均值。

方法一：使用引导包中的 boot()函数

在该方法中，用户首先需要在工作的 R 控制台中安装并导入引导包，然后用户需要调用 boot()函数并将所需的数据作为其参数传入，该函数将进一步返回 Bootstrap 标准R 编程语言中给定数据的错误。

安装和导入引导包的语法：

install.packages('boot')
library('boot')

Boot()函数：该函数为 boot 包中的 boot函数提供了一个简单的前端，该函数针对基于回归模型的引导进行了定制。

Syntax: Boot(object, f=coef, labels=names(f(object)),R=999, …)

Parameters:

object: An object of the class.
f: A function whose one argument is the name of a regression object that will be applied to the updated regression object to compute the statistics of interest.
labels: Provides labels for the statistics computed by f.
R: Number of bootstrap samples.

编程需要懂一点英语

例子：

在此示例中，我们将使用 boot 包中的 boot()函数来计算包含 10 个元素的向量的引导标准误差，并在 R 编程语言中使用 100 个引导样本。

R

library(boot)
  
# Create data
gfg <- c(244,753,596,645,874,141,639,465,999,654)
  
# Calculating the mean 
m <- function(gfg,i){mean(gfg[i])}
  
# Calculate standard error using 100
# bootstrapped samples
boot(gfg, m, 100)

R

# Create data
gfg <- c(244,753,596,645,874,141,639,465,999,654)
  
# Calculating the bootstrap standard error 
mean(replicate(100, sd(sample(
  gfg, replace=T))/sqrt(length(gfg))))

输出：

ORDINARY NONPARAMETRIC BOOTSTRAP


Call:
boot(data = gfg, statistic = m, R = 100)


Bootstrap Statistics :
    original  bias    std. error
t1*      601  -3.231    80.02706

方法 2：使用公式计算 Bootstrap 标准误差

在这种计算引导标准误差的方法中，用户需要使用直接公式来获得相同的结果，而无需使用 R 编程语言中的任何包。

例子：

在此示例中，我们将使用直接公式来获取包含 R 编程语言中 10 个元素的向量的引导标准误差。

R

# Create data
gfg <- c(244,753,596,645,874,141,639,465,999,654)
  
# Calculating the bootstrap standard error 
mean(replicate(100, sd(sample(
  gfg, replace=T))/sqrt(length(gfg))))

输出：

[1] 78.53055