我试图了解因子在 R 中的确切工作原理。假设我想使用 R 中的一些样本数据进行回归:
> data(CO2)
> colnames(CO2)
[1] "Plant" "Type" "Treatment" "conc" "uptake"
> levels(CO2$Type)
[1] "Quebec" "Mississippi"
> levels(CO2$Treatment)
[1] "nonchilled" "chilled"
> lm(uptake ~ Type + Treatment, data = CO2)
Call:
lm(formula = uptake ~ Type + Treatment, data = CO2)
Coefficients:
(Intercept) TypeMississippi Treatmentchilled
36.97 -12.66 -6.86
我理解这一点TypeMississippi
并被Treatmentchilled
视为布尔值:对于每一行,初始摄取量为,如果它是密西西比类型并且是否被冷藏36.97
,我们减去它。我无法理解这样的事情:12.66
6.86
> lm(uptake ~ Type * Treatment, data = CO2)
Call:
lm(formula = uptake ~ Type * Treatment, data = CO2)
Coefficients:
(Intercept) TypeMississippi
35.333 -9.381
Treatmentchilled TypeMississippi:Treatmentchilled
-3.581 -6.557
两个因数相乘是什么意思lm
?