r - 重要な二次項 - 線形回帰 - R

Question

次のデータが与えられます。

ここに画像の説明を入力

最初に二次モデルに適合するように言われました。

ここに画像の説明を入力

 > time = c(10,20,15,11,11,19,11,13,17,18,16,16,17,18,10)
 > experience = c(24,1,10,15,17,3,20,9,3,1,7,9,7,5,20)
 > fit = lm (time ~ experience + I(experience^2))
> summary(fit)

Call:
lm(formula = y ~ x + I(x^2))

Residuals:
    Min      1Q  Median      3Q     Max 
-1.8287 -0.8300  0.5054  0.7476  1.1713 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept) 20.091108   0.724705  27.723    3e-12 ***
x           -0.670522   0.154706  -4.334 0.000972 ***
I(x^2)       0.009535   0.006326   1.507 0.157605    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 1.091 on 12 degrees of freedom
Multiple R-squared:  0.9162,    Adjusted R-squared:  0.9022 
F-statistic: 65.59 on 2 and 12 DF,  p-value: 3.465e-07

すべてがうまくいくようです。

私のモデルは

y = 20.091-.671x+.0095x^2

それをプロットする：

> x = seq(0,25, by = .1)
> y = fit$coefficient[1]+fit$coefficient[2]*x+fit$coefficient[3]*x^2
> lines(x,y)

ここに画像の説明を入力

繰り返しますが、すべて問題ないようです。

しかし、二次項が a = .1 の有意水準で有意であるかどうかをテストするように言われました。

私もです

> fit1 = lm (time ~ experience + I(experience^2))
> fit2 = lm(time~experience)
> anova(fit2, fit1)

Analysis of Variance Table

Model 1: time ~ experience
Model 2: time ~ experience + I(experience^2)
  Res.Df    RSS Df Sum of Sq      F Pr(>F)
1     13 16.984                           
2     12 14.280  1    2.7037 2.2719 0.1576

したがって、二次項の F 値は 2.27 です。.1576 の確率に相当します。.1576 > .1 したがって、二次項は a = .1 で有意です。

しかし、私の教授は、二次項がモデルにとって重要ではないことを確認する必要があることを示しました。ここで何が間違っていますか？

score 1 · Accepted Answer

あなたが失敗しているのは、直交多項式の項を構築することです。R の poly() 関数は、その目的のために設計されています。

 time = c(10,20,15,11,11,19,11,13,17,18,16,16,17,18,10)
 experience = c(24,1,10,15,17,3,20,9,3,1,7,9,7,5,20)
 fit = lm (time ~ poly(experience, degree=2))
 summary(fit)
#--------------
Call:
lm(formula = time ~ poly(experience, degree = 2))

Residuals:
    Min      1Q  Median      3Q     Max 
-1.8287 -0.8300  0.5054  0.7476  1.1713 

Coefficients:
                              Estimate Std. Error t value Pr(>|t|)    
(Intercept)                    14.8000     0.2817  52.544 1.48e-15 ***
poly(experience, degree = 2)1 -12.3861     1.0909 -11.354 8.94e-08 ***
poly(experience, degree = 2)2   1.6443     1.0909   1.507    0.158    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 1.091 on 12 degrees of freedom
Multiple R-squared:  0.9162,    Adjusted R-squared:  0.9022 
F-statistic: 65.59 on 2 and 12 DF,  p-value: 3.465e-07

F 統計は二次項に固有のものではありませんが、null モデルと 2 項モデルを実際に比較しています。

r - 重要な二次項 - 線形回帰 - R

1 に答える 1

Related

Reference