Drawing Inference

Lucy D’Agostino McGowan

Is there a difference in frequency score between circle and square beads?

What if I want to know the relationship bead shape and the frequency score?

How can we quantify how much we’d expect the difference in means to differ from one random sample to another?

Is there a difference in frequency score between circle and square beads?

We need a measure of uncertainty
How about the standard error of the difference in means?
The standard error is how much we expect the mean difference to vary from one random sample to another.

Difference in means

\[ \Large \bar{y}_{circle} - \bar{y}_{square} \]

Difference in means

\[ \Large y = \beta_0 + \beta_1(group == square) + \varepsilon \]

Difference in means

\[ \Large \hat{y} = \hat\beta_0 + \hat\beta_1(group == square) \]

lm(frequency_score ~ group, data = sample)


Call:
lm(formula = frequency_score ~ group, data = sample)

Coefficients:
(Intercept)  groupsquare  
       39.8         -8.0

Bead data

How can we quantify how much we’d expect the slope to differ from one random sample to another?

mod <- lm(frequency_score ~ group, data = sample)
summary(mod)


Call:
lm(formula = frequency_score ~ group, data = sample)

Residuals:
   Min     1Q Median     3Q    Max 
-38.80 -22.55 -11.80  30.20  95.20 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   39.800      7.757   5.131 8.82e-06 ***
groupsquare   -8.000     10.970  -0.729     0.47    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 34.69 on 38 degrees of freedom
Multiple R-squared:  0.0138,    Adjusted R-squared:  -0.01215 
F-statistic: 0.5318 on 1 and 38 DF,  p-value: 0.4703

Bead data

We need a test statistic that incorporates \(\hat{\beta}_1\) and the standard error

mod <- lm(frequency_score ~ group, data = sample)
summary(mod)


Call:
lm(formula = frequency_score ~ group, data = sample)

Residuals:
   Min     1Q Median     3Q    Max 
-38.80 -22.55 -11.80  30.20  95.20 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   39.800      7.757   5.131 8.82e-06 ***
groupsquare   -8.000     10.970  -0.729     0.47    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 34.69 on 38 degrees of freedom
Multiple R-squared:  0.0138,    Adjusted R-squared:  -0.01215 
F-statistic: 0.5318 on 1 and 38 DF,  p-value: 0.4703

\(t = \frac{\hat{\beta}_1}{SE_{\hat{\beta}_1}}\)

Bead data

How do we interpret this?

mod <- lm(frequency_score ~ group, data = sample)
summary(mod)


Call:
lm(formula = frequency_score ~ group, data = sample)

Residuals:
   Min     1Q Median     3Q    Max 
-38.80 -22.55 -11.80  30.20  95.20 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   39.800      7.757   5.131 8.82e-06 ***
groupsquare   -8.000     10.970  -0.729     0.47    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 34.69 on 38 degrees of freedom
Multiple R-squared:  0.0138,    Adjusted R-squared:  -0.01215 
F-statistic: 0.5318 on 1 and 38 DF,  p-value: 0.4703

“\(\hat{\beta}_1\) is -0.73 standard errors below a slope of zero”

Bead data

How do we know what values of this statistic are worth paying attention to?

mod <- lm(frequency_score ~ group, data = sample)
summary(mod)


Call:
lm(formula = frequency_score ~ group, data = sample)

Residuals:
   Min     1Q Median     3Q    Max 
-38.80 -22.55 -11.80  30.20  95.20 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   39.800      7.757   5.131 8.82e-06 ***
groupsquare   -8.000     10.970  -0.729     0.47    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 34.69 on 38 degrees of freedom
Multiple R-squared:  0.0138,    Adjusted R-squared:  -0.01215 
F-statistic: 0.5318 on 1 and 38 DF,  p-value: 0.4703

confidence intervals, p-values
Hypothesis testing: \(H_0: \beta_1 = 0\) \(H_A: \beta_1 \neq 0\)

Confidence interval for \(\hat\beta_1\)

How do we calculate the confidence interval for the slope?

\[\hat\beta_1\pm t^*SE_{\hat\beta_1}\]

\(t^*\) is the critical value for the \(t_{n−2}\) density curve to obtain the desired confidence level
Why \(n-2\) now?

Let’s do it in R!

qt(0.975, df = 40 - 2)

[1] 2.024394

Bead data

How do get a confidence interval for \(\hat{\beta}_1\)? What function can we use in R?

confint(mod)

                2.5 %   97.5 %
(Intercept)  24.09636 55.50364
groupsquare -30.20830 14.20830

How do we interpret this value?

`Application Exercise`

Calculate the difference in means between your two groups
Create a new project from this template in RStudio Pro:

https://github.com/sta-112-s24/appex-06.git

Fill in your data values for the circle and square beads
Fit the model of frequency_score and group in your data
Calculate a confidence interval for the estimate \(\hat\beta_1\)
Interpret this value

05:00

Hypothesis testing

So far, we have estimated the relationship between frequency score and group
This could be useful if we wanted to understand, on average, how these variables are related (estimation)
This could also be useful if we wanted to guess a future bead’s frequency score using their group (prediction)
What if we just want to know whether there is some relationship bewteen the two? (hypothesis testing)

Hypothesis testing

Null hypothesis: There is no relationship between frequency score and group
- \(H_0: \beta_1 = 0\)
Alternative hypothesis: There is a relationship between frequency score and group
- \(H_A: \beta_1 \neq 0\)

Hypothesis testing

summary(mod)


Call:
lm(formula = frequency_score ~ group, data = sample)

Residuals:
   Min     1Q Median     3Q    Max 
-38.80 -22.55 -11.80  30.20  95.20 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   39.800      7.757   5.131 8.82e-06 ***
groupsquare   -8.000     10.970  -0.729     0.47    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 34.69 on 38 degrees of freedom
Multiple R-squared:  0.0138,    Adjusted R-squared:  -0.01215 
F-statistic: 0.5318 on 1 and 38 DF,  p-value: 0.4703

Is \(\hat\beta_1\) different from 0?

Hypothesis testing

summary(mod)


Call:
lm(formula = frequency_score ~ group, data = sample)

Residuals:
   Min     1Q Median     3Q    Max 
-38.80 -22.55 -11.80  30.20  95.20 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   39.800      7.757   5.131 8.82e-06 ***
groupsquare   -8.000     10.970  -0.729     0.47    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 34.69 on 38 degrees of freedom
Multiple R-squared:  0.0138,    Adjusted R-squared:  -0.01215 
F-statistic: 0.5318 on 1 and 38 DF,  p-value: 0.4703

Is \(\beta_1\) different from 0? (notice the lack of the hat!)

p-value

The probability of observing a statistic as extreme or more extreme than the observed test statistic given the null hypothesis is true

p-value

summary(mod)


Call:
lm(formula = frequency_score ~ group, data = sample)

Residuals:
   Min     1Q Median     3Q    Max 
-38.80 -22.55 -11.80  30.20  95.20 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   39.800      7.757   5.131 8.82e-06 ***
groupsquare   -8.000     10.970  -0.729     0.47    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 34.69 on 38 degrees of freedom
Multiple R-squared:  0.0138,    Adjusted R-squared:  -0.01215 
F-statistic: 0.5318 on 1 and 38 DF,  p-value: 0.4703

What is the p-value? What is the interpretation?

Hypothesis testing

Null hypothesis: \(\beta_1 = 0\) (there is no relationship between frequency score and group)
Alternative hypothesis: \(\beta_1 \neq 0\) (there is a relationship between frequency score and group)
Often we have an \(\alpha\) level cutoff to compare the p-value to, for example 0.05.
If p-value < 0.05, we reject the null hypothesis
If p-value > 0.05, we fail to reject the null hypothesis
Why don’t we ever “accept” the null hypothesis?
absense of evidence is not evidence of absense

p-value

summary(mod)


Call:
lm(formula = frequency_score ~ group, data = sample)

Residuals:
   Min     1Q Median     3Q    Max 
-38.80 -22.55 -11.80  30.20  95.20 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   39.800      7.757   5.131 8.82e-06 ***
groupsquare   -8.000     10.970  -0.729     0.47    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 34.69 on 38 degrees of freedom
Multiple R-squared:  0.0138,    Adjusted R-squared:  -0.01215 
F-statistic: 0.5318 on 1 and 38 DF,  p-value: 0.4703

Do we reject the null hypothesis?

`Application Exercise`

Open appex-06.qmd
Examine the summary of the model of frequency_score and group with your data
Test the null hypothesis that there is no relationship between frequency score and group
What is the p-value? What is the result of your hypothesis test?
Turn this in on Canvas

02:00