Chapter 2 Introduction to Correlated Data

Chapter 2 Learning Outcomes

Identify the response variable and fixed and random effects associated with a given set of data.
Interpret parameter estimates, confidence intervals, and hypothesis tests associated with fixed effects in LME models.
Interpret variance component estimates (\(\sigma\) and \(\sigma_u\)) in linear mixed effects (LME) models.
Determine how standard errors in a LME model would compare to the standard errors we would get if we used a LLSR model on the same data.
Determine how changes in sample size and/or experimental design will impact standard errors, confidence intervals, and p-values in a LME model.
Compare the size of different variance component estimates given a LME model and a small set of data (i.e. say whether \(\sigma_u\), or \(\sigma\) will larger).

library(tidyverse)
library(lme4)
library(lmerTest)
library(knitr)

2.1 Introduction to Correlated Data

2.1.1 Weight Gain in Mice: Experiment Design #1

Consider an experiment designed to assess the impact of three different diets on weight gain in mice. We observe six different litters of mice, with six mice in each litter. Within each litter, two mice are randomly assigned to each of the three diets. Researchers recorded the mean weight gain in each mouse over a four-week period.

Experimental Design 1:

Figure 2.1: Mouse Experiment Version 1

2.1.2 Mice Experiment 1 Data

The first 10 rows of the dataset look like this:

head(mice,10)

##    litter diet weight_gain
## 1       1    A -0.03526041
## 2       1    B -0.06841666
## 3       1    C -0.06351305
## 4       1    A -0.05171452
## 5       1    B -0.06917391
## 6       1    C -0.05951775
## 7       2    A  0.07584083
## 8       2    B  0.04913527
## 9       2    C  0.07958384
## 10      2    A  0.07278071

2.1.3 A Naive Graphical Analysis

Let’s temporarily ignore the fact that some mice came from the same litter, and treat all observations as independent.

The plot below shows the weight gain (or loss) for each of the mice, by diet. The red dots and connecting lines show the mean weight gain for each diet.

The ensuing table shows the mean and standard deviation in weight gain for each of the three diets.

ggplot(data=mice, aes(x=factor(diet), y=weight_gain))  + geom_point()  + 
  stat_summary(fun="mean", geom="line", aes(group=factor(1)))  +
  stat_summary(fun="mean", geom="point", color="red", size=2)

Question:

Based on this graph, do you think there is evidence of diets have an effect on weight gain in mice? Why or why not?

2.1.4 A More Informed Graphical Analysis

So far, we’ve ignored the fact that some of the mice came from the same litter. Now, let’s take litter into account. The figure below colors the mice by litter.

The lines represent the average weight gain (or loss) in each litter.

ggplot(data=mice, aes(x=factor(diet), y=weight_gain, color=factor(litter))) + 
  geom_point() +  stat_summary(fun="mean", geom="line", aes(group=factor(litter)))

It may be helpful to examine each litter individually.

ggplot(data=mice, aes(x=factor(diet), y=weight_gain, color=factor(litter))) + 
  geom_point()  + 
  facet_grid(.~litter, scales = "free") + 
  stat_summary(fun="mean", geom="line", aes(group=factor(litter)))

Question:

Based on the information about litters, do your thoughts about whether diets have an effect on weight gain in mice change? Does there appear to be stronger evidence of differences between groups? Weaker? Same?

2.1.5 Table of Mean Weight By Diet

mouse_groups <- mice %>% group_by(diet)%>% 
  summarize(Mean_Weight=mean(weight_gain), SD_Weight = sd(weight_gain), N=n())
mouse_groups

## # A tibble: 3 × 4
##   diet  Mean_Weight SD_Weight     N
##   <chr>       <dbl>     <dbl> <int>
## 1 A          0.0580    0.0567    12
## 2 B          0.0340    0.0583    12
## 3 C          0.0471    0.0612    12

2.1.6 Table of Mean Weight By Diet and Litter

mouse_groups <- mice %>% group_by(diet, litter)%>% 
  summarize(Mean_Weight=mean(weight_gain), SD_Weight = sd(weight_gain), N=n())
mouse_groups

## # A tibble: 18 × 5
## # Groups:   diet [3]
##    diet  litter Mean_Weight SD_Weight     N
##    <chr>  <int>       <dbl>     <dbl> <int>
##  1 A          1   -0.0435    0.0116       2
##  2 A          2    0.0743    0.00216      2
##  3 A          3    0.0192    0.00282      2
##  4 A          4    0.104     0.00487      2
##  5 A          5    0.110     0.00145      2
##  6 A          6    0.0833    0.00721      2
##  7 B          1   -0.0688    0.000535     2
##  8 B          2    0.0554    0.00881      2
##  9 B          3   -0.0108    0.00781      2
## 10 B          4    0.0731    0.00482      2
## 11 B          5    0.0873    0.00890      2
## 12 B          6    0.0682    0.00430      2
## 13 C          1   -0.0615    0.00283      2
## 14 C          2    0.0746    0.00711      2
## 15 C          3    0.000839  0.00489      2
## 16 C          4    0.0918    0.00638      2
## 17 C          5    0.0979    0.0214       2
## 18 C          6    0.0792    0.00310      2

Since each mouse in a litter received a different treatment, we can use the standard deviations between mice in the same litter to assess the amount of unexplained variability after accounting for litter and diet.

Notice standard deviations are much smaller when we account for litter. Most of the unexplained variability in the first table is explained when we account for litter.

When we fail to account for litter, variability thqt can be explained by differences between litters becomes conflated with unexplained variability.

This causes us to overestimate unexplained variability and makes differences between groups look less meaningful than they really are.

2.1.7 Assessing Evidence of Differences

Conceptually, we can assess whether there is evidence of differences between the diets by considering differences in mean weights, relative to the amount of unexplained variability between mice in the same litter and on the same diet.

Recall that when testing for a difference between two groups, a t-statistic is calculated using the formula

\[ t= \frac{\bar{x}_1-\bar{x}_2}{\textrm{SE}(\bar{x}_1-\bar{x}_2)}=\frac{\bar{x}_1-\bar{x}_2}{\sqrt{\frac{s_1^2}{n_1}+\frac{s_2^2}{n_2}}} \]

The numerator measures the size of the differences between the groups, and the denominator measures the amount of unexplained variability between individuals in the same group.

2.1.8 An Improper Statistical Analysis

We’ve seen graphically, and in table form, how accounting for litter impacts our ability to discern differences between diets. Now, let’s look at how this happens in a statistical model.

If we ignore the fact that some mice are from the same litter, and wrongly treat them as independent, we might use an ordinary linear least squares regression model.

M_LLSR <- lm(data=mice, weight_gain~factor(diet))
summary(M_LLSR)$coefficients

##                  Estimate Std. Error    t value    Pr(>|t|)
## (Intercept)    0.05798494 0.01696450  3.4180166 0.001693572
## factor(diet)B -0.02394185 0.02399142 -0.9979334 0.325573272
## factor(diet)C -0.01085590 0.02399142 -0.4524907 0.653876252

The p-values on line 2 is large, indicating that there is not evidence of differences in weight gain between mice on diet 2, and the baseline diet (diet 1).

The same is true for a comparison of diets 3 and 1, as seen on the third line.

2.1.9 A More Appropriate Statistical Analysis

The following command fits a linear mixed effects model (or multilevel model) that accounts for correlation between mice in the same litter.

M_LME <- lmer(data=mice, weight_gain~factor(diet) + (1 | factor(litter)))
summary(M_LME)$coeff

##                  Estimate  Std. Error        df   t value          Pr(>|t|)
## (Intercept)    0.05798494 0.025057550  5.046906  2.314071 0.068074390235216
## factor(diet)B -0.02394185 0.002962717 28.000001 -8.081043 0.000000008464648
## factor(diet)C -0.01085590 0.002962717 28.000001 -3.664169 0.001025965505479

Estimates are unchanged, but standard errors decrease by a factor of almost 10.
t-statistics are now high, and p-values low, indicating differences between the diets

Just as we saw graphically, and in tables, accounting for differences between litters allows us to accurately quantify unexplained variability, and assess whether there is evidence of differences in weight gain between the diets.

2.2 Linear Mixed Effects Models

2.2.1 LLSR Model for Mice Experiment

Let \(Y_{ij}\) denote the weight gain of mouse \(j\) in litter \(i\). \(j=1,2,\ldots, 6\), \(j=1,2,\ldots, 6\).

In an ordinary linear least squares regression model, we assume that:

each diet has an expected (or average) weight gain
individual mice vary from their expected weights randomly, according to normal distributions with constant variance \(\sigma^2\)
no two mice are any more or less alike than any others, except for diet (which is not true in this context)

A model would have the form:

\[ Y_{ij} = \beta_{0}+\beta_{1}\textrm{DietB}_{ij} +\beta_{2}\textrm{DietC}_{ij} + \epsilon_{ij}, \]

where \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\).

Examples:

	Diet	Expected Weight	Random Deviation
Litter 1, Mouse 1	A	\(\beta_0\)	\(\epsilon_{11}\)
Litter 1, Mouse 3	B	\(\beta_0 + \beta_1\)	\(\epsilon_{13}\)
Litter 1, Mouse 5	C	\(\beta_0 + \beta_2\)	\(\epsilon_{15}\)
Litter 2, Mouse 1	A	\(\beta_0\)	\(\epsilon_{21}\)
Litter 2, Mouse 3	B	\(\beta_0 + \beta_1\)	\(\epsilon_{23}\)
Litter 2, Mouse 5	C	\(\beta_0 + \beta_2\)	\(\epsilon_{25}\)

2.2.2 Model Accounting For Correlation

Let \(Y_{ij}\) denote the weight gain of mouse \(j\) in litter \(i\). \(j=1,2,\ldots, 6\), \(j=1,2,\ldots, 6\).

We assume that:

expected (or average) weight gain differs between diets
individual litters vary from their expected weight randomly, according to normal distributions with constant variance \(\sigma^2_l\)
within each litter, individual mice vary from their expected weights randomly, according to normal distributions with constant variance \(\sigma^2\)

A model would have the form:

\[ Y_{ij} = \beta_{0}+\beta_{1}\textrm{DietB}_{ij} +\beta_{2}\textrm{DietC}_{ij} + l_{i} + \epsilon_{ij}, \]

where \(l_i\sim\mathcal{N}(0, \sigma^2_l)\), and \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\).

	Diet	Expected Weight	Random Deviation
Litter 1, Mouse 1	A	\(\beta_0\)	\(l_1 + \epsilon_{11}\)
Litter 1, Mouse 3	B	\(\beta_0 + \beta_1\)	\(l_1 + \epsilon_{13}\)
Litter 1, Mouse 5	C	\(\beta_0 + \beta_2\)	\(l_1 + \epsilon_{15}\)
Litter 2, Mouse 1	A	\(\beta_0\)	\(l_2 + \epsilon_{21}\)
Litter 2, Mouse 3	B	\(\beta_0 + \beta_1\)	\(l_2 + \epsilon_{23}\)
Litter 2, Mouse 5	C	\(\beta_0 + \beta_2\)	\(l_2 + \epsilon_{25}\)

2.2.3 Questions of Interest

The model

\[ Y_{ij} = \beta_{0}+\beta_{1}\textrm{DietB}_{ij} +\beta_{2}\textrm{DietC}_{ij} + l_{i} + \epsilon_{ij}, \]

where \(l_i\sim\mathcal{N}(0, \sigma^2_l)\), and \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\)

has 5 parameters:

\(\beta_0\) - average weight gain for mice on diet A.
\(\beta_1\) - difference in average weight gain for mice on diet B, compared to to diet A
\(\beta_2\) - difference in average weight gain for mice on diet C, compared to to diet A
\(\sigma_l\) - standard deviation in the distribution of differences between litters (i.e. variability explained by litter)
\(\sigma\) - standard deviation in the distribution of differences between individual mice in the same litter (i.e. unexplained variability)

Thus, for a mouse on Diet A, the expected weight gain follows a normal distribution with mean \(\beta_0\) and variance \(\sigma^2_l + \sigma^2\).

For a mouse on Diet C, the expected weight gain follows a normal distribution with mean \(\beta_0 + \beta_2\) and variance \(\sigma^2_l + \sigma^2\)

This is based on the fact that the sum of two independent normal random variables is normal, with mean equal to the sum of the means and variance equal to the sum of the variances.

2.2.4 Fixed and Random Effects

In this case, we want to test for whether there are differences in weight gain between the diets.

There is no reason to test for differences in weight gain between the litters. Differences between these specific litters of mice are not important to us. We’re not interested in drawing conclusions about these specific mice. They’re just a sample of participants, being used to investigate the diets. It’s likely that we’ll never acually see these specific litters of mice beyond this study.

We still need to account for litter, though, because it helps explain, or account for, variability that would otherwise go unexplained.

Variables for which we want to investigate differences or relationships are called fixed effects. We should build these into the “expectation structure” of the model, using \(\beta_j\)’s.
Variables that we are not interested in testing for differences or relationships between, but that we still want to include in our model in order to account for correlation and explain variability are called random effects. We should add these to the model as normally distributed error terms.
Accounting for random effects allows us to accurately calculate standard errors associated with fixed effects.
A model that involves both fixed and random effects is called a linear mixed effects model.

2.2.5 Fitting the Model in R

To fit a linear mixed effects model in R, we use the lmer() command that is part of the lme4 package. It is also helpful to load the lmerTest package, in order to obtain p-values in the output.

To add a random effect for a variable add (1 | variable_name) in the model.

M_LME <- lmer(data=mice, weight_gain~factor(diet) + (1 | factor(litter)))
summary(M_LME)

## Linear mixed model fit by REML. t-tests use Satterthwaite's method [
## lmerModLmerTest]
## Formula: weight_gain ~ factor(diet) + (1 | factor(litter))
##    Data: mice
## 
## REML criterion at convergence: -193.7
## 
## Scaled residuals: 
##     Min      1Q  Median      3Q     Max 
## -2.2468 -0.7088  0.0247  0.6028  1.9280 
## 
## Random effects:
##  Groups         Name        Variance   Std.Dev.
##  factor(litter) (Intercept) 0.00374095 0.061163
##  Residual                   0.00005267 0.007257
## Number of obs: 36, groups:  factor(litter), 6
## 
## Fixed effects:
##                Estimate Std. Error        df t value      Pr(>|t|)    
## (Intercept)    0.057985   0.025058  5.046906   2.314       0.06807 .  
## factor(diet)B -0.023942   0.002963 28.000001  -8.081 0.00000000846 ***
## factor(diet)C -0.010856   0.002963 28.000001  -3.664       0.00103 ** 
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Correlation of Fixed Effects:
##             (Intr) fct()B
## factor(dt)B -0.059       
## factor(dt)C -0.059  0.500

Estimates and Interpretations

\(\beta_0 = 0.058\): We estimate that average weight gain for Diet A is 0.058 g.
\(\beta_1 = -0.024\): We estimate that mice on diet B gain 0.024 g. less than mice on diet A, on average.
\(\beta_2 = -0.011\): We estimate that mice on diet C gain 0.011 g. less than mice on diet A, on average.

The “Random effects” table gives estimates of \(\sigma^2_l\) (litters) and \(\sigma^2\) (Residual).

\(\sigma_l = 0.061\): We estimate that the standard deviation in differences in weights between litters, after accounting for diet, is 0.061 g.
\(\sigma = 0.007\): We estimate that the standard deviation in differences in weights between mice within a litter, after accounting for diet, is 0.007 g.

There is evidence of differences between the diets.
There is more variability in weight between different litters than between mice in the same litter, after accounting for diet.

2.2.6 Why Not Fixed Effect for Litter?

Why would a model like the following, which uses litter as an expanatory variable in Example 1, not be very useful?

M_fixed_litter <- lm(data=mice, weight_gain~ factor(diet) + factor(litter))
summary(M_fixed_litter)

## 
## Call:
## lm(formula = weight_gain ~ factor(diet) + factor(litter), data = mice)
## 
## Residuals:
##        Min         1Q     Median         3Q        Max 
## -0.0164273 -0.0051744  0.0001942  0.0044281  0.0138701 
## 
## Coefficients:
##                  Estimate Std. Error t value             Pr(>|t|)    
## (Intercept)     -0.046333   0.003421 -13.544   0.0000000000000815 ***
## factor(diet)B   -0.023942   0.002963  -8.081   0.0000000084646521 ***
## factor(diet)C   -0.010856   0.002963  -3.664              0.00103 ** 
## factor(litter)2  0.126011   0.004190  30.075 < 0.0000000000000002 ***
## factor(litter)3  0.061021   0.004190  14.564   0.0000000000000136 ***
## factor(litter)4  0.147712   0.004190  35.254 < 0.0000000000000002 ***
## factor(litter)5  0.156366   0.004190  37.320 < 0.0000000000000002 ***
## factor(litter)6  0.134801   0.004190  32.173 < 0.0000000000000002 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.007257 on 28 degrees of freedom
## Multiple R-squared:  0.9874, Adjusted R-squared:  0.9843 
## F-statistic: 314.5 on 7 and 28 DF,  p-value: < 0.00000000000000022

We’re now estimating 8 \(\beta's\) instead of 3.

Each time we estimate an additional parameter, we lose a degree of freedom, making estimates and predictions less precise.

We don’t care about differences between the litters, so \(\beta_3, \beta_4, \ldots, \beta_7\) are not useful.

Imagine if we had many more litters. Things would get really messy, and unnecessarily so.

If, for some reason, we really wanted to test for differences in weight gain between these specific litters of mice, then we would treat them as fixed effects, but it’s hard to see why we would want to do that.

2.3 A Second Mice Experiment

2.3.1 Weight Gain in Mice: Experiment 2

Now consider a different structure of the mouse experiment.

In this version of the experiment, the three diets were randomly assigned to six pregnant mice, so that two mice were assigned to each diet. Each of the six mice (dams), gave birth to six pups, creating six litters of six, as seen before. Researchers the observed the mean weight gain of the pups over a four week period.

Now, each pup in a litter has necessarily been assigned to the same diet, since diets were assigned to the dams, before the pups were born.

Experimental Design 2:

Figure 2.2: Mouse Experiment Version 2

2.3.2 Mice Experiment 2 Data

The first 15 rows of the dataset look like this:

head(mice2,15)

##    litter diet   weight_gain
## 1       1    A -0.0352604099
## 2       1    A -0.0474166604
## 3       1    A -0.0545130532
## 4       1    A -0.0517145212
## 5       1    A -0.0481739132
## 6       1    A -0.0505177486
## 7       2    A  0.0758408263
## 8       2    A  0.0701352749
## 9       2    A  0.0885838352
## 10      2    A  0.0727807107
## 11      2    A  0.0826007925
## 12      2    A  0.0785294998
## 13      3    B  0.0002306143
## 14      3    B -0.0163362762
## 15      3    B -0.0146213462

2.3.3 A Naive Graphical Analysis for Experiment 2

Again, we’ll temporarily ignore the fact that mice come from the same litter and treat all observations as independent.

ggplot(data=mice2, aes(x=factor(diet), y=weight_gain))  + geom_point()  + 
  stat_summary(fun="mean", geom="line", aes(group=factor(1)))+  
  stat_summary(fun="mean", geom="point", color="red", size=2)

2.3.4 A More Informed Graphical Analysis for Experiment 2

Now, we’ll account for the fact that mice in the same litter got the same diets. The plot below adds color to show litter.

ggplot(data=mice2, aes(x=factor(diet), y=weight_gain, color=factor(litter))) + geom_point()

Since diets were assigned to litters, not individual mice, it is the litters we should be comparing.

When comparing diets, our observational units are litters, not individual mice. Since there are only 6 litters, our sample size is 6, rather than 36.

Thus, the best graphical analysis would come from the following plot, which displays average weight for each litter:

Litters <- mice2 %>% group_by(diet, litter)%>% 
  summarize(mean_weight_gain=mean(weight_gain), N=n())
head(Litters)

## # A tibble: 6 × 4
## # Groups:   diet [3]
##   diet  litter mean_weight_gain     N
##   <chr>  <int>            <dbl> <int>
## 1 A          1         -0.0479      6
## 2 A          2          0.0781      6
## 3 B          3         -0.00791     6
## 4 B          4          0.0788      6
## 5 C          5          0.0994      6
## 6 C          6          0.0779      6

ggplot(data=Litters, aes(x=factor(diet), y=mean_weight_gain, color=factor(litter))) + geom_point()

2.3.5 Table of Mean Weight By Diet for Experiment 2

mouse_groups <- mice2 %>% group_by(diet)%>% 
  summarize(Mean_Weight=mean(weight_gain), SD_Weight = sd(weight_gain), N=n())
mouse_groups

## # A tibble: 3 × 4
##   diet  Mean_Weight SD_Weight     N
##   <chr>       <dbl>     <dbl> <int>
## 1 A          0.0151    0.0661    12
## 2 B          0.0354    0.0457    12
## 3 C          0.0887    0.0137    12

This standard deviations in this table pertain to variability between the 12 mice that got each diet (6 from one litter and 6 from another).

These are not useful here, since diets were assigned to litters, not individual mice.

2.3.6 Table Comparing Litters Means by Diet for Experiment 2

We now look at means and standard deviations between litter means, using the average rate in each litter as the response variable.

litter_groups <- Litters %>% group_by(diet)%>% 
  summarize(Mean_Weight=mean(mean_weight_gain), SD_Weight = sd(mean_weight_gain), N=n())
litter_groups

## # A tibble: 3 × 4
##   diet  Mean_Weight SD_Weight     N
##   <chr>       <dbl>     <dbl> <int>
## 1 A          0.0151    0.0891     2
## 2 B          0.0354    0.0613     2
## 3 C          0.0887    0.0152     2

2.3.7 An Inappropriate Model for the Second Design

An ordinarly linear least squares regression model fails to account for the fact that treatments were assigned to litters, not mice. It is based on the assumption that we have 36 independent mice (which is incorret).

Such a model would have the form

\[ Y_{ij} = \beta_{0}+\beta_{1}\textrm{DietB}_{ij} +\beta_{2}\textrm{DietC}_{ij} + \epsilon_{ij}, \]

Output for such a model is shown below.

M2_LLSR <- lm(data=mice2, weight_gain~factor(diet))
summary(M2_LLSR)

## 
## Call:
## lm(formula = weight_gain ~ factor(diet), data = mice2)
## 
## Residuals:
##       Min        1Q    Median        3Q       Max 
## -0.069586 -0.041328 -0.005675  0.041915  0.073511 
## 
## Coefficients:
##               Estimate Std. Error t value Pr(>|t|)    
## (Intercept)    0.01507    0.01359   1.109 0.275297    
## factor(diet)B  0.02036    0.01922   1.060 0.297007    
## factor(diet)C  0.07358    0.01922   3.829 0.000545 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.04707 on 33 degrees of freedom
## Multiple R-squared:  0.3215, Adjusted R-squared:  0.2804 
## F-statistic: 7.819 on 2 and 33 DF,  p-value: 0.001662

2.3.8 A More Appropriate Model for Experiment 2

A linear mixed effects model with a random term for litter accounts for the fact that treatments were applied to litters, not individual mice. This model has the form:

\[ Y_{ij} = \beta_{0}+\beta_{1}\textrm{DietB}_{ij} +\beta_{2}\textrm{DietC}_{ij} + l_{i} + \epsilon_{ij}, \]

where \(l_i\sim\mathcal{N}(0, \sigma^2_l)\), and \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\)

Output for a model that accounts for correlation between mice in the same litter is shown.

M2_LME <- lmer(data=mice2, weight_gain~factor(diet) + (1 | factor(litter)))
summary(M2_LME)

## Linear mixed model fit by REML. t-tests use Satterthwaite's method [
## lmerModLmerTest]
## Formula: weight_gain ~ factor(diet) + (1 | factor(litter))
##    Data: mice2
## 
## REML criterion at convergence: -206.7
## 
## Scaled residuals: 
##      Min       1Q   Median       3Q      Max 
## -2.33470 -0.59248  0.03505  0.59073  1.91089 
## 
## Random effects:
##  Groups         Name        Variance   Std.Dev.
##  factor(litter) (Intercept) 0.00396804 0.062992
##  Residual                   0.00005093 0.007136
## Number of obs: 36, groups:  factor(litter), 6
## 
## Fixed effects:
##               Estimate Std. Error      df t value Pr(>|t|)
## (Intercept)    0.01507    0.04459 3.00000   0.338    0.758
## factor(diet)B  0.02036    0.06306 3.00000   0.323    0.768
## factor(diet)C  0.07358    0.06306 3.00000   1.167    0.328
## 
## Correlation of Fixed Effects:
##             (Intr) fct()B
## factor(dt)B -0.707       
## factor(dt)C -0.707  0.500

Estimates are the same
Standard errors are larger, due mostly to the fact that our sample size is 6, instead of 36
after accounting for diet, standard deviation in weights between litters is estimated to be 0.0629 g. (estimate of \(\sigma_l\))
after accounting for diet, standard deviation in weights between mice in the same litter is estimated to be 0.0071 g. (estimate of \(\sigma\))

2.3.9 Model for Litter Means

Alternatively, we could fit a model, using the 6 litters as our observations, with the mean weight in each litter as the response variable. If we now let \(Y_i\) represent the mean weight in litter i, our model has the form:

\[ Y_{i} = \beta_{0}+\beta_{1}\textrm{DietB}_{i} +\beta_{2}\textrm{DietC}_{i} + \epsilon_{i}, \]

where \(\epsilon_{i} \sim\mathcal{N}(0, \sigma^2)\)

Since it is reasonable to assume that litters are independent, we could use an ordinary LLSR model in this context.

M2_Means <- lm(data=Litters, mean_weight_gain~factor(diet))
summary(M2_Means)

## 
## Call:
## lm(formula = mean_weight_gain ~ factor(diet), data = Litters)
## 
## Residuals:
##        1        2        3        4        5        6 
## -0.06301  0.06301 -0.04335  0.04335  0.01078 -0.01078 
## 
## Coefficients:
##               Estimate Std. Error t value Pr(>|t|)
## (Intercept)    0.01507    0.04459   0.338    0.758
## factor(diet)B  0.02036    0.06306   0.323    0.768
## factor(diet)C  0.07358    0.06306   1.167    0.328
## 
## Residual standard error: 0.06306 on 3 degrees of freedom
## Multiple R-squared:  0.3261, Adjusted R-squared:  -0.1231 
## F-statistic: 0.7259 on 2 and 3 DF,  p-value: 0.5532

Estimates, standard errors, and p-values are identical to the ones seen in the mixed-effects model.

2.3.10 Fixed Effect for Litter in Experiment 2

If we try to treat litter as a fixed effect in Experiment 2, we would not even be able to estimate all of the parameters.

M2_fixed_litter <- lm(data=mice2, weight_gain~ factor(diet) + factor(litter))
summary(M2_fixed_litter)

## 
## Call:
## lm(formula = weight_gain ~ factor(diet) + factor(litter), data = mice2)
## 
## Residuals:
##        Min         1Q     Median         3Q        Max 
## -0.0166840 -0.0041608  0.0003311  0.0042513  0.0136135 
## 
## Coefficients: (2 not defined because of singularities)
##                  Estimate Std. Error t value             Pr(>|t|)    
## (Intercept)     -0.047933   0.002913 -16.453 < 0.0000000000000002 ***
## factor(diet)B    0.126712   0.004120  30.755 < 0.0000000000000002 ***
## factor(diet)C    0.125801   0.004120  30.533 < 0.0000000000000002 ***
## factor(litter)2  0.126011   0.004120  30.585 < 0.0000000000000002 ***
## factor(litter)3 -0.086691   0.004120 -21.041 < 0.0000000000000002 ***
## factor(litter)4        NA         NA      NA                   NA    
## factor(litter)5  0.021565   0.004120   5.234             0.000012 ***
## factor(litter)6        NA         NA      NA                   NA    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.007136 on 30 degrees of freedom
## Multiple R-squared:  0.9858, Adjusted R-squared:  0.9835 
## F-statistic: 417.2 on 5 and 30 DF,  p-value: < 0.00000000000000022

2.4 A Multilevel Experiment

2.4.1 Experiment 2 Illustration

In example 2, we saw that treatments (diets) were assigned to litters (dams), but measurements were taken on the individual mice (pups).

2.4.2 Experiment with Variables Assigned at Different Levels

Now imagine an experiment with the following setup

diets are still assigned to dams, prior to the birth of the pups, so all mice in the same litter get the same diet
within each litter three mice are given nutritional supplements after their birth and the other three are not

We want to study the effect of diet and supplement on weight gain.

One treatment (diet) is assigned to litters, while the other (supplement) is assigned to individual mice.

For the purpose of comparing diets, our observational units are 6 independent litters.
For the purpose of comparing supplements, our observational units are 36 individual mice (who are not independent)

An experiment where treatments are assigned at different levels is called a multilevel experiment

level 1 observational units are mice, and level 1 treatment is supplement
level 2 observational units are litters, and level 2 treatment is diet

2.4.3 Experiment 3 Data

head(mice3,15)

##    litter diet supplement  weight_gain
## 1       1    A          1 -0.033260410
## 2       1    A          0 -0.047416660
## 3       1    A          1 -0.052513053
## 4       1    A          0 -0.051714521
## 5       1    A          1 -0.046173913
## 6       1    A          0 -0.050517749
## 7       2    A          1  0.077840826
## 8       2    A          0  0.070135275
## 9       2    A          1  0.090583835
## 10      2    A          0  0.072780711
## 11      2    A          1  0.084600792
## 12      2    A          0  0.078529500
## 13      3    B          1  0.002230614
## 14      3    B          0 -0.016336276
## 15      3    B          1 -0.012621346

2.4.4 Graphical Analysis for Experiment 3

We use color to represent litter, and shape to represent supplement. We’ll use the argument position=position_jitterdodge() to stagger litters and supplement levels, which avoids overlap and makes the graph easier to read.

ggplot(data=mice3, aes(x=factor(diet), y=weight_gain, color=factor(litter), 
       shape=factor(supplement))) + geom_point(position=position_jitterdodge())

2.4.5 An Inappropriate Model for the 3rd Design

The model has the form

\[ Y_{ij} = \beta_{0} + \alpha\textrm{Supplement}_i + \beta_{1}\textrm{DietB}_{ij} +\beta_{2}\textrm{DietC}_{ij} + \epsilon_{ij}, \]

where \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\)

Output for such a model is shown below.

M3_LLSR <- lm(data=mice3, weight_gain~supplement + factor(diet))
summary(M3_LLSR)

## 
## Call:
## lm(formula = weight_gain ~ supplement + factor(diet), data = mice3)
## 
## Residuals:
##       Min        1Q    Median        3Q       Max 
## -0.072685 -0.040982 -0.005618  0.040366  0.070412 
## 
## Coefficients:
##               Estimate Std. Error t value Pr(>|t|)    
## (Intercept)   0.011974   0.015895   0.753 0.456766    
## supplement    0.008198   0.015895   0.516 0.609538    
## factor(diet)B 0.020361   0.019467   1.046 0.303431    
## factor(diet)C 0.073578   0.019467   3.780 0.000648 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.04768 on 32 degrees of freedom
## Multiple R-squared:  0.3263, Adjusted R-squared:  0.2632 
## F-statistic: 5.167 on 3 and 32 DF,  p-value: 0.005021

2.4.6 A More Appropriate Model for Experiment 3

We instead fit a linear mixed effects model.

Since we want to study the effects of diet and supplement, we treat supplement and diet as fixed effects.

Since we want to account for correlation due to mice being in the same litter, we treat litter as a random effect.

Our model has the form

\[ Y_{ij} = \beta_{0} + \alpha\textrm{Supplement}_i + \beta_{1}\textrm{DietB}_{ij} +\beta_{2}\textrm{DietC}_{ij} + l_{i} + \epsilon_{ij}, \]

where \(l_i\sim\mathcal{N}(0, \sigma^2_l)\), and \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\)

Output for a model that accounts for correlation between mice in the same litter is shown.

M3_LME <- lmer(data=mice3, weight_gain ~ supplement + factor(diet) + (1 | factor(litter)))
summary(M3_LME)

## Linear mixed model fit by REML. t-tests use Satterthwaite's method [
## lmerModLmerTest]
## Formula: weight_gain ~ supplement + factor(diet) + (1 | factor(litter))
##    Data: mice3
## 
## REML criterion at convergence: -203.9
## 
## Scaled residuals: 
##      Min       1Q   Median       3Q      Max 
## -2.12499 -0.76119  0.06055  0.60135  1.64980 
## 
## Random effects:
##  Groups         Name        Variance   Std.Dev.
##  factor(litter) (Intercept) 0.00396973 0.063006
##  Residual                   0.00004076 0.006384
## Number of obs: 36, groups:  factor(litter), 6
## 
## Fixed effects:
##                Estimate Std. Error        df t value Pr(>|t|)    
## (Intercept)    0.011974   0.044603  3.003417   0.268 0.805735    
## supplement     0.008198   0.002128 29.000000   3.853 0.000596 ***
## factor(diet)B  0.020361   0.063060  3.000000   0.323 0.767980    
## factor(diet)C  0.073578   0.063060  3.000000   1.167 0.327609    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Correlation of Fixed Effects:
##             (Intr) spplmn fct()B
## supplement  -0.024              
## factor(dt)B -0.707  0.000       
## factor(dt)C -0.707  0.000  0.500

Interpretations

Interpretations for fixed effects are the same as in LLSR.

We expect mice on the supplement to gain 0.008 g. more than mice not on the supplement, assuming they get the same diet.
We expect mice in diet B to gain 0.02 g. more than mice on diet A, assuming supplement is held constant.
We expect mice in diet C to gain 0.07 g. more than mice on diet A, assuming supplement is held constant
After accounting for differences in diet and supplement, the standard deviation in weights between litters is estimated to be 0.063 g. (an estimate of \(\sigma_l\)).
After accounting for differences in diet and supplement, the standard deviation in weights between mice in the same litter is estimated to be 0.00638 g. (an estimate of \(\sigma\)).

2.4.7 Comparison of LLSR and LME Models

Compared to the incorrect LLSR model, when we use the linear mixed effects model:

estimates for fixed effects supplement and diet do not change
standard error for supplement is smaller - we get a more precise comparison for supplements because the model has accounted for variability due to differences in litters
standard error for diet is larger - since diets were assigned to litters, our sample size is 6, not 36, so standard errors are higher

The mixed effects model suggests evidence of differences due to supplement, but not evidence of differences due to diet. This is the opposite of the incorrect LLSR model.

2.5 Practice Questions

Refer to the following scenarios to answer the questions in this section.

Scenario 1

An experiment was conducted to compare the effects of two different fertilizers on plants. Twelve plants were grown in each of eight trays, for a total of 96 plants. Fertilizers were randomly assigned to entire trays, so that all plants in a tray received the same fertilizer, and each fertilizer was used on four different trays. The amount of fertilizer applied to each plant (in ounces) also varied between trays. Two trays received 2, oz, two received, 4 oz, two received 6 oz, and two received 8 oz. Plants in the same tray received the same amount of fertilizer. (See the diagram for an illustration.) The growth of each plant (in cm.) over a two week period was recorded.

Scenario 1 Illustration

The diagram shows values of the explanatory variables, brand and amount, but not the response variable, growth.

Figure 2.3: Plant Experiment Scenario 1

The first few rows of the data are shown below.

head(plants)

## # A tibble: 6 × 5
##    ...1  tray brand amount growth
##   <dbl> <dbl> <dbl>  <dbl>  <dbl>
## 1     1     1     1      2   2.82
## 2     2     1     1      2   2.9 
## 3     3     1     1      2   3.06
## 4     4     1     1      2   2.87
## 5     5     1     1      2   3.28
## 6     6     1     1      2   2.94

A scatterplot showing growth by amount of fertilizer, brand, and tray is shown below.

ggplot(data=plants, aes(x=amount, y=growth, shape=factor(brand), color=factor(tray))) + geom_point()

Figure 2.4: Scenario 1

Scenario 2

Now consider a different version of the experiment in Question 2. In this version, the plants are planted in separate plots, which are then placed in the 8 trays. All 12 plants in each tray still receive the same brand of fertilizer, but the amount applied is randomly assigned to pots, so that within each tray, three pots get 2 oz, three get 4 oz, three get 6 oz, and three get 8 oz. (See illustration).

Scenario 2 Illustration

The diagram shows values of the explanatory variables, brand and amount, but not the response variable, growth.

Figure 2.5: Plant Experiment Scenario 2

The first few rows of the data are shown below.

head(plants2)

## # A tibble: 6 × 5
##    ...1  tray brand amount growth
##   <dbl> <dbl> <dbl>  <dbl>  <dbl>
## 1     1     1     1      2   2.82
## 2     2     1     1      2   2.9 
## 3     3     1     1      2   3.06
## 4     4     1     1      4   3.41
## 5     5     1     1      4   3.82
## 6     6     1     1      4   3.48

A scatterplot showing growth by amount of fertilizer, brand, and tray in Scenario 2 is shown below.

ggplot(data=plants2, aes(x=amount, y=growth, shape=factor(brand), color=factor(tray))) + geom_point()

Figure 2.6: Scenario 2

7. Identify the response variable and fixed and random effects associated with a given set of data.

Identify the response variable and all fixed and random effects in scenarios 1 and 2.

8. Interpret variance component estimates (\(\sigma\) and \(\sigma_u\)) in linear mixed effects (LME) models.

Let \(Y_{ij}\) represent growth of plant \(j\) in tray \(i\). We will work with a random effects model of the form:

\[ Y_{ij} = \beta_{0}+\beta_1\textrm{Brand2}_{ij} + \alpha\textrm{Amount}_{ij} + t_{i} + \epsilon_{ij}, \]

where \(t_i\sim\mathcal{N}(0, \sigma^2_t)\), and \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\).

a) Explain, in words what \(\sigma\) and \(\sigma_t\) represent in this model.

R output for a linear mixed effects model for scenario 1 is shown below.

M1 <- lmer(data=plants, growth ~ factor(brand) + amount + (1|tray))
summary(M1)

## Linear mixed model fit by REML. t-tests use Satterthwaite's method [
## lmerModLmerTest]
## Formula: growth ~ factor(brand) + amount + (1 | tray)
##    Data: plants
## 
## REML criterion at convergence: 29.9
## 
## Scaled residuals: 
##      Min       1Q   Median       3Q      Max 
## -2.56962 -0.63778 -0.07834  0.54127  2.67454 
## 
## Random effects:
##  Groups   Name        Variance Std.Dev.
##  tray     (Intercept) 0.24871  0.4987  
##  Residual             0.05609  0.2368  
## Number of obs: 96, groups:  tray, 8
## 
## Fixed effects:
##                Estimate Std. Error      df t value Pr(>|t|)   
## (Intercept)     2.43906    0.47086 5.00000   5.180  0.00353 **
## factor(brand)2  0.54312    0.35594 5.00000   1.526  0.18756   
## amount          0.36027    0.07959 5.00000   4.527  0.00625 **
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Correlation of Fixed Effects:
##             (Intr) fct()2
## fctr(brnd)2 -0.378       
## amount      -0.845  0.000

Give the estimates of \(\sigma\) and \(\sigma_u\). Which is bigger and what does that tell us?

9. Determine how standard errors in a LME model would compare to the standard errors we would get if we used a LLSR model on the same data.

In each of scenarios 1 and 2, suppose we inappropriately fit a LLSR model of the form
\[ Y_{ij} = \beta_{0}+\beta_1\textrm{Brand2}_{ij} + \alpha\textrm{Amount}_{ij} + \epsilon_{ij}. \]

to the same data. For each scenario, state whether the standard errors associated with brand and amount would increase, decrease, or stay the same. How would this affect the p-value and the amount of evidence of effects associated with these variables?

For the data in Scenario 1, we now create a dataset grouped by tray, with the average growth displayed for each tray. The data are displayed below.

plants_summary <- plants %>% group_by(tray) %>% summarize(brand=mean(brand),
                                                          amount=mean(amount),
                                                          growth=mean(growth), 
                                                          N=n())
plants_summary

## # A tibble: 8 × 5
##    tray brand amount growth     N
##   <dbl> <dbl>  <dbl>  <dbl> <int>
## 1     1     1      2   2.97    12
## 2     2     2      4   3.90    12
## 3     3     1      6   4.11    12
## 4     4     2      8   5.67    12
## 5     5     1      8   5.45    12
## 6     6     2      6   5.72    12
## 7     7     1      4   4.43    12
## 8     8     2      2   3.85    12

If we fit a linear least squares regression model with brand and amount as explanatory variables, how would the estimates, standard errors, and p-values compare to those obtained from the LME model for Scenario 1?

Explain why it would not make sense to group the data by tray and fit a LLSR model in scenario 2?

10. Determine how changes in sample size and/or experimental design will impact standard errors, confidence intervals, and p-values in a LME model.

In Scenario 1, explain why it is inappropriate to think of our sample size as being 96 plants in this scenario. What are the appropriate observational units, and what is the relevant sample size for determining the effects of brand and amount?

In Scenario 2, what is the relevant sample size associated with brand variable? What about the amount variable?

Consider the experimental designs in Questions 1 and 2. If you were designing the experiment and could choose either design, at equal cost, which would you choose? Why?

11. Compare the size of different variance component estimates given a LME model and a small set of data (i.e. say whether \(\sigma_u\), or \(\sigma\) will be larger).

Consider the simplified version of the plant experiment shown below. Here, there are no fixed effects, and we simply seek to estimate the average height of the plants and measure the amount of variability associated with plants and trays. Now note that unlike the previous diagrams, the values of the response variable, growth, are shown.

Let \(Y_{ij}\) represent growth of plant \(j\) in tray \(i\). (i=1, , 4 and j=1, , 6). We will work with a random effects model of the form:

\[ Y_{ij} = \beta_{0} + t_{i} + \epsilon_{ij}, \]

where \(t_i\sim\mathcal{N}(0, \sigma^2_t)\), and \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\).

Will \(\sigma_t\) or \(\sigma\) be larger? Explain your answer. (Don’t actually try to calculate these, just answer based on the information given in the diagram)

Answer the same question as in (a), for the following data. In this scenario, there are only 3 trays.

Now, suppose 3 plants in each tray receive fertilizer brand A and 3 receive fertilizer brand B. The fertilizer brands are shown in the plots below.

Consider Model 1 below, fit to data shown underneath it. State whether \(\sigma_t\) or \(\sigma\) will be larger? Explain your answer.

Let \(Y_{ij}\) represent growth of plant \(j\) in tray \(i\).

Model 1:

\[ Y_{ij} = \beta_{0} + t_{i} + \epsilon_{ij}, \]

where \(t_i\sim\mathcal{N}(0, \sigma^2_t)\), and \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\).

Now suppose that Model 2 is fit to the same data. State whether \(\sigma_t\) or \(\sigma\) will be larger? Explain your answer.

Model 2:

\[ Y_{ij} = \beta_{0} + \beta_1\text{BrandB}_{ij} + t_{i} + \epsilon_{ij}, \]

where \(t_i\sim\mathcal{N}(0, \sigma^2_t)\), and \(\epsilon_{ij} \sim\mathcal{N}(0, \sigma^2)\).