Week 12 Lab -- Confidence Intervals in (G)LMM

Sat, Feb 9, 2019

This week we’ll continue to look at Linear and Generalized linear mixed effects models, emphasizing how to get confidence intervals on the predictions. We’ll start with something easy - a linear mixed effects model with the sleepstudy data.

library(tidyverse)
library(lme4)  ## notice conflicts with tidyr
library(broom)
library(boot)

ssBase <- ggplot(sleepstudy, aes(x = Days, y = Reaction)) + geom_point(aes(color = Subject)) + 
    scale_color_discrete() + labs(x = "Days with no sleep", y = "Reaction time [ms]")
ssBase + geom_smooth(method = "lm") + facet_wrap(~Subject) + guides(color = FALSE)

So there’s the basic data, and now we fit a mixed model with both the intercept and Days varying across subjects.

# fit a mixed model
summary(ss.mm <- lmer(Reaction ~ Days + (1 + Days | Subject), sleepstudy, REML = FALSE))

Let’s check the profile confidence limits on the estimates before we look at getting confidence limits on predictions.

# remember this is SLOOOOW ss.parms <- tidy(ss.mm, conf.int=TRUE,
# conf.method='profile') unfortunately tidy.merMod doesn't do profile
# intervals
confint(ss.mm, oldNames = FALSE)

So the most interesting thing here is that the confidence limit for the correlation coefficient includes 0. The other thing to recognize is that all of the variance parameters are uncertain as well. Even the residual variance has a range of possible values! This is the uncertainty that is not easy to include in our confidence intervals on predictions.

What I want to do now is get confidence intervals on predicted values so we can add nice confidence polygons to our predicted plots. Unlike glm() or lm() objects, the predict function for merMod objects produced by (g)lmer() doesn’t give us standard errors or confidence limits. The reason given by the developers is that there isn’t a clear method to account for the variance in the random effects parameters. Recognizing that these parameters are uncertain would increase the width of the confidence intervals on our predictions.

First I want to make a plot that has both the population level fitted values and the subject level lines. This is going to get messy.

# get fitted values
ss.fitted <- augment(ss.mm)
names(ss.fitted)

augment() gives us a bunch of columns for merMod objects; the one we’re interested in here is .fitted. There is also a column .mu that has identical values in it. For a model that has a link function other than the identity link (e.g. the default logit link for the binomial family) .fixed will be on the link scale, and .mu will be on the response scale. We’ll come back to that in the next example.

# x is inherited from ssBase if we don't specify color=Subject in
# geom_line() we only get one line ... oh, because I specified color=Subject
# in geom_points() not ggplot()
ssBase + geom_line(aes(y = .fitted, color = Subject), data = ss.fitted) + guides(color = FALSE)

So each line represents the fitted values including the effects of the random perturbations on the intercept and slope. Now get the population level predictions using predict() and a new data.frame.

nd <- data.frame(Days = 0:9)
# re.form tells predict what to do with the random effects ~0 says ignore
# them; population level predictions
nd$PopPred <- predict(ss.mm, newdata = nd, re.form = ~0)
ssBase + geom_line(aes(y = .fitted, color = Subject), data = ss.fitted, alpha = 0.5) + 
    geom_line(aes(y = PopPred), data = nd, size = 2) + guides(color = FALSE)

OK, but we want to know how much confidence to have in that prediction. Really the only option here is to use bootstrapping. This is slow, but gets all the uncertainty in our prediction. Fortunately lme4 includes a function bootMer to do bootstrapping by generating a random sample of data and then fitting the model to that new data. Repeat that 1000’s of times and you can get a distribution of possible model fits. Unfortunately we have to write a function to extract the predictions from the random samples the way we want them. Start by getting a single random sample.

# bootMer takes a fitted merMod object then the function that takes a fitted
# merMod object and returns the thing we want as a vector fitted() will give
# us the coefficients from a fit
test <- bootMer(ss.mm, fixef)
# a (fairly) compact look at the *str*ucture of an object
str(test)

There’s alot of stuff in there, most of which relates to internal stuff that would allow us to recreate the simulation. We are mostly interested in t0 and t. t0 is the result of applying our FUN to the original object. In this case it gives us a vector of the fixed effects coefficients. t is a matrix of the simulation output. In each row is the result of our FUN applied to a simulated result. By default bootMer only does one simulation. If we want more:

test <- bootMer(ss.mm, fixef, nsim = 10)
test$t

If we set nsim = 10000 and then took quantiles of the resulting columns, we could get bootstrapped confidence limits on our fixed effects. That’s what is happening when we do confint(ss.mm, method="boot"). We want a function that makes a prediction from a fitted model. Just using predict() won’t work, because the function only takes one argument, the fitted model. So we have to make a “wrapper” that will call predict() with the right arguments.

myFunc <- function(mm) {
    # forgot the re.form argument on first try
    predict(mm, newdata = nd, re.form = ~0)
}
myFunc(ss.mm)  # works
# try with bootMer()
test <- bootMer(ss.mm, myFunc)
test$t  #works

OK, this next bit takes a while … about 1 minute on my (old) home computer

bigBoot <- bootMer(ss.mm, myFunc, nsim = 1000)
head(bigBoot$t)

Now we want quantiles of each column of that thing, turned around so the columns are rows like the original nd.

# apply(xx, MARGIN = 2, FUN, ...) 'applies' FUN to each column (because
# MARGIN = 2). arguments in ... get passed to FUN t() transposes the result
predCL <- t(apply(bigBoot$t, MARGIN = 2, FUN = quantile, probs = c(0.025, 0.975)))
# ggplot only works with data.frames so add to nd '%' in column names is a
# disaster so be explicit
nd$lci <- predCL[, 1]
nd$uci <- predCL[, 2]

ssBase + geom_line(aes(y = .fitted, color = Subject), data = ss.fitted, alpha = 0.5) + 
    geom_line(aes(y = PopPred), data = nd, size = 2) + geom_ribbon(aes(x = Days, 
    ymin = lci, ymax = uci), data = nd, inherit.aes = FALSE, alpha = 0.2) + 
    guides(color = FALSE)

So that’s pretty nice. I can add a geom_smooth() to show how much wider the confidence intervals are compared to ignoring all the extra variability.

ssBase + geom_line(aes(y = .fitted, color = Subject), data = ss.fitted, alpha = 0.5) + 
    geom_line(aes(y = PopPred), data = nd, size = 2) + # set inherit.aes to FALSE here otherwise geom_ribbon looks for a y
# aesthetic called Reaction
geom_ribbon(aes(x = Days, ymin = lci, ymax = uci), data = nd, inherit.aes = FALSE, 
    alpha = 0.2) + geom_smooth(method = "lm", alpha = 0.8, color = "black") + 
    guides(color = FALSE)

It is interesting that there isn’t much difference when Days == 0. Note that you wouldn’t ever do this in practice! However, it might be nice to build a few polygons at different levels of confidence.

predCL <- t(apply(bigBoot$t, MARGIN = 2, FUN = quantile, probs = c(0.05, 0.95, 
    0.1, 0.9, 0.25, 0.75)))
# try making it into a dataframe
predCL <- data.frame(predCL)
# names(predCL) so puts a X at the front bc can't start with a number, and
# replaces % with .  put them into nd
nd <- bind_cols(nd, predCL)
# brute force multi-ribbons:
ssBase + geom_line(aes(y = .fitted, color = Subject), data = ss.fitted, alpha = 0.5) + 
    geom_line(aes(y = PopPred), data = nd, size = 2) + # set inherit.aes to FALSE here otherwise geom_ribbon looks for a y
# aesthetic called Reaction
geom_ribbon(aes(x = Days, ymin = lci, ymax = uci), data = nd, inherit.aes = FALSE, 
    alpha = 0.2) + # add 90% ribbon
geom_ribbon(aes(x = Days, ymin = X5., ymax = X95.), data = nd, inherit.aes = FALSE, 
    alpha = 0.2) + # add 80% ribbon
geom_ribbon(aes(x = Days, ymin = X10., ymax = X90.), data = nd, inherit.aes = FALSE, 
    alpha = 0.2) + # add 50% ribbon
geom_ribbon(aes(x = Days, ymin = X25., ymax = X75.), data = nd, inherit.aes = FALSE, 
    alpha = 0.2) + 
guides(color = FALSE)

That doesn’t look as good as I’d hoped. The idea here is to draw the eye closer to the expected value rather than the outer edges of the confidence region.

Doing it with a glmm

So now we want to redo the deer model from last week.

library(NRES803)
data(ecervi)
ecervi <- ecervi %>% mutate(Ecervi.pa = Ecervi > 0, fSex = factor(Sex, labels = c("Male", 
    "Female")), cLength = Length - mean(Length), csLength = cLength/sd(Length))

contrasts(ecervi$fSex) <- contr.sum(2)

M0 <- glmer(Ecervi.pa ~ fSex * csLength + (1 + csLength | Farm), data = ecervi, 
    family = binomial)

lr = range(ecervi$csLength)
nd <- expand.grid(csLength = seq(lr[1], lr[2], length = 20), fSex = factor(levels(ecervi$fSex)))

nd$pp = predict(M0, newdata = nd, type = "response", re.form = ~0)

ggplot(nd, aes(x = csLength, y = pp, col = fSex)) + geom_line(alpha = 0.5) + 
    ylab("Probability of E. cervi infection") + geom_line(aes(x = csLength, 
    y = pp, col = fSex), data = nd, inherit.aes = FALSE, size = 2) + geom_rug(aes(x = csLength), 
    data = filter(ecervi, Ecervi.pa), sides = "t", inherit.aes = FALSE) + geom_rug(aes(x = csLength), 
    data = filter(ecervi, !Ecervi.pa), sides = "b", inherit.aes = FALSE) + facet_wrap(~fSex) + 
    guides(color = FALSE)

I like this one with facet_wrap() better than the one from last week. OK, now we want to do the bootstrap. But fitting this model already is slow, so doing it 1000 times is even slower! Bootstrapping is one of the things that benefits from parallel computing very easily. There are some options to bootMer() that splits the computation up 4 ways. 4 times faster. Most modern CPU’s have multiple cores that can run independently. In normal operation R uses just one core. My office windows computer has 8 cores, so could speed up this computation by a factor of 8! This seems a bit fussy – use at own risk! You can download a binary file with my bigBoot2 object in it here.

# Don't run this code!  takes a loooong time ...
bigBoot2 <- bootMer(M0, myFunc, nsim = 1000, parallel = "multicore", ncpus = 4)
save(bigBoot2, file = "data/bigBoot2.RData")

# remember to download the bigBoot2.RData file
if (!exists("bigBoot2")) {
    load("data/bigBoot2.Rdata")
}
# the object returned by bootMer is of type boot and there are some useful
# functions there.
envPred <- envelope(bigBoot2)
str(envPred)

The matrix point in there is the “pointwise” 95% confidence intervals for each of the 40 rows in nd. The error rate for those intervals is 5% on each value, which means that 24% of the points will be outside the interval over the entire curve. The matrix overall expands those limits until the number of points over the entire curve is 5%. Time to add these to a plot. First we have to get them into the nd data.frame.

nd$lower.point <- envPred$point[1, ]
nd$upper.point <- envPred$point[2, ]
ggplot(nd, aes(x = csLength, y = pp, col = fSex)) + geom_line(alpha = 0.5) + 
    ylab("Probability of E. cervi infection") + geom_line(aes(x = csLength, 
    y = pp, col = fSex), data = nd, inherit.aes = FALSE, size = 2) + geom_rug(aes(x = csLength), 
    data = filter(ecervi, Ecervi.pa), sides = "t", inherit.aes = FALSE) + geom_rug(aes(x = csLength), 
    data = filter(ecervi, !Ecervi.pa), sides = "b", inherit.aes = FALSE) + geom_ribbon(aes(x = csLength, 
    ymin = lower.point, ymax = upper.point, color = fSex), data = nd, inherit.aes = FALSE) + 
    facet_wrap(~fSex) + guides(color = FALSE)

OK. That’s not right! I mean, not just the fact that they are completely black. Look at the range, those values go from -5 to nearly 8. They can’t be probabilities.

And in fact they aren’t. Back when we created myFunc(), we just used predict() without specifying the argument type (think back to week 5 & 6 with glm() fits). By default, predict() produces values on the link scale, the scale of the linear predictor part of the model. But we want to plot probabilities, so we have two choices.

change myFunc to use type="response" and rerun bootMer()
transform the link scale values to probabilities with the logistic function.

I’m going with (2), because it took hours to do (1) the first time.

# boot::inv.logit() does the job
nd$lower.point <- inv.logit(envPred$point[1, ])
nd$upper.point <- inv.logit(envPred$point[2, ])
nd$sample.curve <- inv.logit(bigBoot2$t[127, ])
ggplot(nd, aes(x = csLength, y = pp, col = fSex)) + # and don't forget to change alpha
geom_ribbon(aes(x = csLength, ymin = lower.point, ymax = upper.point, fill = fSex), 
    data = nd, inherit.aes = FALSE, alpha = 0.5) + geom_line(aes(col = fSex), 
    alpha = 0.5) + ylab("Probability of E. cervi infection") + geom_line(aes(x = csLength, 
    y = pp, col = fSex), data = nd, inherit.aes = FALSE, size = 2) + geom_rug(aes(x = csLength), 
    data = filter(ecervi, Ecervi.pa), sides = "t", inherit.aes = FALSE) + geom_rug(aes(x = csLength), 
    data = filter(ecervi, !Ecervi.pa), sides = "b", inherit.aes = FALSE) + facet_wrap(~fSex) + 
    guides(color = FALSE, fill = FALSE) + geom_line(aes(y = sample.curve))

Phew! The reason this lab is so late getting up is that I could not get that d#$(*&#) graph to have matching colors on the line and the fill. I spent a couple of hours trying everything I could think of. As it turns out it was failing because I had copied the graph code, but not changed the object names … and because I was working interactively the old objects were still around! I only realized my error after wiping the workspace and running the script from the top. The moral of the story is, when nothing makes sense, take it from the top!

Redo the plot with the “overall” 95% confidence limits and compare with the previous plot. Does that change your inference?

Week 12 Lab -- Confidence Intervals in (G)LMM

Doing it with a glmm

Andrew Tyre

Professor of Wildlife Ecology