In this exercise you will:
family="binomial"
versus
family="quasibinomial"
Wedderburn’s leaf blotch data represents the proportion of leaf blotch on ten varieties of barley grown at nine sites in 1965. The amount of leaf blotch is recorded as a percentage. Obtain the data using:
and answer the following questions.
obtain the data and check that variety and site are correctly defined.
fit a GLM that is appropriate for this data, assuming the data are binomially distributed.
Why can’t we fit the interaction model here.
produce a plot of the residuals vs the predicted values for this model and identify any problems that might exist.
:\{r plotLeaf.glm} plot(Leaf.glm,which=1)
Leaf.glm.quasi = glm(Prop~Site+Variety, family=quasibinomial(), data=LeafBlotch, weights=rep(100, nrow(LeafBlotch)))
:\{r plotLeaf.glm.quasi} plot(Leaf.glm.quasi, which=1)
What is the dispersion parameter for this model?
Describe the effect site and variety have on the proportion of leaf blotch.
You should compare your work with the solutions for this workshop.
The agridat
package includes this dataset and calls it
wedderburn.barley
. The help page for it presents several
other ways to model the proportion that are beyond the expected learning
for the 161.331 course. You might investigate if you have lots of time
on your hands, or after the conclusion of the semester.