Lecture 23 Orthogonal Factorial Models

In class version

We know that the importance of one predictor can depend on whether or not we have adjusted for another. This is also true if the predictors are factors.
This means that we need to look at multiple ANOVA tables in order to perform all the possible tests (with particular patterns of adjustment).
However, for orthogonal factorial designs it turns out that the issue of adjustment is of no concern.

23.1 What is an Orthogonal Factorial Design?

The pattern of factor levels (i.e. the number of observations in the different level groups of the different factors), when chosen by an experimenter, is called the factorial design.
In essence, the pattern of factor levels (the factorial design if this was chosen by an experimenter) is orthogonal if the sum of squares attributable to one factor is the same whether or not the other factor has been included in the model.
In terms of ANOVA tables, this means that SSA will be the same whether A appears on the first or second line of the ANOVA table.
A balanced and complete factorial design is orthogonal. We will see what these two terms mean.

23.2 What is a balanced and complete factorial design?

A factorial design is complete if observations are made at every possible combination of factor levels, or treatment.
For example, if factor A has 3 levels, and factor B has 4 levels, then a complete design requires that we observe responses at each of the possible \(3\times4 = 12\) treatments.
A factorial design is balanced if the same number of experimental units are observed at each treatment. In other words, n_ij = r is the (constant) number of replications, i.e. the number of observations in each treatment.
Balance and completeness need to be achieved by design (they will not usually be the case in an observational study or survey).

23.3 Consequences of an Orthogonal Factorial Design

In a two-way orthogonal design:

The P-values for each factor in the ANOVA table remain precisely the same irrespective of the order in which the factors are listed.
In considering the importance of factor B it does not matter whether or not we have adjusted for A (and vice versa).

The idea of orthogonality can be extended to three or more factor models.

23.4 A Designed Experiment for Dairy Cattle

An experiment was performed to investigate butterfat content of milk (the response variable, measured as a percentage). The factors are:

Cow breed, with five levels: Ayrshire, Canadian, Guernsey, Holstein-Fresian, Jersey.
Cow age, with two levels: mature, and 2 years old.

Some cows

10 replicates (cows) observed at each treatment (i.e. combination of breed and maturity).
The design is complete and balanced, so therefore orthogonal.

23.4.1 Analysis of Dairy Cattle Data

Download cows.csv

## Cows <- read.csv(file = "cows.csv", header = TRUE)
head(Cows)

  X Butterfat    Breed    Age
1 1      3.74 Ayrshire Mature
2 2      4.01 Ayrshire  2year
3 3      3.77 Ayrshire Mature
4 4      3.78 Ayrshire  2year
5 5      4.10 Ayrshire Mature
6 6      4.06 Ayrshire  2year

Cows.lm.1 <- lm(Butterfat ~ Breed + Age, data = Cows)
anova(Cows.lm.1)

Analysis of Variance Table

Response: Butterfat
          Df Sum Sq Mean Sq F value Pr(>F)    
Breed      4 34.321  8.5803 50.1150 <2e-16 ***
Age        1  0.274  0.2735  1.5976 0.2094    
Residuals 94 16.094  0.1712                   
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Cows.lm.2 <- lm(Butterfat ~ Age + Breed, data = Cows)
anova(Cows.lm.2)

Analysis of Variance Table

Response: Butterfat
          Df Sum Sq Mean Sq F value Pr(>F)    
Age        1  0.274  0.2735  1.5976 0.2094    
Breed      4 34.321  8.5803 50.1150 <2e-16 ***
Residuals 94 16.094  0.1712                   
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The figures in the ANOVA tables for models Cows.lm.1 and Cows.lm.2 are identical, despite the difference in order in which the factors are considered. This occurs because of the orthogonal design.

There is overwhelming evidence of a breed effect (P-value smaller than \(2 \times 10^{-16}\)) on mean butterfat content.

There is no evidence of an age effect.

23.4.2 Summary Table

summary(Cows.lm.1)$coefficients

                      Estimate Std. Error   t value     Pr(>|t|)
(Intercept)             4.0077 0.10135464 39.541356 2.234024e-60
BreedCanadian           0.3785 0.13084828  2.892663 4.746491e-03
BreedGuernsey           0.8900 0.13084828  6.801771 9.480645e-10
BreedHolstein-Fresian  -0.3905 0.13084828 -2.984372 3.621196e-03
BreedJersey             1.2325 0.13084828  9.419306 3.155879e-15
AgeMature               0.1046 0.08275572  1.263961 2.093694e-01

summary(Cows.lm.2)$coefficients

                      Estimate Std. Error   t value     Pr(>|t|)
(Intercept)             4.0077 0.10135464 39.541356 2.234024e-60
AgeMature               0.1046 0.08275572  1.263961 2.093694e-01
BreedCanadian           0.3785 0.13084828  2.892663 4.746491e-03
BreedGuernsey           0.8900 0.13084828  6.801771 9.480645e-10
BreedHolstein-Fresian  -0.3905 0.13084828 -2.984372 3.621196e-03
BreedJersey             1.2325 0.13084828  9.419306 3.155879e-15

Note that by default, level one of Age (2 years old) and level one of Breed, Ayrshire, are set as the reference levels for the treatment constraint.

The Jerseys seem to provide the highest butterfat concentration.

We should look at model diagnostics.

23.5 Rats – Not Another Task!

This task concerns a complete and balanced experiment into rat weight gain.

Two factors:

Protein: either beef or cereal;
Amount: either low or high.

We have ten replicates at each treatment.

The rats again

The following ANOVA table (with certain elements obscure by #) was obtained using R.

	Df	Sum Sq	Mean Sq	F value	Pr(>F)
Amount	1	1299.6	#	#	0.026
Protein	1	220.9	#	#	0.345
Residuals	37	8933.0	#

Calculate the obscured values.
What can you conclude (if anything) about the effect of Protein ignoring the effects of Amount?

23.6 Mathematical formulation of factorial models

As we move towards more complex factorial models, with more than two factors and interactions, we need to start using the mathematical formulation of these models as they are more concise.

23.6.1 One-way ANOVA model

Remember that the one-way model \[Y_i = \mu + \alpha_2 z_{i2} + \ldots + \alpha_K z_{iK} + \varepsilon_i~~~~~(i=1,2,\ldots,n)\]

can be written as \[\boldsymbol{Y_{ij} = \mu + \alpha_i + \varepsilon_{ij}}\] where

\(Y_{ij}\) is the response of the \(j\)th unit at the \(i\)th level of the factor \((i=1,\ldots,K;~j=1,\ldots,n_i)\);
\(K\) denotes the number of levels, and \(n_i\) the number of observations at level \(i\) of the factor.
Assume treatment constraint \(\alpha_1=0\).
\(\varepsilon_{11}, \varepsilon_{12}, \ldots, \varepsilon_{Kn_K}\) are random errors satisfying assumptions A1 to A4.

23.6.2 Two-way (main effects) ANOVA model

A two-way model with factors A and B in multiple regression form \[Y_i = \mu + \alpha_2 z_{Ai2} + \ldots + \alpha_K z_{AiK} + \beta_2 z_{Bi2} + \ldots + \beta_L z_{BiL} + \varepsilon_i~~~~~(i=1,2,\ldots,n)\] becomes: \[\boldsymbol{Y_{ijk} = \mu + \alpha_i + \beta_j + \varepsilon_{ijk}}\]

where \(Y_{ijk}\) is the response for the \(k\)th unit at level \(i\) of factor A and level \(j\) of factor B (\(i=1,\ldots,K;~j=1,\ldots,L;~k=1,\ldots,n_{ij}\)).

\(\alpha_1, \ldots, \alpha_K\) and \(\beta_1, \ldots, \beta_L\) are parameters describing the ‘main effects’ of A and B respectively.
Assume treatment constraints, \(\alpha_1 = 0\) and \(\beta_1 = 0\).
\(\varepsilon_{111}, \varepsilon_{112}, \ldots, \varepsilon_{KLn_{KL}}\), are error terms satisfying assumptions A1–A4.