Repeated measures and correlated data
Repeated measurements can be obtained in many situations. A simple example is a pre-post design to assess the intervention effect. With respect to conditions, e.g., treatment A, B, and C, different treatments may be applied to the same group of subjects repeatedly according to the designated time schedule to compare the effects of treatments. In perspective of time, we may measure children's height repeatedly to know the growth pattern. In the field of dentistry, we measure the periodontal pocket depths of a tooth repeatedly at different surfaces: buccal, lingual, mesial, and distal. Common characteristics of these repeated measurements include correlations of the data points obtained by the same object, person, tooth, etc. The paired t-test is an analyzing method of correlated samples with two time points or occasions. For three or more time points or repeated conditions, we may use the repeated measures ANOVA which is equivalent to the one-way ANOVA for independent samples. Therefore, continuous outcome variables and categorical independent variables are the basic requirements.
Comparison of one-way repeated measures ANOVA and classical one-way ANOVA
Table 1 shows a hypothetical data with repeated observations, P1, P2, and P3, which may represent scores measured in different conditions or different time points. If we disregard the correlated structure of three scores, the data can be analyzed using the one-way ANOVA. The classical one-way ANOVA decomposes the total variation of scores (SST, total sum of squares) into between-group variance (SSB, SS by conditions in
Table 2) and within-group variance (SSW, SS residual in
Table 2). The total variation is explained by SSB of different conditions (P1, P2, and P3), and the remainder is a substantial amount of unexplained within-group variance (
Table 2). On the other hand, in the one-way repeated measures ANOVA, SSW is divided into SS by subject and unexplained SS (SSE, error sum of squares), which results in a significant difference among different conditions after considering subject effects, as shown in
Table 3. In the classical ANOVA, different conditions do not make significant differences (
p = 0.1503), while the repeated measures ANOVA obtain significant differences (
p < 0.0001). The variance proportion explained in ANOVA is used as an 'effect size' which represents the degree how the analysis is effective in explaining the variance of outcome variable. In result of the repeated measures ANOVA, 'the partial eta squared' is calculated by dividing the variance explained with the condition (SS by condition) by the total variance (SS by condition +SS residual), excluding variance by subjects. In
Table 3, the partial eta squared is calculated as 0.69.
Multivariate and univariate tests
The multivariate tests analyze multiple variables (P1, P2, P3) as one observation using a 'wide-form' data that one person has one record with P1 to P3. The multivariate approach requires a sufficient number of subjects (e.g., larger than 30) as the number of person is the number of information. Also a record with any missing value is deleted, therefore even small number of missing values may lead to substantial loss of information. When sample size is large, Pillai's trace and Wilk's lambda have similar power and show similar results (Please refer [g] of SPSS analysis results).
1 The univariate tests are analyzing 'long form' data that one person has multiple records by occasions, e.g., three records per person in this example. The univariate tests have more power based on the increased number of information and have relative advantage in treating missing values that only the missing occasions are deleted. In case with many missing values, only the univariate approach works. However additional assumption, 'sphericity' is enquired to obtain reasonable analysis results by the univariate tests.
Sphericity assumption of univariate tests
The univariate tests assume an equal correlation between the repeated measures which is called 'sphericity' condition. Sphericity refers to the condition where the variances of the differences between all possible pairs of groups (i.e., levels of the independent variable, 'conditions' here) are equal.
Table 3 is the result based on the sphericity assumption. However, variances in differences between pairs of conditions, P2-P1, P3-P1, and P3-P2 are 11.9, 16.0, and 1.9, respectively, which seem fairly different (shown in the right side of
Table 1). The Mauchly's test which evaluates the sphericity condition showed that the assumption of sphericity was unsatisfied (
p = 0.025, [h] of SPSS results), thus additional remedy steps should be taken. The remedy is to adjust degrees of freedom of numerator and denominator by multiplying the adjustment factor, ε (epsilon). Two kinds of epsilon, Greenhouse-Geisser and Huynh-Feldt epsilon ([h] of SPSS results) are applied to the test of within-subject effect ([i] of SPSS results). If the epsilon is close to one, which means the sphericity assumption is met, then there is no need of adjustment. The guideline of epsilon is determined by the Greenhouse-Geisser epsilon, 0.75; if larger than 0.75 then apply the Huynh-Feldt epsilon, otherwise, apply the Greenhouse-Geisser epsilon.
2 In this example as the Greenhouse-Geisser epsilon was 0.586, we interpret the corrected results in (i) of the SPSS results by applying epsilon to degrees of freedom of 'repeat 1' and error portion (
p = 0.003) with the heading Greenhouse-Geisser. The pairwise comparison of main effects of P1, P2, and P3 shows that significant differences are found in P3 and other two occasions by applying the Bonferroni correction which controls the type one error rate ([k] of SPSS results). We could see that mean of P3 is significantly different from those of P1 (
p = 0.011) and P2 (
p < 0.001), while means of P1 and P2 are similar (
p = 0.585) as shown in table (k) of SPSS output.
Other outputs
Assessment polynomial terms show that assuming linear relationship among repeated measurements is appropriate ([j] of SPSS results). Also the Profile plot ([l] of SPSS results) provides the trend of the repeated measurements. Tests of between-subjects effects traditionally provided by the repeated measures ANOVA SPSS output (not shown) represent the comparison of simple averaged values among groups without accounting for correlated structures; therefore the importance of the result is little in respect to the analysis of correlated data.
The procedure of the one-way repeated measures ANOVA using SPSS statistical package (SPSS Inc., Chicago, IL, USA):
REFERENCES
- 1. Park E, Cho M, Ki CS. Correct use of repeated measures analysis of variance. Korean J Lab Med 2009;29:1-9. ArticlePubMed
- 2. Field A. Chapter 13. Repeated measures designs: discovering statistics using SPSS. 3rd ed. London: SAGE Publications Ltd; 2009.
Table 1Hypothetical data with repeated observations
Table 2Summary table for the classical on-way ANOVA (incorrect)
Table 3Summary table for the one-way repeated measures ANOVA
Citations
Citations to this article as recorded by