1.4 Factors and Groups
You often see terms like between-subject and within-subject What are they and how do they relate to experiment design?
One IV vs DV
Subjects are measured on DV and belongs to one of two groups as denoted by IV level value. In this example, IV, let’s say treatment
, has two levels: control and experiment. If there are more than 2 levels (e.g. subject belongs to one of three levels control, experiment 1, or experiment 2), then such experiment would still fall under “one IV” case.
In a table representation, the cell values are generally the average \(\overline{y}\) of the measured DV (where \(y\) is the DV). The subscripts distinguish which subject (\(i\)) and groups (\(j\)) the variable \(y_{ij}\) refers to.
- For example, for an experiment with \(n_1\) number of subjects in
Group 1
(\(j = 1\)) and \(n_2\) number of subjects inGroup 2
(\(j = 2\)),
there are \(\langle y_{1,1}, y_{2,1}, y_{3,1}, ..., y_{n_1, 1}\rangle\) measurements associated with subjects inGroup 1
, and
there are \(\langle y_{1,2}, y_{2,2}, y_{3,2}, ..., y_{n_2, 2}\rangle\) measurements associated with subjects inGroup 2
.
When it is obvious that first subscript (subject index) and second subscript (group index) is obviously distinguishable, the commas between the indices are omitted. When we aggregate over the subjects (as we are doing with averages), we often put a dot \(\bullet\) in place of the \(i\) as a placeholder.
Control | Experiment | |
---|---|---|
DV | \(\overline{y}_{\bullet 1}\) | \(\overline{y}_{\bullet 2}\) |
You can also be more explicit and say \(\overline{y}_{\bullet,Control}\), \(\overline{y}_{\bullet,Teatment}\) instead of \(\overline{y}_{\bullet 1}\), \(\overline{y}_{\bullet 2}\), but writing out numbered index instead of full level name is more practical if it is unambiguous that what 1 and 2 refers to. The dot (\(\bullet\)) is also sometimes omitted if it is obvious to do so and if it is acceptable in certain field of study.
Two IVs with two levels vs DV
When we add another IV, we can apply the same logic as 1 IV case, except the rows in the table now represent IV levels. This is called factorial design.
Control | Experiment | ||
---|---|---|---|
IV2 (Sex) | Male | \(\overline{y}_{11}\) | \(\overline{y}_{12}\) |
Female | \(\overline{y}_{21}\) | \(\overline{y}_{22}\) |
With 2 IVs, there are three research questions (one for each IV and their interaction):
Does IV1 have an effect on the DV? (Not accounting for any other variables, such as IV2)
For example if DV is student’s GPA and IV1 is traditional vs novel teaching method, does the teaching method effect GPA?Does IV2 have an effect on the DV? (Not accounting for any other variables, such as IV1).
For example if DV is student’s GPA and IV2 is student’s sex, is student sex related to differences in GPA? Note that we do not use the wordeffect
here because student’s sex is demographic information about a student that researcher did not manipulate. Such variable is called ex post facto.
Such ex post facto research design and is considered to be a quasi-experimental design. This variable is measuring characteristic or trait about the subject that already existed prior to researcher’s intervention (Treatment
), and using groups from such variable to compare the differences in DV is “after the fact” research.Does effect IV1 have on DV depend on the value IV2? Conversely, does effect IV2 have on DV depend on the value of IV1?
Factors, Groups, and Between-Subject Design vs Within-Subject Design
Factor happens to be a synomym for IV when it takes a fixed number of values (levels). This term has a slightly deeper meaning, but let’s just simplify things and say factor variable is a variable that one of the possible categorical values.
We saw that IV with 2 levels can split our subjects into two groups. If the IV had 3 or mor possible levels, then we can obviously split into 3 or more groups. In experimental design. the number of levels do not matter as much as how many IVs we have.
Between-Subject Design: Experimental treatments are given to different groups of subject – i.e. no subject can receive more than one treatment at the same time.
In Teacher Rating Case Study, there is 1 IV with 2 conditions (instructor review) and 1 DV (professor rating). Students (subjects) were split into one of the two groups by IV / factor. Comparing the mean DV of one group to the mean DV of another group is effectively a comparison between (group of subjects in one condtion) vs (group of subjects in another condition).
Two conditions only vary by whether or not IV determined which group the subjects were assigned to. If there were no differences among subjects between groups, then we can interpret the IV directly effecting mean difference. If for some reason one group had particularly generous students who rated professors highly regardless, then we don’t know if the mean difference in professor rating is due to the effect of instructor review condition (IV) or because of generous students. If we randomly assigned which condition a student was exposed to, then we can try to limit accidentally putting all the generous students into one group. However, this does not guarantee that there will not be any differences between groups. There could be another factor unknown to the researcher that is creating differences between groups.Multi-factor Between-Subject Design: same thing as Between-Subject Design with multiple IVs.
Within-Subject Design: Unlike Between-Subject Design where a subject is exposed to only one of the levels for a factor (or unique combination of factors in Multi-factor Between Subject Design), each subject is exposed to all levels of a factor. This is done via applying each factor level to each subject in order. Obviously, this design is not possible for all types of factors. Sometimes once a treatment is applied, there’s no reasonable way of “removing” the treatment. Time between each treatment also comes into play here.
The order of application of factor level can also become confounding, so we counterbalance by applying factor levels in different order for different subjects. When we consider all the subjects, the number of times a specific order was applied should be same for all possible orders.
Because each subject is measured multiple times after different factor levels are applied, this design is often called “repeated measures” design. Within-Subject Design has an advantage that individual differences in subjects’ overall levels of performance are controlled. Meaning the effect of individual subject differences on DV are removed because each subject essentially serves as its own control (e.g. each subject received both control and treatment level. The difference in DV taken after each treatment can be attributed to the most recent factor, ignoring the baseline of other subjects).