Grouping

Lesson

Elijah Galvan

September 1, 2023

13 min read

Goal During this Stage

To examine our model predictions in an efficient way and to potentially generate theoretical, sampling invariant, theoretically valid a priori groupings. Essentially, this a valid, unbiased way to classify subjects’ behavior in your task without making any arbitrary assertions about who falls into which group.

How to Achieve this Goal

Note

This section will differ from the previous sections: rather than provide conceptual examples about how to implement it, we’ll primarily talk about the reasons to use this approach and what it can be used for. The implementation examples will be in reference to the first tutorial.

Choosing an Approach to Grouping

Canonically, there have been 2 approaches to grouping: clustering and binning.

Binning is simply the researcher asserting that it is the case that groups 1 and 2 are differentiable on X or Y: the grouping is only as valid as the researcher’s reasoning. Further, this approach is theoretically only valid when specified a priori (in a preregistration).

On the other hand, Clustering is an empirical, data-driven approach which provides post-hoc explanations. Essentially, whichever Clustering Algorithm you use searches for the best solution to the problem you offer it. Thus, the groups are determined by the observed data and can obviously be biased therein.

Here, we clearly don’t have any real data: we want to cluster the simulated model predictions. This is the most ideal situation: since we can simulate as much data as we want and therefore exhaustively represent the variance in expected behavior. Here, we also group based on the model predictions (i.e. our hypotheses) - which means that our clustered groupings are a logical extension of our psychological theory in the context of our Experimental Paradigm and Trial Set. This enables us to overcome the limitations of both clustering (post-hoc, sampling dependency, atheoretical) and binning (arbitrariness, overreliance on reasoning, etc.).

However, the validity of a priori clustering requires that our model be highly generalizable: all Constructs must have the same value on the same Trial for each subject and there are no Free Parameters in your model which do not translate to psychologically meaningful differences.. So, for instance, this might preclude using binary choice tasks which often require Free Parameters to model response bias parameters (preference for left-versus-right) and inverse heat parameters (probability of behaving preference-congruent) for example. It also requires that all Constructs are directly computed from Experimental Variables and not self-report measures for instance. In this study, the experimenters asked what the participant thought the Investor expected for all trials: although this would be a theoretically superior way to mathematically calculate Guilt, using the a priori clustering to group subjects would be conceptually problematic. In these cases, it is always good to make sure that the generalized Constructs is highly correlated with the questionnaire measure and that using either value leads to the same behavioral conclusions - not just taking for granted that these are distinctions without differences.

Grouping

Lesson

Goal During this Stage

How to Achieve this Goal

Tutorials

Tutorial 1 - van Baar, Chang, & Sanfey, 2019

Tutorial 2 - Galvan & Sanfey, 2024

Tutorial 3 - Crockett et al., 2014

Tutorial 4 - Li et al., 2022