Behavioral & Survey Research

What do customers truly want, and how would we know?

Years ago I ran an experiment: people read about crime in a city described either as a beast preying on it or a virus infecting it, then chose how to respond. The wording shifted their policy choices. But "shifted" is a causal claim, and causal claims are hard. This walks through how you actually earn one.

Inspired by Thibodeau & Boroditsky (2011), "Metaphors We Think With," PLoS ONE. Numbers here are illustrative, the method is the point.

1The fundamental problem, you only see one outcome

Every person has two potential outcomes: what they'd choose under "beast" and what they'd choose under "virus." Causation is the difference between them. The trouble: each person reads only one version, so one of those two is forever unobserved. Reveal the hidden column to see the counterfactual we never get in real life.

Each person was randomly shown one frame. The colored cell is what we observed; the other is what they would have done under the other frame, invisible in reality. Causal inference is the science of estimating that missing column for a whole population.

2Why you can't just compare the groups you find

Suppose you skipped the experiment and compared people who naturally talk about crime as a beast vs. a virus. Political leaning drives both which metaphor someone favors and how punitive they are, so the difference you measure is part metaphor, part politics. That's confounding. Slide how lopsided the groups are by party.

Party imbalance between the groups

Conservatives Liberals Dashed = true causal effect (+18 pts)

3Randomization, the move that earns the claim

Assign the frame by a coin flip and the confounder loses its grip: liberals and conservatives end up split roughly evenly across both arms, so the two groups are alike in every way except the metaphor. Now any difference in their choices is caused by the wording. Randomize the sample and watch the party mix balance out, and steady as the sample grows.

Sample size

Conservatives Liberals

4The effect, and how sure we are

With a clean randomized comparison, the estimate is just the difference in the share choosing enforcement: beast minus virus. But an estimate without uncertainty is a guess in a lab coat. The 95% confidence interval shows the range consistent with the data, and it tightens as the sample grows. The red line is "no effect"; the gray marker is the partisan gap, for scale.

Sample size

Work together on behavioral & survey research

What a typical engagement looks like, end to end.

What you get

A survey or experiment instrument that measures what you think it measures
A sampling and fielding plan
Clean analysis with the caveats stated honestly
An insights report you can act on

How it works

Free 30-minute call to scope the question
Instrument and study design
Fielding and data collection
Analysis and an insights readout

Book Call →