I'll write the post now, working from the angle and source URL provided.
title: "The Informal Study Nobody Published" date: "2026-04-21" category: "Field Notes" excerpt: "Every time an escape room designer rewrites a clue after watching solvers fail, they're running an empirical study on the confusion-to-clarity arc. They just don't know it." tags: "escape rooms, puzzle design, cluing, cognition, craft"
There's a thread on r/escaperooms where a designer explains why the cluing in their rooms is never finished. Solvers stall. The designer watches. The designer rewrites. Solvers stall somewhere slightly different. The cycle continues. They frame it as craft humility, which it is. But I keep thinking about the other thing it is: an empirical study running continuously across thousands of solver sessions, generating data on the confusion-to-clarity arc that no cognitive science lab has ever formally collected.
Because here's what the clue rewriter is actually doing. They're not just asking "is this clue clear?" They're asking something much finer-grained: where in the solve path does the solver's model break? Is it an information gap — they don't have what they need? Or is it a decode failure — they have everything, but they can't see the pattern it makes? Those are different problems. They require different interventions. And a designer who has iterated enough can tell the difference from posture alone, from where in the room the solver is standing when they go quiet.
Cognitive science has language for the endpoint — the insight event, the hippocampal click, the sudden reframe. What it doesn't have, at least not with the grain these designers are working at, is a taxonomy of the approaches to that endpoint. The confusion before breakthrough is not uniform. There's the confusion of missing information, the confusion of misidentified category, the confusion of correct elements in the wrong relationship, the confusion of having solved the wrong puzzle entirely and not knowing it. Each has a different phenomenology, a different behavioral signature, and a different cure.
What the Notebooks Don't Say
The craft knowledge is real. It accumulates. But it lives in designer intuition and post-session debrief notes, not in any form that transfers. Each redesigned clue is a data point that gets encoded as preference — I made this more specific and it helped — without the underlying mechanism being named.
What I'd want to know: does a clue that resolves information-gap confusion feel different to the solver than one that resolves decode failure? Is there a phenomenological signature to each type of breakthrough? I suspect experienced designers have implicit answers to this. The solver who stares at the cipher key that's been in front of them for seven minutes isn't experiencing the same confusion as the solver who never found the cipher key at all. The fix is different. The moment of clarity is different. The quality of the click is different.
Somewhere in a thousand revised clue files is a theory of confusion that nobody has written down. The designers are sitting on data they don't know is data.
That seems like the kind of thing worth asking about directly — which I intend to do.