Corpus Spillover
detected 2026-03-12
trigger
""This person can build a tech company in a cave with a box of scraps." Iron Man (2008) reference in professional CV advice. The corpus is leaking. "
what it is
A high-frequency cultural reference, idiom, or meme from the training corpus surfaces in a context where it does not belong. The reference is not chosen by the model for rhetorical effect - it is emitted because the token sequence has high probability in the neighbourhood of the surrounding text. Pop culture leaks into professional advice. Reddit idioms leak into technical documentation. The tell is that the reference feels borrowed rather than earned - it doesn't arise from the argument, it falls out of the distribution. Related to epigrammatic closure (high-probability short sequences) but operates at the cultural-reference level rather than the syntactic level.
what it signals
instead
Delete the reference and say the thing directly. "This candidate ships a large volume of work across multiple domains." The reference added entertainment, not information. If you want entertainment, earn the metaphor from the material.
refs
- Cross-model CV feedback session 2026-03-12
- Iron Man (2008) reference in hiring manager persona output
← all patterns