Why composition breaks reasoning. As a session draws on more scientific domains (order k = 1 → 4), accuracy falls non-linearly — combining disciplines compounds, rather than adds.
Video tutorial · about 5 minutes · narrated & subtitled
A guided tour of XDomainBench
The problem, the idea, and the evidence — animated from the paper's own analysis. Why interdisciplinary reasoning collapses as scientific domains combine, and the two mechanisms behind it.
What the tutorial covers
1
2
A controllable design. Composition order and mixture structure, over 8,598 sessions, 20 domains, and realistic difficulty / mixture trajectories — reasoning made measurable.
3
Two mechanisms of collapse. A direct integrative load at the very first turn, and indirect, trajectory-amplified failures — error accumulation, reasoning breaks, and domain confusion.