Routed SSM · MQAR (all runs)
→
Generalized, auto-discovering MQAR training-curve view: every routed/dense run on one page,
one shared filter (router × difficulty × seed × variant) driving all charts,
mean±band aggregation across seeds.
hard top-k routed ~0.91 vs dense ~0.77 (11 seeds)