The result of comparing variant A against variant B on one task.
winner is "a" / "b" / "tie". detail carries audit info such as the per-order verdicts behind a swapped LLM judgment.
winner instance-attribute
reasoning class-attribute instance-attribute
detail class-attribute instance-attribute