ML4CO-Bench-101 ATSP instance, scale 50, #1782
Subject outcomes
- ML4CO-SL-solverscore 0.988
ML Engineering & Research
ML4CO-Bench-101: a modular benchmark of neural combinatorial-optimization solvers under a paradigm-model-learning taxonomy, over 7 graph CO problems (TSP, ATSP, CVRP, MIS, MCl, MVC, MCut). This build ingests the released per-instance neural-solver outputs (ML4CO/ML4CO-Bench-101-SL, ml4co_result/) and computes the achieved objective value per instance, with the reference (optimal/near-optimal) objective as correct_answer.
Response matrix
Each row is an AI model and each column an item, ordered so the strongest models and easiest items gather toward one corner. 1 subjects × 35,578 items, 100% of cells evaluated.
Fit to width. Hover for subject & item; click a cell for details.

Scale: Achieved objective value per instance — tour cost (TSP/ATSP/CVRP), set size (MIS/MVC/MCl), or cut value (MCut); each combinatorial problem is shown on its own scale.
Sample items
A spread of items across the difficulty range. This benchmark does not publish per-answer traces, so each item shows which subjects succeeded.
ML4CO-Bench-101 ATSP instance, scale 50, #1782
Subject outcomes
ML4CO-Bench-101 TSP instance, scale 100, #662
Subject outcomes
ML4CO-Bench-101 CVRP instance, scale 50, #9278
Subject outcomes
ML4CO-Bench-101 CVRP instance, scale 100, #6429
Subject outcomes
ML4CO-Bench-101 CVRP instance, scale 100, #6151
Subject outcomes
ML4CO-Bench-101 MCut instance, scale ba-giant, #5
Subject outcomes
Subjects
1 subjects, ranked by mean response across this benchmark's items.