STEP classification dataset: cifar100
Subject outcomes
- QKFormerscore 0.947
- SGLFormerscore 0.944
- Spikformerscore 0.774
ML Engineering & Research
STEP: a unified Spiking Transformer evaluation platform reproducing and fairly comparing representative spiking transformers under one training pipeline across image and sequential image classification, plus module-wise ablations.
Response matrix
Each row is an AI model and each column an item, ordered so the strongest models and easiest items gather toward one corner. 8 subjects × 8 items, 53% of cells evaluated.
Fit to width. Hover for subject & item; click a cell for details.

Scale: 0 to 1 (per task): accuracy (classification) or mIoU / mAP (segmentation, detection) for spiking transformers; percentages ÷ 100.
Sample items
A spread of items across the difficulty range. This benchmark does not publish per-answer traces, so each item shows which subjects succeeded.
STEP classification dataset: cifar100
Subject outcomes
STEP encoding ablation scheme: rate
Subject outcomes
STEP classification dataset: scifar
Subject outcomes
STEP encoding ablation scheme: phase
Subject outcomes
STEP classification dataset: psmnist
Subject outcomes
STEP classification dataset: smnist
Subject outcomes
Subjects
8 subjects, ranked by mean response across this benchmark's items.