WildDash 2 Benchmark

View Visualizations: Algorithm Results / Validation Ex. (1) / Validation Ex. (2)

For all metrics, higher scores are better. To participate in the benchmark, check our submission instructions.

	Meta AVG	Classic				Negative	Impact (PQ)
Algorithm	PQ	PQ	RQ	SQ	Cat. PQ	PQ	Blur	Coverage	Distortion	Hood	Occ.	Overexp.	Particles	Screen	Underexp.	Var.
seamseg_rvcsubset	27.8%	32.4%	42.6%	37.0%	53.3%	15.1%	-21%	-23%	0%	-5%	-6%	-15%	-22%	-39%	-24%	-13%
seamseg_mvd_ps	27.4%	32.1%	42.1%	36.9%	53.4%	13.4%	-19%	-18%	0%	-9%	-5%	-11%	-42%	-39%	-28%	-12%
EffPS_b1bs4_RVC	24.2%	28.2%	36.4%	31.8%	54.4%	11.2%	-9%	-9%	-6%	-7%	-4%	-7%	-29%	-33%	-20%	-18%

Cached July 30, 2025, 5:45 p.m. UTC+0

Click here for the extended metrics table

Methodology:
Our benchmark evaluates the negative Impact of common visual hazards on algorithm output performance. It is calculated by this formula:
impact = min(metric_low,metric_high) / max(metric_none,metric_low) - 1.0
The metrics_{none/low/high} are evaluated on subsets of the benchmark dataset that correspond to the identified severity of the hazard (e.g. the subset Blur_high contains images which have a lot of blur visible). Positive impacts are truncated to zero.
An impact of -10% at Blur translates to an expected performance degradation for the algorithm of 10 percent when there is a considerable blur in the input image as opposed to supplying the same algorithm a similar image without noticeable image blur.
These are all currently evaluated hazards:
Blur: Image is noticeably affected by blur (e.g. motion blur, defocusing, compression artifacts...)
Coverage: Normally visible parts of the road are covered (e.g. unusual lane markings, snow, leaves...)
Distortion: Visible lens distortion
Hood: Ego-vehicle is visible, non-windscreen parts (e.g. car hood, mirrors)
Occl: Objects are partially occluded or cut off by image border
Overexp.: The scene is overexposed
Particle: Particles in the air obstruct the view (e.g. heavy rain, snow, fog)
Screen: The windscreen is interfering (e.g. interior reflections, wipers, rain on the windscreen,...)
Underexp.: The image is underexposed
Variation: Intra-class variations within the image (i.e. unusual representations of labels like unique cars)
More details on evaluation metrics and negative test cases can also be found on the FAQ page.