Skip to main content

Eval report — step-006000

  • entries evaluated: 74
  • full-parse exact match: 0.0000
  • mean token confidence: 0.8653

Per-component F1

tagprecisionrecallf1support
country0.00000.00000.00006
region0.09860.11110.104563
locality0.04230.04170.042072
dependent_locality0.00000.00000.00001
postcode0.00000.00000.000065
subregion0.00000.00000.00000
cedex0.00000.00000.00001

Calibration (confidence bucket → accuracy)

bucketnaccuracy
0.0–0.100.0000
0.1–0.200.0000
0.2–0.350.0000
0.3–0.4360.2222
0.4–0.5560.2143
0.5–0.6730.2055
0.6–0.7640.0938
0.7–0.8940.2128
0.8–0.9950.2316
0.9–1.07770.3591