Skip to main content

Eval report — step-050000

  • entries evaluated: 74
  • full-parse exact match: 0.5270
  • mean token confidence: 0.9745

Per-component F1

tagprecisionrecallf1support
country0.00000.00000.00006
region0.85000.80950.829363
locality0.68750.61110.647172
dependent_locality0.00000.00000.00001
postcode0.87300.84620.859465
subregion0.00000.00000.00000
cedex0.00000.00000.00001

Calibration (confidence bucket → accuracy)

bucketnaccuracy
0.0–0.100.0000
0.1–0.200.0000
0.2–0.300.0000
0.3–0.450.2000
0.4–0.590.4444
0.5–0.6200.4000
0.6–0.780.5000
0.7–0.8190.3684
0.8–0.9250.4000
0.9–1.011140.8824