Eval Matrix Report
Generated: 2026-05-25T15:43:12.991Z Golden rows: 4535
Summaryโ
| Mode | Exact Match | Macro F1 | Empty Parse | Overconf Wrong |
|---|---|---|---|---|
| rule-only | 30.8% | 22.0% | 6.3% | 2.4% |
| neural | 0.0% | 7.8% | 0.3% | 59.3% |
| hybrid | 0.0% | 7.8% | 0.3% | 59.4% |
| hybrid-joint | 10.2% | 17.0% | 0.0% | 0.2% |
rule-onlyโ
Per-component F1โ
| Tag | P | R | F1 | TP | FP | FN |
|---|---|---|---|---|---|---|
| region | 96.9% | 84.1% | 90.1% | 2697 | 86 | 508 |
| house_number | 75.5% | 92.0% | 82.9% | 1602 | 520 | 140 |
| postcode | 98.8% | 65.4% | 78.7% | 1948 | 24 | 1032 |
| street | 72.6% | 71.2% | 71.9% | 2085 | 786 | 843 |
| locality | 83.4% | 57.0% | 67.8% | 1915 | 381 | 1442 |
| venue | 38.6% | 17.7% | 24.3% | 195 | 310 | 906 |
| country | 21.0% | 25.7% | 23.1% | 63 | 237 | 182 |
| unit | 0.9% | 20.0% | 1.7% | 1 | 115 | 4 |
| street_prefix | 0.0% | 0.0% | 0.0% | 0 | 0 | 10 |
| street_suffix | 0.0% | 0.0% | 0.0% | 0 | 0 | 2 |
| po_box | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| unit_designator | 0.0% | 0.0% | 0.0% | 0 | 115 | 0 |
| intersection_a | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| intersection_b | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| dependent_locality | 0.0% | 0.0% | 0.0% | 0 | 0 | 40 |
| level_designator | 0.0% | 0.0% | 0.0% | 0 | 1 | 0 |
| level | 0.0% | 0.0% | 0.0% | 0 | 1 | 0 |
| street_prefix_particle | 0.0% | 0.0% | 0.0% | 0 | 0 | 6 |
| cedex | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| attention | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
Calibrationโ
| Bucket | Total | Correct | Accuracy |
|---|---|---|---|
| conf > 0.9 | 946 | 835 | 88.3% |
| conf 0.7โ0.9 | 1330 | 564 | 42.4% |
| conf 0.5โ0.7 | 1199 | 0 | 0.0% |
| conf < 0.5 | 1060 | 0 | 0.0% |
Per-failure-classโ
| Class | Total | Exact Match | Rate |
|---|---|---|---|
| failure/tokenization-trap | 3 | 0 | 0.0% |
| kryptonite/place-shaped-venue | 6 | 0 | 0.0% |
| kryptonite/particle-honorific | 9 | 0 | 0.0% |
| normal | 4503 | 1389 | 30.8% |
| failure/street-locality-collision | 5 | 2 | 40.0% |
| kryptonite/place-name-venue | 10 | 5 | 50.0% |
| failure/ambiguous-locality | 4 | 3 | 75.0% |
| failure/numeric-chaos | 1 | 1 | 100.0% |
| kryptonite/disambiguation | 4 | 4 | 100.0% |
neuralโ
Per-component F1โ
| Tag | P | R | F1 | TP | FP | FN |
|---|---|---|---|---|---|---|
| region | 87.5% | 58.0% | 69.8% | 1860 | 266 | 1345 |
| locality | 45.1% | 28.0% | 34.5% | 939 | 1144 | 2418 |
| country | 26.9% | 26.9% | 26.9% | 66 | 179 | 179 |
| house_number | 11.0% | 0.5% | 1.0% | 9 | 73 | 1733 |
| postcode | 12.1% | 0.2% | 0.5% | 7 | 51 | 2973 |
| street | 12.0% | 0.1% | 0.2% | 3 | 22 | 2925 |
| street_prefix | 0.0% | 0.0% | 0.0% | 0 | 0 | 10 |
| street_suffix | 0.0% | 0.0% | 0.0% | 0 | 0 | 2 |
| po_box | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| unit | 0.0% | 0.0% | 0.0% | 0 | 0 | 5 |
| intersection_a | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| intersection_b | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| venue | 0.0% | 0.0% | 0.0% | 0 | 1 | 1101 |
| dependent_locality | 0.0% | 0.0% | 0.0% | 0 | 17 | 40 |
| street_prefix_particle | 0.0% | 0.0% | 0.0% | 0 | 0 | 6 |
| cedex | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| attention | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
Calibrationโ
| Bucket | Total | Correct | Accuracy |
|---|---|---|---|
| conf > 0.9 | 2689 | 0 | 0.0% |
| conf:0.7-0.9 | 1145 | 0 | 0.0% |
| conf:0.5-0.7 | 561 | 0 | 0.0% |
| conf < 0.5 | 140 | 1 | 0.7% |
Per-failure-classโ
| Class | Total | Exact Match | Rate |
|---|---|---|---|
| normal | 4503 | 0 | 0.0% |
| kryptonite/place-name-venue | 10 | 0 | 0.0% |
| failure/street-locality-collision | 5 | 0 | 0.0% |
| failure/tokenization-trap | 3 | 0 | 0.0% |
| kryptonite/place-shaped-venue | 6 | 0 | 0.0% |
| kryptonite/particle-honorific | 9 | 0 | 0.0% |
| failure/numeric-chaos | 1 | 0 | 0.0% |
| failure/ambiguous-locality | 4 | 1 | 25.0% |
| kryptonite/disambiguation | 4 | 1 | 25.0% |
hybridโ
Per-component F1โ
| Tag | P | R | F1 | TP | FP | FN |
|---|---|---|---|---|---|---|
| region | 87.5% | 58.1% | 69.8% | 1861 | 266 | 1344 |
| locality | 45.2% | 28.1% | 34.6% | 942 | 1141 | 2415 |
| country | 26.9% | 26.9% | 26.9% | 66 | 179 | 179 |
| house_number | 11.5% | 0.5% | 1.0% | 9 | 69 | 1733 |
| postcode | 12.3% | 0.2% | 0.5% | 7 | 50 | 2973 |
| street | 12.0% | 0.1% | 0.2% | 3 | 22 | 2925 |
| street_prefix | 0.0% | 0.0% | 0.0% | 0 | 0 | 10 |
| street_suffix | 0.0% | 0.0% | 0.0% | 0 | 0 | 2 |
| po_box | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| unit | 0.0% | 0.0% | 0.0% | 0 | 0 | 5 |
| intersection_a | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| intersection_b | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| venue | 0.0% | 0.0% | 0.0% | 0 | 1 | 1101 |
| dependent_locality | 0.0% | 0.0% | 0.0% | 0 | 17 | 40 |
| street_prefix_particle | 0.0% | 0.0% | 0.0% | 0 | 0 | 6 |
| cedex | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| attention | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
Calibrationโ
| Bucket | Total | Correct | Accuracy |
|---|---|---|---|
| conf > 0.9 | 2693 | 0 | 0.0% |
| conf:0.7-0.9 | 1145 | 0 | 0.0% |
| conf:0.5-0.7 | 557 | 0 | 0.0% |
| conf < 0.5 | 140 | 1 | 0.7% |
Per-failure-classโ
| Class | Total | Exact Match | Rate |
|---|---|---|---|
| normal | 4503 | 0 | 0.0% |
| kryptonite/place-name-venue | 10 | 0 | 0.0% |
| failure/street-locality-collision | 5 | 0 | 0.0% |
| failure/tokenization-trap | 3 | 0 | 0.0% |
| kryptonite/place-shaped-venue | 6 | 0 | 0.0% |
| kryptonite/particle-honorific | 9 | 0 | 0.0% |
| failure/numeric-chaos | 1 | 0 | 0.0% |
| failure/ambiguous-locality | 4 | 1 | 25.0% |
| kryptonite/disambiguation | 4 | 1 | 25.0% |
hybrid-jointโ
Per-component F1โ
| Tag | P | R | F1 | TP | FP | FN |
|---|---|---|---|---|---|---|
| house_number | 79.8% | 87.2% | 83.3% | 1519 | 384 | 223 |
| postcode | 87.3% | 70.7% | 78.1% | 2107 | 307 | 873 |
| region | 83.7% | 55.4% | 66.7% | 1775 | 345 | 1430 |
| locality | 54.5% | 57.3% | 55.9% | 1923 | 1604 | 1434 |
| country | 14.6% | 14.3% | 14.5% | 35 | 204 | 210 |
| street | 4.1% | 3.8% | 3.9% | 110 | 2597 | 2818 |
| venue | 2.8% | 2.8% | 2.8% | 31 | 1083 | 1070 |
| street_prefix | 0.0% | 0.0% | 0.0% | 0 | 0 | 10 |
| street_suffix | 0.0% | 0.0% | 0.0% | 0 | 0 | 2 |
| po_box | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| unit | 0.0% | 0.0% | 0.0% | 0 | 0 | 5 |
| intersection_a | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| intersection_b | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| dependent_locality | 0.0% | 0.0% | 0.0% | 0 | 11 | 40 |
| subregion | 0.0% | 0.0% | 0.0% | 0 | 1 | 0 |
| street_prefix_particle | 0.0% | 0.0% | 0.0% | 0 | 0 | 6 |
| cedex | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
| attention | 0.0% | 0.0% | 0.0% | 0 | 0 | 1 |
Calibrationโ
| Bucket | Total | Correct | Accuracy |
|---|---|---|---|
| conf > 0.9 | 11 | 0 | 0.0% |
| conf:0.7-0.9 | 862 | 128 | 14.8% |
| conf:0.5-0.7 | 3489 | 326 | 9.3% |
| conf < 0.5 | 173 | 10 | 5.8% |
Per-failure-classโ
| Class | Total | Exact Match | Rate |
|---|---|---|---|
| kryptonite/place-name-venue | 10 | 0 | 0.0% |
| failure/street-locality-collision | 5 | 0 | 0.0% |
| failure/tokenization-trap | 3 | 0 | 0.0% |
| kryptonite/place-shaped-venue | 6 | 0 | 0.0% |
| kryptonite/particle-honorific | 9 | 0 | 0.0% |
| failure/numeric-chaos | 1 | 0 | 0.0% |
| normal | 4503 | 462 | 10.3% |
| failure/ambiguous-locality | 4 | 1 | 25.0% |
| kryptonite/disambiguation | 4 | 2 | 50.0% |