Bertozzi, Claudio (2025) Matching and Conflation of Open Government Data with OpenStreetMap Data. Masters thesis, OST Ostschweizer Fachhochschule.
MT_Conflation_of_OGD_ATP_Claudio_Bertozzi_Thesis.pdf - Supplemental Material
Download (3MB)
Abstract
This thesis addresses the absence of a reliable, openly licensed, repeatable workflow for conflating authoritative POI from Swiss OGD sources with heterogeneous OSM data—a gap that increases duplication,staleness, and manual curation cost. The objective was to design and evaluate an auditable end-to-end pipeline (DiffedPlaces) that ingests brand and retailer feeds from ATP together with contemporaneous OSM extracts, generates spatially and semantically blocked candidate pairs for Switzerland (Oct 2024–Aug 2025), and resolves matches via a tunable rule-based scorer and a supervised machine learning (ML) RandomForestClassifier. Two golden datasets underpinned evaluation: a brand-focused Aldi Süd Switzerland subset (246 outlets) and a stratified multi-category random sample (200 POI). The evolved ML matcher achieved Precision 1.0000, Recall 0.9957 (F1 0.9978) on the brand subset and improved F1 on the heterogeneous sample while substantially lowering false positives versus the tuned rule-based approach. The resulting workflow delivers reproducible, high-precision conflation, reduces audit workload, and provides a transferable governance template for integrating additional authoritative OGD feeds into OSM with transparent quality controls.
| Item Type: | Thesis (Masters) |
|---|---|
| Subjects: | Topics > Internet Technologies and Applications > Monitoring Topics > Internet Technologies and Applications > Internet of Things (IoT) Area of Application > GIS > OpenStreetMap Technologies > Databases Metatags > IFS (Institute for Software) |
| Divisions: | Master of Science in Engineering (MRU Software and Systems) |
| Depositing User: | Stud. I |
| Date Deposited: | 10 Sep 2025 13:18 |
| Last Modified: | 06 Nov 2025 09:51 |
| URI: | https://eprints.ost.ch/id/eprint/1327 |
