Study scope: Validate OSM data (bike lanes, pedestrian streets) in 7 cities (Barcelona, Milan, Warsaw, Ljubljana, Utrecht, MalmΓΆ, Paris) for three periods (e.g. 2015, 2019, and 2023).
Reference dataset: Google Street View (GSV).
Sampling: ~60 census tracts per city (stratified: central, peripheral, high-density, low-density).
Image collection: Up to 13,000 GSV images via API, randomly selected within the stratified tracts (GEMOTT π₯οΈ).
Validation (MTurk):
5 users classify each image by answering βYesβ or βNoβ to:β
Bike Lane, β
Pedestrian Area, β
Both.
At least 3 out of 5 users must agree.
Metrics:
Accuracy: Proportion of correctly mapped features (true positives).
Completeness: Proportion of real-world features present in OSM.
SCI (Spatial Completeness Index): Measures variation in completeness.
Reliability: Metrics determine which cities, intervals, and infrastructure types have reliable data (based on thresholds in prior studies).