Skip to content

SceneEdited: 3D HD Map Updating Benchmark

Last updated: 2026-05-09

Why It Matters

SceneEdited moves beyond "detect a map change" and asks whether a method can update a stale 3D point-cloud map using current image evidence. That is the right benchmark shape for autonomy map operations: a stale map is only useful if the system can produce a geometrically consistent replacement or removal proposal.

For airside mapping, this matters because 2D visual discrepancy alone is not enough. A map update must preserve localization landmarks, remove obsolete geometry, add persistent new infrastructure, and avoid turning transient aircraft/GSE into permanent map points.

Dataset Snapshot

ItemSceneEditedAirside relevance
Task3D point-cloud map updating via image-guided change detectionUpdate stale 3D airport map tiles using current fleet observations
Scale800+ up-to-date scenes, 73 km driving, about 3 km^2 urban areaUseful city-scale proxy; airport aprons need separate capture
Changes23,000+ synthesized object changes across 2,000+ outdated versionsGood stress test for additions/removals but not airport-specific operations
ObjectsRoadside infrastructure, buildings, overpasses, utility polesTransfers to static infrastructure, not aircraft/GSE/FOD semantics
InputsCalibrated RGB images, LiDAR scans, change masksSimilar multi-sensor ingredients to fleet map QA
ToolkitScenePoint-ETK for editing and dataset constructionTemplate for reproducible stale/current map generation
VenueWACV 2026 accepted paperRecent benchmark; maturity should be checked before operational reuse

Benchmark Framing

StageSceneEdited framingAirside equivalent
Up-to-date mapCurrent static point-cloud sceneApproved current airport localization tile
Outdated mapEdited stale point-cloud scenePrevious tile or deliberately staled benchmark tile
Current evidenceGeo-referenced RGB images and LiDARVehicle camera/LiDAR/radar logs with pose quality
Change maskImage/3D change annotationsPixel, point, and map-element labels for changed regions
Update outputRevised 3D point cloudCandidate tile diff with additions/removals
EvaluationGeometry fidelity after updateLocalization impact, static preservation, hazard policy, reviewer burden

Practical Metrics

Metric familyWhat it measuresAirside addition
Chamfer distanceAverage bidirectional point errorReport near docking landmarks and stand markings separately
Hausdorff distanceWorst-case geometric deviationUse to catch protrusions into safety envelopes
Modified Hausdorff / median point distanceRobust error summariesUse for noisy LiDAR and wet-surface artifacts
Addition accuracyQuality of newly reconstructed geometryRequire persistence before static-map promotion
Deletion accuracyQuality of removed obsolete geometryRequire occlusion checks and human review near safety-critical features
Static preservationDamage to unchanged map pointsMust be part of airside acceptance even if not the headline metric
Localization regressionPose residual before/after updateRequired before publishing any map tile

Airside Use Cases

Use caseHow SceneEdited helpsMissing airport evidence
Construction barrier updateTests object removal/addition in 3DAirport barrier classes, cones, closures, NOTAM links
New fixed equipment installationTests insertion into stale 3D mapSponsor-approved permanence and asset IDs
Removed fixed objectTests deletion from old mapMulti-pass absence and occlusion reasoning
Repainted stand routeNeeds image-guided change maskSurface marking semantics, not just object geometry
Aircraft/GSE presentShould not become static map geometryMovable-static layer policy and object taxonomy
FOD detectionShould remain a live hazard alertSmall-object labeling and inspection protocol

Implementation Guidance

  1. Use SceneEdited as a benchmark pattern for "update the map," not only "detect a mismatch."
  2. Reproduce the stale/current/update split for airport tiles: previous approved map, current evidence, and verified target map.
  3. Separate added and deleted regions in evaluation. A method that deletes well but adds poorly is useful for map cleanup, not full maintenance.
  4. Add airside-specific static-preservation metrics near terminal edges, blast fences, docking templates, stop bars, and stand lead-in lines.
  5. Require map hygiene labels before training: permanent static, movable-static, current dynamic, FOD/hazard, artifact, and unknown/review.
  6. Do not use image-only update quality as an operational gate. Airport deployment needs LiDAR/geodetic alignment and localization regression.
  7. Treat synthetic edits as coverage generation; reserve real operational changes for final acceptance.

Source Caveats for Use

CaveatImpact
SceneEdited changes are synthesizedGood for controlled benchmarking; final airport acceptance needs real change captures
Urban objects differ from apron objectsAirport-specific taxonomy and sensors are required
Project page/repo were under active update when checkedPin dataset/toolkit versions for reproducible experiments
Benchmark focuses on 3D geometryDoes not by itself define regulatory, FOD, or route-closure map policy

Sources

Public research notes collected from public sources.