Perception and SLAM Gap Fill 2026-05-09

This report records the 2026-05-09 multi-agent web sweep and second-pass research fill for perception, SLAM, localization, mapping, first-principles foundations, and runtime reliability.

It does not replace the dedicated perception coverage audit, SLAM coverage audit, or knowledge gap backlog. Its purpose is narrower: identify the strongest remaining gaps after the existing May 2026 promotion waves and give enough source-backed detail for the next atomic-file wave.

Agent Workflow

Stage	Agents	Output
Web gap scouts	6	Non-duplicate gap candidates across perception methods, SLAM methods, benchmarks, KB foundations, runtime reliability, and neural-field/world-model boundaries.
Fill-in researchers	5	Source-backed briefs for the highest-leverage gap clusters.
Integration	1	This synthesis report and promotion queue.

The scout pass treated a topic as a stronger gap when it was absent as an atomic file, present only as an audit/watchlist row, or covered only indirectly inside broader overview pages.

Highest-Leverage Gaps

Priority	Gap Cluster	Why It Matters
P0	Resilient open-world occupancy	Occupancy is already covered, but deployable occupancy needs unknown-object scoring, missing-camera robustness, long-range sparse evidence, and open-ended label bootstrapping.
P0	Degraded-assumption SLAM	Current methods now target radar-centric odometry, foundation dense SLAM, UWB/wheel/LiDAR/IMU fusion, dynamic-aware LIO, and multi-agent neural/Gaussian mapping.
P0	Validation datasets for route repeat, drift, OOD, relocalization, and airside movement	The repo has many benchmarks, but needs stronger slice-based protocols for same-route change, continuous ODD drift, compound uncertainty, relocalization, and airport surface trajectories.
P0	Estimator foundations not yet taught directly	Observability/gauge consistency, unknown-correlation fusion, out-of-sequence measurements, and localization integrity are structural prerequisites for trustworthy perception/SLAM safety cases.
P0	Runtime telemetry and diagnostics contracts	Offline validation is not enough; deployed fleets need versioned schemas, diagnostics, localization health gates, and map QA release gates.

Resilient Open-World Occupancy

The current perception library covers camera occupancy, LiDAR-camera occupancy fusion, dynamic occupancy/free-space, Gaussian occupancy, open-world detection, OOD/anomaly benchmarks, and corruption tests. The missing layer is a focused "resilient open-world occupancy" cluster: methods that keep dense occupancy useful when objects are unknown, classes are long-tail, cameras are missing, evidence is sparse at long range, or manual labels are incomplete.

Proposed file	Method	Contribution	Deployment Risk
`30-autonomy-stack/perception/methods/proood.md`	ProOOD	Prototype-guided semantic imputation, tail-class mining, and voxel-level OOD scoring for 3D occupancy.	Prototype bias; OOD score must trigger conservative behavior, not become an uncalibrated class label.
`30-autonomy-stack/perception/methods/m2-occ.md`	M2-Occ	Semantic occupancy under incomplete multi-camera inputs through masked reconstruction and feature memory.	Missing-view reconstruction can hallucinate occupancy or freespace unless confidence is exposed.
`30-autonomy-stack/perception/methods/self-supervised-sparse-sensor-fusion-long-range.md`	Self-supervised sparse sensor fusion	Sparse multimodal temporal encoding for long-range perception from unlabeled camera-LiDAR data.	Long-range sparse features must not replace near-field dense FOD and personnel detection.
`30-autonomy-stack/perception/methods/autoocc.md`	AutoOcc	Open-ended semantic occupancy auto-labeling using vision-language guidance and Gaussian splatting.	Auto-labeling needs human QA, negative examples, and adverse-condition splits before safety use.

Airside transfer is strong because aprons and industrial yards contain non-catalog objects, unusual equipment, temporary clutter, partial camera views, long stopping distances, glare, spray, and domain-specific labels such as chocks, hoses, belt loaders, cones, covers, aircraft gear, and FOD. The runtime contract should expose unknown-voxel risk, missing-view confidence, long-range evidence quality, label provenance, and sensor-health inputs.

Useful P1 follow-ons: LOcc, GroundingOcc, GS-Occ3D, GaussianFlowOcc, and EmbodiedOcc.

Sources:

ProOOD: https://arxiv.org/abs/2604.01081, https://github.com/7uHeng/ProOOD
M2-Occ: https://arxiv.org/abs/2603.09737
Self-Supervised Sparse Sensor Fusion for Long Range Perception: https://openaccess.thecvf.com/content/ICCV2025/html/Palladin_Self-Supervised_Sparse_Sensor_Fusion_for_Long_Range_Perception_ICCV_2025_paper.html, https://arxiv.org/abs/2508.13995
AutoOcc: https://openaccess.thecvf.com/content/ICCV2025/html/Zhou_AutoOcc_Automatic_Open-Ended_Semantic_Occupancy_Annotation_via_Vision-Language_Guided_Gaussian_ICCV_2025_paper.html
GroundingOcc: https://arxiv.org/abs/2508.01197
GS-Occ3D: https://arxiv.org/abs/2507.19451
GaussianFlowOcc: https://arxiv.org/abs/2502.17288
EmbodiedOcc: https://arxiv.org/abs/2412.04380

Degraded-Assumption SLAM

The SLAM library is strong on classical LiDAR, LIO/LIVO, radar SLAM, Gaussian SLAM, collaborative PGO, map cleaning, and alternative sensors. The remaining current gap is not another classic LOAM variant; it is SLAM under degraded assumptions: sparse/weather-friendly radar, uncalibrated foundation-model dense geometry, GNSS-denied multimodal fusion, dynamic-scene LIO, and multi-agent neural/Gaussian map merging.

Proposed file	Method family	Contribution	Deployment Risk
`30-autonomy-stack/localization-mapping/slam-methods/super4dr.md`	Super4DR	4D radar-centric learned odometry and Gaussian map optimization for poor weather/lighting.	Radar sparsity, multipath, weak covariance semantics, preprint maturity.
`30-autonomy-stack/localization-mapping/slam-methods/slam3r-vggt-foundation-slam.md`	SLAM3R / VGGT-SLAM / VGGT-SLAM++	Feed-forward/foundation dense SLAM from RGB, with pointmap/submap alignment and retrieval-assisted optimization.	Camera-only fragility, learned-prior hallucination, weak runtime safety evidence.
`30-autonomy-stack/localization-mapping/slam-methods/cm-liuw-odometry.md`	CM-LIUW-Odometry	LiDAR-IMU-UWB-wheel fusion for GPS-denied, feature-poor, slippery, or tunnel-like environments.	UWB NLOS, wheel slip, lever-arm/calibration errors, mode-switch tuning.
`30-autonomy-stack/localization-mapping/slam-methods/dynamic-aware-lio-btsa.md`	Breaking the Static Assumption	Dynamic-aware LIO using spatio-temporal normal analysis inside registration.	Dynamic filtering can remove useful structure or retain moving clutter if thresholds drift.
`30-autonomy-stack/localization-mapping/slam-methods/multi-agent-neural-gaussian-slam.md`	MAGiC-SLAM / MNE-SLAM	Multi-agent dense neural/Gaussian mapping and map fusion beyond classical collaborative pose graphs.	False map merges, bandwidth limits, dense-map consistency, safety-case immaturity.

For airside autonomy, Super4DR and CM-LIUW are the strongest runtime-adjacent candidates. SLAM3R/VGGT and MAGiC/MNE are more useful for offline mapping, site digitization, inspection, relocalization candidates, and simulation map building than as immediate safety pose sources.

Sources:

Super4DR: https://arxiv.org/abs/2512.09608
SLAM3R: https://openaccess.thecvf.com/content/CVPR2025/papers/Liu_SLAM3R_Real-Time_Dense_Scene_Reconstruction_from_Monocular_RGB_Videos_CVPR_2025_paper.pdf
VGGT-SLAM: https://arxiv.org/abs/2505.12549
VGGT-SLAM++: https://arxiv.org/abs/2604.06830
CM-LIUW-Odometry: https://arxiv.org/abs/2511.01379
Breaking the Static Assumption / BTSA: https://arxiv.org/abs/2510.22313, https://github.com/thisparticle/btsa
MAGiC-SLAM: https://openaccess.thecvf.com/content/CVPR2025/html/Yugay_MAGiC-SLAM_Multi-Agent_Gaussian_Globally_Consistent__SLAM_CVPR_2025_paper.html, https://github.com/VladimirYugay/MAGiC-SLAM
MNE-SLAM: https://openaccess.thecvf.com/content/CVPR2025/papers/Deng_MNE-SLAM_Multi-Agent_Neural_SLAM_for_Mobile_Robots_CVPR_2025_paper.pdf

Benchmark and Dataset Gaps

The benchmark gap is not simply "more datasets." The missing validation pattern is slice-based evidence: same route across conditions, continuous ODD drift, compound weather+OOD uncertainty, global relocalization under appearance change, and airside surface movement forecasting.

Proposed file or update	Benchmark	Measures	Validation Use
`30-autonomy-stack/perception/datasets-benchmarks/ithaca365-repeated-weather-perception.md`	Ithaca365	Repeated traversals across weather, season, time, traffic, cameras, LiDAR, GPS/INS, masks, and boxes.	Validate "same route, different day" perception and localization degradation.
`30-autonomy-stack/perception/datasets-benchmarks/shift-continuous-domain-shift.md`	SHIFT	Continuous shifts in weather, lighting, traffic, and pedestrian density.	Plot degradation curves and ODD cliff points instead of binary clear/adverse tests.
`30-autonomy-stack/perception/datasets-benchmarks/muad-multiple-uncertainties.md`	MUAD	Weather, day/night, OOD, segmentation, depth, object and instance detection.	Test compound uncertainty: normal, OOD-only, weather-only, weather+OOD.
`30-autonomy-stack/localization-mapping/datasets-benchmarks/pit30m-global-localization.md`	Pit30M	Large-scale image/LiDAR localization with season, weather, time, traffic, occlusion metadata.	Separate relocalization metrics from odometry metrics: recall, false-place matches, map age, time to relocalize.
`70-operations-domains/airside/datasets-benchmarks/amelia-airport-surface-forecasting.md`	Amelia / Amelia-42	Airport surface movement trajectories, routing graphs, forecasting, anomaly detection.	Add airside multi-agent risk, topology transfer, route compliance, and runway-incursion-style anomaly protocols.

Secondary audit rows: CODA for corner-case objects, Waymo Open Sim Agents Challenge for stochastic multi-agent simulation, CARLA Leaderboard 2.1 for closed-loop route/infraction scoring, and RobotCar Seasons for long-term visual localization.

Sources:

Ithaca365: https://arxiv.org/abs/2208.01166
SHIFT: https://arxiv.org/abs/2206.08367
MUAD: https://muad-dataset.github.io/, https://arxiv.org/abs/2203.01437
Pit30M: https://arxiv.org/abs/2012.12437
Amelia: https://ameliacmu.github.io/, https://ameliacmu.github.io/amelia-dataset/
CODA: https://coda-dataset.github.io/
Waymo Open Sim Agents Challenge: https://waymo.com/research/the-waymo-open-sim-agents-challenge/
CARLA Leaderboard 2.1: https://leaderboard.carla.org/get_started_v2_1/
RobotCar Seasons / Visual Localization Benchmark: https://www.visuallocalization.net/datasets/

First-Principles KB Gaps

The knowledge base is strong on filters, factor graphs, covariance, information, data association, time synchronization, continuous-time trajectories, and registration math. The structural gaps below are important because they explain why a perception/SLAM system can be numerically impressive but still unsafe or overconfident.

Proposed file	Topic	What It Should Teach	Failure Mode Avoided
`10-knowledge-base/state-estimation/slam-vio-observability-fej-nullspace-consistency.md`	SLAM/VIO observability and consistency	Physical vs linearized observability, gauge modes, FEJ, nullspace projection, NEES/NIS, rank diagnostics.	Estimator invents information in unobservable directions and reports overconfident covariance.
`10-knowledge-base/state-estimation/fusion-unknown-correlations-covariance-intersection.md`	Unknown-correlation fusion	Cross-correlation, data incest, Covariance Intersection, track-to-track fusion, consistency vs conservatism.	Double-counted information in V2X, infrastructure perception, shared maps, or distributed tracking.
`10-knowledge-base/state-estimation/out-of-sequence-measurements-fixed-lag-smoothing.md`	Delayed and out-of-sequence measurements	Measurement time vs arrival time, retrodiction, fixed-lag smoothing, buffering, OOSM filters/factor graphs.	Stale camera/radar/V2X/map updates are fused as current state and corrupt tracks or ego pose.
`10-knowledge-base/state-estimation/localization-integrity-protection-levels-raim.md`	Localization integrity	Integrity vs accuracy, alert limits, protection levels, RAIM/ABAS, hazardous misleading localization.	Low average error hides unbounded risk of undetected wrong pose.

These pages require the KB visual pipeline: each new 10-knowledge-base page needs a curated visual block, SVG asset, and tools/knowledge-base/visual-taxonomy.mjs assignment before tests will pass.

Sources:

Observability-based EKF SLAM consistency: https://journals.sagepub.com/doi/pdf/10.1177/0278364909353640
VINS observability/consistency: https://intra.ece.ucr.edu/~mourikis/papers/Huang2011-IROS.pdf
Covariance Intersection: https://colab.ws/articles/10.1109%2FACC.1997.609105
Distributed fusion under unknown correlation: https://www.mdpi.com/1424-8220/17/11/2472
Dependent-error track-to-track association: https://isif.org/media/multisensor-track-track-association-tracks-dependent-errors
Stone Soup OOSM example: https://stonesoup.readthedocs.io/en/stable/auto_examples/oosm/PF_OOSM_example.html
Bayesian OOSM treatment: https://www.sciencedirect.com/science/article/pii/S156625350300037X
Nonlinear robot localization with OOSM: https://pmc.ncbi.nlm.nih.gov/articles/PMC3376572/
FAA GNSS/RAIM overview: https://www.faa.gov/about/office_org/headquarters_offices/ato/service_units/techops/navservices/gnss/
Integrity monitoring for Kalman-filter localization: https://journals.sagepub.com/doi/10.1177/0278364920960517
Multimodal perception localization integrity: https://www.mdpi.com/1424-8220/20/16/4654

Runtime Reliability and Fleet Operations

The operational gap is a versioned telemetry and diagnostics contract. Offline perception/SLAM validation does not prove the deployed fleet can compare evidence across builds, map versions, calibration packages, airports, and time. The runtime layer must turn sensor health, localization residuals, map QA, diagnostics, and safety actions into versioned, auditable telemetry.

Proposed update	Contract or Gate	Required Content
`50-cloud-fleet/data-platform/perception-slam-fleet-data-contract.md`	Versioned telemetry schema	`schema_url`, `schema_version`, signal namespace, migration ID, software build, map package, calibration, model ID, diagnostics contract version, severity, safety action, raw-log references.
`40-runtime-systems/ros-autoware/autoware-universe-deep-dive.md` or new diagnostics file	ROS/Autoware diagnostics contract	REP-107 style summary/detail split, `DiagnosticArray` naming, `diagnostic_updater` checks, Autoware diagnostics DAG, fleet severity mapping.
`60-safety-validation/runtime-assurance/online-perception-monitoring-odd-enforcement.md`	Localization health gate	Health states: localized, degraded, uncertain, relocalizing, lost, map mismatch, time-sync fault; gates on covariance, NDT score, residual drift, skipped publication, delay.
`50-cloud-fleet/map-operations/hd-map-lifecycle-operations.md`	Map QA release gate	Lanelet2 validation, Autoware/TIER IV validators, semantic diff, signed manifest, tile consistency, pointcloud alignment, cohort/canary plan, rollback target.

The AWS IoT FleetWise dependency should be treated carefully: AWS says FleetWise stops accepting new customers on April 30, 2026, and will receive no new feature development. Existing references can remain, but new schema strategy should not anchor on FleetWise alone.

Sources:

COVESA Vehicle Signal Specification: https://covesa.global/project/vehicle-signal-specification/, https://covesa.github.io/vehicle_signal_specification/introduction/overview/
Eclipse KUKSA: https://projects.eclipse.org/projects/automotive.kuksa
OpenTelemetry schemas: https://opentelemetry.io/docs/specs/otel/schemas/
AWS IoT FleetWise availability change: https://docs.aws.amazon.com/iot-fleetwise/latest/developerguide/iotfleetwise-availability-change.html
ROS REP-107 diagnostics: https://www.ros.org/reps/rep-0107.html
ROS 2 diagnostic_updater: https://docs.ros.org/en/jazzy/p/diagnostic_updater/
Autoware diagnostics API: https://autowarefoundation.github.io/autoware-documentation/latest/design/autoware-interfaces/ad-api/features/diagnostics/
Autoware NDT scan matcher diagnostics: https://autowarefoundation.github.io/autoware_core/pr-602/localization/autoware_ndt_scan_matcher/
Autoware localization interface: https://tier4.github.io/autoware-documentation/latest/design/autoware-interfaces/components/localization/
Lanelet2 validation: https://docs.ros.org/en/rolling/p/lanelet2_validation/
TIER IV Autoware Lanelet2 map validator: https://github.com/tier4/autoware_lanelet2_map_validator
Autoware.Auto Lanelet2 map requirements: https://autowarefoundation.gitlab.io/autoware.auto/AutowareAuto/lanelet2-map-for-autoware-auto.html

Promotion Queue

Use this order for the next atomic-file wave:

Create the four resilient open-world occupancy pages: ProOOD, M2-Occ, self-supervised sparse sensor fusion, and AutoOcc.
Create the three most deployment-relevant SLAM pages: Super4DR, CM-LIUW-Odometry, and BTSA dynamic-aware LIO. Add SLAM3R/VGGT as a foundation-SLAM survey page if time allows.
Add Ithaca365, SHIFT, MUAD, Pit30M, and Amelia as benchmark pages or audit rows, then connect them to validation protocols.
Add the runtime reliability updates before broadening the method library further, because diagnostics and schema contracts are needed to make method evidence operational.
Promote the four first-principles KB pages only with the visual/taxonomy pipeline in the same change.

Do Not Over-Promote Yet

Keep these as watchlist or cross-link items until source maturity, code availability, or validation relevance improves: many 2026 Gaussian occupancy variants, learned radar odometry variants without repeated external evaluation, neural/Gaussian multi-agent SLAM as runtime pose sources, diffusion occupancy completion as a safety primitive, and open-ended VLM labels without human QA or adverse-condition calibration.

SLAM Methods

Methods

Perception and SLAM Gap Fill 2026-05-09 ​

Agent Workflow ​

Highest-Leverage Gaps ​

Resilient Open-World Occupancy ​

Degraded-Assumption SLAM ​

Benchmark and Dataset Gaps ​

First-Principles KB Gaps ​

Runtime Reliability and Fleet Operations ​

Promotion Queue ​

Do Not Over-Promote Yet ​

Perception and SLAM Gap Fill 2026-05-09

Agent Workflow

Highest-Leverage Gaps

Resilient Open-World Occupancy

Degraded-Assumption SLAM

Benchmark and Dataset Gaps

First-Principles KB Gaps

Runtime Reliability and Fleet Operations

Promotion Queue

Do Not Over-Promote Yet