Skip to content

Runtime Monitor Qualification Evidence

Last updated: 2026-05-09

Purpose

Runtime monitors are safety controls only when they are specified, validated, integrated, observed, and maintained. This page defines the evidence needed to qualify monitors that supervise perception, SLAM/localization, timing, calibration, free-space, unknown/OOD objects, maps, and ODD boundaries.

Qualification Levels

LevelMeaningRelease use
MQ0Diagnostic only; logs a signal but has no validated actionCannot support a safety claim
MQ1Thresholded alert with owner and dashboardSupports operations watch only
MQ2Validated alert with replay/fault-injection evidence and runbookSupports canary gates
MQ3On-vehicle action integrated and tested under nominal/fault casesSupports safety control claim
MQ4Field performance monitored with periodic requalification and change controlSupports steady-state production claim

Evidence Model

EvidenceMinimum contents
Monitor requirementHazard, claim, signal, threshold, ODD scope, consumer action
Algorithm specInputs, timing, filtering, state machine, latch/reset, assumptions, known limits
Data validityRequired schemas, units, frames, time source, missing-data handling
Threshold derivationCalibration partition, locked test partition, confidence interval, slice analysis
Fault injectionInjected fault, expected detection, detection latency, false negative result
Integration proofPlanner/runtime/vehicle action test for yellow, red, unknown, missing monitor
Resource budgetCPU/GPU/memory/latency and interference analysis
False alarm analysisRate by ODD slice, suppression policy, operator workload
Field monitoringPrecision/recall proxy, alert outcomes, missed-event review, drift
Change recordVersion, thresholds, schema, evidence IDs, approval

Monitor Classes

MonitorSafety purposeQualification focus
Timing monitorPrevent stale/future/misaligned sensor dataTimestamp sweep, PTP/PHC failure, TF/message-filter reject behavior
Localization integrityPrevent wrong-pose operationResidual-to-error calibration, covariance/protection-level coverage, relocalization loss
Free-space monitorPrevent false traversable spaceProtected-zone false-free-space tests, unknown conservatism
Unknown/OOD monitorPrevent novel object suppressionOpen-world/OOD benchmarks, closed-course unknown fixtures, actionability
Calibration monitorDetect extrinsic/time driftMiscalibration injection, scene degeneracy handling, maintenance workflow
Map disagreement monitorCatch stale/wrong map assumptionsMap tile disagreement, overlay expiry, field ticket closure
Model-runtime monitorPrevent overloaded or incompatible inferenceEngine compatibility, p99 latency, GPU memory, model readiness
ODD monitorKeep operation inside validated domainWeather/site/routing boundary detection and enforced restriction

Qualification Gates

GatePass conditionBlock condition
Q0 traceMonitor maps to hazard, claim, requirement, and actionMonitor exists only as dashboard panel
Q1 input validityInputs include frame/time/provenance/schema and missing-data policyMissing signal can be interpreted as green
Q2 thresholdThreshold derived from independent data and locked before testThreshold tuned on incident or test set without record
Q3 detectionInjected credible faults are detected before unsafe consumer outputFault remains silent until after planner consumes unsafe state
Q4 actionVehicle/planner/runtime response occurs for yellow/red/unknownAlert fires but no safety action occurs where claimed
Q5 robustnessFalse alarm and false negative behavior reviewed by ODD sliceAggregate metric hides high-risk route or weather slice
Q6 observabilityAlert joins evidence IDs and reaches ownerCannot reconstruct monitor decision
Q7 change controlMonitor version and threshold changes go through release processField threshold edited without safety review

Fault-Injection Patterns

FaultExpected monitor behavior
Sensor timestamp offsetTiming state yellow/red before fusion accepts stale/future data
LiDAR-camera yaw perturbationCalibration state red or unknown-object/free-space conservatism increases
LiDAR beam dropoutSensor/perception uncertainty rises and false-free-space remains blocked
Map tile mismatchMap mismatch alert and route hold
Pose covariance under-reportingIntegrity monitor catches residual/protection-level inconsistency
Unknown object in routeOOD/unknown monitor produces obstacle/stop/review action
Model engine mismatchRuntime monitor prevents activation or marks model unavailable

Field Performance Review

CadenceReview item
Every releaseMonitor versions, thresholds, evidence freshness, changed consumers
Weekly during canaryAlert counts, unresolved P0/P1, suppression list, false alarm burden
Monthly steady stateMissed events, incident linkage, drift trends, maintenance correlation
After incidentWhether monitor fired, fired late, action failed, or evidence was insufficient
After ODD/site changeRequalification for affected slices and assumptions

Evidence Artifacts

ArtifactStorage target
Monitor qualification reportSafety evidence store
Threshold manifestRelease artifact registry
Fault-injection logsData lake with bag/MCAP pointers
Monitor-to-action integration testCI/HIL report
Alert/runbook closure recordsIncident system
Field performance dashboard exportFleet SRE evidence
Safety-case trace updateEvidence graph
  • 60-safety-validation/runtime-assurance/runtime-verification-monitoring.md
  • 60-safety-validation/runtime-assurance/online-perception-monitoring-odd-enforcement.md
  • 60-safety-validation/runtime-assurance/simplex-safety-architecture.md
  • 60-safety-validation/verification-validation/sensor-dropout-latency-jitter-stress-protocol.md
  • 60-safety-validation/verification-validation/robustness/perception-slam-corruption-fault-injection-protocol.md
  • 50-cloud-fleet/observability/perception-slam-alert-runbooks.md

Sources

Public research notes collected from public sources.