Skip to content

Time-Sync Fault Injection Protocol

Last updated: 2026-05-09

Purpose

This protocol validates that perception, localization, SLAM, map lookup, and safety monitors remain safe when the vehicle timing fabric degrades. The target faults are PTP grandmaster loss, PHC drift, sensor timestamp mode errors, GNSS time faults, host clock steps, ROS time jumps, and network asymmetry. The goal is not to prove perfect synchronization. The goal is to prove that timing faults are detected, bounded, surfaced to runtime monitors, and prevented from becoming silent wrong-pose or stale-obstacle failures.

This protocol extends the perception-SLAM corruption and fault injection protocol, feeds the timestamp shift sweep protocol, sensor dropout, latency, and jitter stress protocol, SLAM integrity under timing errors, and the SLAM timing health dashboard.

Fault Model

FaultInjection methodSafety concern
PTP grandmaster lossRemove grandmaster, change BMCA priority, block event/general messagesSensors free-run with increasing skew
PTP grandmaster failoverSwitch to backup grandmaster with known offset or different time traceabilitySmall offset step masked as normal operation
PHC driftDisable or bias phc2sys, apply controlled frequency offsetHost and NIC/sensor time diverge slowly
Clock stepStep system time, PHC, or simulated ROS /clock by signed offsetFuture/past transforms, invalid fusion windows
Clock slew/rampApply gradual offset ramp over seconds to minutesHarder-to-detect localization bias
Path asymmetryAdd one-way delay/asymmetry on PTP path or sensor networkPTP reports lock while absolute sensor skew grows
Packet lossDrop PTP sync/follow-up/delay request packetsServo instability and intermittent lock
GNSS time degradationRemove PPS/NMEA, bias GNSS time, force receiver holdoverIMU/GNSS timestamps remain plausible but wrong
Sensor timestamp fallbackForce LiDAR/IMU/camera from PTP/GNSS time to internal oscillator or host receive timeMixed timestamp epochs across modalities
ROS time misuseReplay with /clock disabled/enabled incorrectly or with non-monotonic clockOffline replay passes while runtime semantics fail

Required Setup

ItemRequirement
Isolated timing domainDedicated test network or HIL bench where PTP/GNSS faults cannot affect production systems
Reference clockIndependent reference, such as PPS-disciplined logger, second GNSS receiver, or calibrated time interval counter
Candidate buildFrozen software hash, runtime parameters, QoS profiles, time-sync monitor thresholds
Sensor manifestSensor serials, firmware, timestamp mode, PTP profile, clock source, PPS wiring, driver config
Clock telemetryptp4l, phc2sys, pmc, NIC PHC, system clock, /clock, and sensor diagnostics captured at 1 Hz or faster
Ground truthPose and object truth sufficient to measure timing-induced localization and fusion errors
Safety monitorsRuntime timing monitor, localization integrity monitor, stale-data watchdog, and controlled-stop path enabled
Evidence captureRosbag2/MCAP plus host logs, PTP logs, driver diagnostics, and fault injector timeline

Test Phases

PhaseEnvironmentGoal
T0 static auditConfig and manifestsConfirm every sensor has declared timestamp source, clock domain, and expected epoch
T1 bench timingNetwork timing benchMeasure PTP/PHC monitor response without motion or autonomy risk
T2 offline replayRosbag2/MCAP faulted replayVerify timestamp manipulation, /clock, TF, and fusion behavior deterministically
T3 HIL/SILVehicle compute with sensor/network emulatorExercise drivers, middleware, queues, and watchdogs under realistic load
T4 closed courseLow-speed physical testConfirm detection and safe response under controlled motion
T5 shadow watchSupervised operational routesConfirm production telemetry and alert rates without injecting unsafe faults

Injection Matrix

IDFaultSeverity sweepRequired observation
TS-FI-01PTP grandmaster disconnect1 s, 5 s, 30 s, 5 min holdoverMonitor transitions from locked to holdover/fault; SLAM degrades or holds safely
TS-FI-02PTP grandmaster failover0 ms, 1 ms, 5 ms, 10 ms, 25 ms offset between grandmastersFailover event logged with grandmaster identity and offset estimate
TS-FI-03PHC-to-system drift1, 5, 10, 50 ppm equivalent driftHost receive time and sensor stamp skew alert before fusion becomes unsafe
TS-FI-04Step offset+/-1 ms, +/-5 ms, +/-10 ms, +/-25 ms, +/-50 msFuture/past data is rejected or degraded, not silently fused
TS-FI-05Ramp offset1 ms/min to release threshold, both signsResidual and integrity monitors trend before pose error crosses alert limit
TS-FI-06PTP path asymmetry0.5 ms to release threshold one-way delayAbsolute reference detects skew even if PTP servo appears locked
TS-FI-07Packet loss/jitter1, 5, 10, 25 percent packet loss plus burst lossServo state, offset, and path delay alerts correlate with degradation
TS-FI-08GNSS/PPS dropout10 s, 60 s, 10 min outageGNSS/IMU reports holdover status; localization confidence changes
TS-FI-09Sensor timestamp fallbackPer sensor: LiDAR, camera, radar, IMU/GNSSMixed time-source state is release-blocking unless explicitly supported
TS-FI-10ROS /clock jumpForward/backward jump during replay and HIL simReplay, TF, and message filters fail closed with diagnosable drops

Severity values are starting points. The release plan must set site-specific limits from speed, braking distance, sensor frame rate, fusion window, and localization alert limits.

Metrics

MetricDefinitionUse
PTP state dwell timeTime in listening, slave, master, fault, holdover, or uncalibrated stateDetects unstable or wrong BMCA behavior
Grandmaster identity changesCount and timestamp of GM changesRequired for failover root cause
PHC offsetNIC/sensor PHC offset to selected master or referencePrimary sync health signal
System-to-PHC offsetphc2sys measured offset and frequency correctionDetects host clock divergence
Path delay and asymmetry proxyPTP delay plus external one-way delay reference where availableDetects locked-but-wrong timing
Sensor stamp ageHost receive time minus message header.stampDetects stale/future data and fallback modes
Inter-sensor skewTimestamp difference for physical same-time eventsDirect fusion risk indicator
TF extrapolation failuresFuture/past transform lookup failures by frame pairReveals replay and runtime time contract violations
Message filter dropsDropped, late, out-of-cache, and no-transform messagesMeasures fail-closed behavior in fusion input gates
Localization timing sensitivityPose error, covariance, scan-match residual, innovation, and availability under each faultShows whether time fault becomes wrong pose
Safety action latencyTime from injected timing fault to alert, degraded mode, stop, or route holdMust fit safety budget
Evidence completenessRequired PTP, sensor, ROS, and localization logs presentBlocks unverifiable pass claims

Pass and Block Gates

GatePass conditionBlock condition
TS0 provenanceSensor timestamp source, PTP profile, clock domain, and expected epoch are declared for every sensorUnknown or mixed timestamp modes in release candidate
TS1 nominal lockPTP/PHC offset, path delay, and sensor stamp age remain inside approved envelope in nominal runOffset envelope exceeded without alert
TS2 detectionInjected loss, step, ramp, and fallback faults produce timing health alerts within detection budgetFault remains invisible to diagnostics or fleet telemetry
TS3 fail closedFusion rejects, buffers, degrades, or stops when data is stale, future-dated, or from an invalid clock sourceStale/future data is consumed as current safety input
TS4 localization integrityPose error remains within alert/protection limit, or integrity monitor trips before violationHazardously misleading pose or false nominal covariance
TS5 replay parityOffline replay fault reproduces HIL fault direction and monitor outcomeReplay semantics hide the runtime failure mode
TS6 recoveryAfter fault removal, clocks relock and localization recovers without unexplained map jumpRecovery produces pose discontinuity, wrong map frame, or unlogged relocalization
TS7 evidenceFault timeline, clock telemetry, raw logs, derived metrics, and monitor actions are retainedMissing evidence for any release-critical run

Operational Response

Alert levelTrigger patternRequired response
AdvisoryMinor offset trend, one missed PTP interval, or brief sensor stamp-age warningContinue at nominal behavior, raise fleet watch, attach evidence to session
DegradedSustained offset, PHC drift, packet loss, or sensor fallback inside controllable envelopeReduce speed, widen fusion uncertainty, avoid map publication from session
StopInvalid time source, backwards/large clock jump, stale safety-critical modality, or integrity alertControlled stop or remote-assist handoff; preserve pre/post event logs
QuarantineTiming fault affects map-building, calibration, or release evidenceQuarantine map tile/log partition; exclude from release metrics until reviewed
MaintenanceRepeated PTP unlock, GNSS/PPS fault, sensor oscillator drift, or cable/NIC faultRemove vehicle or sensor rig from service pending timing health check

Evidence Artifacts

ArtifactContents
Timing manifestPTP profile, grandmaster ID, sensor timestamp modes, PHC/system clock mapping
Fault scheduleInjected fault type, severity, start/end time, injector version, random seed if applicable
Clock telemetry bundleptp4l, phc2sys, pmc, NIC PHC, sensor diagnostics, /clock, host receive stamps
Fusion and localization reportTF/message-filter drops, pose error, covariance, residuals, availability, recovery behavior
Safety action reportAlert, degraded mode, stop, remote-assist, route hold, map quarantine timing
Release dispositionPass, pass with ODD restriction, inconclusive, or block with defect IDs

Owner Handoffs

OwnerResponsibility
V&V timing leadFault matrix, severity sweep, pass/block decision
Platform/runtime ownerPTP, PHC, ROS time, diagnostics, and watchdog implementation
Perception/SLAM ownerFusion behavior, localization residuals, map impact analysis
Sensor ownerTimestamp mode, firmware, PPS/PTP wiring, driver diagnostics
Fleet operationsOperational response, maintenance workflow, dashboard acknowledgement
Safety leadTiming alert limits, residual risk acceptance, ODD restrictions

Sources

Public research notes collected from public sources.