Skip to content

Airside Scenario Taxonomy and Edge Case Catalog for Autonomous Ground Vehicles

A systematic classification of operational scenarios, hazards, and edge cases for autonomous vehicles operating on airport airside surfaces. Adapted from ISO 34502 (road vehicle scenario-based safety evaluation) and aligned with SOTIF (ISO 21448) for the unique constraints of apron, taxiway, and service road environments. Intended to support safety case development, test planning, and certification evidence for ISO 3691-4 and anticipated FAA Advisory Circulars.


Table of Contents

  1. Introduction
  2. Scenario Classification Framework
  3. Operational Phase Taxonomy
  4. Environmental Conditions Matrix
  5. Actor Taxonomy
  6. Hazard Catalog (SOTIF-Aligned)
  7. Edge Cases and Long-Tail Scenarios
  8. Sensor Coverage Analysis per Scenario
  9. Scenario Frequency and Risk Matrix
  10. Testing Strategy per Scenario
  11. Regulatory Mapping
  12. References

1. Introduction

1.1 Why a Formal Scenario Taxonomy

Airport ramp operations produce approximately 27,000 accidents and incidents annually worldwide, resulting in an estimated 243,000 injuries and at least USD 10 billion in costs (Flight Safety Foundation GAP, IATA Ground Ops Safety). Ground Support Equipment is responsible for 61% of aircraft ground damage (IATA Ground Damage Database), and aircraft-GSE collisions cost the industry USD 5 billion per year -- a figure projected to double to USD 10 billion by 2035 unless preventive action is taken (IATA 2022).

An autonomous vehicle entering this environment must demonstrate safety performance that exceeds the human baseline across every plausible scenario. A scenario taxonomy provides the systematic foundation for three critical activities:

  1. Safety case construction. ISO 3691-4 (Clause 5) requires verification of safety functions against an identified hazard list. A comprehensive scenario catalog ensures that the hazard list covers all reasonably foreseeable situations, including rare-but-severe edge cases that fall outside routine operating experience.

  2. Test planning and coverage measurement. Without a structured scenario space, testing is ad hoc and coverage gaps are invisible. The taxonomy defines what must be tested, how many variations exist within each scenario class, and what constitutes sufficient evidence of safe behavior.

  3. Certification evidence. Anticipated regulatory frameworks -- FAA Advisory Circular (predicted ~2028-2029), EASA AMC (~2028), and ISO/SAE autonomous GSE standards (~2029-2030) -- will require evidence that the AV has been evaluated against a defined set of scenarios. Establishing the taxonomy now positions a manufacturer to provide this evidence on the timeline regulators expect.

1.2 Scope

This taxonomy covers autonomous vehicles operating on airport airside surfaces including aprons, service roads, and designated taxiway crossings. It does not cover runway operations (autonomous vehicles should never operate on active runways) or public-road transit between airports.

Vehicle types in scope: autonomous baggage tractors/dolly tugs (reference airside AV stack autonomous baggage/cargo tug, UISEE tractors, EasyMile/TLD TractEasy EZTow), autonomous cargo transporters, and autonomous tow tractors. The taxonomy is extensible to autonomous pushback tugs, autonomous refueling vehicles, and autonomous personnel movers if those vehicle types enter development.

1.3 Relationship to Other Documents

DocumentRelationship
../safety-case/failure-modes-analysis.mdFailure taxonomy for perception, world model, and planning subsystems
../../70-operations-domains/airside/safety/ground-crew-pedestrian-safety.mdDetailed analysis of personnel hazard H1
../standards-certification/iso-3691-4-deep-dive.mdStandard requirements mapped against this taxonomy
../safety-case/safety-incidents-lessons.mdReal-world AV incidents informing edge case identification
../runtime-assurance/simplex-safety-architecture.mdDual-stack architecture providing hazard mitigation
../../70-operations-domains/airside/operations/fod-and-jetblast.mdDetection methods for hazards H4 and H5
../../70-operations-domains/airside/operations/turnaround-prediction.mdTurnaround phase model underlying Phase 3 scenarios

2. Scenario Classification Framework

2.1 ISO 34502 Adaptation for Airside

ISO 34502:2022 defines a three-tier scenario abstraction hierarchy for road vehicles. We adapt this hierarchy for the airside operational design domain (ODD):

Functional Scenario (natural language, qualitative)
  "AV transits from depot to apron in light rain at night"

    ↓ parameterize

Logical Scenario (parameter ranges, distributions)
  "AV at 10-15 km/h on service road, rain rate 2-10 mm/hr,
   ambient light 0.5-5 lux, 0-3 oncoming GSE vehicles,
   surface friction coefficient 0.5-0.7"

    ↓ instantiate

Concrete Scenario (specific values, executable)
  "AV at 12.3 km/h on service road R7, rain rate 4.2 mm/hr,
   ambient light 1.8 lux, 2 oncoming belt loaders at positions
   (x1,y1) and (x2,y2), friction 0.62"

This mirrors ISO 34502's "recognize-judge-operate" decomposition but replaces road-specific elements (lanes, traffic signals, highway merge) with airside-specific elements (stand assignments, turnaround phases, jet blast zones, VDGS guidance).

2.2 Scenario Dimensions

Each scenario is characterized along six orthogonal dimensions:

DimensionAirside-Specific Elements
Operational phaseTransit, approach, turnaround support, pushback support, return (Section 3)
EnvironmentWeather, lighting, surface condition, visibility, temperature (Section 4)
ActorsAircraft state, GSE types, personnel roles, wildlife, emergency vehicles (Section 5)
InfrastructureStand type (contact/remote), taxiway width, service road geometry, AV zone markings
EventsPlanned (turnaround sequence), unplanned (emergency, equipment failure, spill)
Ego stateLoaded/unloaded, speed, dolly train length, battery level, sensor health

2.3 Scenario Combinatorics

The total scenario space is the Cartesian product of all dimension values. With conservative estimates:

DimensionDistinct Values
Operational phase5
Weather6
Lighting4
Surface5
Actor configurations~50
Infrastructure variants~10
Event types~20
Ego states~8

Total functional scenario combinations: ~4.8 million. This is intractable for exhaustive physical testing but manageable for simulation-based evaluation with statistical sampling. The taxonomy's purpose is to identify which regions of this space contain the highest risk and therefore require the most testing attention.

2.4 SOTIF Integration

ISO 21448 (SOTIF) defines two categories of unsafe behavior:

  • Known unsafe scenarios (Area 2): Identified triggering conditions where the system may fail. These must be mitigated to acceptable residual risk.
  • Unknown unsafe scenarios (Area 3): Not yet identified but potentially hazardous. These must be reduced through systematic exploration (scenario generation, field testing, fleet data).

The hazard catalog (Section 6) maps each hazard to its triggering conditions, placing it in Area 2. Section 7 (edge cases) targets Area 3 -- scenarios that have not been observed in operational data but are physically plausible and must be tested in simulation.


3. Operational Phase Taxonomy

3.1 Phase 1: Transit (Depot to Apron)

AttributeValue
DescriptionAV travels from maintenance/charging depot to assigned apron area via service roads and taxiway crossings
Typical duration3-15 minutes depending on airport layout
Speed range10-25 km/h (service road limit typically 25-30 km/h)
Actors presentOther GSE vehicles, occasional ground crew, possibly aircraft taxiing on crossing taxiways
Risk levelMedium -- higher speed, fewer actors, but taxiway crossings are high-consequence
Key hazardsTaxiway incursion (H6), collision with oncoming GSE (H3), FOD on service road (H5)

Critical sub-scenarios:

  • Taxiway crossing: AV must yield to taxiing aircraft with absolute priority. Detection range must exceed aircraft approach speed x reaction time (aircraft taxi speed 15-30 kt = 8-15 m/s; at 200 m detection range, 13-25 s reaction window).
  • Intersection with other GSE at unmarked service road junctions.
  • Transition from depot (indoor/covered) to outdoor environment -- sudden lighting and GPS availability change.

3.2 Phase 2: Approach (Navigating to Assigned Stand)

AttributeValue
DescriptionAV navigates from service road to assigned aircraft stand, entering congested apron area
Typical duration1-5 minutes
Speed range5-15 km/h (apron speed limit typically 15-30 km/h, with 10-15 km/h near stands)
Actors presentParked aircraft, active GSE at adjacent stands, ground crew, marshallers, passengers (bus transfer)
Risk levelHigh -- increasing congestion, proximity to aircraft, narrow clearances
Key hazardsAircraft wingtip clearance violation (H8), collision with GSE at adjacent stands (H3), struck personnel (H1)

Critical sub-scenarios:

  • Navigating between parked aircraft with clearances as small as 7.5 m (25 ft) between wingtips.
  • Arriving at a stand where the turnaround for the previous aircraft is still in progress.
  • Stand reassignment while en route -- AV must re-plan to a different stand.
  • Following VDGS lead-in lines painted on the apron surface (may be degraded or obscured by water/oil).

3.3 Phase 3: Turnaround Support (Operating Around Aircraft)

AttributeValue
DescriptionAV positions at stand, loads/unloads baggage containers or dollies, operates alongside 8-15 other GSE units
Typical duration25-90 minutes depending on aircraft type and turnaround schedule
Speed range0-5 km/h (positioning), stationary during load/unload
Actors presentMaximum density: ground crew (6-20 personnel), belt loaders, container loaders, catering trucks, fuel truck, GPU, PCA unit, lavatory service, potable water truck, possibly passenger bus
Risk levelVery High -- maximum congestion, personnel in blind spots, dynamic environment
Key hazardsPersonnel collision (H1), GSE collision (H3), jet blast during engine start (H4), FOD creation (H5), aircraft clearance (H8)

Critical sub-scenarios:

  • Positioning under aircraft fuselage with <1 m clearance to cargo door sill.
  • Ground crew walking between AV and belt loader during active loading.
  • Adjacent stand pushback commencing while AV is stationary at its own stand.
  • Engine start at own stand (pilot initiates engine spool-up while GSE is still clearing).
  • Night turnaround with mixed lighting: bright flood lights from one direction, deep shadows from another.

3.4 Phase 4: Pushback Support (Coordinating with Pushback Operations)

AttributeValue
DescriptionAV clears the stand area before/during aircraft pushback, or operates in coordination with pushback tug
Typical duration3-10 minutes
Speed range0-10 km/h (clearing), 0 km/h (waiting at safe position)
Actors presentPushback tug, wing walkers, headset-connected ground crew, aircraft with engines starting
Risk levelVery High -- aircraft in motion on the stand, jet blast increasing, time pressure to clear
Key hazardsJet blast exposure (H4), collision with pushback aircraft (H2), personnel collision during evacuation (H1)

Critical sub-scenarios:

  • AV must vacate stand within 2-3 minutes of pushback clearance announcement.
  • Simultaneous pushback at adjacent stand -- the pushed-back aircraft's tail sweeps into AV's planned egress path.
  • Engine start during pushback: CFM56 idle thrust blast reaches 35 kt at 30 m behind aircraft. Larger engines (GE90, Trent XWB) produce higher exhaust velocities.
  • Communication failure: AV does not receive pushback notification and is still on stand when pushback begins.

3.5 Phase 5: Return (Apron to Depot)

AttributeValue
DescriptionAV returns to depot for charging, maintenance, or end-of-shift parking
Typical duration3-15 minutes
Speed range10-25 km/h
Actors presentSimilar to Phase 1 but may include shift-change pedestrian traffic
Risk levelMedium -- similar to transit but potentially at end of battery charge or in adverse weather that has developed
Key hazardsLow battery causing reduced speed or emergency stop (ego failure), nighttime return with reduced visibility, taxiway incursion (H6)

Critical sub-scenarios:

  • Battery critically low -- AV must reach depot before shutdown, but must not sacrifice safety for urgency.
  • Return in worsening weather (fog rolled in during shift).
  • End-of-shift fatigue parallel: sensor degradation from accumulated dirt/water on LiDAR windows over shift duration.

3.6 Phase Summary

PhaseDurationSpeed (km/h)Actor DensityRisk LevelPrimary Hazard
Transit3-15 min10-25LowMediumH6 (incursion)
Approach1-5 min5-15MediumHighH8 (wingtip)
Turnaround25-90 min0-5Very HighVery HighH1 (personnel)
Pushback3-10 min0-10HighVery HighH4 (jet blast)
Return3-15 min10-25Low-MediumMediumH6 (incursion)

4. Environmental Conditions Matrix

4.1 Weather Conditions

ConditionVisibility ImpactSurface ImpactSensor ImpactFrequency
ClearNoneDryNominal50-70% of ops
Light rain (<5 mm/hr)SlightWet, reflectiveLiDAR: 5-15% point loss; Camera: rain drops on lens10-20%
Heavy rain (>10 mm/hr)ModerateStanding water, hydroplaning riskLiDAR: 30-50% point loss; Camera: severely degraded2-5%
Fog (vis <500 m)SevereWetLiDAR: range reduced 30-60%; Camera: contrast loss; Radar: unaffected3-8%
Snow/iceModerate-SevereSlippery, markings obscuredLiDAR: lens accumulation; Camera: contrast loss; All: calibration drift from thermal cycling1-10% (varies by latitude)
De-icing operationsSevere (local)Chemical spray, glycol on surfaceLiDAR: lens contamination in spray zone (sudden point count drop to near-zero); Camera: droplet blur2-5% (winter)
Sandstorm/dustSevereAbrasive, reduced frictionLiDAR: rapid degradation; Camera: pitting; Radar: unaffected<1% (arid airports)

4.2 Lighting Conditions

ConditionIlluminationSensor ImpactFrequency
Day (bright)>10,000 luxCamera: possible saturation from apron reflection; LiDAR: solar interference in near-IR band40-50%
Day (overcast)1,000-10,000 luxCamera: good; LiDAR: nominal15-25%
Dusk/dawn1-1,000 luxCamera: rapid dynamic range change; Auto-exposure hunting5-10%
Night0.5-50 lux (apron lit)Camera: noise, reduced resolution; AEB failure rate for hi-vis vest: 84-88%; LiDAR: unaffected; Thermal: optimal25-35%
MixedHighly variableCamera: extreme dynamic range (bright flood lights + deep shadows); HDR neededCommon at night

4.3 Surface Conditions

ConditionFriction CoefficientImpact on AVDetection Method
Dry concrete0.7-0.9Nominal brakingDefault assumption
Wet concrete0.4-0.720-40% longer braking distanceRain sensor, camera surface analysis
Icy/frosted0.1-0.33-7x longer braking distanceTemperature sensor, friction estimation from wheel slip
Oil/fuel spill0.05-0.15Near-zero braking traction, fire hazardCamera (color/sheen detection), infrastructure notification
Painted markings0.5-0.7 (wet: 0.3-0.5)Reduced traction on wet paintMap layer (known marking locations)
Expansion jointsVariableBump, possible dolly derailmentMap layer, LiDAR elevation change
Rubber deposits0.3-0.6Reduced tractionMap layer (near runway thresholds)

4.4 Visibility Modifiers

ModifierMechanismAffected SensorsMitigation
Jet blast shimmerRefractive index variation from hot exhaustCamera: image distortion, blurThermal camera detects exhaust boundary; radar unaffected
De-icing spray cloudGlycol/water aerosolLiDAR: backscatter, lens contamination; Camera: obscuredRadar sees through; halt and wait
Heat hazeGround-level thermal convectionCamera: image wobble at long rangeLiDAR unaffected; use radar at range
Glare (low sun)Direct sunlight into cameraCamera: blooming, flareSun position prediction; visor/filter; LiDAR unaffected
Aircraft lightingStrobe, nav, landing lightsCamera: local overexposureHDR processing; temporal filtering

4.5 Temperature Effects on Sensors

Temperature RangeEffect
-20C to -10CLiDAR heater power draw increases; battery capacity reduced 20-40%; rubber seals may stiffen; lubricant viscosity increases
-10C to 0CIce formation on sensor windows possible; condensation on cold-soak sensors moving to warm environment
0C to 35CNominal operating range for most sensors
35C to 45CGPU/SoC thermal throttling begins on Orin above ~40C ambient (depending on enclosure); LiDAR may derate
45C to 55CSustained operation at thermal limits; increased failure rate; apron surface temperature can reach 70C+ (burns, tire degradation)

5. Actor Taxonomy

5.1 Aircraft States

StateDescriptionHazard to AVDetection Cues
Parked, engines offChocked, GPU connected, doors openStatic obstacle (wingtip, landing gear, open cargo doors)Size, shape, position in map
Parked, APU runningPre-departure, exhaust from APUMild exhaust (APU outlet typically at tail), noise maskingThermal signature, sound level
Parked, engines startingPilot initiating engine startIngestion zone (15 ft / 4.6 m forward of intake); growing exhaust blastRotating beacon ON = engine start signal; thermal bloom
Taxiing (arrival)Moving to stand at 5-15 ktCollision risk, jet blast, ingestion zoneMoving target, size, ADS-B if available
Taxiing (departure)Moving from stand to taxiwayCollision risk, increasing thrust/blastMoving target, departing trajectory
PushbackMoving backwards under tug controlTail sweeps arc, unpredictable from AV perspectivePushback tug visible, wing walkers present

5.2 GSE Vehicle Types and Behavior Patterns

GSE TypeTypical SpeedBehavior PatternAV-Relevant Hazard
Baggage tractor + dolly train10-25 km/hLong articulated train (up to 4-5 dollies), wide turning radius, often driven aggressively under time pressureLong tail swing; limited rearward visibility; dolly derailment drops baggage (FOD)
Belt loader5-15 km/hApproaches aircraft, extends conveyor to cargo door, operates at stand for 10-20 minExtends/retracts: changing footprint; personnel walk around both sides
Container/pallet loader (high-loader)5-10 km/hLarge vehicle, elevating platform to main-deck cargo doorVery large blind zones; personnel underneath during ULD transfer
Catering truck10-20 km/hElevating box body to aircraft door; operates 10-15 minElevated body obscures camera view of surroundings; personnel between truck and aircraft
Fuel truck/hydrant dispenser5-15 km/hApproaches aircraft underwing fuel panel; connects hosesFuel spill zone around hose connections; NO VEHICLES within 6 m during fueling (safety zone)
Pushback tug5-10 km/hConnects to nose gear, pushes aircraft backwardVery large combined vehicle; aircraft tail sweeps wide arc
GPU (ground power unit)5-15 km/hParks near nose gear, connects cableStationary obstacle with trailing cables; trip/snag hazard
PCA (pre-conditioned air)5-15 km/hParks near aircraft, connects ductingLarge duct creates temporary obstacle; often left in position
Lavatory service truck10-15 km/hServices aft of aircraft; hose connectionsWaste spill hazard; often operates in low-traffic areas
Potable water truck10-15 km/hServices forward of aircraftSpill hazard; small vehicle, may be in AV path
Passenger stairs5-10 km/hPositions at aircraft door (remote stands)Large footprint; passengers descending stairs near ground level
Passenger bus10-30 km/hTransports passengers between terminal and remote standsLarge vehicle; passengers disembarking at stand level near AV
De-icing truck10-15 km/hElevated boom spraying glycol; operates pre-departureSpray cloud degrades sensors; glycol on surface reduces friction
Follow-me car15-30 km/hGuides aircraft on apronMay change speed/direction unexpectedly; AV should not follow
Maintenance vehicle10-20 km/hVarious sizes; may tow equipmentUnpredictable stopping; equipment may extend beyond vehicle footprint

5.3 Personnel Types

Personnel TypeTypical BehaviorVisibilityDetection Challenge
Ground crew (ramp agents)Moving between GSE and aircraft, carrying items, bending, crouchingHi-vis vest (84-88% AEB failure rate at night per research data)Crouching/bending reduces height; occluded by GSE; fast direction changes
MarshallersStanding in front of aircraft, using wands/paddlesHi-vis, reflective wandsGenerally visible; but AV must not approach marshalling zone
Wing walkersWalking alongside aircraft wingtips during pushbackHi-visMoving with aircraft; AV must maintain clearance from pushback sweep zone
Supervisors/managersWalking between stands, sometimes without hi-visVariableMay not wear hi-vis; may cross AV path unexpectedly
Passengers (bus transfer)Groups of 30-150, walking between bus and aircraft stairsCivilian clothing, no hi-visLarge group, unpredictable individual movement, children, mobility-impaired
Fuel handlersNear fuel truck and underwing panelHi-vis + flame-retardantLocated in fuel safety zone; AV must avoid this zone entirely
Maintenance engineersUnder aircraft, on ladders, inside engine cowlingsVariableMay be partially occluded by aircraft; may emerge unexpectedly from under fuselage

5.4 Wildlife

TypeFrequencyHazard to AVDetection
Birds (gulls, starlings)Common at coastal/inland airportsFOD risk if struck; bird remains on surface create FOD for aircraftCamera: good; LiDAR: marginal for small birds
Rabbits/haresCommon at many European airportsSmall, fast-moving; potential collisionLiDAR: detectable >15 m; Radar: possible
FoxesOccasional, mostly nightMedium-sized; may freeze in headlightsLiDAR: good; Thermal: excellent
Large animals (deer)Rare on apron (perimeter fence breach)Significant collision riskAll sensors: good

5.5 Emergency Vehicles

VehiclePriorityAV Response Required
ARFF (Aircraft Rescue and Fire Fighting)Absolute priorityImmediate stop and clear path; do not resume until all-clear
AmbulanceHigh priorityYield, stop if in path
Airport operations vehicleContext-dependentYield if lights/sirens active
Police/securityContext-dependentYield if lights/sirens active

Emergency vehicles may travel at 60-80 km/h on apron during response -- far exceeding normal GSE speeds. The AV must detect and yield at maximum range.


6. Hazard Catalog (SOTIF-Aligned)

6.1 Hazard Summary Table

IDHazardSeverityFrequency (per 1M ops)ASIL-like RatingPrimary Phase
H1Collision with personnelCatastrophic (S3)0.47 fatalities/M departures (NTSB)DTurnaround, Pushback
H2Collision with aircraftCritical (S2-S3)6.2 per 10,000 departuresC-DApproach, Turnaround
H3Collision with other GSESerious (S2)~15 per 10,000 departuresB-CAll phases
H4Jet blast exposureCritical-Catastrophic~0.5 per 10,000 departuresCPushback, Turnaround
H5FOD creationModerate-Critical (S1-S2)Context-dependentBTransit, Turnaround
H6Runway/taxiway incursionCatastrophic (S3)32 per 1M operations (FAA FY2023)DTransit, Return
H7Fuel spill zone entryCritical-CatastrophicRareCTurnaround
H8Aircraft wingtip clearance violationCritical (S2)~3 per 10,000 departuresCApproach, Turnaround

6.2 H1: Collision with Personnel

Description: AV strikes a ground crew member, passenger, or other person on the apron.

Triggering conditions:

  • Personnel in AV blind spot (behind dolly train, under vehicle)
  • Personnel crouching or bending below LiDAR scan plane
  • Personnel emerging from behind GSE or aircraft structure
  • Personnel wearing dark clothing at night (hi-vis failure: 84-88% AEB failure rate)
  • Personnel making sudden direction change
  • Personnel distracted by noise, time pressure, or personal device

Severity: Fatal or life-changing injury. NTSB data: 26% of ground crew accidents are fatal. Average cost per fatality: USD 5-12 million (legal settlements, regulatory penalties, operational disruption).

Frequency estimate: NTSB historical rate: 0.47 struck-by events per million departures (1983-2004). FAA data: 11 fatal struck-by injuries since 1985. The US GAO reported 6 fatal ramp accidents per year in the US alone (2007).

Sensor coverage:

SensorEffectivenessLimitation
LiDARGood (>15 m)Misses crouching personnel <0.5 m; sparse points at range on thin limbs
CameraGood (day)84-88% AEB failure rate at night for hi-vis personnel
RadarFairLow resolution for person-sized targets; poor classification
ThermalExcellent (night)Cost; integration complexity; resolution lower than camera

Mitigation:

  • 360-degree LiDAR coverage (4-8 sensors, reference airside AV stack current config)
  • Thermal cameras for night operations (FLIR Boson 640, 640x512 @ 60 Hz)
  • Conservative speed limits near personnel: max 5 km/h within 5 m of any detected person
  • Emergency stop distance < 0.5 m at 5 km/h (requires <0.36 s total reaction time)
  • Under-vehicle sensing (ultrasonic or short-range radar) to detect personnel in crush zone
  • V2I notification of personnel presence from infrastructure cameras

Residual risk: Personnel in complete sensor shadow (behind aircraft landing gear, between stacked containers). Mitigated by infrastructure perception and operational procedures (exclusion zones).

6.3 H2: Collision with Aircraft

Description: AV contacts aircraft fuselage, landing gear, engine nacelle, or other aircraft structure.

Triggering conditions:

  • Localization error (GPS multipath near terminal buildings: 2-10 m error)
  • Path planning error approaching stand
  • Dolly train tail swing contacting aircraft during positioning
  • Undetected aircraft position change (pushback initiated without notification)
  • Wind gust moving lightweight AV toward aircraft

Severity: Aircraft ground damage average cost: USD 75,000-150,000 per incident (IATA). Range: USD 250,000 (minor dent) to USD 139 million+ (structural damage requiring major repair or write-off). Engine damage from GSE collision can reach USD 35 million per engine.

Frequency estimate: 22,400 aircraft ground damage incidents per year worldwide (IATA, based on 0.6-0.75 per 1,000 departures, 32 million departures in 2019). GSE causes 61% of these.

Sensor coverage:

SensorEffectivenessLimitation
LiDARExcellentAircraft are large, high-reflectivity targets
CameraGoodNight/glare may reduce detection quality
UltrasonicGood (close range)Only effective ❤️ m; useful for final positioning

Mitigation:

  • HD map with precise aircraft stand geometry and expected aircraft envelope per type
  • Ultrasonic proximity sensors for final approach to stand (❤️ m)
  • Dolly train articulation monitoring (prevent tail swing exceeding envelope)
  • Real-time aircraft position from ADS-B/MLAT (infrastructure feed)
  • Minimum 3 m clearance from aircraft unless in designated approach corridor

Residual risk: Aircraft position differs from expected (wrong gate assignment, aircraft parked off-center). Mitigated by LiDAR-based aircraft detection and real-time position estimation.

6.4 H3: Collision with Other GSE

Description: AV collides with another ground support equipment vehicle.

Triggering conditions:

  • Other GSE vehicle makes unpredictable maneuver (sudden reverse, U-turn)
  • GSE vehicle approaches from behind (AV rear blind spot if dolly train present)
  • GSE vehicle obscured by aircraft or other large equipment
  • GSE vehicle exceeds apron speed limit (tracked at up to 60% above speed limit per ICAO observations)
  • Multiple GSE converging at stand simultaneously during turnaround

Severity: Vehicle damage USD 5,000-100,000. Personnel injury possible if occupant ejected or pinned. Cargo damage/delay costs.

Frequency estimate: More frequent than aircraft damage -- estimated 15-20 GSE-to-GSE incidents per 10,000 departures based on industry surveys.

Sensor coverage: LiDAR and camera provide good detection of GSE-sized objects. Radar effective at all ranges. Primary challenge is prediction of GSE behavior, not detection.

Mitigation:

  • 360-degree obstacle detection with minimum 50 m range
  • GSE trajectory prediction (constant velocity model as baseline; learned model for improved prediction)
  • Cooperative awareness via V2V if available (future: all GSE broadcasting position)
  • Conservative right-of-way: AV always yields to human-driven GSE

Residual risk: GSE reversing into AV at high speed from close range (reaction time insufficient). Mitigated by rear-facing sensors and audible/visual warning systems.

6.5 H4: Jet Blast Exposure

Description: AV enters jet blast zone and is displaced, overturned, or damaged by engine exhaust.

Triggering conditions:

  • Aircraft engine start while AV is within blast zone
  • AV path crosses behind taxiing aircraft
  • Wind direction changes, redirecting blast zone toward AV
  • AV does not receive pushback/departure notification

Severity: Large jet engines (CF6, GE90, Trent XWB) produce exhaust velocities exceeding 100 kt (185 km/h) at 60 m behind the aircraft at 40% N1 (ground idle). At takeoff thrust, hazardous blast extends to 365 m (1,200 ft). A baggage tractor weighing 3-5 tonnes can be displaced or overturned. Personnel exposure is potentially fatal.

Jet blast zone dimensions by engine class:

Engine ClassExampleGround Idle (35 kt zone)Breakaway (50 kt zone)Takeoff (100 kt zone)
Small turbofanCFM56-5B30 m behind45 m behind150 m behind
Medium turbofanPW1100G35 m behind55 m behind180 m behind
Large turbofanGE90-115B50 m behind75 m behind250 m behind
Very large turbofanTrent XWB55 m behind80 m behind280 m behind

Engine intake ingestion zone: 4.6 m (15 ft) in front of engine intake. Personnel within this zone risk ingestion -- two fatal engine ingestion incidents occurred in 2022-2023 (Montgomery, Alabama and San Antonio, Texas). AV must never enter the intake zone.

Sensor coverage:

SensorEffectivenessLimitation
Thermal cameraExcellentOnly passive sensor that can visualize jet exhaust boundaries
LiDARPoorJet exhaust is invisible to LiDAR; vibration from blast degrades readings
CameraPoorExhaust shimmer visible but not reliably detectable
RadarFairMay detect velocity changes in blast zone particles
IMU/accelerometerGood (reactive)Detects blast force after exposure begins -- too late for avoidance

Mitigation:

  • Aircraft state monitoring via ADS-B, MLAT, or airport A-CDM feed (engine start notification)
  • Rotating beacon detection (beacon ON = engines running or about to start)
  • Thermal camera for exhaust plume boundary detection
  • Predictive blast zone overlay on occupancy grid based on aircraft type and engine state
  • Hard geofence: AV may not enter computed blast zone under any circumstances
  • Wind direction and speed integration to model blast deflection

Residual risk: Unannounced engine start (pilot deviation from procedure). Mitigated by rotating beacon detection and thermal sensing.

6.6 H5: FOD Creation

Description: AV drops cargo, loses vehicle parts, or creates debris that becomes Foreign Object Debris hazardous to aircraft.

Triggering conditions:

  • Baggage/cargo shifts on dolly during transit or turn
  • Dolly latch failure allows container to slide off
  • Vehicle component detaches (wheel cover, mirror, antenna)
  • AV drives over and scatters existing FOD (amplification)

Severity: FOD costs the aviation industry USD 4 billion annually (Boeing estimate). A single damaged engine fan blade can cost USD 50,000+. 55% of all FOD is discovered in stand/apron areas. Catastrophic engine failure from FOD ingestion can cause hull loss.

Frequency estimate: Context-dependent. Higher risk with loaded dollies, on turns, and on rough surfaces.

Sensor coverage: Rear-facing cameras can detect dropped items; accelerometers detect dolly latch failure. Self-monitoring of vehicle integrity via BIT (built-in test).

Mitigation:

  • Dolly latch status monitoring (sensor on latch mechanism)
  • Rear-facing camera with dropped-object detection
  • Pre-trip vehicle inspection (automated visual check)
  • FOD detection on path ahead (LiDAR anomaly detection, see fod-and-jetblast.md)
  • Speed limits on turns when loaded

Residual risk: Small items (single baggage tag, plastic wrap) not detectable by vehicle sensors. Mitigated by periodic FOD sweeps and infrastructure FOD detection systems.

6.7 H6: Runway/Taxiway Incursion

Description: AV enters an active runway or taxiway without authorization, creating collision risk with aircraft.

Triggering conditions:

  • Localization failure places AV on wrong side of hold-short line
  • Path planning error routes AV across taxiway without clearance
  • AV follows another GSE vehicle that has clearance (unauthorized following)
  • Map error: hold-short line position incorrect in HD map
  • GPS multipath/spoofing near terminal places AV on taxiway

Severity: Catastrophic. Aircraft-vehicle collision on taxiway or runway at aircraft taxi speed (15-30 kt) would likely result in fatalities and major aircraft damage. FAA FY2023: 32 incursions per 1 million operations. Vehicle/Pedestrian Deviations (VPDs) are one of three incursion categories. Category A (most severe) incursions involve immediate collision risk.

Frequency estimate: FAA FY2024: 9 serious (Category A/B) runway incursions total (all causes), down 59% from 22 in FY2023. VPDs represent approximately 19% of all runway incursions. For airside AVs, taxiway crossings are the primary risk -- no AV should ever operate on a runway.

Sensor coverage:

SensorEffectivenessLimitation
GPS/GNSSGood (open areas)Multipath near buildings; 2-10 m error possible
LiDAR SLAMGoodProvides independent position check
Camera (sign/marking recognition)Good (day)Hold-short markings may be worn or obscured
ADS-B/MLAT (infrastructure)ExcellentProvides aircraft positions; AV can check for conflict

Mitigation:

  • Multi-source localization: GPS + LiDAR SLAM + wheel odometry (GTSAM fusion, current reference airside AV stack approach)
  • Geofence hard limits: AV physically cannot cross hold-short lines without explicit clearance signal from airport system
  • Hold-short line detection via camera and map cross-reference
  • ADS-B/MLAT aircraft position feed: AV checks for aircraft on taxiway before crossing
  • If localization confidence drops below threshold, AV stops immediately

Residual risk: Simultaneous localization failure across all sources (GPS + SLAM + odometry). Probability: extremely low (~10^-9 per hour with independent failure modes). Mitigated by immediate stop on any single-source disagreement.

6.8 H7: Fuel Spill Zone Entry

Description: AV enters an area where jet fuel (Jet A/A-1) has spilled, creating fire/explosion risk from vehicle electrical systems or friction sparks.

Triggering conditions:

  • Fuel hose leak or disconnection during refueling
  • Wing vent spill (>95% of fuel spills at Phoenix Sky Harbor)
  • Fuel truck collision spilling fuel on apron
  • AV not informed of fuel spill location

Severity: Jet fuel (Jet A) flash point: 38C (100F). Below this temperature, spilled fuel does not easily ignite. Above this temperature (common on hot aprons), vapor-air mixture is flammable. A vehicle's electrical system, ESD, or friction spark could ignite vapors. Consequence: fire/explosion, potential aircraft destruction (>USD 100 million), personnel casualties.

Sensor coverage:

SensorEffectivenessLimitation
CameraFairCan detect fuel sheen on wet surface in good lighting
LiDARPoorCannot detect liquid fuel on ground
Olfactory sensorPossibleHydrocarbon gas detector could detect vapors
Infrastructure notificationBestAirport fuel management system reports spill location

Mitigation:

  • Fuel safety zone enforcement: AV maintains minimum 6 m from any active fueling operation
  • Infrastructure notification: fuel management system broadcasts spill alerts
  • Fuel vapor detector on vehicle (hydrocarbon sensor)
  • Camera-based fuel sheen detection (research stage)
  • If fuel detected or reported, AV stops and routes around

Residual risk: Undetected fuel leak in AV's path. Mitigated by intrinsically safe vehicle electrical design (ATEX/IECEx compliance for vehicles operating in refueling zones).

6.9 H8: Aircraft Wingtip Clearance Violation

Description: AV or its dolly train passes too close to aircraft wingtip, risking contact.

Triggering conditions:

  • AV navigating between aircraft at adjacent stands with <7.5 m (25 ft) wingtip separation
  • Dolly train tail swing on turn exceeds expected envelope
  • Aircraft parked off-center from stand centerline (reduces clearance)
  • Wrong aircraft type at stand (larger than expected, e.g., A321neo XLR instead of A320)

Severity: Wingtip damage: USD 250,000-2,000,000 depending on extent. Composite wingtip repair can take 2-4 weeks, causing extended aircraft out-of-service costs.

Frequency estimate: Wingtip strikes are a subset of the 22,400 annual ground damage incidents. Pushback-related wingtip strikes are specifically documented: multiple incidents of simultaneous pushback from adjacent stands causing tail-to-wingtip contact (SKYbrary records: B737+B737 collision Jan 2018; B767+A320 collision Aug 2017; B767+B737 collision Mar 2018; B787+A350 wingtip-to-stabilizer contact Apr 2024 at Heathrow).

Sensor coverage: LiDAR provides precise 3D measurement of wingtip position. Camera provides visual confirmation. Ultrasonic for close-range (❤️ m) clearance monitoring.

Mitigation:

  • HD map contains stand geometry with aircraft type-specific clearance envelopes
  • Real-time aircraft type identification (from AMS/AODB feed or LiDAR-based classification)
  • Minimum 3 m clearance from nearest aircraft surface enforced by planner
  • Dolly train articulation model: compute swept path including all dollies before committing to maneuver
  • LiDAR-based wingtip position detection overrides map-based assumptions

Residual risk: Aircraft parked with gear failure (lower than expected, wing drooping). Extremely rare; mitigated by LiDAR-based real-time clearance measurement.


7. Edge Cases and Long-Tail Scenarios

7.1 Overview

Edge cases are scenarios that are individually rare (frequency <10^-4 per operating hour) but collectively represent the dominant residual risk after common hazards are mitigated. Per the Pareto principle inverted: the last 1% of scenarios may account for 50% of the remaining safety risk.

7.2 Top 20 Edge Cases Ranked by Risk

RankEdge CaseFrequency EstimateSeverityRisk Score (F x S)Test Method
1Simultaneous pushback at adjacent stands with tail sweep into AV path10^-3/hr at busy standsAircraft damage + personnel riskVery HighSimulation + physical (empty aircraft)
2Engine start while AV is within blast zone, no prior notification10^-4/hrVehicle overturn, personnel injuryVery HighSimulation only
3Passenger (child) breaks away from group, runs into AV path10^-4/hr at remote standsFatality riskVery HighSimulation + controlled physical (dummy)
4Fuel spill ignition near AV (fire on apron)10^-5/hrFire/explosion, catastrophicVery HighSimulation only
5Bird strike debris field: aircraft hits flock on approach, scatters debris across apron10^-5/hrFOD damage to AV and other aircraftHighSimulation only
6Medical emergency: ground crew collapses in AV path10^-4/hrFatality if run overHighSimulation + dummy test
7Aircraft tire blowout during taxi near AV10^-5/hrDebris projectile, AV damageHighSimulation only
8Marshaller giving incorrect/contradictory signals10^-3/hrAV enters wrong zone, collisionHighSimulation + physical (staged)
9GPS spoofing/interference causing localization jump10^-5/hrTaxiway incursionHighInjection test (simulated GPS)
10De-icing spray coats all LiDAR and camera lenses simultaneously10^-4/hr (winter)Complete sensor blindnessHighControlled spray test
11Two AV units assigned to same stand simultaneously10^-4/hrAV-AV collisionMedium-HighFleet simulation + physical test
12Power failure at night: apron lights off, AV loses infrastructure feeds10^-5/hrNavigation in darkness, no V2IMedium-HighControlled blackout test
13Heavy cargo shifts during transit, changing AV center of gravity10^-3/hrRollover on turn, loss of steeringMedium-HighPhysical test with instrumented load
14Standing water pool on apron reflecting LiDAR (phantom ground plane)10^-3/hr (rain)False obstacle / navigation errorMediumPhysical test in rain
15Jet fuel vapor cloud drifts over AV (fuel truck overflows nearby)10^-5/hrVapor ignition risk from AV electronicsMediumSimulation only
16Aircraft door opens unexpectedly (maintenance, passenger attempt)10^-4/hrChanging aircraft envelope, personnel emergingMediumSimulation
17Construction zone on apron: barriers, excavation, new routing10^-3/hrMap mismatch, unexpected obstaclesMediumPhysical test with barrier setup
18Wildlife incursion: large bird (goose/heron) on apron10^-3/hrCollision, FOD if AV strikes birdMediumSimulation + decoy test
19Dolly wheel failure (flat tire/axle break) during transit10^-4/hrDolly drags, creates FOD, AV pulls to one sideMediumPhysical test with controlled failure
20Cybersecurity: spoofed command to AV via compromised airport network10^-6/hrAV diverted to unsafe areaMediumPenetration testing, see ../cybersecurity/cybersecurity-airside-av.md

7.3 Scenario Details for Highest-Risk Edge Cases

Edge Case 1: Simultaneous adjacent pushback. At busy airports, adjacent-stand pushback is common. The pushback aircraft's tail describes an arc that sweeps into the adjacent stand area. If an AV is at the adjacent stand, it has limited time (30-60 s) to detect the pushback and either evacuate or hold position within a safe zone. Real incidents: B767+A320 at Delhi (Aug 2017), B767+B737 (Mar 2018), B787+A350 at Heathrow (Apr 2024).

Edge Case 2: Unannounced engine start. Standard procedure requires all GSE to clear the safety zone before engine start. However, procedural deviations occur. The rotating beacon (anti-collision light) should activate before engine start, providing a visual cue. Detection: camera-based beacon detection; thermal camera detects intake temperature change; infrastructure A-CDM feed provides departure sequence.

Edge Case 3: Child passenger on apron. At remote stands, passengers walk from bus to aircraft stairs. A child breaking away from parents moves unpredictably, is smaller than adult personnel, and may not be wearing any visible clothing. Detection: camera-based pedestrian detection with child classification; thermal camera detects regardless of clothing.

7.4 Aviation Safety Database Sources for Frequency Estimation

DatabaseCoverageAccess
IATA Ground Damage Database180+ airlines, standardized damage reportsIATA members only
FAA Runway Incursion DatabaseAll US towered airportsPublic (faa.gov)
ASRS (Aviation Safety Reporting System)Voluntary reports, USPublic (asrs.arc.nasa.gov)
EASA Annual Safety ReviewEuropean operationsPublic (easa.europa.eu)
SKYbrary Accident DatabaseGlobal, curated by EurocontrolPublic (skybrary.aero)
NTSB Aviation Accident DatabaseUS accident investigationsPublic (ntsb.gov)
UK AAIB ReportsUK accident investigationsPublic (gov.uk/aaib-reports)

7.5 Simulation-Only Scenarios

The following scenarios cannot be safely tested on a live airfield and must be validated exclusively in simulation:

  1. Fuel fire/explosion on apron
  2. Aircraft tire blowout debris
  3. Bird strike debris field
  4. Full power engine blast exposure
  5. Multi-vehicle pile-up on service road
  6. Cybersecurity attack compromising AV fleet
  7. Catastrophic weather event (microburst, hailstorm)
  8. Aircraft structural failure (landing gear collapse at stand)

These scenarios require high-fidelity simulation environments (see [../deployment/simulation-validation-strategy.md] if available) with physics-accurate jet blast modeling, fire/fluid simulation, and multi-agent coordination.


8. Sensor Coverage Analysis per Scenario

8.1 Sensor Capability Matrix

ScenarioLiDARCamera (Visible)Camera (Thermal)4D RadarUltrasonicInfrastructure (V2I)
Personnel detection (day)GoodGoodFairFairGood (<3m)Good
Personnel detection (night)GoodPoor (84-88% fail)ExcellentFairGood (<3m)Good
Personnel crouchingMarginalFairGoodPoorGood (<3m)Fair
Aircraft detectionExcellentGoodGoodGoodGood (<3m)Excellent
GSE detectionGoodGoodFairGoodFairGood
Jet blast zoneFailPoor (shimmer only)ExcellentFairFailGood (A-CDM)
FOD detection (>10 cm)Good (<25m)Good (day)PoorFairFailGood (Tarsier/FODetect)
FOD detection (❤️ cm)FailMarginal (close)FailFailFailGood (W-band radar)
Fuel spillFailFair (sheen)FairFailFailGood (fuel system)
Taxiway incursion preventionFair (SLAM)Fair (markings)FailFailFailExcellent (ADS-B/MLAT)
Wingtip clearanceExcellentGoodFairFairExcellent (<3m)Good (stand allocation)
Standing water (reflections)MarginalFairFailGoodFailFail
Heavy rainDegraded (-30-50%)DegradedUnaffectedUnaffectedUnaffectedUnaffected
FogDegraded (-30-60%)Fail (<100m)UnaffectedUnaffectedUnaffectedUnaffected
Snow/ice on sensorsFail (if blocked)Fail (if blocked)Fail (if blocked)UnaffectedUnaffectedUnaffected
Night + rainDegradedFailGoodUnaffectedUnaffectedUnaffected
De-icing sprayFail (contaminated)Fail (contaminated)Fail (contaminated)UnaffectedUnaffectedUnaffected

8.2 Key Findings

  1. No single sensor modality provides adequate coverage across all scenarios. Minimum viable sensor suite: LiDAR + camera + 4D radar + thermal camera + ultrasonic.

  2. 4D radar should be primary (not backup) for airside. It is the only vehicle-mounted sensor immune to rain, fog, de-icing spray, and jet exhaust. Continental ARS548 provides 300 m range, 4D point cloud, and Doppler velocity.

  3. Thermal cameras are essential, not optional. They are the only passive sensor that can detect jet blast boundaries, see personnel at night when hi-vis fails, and operate through fog. Recommended: FLIR Boson 640 (640x512, 60 Hz, USD 3-6K, MIPI CSI-2 for direct Orin integration).

  4. Infrastructure perception (V2I) fills critical gaps. Fuel spill notification, ADS-B aircraft position, A-CDM departure sequence, and FOD detection from fixed sensors all provide information that vehicle-mounted sensors cannot reliably obtain.

  5. The de-icing scenario is the hardest. Glycol spray can simultaneously blind LiDAR, camera, and thermal sensors. Only radar and infrastructure feeds remain operational. The AV must stop immediately when spray is detected and wait for clearing. Sensor window heating and cleaning systems are essential for winter operations.

8.3 Sensor Fusion Requirements per Phase

PhasePrimary SensorsSecondary SensorsInfrastructure Feeds
TransitLiDAR, Camera, RadarThermalGPS, map, ADS-B for taxiway crossings
ApproachLiDAR, Camera, Radar, UltrasonicThermalStand allocation, aircraft type, VDGS
TurnaroundLiDAR, Camera, Thermal, UltrasonicRadarPersonnel tracking, turnaround status, fuel zone
PushbackThermal (jet blast), LiDAR, CameraRadar, UltrasonicA-CDM departure sequence, pushback notification
ReturnLiDAR, Camera, RadarThermalGPS, map

9. Scenario Frequency and Risk Matrix

9.1 FMEA-Style Risk Assessment

Risk Priority Number (RPN) = Severity (S) x Occurrence (O) x Detectability (D)

Scales:

  • Severity (S): 1 = negligible, 2 = minor injury/damage, 5 = serious injury/significant damage, 8 = life-threatening/major damage, 10 = fatality/catastrophic
  • Occurrence (O): 1 = extremely rare (<10^-6/hr), 2 = rare (10^-5/hr), 3 = uncommon (10^-4/hr), 5 = occasional (10^-3/hr), 7 = frequent (10^-2/hr), 10 = expected (>10^-1/hr)
  • Detectability (D): 1 = certain detection, 2 = high detection, 3 = moderate detection, 5 = low detection, 8 = very low detection, 10 = undetectable

9.2 Risk Priority Number Table

RankScenarioSODRPNHazardPhase
1Personnel struck at night during turnaround1055250H1Turnaround
2AV enters jet blast zone, unannounced engine start835120H4Pushback
3AV contacts aircraft fuselage during positioning85280H2Turnaround
4Taxiway incursion, aircraft approaching102360H6Transit
5Personnel crouching in blind spot, AV reverses1035150H1Turnaround
6Adjacent pushback sweeps into AV position83372H2Pushback
7Fuel spill ignites near AV101550H7Turnaround
8AV drops baggage container as FOD55375H5Transit
9AV contacts wingtip navigating between stands83248H8Approach
10De-icing spray blinds all sensors538120H1-H3All (winter)
11Child breaks from passenger group103390H1Turnaround
12AV-AV collision (fleet coordination failure)53230H3All
13Heavy rain degrades LiDAR + camera simultaneously55375H1-H3All
14GPS spoofing/interference82580H6Transit
15GSE reverses into AV from behind55375H3Turnaround

9.3 Risk Reduction Targets

For certification under ISO 3691-4, safety functions must achieve Performance Level d (PLd) or higher for personnel protection functions. This corresponds to:

  • Probability of dangerous failure per hour: 10^-7 to 10^-6
  • Mean time to dangerous failure (MTTFd): high (30-100 years per component)
  • Diagnostic coverage: 99%+
  • Category 3 or 4 architecture (redundant with diagnostic monitoring)

The Simplex architecture (production stack as safety baseline + experimental neural stack) inherently provides Category 3 by offering a verified backup controller. See ../runtime-assurance/simplex-safety-architecture.md.

9.4 Top 10 Highest-Risk Scenarios After Mitigation

After applying the mitigations described in Section 6, the residual risk ranking shifts:

RankScenarioResidual RiskLimiting Factor
1Personnel in complete sensor shadow (behind aircraft gear)MediumNo sensor can see around solid obstructions
2De-icing spray simultaneous sensor blindnessMediumOnly radar remains; limited resolution
3Unannounced engine start in blast zoneMediumRequires infrastructure notification; sensor detection is reactive
4Fuel vapor ignition from AV electronicsLow-MediumRequires ATEX-rated electrical design
5Child running from passenger group at nightLow-MediumThermal camera helps but small fast targets are challenging
6GPS spoofing causing localization failureLow-MediumMulti-source fusion mitigates but coordinated attack possible
7Construction zone with no map updateLow-MediumCamera-based barrier detection helps
8Adjacent pushback with simultaneous engine startLow-MediumCompound event; each component mitigated individually
9Heavy cargo shift causing AV rolloverLowLoad monitoring sensors; speed limits on turns
10Cybersecurity attack on fleet managementLowNetwork segmentation; vehicle-level authentication

10. Testing Strategy per Scenario

10.1 Test Method Classification

MethodApplicable ScenariosAdvantagesLimitations
Physical test (real vehicle, real airport)Common scenarios, nominal operations, sensor performanceHighest fidelity; validates real hardwareExpensive; limited repetitions; cannot test dangerous scenarios
Physical test (closed course)Pedestrian detection (dummy), emergency stop, obstacle avoidanceSafe, repeatable; hardware in loopMay not represent full airport complexity
Hardware-in-the-loop (HiL)Sensor injection, compute timing, actuator responseTests real compute stack; repeatableRequires sensor simulation fidelity
Software-in-the-loop (SiL)All scenarios; especially long-tail edge casesUnlimited scenarios; statistical coverageSimulation fidelity gap; no hardware validation
Replay testingHistorical scenarios from recorded dataUses real sensor data; regression testingCannot test unobserved scenarios
Fleet shadow modeAll scenarios encountered in operationReal-world coverage; no safety risk (shadow does not control)Cannot test response to scenarios; only detection

10.2 Minimum Test Repetitions

Statistical confidence for safety-critical functions requires sufficient test repetitions to demonstrate failure rates below the target probability:

For a target failure rate of p = 10^-6 per hour with 95% confidence:

  • Required test hours without failure: N = -ln(0.05) / p = 3,000,000 hours
  • This is infeasible with physical testing alone (342 years of continuous operation)
  • Therefore: simulation must provide the bulk of evidence, with physical testing validating simulation fidelity

Practical approach (based on TractEasy precedent of 1-6 years per airport approval):

Test TypeMinimum VolumePurpose
Physical on-airport10,000 km or 2,000 hours without safety-critical incidentDemonstrates nominal performance
Physical closed-course500 test runs per critical scenario (e.g., pedestrian emergency stop)Validates reaction time, stopping distance
SiL simulation10,000,000 scenario-km covering full taxonomyStatistical coverage of long-tail scenarios
HiL testing1,000 hours per sensor modality in degraded conditionsValidates degradation detection
Shadow mode50,000 km minimum before autonomous operationValidates perception and planning without risk

10.3 Scenario-to-Test Mapping

Scenario CategoryPhysical (Airport)Physical (Closed)HiLSiLShadowReplay
Nominal transitYesYesYesYesYesYes
Nominal approachYesYesYesYesYesYes
Nominal turnaroundYesLimitedYesYesYesYes
Personnel detection (day)YesYes (dummy)YesYesYesYes
Personnel detection (night)YesYes (dummy)YesYesYesYes
Emergency stopYesYesYesYesN/AYes
Jet blast avoidanceNoLimitedYesYesYesNo
FOD detectionYesYes (placed objects)YesYesYesYes
Taxiway crossingYes (supervised)NoYesYesYesYes
Fuel spill responseNoNoYesYesN/ANo
Rain/fog operationYes (opportunistic)NoYesYesYesYes
De-icing sprayNoYes (controlled)YesYesNoNo
Adjacent pushbackNoNoYesYesYesNo
Engine start blastNoNoYesYesYesNo
Emergency vehicle yieldNoYes (staged)YesYesYesNo
Multi-AV coordinationYes (supervised)YesYesYesYesNo

10.4 Scenario Database Format

For systematic scenario management, we recommend adapting OpenSCENARIO DSL (formerly OpenSCENARIO 2.0) for airside scenarios. OpenSCENARIO DSL provides a domain-specific language for defining abstract, logical, and concrete scenarios with constraint-based parameter variation.

Airside-specific extensions required:

// Airside ODD extensions for OpenSCENARIO DSL

type airside_zone: enum of [apron, service_road, taxiway_crossing, 
                             depot, maintenance_area]

type aircraft_state: enum of [parked_engines_off, parked_apu_on,
                               engines_starting, taxiing_in, 
                               taxiing_out, pushback]

type turnaround_phase: enum of [arrival, unloading, servicing,
                                 loading, departure_prep, pushback]

type gse_type: enum of [baggage_tractor, belt_loader, container_loader,
                         catering_truck, fuel_truck, pushback_tug,
                         gpu, pca, lavatory_truck, water_truck,
                         passenger_stairs, deicing_truck, follow_me]

struct airside_scenario:
    zone: airside_zone
    weather: weather_condition
    lighting: lighting_condition
    aircraft: list of (aircraft_type, aircraft_state)
    gse: list of (gse_type, position, velocity)
    personnel: list of (personnel_type, position, activity)
    ego: ego_vehicle_state
    event: optional airside_event

This format enables automated generation of concrete test scenarios from abstract scenario definitions, with coverage tracking against the taxonomy dimensions defined in Section 2.

10.5 Regression Test Suite Design

The regression test suite should include:

  1. Golden scenarios (fixed): 100 concrete scenarios covering each hazard at least 5 times with varying environmental conditions. These never change and provide a consistent safety baseline across software releases.

  2. Parameterized sweep (automated): For each of the 8 hazard categories, generate 1,000 concrete scenarios by sampling from the logical scenario parameter space. Re-run on every build.

  3. Adversarial scenarios (evolving): Generated by adversarial scenario search (e.g., Bayesian optimization over scenario parameters to find configurations that cause failures). Add discovered failure scenarios to the golden set permanently.

  4. Field-discovered scenarios (growing): Every safety-relevant event from fleet operation (shadow mode or autonomous) is converted to a replay test case and added to the regression suite.


11. Regulatory Mapping

11.1 ISO 3691-4 Scenario Requirements

ISO 3691-4:2023 Annex B provides a hazard list for driverless industrial trucks. Mapping to our taxonomy:

ISO 3691-4 HazardOur TaxonomyCoverage
Collision with persons (Clause 4.3.1)H1Full
Collision with obstacles (Clause 4.3.2)H2, H3, H8Full
Uncontrolled movement (Clause 4.3.3)H4 (blast-induced), H5 (cargo shift)Partial -- standard does not specifically address jet blast
Hazardous zone entry (Clause 4.3.4)H6, H7Partial -- standard addresses general zones, not aviation-specific
Electrical hazard (Clause 4.4)H7 (fuel ignition from electrical)Partial
Environmental conditions (Clause 4.5)Section 4 (full matrix)Full

Gap: ISO 3691-4 was designed for warehouse/factory environments. It does not specifically address:

  • Jet blast hazards (H4) -- unique to airport operations
  • Runway/taxiway incursion (H6) -- unique to airport operations
  • Aircraft-specific clearance requirements (H8)
  • De-icing operations
  • Fuel spill/fire scenarios specific to aviation fuel
  • Wildlife on operating surface
  • Emergency vehicle right-of-way at airport speeds

These gaps must be addressed through supplementary safety analysis and will likely be covered by future aviation-specific standards.

11.2 FAA Advisory Circular (Anticipated ~2028-2029)

Based on FAA CertAlert 24-02 (non-directive, supports controlled testing) and the regulatory trajectory analysis (see ../../80-industry-intel/regulations/regulatory-trajectory-deep-dive.md), the anticipated FAA AC will likely require:

Expected RequirementOur Taxonomy Coverage
Operational Design Domain definitionSection 2 (dimensions), Section 3 (phases), Section 4 (environment)
Hazard analysis per SOTIF principlesSection 6 (full hazard catalog)
Scenario-based testing evidenceSection 10 (test strategy)
Edge case identification and mitigationSection 7 (20 edge cases with mitigations)
Sensor performance envelopeSection 8 (sensor coverage matrix)
Emergency vehicle interactionSection 5.5, Section 7 (emergency vehicle yield)
Runway/taxiway incursion preventionH6 (Section 6.7)

11.3 EASA AMC (Anticipated ~2028)

EASA's approach, influenced by the EU Machinery Regulation 2027 (which mandates third-party assessment for AI-based autonomous vehicles) and EU Product Liability Directive 2024/2853 (software/AI as "products" subject to strict liability), will likely impose additional requirements:

Expected EASA RequirementOur Taxonomy Coverage
AI/ML safety assurance (per EU AI Act)Section 8 (sensor/ML limitations), Section 10 (test coverage)
Third-party conformity assessmentSection 10 (provides assessor with structured evidence)
Continuous safety monitoringSection 10.5 (regression suite, field-discovered scenarios)
Environmental robustness evidenceSection 4 (full environmental matrix)

11.4 ISO/SAE Standards (Anticipated ~2029-2030)

An ISO or SAE standard specifically for autonomous GSE in airport environments is anticipated. This taxonomy is designed to be forward-compatible with such a standard. Key areas that a future standard will need to address:

AreaCurrent GapThis Document's Contribution
Airport-specific ODD definitionNo standard defines airside ODDSection 2 provides dimension framework
Turnaround coordination requirementsNot addressed in any vehicle standardSection 3.3 defines turnaround phase scenarios
Jet blast safetyNot addressed in any vehicle standardSection 6.5 provides blast zone models
Aviation fuel safety zonesAddressed in IATA IGOM but not in vehicle standardsSection 6.8 maps fuel zone requirements
Multi-AV fleet coordinationEmerging topic in ISO 3691-4 revisionSection 7 addresses fleet coordination edge cases

11.5 Regulatory Scenario Coverage Summary

Scenario CategoryISO 3691-4FAA AC (predicted)EASA AMC (predicted)ISO/SAE (predicted)
Personnel collisionRequiredRequiredRequiredRequired
Aircraft collisionPartialRequiredRequiredRequired
GSE collisionRequiredRequiredRequiredRequired
Jet blastNot coveredRequiredRequiredRequired
FOD creationNot coveredLikely requiredLikely requiredRequired
Taxiway incursionNot coveredRequiredRequiredRequired
Fuel spill zoneNot coveredRequiredRequiredRequired
Wingtip clearanceNot coveredRequiredRequiredRequired
Weather degradationGeneralDetailedDetailedDetailed
Night operationsGeneralDetailedDetailedDetailed
Emergency vehicleNot coveredRequiredRequiredRequired
CybersecurityNot coveredLikely requiredRequired (EU CRA)Likely required
AI/ML assuranceNot coveredLikely requiredRequired (EU AI Act)Likely required

12. References

Standards

  1. ISO 34502:2022 -- Road vehicles -- Test scenarios for automated driving systems -- Scenario based safety evaluation framework
  2. ISO 21448:2022 -- Road vehicles -- Safety of the intended functionality (SOTIF)
  3. ISO 3691-4:2023 -- Industrial trucks -- Safety requirements and verification -- Part 4: Driverless industrial trucks and their systems
  4. ISO 12100:2010 -- Safety of machinery -- General principles for design -- Risk assessment and risk reduction
  5. ISO 13849-1:2023 -- Safety of machinery -- Safety-related parts of control systems -- Part 1: General principles for design
  6. IEC 62998-1:2019 -- Safety of machinery -- Safety-related sensors used for the protection of persons

Aviation Safety Data

  1. IATA Ground Operations Safety Program. https://www.iata.org/en/programs/ops-infra/ground-operations/safety/
  2. IATA Ground Damage Database -- ground damage frequency: 0.6-0.75 per 1,000 departures, 22,400 damages/year
  3. Flight Safety Foundation -- Ground Accident Prevention (GAP). https://flightsafety.org/toolkits-resources/past-safety-initiatives/ground-accident-prevention-gap/
  4. FAA Runway Safety Statistics. https://www.faa.gov/airports/runway_safety/statistics
  5. FAA Report to Congress: Injuries and Fatalities of Workers Struck by Vehicles. https://www.faa.gov/sites/faa.gov/files/airports/resources/publications/reports/vehicle_injuries.pdf
  6. EASA Annual Safety Review 2024. https://www.easa.europa.eu/en/document-library/general-publications/annual-safety-review-2024
  7. SKYbrary -- Ground Collision. https://skybrary.aero/articles/ground-collision
  8. SKYbrary -- Wingtip Clearance Hazard. https://skybrary.aero/articles/wingtip-clearance-hazard
  9. SKYbrary -- Pushback. https://skybrary.aero/articles/pushback
  10. NASA ASRS -- Ramp Safety. https://asrs.arc.nasa.gov/publications/directline/dl8_ramp.htm
  11. NASA ASRS -- Ground Jet Blast Hazard. https://asrs.arc.nasa.gov/publications/directline/dl6_blast.htm

Cost Data

  1. IATA: Annual cost of ground damage could reach $10 billion -- https://simpleflying.com/iata-cost-of-ground-damage-aircraft-10-billion/
  2. Boeing FOD cost estimate: $4 billion annually -- https://www.fodcontrol.com/what-is-fod/
  3. Aviation Pros -- The Costs of Ground Damage. https://www.aviationpros.com/aircraft-maintenance-technology/aircraft-technology/maintenance-providers/article/21279424/the-costs-of-ground-damage
  4. Global Aerospace -- Rising Trends in Ground Incidents. https://www.global-aero.com/from-the-hangar-to-the-tarmac-rising-trends-in-ground-incidents/

Incident Reports

  1. NTSB -- Ground crew injuries and fatalities in U.S. commercial aviation, 1983-2004
  2. FAA Runway Incursion Mitigation FY2024 Annual Summary Report. https://www.airporttech.tc.faa.gov/Products/Airport-Safety-Papers-Publications/Airport-Safety-Detail/runway-incursion-mitigation-fiscal-year-2024-annual-summary-report
  3. SKYbrary -- Accident and Serious Incident Reports: GND. https://www.skybrary.aero/index.php/Accident_and_Serious_Incident_Reports:_GND
  4. CBS News -- Airline worker engine ingestion incidents (2022-2023). https://www.cbsnews.com/news/airline-worker-died-sucked-into-plane-engine-ntsb-report/

Jet Blast and FOD

  1. IATA Ground Injury Prevention Program -- Engine Danger Areas (June 2024). https://www.iata.org/contentassets/f135f60f52e9495d9a6bb09aab8e39e7/engine-danger-areas.pdf
  2. SKYbrary -- Jet Efflux Hazard. https://skybrary.aero/articles/jet-efflux-hazard
  3. FAA AC 150/5220-24 -- FOD Detection Equipment
  4. FAA -- Foreign Object Debris Detection System Cost-Benefit Analysis. https://rosap.ntl.bts.gov/view/dot/67541

Airside AV Deployments

  1. Changi Airport autonomous tractor deployment (Jan 2026) -- UISEE. https://www.uisee.com/en/article226-cases1.html
  2. TractEasy (EasyMile/TLD) -- https://tracteasy.com/
  3. EasyMile Safety Report 2023. https://easymile.com/sites/default/files/easymile_safety_report_2023_1.pdf

Scenario and Test Standards

  1. ASAM OpenSCENARIO DSL. https://www.asam.net/standards/detail/openscenario-dsl/
  2. OpenSCENARIO V2.0 Concept Paper. https://releases.asam.net/OpenSCENARIO/2.0-concepts/ASAM_OpenSCENARIO_2-0_Concept_Paper.html

Apron Design and Operations

  1. ICAO Annex 14, Vol I -- Aerodrome Design and Operations
  2. FAA AC 150/5300-13A -- Airport Design
  3. FAA AC 150/5210-20 -- Ground Vehicle Operations on Airports
  4. ICAO Airport Services Manual (Doc 9137) -- Part 8: Airport Operational Services
  5. IATA Airport Handling Manual (AHM), 46th Edition (2026)
  6. IATA Ground Operations Manual (IGOM), 14th Edition (2026)

Public research notes collected from public sources.