Skip to content

Behavior Planning and Maneuver Arbitration

Purpose: Define the tactical planning layer that converts mission goals, maps, traffic rules, operational instructions, V2X, prediction, and fallback policy into maneuvers that a motion planner and controller can execute.

Key Takeaway: Behavior planning is the layer that prevents a route follower from becoming unsafe or operationally naive. It must arbitrate between progress, yielding, stopping, avoiding, rerouting, docking, remote approval, V2X advice, and safe fallback. For airside AVs, this layer must encode airport authority and task state as first-class inputs, not only road-style lane-change and obstacle-avoidance logic.

Research current as of: 2026-05-09


Problem Framing

An autonomous vehicle route says "go to Stand B7." A motion planner can generate a smooth path along a lane or reference line. Neither is enough to decide whether the vehicle should yield to a pushback tug, wait for ramp-control clearance, pass a stopped baggage cart, hold outside a jet-blast polygon, pull over for an emergency vehicle, or abort a docking approach because tracking error is too high.

Behavior planning is the tactical decision layer between route/mission planning and trajectory generation. Its output should be more than a path: it should include maneuver intent, drivable area, stop/yield decisions, approval requirements, and reasoning tags that downstream validation and operators can inspect.

For airside deployments, behavior arbitration has to combine:

  • Static rules: airport driving rules, speed zones, stand boundaries, hold-short rules.
  • Dynamic authority: A-CDM milestones, ramp-control instructions, movement-area clearance, NOTAMs.
  • Scene state: aircraft, GSE, personnel, FOD, de-icing zones, blocked service roads.
  • Prediction and interaction: whether nearby agents are yielding, crossing, docking, or pushing back.
  • System state: localization confidence, V2X health, controller trackability, degraded modes.

Method and Architecture Taxonomy

Tactical Maneuver Set

ManeuverRoad AnalogueAirside / Industrial Extension
Lane / corridor followLane keepingService-road, taxi-lane-adjacent, warehouse aisle following
Stop / holdStop line, red lightHold-short line, stand-access gate, clearance timeout
Yield / proceedIntersection negotiationAircraft always priority, pushback priority, emergency GSE
Avoid / passObstacle avoidancePass parked loader only if stand envelope and task zone allow
Lane change / side shiftLane changeShift within wide apron corridor or around temporary closure
Pull over / clear laneShoulder pull-overClear stand before pushback or emergency route
Pull out / mergeParking departureLeave staging bay, charger, hangar, or stand pocket
Dock / precision approachParkingBelt loader, tug, dolly, charging, aircraft-service-point approach
Reverse / multi-point maneuverParking reverseTight stand repositioning and tug/dolly alignment
Convoy / followCar followingFollow-me vehicle, tug convoy, baggage-cart train
Abort / safe stopMinimum-risk maneuverLoss of clearance, V2X trust failure, controller infeasibility

Arbitration Architectures

ArchitecturePatternStrengthLimit
FSM / scenario-stage-taskExplicit scenario states and transitionsAuditable, deterministic, common in production stacksState explosion and brittle corner cases
Behavior treePrioritized modular behaviorsGood for safety overrides and fallback logicNeeds careful priority design to avoid starvation
Rule engineFeasible behavior set plus resolution policyExplainable and field-tunableRule conflicts and hidden interactions accumulate
Utility / cost arbitrationScore candidate maneuvers by safety, progress, comfort, prioritySmooth tradeoffs and easy rankingHard constraints must be separated from soft costs
Optimization-based integrated behavior/controlSolve maneuver and motion together under rulesFewer interface mismatchesMore complex, harder to certify and debug
Game-theoretic / joint prediction-planningModels mutual influence between agentsBetter at negotiation and dense interactionsCompute and data demand, unclear guarantees
Learned / LLM/VLM behavior plannerUses data and commonsense reasoning for long-tail decisionsFlexible and interpretable when aligned to decision statesNeeds constrained output space and safety monitor
Simplex / dual-stack arbitrationAdvanced behavior source plus conservative baselinePractical path to deploy learned modulesRequires a correct monitor and clear switching semantics

Reference Module Boundary

Autoware's planning architecture is a useful reference decomposition:

text
Mission planner
  -> route
Scenario planner
  -> behavior planner: path, drivable area, turn signals, planning factors
  -> motion planner: time-parameterized trajectory
Validation
  -> safety-checked trajectory
Control
  -> actuator commands

For airside, add an operational authority adapter and a degraded-mode policy:

text
A-CDM / AODB / ramp control / NOTAM / V2X
  -> authority and task-state model
  -> behavior arbitration
  -> maneuver intent + constraints
  -> trajectory generation
  -> validation and command gate

Decision Inputs and Outputs

InterfaceRequired Content
Inputs from route / missiondestination, route corridor, task type, deadline, priority
Inputs from maplanelets/corridors, stand polygons, speed zones, no-go zones, hold lines
Inputs from perception / predictiondynamic objects, tracks, predicted occupancy, occlusion zones, uncertainty
Inputs from operationsflight phase, stand status, clearances, NOTAMs, emergency priority
Inputs from V2Xstatus, intent, cooperative perception, maneuver requests/advice
Inputs from system healthlocalization confidence, controller envelope, DBW health, connectivity
Outputs to motion plannerselected maneuver, path/drivable area, target stop/yield points, speed caps
Outputs to validation / HMIplanning factors, approval requests, reason for stop, fallback state

Evaluation and Deployment Notes

Behavior planning should be evaluated with scenario labels, not only trajectory metrics. A trajectory can be smooth and collision-free while making the wrong tactical choice.

Recommended metrics:

  • Decision correctness: expected maneuver class, stop/yield/proceed decision, clear/hold response.
  • Rule compliance: speed zone, hold-short, stand access, aircraft priority, temporary closure compliance.
  • Safety: collision-free, TTC margin, clearance envelope, no unsafe occlusion entry.
  • Progress: route completion, unnecessary stop count, blocked-zone time, deadlock/livelock rate.
  • Explainability: reason tags match actual decision inputs; operator can diagnose "why stopped."
  • Stability: no rapid oscillation between proceed/stop, avoid/follow, or route alternatives.
  • Fallback: correct transition to degraded speed, safe stop, remote approval, or reroute.

Deployment should start with a conservative module set:

  1. Rule-based lane/corridor following, stop/hold, speed-zone, obstacle avoidance, and clear-stand behaviors.
  2. Reservation-aware behavior for service-road pinch points, connected to the fleet traffic manager.
  3. V2X intent consumption as advisory first, then agreement-seeking once trust and timing are validated.
  4. Learned or LLM/VLM behavior suggestions only inside a constrained action vocabulary and behind a safety monitor.
  5. Shadow-mode behavior comparison against human, production, or conservative baseline decisions before activation.

Behavior modules should publish planning factors for every stop, yield, reroute, and failed maneuver candidate. This is essential for remote operations and safety-case evidence.


Indoor / Outdoor / Airside Fit

DomainFitBehavior-Specific Notes
Indoor warehouse / factoryVery highAisle right-of-way, pedestrian zones, forklift/load constraints, WMS task priority
Outdoor yard / depotVery highTrailer lanes, gate queues, blocked roads, private-road speed zones, teleoperation fallback
Public roadHighLane change, intersection, crosswalk, traffic light, pull-over, emergency vehicle handling
Airside apronVery highStand sequencing, aircraft priority, pushback, jet blast, FOD, clearance and ramp-control state
Movement areaHigh with strict authorityDefault-deny hold-short logic and explicit clearance TTL must dominate progress

Airside behavior planning is closer to industrial operations than public-road driving in one respect: the planner can rely on bounded maps, fleet identity, task schedules, and a controlling authority. It is harder in another respect: the rule source is multi-layered and operationally dynamic.


Failure Modes

Failure ModeSymptomMitigation
Rule conflictPlanner alternates between incompatible modulesPriority hierarchy, hard-vs-soft constraint separation, conflict logging
State explosionFSM cannot cover mixed stand, service-road, and emergency casesBehavior tree or rule engine with modular predicates
Overconservative stopVehicle freezes in dense ramp activityInteraction-aware prediction, courtesy/yield timers, traffic manager reservations
Unsafe assertivenessPlanner proceeds through ambiguous right-of-wayDefault-deny for authority-critical zones, clearance TTL, safety gates
V2X advice overtrustedVehicle follows stale or malicious maneuver adviceTrust scoring, freshness checks, cross-check with onboard perception and map
Learned behavior hallucinationLLM/VLM emits unrecognized or unsafe actionConstrained action/state vocabulary, validator, Simplex fallback
Untrackable maneuverBehavior selects a maneuver beyond controller envelopeFeasibility query to motion planner/controller before approval
Operator opacityRemote operator cannot understand why vehicle stoppedPlanning factors, event replay, reason codes
Local optimum deadlockPairwise yielding blocks a service roadFleet-level reservation and deadlock prevention layer
Airport-specific procedure mismatchBehavior is legal at one airport and wrong at anotherSite-specific rule packs, map overlays, and approval workflow


Sources

Public research notes collected from public sources.