Skip to content

CoInfra

What It Is

CoInfra is a large-scale cooperative infrastructure perception system and dataset.

It focuses on infrastructure-to-infrastructure and vehicle-to-infrastructure perception with real roadside nodes.

The system uses multiple synchronized sensing stations connected through a commercial 5G network.

The dataset includes adverse-weather operation, including rain, heavy snow, and freezing rain.

It is both a system paper and a dataset-backed benchmark.

Core Technical Idea

CoInfra treats roadside infrastructure as a cooperative perception platform.

Each node collects local LiDAR and camera data, performs edge processing, and exchanges information through networked infrastructure.

The core idea is not just feature fusion; it is a deployable cooperative infrastructure stack:

  • Synchronized sensor nodes.
  • Edge compute at the roadside.
  • 5G data movement.
  • HD-map anchoring.
  • Global 3D annotation.
  • Remote monitoring and operation.

The paper argues that vehicle-only perception can have low visibility in critical frames, while cooperative infrastructure can recover more complete scene coverage.

Inputs and Outputs

Inputs:

  • Roadside LiDAR point clouds.
  • Roadside RGB camera images.
  • Vehicle-side sensor data for the V2I subset.
  • HD map information.
  • Sensor extrinsics and global coordinate transforms.
  • Network and synchronization metadata.
  • Weather and scene condition labels.

Outputs:

  • Global 3D object detections.
  • Cooperative infrastructure perception results.
  • Vehicle-infrastructure perception results.
  • Dataset splits for I2I and V2I evaluation.

The annotated classes are road-user classes, not airside operational classes.

Architecture or Benchmark Protocol

CoInfra has two main benchmark modes.

I2I mode:

  • Multiple infrastructure nodes around the same traffic area cooperate.
  • The focus is on global roadside perception under weather and occlusion.

V2I mode:

  • An ego vehicle cooperates with infrastructure nodes.
  • The focus is whether infrastructure improves the vehicle's critical-frame observability.

The reported system includes 14 synchronized infrastructure nodes and an 8-node roundabout subset for the released I2I dataset.

The arXiv v3 abstract reports about 294k LiDAR samples, 589k camera images, and 332k annotated 3D boxes for I2I, plus a V2I subset with about 46k LiDAR samples, 92k images, and 43k annotated boxes.

Training and Evaluation

Training can use single-node, multi-node, or vehicle-infrastructure inputs depending on the task.

Evaluation emphasizes:

  • 3D detection quality.
  • Critical-frame completeness.
  • Performance under adverse weather.
  • Cooperation benefit over vehicle-only or infrastructure-only perception.
  • Robustness to realistic networked deployment constraints.

The paper reports that V2I cooperation can raise critical-frame completeness substantially over vehicle-only perception.

Strengths

  • Real deployed infrastructure, not only a curated perception dataset.
  • Includes adverse weather, a key gap in many cooperative perception benchmarks.
  • Combines sensing, networking, edge compute, and mapping concerns.
  • Contains both I2I and V2I settings.
  • Commercial 5G operation makes communication constraints more realistic.
  • Useful for studying infrastructure perception as a production system.

Failure Modes

  • Roundabout and road-intersection geometry may not transfer cleanly to airport aprons.
  • 5G availability and QoS assumptions may differ on secure airport networks.
  • Weather improves coverage realism but does not cover jet blast, glare from floodlights, or wet-apron reflections around aircraft.
  • Global labels depend on calibration and multi-sensor annotation quality.
  • Cooperative benefit may be overstated if infrastructure nodes have unusually good coverage compared with a constrained airport stand.
  • Public-road class labels do not cover GSE or aircraft.

Airside AV Fit

CoInfra is one of the best matches for airside infrastructure perception.

Airports are private, map-rich, and can justify fixed sensors around high-value conflict zones.

The adverse-weather emphasis is directly relevant to airport operations.

Airside transfer targets:

  • Stand-level cooperative perception.
  • Apron service-road conflict monitoring.
  • Pushback lane observability.
  • Snow, rain, and low-visibility ground movement support.
  • Infrastructure assistance for low-speed autonomous baggage or cargo tractors.

An airside adaptation should measure critical-frame completeness around aircraft and equipment, not just road-user AP.

Implementation Notes

  • Keep network metadata in the data model; do not evaluate perception without latency and packet-loss context.
  • Use surveyed airport coordinates and an HD apron map as the global frame.
  • Mirror the I2I and V2I split: infrastructure-only stand perception first, then vehicle-infrastructure cooperation.
  • Add airport-specific weather and lighting tags: night floodlights, rain glare, snow banks, deicing mist, and low sun.
  • Track blind-zone recovery around aircraft fuselage and jet bridge occlusions.
  • Validate whether private 5G, Wi-Fi 6/7, or wired backhaul is the right communication layer for each stand.

Sources

Public research notes collected from public sources.