CoInfra

What It Is

CoInfra is a large-scale cooperative infrastructure perception system and dataset.

It focuses on infrastructure-to-infrastructure and vehicle-to-infrastructure perception with real roadside nodes.

The system uses multiple synchronized sensing stations connected through a commercial 5G network.

The dataset includes adverse-weather operation, including rain, heavy snow, and freezing rain.

It is both a system paper and a dataset-backed benchmark.

Core Technical Idea

CoInfra treats roadside infrastructure as a cooperative perception platform.

Each node collects local LiDAR and camera data, performs edge processing, and exchanges information through networked infrastructure.

The core idea is not just feature fusion; it is a deployable cooperative infrastructure stack:

Synchronized sensor nodes.
Edge compute at the roadside.
5G data movement.
HD-map anchoring.
Global 3D annotation.
Remote monitoring and operation.

The paper argues that vehicle-only perception can have low visibility in critical frames, while cooperative infrastructure can recover more complete scene coverage.

Inputs and Outputs

Inputs:

Roadside LiDAR point clouds.
Roadside RGB camera images.
Vehicle-side sensor data for the V2I subset.
HD map information.
Sensor extrinsics and global coordinate transforms.
Network and synchronization metadata.
Weather and scene condition labels.

Outputs:

Global 3D object detections.
Cooperative infrastructure perception results.
Vehicle-infrastructure perception results.
Dataset splits for I2I and V2I evaluation.

The annotated classes are road-user classes, not airside operational classes.

Architecture or Benchmark Protocol

CoInfra has two main benchmark modes.

I2I mode:

Multiple infrastructure nodes around the same traffic area cooperate.
The focus is on global roadside perception under weather and occlusion.

V2I mode:

An ego vehicle cooperates with infrastructure nodes.
The focus is whether infrastructure improves the vehicle's critical-frame observability.

The reported system includes 14 synchronized infrastructure nodes and an 8-node roundabout subset for the released I2I dataset.

The arXiv v3 abstract reports about 294k LiDAR samples, 589k camera images, and 332k annotated 3D boxes for I2I, plus a V2I subset with about 46k LiDAR samples, 92k images, and 43k annotated boxes.

Training and Evaluation

Training can use single-node, multi-node, or vehicle-infrastructure inputs depending on the task.

Evaluation emphasizes:

3D detection quality.
Critical-frame completeness.
Performance under adverse weather.
Cooperation benefit over vehicle-only or infrastructure-only perception.
Robustness to realistic networked deployment constraints.

The paper reports that V2I cooperation can raise critical-frame completeness substantially over vehicle-only perception.

Strengths

Real deployed infrastructure, not only a curated perception dataset.
Includes adverse weather, a key gap in many cooperative perception benchmarks.
Combines sensing, networking, edge compute, and mapping concerns.
Contains both I2I and V2I settings.
Commercial 5G operation makes communication constraints more realistic.
Useful for studying infrastructure perception as a production system.

Failure Modes

Roundabout and road-intersection geometry may not transfer cleanly to airport aprons.
5G availability and QoS assumptions may differ on secure airport networks.
Weather improves coverage realism but does not cover jet blast, glare from floodlights, or wet-apron reflections around aircraft.
Global labels depend on calibration and multi-sensor annotation quality.
Cooperative benefit may be overstated if infrastructure nodes have unusually good coverage compared with a constrained airport stand.
Public-road class labels do not cover GSE or aircraft.

Airside AV Fit

CoInfra is one of the best matches for airside infrastructure perception.

Airports are private, map-rich, and can justify fixed sensors around high-value conflict zones.

The adverse-weather emphasis is directly relevant to airport operations.

Airside transfer targets:

Stand-level cooperative perception.
Apron service-road conflict monitoring.
Pushback lane observability.
Snow, rain, and low-visibility ground movement support.
Infrastructure assistance for low-speed autonomous baggage or cargo tractors.

An airside adaptation should measure critical-frame completeness around aircraft and equipment, not just road-user AP.

Implementation Notes

Keep network metadata in the data model; do not evaluate perception without latency and packet-loss context.
Use surveyed airport coordinates and an HD apron map as the global frame.
Mirror the I2I and V2I split: infrastructure-only stand perception first, then vehicle-infrastructure cooperation.
Add airport-specific weather and lighting tags: night floodlights, rain glare, snow banks, deicing mist, and low sun.
Track blind-zone recovery around aircraft fuselage and jet bridge occlusions.
Validate whether private 5G, Wi-Fi 6/7, or wired backhaul is the right communication layer for each stand.

Sources

arXiv paper: https://arxiv.org/abs/2403.10145
Official GitHub: https://github.com/NingMingHao/CoInfra
arXiv PDF: https://arxiv.org/pdf/2403.10145

SLAM Methods

Methods

CoInfra ​

What It Is ​

Core Technical Idea ​

Inputs and Outputs ​

Architecture or Benchmark Protocol ​

Training and Evaluation ​

Strengths ​

Failure Modes ​

Airside AV Fit ​

Implementation Notes ​

Sources ​