GLIM

Executive Summary

GLIM is a modern, open-source 3D range-inertial localization and mapping framework focused on accurate map building with direct scan-matching factors on factor graphs. It is best understood as a complete research and prototyping SLAM/mapping stack, not just an optimizer and not merely a LiDAR odometry front end.

Its distinguishing idea is to replace the usual "estimate pairwise scan matches, then optimize relative pose constraints" pattern with GPU-accelerated registration-error factors that can be optimized directly in GTSAM-style factor graphs. In odometry, GLIM uses fixed-lag smoothing and keyframe-based multi-scan matching. In global mapping, it optimizes submap poses by directly minimizing matching costs between overlapping submaps.

For autonomous vehicles, GLIM is most useful for survey mapping, offline map refinement, research on robust range-inertial SLAM, and prototyping factor-graph localization extensions. It is less likely to be the production online localization stack unchanged, because production AV localization usually needs deterministic latency, map versioning, safety monitors, dynamic-object handling, calibrated covariance outputs, localization against a frozen map, and integration with GNSS, wheel odometry, IMU, perception, and fleet map operations.

What It Is

GLIM is a versatile and extensible range-based 3D localization and mapping framework by Koide, Yokozuka, Oishi, and Banno. The paper title is "GLIM: 3D Range-Inertial Localization and Mapping with GPU-Accelerated Scan Matching Factors."

The official repository describes GLIM as a range-based 3D mapping framework designed for accuracy, ease of use, sensor versatility, and extensibility. It supports many range sensor types, including spinning LiDAR, non-repetitive LiDAR, solid-state LiDAR, RGB-D cameras, and other depth sensors, provided the sensor setup and calibration are handled correctly.

GLIM uses:

GTSAM as the factor-graph optimization backend.
gtsam_points for point-cloud SLAM factors, scan-matching factors, GPU VGICP factors, nearest-neighbor structures, and related optimizers.
Optional CUDA for GPU-accelerated scan matching and global mapping.
ROS 2 integration through glim_ros2.
Offline viewer and map editor tools for manual loop creation, map correction, point removal, point-cloud export, and multi-session merging.
Extension modules and global callback slots for adding constraints or accessing internal mapping state.

Why It Matters

Most practical LiDAR SLAM systems separate the problem into a fast local odometry front end and a global pose-graph backend. That architecture is efficient, but it often compresses scan matching into relative pose measurements with approximate covariances. In difficult cases, such as small-overlap loops, degenerate point clouds, or changing sensor geometry, that approximation can be weak.

GLIM matters because it pushes direct registration cost into the graph itself:

text

Instead of:
  scan matching -> relative pose + covariance -> pose graph optimization

GLIM emphasizes:
  point-cloud registration error factors -> direct graph optimization

This is computationally heavier, but GLIM makes it practical with GPU-accelerated factor evaluation and a system design that keeps odometry, local mapping, and global mapping organized as separate modules.

For AV mapping, this matters because mapping quality is often dominated by accumulated alignment errors, loop-closure quality, and the ability to repair or refine maps offline. GLIM is closer to an advanced mapping workbench than a minimal odometry package.

Core Idea

GLIM formulates 3D range-inertial SLAM as factor-graph optimization with scan-matching factors that directly evaluate point-cloud registration error.

The main technical ideas are:

Direct multi-scan registration: optimize scan or submap alignment by minimizing registration errors, not only precomputed relative-pose residuals.
GPU-accelerated scan-matching factors: evaluate and linearize many point-cloud matching costs efficiently on GPU.
Voxelized GICP-style matching: use distribution-to-distribution point/voxel alignment with voxel-based association.
Fixed-lag smoothing odometry: keep recent states active for a few seconds instead of immediately freezing each state like a filter.
Keyframe-based point-cloud matching: connect the latest frame to selected past keyframes to reduce drift.
Global submap optimization: create submaps, then optimize their poses with global matching cost minimization.
Tightly coupled inertial constraints: use IMU factors in odometry and global mapping to stabilize pose estimation, especially gravity-related degrees of freedom.
Extensibility: add custom constraints, callbacks, visual features, GNSS factors, loop detectors, or other modules through extension points.

Pipeline

Receive range data, IMU data, and optional camera/image inputs.
Preprocess point clouds with downsampling and nearest-neighbor preparation.
Deskew or motion-compensate point clouds using inertial prediction when applicable.
Build an odometry factor graph over recent frames.
Add IMU preintegration factors between consecutive states.
Add GPU scan-matching factors between the latest frame and selected keyframes.
Run fixed-lag smoothing so recent states can still be corrected.
Create local submaps from optimized local frames.
Add submap-level scan-matching factors for global consistency.
Run global trajectory optimization over submaps.
Export trajectories and map data.
Use the offline viewer or map editor for manual loop closure, plane constraints, map cleanup, point removal, PLY export, or multi-session merging.

GLIM separates odometry estimation, local mapping, and global mapping, but the modules are designed as one integrated process for efficiency.

Where It Fits

GLIM fits in the SLAM stack as a full 3D range-inertial mapping framework:

text

Sensors:
  LiDAR / depth / range camera + IMU + optional cameras

Front end:
  direct scan matching, keyframes, fixed-lag smoothing

Backend:
  GTSAM-based factor graphs with gtsam_points scan-matching factors

Map layer:
  submaps, dense point-cloud outputs, offline correction tools

Best use:
  research, mapping, offline refinement, prototyping advanced factor constraints

It should not be confused with GTSAM. GTSAM is a general C++ factor-graph library. GLIM is an application-level SLAM system that uses GTSAM and gtsam_points to solve range-inertial mapping problems.

It should also not be confused with FAST-LIO, LIO-SAM, or KISS-ICP:

FAST-LIO/FAST-LIO2 are fast tightly coupled LiDAR-inertial odometry systems using filtering-style estimation. They are strong online odometry front ends.
LIO-SAM is a factor-graph LiDAR-inertial SLAM system with LOAM-style features, IMU preintegration, GPS factors, and loop closure.
KISS-ICP is a simple LiDAR-only odometry baseline built around point-to-point ICP and a local voxel map.
GLIM is heavier, more globally optimized, and more map-workflow-oriented, with GPU scan-matching factors and offline correction tooling.

Strengths

Strong mapping orientation: submap optimization, loop refinement, manual correction, and multi-session workflows are first-class concerns.
Direct registration costs avoid relying only on approximate relative-pose constraints.
Fixed-lag smoothing can recover from short periods of geometric degeneracy better than purely causal scan-to-model pipelines.
Supports a broad class of range sensors rather than being tied to one mechanical LiDAR pattern.
GPU acceleration makes computationally expensive multi-scan and submap matching practical.
Built on GTSAM, making the graph structure familiar to robotics researchers.
gtsam_points exposes reusable point-cloud factors and optimizers beyond GLIM itself.
Extension modules make it suitable for research on GNSS, visual constraints, loop detection, velocity constraints, calibration checks, or domain-specific factors.
Offline viewer and map editor are valuable for real mapping operations where fully automatic SLAM is not enough.

Limitations

GPU acceleration is central to GLIM's practical value; CPU-only configurations exist but may not deliver the same throughput or mapping quality.
The stack is more complex than minimal odometry systems and has more integration surface: GTSAM, gtsam_points, CUDA, ROS 2, sensor calibration, configuration files, and visualization tools.
Fixed-lag smoothing only protects against degeneration within the optimization window. Long-term range-data degeneracy still needs another source such as camera, radar, wheel odometry, GNSS, or a motion prior.
Direct scan-matching factors can still fail when geometry is dynamic, repetitive, sparse, reflective, or dominated by moving objects.
Offline correction is a strength for mapping, but it also signals that production map generation may require human-in-the-loop QA.
The open-source extension modules are examples or proofs of concept and may carry separate maintenance and license considerations.
Dense maps and submap graphs can become heavy at city or fleet scale; large-scale production mapping may need distributed optimization, tiling, map versioning, and cloud processing around GLIM.
It is not a drop-in safety-certified localization component.

AV Relevance

For autonomous vehicles, GLIM is most relevant in four roles:

Survey mapping and HD-map construction: GLIM's dense point-cloud mapping, submap optimization, manual correction, and multi-session merge tools are useful for building high-quality 3D maps from repeated drives.
Research baseline: It is a strong method to compare against FAST-LIO2, LIO-SAM, KISS-ICP, CT-ICP, Cartographer 3D, and LiDAR bundle-adjustment methods when evaluating mapping accuracy and robustness.
Prototyping factor-graph localization: Because it exposes callback slots and uses GTSAM-style variables, it is a practical environment for adding GNSS, wheel odometry, map priors, camera constraints, loop detectors, or custom vehicle-motion factors.
Offline refinement: GLIM is well suited to post-processing recorded routes, repairing failed loops, removing transient objects, and exporting cleaned maps.

For production AV online localization, GLIM should usually be treated as a component or reference implementation rather than the full answer. A production stack normally needs:

Localization against a frozen, versioned prior map.
Bounded-latency pose output independent of global optimization jumps.
Robust dynamic-object filtering.
Covariance calibration and health metrics.
GNSS/INS/wheel odometry fusion with gating.
Degradation modes and fallback estimators.
Integration with map release, QA, and fleet data pipelines.
Operational monitoring and replayable safety evidence.

A practical AV architecture may use GLIM offline to build maps, then use a leaner online scan-to-map localizer or fusion backend for vehicle operation.

Indoor/Outdoor Notes

Indoor: GLIM is strong for corridors, warehouses, campuses, tunnels, stairs, labs, industrial spaces, and mixed indoor/outdoor trajectories. Its surface-orientation validation and multi-resolution voxel maps are especially relevant indoors where thin walls, opposite wall faces, and tight geometry can cause wrong correspondences.

Outdoor: GLIM is relevant for urban roads, campuses, industrial yards, ports, mines, airports, and large facilities where submap optimization and loop closure matter. Outdoor use benefits from rich structure such as buildings, poles, curbs, walls, vegetation, and parked assets.

Open areas: Large aprons, fields, highways, ramps, and sparse lots can still be geometrically underconstrained. IMU helps, but long-term degeneration needs additional constraints such as RTK-GNSS, wheel odometry, camera/radar factors, map priors, or surveyed control points.

Dynamic scenes: Vehicles, aircraft, pedestrians, baggage carts, forklifts, and temporary equipment can pollute maps. GLIM's map editor helps remove objects offline, but online dynamic-object handling must be solved upstream or through extensions.

Comparison

Method	Main role	Estimation style	Map/global correction	AV interpretation
GLIM	Range-inertial 3D SLAM and mapping framework	Factor graph with direct scan-matching factors	Strong submap/global optimization and offline tools	Excellent mapping/research stack; production localization needs wrapping
GTSAM	Optimization library	Generic factor graphs	Not a SLAM system by itself	Backend building block, not a localization product
`gtsam_points`	Point-cloud SLAM factors and optimizers	GTSAM-compatible scan-matching factors	Supports GLIM and related experiments	Useful library for custom LiDAR/range graph systems
FAST-LIO2	Real-time LiDAR-inertial odometry	Iterated Kalman filtering, scan-to-map	No full mapping workbench by default	Strong online odometry front end
LIO-SAM	LiDAR-inertial SLAM	GTSAM factor graph with feature scan matching	Loop closure and GPS factors	Clear educational/reference architecture
KISS-ICP	LiDAR-only odometry	Simple ICP against local map	No loop closure in core	Strong baseline/fallback odometry
Cartographer 3D	Submap SLAM	Local scan matching plus pose graph	Mature submap loop closure, older stack	Useful reference/offline mapper, less modern as LIO front end

Evaluation

The GLIM paper evaluates robustness under degenerate range data, cross-sensor mapping, Newer College sequences, and NTU VIRAL-style LiDAR/visual-inertial datasets. Reported comparisons include FAST-LIO2, LIO-SAM, LINS, CLINS, DLO, VoxelMap, BALM, SLICT, and visual-LiDAR-inertial methods.

Evaluation dimensions that matter for AV use:

Absolute trajectory error against survey-grade ground truth, RTK/INS, motion capture, or control points.
Relative trajectory error over fixed path lengths.
Drift per kilometer and per minute.
Map consistency after loop closure.
Submap alignment error before and after global optimization.
Runtime per frame and per submap on target hardware.
GPU memory and CPU/GPU utilization.
Failure behavior under long open-road or apron-like degeneracy.
Robustness to dynamic objects and temporary infrastructure.
Repeatability across different vehicle speeds, LiDAR models, weather, and traffic conditions.
Localization impact when using GLIM-generated maps in a separate online localizer.

For airport or industrial-yard AVs, add:

Lateral error against surveyed lane centerlines.
Yaw error during docking or stand approach.
Map cleanliness after removing aircraft, vehicles, and temporary objects.
Cross-session consistency across different days and traffic states.
GNSS-denied or GNSS-multipath performance near buildings, hangars, terminals, and jet bridges.

Implementation Notes

Use the ROS 2 path first for current integration; glim_ros2 is the active ROS integration.
CUDA is optional in the build, but GLIM's headline performance depends on GPU acceleration.
Official docs list Ubuntu 22.04/24.04, CUDA 12.x/13.x options, and NVIDIA Jetson Orin testing.
Build and version alignment matter: GLIM depends on compatible GTSAM and gtsam_points versions.
Keep BUILD_WITH_MARCH_NATIVE conservative unless all dependent libraries are built consistently.
Sensor extrinsics are critical. Range sensor, IMU, and camera transforms must be known and stable.
For ROS 2 configuration, manage config paths carefully; installed package configs may require rebuilds unless using symlink install or external config paths.
GLIM writes dump data and trajectory files that can be inspected and edited offline.
The offline viewer can create explicit loop constraints, plane bundle-adjustment constraints, and export map point clouds.
The map editor can remove selected points or segmented objects, useful for cleaning dynamic objects before map release.
Multi-session merge supports indoor and outdoor registration presets, manual alignment, fine registration, and global matching cost fusion.
Extension callbacks run across odometry, submapping, and global mapping threads, so extension modules must be thread-safe.
For AV deployments, isolate global map optimization from the control pose stream. Global optimization corrections are useful for mapping, but online vehicle control needs a stable localization frame with explicit correction handling.

Practical Recommendation

Use GLIM when the goal is high-quality 3D mapping, offline refinement, multi-session map construction, or research on direct range-inertial factor graphs. It is especially compelling when GPU hardware is available and when map quality matters more than minimal runtime complexity.

Do not treat GLIM as a finished production AV localization stack by itself. For production, use it as a mapping and research component, then build or select a separate online localization layer that localizes against validated maps, fuses vehicle sensors, handles dynamic objects, exposes health metrics, and satisfies latency and safety requirements.

Sources

GLIM official repository. https://github.com/koide3/glim
GLIM documentation. https://koide3.github.io/glim/
GLIM getting started and offline viewer documentation. https://koide3.github.io/glim/quickstart.html
GLIM multi-session merging documentation. https://koide3.github.io/glim/merge.html
GLIM manual object removal documentation. https://koide3.github.io/glim/edit.html
GLIM extension documentation. https://koide3.github.io/glim/extend.html
GLIM API list and related repositories. https://koide3.github.io/glim/api.html
Koide, Yokozuka, Oishi, and Banno, "GLIM: 3D Range-Inertial Localization and Mapping with GPU-Accelerated Scan Matching Factors." https://arxiv.org/abs/2407.10344
gtsam_points official repository. https://github.com/koide3/gtsam_points
GTSAM official site. https://gtsam.org/

SLAM Methods

Methods

GLIM ​

Executive Summary ​

What It Is ​

Why It Matters ​

Core Idea ​

Pipeline ​

Where It Fits ​

Strengths ​

Limitations ​

AV Relevance ​

Indoor/Outdoor Notes ​

Comparison ​

Evaluation ​

Implementation Notes ​

Practical Recommendation ​

Sources ​