Skip to main content

Associating Landmarks from SLAM’s Visual Structure

Published onMay 28, 2024
Associating Landmarks from SLAM’s Visual Structure
·

ABSTRACT

Place recognition is the online task of detecting revisits to previously seen locations and is a key to many navigational systems. In Simultaneous Localization and Mapping, recovering the relative camera pose between recognized visit and revisit (e.g. using bundle adjustment) allows for global map optimization, improving localization accuracy. Visual SLAM recovers structure to estimate camera movement but it is typically not used for visual place recognition. Limited past work which adapted LiDAR place recognition descriptors to SLAM-recovered physical structure found superior robustness to visual effects vs appearance-based VPR, but overall had poorer recall. It was found that LiDAR descriptors’ whole-scan matching assumes excellent 360 degree pointcloud coverage while cameras have limited FoV. We observe that SLAM-tracked points congregate on objects and distinct elements, resulting in sparsity that impacts whole-scan matching. To us this also suggests use of clustering to extract these aggregate congregations as landmarks whose configuration can be matched. Exploring this approach we found that the landmarks generated still vary in detected position, but a far more significant hurdle is that the same landmarks may not be repeatedly clustered each time a scene is visited. This is due to large-scale clustering still being sensitive to instability in the individual SLAM points. This was improved significantly but not sufficiently through visual semantic labeling of the initial 3D points, helping to provide more stable, guided clustering solutions. Still, single missing or “outlier” landmarks are detrimental to successful association between landmark sets. To address this instability in future work we recommend careful selection of salient points from those collected by SLAM, for those which can be expected to be the most stable and repeatably detected. This is expected to provide more stable landmarks than large-scale clustering of detected points which relies on a center-of-mass approach.

Month: May

Year: 2024

Venue: 21st Conference on Robots and Vision

URL: https://crv.pubpub.org/pub/fvr9yv9c


Comments
0
comment
No comments here
Why not start the discussion?