`isaac_ros_visual_mapping`#

Note

This package is only released as a Debian package and is not available in source form.

Overview#

This package includes the core implementations for visual localization and mapping and exposes the built libraries and headers to ROS 2 ecosystem, containing the following libraries:

cuVGL, short for CUDA-accelerated Visual Global Localization, details see: Visual Global Localization.
cuSFM, short for CUDA-accelerated Structure from Motion pipeline.

This is a list of tools provided by the package:

feature_extractor_main: a tool to extract image features from raw images.
generate_bow_vocabulary_main: a tool to create, refine or incrementally build a BoW vocabulary.
generate_bow_index_main: a tool to generate BoW image retrieval index for images.

cuVGL#

This is an example of creating cuVGL and calling localization with images:

using namespace nvidia::isaac;

visual::cuvgl::cuVGL cuvgl;

std::string vgl_map_dir = "<vgl map dir>";
std::string localizer_config_dir = "<localizer config dir>";
std::string vgl_model_dir = "<vgl model dir>";

// Initialize
cuvgl.Init(vgl_map_dir, localizer_config_dir, vgl_model_dir);

// Add camera sensors
std::vector<common::transform::Transform3D> cam_to_vehicles(2);
std::vector<common::sensor::CameraSensor> camera_sensors(2);

cuvgl.AddCamera(0, cam_to_vehicles[0], camera_sensors[0]);
cuvgl.AddCamera(1, cam_to_vehicles[1], camera_sensors[1]);

// Specify the camera id for stereo pairs.
cuvgl.SetStereoCameraParamsIDFromString("0,1");

// Localize with camera images
std::vector<visual::cuvgl::CameraImage> camera_images(2);
common::transform::SE3TransformD vehicle_to_map;
cuvgl.Localize(camera_images, vehicle_to_map);

Tools#

You can use -help to find the command line flags.

feature_extractor_main#

It extracts image feature for raw images in a directory and saves the extracted feature to the output directory. It supports sampling images based on a translation and rotation distance.

feature_extractor_main \
    --input_image_directory <input_image_dir> \
    --output_dir <output_keyframe_dir> \
    --keypoint_creation_config configs/isaac/keypoint_creation_config.pb.txt \
    --feature_type <feature_type>

These are the available options:

input_image_directory: required, the input raw image directory which should include raw images and frames_metadata.pb.txt file for the image metadata. Default value: “”.
output_dir: required, the output directory for the extracted keyframes. Default value: “”.
num_thread: optional, number of threads for feature extraction. Default value: 1.
frames_meta_file: optional, the overridden input metadata file. Default value: “”.

For configuring feature extraction:

feature_type: required, type of feature to extract, available options are sift, aliked, and superpoint.
keypoint_creation_config: optional, in KeypointCreationParams text protobuf. Default value: configs/isaac/keypoint_creation_config.pb.txt.
model_dir: optional, the model directory for the feature extractor. Default value: “”.

For associating synchronized images:

sample_sync_threshold_microseconds: optional, if the value is greater than 0, specifying the max allowable timestamp difference of two keyframes to be considered the same sample. Default value: 0.

For computing stereo camera pairs:

stereo_pair_non_baseline_max_distance: optional, if the value is greater than 0, specifying the max allowable translation distance between two stereo camera pairs to be considered the same stereo pair. Default value: 0.

For sampling images:

min_inter_frame_distance: optional, the translation distance threshold to select the next keyframe. Default value: 0.5 meters.
min_inter_frame_rotation_degrees: optional, the rotation distance threshold to select the next keyframe. Default value: 5 degrees.

For saving debug data:

debug_dir: optional, the output directory for saving debug data. Default value: “”.
debug_image_interval: optional, the number of interval for saving debug images. Default value: 100.

generate_bow_vocabulary_main#

It supports building vocabulary using KMeans or BIRCH (Balanced Iterative Reducing and Clustering using Hierarchies). The recommended way for creating a new vocabulary is to run KMeans clustering, and for incrementally building a vocabulary is to run BIRCH clustering + KMeans refinement.

An example for creating a new vocabulary with KMeans clustering:

generate_bow_vocabulary_main \
    --keyframe_directory <input_keyframe_dir> \
    --output_voc_dir <output_voc_dir> \
    --image_retrieval_config configs/isaac/image_retrieval_config.pb.txt

These are the available options:

output_voc_dir: required, the output directory for saving the vocabulary. Default value: “”.
input_voc_dir: optional, the input directory for loading an existing vocabulary. Default value: “”.
keyframe_directory: required, the input directory for loading keyframe features to build a vocabulary.
image_retrieval_config: required, in ImageRetrievalConfig text protobuf. Default value: configs/isaac/image_retrieval_config.pb.txt.

The binary runs different algorithms depending on the parameters:

input_voc_dir is empty or not specified : create a new vocabulary from keyframe features using KMeans clustering.
input_voc_dir is not empty: incrementally build a vocabulary by adding new keyframe features to the existing vocabulary using BIRCH clustering.

generate_bow_index_main#

It generates image retrieval index for images in the input directory. It first looks up the closest visual words in the vocabulary for all features in the image, generates the global descriptor for the image, and saves it in a lookup table for fast image retrieval.

generate_bow_index_main \
    --keyframe_directory <input_keyframe_dir> \
    --output_map_dir <output_map_dir> \
    --image_retrieval_config configs/isaac/image_retrieval_config.pb.txt

These are the available options:

keyframe_directory: required, the input directory for keyframe features and keyframe metadata file. Default value: “”.
output_map_dir: required, the output directory for saving the image retrieval index. Default value: “”.
image_retrieval_config: required, in ImageRetrievalConfig text protobuf. Default value: configs/isaac/image_retrieval_config.pb.txt.

isaac_ros_visual_mapping#

Overview#

cuVGL#

Tools#

feature_extractor_main#

generate_bow_vocabulary_main#

generate_bow_index_main#

`isaac_ros_visual_mapping`#