Supported Datasets

The EuRoC MAV Dataset

The ETH ASL EuRoC MAV dataset [Burri2016IJRR] is one of the most used datasets in the visual-inertial / simultaneous localization and mapping (SLAM) research literature. The reason for this is the synchronised inertial+camera sensor data and the high quality groundtruth. The dataset contains different sequences of varying difficulty of a Micro Aerial Vehicle (MAV) flying in an indoor room. Monochrome stereo images are collected by a two Aptina MT9V034 global shutter cameras at 20 frames per seconds, while a ADIS16448 MEMS inertial unit provides linear accelerations and angular velocities at a rate of 200 samples per second.

We recommend that most users start testing on this dataset before moving on to the other datasets that our system support or before trying with your own collected data. The machine hall datasets have the MAV being picked up in the beginning and then set down, we normally skip this part, but it should be able to be handled by the filter if SLAM features are enabled. Please take a look at the run_ros_eth.sh script for some reasonable default values (they might still need to be tuned).

@m_class{m-block m-warning}

Groundtruth on V1_01_easy
We have found that the groundtruth on the V1_01_easy dataset is not accurate in its orientation estimate. We have recomputed this by optimizing the inertial and vicon readings in a graph to get the trajectory of the imu (refer to our vicon2gt [Geneva2020TRVICON2GT] project). You can find the output at this link and is what we normally use to evaluate the error on this dataset.

@m_div{m-text-center}

Dataset Name Length (m) Dataset Link Groundtruth Traj. Config
Vicon Room 1 01 58 rosbag, rosbag2 link config
Vicon Room 1 02 76 rosbag , rosbag2 link config
Vicon Room 1 03 79 rosbag, rosbag2 link config
Vicon Room 2 01 37 rosbag, rosbag2 link config
Vicon Room 2 02 83 rosbag, rosbag2 link config
Vicon Room 2 03 86 rosbag, rosbag2 link config
Machine Hall 01 80 rosbag, rosbag2 link config
Machine Hall 02 73 rosbag, rosbag2 link config
Machine Hall 03 131 rosbag, rosbag2 link config
Machine Hall 04 92 rosbag, rosbag2 link config
Machine Hall 05 98 rosbag, rosbag2 link config

@m_enddiv

TUM Visual-Inertial Dataset

The TUM Visual-Inertial Dataset [Schubert2018IROS] is a more recent dataset that was presented to provide a way to evaluate state-of-the-art visual inertial odometry approaches. As compared to the EuRoC MAV datasets, this dataset provides photometric calibration of the cameras which has not been available in any other visual-inertal dataset for researchers. Monochrome stereo images are collected by two IDS uEye UI-3241LE-M-GL global shutter cameras at 20 frames per second, while a Bosch BMI160 inertial unit provides linear accelerations and angular velocities at a rate of 200 samples per second. Not all datasets have groundtruth available throughout the entire trajectory as the motion capture system is limited to the starting and ending room. There are quite a few very challenging outdoor handheld datasets which are a challenging direction for research. Note that we focus on the room datasets as full 6 dof pose collection is available over the total trajectory.

@m_class{m-block m-warning}

Filter Initialization from Standstill
These datasets have very non-static starts, as they are handheld, and the standstill initialization has issues handling this. Thus careful tuning of the imu initialization threshold is typically needed to ensure that the initialized orientation and the zero velocity assumption are valid. Please take a look at the run_ros_tumvi.sh script for some reasonable default values (they might still need to be tuned). One can enable dynamic initialization to avoid this problem via the init_dyn_use configuration value.

@m_div{m-text-center}

Dataset Name Length (m) Dataset Link Groundtruth Traj. Config
room1 147 rosbag link config
room2 142 rosbag link config
room3 136 rosbag link config
room4 69 rosbag link config
room5 132 rosbag link config
room6 67 rosbag link config

@m_enddiv

RPNG AR Table Dataset

The Indoor AR Table Visual-Inertial Datasets [Chen2023ICRA] were collected to demonstrate the impact of estimating long-term planar surfaces within a visual-inertial estimator. An Intel Realsense D4553 with 30Hz RGB-D (depth was not used) and 400Hz BMI055 IMU along with 100Hz OptiTrack poses were recorded in 1-2 minute segments. The groundtruth was recovered using the vicon2gt utility [Geneva2020TRVICON2GT].

@m_div{m-text-center}

Dataset Name Length (m) Dataset Link Size (GB) Groundtruth Traj. Config
table1 56 rosbag 4.77 link config
table2 44 rosbag 5.54 link config
table3 88 rosbag 13.19 link config
table4 91 rosbag 11.49 link config
table5 75 rosbag 11.66 link config
table6 50 rosbag 5.26 link config
table7 63 rosbag 9.02 link config
table8 125 rosbag 16.01 link config

@m_enddiv

RPNG OpenVINS Dataset

In additional the community maintained datasets, we have also released a few datasets. Please cite the OpenVINS paper if you use any of these datasets in your works. Here are the specifics of the sensors that each dataset uses:

  • ArUco Datasets:
    • Core visual-inertial sensor is the VI-Sensor
    • Stereo global shutter images at 20 Hz
    • ADIS16448 IMU at 200 Hz
    • Kalibr calibration file can be found here
  • Ironsides Datasets:
    • Core visual-inertial sensor is the ironsides
    • Has two Reach RTK one subscribed to a base station for corrections
    • Stereo global shutter fisheye images at 20 Hz
    • InvenSense IMU at 200 Hz
    • GPS fixes at 5 Hz (/reach01/tcpfix has corrections from NYSNet)
    • Kalibr calibration file can be found here

@m_class{m-block m-warning}

Monocular Camera
Currently there are issues with running with a monocular camera on the Ironside Neighborhood car datasets. This is likely due to the near-constant velocity and "smoothness" of the trajectory. Please refer to [Lee2020IROS] and [Wu2017ICRA] for details.

Most of these datasets do not have perfect calibration parameters, and some are not time synchronised. Thus, please ensure that you have enabled online calibration of these parameters. Additionally, there is no groundtruth for these datasets, but some do include GPS messages if you wish to compare relative to something.

@m_div{m-text-center}

Dataset Name Length (m) Dataset Link Groundtruth Traj. Config
ArUco Room 01 27 rosbag none config aruco
ArUco Room 02 93 rosbag none config aruco
ArUco Hallway 01 190 rosbag none config aruco
ArUco Hallway 02 105 rosbag none config aruco
Neighborhood 01 2300 rosbag none config ironsides
Neighborhood 02 7400 rosbag none config ironsides

@m_enddiv

UZH-FPV Drone Racing Dataset

The UZH-FPV Drone Racing Dataset [Schubert2018IROS] is a dataset focused on high-speed agressive 6dof motion with very high levels of optical flow as compared to other datasets. A FPV drone racing quadrotor has on board a Qualcomm Snapdragon Flight board which can provide inertial measurement and has two 640x480 grayscale global shutter fisheye camera's attached. The groundtruth is collected with a Leica Nova MS60 laser tracker. There are four total sensor configurations and calibration provides including: indoor forward facing stereo, indoor 45 degree stereo, outdoor forward facing, and outdoor 45 degree. A top speed of 12.8 m/s (28 mph) is reached in the indoor scenarios, and 23.4 m/s (54 mphs) is reached in the outdoor datasets. Each of these datasets is picked up in the beginning and then set down, we normally skip this part, but it should be able to be handled by the filter if SLAM features are enabled. Please take a look at the run_ros_uzhfpv.sh script for some reasonable default values (they might still need to be tuned).

@m_class{m-block m-warning}

Dataset Groundtruthing
Only the Absolute Trajectory Error (ATE) should be used as a metric for this dataset. This is due to inaccurate groundtruth orientation estimates which are explain in their report on the issue. The basic summary is that it is hard to get an accurate orientation information due to the point-based Leica measurements used to groundtruth.

@m_div{m-text-center}

Dataset Name Length (m) Dataset Link Groundtruth Traj. Config
Indoor 5 157 rosbag link config
Indoor 6 204 rosbag link config
Indoor 7 314 rosbag link config
Indoor 9 136 rosbag link config
Indoor 10 129 rosbag link config
Indoor 45deg 2 207 rosbag link config
Indoor 45deg 4 164 rosbag link config
Indoor 45deg 12 112 rosbag link config
Indoor 45deg 13 159 rosbag link config
Indoor 45deg 14 211 rosbag link config

@m_enddiv

KAIST Urban Dataset

The KAIST urban dataset [Jeong2019IJRR] is a dataset focus on autonomous driving and localization in challenging complex urban environments. The dataset was collected in Korea with a vehicle equipped with stereo camera pair, 2d SICK LiDARs, 3d Velodyne LiDAR, Xsens IMU, fiber optic gyro (FoG), wheel encoders, and RKT GPS. The camera is 10 Hz, while the Xsens IMU is 100 Hz sensing rate. A groundtruth "baseline" trajectory is also provided which is the resulting output from fusion of the FoG, RKT GPS, and wheel encoders. We provide processing scripts to generate the calibration and groundtruth from the dataset's formats.

@m_class{m-block m-warning}

Dynamic Environments
A challenging open research question is being able to handle dynamic objects seen from the cameras. By default we rely on our tracking 8 point RANSAC to handle these dynamics objects. In the most of the KAIST datasets the majority of the scene can be taken up by other moving vehicles, thus the performance can suffer. Please be aware of this fact.

We recommend converting the KAIST file format into a ROS bag format. If you are using ROS2 then you should first convert into a ROS1 then convert following the ROS1 to ROS2 Bag Conversion Guide . Follow the instructions on the kaist2bag repository:

git clone https://github.com/irapkaist/irp_sen_msgs.git
git clone https://github.com/rpng/kaist2bag.git

@m_class{m-block m-warning}

Monocular Camera
Currently there are issues with running with a monocular camera on this dataset. This is likely due to the near-constant velocity and "smoothness" of the trajectory. Please refer to [Lee2020IROS] and [Wu2017ICRA] for details.

You can also try to use the file_player to publish live. It is important to disable the "skip stop section" to ensure that we have continuous sensor feeds. Typically we process the datasets at 1.5x rate so we get a ~20 Hz image feed and the datasets can be processed in a more efficient manor.

@m_div{m-text-center}

Dataset Name Length (km) Dataset Link Groundtruth Traj. Example Launch
Urban 28 11.47 download link config
Urban 32 7.30 download link config
Urban 38 11.42 download link config
Urban 39 11.06 download link config

@m_enddiv

KAIST VIO Dataset

The KAIST VIO dataset [Jeon2021RAL] is a dataset of a MAV in an indoor 3.15 x 3.60 x 2.50 meter environment which undergoes various trajectory motions. The camera is intel realsense D435i 25 Hz, while the IMU is 100 Hz sensing rate from the pixelhawk 4 unit. A groundtruth "baseline" trajectory is also provided from a OptiTrack Mocap system at 50 Hz, the bag files have the marker body frame to IMU frame already applied. This topic has been provided in ov_data for convenience sake.

@m_div{m-text-center}

Dataset Name Length (km) Dataset Link Groundtruth Traj. Example Launch
circle 29.99 download link config
circle_fast 64.15 download link config
circle_head 35.05 download link config
infinite 29.35 download link config
infinite_fast 54.24 download link config
infinite_head 37.45 download link config
rotation 7.82 download link config
rotation_fast 14.55 download link config
square 41.94 download link config
square_fast 44.07 download link config
square_head 50.00 download link config

@m_enddiv



ov_core
Author(s): Patrick Geneva , Kevin Eckenhoff , Guoquan Huang
autogenerated on Mon Dec 16 2024 03:06:46