Getting Started » Supported Datasets

The EuRoC MAV Dataset

The ETH ASL EuRoC MAV dataset [4] is one of the most used datasets in the visual-inertial / simultaneous localization and mapping (SLAM) research literature. The reason for this is the synchronised inertial+camera sensor data and the high quality groundtruth. The dataset contains different sequences of varying difficulty of a Micro Aerial Vehicle (MAV) flying in an indoor room. Monochrome stereo images are collected by a two Aptina MT9V034 global shutter cameras at 20 frames per seconds, while a ADIS16448 MEMS inertial unit provides linear accelerations and angular velocities at a rate of 200 samples per second.

We recommend that most users start testing on this dataset before moving on to the other datasets that our system support or before trying with your own collected data. The machine hall datasets have the MAV being picked up in the beginning and then set down, we normally skip this part, but it should be able to be handled by the filter if SLAM features are enabled. Please take a look at the run_ros_eth.sh script for some reasonable default values (they might still need to be tuned).

Dataset NameLength (m)Dataset LinkGroundtruth Traj.Config
Vicon Room 1 0158rosbag, rosbag2linkconfig
Vicon Room 1 0276rosbag , rosbag2linkconfig
Vicon Room 1 0379rosbag, rosbag2linkconfig
Vicon Room 2 0137rosbag, rosbag2linkconfig
Vicon Room 2 0283rosbag, rosbag2linkconfig
Vicon Room 2 0386rosbag, rosbag2linkconfig
Machine Hall 0180rosbag, rosbag2linkconfig
Machine Hall 0273rosbag, rosbag2linkconfig
Machine Hall 03131rosbag, rosbag2linkconfig
Machine Hall 0492rosbag, rosbag2linkconfig
Machine Hall 0598rosbag, rosbag2linkconfig

TUM Visual-Inertial Dataset

The TUM Visual-Inertial Dataset [39] is a more recent dataset that was presented to provide a way to evaluate state-of-the-art visual inertial odometry approaches. As compared to the EuRoC MAV datasets, this dataset provides photometric calibration of the cameras which has not been available in any other visual-inertal dataset for researchers. Monochrome stereo images are collected by two IDS uEye UI-3241LE-M-GL global shutter cameras at 20 frames per second, while a Bosch BMI160 inertial unit provides linear accelerations and angular velocities at a rate of 200 samples per second. Not all datasets have groundtruth available throughout the entire trajectory as the motion capture system is limited to the starting and ending room. There are quite a few very challenging outdoor handheld datasets which are a challenging direction for research. Note that we focus on the room datasets as full 6 dof pose collection is available over the total trajectory.

Image
Dataset NameLength (m)Dataset LinkGroundtruth Traj.Config
room1147rosbaglinkconfig
room2142rosbaglinkconfig
room3136rosbaglinkconfig
room469rosbaglinkconfig
room5132rosbaglinkconfig
room667rosbaglinkconfig

RPNG AR Table Dataset

The Indoor AR Table Visual-Inertial Datasets [7] were collected to demonstrate the impact of estimating long-term planar surfaces within a visual-inertial estimator. An Intel Realsense D4553 with 30Hz RGB-D (depth was not used) and 400Hz BMI055 IMU along with 100Hz OptiTrack poses were recorded in 1-2 minute segments. The groundtruth was recovered using the vicon2gt utility [16].

Image
Dataset NameLength (m)Dataset LinkSize (GB)Groundtruth Traj.Config
table156rosbag4.77linkconfig
table244rosbag5.54linkconfig
table388rosbag13.19linkconfig
table491rosbag11.49linkconfig
table575rosbag11.66linkconfig
table650rosbag5.26linkconfig
table763rosbag9.02linkconfig
table8125rosbag16.01linkconfig

RPNG OpenVINS Dataset

In additional the community maintained datasets, we have also released a few datasets. Please cite the OpenVINS paper if you use any of these datasets in your works. Here are the specifics of the sensors that each dataset uses:

  • ArUco Datasets:
    • Core visual-inertial sensor is the VI-Sensor
    • Stereo global shutter images at 20 Hz
    • ADIS16448 IMU at 200 Hz
    • Kalibr calibration file can be found here
  • Ironsides Datasets:
    • Core visual-inertial sensor is the ironsides
    • Has two Reach RTK one subscribed to a base station for corrections
    • Stereo global shutter fisheye images at 20 Hz
    • InvenSense IMU at 200 Hz
    • GPS fixes at 5 Hz (/reach01/tcpfix has corrections from NYSNet)
    • Kalibr calibration file can be found here
Image

Most of these datasets do not have perfect calibration parameters, and some are not time synchronised. Thus, please ensure that you have enabled online calibration of these parameters. Additionally, there is no groundtruth for these datasets, but some do include GPS messages if you wish to compare relative to something.

Dataset NameLength (m)Dataset LinkGroundtruth Traj.Config
ArUco Room 0127rosbagnoneconfig aruco
ArUco Room 0293rosbagnoneconfig aruco
ArUco Hallway 01190rosbagnoneconfig aruco
ArUco Hallway 02105rosbagnoneconfig aruco
Neighborhood 012300rosbagnoneconfig ironsides
Neighborhood 027400rosbagnoneconfig ironsides

UZH-FPV Drone Racing Dataset

The UZH-FPV Drone Racing Dataset [39] is a dataset focused on high-speed agressive 6dof motion with very high levels of optical flow as compared to other datasets. A FPV drone racing quadrotor has on board a Qualcomm Snapdragon Flight board which can provide inertial measurement and has two 640x480 grayscale global shutter fisheye camera's attached. The groundtruth is collected with a Leica Nova MS60 laser tracker. There are four total sensor configurations and calibration provides including: indoor forward facing stereo, indoor 45 degree stereo, outdoor forward facing, and outdoor 45 degree. A top speed of 12.8 m/s (28 mph) is reached in the indoor scenarios, and 23.4 m/s (54 mphs) is reached in the outdoor datasets. Each of these datasets is picked up in the beginning and then set down, we normally skip this part, but it should be able to be handled by the filter if SLAM features are enabled. Please take a look at the run_ros_uzhfpv.sh script for some reasonable default values (they might still need to be tuned).

Image
Dataset NameLength (m)Dataset LinkGroundtruth Traj.Config
Indoor 5157rosbaglinkconfig
Indoor 6204rosbaglinkconfig
Indoor 7314rosbaglinkconfig
Indoor 9136rosbaglinkconfig
Indoor 10129rosbaglinkconfig
Indoor 45deg 2207rosbaglinkconfig
Indoor 45deg 4164rosbaglinkconfig
Indoor 45deg 12112rosbaglinkconfig
Indoor 45deg 13159rosbaglinkconfig
Indoor 45deg 14211rosbaglinkconfig

KAIST Urban Dataset

The KAIST urban dataset [23] is a dataset focus on autonomous driving and localization in challenging complex urban environments. The dataset was collected in Korea with a vehicle equipped with stereo camera pair, 2d SICK LiDARs, 3d Velodyne LiDAR, Xsens IMU, fiber optic gyro (FoG), wheel encoders, and RKT GPS. The camera is 10 Hz, while the Xsens IMU is 100 Hz sensing rate. A groundtruth "baseline" trajectory is also provided which is the resulting output from fusion of the FoG, RKT GPS, and wheel encoders. We provide processing scripts to generate the calibration and groundtruth from the dataset's formats.

We recommend converting the KAIST file format into a ROS bag format. If you are using ROS2 then you should first convert into a ROS1 then convert following the ROS1 to ROS2 Bag Conversion Guide . Follow the instructions on the kaist2bag repository:

git clone https://github.com/irapkaist/irp_sen_msgs.git
git clone https://github.com/rpng/kaist2bag.git

You can also try to use the file_player to publish live. It is important to disable the "skip stop section" to ensure that we have continuous sensor feeds. Typically we process the datasets at 1.5x rate so we get a ~20 Hz image feed and the datasets can be processed in a more efficient manor.

Dataset NameLength (km)Dataset LinkGroundtruth Traj.Example Launch
Urban 2811.47downloadlinkconfig
Urban 327.30downloadlinkconfig
Urban 3811.42downloadlinkconfig
Urban 3911.06downloadlinkconfig

KAIST VIO Dataset

The KAIST VIO dataset [22] is a dataset of a MAV in an indoor 3.15 x 3.60 x 2.50 meter environment which undergoes various trajectory motions. The camera is intel realsense D435i 25 Hz, while the IMU is 100 Hz sensing rate from the pixelhawk 4 unit. A groundtruth "baseline" trajectory is also provided from a OptiTrack Mocap system at 50 Hz, the bag files have the marker body frame to IMU frame already applied. This topic has been provided in ov_data for convenience sake.

Dataset NameLength (km)Dataset LinkGroundtruth Traj.Example Launch
circle29.99downloadlinkconfig
circle_fast64.15downloadlinkconfig
circle_head35.05downloadlinkconfig
infinite29.35downloadlinkconfig
infinite_fast54.24downloadlinkconfig
infinite_head37.45downloadlinkconfig
rotation7.82downloadlinkconfig
rotation_fast14.55downloadlinkconfig
square41.94downloadlinkconfig
square_fast44.07downloadlinkconfig
square_head50.00downloadlinkconfig