Class CObservation3DRangeScan

Defined in File CObservation3DRangeScan.h

Nested Relationships

Nested Types

Struct CObservation3DRangeScan::unproject_LUT_t

Inheritance Relationships

Base Type

public mrpt::obs::CObservation (Class CObservation)

Class Documentation

class CObservation3DRangeScan : public mrpt::obs::CObservation 

A depth or RGB+D image from a time-of-flight or structured-light sensor.

This kind of observations can carry one or more of these data fields:

3D point cloud (as {x,y,z} float vectors).
Each 3D point has its associated (u,v) pixel coordinates in points3D_idxs_x & points3D_idxs_y (New in MRPT 1.4.0)
2D range image (as a matrix): Each entry in the matrix rangeImage(ROW,COLUMN) contains a distance or a depth, depending on range_is_depth. Ranges are stored as uint16_t for efficiency. The units of ranges are stored separately in rangeUnits.
2D intensity (grayscale or RGB) image (as a mrpt::img::CImage).
2D confidence image (as a mrpt::img::CImage): For each pixel, a 0x00 and a 0xFF mean the lowest and highest confidence levels, respectively.
Semantic labels: Stored as a matrix of bitfields, each bit having a user-defined meaning.
For cameras supporting multiple returns per pixels, different layers of range images are available in the map rangeImageOtherLayers.

The coordinates of the 3D point cloud are in meters with respect to the depth camera origin of coordinates, with the +X axis pointing forward, +Y pointing left-hand and +Z pointing up. By convention, a 3D point with its coordinates set to (0,0,0), will be considered as invalid.

The field CObservation3DRangeScan::relativePoseIntensityWRTDepth describes the change of coordinates from the depth camera to the intensity (RGB or grayscale) camera. In some cameras both cameras coincide, so this pose would be just a rotation (0,0,0,-90deg,0,-90deg). In Kinect-like cameras there is also an offset, as shown in this figure:

../../output_staging/generated/doxygen/xml/CObservation3DRangeScan_figRefSystem.png

In any case, check the field relativePoseIntensityWRTDepth, or the method doDepthAndIntensityCamerasCoincide() to determine if both frames of reference coincide, since even for Kinect cameras both can coincide if the images have been rectified.

The 2D images and matrices are stored as common images, with an up->down rows order and left->right, as usual. Optionally, the intensity and confidence channels can be set to delayed-load images for off-rawlog storage so it saves memory by having loaded in memory just the needed images. See the methods load() and unload(). Due to the intensive storage requirements of this kind of observations, this observation is the only one in MRPT for which it’s recommended to always call “load()” and “unload()” before and after using the observation, ONLY when the observation was read from a rawlog dataset, in order to make sure that all the externally stored data fields are loaded and ready in memory.

Some classes that grab observations of this type are:

There are two sets of calibration parameters:

cameraParams: Intrinsics of the depth camera.
cameraParamsIntensity: Intrinsics of the intensity (RGB) camera.

In some cameras, like SwissRanger, both are the same. It is possible in Kinect to rectify the range images such both cameras seem to coincide and then both sets of camera parameters will be identical.

Range data can be interpreted in two different ways depending on the 3D camera (this field is already set to the correct setting when grabbing observations from an mrpt::hwdrivers sensor):

range_is_depth==true: Kinect-like ranges: entries of rangeImage are distances along the +X (front-facing) axis.
range_is_depth==false: Ranges in rangeImage are actual distances in 3D, i.e. a bit larger than the depth.

The intensity channel may come from different channels in sensors as Kinect. Look at field intensityImageChannel to find out if the image was grabbed from the visible (RGB) or IR channels.

3D point clouds can be generated at any moment after grabbing with CObservation3DRangeScan::unprojectInto(), provided the correct calibration parameters. Note that unprojectInto() will store the point cloud in sensor-centric local coordinates by default, but changing its parameters you can obtain a vehicle-centric, or world-frame point cloud instead.

Examples of how to assign labels to pixels (for object segmentation, semantic information, etc.):

    // Assume obs of type CObservation3DRangeScan::Ptr
    obs->pixelLabels =TPixelLabelInfo::Ptr( new
*CObservation3DRangeScan::TPixelLabelInfo<NUM_BYTES>() );
    obs->pixelLabels->setSize(ROWS,COLS);
    obs->pixelLabels->setLabel(col,row, label_idx);   // label_idxs =
 [0,2^NUM_BYTES-1]
    //...

Note

Since MRPT 1.5.0, external files format can be selected at runtime with CObservation3DRangeScan::EXTERNALS_AS_TEXT

Intensity (RGB) channels

enum TIntensityChannelID

Enum type for intensityImageChannel

Values:

enumerator CH_VISIBLE: Grayscale or RGB visible channel of the camera sensor.

enumerator CH_IR: Infrarred (IR) channel

bool hasIntensityImage = {false}: true means the field intensityImage contains valid data

mrpt::img::CImage intensityImage: If hasIntensityImage=true, a color or gray-level intensity image of the same size than “rangeImage”

TIntensityChannelID intensityImageChannel = {CH_VISIBLE}

The source of the intensityImage; typically the visible channel

See also

TIntensityChannelID

Point cloud

bool hasPoints3D = {false}: true means the field points3D contains valid data.

std::vector<float> points3D_x

If hasPoints3D=true, the (X,Y,Z) coordinates of the 3D point cloud detected by the camera.

See also

resizePoints3DVectors

std::vector<float> points3D_y

std::vector<float> points3D_z

std::vector<uint16_t> points3D_idxs_x: If hasPoints3D=true, the (x,y) pixel coordinates for each (X,Y,Z) point in points3D_x, points3D_y, points3D_z

std::vector<uint16_t> points3D_idxs_y

void resizePoints3DVectors(size_t nPoints): Use this method instead of resizing all three points3D_x, points3D_y & points3D_z to allow the usage of the internal memory pool.

size_t getScanSize() const: Get the size of the scan pointcloud.

Note

Method is added for compatibility with its CObservation2DRangeScan counterpart

Range (depth) image

bool hasRangeImage = {false}: true means the field rangeImage contains valid data

mrpt::math::CMatrix_u16 rangeImage

If hasRangeImage==true, rangeImage(r,c) is a matrix of depth values as captured by the camera, with r the row index and c column index. Matrix element are integers for efficiency of post-processing filters, etc. Zero means no return (i.e. invalid range, no return). To obtain range/depth in meters, multiply this matrix by rangeUnits.

For sensors with more than one return per ray, this matrix holds the range for the “STRONGEST” (normally the closest) return.

std::map<std::string, mrpt::math::CMatrix_u16> rangeImageOtherLayers: Additional layer range/depth images. Text labels are arbitrary and sensor-dependent, e.g. “LAST”, “SECOND”, “3rd”, etc.

float rangeUnits = 0.001f

The conversion factor from integer units in rangeImage and actual distances in meters. Default is 0.001 m, that is 1 millimeter.

See also

rangeImage

bool range_is_depth = {true}: true: Kinect-like ranges: entries of rangeImage are distances along the +X axis (forward); false: Ranges in rangeImage are actual distances in 3D.

void rangeImage_setSize(const int HEIGHT, const int WIDTH): Similar to calling “rangeImage.setSize(H,W)” but this method provides memory pooling to speed-up the memory allocation.

mrpt::img::CImage rangeImage_getAsImage(std::optional<mrpt::img::TColormap> color = std::nullopt, std::optional<float> normMinRange = std::nullopt, std::optional<float> normMaxRange = std::nullopt, const std::optional<std::string> &additionalLayerName = std::nullopt) const

Builds a visualization from the rangeImage. The image is built with the given color map (default: grayscale) and such that the colormap range is mapped to ranges 0 meters to the field “maxRange” in this object, unless overridden with the optional parameters. Note that the usage of optional<> allows any parameter to be left to its default placing std::nullopt.

See also

rangeImageAsImage

Parameters:: additionalLayerName – If empty string or not provided, the main rangeImage will be used; otherwise, the given range image layer.

static mrpt::img::CImage rangeImageAsImage(const mrpt::math::CMatrix_u16 &ranges, float val_min, float val_max, float rangeUnits, std::optional<mrpt::img::TColormap> color = std::nullopt)

Static method to convert a range matrix into an image. If val_max is left to zero, the maximum range in the matrix will be automatically used.

See also

rangeImage_getAsImage

Confidence “channel”

bool hasConfidenceImage = {false}: true means the field confidenceImage contains valid data

mrpt::img::CImage confidenceImage: If hasConfidenceImage=true, an image with the “confidence” value [range 0-255] as estimated by the capture drivers.

Pixel-wise classification labels (for semantic labeling, etc.)

TPixelLabelInfoBase::Ptr pixelLabels: All information about pixel labeling is stored in this (smart pointer to) structure; refer to TPixelLabelInfo for details on the contents User is responsible of creating a new object of the desired data type. It will be automatically (de)serialized no matter its specific type.

inline bool hasPixelLabels() const: Returns true if the field CObservation3DRangeScan::pixelLabels contains a non-NULL smart pointer. To enhance a 3D point cloud with labeling info, just assign an appropriate object to pixelLabels

Sensor parameters

mrpt::img::TCamera cameraParams: Projection parameters of the depth camera.

mrpt::img::TCamera cameraParamsIntensity: Projection parameters of the intensity (graylevel or RGB) camera.

mrpt::poses::CPose3D relativePoseIntensityWRTDepth = {0, 0, 0, -90.0_deg, 0.0_deg, -90.0_deg}

Relative pose of the intensity camera wrt the depth camera (which is the coordinates origin for this observation). In a SwissRanger camera, this will be (0,0,0,-90deg,0,-90deg) since the location of both cameras coincide and there is only a rotation due to the different axes conventions (see picture above). In a Kinect, this will include a small lateral displacement and a rotation, according to the drawing on the top of this page.

See also

doDepthAndIntensityCamerasCoincide

float maxRange = {5.0f}: The maximum range allowed by the device, in meters (e.g. 8.0m, 5.0m,…)

mrpt::poses::CPose3D sensorPose: The 6D pose of the sensor on the robot.

float stdError = {0.01f}: The “sigma” error of the device in meters, used while inserting the scan in an occupancy grid.

bool doDepthAndIntensityCamerasCoincide() const

Return true if relativePoseIntensityWRTDepth equals the pure rotation (0,0,0,-90deg,0,-90deg) (with a small comparison epsilon)

See also

relativePoseIntensityWRTDepth

inline virtual mrpt::poses::CPose3D getSensorPose() const override

A general method to retrieve the sensor pose on the robot. Note that most sensors will return a full (6D) CPose3D, but see the derived classes for more details or special cases.

See also

setSensorPose A general method to retrieve the sensor pose on the robot, returning it by value.

See also

setSensorPose

inline virtual void setSensorPose(const mrpt::poses::CPose3D &newSensorPose) override

A general method to change the sensor pose on the robot. Note that most sensors will use the full (6D) CPose3D, but see the derived classes for more details or special cases.

See also

getSensorPose

Delayed-load manual control methods.

virtual void load_impl() const override

Makes sure all images and other fields which may be externally stored are loaded in memory. Note that for all CImages, calling load() is not required since the images will be automatically loaded upon first access, so load() shouldn’t be needed to be called in normal cases by the user. If all the data were alredy loaded or this object has no externally stored data fields, calling this method has no effects.

See also

unload

virtual void unload() const override

Unload all images, for the case they being delayed-load images stored in external files (otherwise, has no effect).

See also

load

Point cloud external storage functions

inline bool points3D_isExternallyStored() const

inline std::string points3D_getExternalStorageFile() const

void points3D_getExternalStorageFileAbsolutePath(std::string &out_path) const

inline std::string points3D_getExternalStorageFileAbsolutePath() const

void points3D_convertToExternalStorage(const std::string &fileName, const std::string &use_this_base_dir)

Users won’t normally want to call this, it’s only used from internal MRPT programs.

See also

EXTERNALS_AS_TEXT

inline void points3D_overrideExternalStoredFlag(bool isExternal): Users normally won’t need to use this

Range Matrix external storage functions

inline bool rangeImage_isExternallyStored() const

std::string rangeImage_getExternalStorageFile(const std::string &rangeImageLayer) const

void rangeImage_getExternalStorageFileAbsolutePath(std::string &out_path, const std::string &rangeImageLayer) const: rangeImageLayer: Empty for the main rangeImage matrix, otherwise, a key of rangeImageOtherLayers

inline std::string rangeImage_getExternalStorageFileAbsolutePath(const std::string &rangeImageLayer) const

void rangeImage_convertToExternalStorage(const std::string &fileName, const std::string &use_this_base_dir)

Users won’t normally want to call this, it’s only used from internal MRPT programs.

See also

EXTERNALS_AS_TEXT

inline void rangeImage_forceResetExternalStorage(): Forces marking this observation as non-externally stored - it doesn’t anything else apart from resetting the corresponding flag (Users won’t normally want to call this, it’s only used from internal MRPT programs)

Public Functions

CObservation3DRangeScan() = default

~CObservation3DRangeScan() override

template<class POINTMAP> inline void unprojectInto(POINTMAP &dest_pointcloud, const T3DPointsProjectionParams &projectParams = T3DPointsProjectionParams(), const TRangeImageFilterParams &filterParams = TRangeImageFilterParams())

Unprojects the RGB+D image pair into a 3D point cloud (with color if the target map supports it) and optionally at a given 3D pose. The 3D point coordinates are computed from the depth image (rangeImage) and the depth camera camera parameters (cameraParams). There exist two set of formulas for projecting the i’th point, depending on the value of “range_is_depth”. In all formulas below, “rangeImage” is the matrix of ranges and the pixel coordinates are (r,c).

1) [range_is_depth=true] With “range equals depth” or “Kinect-like

depth mode”: the range values are in fact distances along the “+X” axis, not real 3D ranges (this is the way Kinect reports ranges):

x(i) = rangeImage(r,c) * rangeUnits
y(i) = (r_cx - c) * x(i) / r_fx
z(i) = (r_cy - r) * x(i) / r_fy

2) [range_is_depth=false] With “normal ranges”: range means distance in 3D. This must be set when processing data from the SwissRange 3D camera, among others.

Ky = (r_cx - c)/r_fx
Kz = (r_cy - r)/r_fy

x(i) = rangeImage(r,c) * rangeUnits / sqrt( 1 + Ky^2 + Kz^2 )
y(i) = Ky * x(i)
z(i) = Kz * x(i)

The color of each point is determined by projecting the 3D local point into the RGB image using cameraParamsIntensity.

By default the local (sensor-centric) coordinates of points are directly stored into the local map, but if indicated so in takeIntoAccountSensorPoseOnRobot the points are transformed with sensorPose. Furthermore, if provided, those coordinates are transformed with robotPoseInTheWorld

Note

For multi-return sensors, only the layer specified in T3DPointsProjectionParams::layer will be unprojected.

Template Parameters:: POINTMAP – Supported maps are all those covered by mrpt::viz::PointCloudAdapter (mrpt::maps::CPointsMap and derived, mrpt::viz::CPointCloudColoured, PCL point clouds,…)

void convertTo2DScan(mrpt::obs::CObservation2DRangeScan &out_scan2d, const T3DPointsTo2DScanParams &scanParams, const TRangeImageFilterParams &filterParams = TRangeImageFilterParams())

Convert this 3D observation into an “equivalent 2D fake laser scan”, with a configurable vertical FOV.

The result is a 2D laser scan with more “rays” (N) than columns has the 3D observation (W), exactly: N = W * oversampling_ratio. This oversampling is required since laser scans sample the space at evenly-separated angles, while a range camera follows a tangent-like distribution. By oversampling we make sure we don’t leave “gaps” unseen by the virtual “2D laser”.

All obstacles within a frustum are considered and the minimum distance is kept in each direction. The horizontal FOV of the frustum is automatically computed from the intrinsic parameters, but the vertical FOV must be provided by the user, and can be set to be assymetric which may be useful depending on the zone of interest where to look for obstacles.

All spatial transformations are riguorosly taken into account in this class, using the depth camera intrinsic calibration parameters.

The timestamp of the new object is copied from the 3D object. Obviously, a requisite for calling this method is the 3D observation having range data, i.e. hasRangeImage must be true. It’s not needed to have RGB data nor the raw 3D point clouds for this method to work.

If scanParams.use_origin_sensor_pose is true, the points will be projected to 3D and then reprojected as seen from a different sensorPose at the vehicle frame origin. Otherwise (the default), the output 2D observation will share the sensorPose of the input 3D scan (using a more efficient algorithm that avoids trigonometric functions).

Parameters:: out_scan2d – [out] The resulting 2D equivalent scan.

virtual void getDescriptionAsText(std::ostream &o) const override: Build a detailed, multi-line textual description of the observation contents and dump it to the output stream.

Note

If overried by derived classes, call base CObservation::getDescriptionAsText() first to show common information.

Note

This is the text that appears in RawLogViewer when selecting an object in the dataset

void swap(CObservation3DRangeScan &o) noexcept: Very efficient method to swap the contents of two observations.

const unproject_LUT_t &get_unproj_lut() const

Gets (or generates upon first request) the 3D point cloud projection look-up-table for the current depth camera intrinsics & distortion parameters. Returns a const reference to a global variable. Multithread safe.

See also

unprojectInto

Public Static Functions

static void EXTERNALS_AS_TEXT(bool value): Whether external files (3D points, range and confidence) are to be saved as .txt text files (MATLAB compatible) or *.bin binary (faster). Loading always will determine the type by inspecting the file extension.

Note

Default=false

static bool EXTERNALS_AS_TEXT()

static double recoverCameraCalibrationParameters(const CObservation3DRangeScan &in_obs, mrpt::img::TCamera &out_camParams, double camera_offset = 0.01)

A Levenberg-Marquart-based optimizer to recover the calibration parameters of a 3D camera given a range (depth) image and the corresponding 3D point cloud.

Parameters:: camera_offset – The offset (in meters) in the +X direction of the point cloud.
Returns:: The final average reprojection error per pixel (typ <0.05 px)

Protected Attributes

bool m_points3D_external_stored = {false}: If set to true, m_points3D_external_file is valid.

std::string m_points3D_external_file: 3D points are in CImage::getImagesPathBase()+ this file name

bool m_rangeImage_external_stored = {false}: If set to true, m_rangeImage_external_file is valid.

std::string m_rangeImage_external_file: rangeImage is in CImage::getImagesPathBase()+ this file name

struct unproject_LUT_t

Look-up-table struct for unprojectInto()

Public Members

mrpt::aligned_std_vector<float> Kxs: x,y,z: +X pointing forward (depth), +z up These doesn’t account for the sensor pose.

mrpt::aligned_std_vector<float> Kys

mrpt::aligned_std_vector<float> Kzs

mrpt::aligned_std_vector<float> Kxs_rot: x,y,z: in the rotated frame of coordinates of the sensorPose. So, sensed points are directly measured_range(or depth) times this vector, plus the translation (no rotation!) of the sensor pose.

mrpt::aligned_std_vector<float> Kys_rot

mrpt::aligned_std_vector<float> Kzs_rot