README
Model Predictive Path Integral Controller
Overview
This is a predictive controller (local trajectory planner) that implements the Model Predictive Path Integral (MPPI) algorithm to track a path with adaptive collision avoidance. It contains plugin-based critic functions to impact the behavior of the algorithm. It was created by Aleksei Budyakov and adapted & developed for Nav2 by Steve Macenski.
This plugin implements the nav2_core::Controller
interface allowing it to be used across the navigation stack as a local trajectory planner in the controller server’s action server (controller_server
).
This controller is measured to run at 50+ Hz on a modest Intel processor (4th gen i5). See its Configuration Guide Page for additional parameter descriptions.
It works currently with Differential, Omnidirectional, and Ackermann robots.
MPPI Description
The MPPI algorithm is an MPC variant that finds a control velocity for the robot using an iterative approach. Using the previous time step’s best control solution and the robot’s current state, a set of randomly sampled perturbations from a Gaussian distribution are applied. These noised controls are forward simulated to generate a set of trajectories within the robot’s motion model.
Next, these trajectories are scored using a set of plugin-based critic functions to find the best trajectory in the batch. The output scores are used to set the best control with a soft max function.
This process is then repeated a number of times and returns a converged solution. This solution is then used as the basis of the next time step’s initial control.
Features
Predictive MPC trajectory planner
Utilizes plugin-based critics which can be swapped out, tuned, or replaced easily by the user
Highly optimized CPU-only performance using vectorization and tensor operations
Supports a number of common motion models, including Ackermann, Differential-Drive, and Omni-directional
Includes fallback mechanisms to handle soft-failures before escalating to recovery behaviors
High-quality code implementation with Doxygen, high unit test coverage, documentation, and parameter guide
Easily extensible to support modern research variants of MPPI
Comes pre-tuned for good out-of-the-box behavior
Configuration
Controller
Parameter |
Type |
Definition |
---|---|---|
motion_model |
string |
Default: DiffDrive. Type of model [DiffDrive, Omni, Ackermann]. |
critics |
string |
Default: None. Critics (plugins) names |
iteration_count |
int |
Default 1. Iteration count in MPPI algorithm. Recommend to keep as 1 and prefer more batches. |
batch_size |
int |
Default 1000. Count of randomly sampled candidate trajectories |
time_steps |
int |
Default 56. Number of time steps (points) in each sampled trajectory |
model_dt |
double |
Default: 0.05. Time interval (s) between two sampled points in trajectories. |
vx_std |
double |
Default 0.2. Sampling standard deviation for VX |
vy_std |
double |
Default 0.2. Sampling standard deviation for VY |
wz_std |
double |
Default 0.4. Sampling standard deviation for Wz |
vx_max |
double |
Default 0.5. Max VX (m/s) |
vy_max |
double |
Default 0.5. Max VY in either direction, if holonomic. (m/s) |
vx_min |
double |
Default -0.35. Min VX (m/s) |
wz_max |
double |
Default 1.9. Max WZ (rad/s) |
temperature |
double |
Default: 0.3. Selectiveness of trajectories by their costs (The closer this value to 0, the “more” we take in consideration controls with less cost), 0 mean use control with best cost, huge value will lead to just taking mean of all trajectories without cost consideration |
gamma |
double |
Default: 0.015. A trade-off between smoothness (high) and low energy (low). This is a complex parameter that likely won’t need to be changed from the default of |
visualize |
bool |
Default: false. Publish visualization of trajectories, which can slow down the controller significantly. Use only for debugging. |
retry_attempt_limit |
int |
Default 1. Number of attempts to find feasible trajectory on failure for soft-resets before reporting failure. |
reset_period |
double |
Default 1.0. required time of inactivity to reset optimizer (only in Humble due to backport ABI policies) |
regenerate_noises |
bool |
Default false. Whether to regenerate noises each iteration or use single noise distribution computed on initialization and reset. Practically, this is found to work fine since the trajectories are being sampled stochastically from a normal distribution and reduces compute jittering at run-time due to thread wake-ups to resample normal distribution. |
Trajectory Visualizer
Parameter |
Type |
Definition |
---|---|---|
trajectory_step |
int |
Default: 5. The step between trajectories to visualize to downsample candidate trajectory pool. |
time_step |
int |
Default: 3. The step between points on trajectories to visualize to downsample trajectory density. |
Path Handler
Parameter |
Type |
Definition |
---|---|---|
max_robot_pose_search_dist |
double |
Default: Costmap half-size. Max integrated distance ahead of robot pose to search for nearest path point in case of path looping. |
prune_distance |
double |
Default: 1.5. Distance ahead of nearest point on path to robot to prune path to. |
transform_tolerance |
double |
Default: 0.1. Time tolerance for data transformations with TF. |
enforce_path_inversion |
double |
Default: False. If true, it will prune paths containing cusping points for segments changing directions (e.g. path inversions) such that the controller will be forced to change directions at or very near the planner’s requested inversion point. This is targeting Smac Planner users with feasible paths who need their robots to switch directions where specifically requested. |
inversion_xy_tolerance |
double |
Default: 0.2. Cartesian proximity (m) to path inversion point to be considered “achieved” to pass on the rest of the path after path inversion. |
inversion_yaw_tolerance |
double |
Default: 0.4. Angular proximity (radians) to path inversion point to be considered “achieved” to pass on the rest of the path after path inversion. 0.4 rad = 23 deg. |
Ackermann Motion Model
Parameter |
Type |
Definition |
---|---|---|
min_turning_r |
double |
minimum turning radius for ackermann motion model |
Constraint Critic
Parameter |
Type |
Definition |
---|---|---|
cost_weight |
double |
Default 4.0. Weight to apply to critic term. |
cost_power |
int |
Default 1. Power order to apply to term. |
Goal Angle Critic
Parameter |
Type |
Definition |
---|---|---|
cost_weight |
double |
Default 3.0. Weight to apply to critic term. |
cost_power |
int |
Default 1. Power order to apply to term. |
threshold_to_consider |
double |
Default 0.5. Minimal distance between robot and goal above which angle goal cost considered. |
Goal Critic
Parameter |
Type |
Definition |
---|---|---|
cost_weight |
double |
Default 5.0. Weight to apply to critic term. |
cost_power |
int |
Default 1. Power order to apply to term. |
threshold_to_consider |
double |
Default 1.4. Distance between robot and goal above which goal cost starts being considered |
Obstacles Critic
Uses estimated distances from obstacles using cost and inflation parameters to avoid obstacles
Parameter |
Type |
Definition |
---|---|---|
consider_footprint |
bool |
Default: False. Whether to use point cost (if robot is circular or low compute power) or compute SE2 footprint cost. |
critical_weight |
double |
Default 20.0. Weight to apply to critic for near collisions closer than |
repulsion_weight |
double |
Default 1.5. Weight to apply to critic for generally preferring routes in lower cost space. This is separated from the critical term to allow for fine tuning of obstacle behaviors with path alignment for dynamic scenes without impacting actions which may directly lead to near-collisions. This is applied within the |
cost_power |
int |
Default 1. Power order to apply to term. |
collision_cost |
double |
Default 10000.0. Cost to apply to a true collision in a trajectory. |
collision_margin_distance |
double |
Default 0.10. Margin distance from collision to apply severe penalty, similar to footprint inflation. Between 0.05-0.2 is reasonable. |
near_goal_distance |
double |
Default 0.5. Distance near goal to stop applying preferential obstacle term to allow robot to smoothly converge to goal pose in close proximity to obstacles. |
cost_scaling_factor |
double |
Default 10.0. Exponential decay factor across inflation radius. This should be the same as for your inflation layer (Humble only) |
inflation_radius |
double |
Default 0.55. Radius to inflate costmap around lethal obstacles. This should be the same as for your inflation layer (Humble only) |
Cost Critic
Uses inflated costmap cost directly to avoid obstacles
Parameter |
Type |
Definition |
---|---|---|
consider_footprint |
bool |
Default: False. Whether to use point cost (if robot is circular or low compute power) or compute SE2 footprint cost. |
cost_weight |
double |
Default 3.81. Wight to apply to critic to avoid obstacles. |
cost_power |
int |
Default 1. Power order to apply to term. |
collision_cost |
double |
Default 1000000.0. Cost to apply to a true collision in a trajectory. |
critical_cost |
double |
Default 300.0. Cost to apply to a pose with any point in in inflated space to prefer distance from obstacles. |
near_goal_distance |
double |
Default 0.5. Distance near goal to stop applying preferential obstacle term to allow robot to smoothly converge to goal pose in close proximity to obstacles. |
inflation_layer_name |
string |
Default “”. Name of the inflation layer. If empty, it uses the last inflation layer in the costmap. If you have multiple inflation layers, you may want to specify the name of the layer to use. |
Path Align Critic
Parameter |
Type |
Definition |
---|---|---|
cost_weight |
double |
Default 10.0. Weight to apply to critic term. |
cost_power |
int |
Default 1. Power order to apply to term. |
threshold_to_consider |
double |
Default 0.5. Distance between robot and goal above which path align cost stops being considered |
offset_from_furthest |
double |
Default 20. Checks that the candidate trajectories are sufficiently far along their way tracking the path to apply the alignment critic. This ensures that path alignment is only considered when actually tracking the path, preventing awkward initialization motions preventing the robot from leaving the path to achieve the appropriate heading. |
trajectory_point_step |
double |
Default 4. Step of trajectory points to evaluate for path distance to reduce compute time. Between 1-10 is typically reasonable. |
max_path_occupancy_ratio |
double |
Default 0.07 (7%). Maximum proportion of the path that can be occupied before this critic is not considered to allow the obstacle and path follow critics to avoid obstacles while following the path’s intent in presence of dynamic objects in the scene. |
use_path_orientations |
bool |
Default false. Whether to consider path’s orientations in path alignment, which can be useful when paired with feasible smac planners to incentivize directional changes only where/when the smac planner requests them. If you want the robot to deviate and invert directions where the controller sees fit, keep as false. If your plans do not contain orientation information (e.g. navfn), keep as false. |
Note: There is a “Legacy” version of this critic also available with the same parameters of an old formulation pre-October 2023.
Path Angle Critic
Parameter |
Type |
Definition |
---|---|---|
cost_weight |
double |
Default 2.0. Weight to apply to critic term. |
cost_power |
int |
Default 1. Power order to apply to term. |
threshold_to_consider |
double |
Default 0.5. Distance between robot and goal above which path angle cost stops being considered |
offset_from_furthest |
int |
Default 4. Number of path points after furthest one any trajectory achieves to compute path angle relative to. |
max_angle_to_furthest |
double |
Default 1.2. Angular distance between robot and goal above which path angle cost starts being considered |
forward_preference |
bool |
Default true. Whether or not your robot has a preference for which way is forward in motion. Different from if reversing is generally allowed, but if you robot contains no particular preference one way or another. |
Path Follow Critic
Parameter |
Type |
Definition |
---|---|---|
cost_weight |
double |
Default 5.0. Weight to apply to critic term. |
cost_power |
int |
Default 1. Power order to apply to term. |
offset_from_furthest |
int |
Default 6. Number of path points after furthest one any trajectory achieves to drive path tracking relative to. |
threshold_to_consider |
float |
Default 1.4. Distance between robot and goal above which path follow cost stops being considered |
Prefer Forward Critic
Parameter |
Type |
Definition |
---|---|---|
cost_weight |
double |
Default 5.0. Weight to apply to critic term. |
cost_power |
int |
Default 1. Power order to apply to term. |
threshold_to_consider |
double |
Default 0.5. Distance between robot and goal above which prefer forward cost stops being considered |
Twirling Critic
Parameter |
Type |
Definition |
---|---|---|
cost_weight |
double |
Default 10.0. Weight to apply to critic term. |
cost_power |
int |
Default 1. Power order to apply to term. |
Velocity Deadband Critic
Parameter |
Type |
Definition |
---|---|---|
cost_weight |
double |
Default 35.0. Weight to apply to critic term. |
cost_power |
int |
Default 1. Power order to apply to term. |
deadband_velocities |
double[] |
Default [0.0, 0.0, 0.0]. The array of deadband velocities [vx, vz, wz]. A zero array indicates that the critic will take no action. |
XML configuration example
controller_server:
ros__parameters:
controller_frequency: 30.0
FollowPath:
plugin: "nav2_mppi_controller::MPPIController"
time_steps: 56
model_dt: 0.05
batch_size: 2000
vx_std: 0.2
vy_std: 0.2
wz_std: 0.4
vx_max: 0.5
vx_min: -0.35
vy_max: 0.5
wz_max: 1.9
iteration_count: 1
prune_distance: 1.7
transform_tolerance: 0.1
temperature: 0.3
gamma: 0.015
motion_model: "DiffDrive"
visualize: false
TrajectoryVisualizer:
trajectory_step: 5
time_step: 3
AckermannConstraints:
min_turning_r: 0.2
critics: ["ConstraintCritic", "CostCritic", "GoalCritic", "GoalAngleCritic", "PathAlignCritic", "PathFollowCritic", "PathAngleCritic", "PreferForwardCritic"]
ConstraintCritic:
enabled: true
cost_power: 1
cost_weight: 4.0
GoalCritic:
enabled: true
cost_power: 1
cost_weight: 5.0
threshold_to_consider: 1.4
GoalAngleCritic:
enabled: true
cost_power: 1
cost_weight: 3.0
threshold_to_consider: 0.5
PreferForwardCritic:
enabled: true
cost_power: 1
cost_weight: 5.0
threshold_to_consider: 0.5
# ObstaclesCritic:
# enabled: true
# cost_power: 1
# repulsion_weight: 1.5
# critical_weight: 20.0
# consider_footprint: false
# collision_cost: 10000.0
# collision_margin_distance: 0.1
# near_goal_distance: 0.5
CostCritic:
enabled: true
cost_power: 1
cost_weight: 3.81
critical_cost: 300.0
consider_footprint: true
collision_cost: 1000000.0
near_goal_distance: 1.0
PathAlignCritic:
PathAlignCritic:
enabled: true
cost_power: 1
cost_weight: 14.0
max_path_occupancy_ratio: 0.05
trajectory_point_step: 3
threshold_to_consider: 0.5
offset_from_furthest: 20
use_path_orientations: false
PathFollowCritic:
enabled: true
cost_power: 1
cost_weight: 5.0
offset_from_furthest: 5
threshold_to_consider: 1.4
PathAngleCritic:
enabled: true
cost_power: 1
cost_weight: 2.0
offset_from_furthest: 4
threshold_to_consider: 0.5
max_angle_to_furthest: 1.0
forward_preference: true
# VelocityDeadbandCritic:
# enabled: true
# cost_power: 1
# cost_weight: 35.0
# deadband_velocities: [0.05, 0.05, 0.05]
# TwirlingCritic:
# enabled: true
# twirling_cost_power: 1
# twirling_cost_weight: 10.0
Topics
Topic |
Type |
Description |
---|---|---|
|
|
Randomly generated trajectories, including resulting control sequence |
|
|
Part of global plan considered by local planner |
Notes to Users
General Words of Wisdom
The model_dt
parameter generally should be set to the duration of your control frequency. So if your control frequency is 20hz, this should be 0.05
. However, you may also set it lower but not larger.
Visualization of the trajectories using visualize
uses compute resources to back out trajectories for visualization and therefore slows compute time. It is not suggested that this parameter is set to true
during a deployed use, but is a useful debug instrument while tuning the system, but use sparingly. Visualizing 2000 batches @ 56 points at 30 hz is a lot.
The most common parameters you might want to start off changing are the velocity profiles (vx_max
, vx_min
, wz_max
, and vy_max
if holonomic) and the motion_model
to correspond to your vehicle. Its wise to consider the prune_distance
of the path plan in proportion to your maximum velocity and prediction horizon. The only deeper parameter that will likely need to be adjusted for your particular settings is the Obstacle critics’ repulsion_weight
since the tuning of this is proportional to your inflation layer’s radius. Higher radii should correspond to reduced repulsion_weight
due to the penalty formation (e.g. inflation_radius - min_dist_to_obstacle
). If this penalty is too high, the robot will slow significantly when entering cost-space from non-cost space or jitter in narrow corridors. It is noteworthy, but likely not necessary to be changed, that the Obstacle critic may use the full footprint information if consider_footprint = true
, though comes at an increased compute cost.
If you don’t require path following behavior (e.g. just want to follow a goal pose and let the model predictive elements decide the best way to accomplish that), you may easily remove the PathAlign, PathFollow and PathAngle critics.
By default, the controller is tuned and has the capabilities established in the PathAlign/Obstacle critics to generally follow the path closely when no obstacles prevent it, but able to deviate from the path when blocked. See PathAlignCritic::score()
for details, but it is disabled when the local path is blocked so the obstacle critic takes over in that state.
Prediction Horizon, Costmap Sizing, and Offsets
As this is a predictive planner, there is some relationship between maximum speed, prediction times, and costmap size that users should keep in mind while tuning for their application. If a controller server costmap is set to 3.0m in size, that means that with the robot in the center, there is 1.5m of information on either side of the robot. When your prediction horizon (time_steps * model_dt) at maximum speed (vx_max) is larger than this, then your robot will be artificially limited in its maximum speeds and behavior by the costmap limitation. For example, if you predict forward 3 seconds (60 steps @ 0.05s per step) at 0.5m/s maximum speed, the minimum required costmap radius is 1.5m - or 3m total width.
The same applies to the Path Follow and Align offsets from furthest. In the same example if the furthest point we can consider is already at the edge of the costmap, then further offsets are thresholded because they’re unusable. So its important while selecting these parameters to make sure that the theoretical offsets can exist on the costmap settings selected with the maximum prediction horizon and velocities desired. Setting the threshold for consideration in the path follower + goal critics as the same as your prediction horizon can make sure you have clean hand-offs between them, as the path follower will otherwise attempt to slow slightly once it reaches the final goal pose as its marker.
The Path Follow critic cannot drive velocities greater than the projectable distance of that velocity on the available path on the rolling costmap. The Path Align critic offset_from_furthest
represents the number of path points a trajectory passes through while tracking the path. If this is set either absurdly low (e.g. 5) it can trigger when a robot is simply trying to start path tracking causing some suboptimal behaviors and local minima while starting a task. If it is set absurdly high (e.g. 50) relative to the path resolution and costmap size, then the critic may never trigger or only do so when at full-speed. A balance here is wise. A selection of this value to be ~30% of the maximum velocity distance projected is good (e.g. if a planner produces points every 2.5cm, 60 can fit on the 1.5m local costmap radius. If the max speed is 0.5m/s with a 3s prediction time, then 20 points represents 33% of the maximum speed projected over the prediction horizon onto the path). When in doubt, prediction_horizon_s * max_speed / path_resolution / 3.0
is a good baseline.
Obstacle, Inflation Layer, and Path Following
There also exists a relationship between the costmap configurations and the Obstacle critic configurations. If the Obstacle critic is not well tuned with the costmap parameters (inflation radius, scale) it can cause the robot to wobble significantly as it attempts to take finitely lower-cost trajectories with a slightly lower cost in exchange for jerky motion. The default behavior was tuned for small AMRs (e.g. turtlebots or similar), so if using a larger robot, you may want to reduce the repulsion_weight
in kind. It may also perform awkward maneuvers when in free-space to try to maximize time in a small pocket of 0-cost over a more natural motion which involves moving into some low-costed region. Finally, it may generally refuse to go into costed space at all when starting in a free 0-cost space if the gain is set disproportionately higher than the Path Follow scoring to encourage the robot to move along the path. This is due to the critic cost of staying in free space becoming more attractive than entering even lightly costed space in exchange for progression along the task.
Thus, care should be taken to select weights of the obstacle critic in conjunction with the costmap inflation radius and scale so that a robot does not have such issues. How I (Steve, your friendly neighborhood navigator) tuned this was to first create the appropriate obstacle critic behavior desirable in conjunction with the inflation layer parameters. Its worth noting that the Obstacle critic converts the cost into a distance from obstacles, so the nature of the distribution of costs in the inflation isn’t overly significant. However, the inflation radius and the scale will define the cost at the end of the distribution where free-space meets the lowest cost value within the radius. So testing for quality behavior when going over that threshold should be considered.
As you increase or decrease your weights on the Obstacle, you may notice the aforementioned behaviors (e.g. won’t overcome free to non-free threshold). To overcome them, increase the FollowPath critic cost to increase the desire for the trajectory planner to continue moving towards the goal. Make sure to not overshoot this though, keep them balanced. A desirable outcome is smooth motion roughly in the center of spaces without significant close interactions with obstacles. It shouldn’t be perfectly following a path yet nor should the output velocity be wobbling jaggedly.
Once you have your obstacle avoidance behavior tuned and matched with an appropriate path following penalty, tune the Path Align critic to align with the path. If you design exact-path-alignment behavior, its possible to skip the obstacle critic step as highly tuning the system to follow the path will give it less ability to deviate to avoid obstacles (though it’ll slow and stop). Tuning the critic weight for the Obstacle critic high will do the job to avoid near-collisions but the repulsion weight is largely unnecessary to you. For others wanting more dynamic behavior, it can be beneficial to slowly lower the weight on the obstacle critic to give the path alignment critic some more room to work. If your path was generated with a cost-aware planner (like all provided by Nav2) and providing paths sufficiently far from obstacles for your satisfaction, the impact of a slightly reduced Obstacle critic with a Path Alignment critic will do you well. Not over-weighting the path align critic will allow the robot to deviate from the path to get around dynamic obstacles in the scene or other obstacles not previous considered during path planning. It is subjective as to the best behavior for your application, but it has been shown that MPPI can be an exact path tracker and/or avoid dynamic obstacles very fluidly and everywhere in between. The defaults provided are in the generally right regime for a balanced initial trade-off.