Abstract

In this paper, we focus on developing a multi-step uncertainty propagation method for systems with state- and control-dependent uncertainties. System uncertainty creates a mismatch between the actual system and its control-oriented model. Often, these uncertainties are state- and control-dependent, such as modeling error. This uncertainty propagates over time and results in significant errors over a given time horizon, which can disrupt the operation of safety-critical systems. Stochastic predictive control methods can ensure that the system stays within the safe region with a given probability, but requires prediction of the future state distributions of the system over the horizon. Predicting the future state distribution of systems with state- and control-dependent uncertainty is a difficult task. Existing methods only focus on modeling the current or one-step uncertainty, while the uncertainty propagation model over a horizon is generally over-approximated. Hence, we present a multi-step Gaussian process regression method to learn the uncertainty propagation model for systems with state- and control-dependent uncertainties. We also perform a case study on vehicle lateral control problems, where we learn the vehicle’s error propagation model during lane changes. Simulation results show the efficacy of our proposed method.

1 Introduction

Safety is one of the major concerns of the current transportation system, which has propelled research in controller design for automated vehicles (AVs). Here, every AV needs to evaluate its own control solution while avoiding the vehicles in its surrounding. AVs are generally controlled in a receding horizon fashion, which allows them to utilize information about the surrounding environment. Generally, the overall control architecture involves a high-level sampling-based trajectory planner and a low-level controller that follows the high-level waypoints [1,2]. It is imperative for any motion planning and motion control algorithm to ensure the vehicle’s safety by making sure the generated trajectories do not lead to a collision in the presence of uncertainty. To ensure safety, the decision-making algorithms should ensure that the probability with which the state distributions violate the constraints (e.g., collision avoidance) is below a given threshold. Predicting the future state probability distributions over a decision-making horizon is extremely difficult in the presence of nonlinearity in system dynamics and state- and control-dependent uncertainties, such as modeling uncertainties.

Uncertainties, including state- and control-dependent uncertainties, are common in vehicular systems. The source of these uncertainties includes uncertainty in the system parameters, state- and control-dependent noise and uncertainty, modeling error, approximation error, or other sources. These uncertainties create a mismatch between the actual system and the control-oriented model. For predictive control methods, this mismatch propagates over time and results in a significant error in long-horizon predictions. Here, we refer to the collective effect of these sources as a “mismatch” or “error.” This error can impede the operation of safety-critical systems and should be carefully quantified and included in any safety considerations. Hence, in this paper, we focus on developing an approach to model the distribution of the error over a horizon in the presence of state- and control-dependent uncertainties.

Gaussian process regression (GPR) [3,4] is a popular approach for learning the state- and control-dependent uncertainties. GPR is a flexible non-parametric model used to represent functions. It assumes that function values at any set of inputs are jointly distributed according to a multivariate normal distribution, with the mean and covariance determined by the training data. To improve computational efficiency, sparse GP approximations [5,6] have also been developed. However, these methods focus on learning the uncertainty in the current time instant or the one-step error/uncertainty, which influences the state distribution in the next time instant, but not the propagation of uncertainty over a given finite horizon. Since stochastic predictive control methods (such as stochastic model predictive control) require the error distribution over a horizon, learning one-step uncertainty is not sufficient. Even for linear systems with state- and control-dependent uncertainties, the future error distribution cannot be analytically obtained. Hence, existing methods tend to approximate it. In Ref. [7], the authors employed experimental data to find a learning-based extended kinematic model, which includes the same states as a dynamic model, and then used GP models to calculate the discrepancies between the two. However, the discrepancy is only considered in the next time-step, and the propagation of this discrepancy over time (as is pertinent to model-based controllers) is not studied. In Ref. [8], the authors modeled the entire dynamics of the vehicle with GP and incorporated the model in model predictive control (MPC). This approach disregards any information we have from first principle modeling and replaces the physics of the vehicle with black-box models. Furthermore, as the authors have mentioned, such models are not suitable for safety-constrained control, and further approximations and simplifications were done to reduce complexity and computational burden.

Some recent methods have approximated the error propagation model by linearizing system dynamics and propagating the one-step uncertainty over the horizon [3,4,9]. This can add approximation error in the propagation model. Other existing decision-making algorithms make many hard assumptions, such as linear dynamics with Gaussian uncertainty [10,11], or are overly conservative when dealing with these issues, since they consider worst-case situations and over-approximate the state distributions [1215]. Such over-approximations reduce the solution space and decrease AV performance (e.g., increased time to reach the goal or high energy usage) and can even lead to infeasible solutions [16].

To address the research gaps, in this paper, an effort has been made to develop an approach to learn the state- and control-dependent system uncertainty and its propagation of a finite horizon. We develop a multi-stage GPR approach, where we not only learn the uncertainty in the current time instant but also its propagation over a given horizon. The main contributions of the paper include the development of a new multi-step GPR-based method to learn the uncertainty propagation model for systems with state- and control-dependent uncertainties. The contribution also includes a case study considering an AV lane change problem.

2 Problem Description

Consider a physical system that can be modeled as
(1)
where x(·) is the state variable, u(·) is the input to the system, θ is a set of parameters in the physical system (e.g., mass or the moment of inertia), and d and w are random (potentially Gaussian) noises. In this context, the term “actual system” is used to represent either the physical system or an accurate physics-based model of the system.
For control purposes, a simplified model is often required that can be easily used in optimization and prediction stages. An example of such a simplified model can be represented as
(2)
and is referred to as the “simple model” (though it is not required to be simple in any sense, nor is it even required to be linear). Any model that is suitable for the problem at hand would qualify as a simple model. This model may be based on first principle modeling, model simplification methods (such as linearization, model order reduction, etc.), or input–output data. The simple model may also depend on a set of parameters θ′, which is an estimation of θ. There may be a mismatch between the actual system and the simple model, i.e., ε(k)=x(k)x(k), and this mismatch can propagate over time and result in a significant error in long-horizon predictions. In this paper, we focus on developing an approach to model the distribution of the error over a horizon H in the presence of state- and control-dependent uncertainty, i.e., model ρ(τ), τ ∈ [k, k + H] so that ε(τ)ρ(τ).

2.1 Lateral Vehicle Control.

In this paper, we consider a “bicycle” model of the vehicle with two degrees-of-freedom: the vehicle’s lateral position y and yaw angle Ψ. The lateral position is measured along the vehicle’s lateral axis to point O, which is the center of rotation. The yaw angle is measured with respect to the global X axis. The longitudinal velocity of the vehicle at the center of gravity is denoted by Vx. Applying Newton’s second law along the y axis, we obtain the equation of motion: m(y¨+Ψ˙Vx)=Fyf+Fyr. The moment about the z axis for the yaw dynamic can be expressed as follows: IzΨ¨=fFyfrFyr, where ℓf and ℓr are the distances of the front and rear tires from the center of gravity. We assume linear tire forces given by
(3)
where Cαf and Cαr are the cornering stiffness of each front and rear tire, δ is the front wheel steering angle, and θvf and θvr are the front tire and rear tire velocity angles. These angles can be found using the following approximations:
(4)
See Ref. [17] for details on derivation of the model.
The kinematic equations governing the lateral motion of the vehicle are given as
(5)
where X and Y are the global coordinates, ψ is the yaw angle (the orientation of the vehicle with respect to the global X axis), and δ and a, the two inputs to the system, are steering angle and acceleration.

The velocity V is an external variable and can be assumed to be a time-varying function or can be obtained from a longitudinal model.

This model will be utilized later to create potential trajectories for lane changes.

3 Approach

To address the challenge of mismatch between the actual system and the simple model, this paper employs GPR [3,4] to provide a novel method for estimating tight bounds on the mismatch error in a prediction horizon. To train the GPR model, the hyperparameters of the model must be estimated. These hyperparameters control the smoothness of the function and the amount of noise in the data. By training the GPR model on the data from the actual system and the simple model, the mismatch error can be modeled and quantified. This provides a valuable tool for controlling the actual system with greater accuracy and safety, as the error can be accurately predicted and accounted for in the control strategy.

The method presented in this paper consists of three steps:

  • Step 1 Generate a suitable set of inputs for the system.

  • Step 2 Apply the input signals to the simple model and actual system, starting from the same initial condition.

  • Step 3 Record the mismatch between the two as a function of input and initial condition.

  • Step 4 Train the GPR models on the mismatch error.

Here, we will explain every step in detail.

3.1 Input Sequence Generation.

To ensure accurate data collection, it is important to note that errors or mismatches are often dependent on both state and input. If a one-step prediction of the error is the sole objective, then a randomly generated set of initial conditions and input signals from an admissible set would suffice. However, for longer prediction horizons, especially in the case of predictive control, generating the necessary data becomes cumbersome. Even for a simple system with only ten input values, a ten-step prediction horizon would require 1010 input combinations, which is unfeasible to collect or train models on. Therefore, selecting input signals strategically to avoid overblowing the input domain is crucial.

Although it may appear sensible for the model to encompass every conceivable input combination, in reality, only a smaller subset of combinations is valid. For instance, in the lane change case study, gradually turning the steering to the right and then straightening it is the realistic steering combination for shifting to a lane on the right. Therefore, unrealistic input combinations that switch between left and right or high and low values should be avoided by utilizing knowledge of the system to obtain practical input combinations.

To generate a realistic set of inputs, we utilized the kinematic model of the system described in (5), where the inputs are the steering angle and acceleration. To generate an appropriate input signal, we make use of the differential flatness property of this model, as explained in Refs. [18,19]. Specifically, this kinematic model is differentially flat with respect to the flat output σ=[σ1σ2]=[xy], and the following output transformations can be used to determine a suitable set of inputs.
(6)
(7)
(8)
To generate a feasible and realistic trajectory for σ or [xy], we define a fifth-order polynomial for y and assume an affine function for x, with coefficients that are determined based on several conditions. These conditions include the initial and target lanes, speeds, and the desired angle of the car, which should be aligned with the road. A straight line can generate a path that connects the starting point and finishing points but cannot have any velocity variations or constraints. A third-order polynomial, with four constants, can guarantee starting and finishing points and velocities, but the direction of the car would depend on other variables and cannot be independently set. A fifth order is the lowest order of polynomials that can simultaneously determine position, velocity, and orientation of the car at start and finish points on the trajectory independently. By imposing these constraints on the trajectory, we can ensure that it is both feasible and realistic.
Considering y(t) = a0t5 + a1t4 + a2t3 + a3t2 + a4t + a5, the coefficients are obtained from the following conditions:
(9a)
(9b)
(9c)
where (9a) imposes a lane change on the system, so y0 is the location of the current lane and yf is the desired lane. Equation (9b) ensures that the car has no lateral velocity at the beginning and the end of the lane change, ensuring that we stay in the lane. The last equation (9c) ensures that the lateral speed remains unchanged. However, it should be noted that the longitudinal speed is assumed to remain constant during the lane change, resulting in the equation for x(t) to be given by
(10)
To generate a wide range of input signals from (8), we can sweep over various combinations of initial and final lateral positions (or more precisely, lateral change and direction), longitudinal speed, and final time.

3.2 Data Generation and Collection.

Suppose that we have explored a set of possible lane shifts, lane change times, and longitudinal speeds, resulting in nu trajectories and corresponding feasible input sequences. However, it is important to note that when these input sequences are applied to the actual system or the simplified model, they may not result in exactly the planned trajectory. Instead, they serve as a starting point for generating potentially realistic input signals. For each input signal ui(·) generated, we obtain nt different responses from the actual system, denoted as xji() to observe the effects of randomness in the system model. Let the length of this response be denoted by ni. For each time instant k=0,,niH, we use xji(k) as the initial condition and ui(k : k + H) as the input signal for the simple model to obtain the response zji(vx,xij(k),ui(k:k+H)).

The error, or the mismatch, is given as
(11)
for t=k+1,,k+h and h=1,,H. Therefore, for every step of the prediction horizon, we have collected a sample data set of errors.
In the lane change control problem, we have x=[yy˙ΨΨ˙] and x=[yy˙ΨΨ˙], so at time-step k, one-step prediction error would be ε(k+1), two step prediction error would be ε(k+1), and similarly, l step prediction would be ε(k+l), which is given as
(12)
in which x(k) = x′(k) (meaning the prediction, or the simple model, uses the current state of the actual system as a starting point).

Once the error data are collected for various state and input sequences, the data are randomly split between a training and a validation set. For every prediction horizon h, we train a GPR model with the input being, at every sample, [vxxij(k)ui(k:k+h)]. This would result in a total of H Gaussian process models for every step of the prediction, denoted as Gh(·). In practice, these models can be trained separately for every state of the system.

4 Simulation Results

To validate our proposed approach, we consider the actual system is governed by (4), where parameters are given as (refer to Refs. [20,21]
(13)
The elements of matrix A are also perturbed by Gaussian uncertainty and then discretized with sampling time dt = 0.5 s. A Gaussian measurement noise is also added to x(k) at every step. The simple control-oriented model is based on the same dynamic equations; however, the parameter values differ by 5%. It is also assumed that there is a measurement uncertainty in accessing Vx, so the value of Vx used in the simple model is also randomly perturbed.

Figure 1 depicts the effects of the perturbations in the simple model’s response and five trajectories generated by the actual model (using the same inputs and initial conditions). The trajectories are generated for lane changes of Δy = 3.2 and Δy = 6.4 for lane changes to the right and left. Longitudinal speed is considered to belong in {15, 20, 25, 30} and tf is 6, 7, and 8 s.

Fig. 1
Response from the simple model as well as five responses from the actual system
Fig. 1
Response from the simple model as well as five responses from the actual system
Close modal

After generating the data and splitting the training and test sets, the GPR models are trained to predict the error. The results for one-step prediction are given in Fig. 2 (error in y), Fig. 3 (error in y˙), Fig. 4 (error in Ψ), and Fig. 5 (error in Ψ˙). Prediction results for eight-step prediction (equivalent to 4 s) are given in Fig. 6 (error in y), Fig. 7 (error in y˙), Fig. 8 (error in Ψ), and Fig. 9 (error in Ψ˙). It can be seen that even though for a considerably longer prediction horizon (compared to the whole lane change that can be done in 12–16 steps), we still have tight margins for our prediction. It is also worth noting that, even though the example provided here had a linear model as the starting point, the presented method does not rely on linearity or any other assumptions on the system, nor does it rely on any simplification, making it a superior method of prediction, especially for predictive-based control.

Fig. 2
One-step error prediction results for y(k)
Fig. 2
One-step error prediction results for y(k)
Close modal
Fig. 3
One-step error prediction results for y˙(k)
Fig. 3
One-step error prediction results for y˙(k)
Close modal
Fig. 4
One-step prediction results for Ψ(k)
Fig. 4
One-step prediction results for Ψ(k)
Close modal
Fig. 5
One-step prediction results for Ψ˙(k)
Fig. 5
One-step prediction results for Ψ˙(k)
Close modal
Fig. 6
Eight-step prediction results for y(k)
Fig. 6
Eight-step prediction results for y(k)
Close modal
Fig. 7
Eight-step prediction results for y˙(k)
Fig. 7
Eight-step prediction results for y˙(k)
Close modal
Fig. 8
Eight-step prediction results for Ψ(k)
Fig. 8
Eight-step prediction results for Ψ(k)
Close modal
Fig. 9
Eight-step prediction results for Ψ˙(k)
Fig. 9
Eight-step prediction results for Ψ˙(k)
Close modal

Figure 10 compares the efficacy of our method to that of Ref. [9] for predictions in y(t). The first point in the set data was used as the initial condition. Then, for the propagation method, the subsequent predictions were computed and the results were compared to the actual data. We can see that our method can maintain a high level of accuracy, while the propagation method suffers as the prediction horizon increases.

Fig. 10
Prediction efficacy comparison
Fig. 10
Prediction efficacy comparison
Close modal

All simulations are done on a personal laptop. The computational burden of running the controller is low and can be done in real-time. The training of the GP requires more computational overhead, but as it is done offline and in advance, this does not pose a challenge to the applicability of the method. The training is done in python.

5 Conclusion

This paper presents a multi-step Gaussian process regression method for learning uncertainty propagation error in systems with state- and control-dependent uncertainties. The technique has been applied to the vehicle lateral control problem, where the error propagation model was learned during lane changes. Simulation results demonstrate the effectiveness of the proposed method in accurately predicting future state distributions and ensuring the safe operation of safety-critical systems. The presented method provides a significant improvement over existing approaches that only model current or one-step uncertainty and over-approximate the uncertainty propagation model over a horizon. The results suggest that the proposed approach has the potential to be applied to a wide range of safety-critical systems to improve their safety and performance.

Acknowledgment

This material is based upon work supported by the National Science Foundation (Grant No. 2130718).

Conflict of Interest

There are no conflicts of interest.

Data Availability Statement

The datasets generated and supporting the findings of this article are obtainable from the corresponding author upon reasonable request.

References

1.
Guanetti
,
J.
,
Kim
,
Y.
, and
Borrelli
,
F.
,
2018
, “
Control of Connected and Automated Vehicles: State of the Art and Future Challenges
,”
Annu. Rev. Control.
,
45
.
2.
Paden
,
B.
,
Čáp
,
M.
,
Zheng Yong
,
S.
,
Yershov
,
D.
, and
Frazzoli
,
E.
,
2016
, “
A Survey of Motion Planning and Control Techniques for Self-driving Urban Vehicles
,”
IEEE Trans. Intell. Veh.
,
1
(
1
), pp.
33
55
.
3.
Hewing
,
L.
,
Liniger
,
A.
, and
Zeilinger
,
M. N.
,
2018
, “
Cautious NMPC With Gaussian Process Dynamics for Autonomous Miniature Race Cars
,”
European Control Conference
,
Limassol, Cyprus
,
June 12–15
, pp.
1341
1348
.
4.
Hewing
,
L.
,
Kabzan
,
J.
, and
Zeilinger
,
M. N.
,
2019
, “
Cautious Model Predictive Control Using Gaussian Process Regression
,”
IEEE Trans. Control Syst. Technol.
,
28
(
6
), pp.
2736
2743
.
5.
Pan
,
Y.
,
Yan
,
X.
,
Theodorou
,
E. A.
, and
Boots
,
B.
,
2017
, “
Prediction Under Uncertainty in Sparse Spectrum Gaussian Processes With Applications to Filtering and Control
,”
International Conference on Machine Learning
,
Sydney Australia
,
Aug. 6–11
,
PMLR
, pp.
2760
2768
.
6.
Lázaro-Gredilla
,
M.
,
Quinonero-Candela
,
J.
,
Edward Rasmussen
,
C.
, and
Figueiras-Vidal
,
A. R.
,
2010
, “
Sparse Spectrum Gaussian Process Regression
,”
J. Mach. Learn. Res.
,
11
(
1
), pp.
1865
1881
.
7.
Ning
,
J.
, and
Behl
,
M.
,
2023
, “
Vehicle Dynamics Modeling for Autonomous Racing Using Gaussian Processes
,” arXiv:2306.03405.1809.04509
8.
Broderick
,
D. J.
,
2012
, “Dynamic Gaussian Process Models for Model Predictive Control of Vehicle Roll,” PhD thesis,
Auburn University
,
Auburn, AL
.
9.
Hewing
,
L.
,
Kabzan
,
J.
, and
Zeilinger
,
M. N.
,
2020
, “
Cautious Model Predictive Control Using Gaussian Process Regression
,”
IEEE Trans. Control Syst. Technol.
,
28
(
6
), pp.
2736
2743
.
10.
Gao
,
Y.
,
Gray
,
A.
,
Eric Tseng
,
H.
, and
Borrelli
,
F.
,
2014
, “
A Tube-Based Robust Nonlinear Predictive Control Approach to Semiautonomous Ground Vehicles
,”
Veh. Syst. Dyn.
,
52
(
6
), pp.
802
823
.
11.
Carson, III
,
J. M.
,
Açıkmeşe
,
B.
,
Murray
,
R. M.
, and
MacMartin
,
D. G.
,
2013
, “
A Robust Model Predictive Control Algorithm Augmented With a Reactive Safety Mode
,”
Automatica
,
49
(
5
), pp.
1251
1260
.
12.
Mayne
,
D. Q.
,
Kerrigan
,
E. C.
, and
Falugi
,
P.
,
2011
, “
Robust Model Predictive Control: Advantages and Disadvantages of Tube-Based Methods
,”
IFAC Proc. Vol.
,
44
(
1
), pp.
191
196
.
13.
Limon
,
D.
,
Alvarado
,
I.
,
Alamo
,
T.
, and
Camacho
,
E. F.
,
2010
, “
Robust Tube-Based MPC for Tracking of Constrained Linear Systems With Additive Disturbances
,”
J. Process Control
,
20
(
3
), pp.
248
260
.
14.
Paulson
,
J. A.
, and
Mesba
,
A.
,
2018
, “
Nonlinear Model Predictive Control With Explicit Backoffs for Stochastic Systems Under Arbitrary Uncertainty
,”
IFAC-PapersOnLine
,
51
(
20
), pp.
523
534
.
15.
Kousik
,
S.
,
Vaskov
,
S.
,
Johnson-Roberson
,
F. B. M.
, and
Vasudevan
,
R.
,
2020
, “
Bridging the Gap Between Safety and Real-Time Performance in Receding-Horizon Trajectory Design for Mobile Robots
,”
Int. J. Robot. Res.
,
39
(
12
), pp.
1419
1469
.
16.
Lewis Chiang
,
H.-T.
,
HomChaudhuri
,
B.
,
Vinod
,
A. P.
,
Oishi
,
M.
, and
Tapia
,
L.
,
2017
, “
Dynamic Risk Tolerance: Motion Planning by Balancing Short-Term and Long-Term Stochastic Dynamic Predictions
,”
International Conference on Robotics and Automation
,
Marina Bay Sands, Singapore
,
May 29–June 3
, pp.
3762
3769
.
17.
Rajamani
,
R.
,
2012
,
Vehicle Dynamics and Control
(
Mechanical Engineering Series
),
Springer US
,
Boston, MA
.
18.
van Nieuwstadt
,
M.
,
Rathinam
,
M.
, and
Murray
,
R. M.
,
1998
, “
Differential Flatness and Absolute Equivalence of Nonlinear Control Systems
,”
SIAM J. Control Optim.
,
36
(
4
), pp.
1225
1239
.
19.
Wang
,
M.
,
Wang
,
Z.
,
Paudel
,
S.
, and
Schwager
,
M.
,
2018
, “
Safe Distributed Lane Change Maneuvers for Multiple Autonomous Vehicles Using Buffered Input Cells
,”
IEEE International Conference on Robotics and Automation
,
Brisbane, Australia
,
May 21–25
, pp.
1
7
.
20.
Haddad
,
S.
,
Halder
,
A.
, and
Singh
,
B.
,
2022
, “
Density-Based Stochastic Reachability Computation for Occupancy Prediction in Automated Driving
,”
IEEE Trans. Control Syst. Technol.
,
30
(
6
), pp.
2406
2419
.
21.
Bakker
,
E.
,
Pacejka
,
H. B.
, and
Lidner
,
L.
,
1989
, “
A New Tire Model With an Application in Vehicle Dynamics Studies
,”
SAE Trans.
,
98
(
1
), pp.
101
113
.