Paper Title
Learning Discrete-Time Uncertain Nonlinear Systems with Probabilistic Safety and Stability Constraints. pdf
Authors
Iman Salehi, Tyler Taplin, and Ashwin Dani
Date
October 2022
Published in
IEEE Open Journal of Control Systems
A block diagram that illustrates the information flow and procedure for the discrete-time dynamical system model learning method
Abstract: This paper presents a discrete-time dynamical system model learning method from demonstration while providing probabilistic guarantees on the safety and stability of the learned model. The controlled dynamic model of a discrete-time system with a zero-mean Gaussian process noise is approximated using an Extreme Learning Machine (ELM) whose parameters are learned subject to chance constraints derived using a discrete-time control barrier function and discrete-time control Lyapunov function in the presence of the ELM reconstruction error. To estimate the ELM parameters a quadratically constrained quadratic program (QCQP) is developed subject to the constraints that are only required to be evaluated at sampled points. Simulations validate that the system model learned using the proposed method can reproduce the demonstrations inside a prescribed safe set while converging to the desired goal location starting from various different initial conditions inside the safe set. Furthermore, it is shown that the learned model can adapt to changes in goal location during reproductions without violating the stability and safety constraints.
Illustration of the ELM model learning using probabilistic safety and stability constraints on joint positions data of a 2 degrees-of-freedom planar robot.
Evolution of the joint angles for the planar robot simulation using the learned Extreme Learning Machine parameters
Model BendedLine from the LASA dataset, learned with probabilistic safety and stability constraints for ellipse set.
Trajectories of the wheeled mobile robot learned using a constrained learning method subject to probabilistic safety and stability constraints. The boundary of the safe set is shown in a dark gray ellipsoid, and random samples are selected from a more extensive set shown in the light gray ellipsoid.