Nonlinear Feedback Control

In nonlinear control problems, we have a system

dxdt=f(x,u).\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}, \boldsymbol{u}).

x\boldsymbol{x} is the state of the system, and u\boldsymbol{u} is the input to the system. Note that for simplicity, the system is time-invariant. Further assume, without loss of generality, that f(0,0)=0f(0, 0) = 0. The goal of nonlinear feedback control is to find a state feedback law α(x)\alpha(\boldsymbol{x}) such that the equilibrium point xe=0\boldsymbol{x}_e = 0 is globally asymptotically stable for the closed loop system

dxdt=f(x,α(x)).\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}, \alpha(\boldsymbol{x})).

Sometimes, the control impacts the state evolution in an affine manner.

Definition 52

A control affine system is given by the differential equation

dxdt=f(x)+G(x)u\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}) + G(\boldsymbol{x})\boldsymbol{u}

where G(x)G(\boldsymbol{x}) is a matrix dependent on the state vector x\boldsymbol{x}.

When designing controllers, there is a wide variety of techniques we can use. Some simple techniques involve canceling out various types of nonlinearities in the system using the input. Here are some examples.

  1. Set u\boldsymbol{u} such that it cancels out nonlinear terms and adds a stable linear term, effectively making the nonlinear system behave linear in the closed loop.

  2. Set u\boldsymbol{u} to cancel destabilitizing nonlinear terms and add a stable linear term, so the stable nonlinearities help the input drive the system to equilibrium.

  3. Set u\boldsymbol{u} to cancel destabilizing nonlinear terms, so the nonlinear system dynamics drive the system to equilibrium.

  4. Set u\boldsymbol{u} to dominate destabilizing terms so they have a minimal impact on the overall system behavior.

While these techniques can work, there are also more principled ways of designing controllers to satisfy different criteria, particularly for the case of control affine systems.

Control Lyapunov Functions

If we can find an α(x)\alpha(\boldsymbol{x}) that makes the origin globally asymptotically stable, then the converse Lyapunov theorem says that we can find a corresponding Lyapunov function for the system.

x0,dVdt<0    (xV)f(x,α(x))<0    u s.t (xV)f(x,u)<0infu(xV)f(x,u)<0\begin{aligned} \forall \boldsymbol{x} \neq 0, \frac{d^{}V}{dt^{}} < 0 &\implies (\nabla_{\boldsymbol{x}}V)^\top f(\boldsymbol{x}, \alpha(\boldsymbol{x})) < 0 \\ &\implies \exists \boldsymbol{u} \text{ s.t } (\nabla_{\boldsymbol{x}}V)^\top f(\boldsymbol{x}, \boldsymbol{u}) < 0\\ & \Leftrightarrow \inf_{\boldsymbol{u}} (\nabla_{\boldsymbol{x}}V)^\top f(\boldsymbol{x}, \boldsymbol{u}) < 0\end{aligned}

This result motivates the following definition.

Definition 53

A continuously differentiable, PDF, radially unbounded V:RnRV: \mathbb{R}^n \to \mathbb{R} is a Control Lyapunov Function for the system dxdt=f(x,u)\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}, \boldsymbol{u}) if

x0,infu(xV)f(x,u)<0\forall \boldsymbol{x} \neq 0, \inf_{\boldsymbol{u}} (\nabla_{\boldsymbol{x}}V)^\top f(\boldsymbol{x}, \boldsymbol{u}) < 0

Once we have a control lyapunov function, we can prove that it is possible to find a state feedback law that will make the origin globally asymptotically stable.

Theorem 27

Suppose ff is Lipschitz and VV is a control Lyapunov function, then there exists a smooth function α\alpha such that the origin is a globally asympototically stable equilibrium point of dxdt=f(x,α(x))\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}, \alpha(\boldsymbol{x})).

Suppose that we have a control affine system, and we want to construct a control lyapunov function for the system.

infuVxf(x,u)=infuLfV+iLgiVui<0\inf_{\boldsymbol{u}} \frac{\partial V}{\partial \boldsymbol{x}}f(\boldsymbol{x}, \boldsymbol{u}) = \inf_{\boldsymbol{u}} L_f V + \sum_i L_{g_i} V u_i < 0

Here, each gi(x)g_i(\boldsymbol{x}) is a column of G(x)G(\boldsymbol{x}). If i, LgiV=0\forall i,\ L_{g_i} V = 0, then Definition 53 is satisfied so long as LfV<0L_f V < 0.

Theorem 28

A function VV is a control lyapunov function for a control affine system

dxdt=f(x)+igi(x)ui\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}) + \sum_i g_i(\boldsymbol{x}) u_i

if

LgiVi    LfV0L_{g_i} V\, \forall i \implies L_f V \leq 0

Notice that the condition in Theorem 28 is essentially saying that the l2l_2-norm of the vector composed of the LgiVL_{g_i} V is equal to 0. The choice of CLF is important because different CLFs have different properties when used to derive controllers.

Definition 54

A CLF V(x)V(\boldsymbol{x}) satisfies the small control property if ϵ>0\forall \epsilon > 0, δ>0\exists \delta > 0 such that xBδ(0)\boldsymbol{x}\in B_\delta(0), then if x0,uBϵ(0)\boldsymbol{x}\neq0, \exists \boldsymbol{u}\in B_\epsilon(0) satisfying

dVdt=LfV+LGVTu<0.\frac{d^{}V}{dt^{}} = L_fV + L_GV^T\boldsymbol{u} < 0.

The small control property means that CLF will lead to a controller which has a small value that does not get too large when close to the equilibrium.

Given a control lyapunov function for a control affine system V(x,u)V(\boldsymbol{x}, \boldsymbol{u}), we can devise a controller which stabilizes the system. In particular, we need

dVdt(x,u)=LfV(x)+LGV(x)u0.\frac{dV}{dt}(x, u) = L_f V(\boldsymbol{x}) + L_GV(\boldsymbol{x})^\top \boldsymbol{u} \leq 0.

Hence, let

u={0,if LfV<0,(LGVLGV)1(LfVLGV),if LfV>0.\boldsymbol{u} = \begin{cases} 0, & \text{if } L_f V < 0,\\ (L_GVL_GV^\top)^{-1}(-L_fV L_GV^\top), & \text{if } L_f V > 0. \end{cases}

When the plant dynamics are naturally stabilizing, this controller exerts no control effort. When the plant dynamics are not naturally stabilizing, then the controller applies some control to stabilize the system. We can show that this is a minimum norm controller as it solves the optimization problem

minuus.tLfV+LGVu0.\begin{aligned} \min &\quad \boldsymbol{u}^\top \boldsymbol{u}\\ \text{s.t} &\quad L_fV + L_GV^\top \boldsymbol{u} \leq 0. \end{aligned}

Another type of controller is known as the Sontag controller.

Theorem 29

Suppose V:RnRV:\mathbb{R}^n\to\mathbb{R} is a CLF for SISO control affine system dxdt=f(x)+g(x)u\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}) + g(\boldsymbol{x})u where f(0)=0f(0) = 0 where ff and gg are Lipschitz. Then the Sontag feedback control law is given by

αS(x)={LfV(LfV)2+(LgV)4LgV,if LgV0,0,else.\alpha_S(\boldsymbol{x}) = \begin{cases} \frac{-L_fV - \sqrt{(L_fV)^2 + (L_gV)^4}}{L_gV}, & \text{if } L_gV \neq 0,\\ 0, & \text{else.} \end{cases}

makes the origin globally asymptotically stable. Moreover, αS(x)\alpha_S(\boldsymbol{x}) is continuous everywhere except x=0\boldsymbol{x} = 0, is continuous at x=0\boldsymbol{x}=0 if VV satisfies the small control property, and if V(x)V(\boldsymbol{x}) is K+1K+1 times continuously differentiable and f(x),g(x)f(\boldsymbol{x}), g(\boldsymbol{x}) are KK times continously differentiable x0\forall \boldsymbol{x}\neq0, then αS\alpha_S is KKtimes continuously differentiable.

Feedback Linearization

Since we have a large number of tools which allow us to control linear systems, it would be ideal if we could somehow leverage those tools for nonlinear control. Feedback linearization is the process of finding a feedback control law u=α(x)\boldsymbol{u} = \alpha(\boldsymbol{x}) such that under a nonlinear change of coordinates z=Φ(x)z = \Phi(\boldsymbol{x}), the system dxdt=f(x,α(x))\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}, \alpha(\boldsymbol{x})) behaves like a linear system dzdt=Az\frac{d^{}\boldsymbol{z}}{dt^{}} = A\boldsymbol{z}. When the system is control-affine, there are well-established results which help us do this.

SISO Case

Suppose we have a SISO control-affine system

dxdt=f(x)+g(x)uy=h(x)\begin{aligned} \frac{d^{}\boldsymbol{x}}{dt^{}} &= f(\boldsymbol{x}) + g(\boldsymbol{x})u\\ y &= h(\boldsymbol{x}) \end{aligned}

Definition 55

A SISO control affine system with an equilibrium point xe\boldsymbol{x}_e has strict relative degree γ\gamma if in a neighborhood UU around the equilibirum point, LgLfγ1h(x)L_gL_f^{\gamma-1}h(\boldsymbol{x}) is bounded away from 0 and

i=0,,γ2, LgLfγ1h(x)=0\forall i=0,\cdots,\gamma-2,\ L_gL_f^{\gamma - 1}h(\boldsymbol{x}) = 0

To understand relative degree, suppose we differentiate yy once

dydt=Lfh+Lghu.\frac{d^{}y}{dt^{}} = L_fh + L_gh u.

If xU, Lgh(x)=0\forall \boldsymbol{x}\in U,\ L_gh(\boldsymbol{x}) = 0 where UU is some region around the equilibrium, then

xU, dydt=Lfh(x).\forall \boldsymbol{x}\in U,\ \frac{d^{}y}{dt^{}} = L_fh(\boldsymbol{x}).

If we differentiate again, then

xU, d2ydt2=Lf2h(x)+LgLfh(x).\forall \boldsymbol{x}\in U,\ \frac{d^{2}y}{dt^{2}} = L_f^2h(\boldsymbol{x}) + L_gL_fh(\boldsymbol{x}).

Suppose that xU, LgLfh(x)=0\forall \boldsymbol{x}\in U,\ L_gL_fh(\boldsymbol{x}) = 0, then we can differentiate again. At some point, after γ\gamma differentiations, we will get

xU, dγydtγ=Lfγh(x)+LgLfγ1h(x)u.\forall \boldsymbol{x}\in U,\ \frac{d^{\gamma}y}{dt^{\gamma}} = L_f^\gamma h(\boldsymbol{x}) + L_gL_f^{\gamma-1}h(\boldsymbol{x})u.

Therefore, the relative degree of the system is essentially telling us which derivative of the output that we can control. By sequentially taking derivatives, we are essentially looking at the system

y=h(x)dydt=Lfh(x)d2ydt2=Lf2h(x)dγydtγ=Lfγh(x)+LgLfγ1h(x)\begin{aligned} y &= h(\boldsymbol{x})\\ \frac{d^{}y}{dt^{}} &= L_fh(\boldsymbol{x})\\ \frac{d^{2}y}{dt^{2}} &= L_f^2h(\boldsymbol{x})\\ &\vdots\\ \frac{d^{\gamma}y}{dt^{\gamma}} &= L_f^\gamma h(\boldsymbol{x}) + L_gL_f^{\gamma-1}h(\boldsymbol{x}) \end{aligned}

Suppose i=0,,γ1,\forall i=0,\cdots,\gamma-1, we let ξi(x)=diydti\xi_i(\boldsymbol{x}) = \frac{d^{i}y}{dt^{i}}. These are γ\gamma linearly independent coordinates. Since the distribution

Δ(x)=span{g(x)}\Delta(\boldsymbol{x}) = \text{span}\{g(\boldsymbol{x})\}

is involutive, it is integrable, and so there must be n1n-1 functions ηi\eta_i such that

xU, (xηi)g(x)=0.\forall \boldsymbol{x}\in U,\ (\nabla_{\boldsymbol{x}}\eta_i)^\top g(\boldsymbol{x}) = 0.

We can now choose nγn-\gamma of them which are linearly independent of the ξi\xi_i and linearly independent with each other, and this forms a change of coordinates

[ξ1ξγη1ηnγ]=Φ(x)=[h(x)Lfh(x)Lfγ1η1ηnγ].\begin{bmatrix} \xi_1\\ \vdots\\ \xi_{\gamma}\\ \eta_1\\ \vdots\\ \eta_{n-\gamma} \end{bmatrix} = \Phi(\boldsymbol{x}) = \begin{bmatrix} h(\boldsymbol{x})\\ L_fh(\boldsymbol{x})\\ \vdots\\ L_f^{\gamma-1}\\ \eta_1\\ \vdots\\ \eta_{n-\gamma} \end{bmatrix}.

This change of coordinates allows us to put the system into a canonical form.

Definition 56

The normal form of a SISO control affine system is given by

dξ1dt=ξ2dξ2dt=ξ3dξγdt=b(ξ,η)+a(η,ξ)udηdt=q(ξ,η),y=η1\begin{aligned} \frac{d^{}\xi_1}{dt^{}} &= \xi_2\\ \frac{d^{}\xi_2}{dt^{}} &= \xi_3\\ &\vdots\\ \frac{d^{}\xi_{\gamma}}{dt^{}} &= b(\boldsymbol{\xi}, \boldsymbol{\eta}) + a(\boldsymbol{\eta}, \boldsymbol{\xi})u\\ \frac{d^{}\boldsymbol{\eta}}{dt^{}} &= q(\boldsymbol{\xi}, \boldsymbol{\eta}),\\ y &= \eta_1 \end{aligned}

When the original system is given by dxdt=f(x)+g(x)u\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x})+g(\boldsymbol{x})u, then

b(ξ,η)=Lfγh(Φ1(ξ,η))a(ξ,η)=LgLfγ1h(Φ1(ξ,η))b(\boldsymbol{\xi},\boldsymbol{\eta}) = L_f^\gamma h(\Phi^{-1}(\boldsymbol{\xi}, \boldsymbol{\eta})) \qquad a(\boldsymbol{\xi}, \boldsymbol{\eta}) = L_gL_f^{\gamma-1} h(\Phi^{-1}(\boldsymbol{\xi}, \boldsymbol{\eta}))

With this parameterization, it is quite easy to see how we can make our system behave linearly. In particular, choose

u=1a(ξ,η)(b(ξ,η)+v)u = \frac{1}{a(\boldsymbol{\xi}, \boldsymbol{\eta})}\left(-b(\boldsymbol{\xi}, \boldsymbol{\eta}) + v\right)

where vv is some control input. Then the system becomes

dξ1dt=ξ2dξ2dt=ξ3dξγdt=vdηdt=q(ξ,η)y=η1\begin{aligned} \frac{d^{}\xi_1}{dt^{}} &= \xi_2\\ \frac{d^{}\xi_2}{dt^{}} &= \xi_3\\ &\vdots\\ \frac{d^{}\xi_{\gamma}}{dt^{}} &= v\\ \frac{d^{}\boldsymbol{\eta}}{dt^{}} &= q(\boldsymbol{\xi}, \boldsymbol{\eta})\\ y &= \eta_1 \end{aligned}

, which is a linear system. Therefore, we can design a linear feedback controller v=α(x)v = \alpha(\boldsymbol{x}) where we have all of the tools of linear control at our disposal. However, notice that the ηi\eta_i cannot be impacted by the control effort. These are known as the internal dynamics of the system. When ξ=0\boldsymbol{\xi} = 0, then

dηdt=q(0,η)\frac{d^{}\boldsymbol{\eta}}{dt^{}} = q(0, \boldsymbol{\eta})

are known as the Zero Dynamics of the system. Zero dynamics for a system can be dangerous because if they are unstable, then the system could be blowing up. When γ=n\gamma = n, then there are no zero dynamics. When this happens, we say the system is Full State Linearizable. Fortunately, there are necessary and sufficient conditions which guarantee full state linearization.

Theorem 30

There exists a function hh such that a control affine system dxdt=f(x)+g(x)u\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}) + g(\boldsymbol{x})u has relative degree nn at x0\boldsymbol{x}_0 if and only if

[g(x)adfn2g(x)adfn1g(x)]\begin{bmatrix} g(\boldsymbol{x}) & \cdots & \text{ad}_f^{n-2}g(\boldsymbol{x}) & \text{ad}_f^{n-1}g(\boldsymbol{x}) \end{bmatrix}

has rank nn and

[g(x)adfn3g(x)adfn2g(x)]\begin{bmatrix} g(\boldsymbol{x}) & \cdots & \text{ad}_f^{n-3}g(\boldsymbol{x}) & \text{ad}_f^{n-2}g(\boldsymbol{x}) \end{bmatrix}

has rank n1n-1 and is involutive in the neighborhood of x0\boldsymbol{x}_0. The hh is chosen to satisfy

(xh)[g(x)adfn3g(x)adfn2g(x)]=0(\nabla_{\boldsymbol{x}}h)^\top \begin{bmatrix} g(\boldsymbol{x}) & \cdots & \text{ad}_f^{n-3}g(\boldsymbol{x}) & \text{ad}_f^{n-2}g(\boldsymbol{x}) \end{bmatrix} = 0

MIMO Case

Suppose instead we have a MIMO control affine system where

dxdt=f(x)+G(x)uy=h(x)\begin{aligned} \frac{d^{}\boldsymbol{x}}{dt^{}} &= f(\boldsymbol{x}) + G(\boldsymbol{x})\boldsymbol{u}\\ \boldsymbol{y} &= h(\boldsymbol{x}) \end{aligned}

We will assume that the number of outputs is equal to the number of inputs (i.e y,uRm\boldsymbol{y}, \boldsymbol{u} \in \mathbb{R}^m). To linearize the system, we can take the same idea of relative degree from the SISO case and apply it to the MIMO case. Define γj\gamma_j to be the lowest derivative of yjy_j which is impacted by at least one input.

[dγ1y1dtγ1dγmymdtγm]=[Lfγ1h1(x)Lfγmhm(x)]+A(x)u,A(x)=[Lg1Lfγ11h1(x)LgmLfγ11h1(x)Lg1Lfγm1hm(x)LgmLfγm1hm(x)]\begin{bmatrix} \frac{d^{\gamma_1}y_1}{dt^{\gamma_1}}\\ \vdots\\ \frac{d^{\gamma_m}y_m}{dt^{\gamma_m}} \end{bmatrix} = \begin{bmatrix} L_f^{\gamma_1}h_1(\boldsymbol{x})\\ \vdots\\ L_f^{\gamma_m}h_m(\boldsymbol{x})\\ \end{bmatrix} + A(\boldsymbol{x})\boldsymbol{u}, \qquad A(\boldsymbol{x}) = \begin{bmatrix} L_{g_1}L_f^{\gamma_1-1}h_1(\boldsymbol{x}) & \cdots & L_{g_m}L_f^{\gamma_1-1}h_1(\boldsymbol{x})\\ \vdots & \ddots & \vdots\\ L_{g_1}L_f^{\gamma_m-1}h_m(\boldsymbol{x}) & \cdots & L_{g_m}L_f^{\gamma_m-1}h_m(\boldsymbol{x}) \end{bmatrix}

Definition 57

A square control affine system has a vector relative degree (γ1,,γm)(\gamma_1, \cdots, \gamma_m) at x0U\boldsymbol{x}_0\in U if A(x0)A(\boldsymbol{x}_0) is nonsingular and

1im, 1jm, 0kγj2, xU, LgiLfkhj(x)=0\forall 1\leq i\leq m,\ 1\leq j \leq m,\ 0 \leq k \leq \gamma_j-2,\ \forall \boldsymbol{x}\in U,\ L_{g_i}L_f^kh_j(\boldsymbol{x}) = 0

\label{thm:vector-relative-degree}

As before, we can assign diyjdti=ξij\frac{d^{i}y_j}{dt^{i}} = \xi_i^j as a partial change of coordinates and then choose linearly independent η\boldsymbol{\eta}.

Definition 58

The normal form of a square MIMO system is given by

dηdt=q(ξ,η)+p(ξ,η)udξijdt=ξi+1j,j, i<γj1dξγj1jdt=bj(ξ,η)+aj(ξ,η)u\begin{aligned} \frac{d^{}\boldsymbol{\eta}}{dt^{}} &= q(\boldsymbol{\xi},\boldsymbol{\eta}) + p(\boldsymbol{\xi}, \boldsymbol{\eta})\boldsymbol{u}\\ \frac{d^{}\xi_i^j}{dt^{}} &= \xi_{i+1}^j, &\quad \forall j,\ \forall i < \gamma_j-1\\ \frac{d^{}\xi_{\gamma_j-1}^j}{dt^{}} &= b^j(\boldsymbol{\xi},\boldsymbol{\eta}) + \boldsymbol{a}^j(\boldsymbol{\xi},\boldsymbol{\eta})^\top\boldsymbol{u} \end{aligned}

As before the dηdt\frac{d^{}\boldsymbol{\eta}}{dt^{}} represent the internal dynamics of the system that are not impacted by the control. As with the linear case, we can design a controller

u=A1(x)([Lfγ1h1(x)Lfγmhm(x)]+v)\boldsymbol{u} = A^{-1}(\boldsymbol{x})\left( \begin{bmatrix} L_f^{\gamma_1}h_1(\boldsymbol{x})\\ \vdots\\ L_f^{\gamma_m}h_m(\boldsymbol{x})\\ \end{bmatrix} + \boldsymbol{v}\right)

which renders the system linear. We can now choose v\boldsymbol{v} where each entry of v\boldsymbol{v} controls a different output. For this reason, we call A(x)A(\boldsymbol{x}) the decoupling matrix. As in the SISO case, unless jγj=n\sum_j \gamma_j = n, there are zero dynamics to the system.

Theorem 31

A control affine square system dxdt=f(x)+G(x)u\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x}) + G(\boldsymbol{x})\boldsymbol{u} has vector relative degree jγj=n\sum_j \gamma_j = n if and only if Δi\Delta_i is involutive for all in2i \leq n-2, Δi\Delta_i has constant rank for all 1in11 \leq i \leq n-1 and Δn1\Delta_{n-1} has rank nn where

Δ0(x)=span{g1(x),,gm(x)}Δi(x)=span{adfkgi(x)  0ki,1jm},1in1\begin{aligned} \Delta_0(\boldsymbol{x}) &= \text{span}\{g_1(\boldsymbol{x}),\cdots,g_m(\boldsymbol{x})\}\\ \Delta_i(\boldsymbol{x}) &= \text{span}\{\text{ad}_f^kg_i(\boldsymbol{x})\ |\ \forall 0 \leq k \leq i, 1 \leq j \leq m\}, &\quad \forall 1 \leq i \leq n-1 \end{aligned}

Dynamic Extension

Sometimes, we can use full-state linearization even if hh does not satisfy the conditions in Theorem 30. We do this by adding additional states to the system and corresponding pseudo-control inputs which help control these states. Sometimes, this can be done in a way which makes the extended system full-state linearizable.

Sliding Mode Control

In sliding mode control, we design a controller

u=β(x)sgn(s)u = \beta(\boldsymbol{x})\text{sgn}(s)

where s(x)s(\boldsymbol{x}) describes a manifold called the “sliding manifold”. Sliding mode controllers have two states

  1. Reaching Mode

  2. Sliding Mode

During the reaching mode, the controller drives the state towards the sliding manifold s(x)=0s(\boldsymbol{x}) = 0. We choose ss such that on the manifold, when the system is in sliding mode, the system naturally converges asymptotically to the equilibrium. If s(x)=0s(\boldsymbol{x}) = 0 is an invariant manifold, then the system will smoothly travel along the manifold to equilibrium. If the sliding manifold is not invariant, then the state will chatter around the manifold towards equilibrium as the controller continuously drives it back to the manifold once it leaves. To choose ss, we need to find a CLF V(s)V(s) which converges to 00 in finite time when applying the sliding mode controller.

Backstepping

Definition 59

A system expressed in strict feedback form is given by

dxdt=f0(x)+g0(x)ξ1dξ1dt=f1(x,ξ1)+g1(x,ξ1)ξ2dξkdt=fk(x,ξ1,,ξk)+gk(x,ξ1,,ξk)u\begin{aligned} \frac{d^{}\boldsymbol{x}}{dt^{}} &= f_0(\boldsymbol{x}) + g_0(\boldsymbol{x})\xi_1\\ \frac{d^{}\xi_1}{dt^{}} &= f_1(\boldsymbol{x}, \xi_1) + g_1(\boldsymbol{x}, \xi_1)\xi_2\\ &\vdots\\ \frac{d^{}\xi_k}{dt^{}} &= f_k(\boldsymbol{x}, \xi_1,\cdots,\xi_k) + g_k(\boldsymbol{x}, \xi_1,\cdots,\xi_k)u\\ \end{aligned}

When systems are expressed in this way, we have a convenient method of designing controllers.

Theorem 32 (Backstepping Lemma)

Suppose there is a continuously differentiable u=α(x)u=\alpha(\boldsymbol{x}) and a CLF V(x)V(\boldsymbol{x}) such that

LfV+LgVαWL_fV + L_gV\alpha \leq -W

where WW is a positive semi-definite function for the system dxdt=f(x)+g(x)u\frac{d^{}\boldsymbol{x}}{dt^{}} = f(\boldsymbol{x})+g(\boldsymbol{x})u. Then for the system

dxdt=f(x)+g(x)ξdξdt=u\begin{aligned} \frac{d^{}\boldsymbol{x}}{dt^{}} &= f(\boldsymbol{x}) + g(\boldsymbol{x})\xi\\ \frac{d^{}\xi}{dt^{}} &= u \end{aligned}

the function

Va(x,ξ)=V(x)+12(ξα(x))2V_a(\boldsymbol{x},\xi) = V(\boldsymbol{x}) + \frac{1}{2}(\xi - \alpha(\boldsymbol{x}))^2

is a valid CLF and the control input

u=c(ξα(x))+(xα)(f(x)+g(x)ξ)(xV)g(x),c>0u = -c(\xi-\alpha(\boldsymbol{x})) + (\nabla_{\boldsymbol{x}}\alpha)^\top(f(\boldsymbol{x}) + g(\boldsymbol{x})\xi) - (\nabla_{\boldsymbol{x}}V)^\top g(\boldsymbol{x}), \qquad c > 0

is a stabilizing controller.

If we apply Theorem 32 to a system expressed in strict feedback form, then we can recursively define controllers until we arrive at a controller for the full system.

Last updated