Sunday, October 11, 2009

state observer

In control theory, a state observer is a system that models a real system in order to provide an estimate of its internal state, given measurements of the input and output of the real system. It is typically a computer-implemented mathematical model.
Knowing the system state is necessary to solve many control theory problems; for example, stabilizing a system using state feedback. In most practical cases, the physical state of the system cannot be determined by direct observation. Instead, indirect effects of the internal state are observed by way of the system outputs. A simple example is that of vehicles in a tunnel: the rates and velocities at which vehicles enter and leave the tunnel can be observed directly, but the exact state inside the tunnel can only be estimated. If a system is observable, it is possible to fully reconstruct the system state from its output measurements using the state observer.

Typical observer model

The state of a physical discrete-time system is assumed to satisfy
\mathbf{x}(k+1) = A \mathbf{x}(k) + B \mathbf{u}(k) \mathbf{y}(k) = C \mathbf{x}(k) + D \mathbf{u}(k)
where, at time k, \mathbf{x}(k) is the plant's state; \mathbf{u}(k) is its inputs; and \mathbf{y}(k) is its outputs. These equations simply say that the plant's current outputs and its future state are both determined solely by its current state and the current inputs. (Although these equations are expressed in terms of discrete time steps, very similar equations hold for continuous systems). If this system is observable then the output of the plant, \mathbf{y}(k), can be used to steer the state of the state observer.
The observer model of the physical system is then typically derived from the above equations. Additional terms may be included in order to ensure that, on receiving successive measured values of the plant's inputs and outputs, the model's state converges to that of the plant. In particular, the output of the observer may be subtracted from the output of the plant and then multiplied by a matrix L; this is then added to the equations for the state of the observer to produce a so-called Luenberger observer, defined by the equations below. Note that the variables of a state observer are commonly denoted by a "hat": \mathbf{\hat{x}}(k) and \mathbf{\hat{y}}(k) to distinguish them from the variables of the equations satisfied by the physical system.
\mathbf{\hat{x}}(k+1) = A \mathbf{\hat{x}}(k) + L \left[\mathbf{y}(k) - \mathbf{\hat{y}}(k)\right] + B \mathbf{u}(k) \mathbf{\hat{y}}(k) = C \mathbf{\hat{x}}(k) + D \mathbf{u}(k)
The observer is called asymptotically stable if the observer error \mathbf{e}(k) = \mathbf{\hat{x}}(k) - \mathbf{x}(k) converges to zero when  k \rightarrow \infty . For a Luenberger observer, the observer error satisfies  \mathbf{e}(k+1) = (A - LC) \mathbf{e}(k). The Luenberger observer for this discrete-time system is therefore asymptotically stable when the matrix ALC has all the eigenvalues inside the unit circle.
For control purposes the output of the observer system is fed back to the input of both the observer and the plant through the gains matrix K.
\mathbf{u(k)}= -K \mathbf{\hat{x}}(k)
The observer equations then become:
\mathbf{\hat{x}}(k+1) = A \mathbf{\hat{x}}(k) + L \left(\mathbf{y}(k) - \mathbf{\hat{y}}(k)\right) - B K \mathbf{\hat{x}}(k) \mathbf{\hat{y}}(k) = C \mathbf{\hat{x}}(k) - D K \mathbf{\hat{x}}(k)
or, more simply,
\mathbf{\hat{x}}(k+1) = \left(A - B K \right) \mathbf{\hat{x}}(k) + L \left(\mathbf{y}(k) - \mathbf{\hat{y}}(k)\right) \mathbf{\hat{y}}(k) = \left(C - D K\right) \mathbf{\hat{x}}(k)
Due to the separation principle we know that we can choose K and L independently without harm to the overall stability of the systems. As a rule of thumb, the poles of the observer ALC are usually chosen to converge 10 times faster than the poles of the system ABK.

Continuous-time case

The previous example was for an observer implemented in a discrete-time LTI system. However, the process is similar for the continuous-time case; the observer gains L are chosen to make the continuous-time error dynamics converge to zero asymptotically (i.e., when ALC is a Hurwitz matrix).
For a continuous-time linear system
\mathbf{\dot{x}} = A \mathbf{x}+ B \mathbf{u},
\mathbf{y} = C \mathbf{x}, ,
where \mathbf{x} \in \mathbb{R}^n, \mathbf{u} \in \mathbb{R}^m ,\mathbf{y} \in \mathbb{R}^r, the observer looks similar to discrete-time case described above:
\mathbf{\dot{\hat{x}}} = A \mathbf{\hat{x}}+ B \mathbf{u} + L \left(\mathbf{y} - C \mathbf{\hat{x}}\right) .
The observer error \mathbf{e}=\mathbf{\hat{x}}-\mathbf{x} satisfies the equation
 \mathbf{\dot{e}} = (A - LC) \mathbf{e}.
The eigenvalues of the matix ALC can be made arbitrary by appropriate choice of the observer gain L when the pair [A,C] is observable, i.e. observability condition holds. In particular, it can be made Hurwitz, so the observer error e(t) \rightarrow 0 when t \rightarrow \infty.

Peaking and other observer methods

When the observer gain L is high, the linear Luenberger observer converges to the system states very quickly. However, high observer gain leads to a peaking phenomenon in which initial estimator error can be prohibitively large (i.e., impractical or unsafe to use).[1] As a consequence, nonlinear high gain observer methods are available that converge quickly without the peaking phenomenon. For example, sliding mode control can be used to design an observer that brings one estimated state's error to zero in finite time even in the presence of measurement error; the other states have error that behaves similarly to the error in a Luenberger observer after peaking has subsided. Sliding mode observers also have attractive noise resilience properties that are similar to a Kalman filter.[2][3]

State observers for nonlinear systems

Sliding mode observers can be designed for the non-linear systems as well. For simplicity, first consider the no-input non-linear system:
\dot{\mathbf{x}} = f(\mathbf{x})
where \mathbf{x} \in \mathbb{R}^n. Also assume that there is a measurable output \mathbf{y} \in \mathbb{R} given by
\mathbf{y} = h(\mathbf{x}).
There are several non-approximate approaches for designing an observer. The two observers given below also apply to the case when the system has an input. That is,
\dot{\mathbf{x}} = f(\mathbf{x}) + B(\mathbf{x}) \mathbf{u},
\mathbf{y} = h(\mathbf{x}), .

Linearizable error dynamics

One suggested by Kerner and Isidori[4] and Krener and Respondek[5] can be applied in a situation when there exists a linearizing transformation (i.e., a diffeomorphism, like the one used in feedback linearization) \mathbf{z}=\Phi(\mathbf{x}) such that in new variables the system equations read
\dot{\mathbf{z}} = A \mathbf{z}+ \phi(\mathbf{y}),
\mathbf{y} = C\mathbf{z}.
The Luenberger observer is then designed as
\dot{\hat{\mathbf{z}}} = A \hat{\mathbf{z}}+ \phi(\mathbf{y}) - L \left(C \hat{\mathbf{z}}-\mathbf{y} \right) .
The observer error for the transformed variable \mathbf{e}=\mathbf{\hat{z}}-\mathbf{z} satisfies the same equation as in classical linear case.
 \mathbf{\dot{e}} = (A - LC) \mathbf{e}.
As shown by Gauthier, Hammouri, and Othman[6] and Hammouri and Kinnaert,[7] if there exists transformation \mathbf{z}=\Phi(\mathbf{x}) such that the system can be transformed into the form
\dot{\mathbf{z}} = A(u(t)) \mathbf{z}+ \phi(\mathbf{y},u(t) ),
\mathbf{y} = C\mathbf{z},
then the observer is designed as
\mathbf{\dot{\hat{z}}} = A(u(t)) \mathbf{\hat{z}}+ \phi(\mathbf{y},u(t) ) - L(t) \left(C \mathbf{\hat{z}}-\mathbf{y} \right) ,
where L(t) is a time-varying observer gain.

Sliding mode observer

As discussed for the linear case above, the peaking phenomenon present in Luenberger observers justifies the use of a sliding mode observer. The sliding mode observer uses non-linear high-gain feedback to drive estimated states to a hypersurface where there is no difference between the estimated output and the measured output. The non-linear gain used in the observer is typically implemented with a scaled switching function, like the signum (i.e., sign) of the estimated–measured output error. Hence, due to this high-gain feedback, the vector field of the observer has a crease in it so that observer trajectories slide along a curve where the estimated output matches the measured output exactly. So, if the system is observable from its output, the observer states will all be driven to the actual system states. Additionally, by using the sign of the error to drive the sliding mode observer, the observer trajectories become insensitive to many forms of noise. Hence, some sliding mode observers have attractive properties similar to the Kalman filter but with simpler implementation.[2][3]
As suggested by Drakunov,[8] a sliding mode observer can also be designed for a class of non-linear systems. Such an observer can be written in terms of original variable estimate \mathbf{\hat{x}} and has the form
 \mathbf{\dot{\hat{x}}} = \left [ \frac{\partial H(\mathbf{\hat{x}})}{\partial \mathbf{x}}
\right]^{-1} M(\mathbf{\hat{x}}) \, \operatorname{sign}( V(t) - H(\mathbf{\hat{x}}) )
where:
  • The \operatorname{sign}(\mathord{\cdot}) vector extends the scalar signum function to n dimensions. That is,
\operatorname{sign}(\mathbf{z}) = \begin{bmatrix}
\operatorname{sign}(z_1)\\
\operatorname{sign}(z_2)\\
\vdots\\
\operatorname{sign}(z_i)\\
\vdots\\
\operatorname{sign}(z_n)
\end{bmatrix}
for the vector \mathbf{z} \in \mathbb{R}^n.
  • The vector H(\mathbf{x}) has components that are the output function h(\mathbf{x}) and its repeated Lie derivatives. In particular,
H(\mathbf{x}) \triangleq
\begin{bmatrix}
h_1(\mathbf{x})\\
h_2(\mathbf{x})\\
h_3(\mathbf{x})\\
\vdots\\
h_n(\mathbf{x})
\end{bmatrix}
\triangleq
\begin{bmatrix}
h(\mathbf{x})\\
L_{f}h(\mathbf{x})\\
L_{f}^2 h(\mathbf{x})\\
\vdots\\
L_{f}^{n-1}h(\mathbf{x})
\end{bmatrix}
where L^i_f h is the ith Lie derivative of output function h along the vector field f (i.e., along \mathbf{x} trajectories of the non-linear system). In the special case where the system has no input or has a relative degree of n, H(\mathbf{x}(t)) is a collection of the output \mathbf{y}(t)=h(\mathbf{x}(t)) and its n − 1 derivatives. Because the inverse of the Jacobian linearization of H(\mathbf{x}) must exist for this observer to be well defined, the transformation H(\mathbf{x}) is guaranteed to be a local diffeomorphism.
M(\hat{\mathbf{x}}) \triangleq
\operatorname{diag}( m_1(\hat{\mathbf{x}}), m_2(\hat{\mathbf{x}}), \ldots, m_n(\hat{\mathbf{x}}) )
=
\begin{bmatrix}
m_1(\hat{\mathbf{x}}) & & & & & \\
& m_2(\hat{\mathbf{x}}) & & & & \\
& & \ddots & & & \\
& & & m_i(\hat{\mathbf{x}}) & &\\
& & & & \ddots &\\
& & & & & m_n(\hat{\mathbf{x}})
\end{bmatrix}
where, for each i \in \{1,2,\dots,n\}, element m_i(\hat{\mathbf{x}}) > 0 and suitably large to ensure reachability of the sliding mode.
  • The observer vector V(t) is such that
V(t)
\triangleq
\begin{bmatrix}v_{1}(t)\\
v_2(t)\\
v_3(t)\\
\vdots\\
v_i(t)\\
\vdots\\
v_{n}(t)
\end{bmatrix}
\triangleq
\begin{bmatrix}
\mathbf{y}(t)\\
\{ m_1(\hat{\mathbf{x}}) \operatorname{sign}( v_1(t) - h_1(\hat{\mathbf{x}}(t)) ) \}_{\text{eq}}\\
\{ m_2(\hat{\mathbf{x}}) \operatorname{sign}( v_2(t) - h_2(\hat{\mathbf{x}}(t)) ) \}_{\text{eq}}\\
\vdots\\
\{ m_{i-1}(\hat{\mathbf{x}}) \operatorname{sign}( v_{i-1}(t) - h_{i-1}(\hat{\mathbf{x}}(t)) ) \}_{\text{eq}}\\
\vdots\\
\{ m_{n-1}(\hat{\mathbf{x}}) \operatorname{sign}( v_{n-1}(t) - h_{n-1}(\hat{\mathbf{x}}(t)) ) \}_{\text{eq}}
\end{bmatrix}
where \operatorname{sign}(\mathord{\cdot}) here is the normal signum function defined for scalars, and \{ \ldots \}_{\text{eq}} denotes an "equivalent value operator" of a discontinuous function in sliding mode.
The idea can be briefly explained as follows. According to the theory of sliding modes, in order to describe the system behavior, once sliding mode starts, the function \operatorname{sign}( v_{i}(t)\!-\! h_{i}(\hat{\mathbf{x}}(t)) ) should be replaced by equivalent values (see equivalent control in the theory of sliding modes). In practice, it switches (chatters) with high frequency with slow component being equal to the equivalent value. Applying appropriate lowpass filter to get rid of the high frequency component on can obtain the value of the equivalent control, which contains more information about the state of the estimated system. The observer described above uses this method several times to obtain the state of the nonlinear system ideally in finite time.
The modified observation error can be written in the transformed states \mathbf{e}=H(\mathbf{x})-H(\mathbf{\hat{x}}). In particular,
\begin{align}
\dot{\mathbf{e}}
&=
\frac{\operatorname{d}}{\operatorname{d}t} H(\mathbf{x})
-
\frac{\operatorname{d}}{\operatorname{d}t} H(\hat{\mathbf{x}})\\
&=
\frac{\operatorname{d}}{\operatorname{d}t} H(\mathbf{x})
-
M(\hat{\mathbf{x}}) \, \operatorname{sign}( V(t) - H(\hat{\mathbf{x}}(t)) ),
\end{align}
and so
\begin{align}
\begin{bmatrix}
\dot{\mathbf{e}}_1\\
\dot{\mathbf{e}}_2\\
\vdots\\
\dot{\mathbf{e}}_i\\
\vdots\\
\dot{\mathbf{e}}_{n-1}\\
\dot{\mathbf{e}}_n
\end{bmatrix}
&=
\mathord{\overbrace{
\begin{bmatrix}
\dot{h}_1(\mathbf{x})\\
\dot{h}_2(\mathbf{x})\\
\vdots\\
\dot{h}_i(\mathbf{x})\\
\vdots\\
\dot{h}_{n-1}(\mathbf{x})\\
\dot{h}_n(\mathbf{x})
\end{bmatrix}
}^{\tfrac{\operatorname{d}}{\operatorname{d}t} H(\mathbf{x})}}
-
\mathord{\overbrace{
M(\hat{\mathbf{x}}) \, \operatorname{sign}( V(t) - H(\hat{\mathbf{x}}(t)) )
}^{\tfrac{\operatorname{d}}{\operatorname{d}t} H(\mathbf{\hat{x}})}}
=
\begin{bmatrix}
h_2(\mathbf{x})\\
h_3(\mathbf{x})\\
\vdots\\
h_{i+1}(\mathbf{x})\\
\vdots\\
h_n(\mathbf{x})\\
L_f^n h(\mathbf{x})
\end{bmatrix}
-
\begin{bmatrix}
m_1 \operatorname{sign}( v_1(t) - h_1(\hat{\mathbf{x}}(t)) )\\
m_2 \operatorname{sign}( v_2(t) - h_2(\hat{\mathbf{x}}(t)) )\\
\vdots\\
m_i \operatorname{sign}( v_i(t) - h_i(\hat{\mathbf{x}}(t)) )\\
\vdots\\
m_{n-1} \operatorname{sign}( v_{n-1}(t) - h_{n-1}(\hat{\mathbf{x}}(t)) )\\
m_n \operatorname{sign}( v_n(t) - h_n(\hat{\mathbf{x}}(t)) )
\end{bmatrix}\\
&=
\begin{bmatrix}
h_2(\mathbf{x}) - m_1(\hat{\mathbf{x}}) \operatorname{sign}( \mathord{\overbrace{ \mathord{\overbrace{v_1(t)}^{v_1(t) = y(t) = h_1(\mathbf{x})}} - h_1(\hat{\mathbf{x}}(t)) }^{\mathbf{e}_1}} )\\
h_3(\mathbf{x}) - m_2(\hat{\mathbf{x}}) \operatorname{sign}( v_2(t) - h_2(\hat{\mathbf{x}}(t)) )\\
\vdots\\
h_{i+1}(\mathbf{x}) - m_i(\hat{\mathbf{x}}) \operatorname{sign}( v_i(t) - h_i(\hat{\mathbf{x}}(t)) )\\
\vdots\\
h_n(\mathbf{x}) - m_{n-1}(\hat{\mathbf{x}}) \operatorname{sign}( v_{n-1}(t) - h_{n-1}(\hat{\mathbf{x}}(t)) )\\
L_f^n h(\mathbf{x}) - m_n(\hat{\mathbf{x}}) \operatorname{sign}( v_n(t) - h_n(\hat{\mathbf{x}}(t)) )
\end{bmatrix}.
\end{align}
So:
  1. As long as m_1(\mathbf{\hat{x}}) \geq |h_2(\mathbf{x}(t))|, the first row of the error dynamics, \dot{\mathbf{e}}_1 = h_2(\hat{\mathbf{x}}) - m_1(\hat{\mathbf{x}}) \operatorname{sign}( \mathbf{e}_1 ), will meet sufficient conditions to enter the e1 = 0 sliding mode in finite time.
  2. Along the e1 = 0 surface, the corresponding v_2(t) = \{m_1(\hat{\mathbf{x}}) \operatorname{sign}( \mathbf{e}_1 )\}_{\text{eq}} equivalent control will be equal to h_2(\mathbf{x}), and so v_2(t) - h_2(\hat{\mathbf{x}}) = h_2(\mathbf{x}) - h_2(\hat{\mathbf{x}}) = \mathbf{e}_2. Hence, so long as m_2(\mathbf{\hat{x}}) \geq |h_3(\mathbf{x}(t))|, the second row of the error dynamics, \dot{\mathbf{e}}_2 = h_3(\hat{\mathbf{x}}) - m_2(\hat{\mathbf{x}}) \operatorname{sign}( \mathbf{e}_2 ), will enter the e2 = 0 sliding mode in finite time.
  3. Along the ei = 0 surface, the corresponding v_{i+1}(t) = \{\ldots\}_{\text{eq}} equivalent control will be equal to h_{i+1}(\mathbf{x}). Hence, so long as m_{i+1}(\mathbf{\hat{x}}) \geq |h_{i+2}(\mathbf{x}(t))|, the (i + 1)th row of the error dynamics, \dot{\mathbf{e}}_{i+1} = h_{i+2}(\hat{\mathbf{x}}) - m_{i+1}(\hat{\mathbf{x}}) \operatorname{sign}( \mathbf{e}_{i+1} ), will enter the ei + 1 = 0 sliding mode in finite time.
So, for sufficiently large mi gains, all observer estimated states reach the actual states in finite time. In fact, increasing mi allows for convergence in any desired finite time so long as each |h_i(\mathbf{x}(0))| function can be bounded with certainty. Hence, the requirement that the map H:\mathbb{R}^n \rightarrow \mathbb{R}^n is a diffeomorphism (i.e., that its Jacobian linearization is invertible) asserts that convergence of the estimated output implies convergence of the estimated state. That is, the requirement is an observability condition.
In the case of the sliding mode observer for the system with the input, additional conditions are needed for the observation error to be independent of the input. For example, that
 \frac{\partial H(\mathbf{x})}{\partial \mathbf{x}} B(\mathbf{x})
does not depend on time. The observer is then
\dot{\mathbf{\hat{x}}} = \left[ \frac{\partial H(\mathbf{\hat{x}})}{\partial \mathbf{x}}
\right]^{-1} M(\mathbf{\hat{x}}) \operatorname{sign}(V(t) - H(\mathbf{\hat{x}}))+B(\mathbf{\hat{x}})u.

No comments:

Post a Comment