Uncertain AoI in stochastic optimal control of constrained LTI systems

Jannik Hahn; Olaf Stursberg

doi:10.1515/auto-2021-0125

Article

Uncertain AoI in stochastic optimal control of constrained LTI systems

Jannik Hahn

M. Sc. Jannik Hahn is research assistant with the Control and System Theory Group, Department of Electrical Engineering and Computer Science, Universität Kassel. His research activities are focussed on robust and stochastic distributed model predictive control.
and Olaf Stursberg

Prof. Dr.-Ing. Olaf Stursberg is Full Professor and Head of the Control and System Theory Group in the Department of Electrical Engineering and Computer Science at University of Kassel. His main research areas include methods for optimal and predictive control of networked and hierarchical systems, techniques for analysis and design of hybrid dynamic systems, and the control of stochastic, uncertain and learning systems in different domains of application.

Published/Copyright: March 25, 2022

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal at - Automatisierungstechnik Volume 70 Issue 4

Abstract

This paper addresses finite-time horizon optimal control of control structures with shared communication network. To cope with the uncertainties, induced by network imperfections and exogenous disturbances at the same time, an optimization-based control scheme is proposed. It uses a disturbance feedback and the Age of Information (AoI), a receiver-based measure of communication delays, as central aspects. The disturbance feedback is an extension of the control law used for balanced stochastic optimal control. Balanced optimality is understood as a compromise between minimizing expected deviations from the reference and the minimization of the uncertainty of future states. Time-varying state constraints as well as time-invariant input constraints are considered, and the controllers are synthesized by semi-definite programming.

Zusammenfassung

Dieser Beitrag befasst sich mit der optimalen Regelung von Regelkreisen mit gemeinsam genutztem Kommunikationspfad. Um Unsicherheiten zu minimieren, die durch eine nicht ideale Kommunikation und gleichzeitig auf das System wirkenden Störgrößen hervorgerufen werden, wird eine optimierungsbasiertes Regelung vorgestellt. Die Regelung basiert auf einer Störungsrückführung sowie dem Alter von Informationen, einer Empfänger-basierten Messgröße für Kommunikationsverzögerungen. Die vorgestellte Störungsrückführung ist eine Erweiterung des Regelgesetzes, das zur ausgewogenen stochastischen optimalen Regelung genutzt wird. Ausgewogen bezeichnet hierbei den Kompromiss zwischen erwarteter Abweichung zu einem Referenzsignal und der Unsicherheit über zukünftiger Zustände. Das vorgestellte Konzept berücksichtigt zeitvariable Zustands- sowie zeitinvariante Eingangsbeschränkungen und die Reglersynthese basiert auf der Lösung eines semidefiniten Programms.

Keywords: age of information; communication delay; networked control systems; optimal control; stochastic control

Schlagwörter: Alter von Informationen; Kommunikationsverzögerung; vernetzte Regelungssysteme; stochastische Regelung; optimale Regelung

Funding source: Deutsche Forschungsgemeinschaft

Award Identifier / Grant number: SPP 1914

Funding statement: Partial financial support by the German Research Foundation (DFG) within the research priority program SPP 1914: Cyberphysical Networking is gratefully acknowledged.

About the authors

Jannik Hahn

M. Sc. Jannik Hahn is research assistant with the Control and System Theory Group, Department of Electrical Engineering and Computer Science, Universität Kassel. His research activities are focussed on robust and stochastic distributed model predictive control.

Olaf Stursberg

Prof. Dr.-Ing. Olaf Stursberg is Full Professor and Head of the Control and System Theory Group in the Department of Electrical Engineering and Computer Science at University of Kassel. His main research areas include methods for optimal and predictive control of networked and hierarchical systems, techniques for analysis and design of hybrid dynamic systems, and the control of stochastic, uncertain and learning systems in different domains of application.

Appendix A

Proof of Lemma 1.

First, consider the system dynamics (3) and the control law (7), for the case of available state information:

x k + 1 = f ( x k , u k , w k ) , u k = κ k ( x 0 , … , x k )

with general functions:

(17) f : R n x × R n u × R n w → R n x , κ k : R n x × … × R n x ︸ k + 1 → R n u .

The recursive state equation can be reformulated to a series of functions f k and κ k , where control laws are rewritten to κ ˜ k , e. g., for k ∈ { 0 , 1 }:

u 0 = κ 0 ( x 0 ) , x 1 = f ( x 0 , u 0 , w 0 ) = f ( x 0 , κ 0 ( x 0 ) , w 0 ) = : f 1 ( x 0 , w 0 ) , u 1 = κ 1 ( x 0 , x 1 ) = κ 1 ( x 0 , f 1 ( x 0 , w 0 ) ) = : κ ˜ 1 ( x 0 , w 0 ) ,

or for arbitrary k ∈ N ≥ 0:

(18) u k = κ ˜ k ( x 0 , w 0 , … , w k − 1 ) , x k = f k ( x 0 , w 0 , … , w k − 1 ) ,

or for linear functions:

κ ˜ k = a 0 ( x ) · x 0 + a 0 ( w ) · w 0 + … + a k − 1 ( w ) · w k − 1 , f k = b 0 ( x ) · x 0 + b 0 ( w ) · w 0 + … + b k − 1 ( w ) · w k − 1

with a i and b i [8].

Secondly, if a k ≥ 0, (7) is used in combination with (8). Thus, with the last certain information x l → l : = k − a k , it holds that all states x l + 1 up to x k are not available to the controller:

x 0 , … , x l , ︸ available x l + 1 , … , x k ︸ not available .

With (7) and (8), it holds that:

(19) u k = κ k ( x 0 , … , x l , x l + 1 | l , … , x k | l ) , x k | l = f k , l ( x l , u l , … , u k − 1 )

with functions κ k defined in (17) and:

f k , l : R n x × R n u × … × R n u ︸ a k → R n x .

In (19), the states with index up to l can be expressed by the functions given in (18). For the first of the remaining states/inputs in the function arguments, it follows with (18) that:

x l + 1 | l = f l + 1 , l ( x l , u l ) = : f ˜ l + 1 , l ( x 0 , w 0 , … , w l − 1 ) , u l + 1 = κ l + 1 ( x 0 , x l , x l + 1 | l ) = : κ ˜ l + 1 ( x 0 , w 0 , … , w l − 1 ) .

Recursively for k > l, the states:

x k | l = f ˜ k , l ( x 0 , w 0 , … , w l − 1 ) ,

and the inputs:

(20) u k = κ ˜ k ( x 0 , w 0 , … , w l − 1 )

are obtained. Again, with linear functions f k , l and κ k , the control law κ ˜ k in (20) is a linear function of its arguments, thus (again with a set of parameters a i ) one can write:

(21) u k = a 0 ( x ) · x 0 + a 0 ( w ) · w 0 + … + a l − 1 ( w ) · w l − 1 .

Eventually, (21) feeds back all disturbances w r with r ≤ l − 1 = k − a k − 1 ⇔ r < k − a k . With V k : = a 0 ( x ) and M k , r : = a r ( w ) , (21) equals the disturbance feedback in Lemma 1 for time k. □

Derivation of Equation (12). Each probability p k , r is given according to (10) with respect to the probability for the AoI (6):

p k , r = E 𝟙 k , r = P ( r < k − a ( k ) ) = P ( a ( k ) ≤ k − r − 1 ) = P ( a ( k ) = 0 ) + … + P ( a ( k ) = k − r − 1 ) = ∑ l = 0 k − r − 1 μ k [ l ] .

□

Proof of Proposition 1.

According to Proposition 1 the co-domain of λ k is given by:

λ k ∈ max ( δ u , δ x ) , 1 ,

such that

γ k ( u ) = δ u λ k ∈ δ u , 1 , γ k ( x ) ≥ δ x λ k ∈ δ x , 1

hold. Now recall (14), and let P ( > β ) ⊆ P ( Ω ) denotes a subset of indicator matrices P ( i ) , for which all entries satisfy the inequality 𝟙 k , r ( i ) ≥ 𝟙 k , r ( β ) ∀ k , r ∈ N H . Then the following holds true with θ denoting the behavior of the communication network according to P ( θ ) ∈ P ( Ω ) :

P ( x k ∈ X k ) = ∑ i = 1 o P ( x k ( i ) ∈ X k | θ = i ) = ∑ i = 1 o P θ = i · P ( x k ( i ) ∈ X k ) ≤ ∏ t = 1 k − 1 λ t · P ( x k ( β ) ∈ X k ) ≤ δ x ,

if all control laws u k ( i ) , for which P ( i ) ∈ P ( > β ) holds, are truncated to u k ( β ) . In here, i denotes one run of the Markov process over the whole trajectory (though i is independent of k), which justifies the independence of probabilities. Therefore, the necessary condition results with (15) to:

P ( x k ( β ) ∈ X k ) ≤ δ x ∏ t = 1 k − 1 λ t = γ k ( x ) .

This is at least satisfied, if the γ k ( x ) -confidence ellipsoid of the state x k ( β ) lies within the admissible state set, i. e.:

(22) X k ( γ , β ) ⊆ X k .

Following the same steps of the proof to [8, Proposition 5], condition (22) is satisfied if the LMI L x k + 1 [ j ] ≽ 0 given in (16) is satisfied with the tailored likelihood γ k ( x ) for each half-space j ∈ { 1 , … , n X k + 1 } of X k + 1 , where n X k + 1 denotes the number of half-spaces.

The reasoning for the input follows analogously, but with λ k instead of its product. □

References

1. Hespanha, J. P., P. Naghshtabrizi and Y. Xu. 2007. A survey of recent results in networked control systems. Proceedings of the IEEE 95(1): 138–162.10.1109/JPROC.2006.887288Search in Google Scholar

2. Antonelli, G. 2013. Interconnected dynamic systems: An overview on distributed control. IEEE Control Systems Magazine 33(1): 76–88.10.1109/MCS.2012.2225929Search in Google Scholar

3. Klugel, M., M. Mamduhi, O. Ayan, et al. 2020. Joint cross-layer optimization in real-time networked control systems. IEEE TCNS 7(4): 1903–1915.10.1109/TCNS.2020.3011847Search in Google Scholar

4. Hahn, J., R. Schoeffauer, G. Wunder, et al. 2018. Distributed MPC with prediction of time-varying communication delay. IFAC-PapersOnLine 51(23): 224–229.10.1016/j.ifacol.2018.12.039Search in Google Scholar

5. van Hessem, D. H. and O. H. Bosgra. 2002. A conic reformulation of model predictive control including bounded and stochastic disturbances under state and input constraints. In: 41st CDC, vol. 4. IEEE, pp. 4643–4648.Search in Google Scholar

6. Oldewurtel, F., C. N. Jones and M. Morari. 2008. A tractable approximation of chance constrained stochastic MPC based on affine disturbance feedback. In: 47th CDC. IEEE, pp. 4731–4736.10.1109/CDC.2008.4738806Search in Google Scholar

7. Asselborn, L. and O. Stursberg. 2015. Probabilistic control of uncertain linear systems using stochastic reachability. IFAC-PapersOnLine 48(14): 167–173.10.1016/j.ifacol.2015.09.452Search in Google Scholar

8. Hahn, J. and O. Stursberg, 2020. Balanced stochastic optimal control of uncertain linear systems with constraints. IFAC-PapersOnLine 53(2): 7172–7178. 21th WC.10.1016/j.ifacol.2020.12.535Search in Google Scholar

9. Wu, D., J. Wu, S. Chen, et al. 2010. Stability of networked control systems with polytopic uncertainty and buffer constraint. TAC 55(5): 1202–1208.10.1109/TAC.2010.2042232Search in Google Scholar

10. Zhivoglyadov, P. V. and R. H. Middleton. 2003. Networked control design for linear systems. Automatica 39(4): 743–750.10.1016/S0005-1098(02)00306-0Search in Google Scholar

11. Hahn, J. and O. Stursberg. 2021. Constrained stochastic predictive control of linear systems with uncertain communication. at-Automatisierungstechnik 69(9): 771–781.10.1515/auto-2021-0033Search in Google Scholar

12. Schoeffauer, R. and G. Wunder. 2018. Predictive network control and throughput sub-optimality of max weight. In: European Conf. on Networks and Communications. IEEE, pp. 1–6.10.1109/EuCNC.2018.8442615Search in Google Scholar

13. Hahn, J. and O. Stursberg. 2019. Robust distributed MPC for disturbed affine systems using predictions of time-varying communication. In: 18th ECC. IEEE, pp. 56–62.10.23919/ECC.2019.8795773Search in Google Scholar

14. Borrelli, F. 2003. Constrained optimal control of linear and hybrid systems, vol. 290. Springer.Search in Google Scholar

15. Goulart, P. J., E. C. Kerrigan and J. M. Maciejowski. 2006. Optimization over state feedback policies for robust control with constraints. Automatica 42(4): 523–533.10.1016/j.automatica.2005.08.023Search in Google Scholar

16. Groß, D. and O. Stursberg. 2014. Distributed predictive control of communicating and constrained systems. ZAMM – Journal of Applied Mathematics and Mechanics 94(4): 303–316.10.1002/zamm.201100166Search in Google Scholar

Received: 2021-09-01

Accepted: 2022-01-04

Published Online: 2022-03-25

Published in Print: 2022-04-26

You are currently not able to access this content.

Articles in the same Issue

https://doi.org/10.1515/auto-2021-0125

Keywords for this article

age of information; communication delay; networked control systems; optimal control; stochastic control