Abstract
Based on singularly perturbed bilinear quadratic problems, this paper proposes to decompose the full-order system into two subsystems of a slow-time and fast-time scale. Utilizing the fixed point iterative algorithm to solve cross-coupled algebraic Riccati equations, equilibrium strategies of the two subsystems can be obtained, and further the composite strategy of the original full-order system. It was proved that such a composite strategy formed an o(ε) (near) Stackelberg equilibrium, and a numerical result of the algorithm was presented in the end.
1 Introduction
Dynamic game theory has been studied widely over the past decades, and the non-cooperative game theory of linear quadratic systems has been studied intensively in many papers. For example, Cruz. Jr et al. obtained the open-loop Stackelberg strategy in non-zero sum games[1]; in [2], Basar summarized the non-cooperative game theory in linear quadratic systems; in [3], Medanic developed necessary conditions for closed-loop Stackelberg strategies in linear quadratic problems and presented an algorithm for numerical solutions of two-level Stackelberg problems; Mizukami investigated the linear quadratic closed-loop Stackelberg game for the descriptor system and constructed the incentive strategies in [4]. For singularly perturbed systems, in [5], Khalil and Kokotovic discussed the well-posedness of singularly perturbed Nash games and illustrated the impact of the feedback information available to players on the well-posedness of the game; Xu and Mizukami presented a unified approach to achieve the composite approximation of the full-order linear feedback saddle-point solution[6]; Mukaidani proposed a new algorithm for solving cross-coupled algebraic Riccati equations of singularly perturbed Nash games in [7], further applied the algorithm in obtaining the linear quadratic infinite horizon Nash game for general multiparameter singularly pertubed systems[8], studied the computation of the linear closed-loop Stackelberg strategies with small singular perturbation parameter in [9], and investigated the linear closed-loop Stackelberg strategy of the singularly perturbed stochastic systems with state dependent noise[10].
However, game theories of singularly perturbed bilinear systems are seldom discussed, while singularly perturbed bilinear systems are a quite proper and essential description tool in describing many practical systems such as neutron level control problem in a fission reactor, DC-motor, induction motor drives[11], and in financial engineering problems, Black-Scholes Option Pricing Model, Aoki’s two sector macroeconomic growth model, Chander and Tokao’s non-linear input-output model can all be extended to singularly perturbed bilinear models in [12–15].
The structure of this paper is organized as follows. In Section 2, the problem of the differential Stackelberg equilibrium strategy for a singularly perturbed bilinear time-invariant system is presented. Sections 3 and 4 are concerned with the decomposition of the full-order system into two subsystems, and the composition strategy of the original full-order system. A simple numerical example is solved in Section 5. Section 6 contains the conclusion.
2 Problem Statement
Consider a time-invariant singularly perturbed bilinear system:
with initial condition
where x1(t)∈ Rn1, x2(t) ∈ Rn2 are respectively slow and fast state variable, x(t) = [x1(t), x2(t)]T ∈ Rn are state vector with n1 + n2 = n, u(t) ∈ Rm and v(t) ∈ Rl are respectively the control inputs of Player 1 and Player 2, the small singular perturbation parameter ε > 0 represents small time constants, inertias, masses, etc., and A11, A12, A21, A22, B11, B12, B21, B22, Ms, Mf, Ns, Nf are constant matrices of appropriate dimensions, with
The cost function for each player is defined by
where
It is assumed that the decision-maker denoted by Player 1 is the leader, and Player 2 is the follower. Under the assumption that both players employ strategies u := u(x, t), v := v(x, t), a strategy set (u*, v*) is called a Stackelberg strategy if for any admissible strategy set (u, v), the following conditions hold[10].
where
and
3 Decomposition of Slow and Fast Systems
Let
then (1) can be written as:
Neglecting the fast modes is equivalent to assuming that they are infinitely fast, that is letting ε = 0. Without the fast modes the system (5) reduces to
Assuming that A22 is nonsingular, we have
where 
Then we can obtain the quadratic cost function for the slow subsystem
where 
Theorem 1
Suppose that the following cross-coupled algebraic Riccati equations has solutions p1sand p2s
Then, the Stackelberg equilibrium solution
Proof
The Hamiltonian His corresponding to the system (7) and performance index (8) is
where λi ∈ Rn1 × 1 is the Langrangian multiplier. □
Given arbitrary us, the corresponding vs is obtained by minimizing J2s with respect to vs. Then, the optimal control is given by
Then the cost J1s can be obtained, and we can further obtain
then
where
For 
In [8], Mukaidani proposed a fixed-point iterative algorithm for solving cross-coupled algebraic Riccati equations (9).
Assumption 1
The triplet 
Under Assumption 1, the positive semidefinite solutions of cross-coupled algebraic Riccati equations (9) exist. It is obtained by performing the fixed-point algorithm:
where 
The proof can be seen in [8].
In the fast subsystem, we assume that the slow variables are constant in the boundary layer. Redefining the fast variables x2f = x2 − x2s, and the fast controls uf = u − us, vf = v − vs, the fast subsystem is formulated as:
Then we can obtain the quadratic cost function for the fast subsystem
Assumption 2
The triplet 
Theorem 2
Under Assumption 2, suppose that the following cross-coupled algebraic Riccati equations has solutions p1fand p2f
Then, the Stackelberg equilibrium solution
Proof
we can get the Stackelberg equilibrium solution 
then
where λif ∈ Rn2 × 1 is the Langrangian multiplier. Then
where p1f, p2f satisfy the cross-coupled algebraic Riccati equations (17). □
Similarly, under Assumption 2, the positive semidefinite solutions of cross-coupled algebraic Riccati equations (17) exist, and can be obtained by performing the fixed-point algorithm:
where 
4 Composite Strategy
The composite Stackelberg strategy pair of the full-order singularly perturbed system (1) is constructed as follows[16]:
With x1 replacing x1s, x2 replacing x2s + x2f, for 
where
Theorem 3
The composite strategy pair constitutes an o(ε) (near) Stackelberg equilibrium of the full-order game, that is,
Proof
The feedback system (5) can be written as
Introducing the Chang transformation and its inverse
while the transformation equations are given by
we get
where S is the system matrix of (24),
If 
where x1s(0), x2f(0) are given by (7a), (15). If in addition (A11 + 
5 A Numerical Example
In order to demonstrate the efficiency of the proposed decomposition method, we have run a simple numerical example. All matrices are chosen randomly, which are given by
and a quadratic cost function
where
The simulation result is presented in Figure 1.

Simulation curves of the composite Stackelberg strategy (uc, vc)
6 Conclusions
Many real systems possess the structure of the singularly perturbed bilinear control systems such as motor drives, robust control, multi-sector input-output analysis and option pricing. In this paper, we have studied the Stackelberg games for singularly perturbed bilinear systems. And we propose to decompose the full-order system into two subsystems of a slow-time and fast-time scale. Utilizing the fixed point iterative algorithm to solve cross-coupled algebraic Riccati equations, equilibrium strategies of the two subsystems can be obtained, and further the composite strategy of the original full-order system. It has been proved that such a composite strategy formed an o(ε) (near) Stackelberg equilibrium, and a numerical example in the end has demonstrated the efficiency of the algorithm. The conclusion obtained in this paper could be applied to deal with many practical industry engineering and financial engineering problems.
References
[1] Simaan M, Cruz Jr J B. On the Stackelberg strategy in non-zero sum games. Journal of Optimization Theory and Applications, 1973, 11(5): 533–555.10.1007/BF00935665Suche in Google Scholar
[2] Basar T, Olsder G T. Dynamic non-cooperative game theory. Academic Press, New York, 1991.Suche in Google Scholar
[3] Medanic J. Closed-loop Stackelberg strategies in linear-quadratic problems. IEEE Transactions on Automatic Control, 1978, 23(4): 632–637.10.1109/TAC.1978.1101788Suche in Google Scholar
[4] Mizukami K, Xu H. Closed-loop stackelberg strategies for linear-quadratic descriptor systems. Journal of Optimization Theory and Applications, 1992, 74: 151–170.10.1007/BF00939897Suche in Google Scholar
[5] Khalil H K, Kokotovic P V. Feedback and well-posedness of singularly perturbed Nash games. IEEE Transactions on Automatic Control, 1979, 24(5): 699–708.10.1109/CDC.1978.268107Suche in Google Scholar
[6] Xu H, Mizukami K. Infinite-horizon differential games of singularly perturbed systems: A unified approach. Automatica, 1997, 33(2): 273–276.10.1016/S0005-1098(96)00173-2Suche in Google Scholar
[7] Mukaidani H, Xu H, Mizukami K. A new algorithm for solving cross-coupled algebraic Riccati equations of singularly perturbed Nash games. Proceedings of the 39th IEEE Conference on Decision and Control, Sydney, Australia, 2000: 3648–3653.Suche in Google Scholar
[8] Mukaidani H. A computational efficient numerical algorithm for solving cross-coupled algebraic Riccati equation and its application to multimodeling systems. Proceedings of the 2006 American Control Conference, Minneapolis, Minnesota, USA, 2006: 725–730.10.1109/ACC.2006.1655442Suche in Google Scholar
[9] Mukaidani H. Efficient numerical procedures for solving closed-loop Stackelberg strategies with small singular perturbation parameter. Applied Mathematics and Computation, 2007, 188: 1173–1183.10.1016/j.amc.2006.10.068Suche in Google Scholar
[10] Mukaidani H, Unno M, Yamamoto T. Stackelberg strategies for singularly perturbed stochastic systems. Proceedings of the 2013 European Control Conference, Zurich, Switzerland, 2013: 730–735.10.23919/ECC.2013.6669247Suche in Google Scholar
[11] Aganovic Z, Gajic Z. Linear optimal control of bilinear systems with applications to singularly perturbations and weak coupling. Springer-Verlag, London, 1995.10.1007/3-540-19976-4Suche in Google Scholar
[12] Mceneaney W. A robust control framework for option pricing. Mathematics of Operations Research, 1997, 22(1): 203–221.10.1287/moor.22.1.202Suche in Google Scholar
[13] Aoki M. Some examples of dynamic bilinear models in economics. Lecture Notes in Economics and Mathematical Systems, Springer-Verlag, Berlin Heidelberg, 1975: 163–169.10.1007/978-3-642-47457-6_9Suche in Google Scholar
[14] Chander P. The nonlinear input output model. Journal of Economic Theory, 1983, 30: 219–229.10.1016/0022-0531(83)90105-9Suche in Google Scholar
[15] Tokao F. Nonlinear Leontief model in abstract spaces. Journal of Mathematical Economics, 1986, 15: 151–156.10.1016/0304-4068(86)90006-6Suche in Google Scholar
[16] Kim B S, Lim M T. Composite control for singularly perturbed bilinear systems via successive Galerkin approximation. IEEE Proceedings of Control Theory, 2003, 150(5): 483–488.10.1049/ip-cta:20030814Suche in Google Scholar
© 2015 Walter de Gruyter GmbH, Berlin/Boston
Artikel in diesem Heft
- A Dynamic Formation Procedure of Information Flow Networks
 - A Study on the Optimal Portfolio Strategies Under Inflation
 - A Dynamic Model of Procurement Risk Element Transmission in Construction Projects
 - Can Scientific and Technological Talent Aggregation Accelerate Economic Growth? An Empirical Study
 - Composite Stackelberg Strategy for Singularly Perturbed Bilinear Quadratic Systems
 - Bearing Fault Diagnosis Using Orthogonal Matching Pursuit with Pulse Atoms Based on Vibration Model
 - Dynamics of a Nonlinear Business Cycle Model Under Poisson White Noise Excitation
 - The GM(1,1) Model Optimized by Using Translation Transformation Method and Its Application of Rural Residents’ Consumption in China
 
Artikel in diesem Heft
- A Dynamic Formation Procedure of Information Flow Networks
 - A Study on the Optimal Portfolio Strategies Under Inflation
 - A Dynamic Model of Procurement Risk Element Transmission in Construction Projects
 - Can Scientific and Technological Talent Aggregation Accelerate Economic Growth? An Empirical Study
 - Composite Stackelberg Strategy for Singularly Perturbed Bilinear Quadratic Systems
 - Bearing Fault Diagnosis Using Orthogonal Matching Pursuit with Pulse Atoms Based on Vibration Model
 - Dynamics of a Nonlinear Business Cycle Model Under Poisson White Noise Excitation
 - The GM(1,1) Model Optimized by Using Translation Transformation Method and Its Application of Rural Residents’ Consumption in China