The Stochastic Cash Balance Problem with Fixed Costs: The Risk-averse Case

Shuren Liu; Pei Tang

doi:10.1515/JSSI-2014-0520

40% Rabatt

auf Fachbücher bei De Gruyter Brill *

Artikel Öffentlich zugänglich

The Stochastic Cash Balance Problem with Fixed Costs: The Risk-averse Case

Shuren Liu und Pei Tang

Veröffentlicht/Copyright: 25. Dezember 2014

Veröffentlicht von

Veröffentlichen auch Sie bei De Gruyter Brill

Informationen für Autor*innen

Aus der Zeitschrift Journal of Systems Science and Information Band 2 Heft 6

Abstract

This paper discusses multi-period stochastic cash balance problem with fixed costs when the decision maker is risk averse. By using the consumption model introduced by Chen et al, we characterize the structure of the optimal policy for the stochastic cash balance problem under the general increasing concave utility function and exponential utility function, respectively. We show that the structure of the optimal policy for a decision maker with exponential utility function is almost identical to the structure of the optimal risk-neutral operations policy. Furthermore, we extend the results for the exponential utility function to the ambiguity aversion case.

Keywords: stochastic cash balance problem; risk aversion; ambiguity aversion; stochastic dynamic programming

1 Introduction

The stochastic cash balance problem is an optimization problem faced by a firm, which has to decide how much cash to hold in order to meet its transaction requirement for a given planing horizon with multiple periods. Arrow et al[1] point out that the similarity between the motives of inventories of goods and those for keeping cash balances. In contrast to the usual inventory problem, the stochastic cash balance problem with a case where the cash level (i.e., the checking account level) during the period can either increase or decrease, depending on whether the income is larger or smaller than the expenses during that period. It also allows the decision maker to change the cash level in any direction at the beginning of each period. He can increase the checking account level by withdrawing money from his savings, or decrease it by transferring money to his savings. Therefore, the stochastic cash balance problem can be regarded as a special type of inventory control problems, where the customer demands may be positive or negative and the decision maker can increase or decrease it. Hence, we will use the term inventory level instead of cash level, and also use the terms “order” or “return” to indicate the increase or decrease of the cash levels. At the beginning of each period, the firm may decide to replenish the inventory or return excess stock. Both the ordering cost and the return cost may include a fixed component and a variable component which is proportional to the transaction amount. A holding or penalty cost is charged depending on whether the inventory level is positive or negative. The objective of the firm is to find an ordering or return policy so as to minimize the total expected cost, or equivalently, maximize the total expected profit over the entire planning horizon. Of course, this focus on optimizing expected profit or cost is appropriate for a risk-neutral decision maker, i.e., a firm that is insensitive to profit variations.

The stochastic cash balance problem received considerable amount of attention in the 1960s. Eppen and Fama[2], Whisler[3] consider a cash balance model with independent and identical distribution discrete demands with finite support and without fixed costs. They show the existence of order-up-to and return-down-to levels in the finite and infinite horizon models with discounted cost criterion. Feinberg and Lewis[4] justify the average cost case with the general demand distribution and study the problems with borrowing and lending options and no fixed costs, for which they establish the optimality of simple four-threshold policies. Girgis[5] investigates finite and infinite horizon discounted cost problems with continuous demand when there are fixed costs for increasing or decreasing demand (but not both). Neave[6] studies finite horizon problems with continuous demand when both transactions have fixed costs. However, Chen and Simchi-Levi[7] and Ye and Duenyas[8] notice that some of the claims in [6] are not proved. By using the notion of a (K, Q)-convex function introduced by [8], Chen and Simchi-Levi[7] describe the structural properties of optimal solutions of finite horizon cash balance problems when both transactions have fixed costs. Feinberg and Lewis[9] show that structural results stated by [7] indeed hold for finite horizon cash balance problems with discounted criteria and extend the results to the average cost per unit time criteria.

All the papers referenced above assume that the decision makers are risk-neutral. However, many are willing to tradeoff lower expected profit for downside protection against possible losses. Note that traditional stochastic cash balance models fall short of meeting the needs of risk-averse planners. For instance, traditional stochastic cash balance models do not suggest mechanism to reduce the chance of unfavorable profit levels. Thus, it is important to incorporate the notions of risk aversion in the stochastic cash balance problem.

A parallel stream of research studies risk-averse inventory models. Many of the risk-averse inventory models consider single period newsvendor type of models (see, for example, Chen, Xu and Zhang[10], Eeckhoudt, Gollier and Schlesinger[11], Lau[12], Wu, Zhu and Teunter[13]). Bouakiz and Sobel[14] characterize the inventory control strategy so as to minimize the expected utility of the net present value of costs over a finite planing or an infinite horizon. Assuming linear ordering cost, they prove that a base stock policy is optimal. Chen et al[15] propose a general framework to incorporate risk aversion into multi-period inventory models as well as multi-period models that coordinate inventory and pricing strategies. In both cases, they distinguish between models with fixed ordering costs and models with no fixed ordering costs. They show that the structure of the optimal policy for a decision maker with exponential utility function is almost identical to the structure of the optimal risk-neutral inventory (and pricing) policies. These structural results are extended to models in which the decision maker has access to a (partially) complete financial market and can hedge his operational risk through trading financial securities.

On the other hand, a decision maker may not know the exact demand distributions and have to estimate them from limited historical data. In this case, the decision maker is ambiguous about the probability distribution. Recently, Nilim and EI Ghaoui[16] study robust solutions to Markov decision problems with uncertain transition matrices. They propose the general idea on the ambiguity averse models, that is, the decision maker choose his policies assuming that nature is adversarial, choosing probability distributions from an ambiguity set to minimize the decision maker’s expected utility. Chen and Sun[17] adopt the robust dynamic programming modelling framework introduced by [16] to ambiguity and risk averse inventory and pricing models. They show that the optimal control policies share similar structure properties as Chen et al[15] for the finite horizon case and extend Chen et al[15] to including ambiguity aversion and considering infinite horizon models.

In this paper, we propose a framework for incorporating risk aversion in stochastic cash balance problem. We characterize the structure of the optimal policy on the risk-averse stochastic cash balance problem by using the consumption model introduced by Chen et al[15]. We show that the structure of the optimal policy for a decision maker with exponential utility function is almost identical to the structure of the optimal risk-neutral operations policy. Furthermore, we extend the results for the exponential utility function to the ambiguity aversion case.

The paper is organized as follows. In Section 2, we propose a model to incorporate risk aversion in the stochastic cash balance problem. In Sections 3 and 4, we focus on characterizing the structure of the optimal policies under the general increasing concave utility function and exponential utility function, respectively. In Section 5, we extend the results for the exponential utility function to the ambiguity aversion case. Finally, Section 6 is concluding section.

2 The basic model

Consider a risk-averse firm facing stochastic demand that has to make ordering or return decisions over a finite planning horizon with a total of T periods.

At the beginning of each period, an ordering or return decision is made. Let x_t be the inventory level at the beginning of period t before a decision is made and y_t be the inventory level at the beginning of period t after an ordering or return decision is made. Lead time for the ordering or return transaction is assumed to be zero. The transaction cost is denoted by c(x_t, y_t), which is calculated as follows:

c(xt,yt)=K+k(yt−xt),ifyt>xt,0,ifyt=xt,Q+q(xt−yt),ifyt<xt,

where K ≥ 0, Q ≥ 0, k + q ≥ 0. Note that the assumption that k + q ≥ 0 implies that the unit refund is no more than the unit ordering cost.

For t = 1, 2, ⋯, T, let p_t be per unit “sale price” of product in period t and D_t(ϵ_t) (here ϵ_t is a random variable) be “stochastic demand” in period t, which consists of obligations paid less funds received (note that the demand in a period can be negative, which corresponds to receiving more funds than were paid out that period). Furthermore, demands in different periods are independent of each other. Unsatisfied demand is backlogged. Therefore, the inventory level carried over from period t to the next period, x_t+1, may be positive or negative. A cost h_t(x_t+1) is incurred at the end of period t which represents holding cost when x_t+1 > 0 and shortage cost if x_t+1 < 0. For technical reasons, we assume that function h_t(x) is convex and lim|x|→∞ht(x)=∞. Further, similar to Assumption 1 in Chen and Simchi-Levi[7], it is assumed that there are finite numbers x_t ≤ y_t ≤ v_t ≤ z_t such that (h_t(y_t) − h_t(x_t))/(y_t − x_t) < −k and (h_t(z_t) − h_t(v_t))/(z_t − v_t) > q.

To study the stochastic cash balance problem with fixed costs under risk aversion, we adopt the consumption model under uncertainty introduced by Chen et al[15]. The general idea is to directly model consumption, saving and borrowing decisions as well as inventory decisions for the stochastic cash balance problem. Specifically, assume that the decision maker has access to a financial market for borrowing and lending with a risk-free saving and borrowing interest rate r_f, or equivalently, the discount factor is γ=11+rf. At the beginning of period t, assume that the decision maker has initial wealth w_t and chooses an operations policy (order or return) that affects his income cash flow. At the end of period t, that is, after the uncertainty of this period has been resolved, the decision maker observes his current wealth level w_t + P_t and decides his consumption level f_t for the period, where P_t is the income generated at period t. Note that the income at period t is

Pt¯(xt,yt;ϵt)=−Kδ(yt−xt)−Qδ(xt−yt)−k(yt−xt)+−q(yt−xt)−+ptDt(ϵt)−ht(yt−Dt(ϵt)),

where x⁺ = max{x, 0}, x⁻ = min{x, 0},

δ(x)=1,ifx>0,0,otherwise.

The remaining wealth, w_t + P_t − f_t, is then saved (or borrowed, if negative) for the next period, i.e., w_t+1 = (1 + r_f)(w_t + P_t − f_t), or, equivalently, f_t = w_t − γw_t+1 + P_t. The decision maker’s objective is to maximize his expected utility of the consumption flow E[Π(f₁, ⋯, f_T)] over the planing horizon 1,⋯, T. Moreover, at the last period T, we assume the decision maker consumes everything, which corresponds to w_T+1 = 0.

According to the consumption model, the decision maker’s problem is to find the inventory level y_t and decide the initial wealth level w_t (or equivalently, the consumption level f_t) for the following optimization problem.

maxE[Π(f1,f2,⋯,fT)]s.t.xt+1=yt−Dt(ϵt),ft=wt−γwt+1+Pt¯(xt,yt;ϵt),wT+1=0.(1)

When the utility function Π(f₁, f₂, ⋯, f_T) takes the linear form Π(f₁, f₂, ⋯, f_T) = ∑t=1Tγt−1ft, the consumption model reduces to the traditional risk-neutral stochastic cash balance problem. In this case, we denote V_t(x) to be the profit-to-go function at the beginning of period t with the initial inventory level x. A natural dynamic program for the risk-neutral stochastic cash balance problem is as follows:

Vt(x)=maxy{−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−+pDt(ϵt)−ht(y−Dt(ϵt))+γVt+1(y−Dt(ϵt))}=max{Ht(x),maxy>xHt(y)−K−k(y−x),maxy<xHt(y)−Q−q(x−y)},

with boundary condition V_T+1(x) = 0, where H_t(x) = E{pD_t(ϵ_t) − h_t(x − D_t(ϵ_t)) + γV_t+1(x − D_t(ϵ_t))}. Without loss of generality, we assume that k ≥ Q. Define Lt∈arg⁡maxx{Ht(x)−kx},lt=min{x|Ht(x)−kx=Ht(Lt)−kLt−K},lt′=min{x|Ht(x)−kx=Ht(Lt)−kLt−(K−Q)},Ut∈arg⁡maxx{Ht(x)+qx},ut=max{x|Ht(x)+qx=Ht(Ut)+qUt−Q},ut′=min{x|Ht(x)+qx=Ht(Ut)+qUt−(K−Q)}.

From Lemma 3 in Chen and Simchi-Levi[7], we have Lt≤Ut,lt′≤ut′. Moreover, ut′≤Ut due to K ≥ Q ≥ 0. Therefore, the above parameters satisfy the following relationship: l_t ≤ lt′≤Lt≤Ut≤ut,lt′≤ut′≤Ut≤ut.

Notice that these critical points have explicit implications in the stochastic cash balance problem. By definition, l_t is the largest value below which one always orders; lt′ is the smallest value above which one never orders; u_t is the smallest value above which one always returns; ut′ is the largest value below which one never returns. In particular, we call {lt,lt′}and{ut,ut′} the pairs of order- and return-associated critical points, respectively.

To provide a characterization of the optimal policy, Chen and Simchi-Levi use the following concept of (K, Q)-convexity, which is introduced by Ye and Duenyas[8].

Definition 1

A real-valued function is called (K, Q)-convex for K, Q ≥ 0, if for anyx₀, x₁withx₀ ≤ x₁, and λ ∈ [0, 1],

f((1−λ)x0+λx1)≤(1−λ)f(x0)+λf(x1)+λK+(1−λ)Q−min{λ,1−λ}min{K,Q}.

A function f is called (K, Q)-concave if − f is (K, Q)-convex.

See Lemmas 1 and 2 in [7] for the properties of the (K, Q)-convex function.

Note that the (K, 0)-convexity is exactly the K-convexity introduced by Scarf[18] for the classical stochastic inventory control problem with fixed ordering costs. Moreover, the (K, K)-convexity is the symmetric K-convexity, a concept introduced and applied in Chen and Simchi-Levi[19] to analyze a joint inventory control and pricing problem with fixed ordering costs and a general demand distributions.

Similar to the proof of Theorems 3.1 and 3.2 in [7], we have the following main results for the traditional risk-neutral stochastic cash balance problem.

Theorem 1

Assume that K ≥ Q > 0. The profit-to-go functionsV_t(x) andH_t(x) are (K, Q)-concave and the optimal inventory levely_t(x) after a decision is made satisfies

yt(x)=Lt,ifx≤lt,∈{x,Lt},ifx∈(lt,lt′),x,ifx∈(lt′,ut′),∈[lt′,x],ifx∈(ut′,ut),Ut,ifx≥ut(2)

The results for the case Q ≥ K > 0 follow from a symmetric argument.

On a special case of the stochastic cash balance problem where K = Q > 0, we have

Theorem 2

Assume that K = Q. The profit-to-go functions V_t(x) and H_t(x) are symmetric K-concave and the optimal inventory level y_t(x) after a decision is made satisfies

yt(x)=Lt,ifx≤lt,∈{x,Lt},ifx∈(lt,lt+Lt2),x,ifx∈[lt+Lt2,ut+Ut2],∈{x,Ut},ifx∈(ut+Ut2,ut),Ut,ifx≥ut(3)

3 Additive increasing concave utility model

In this section, we focus on the additive general increasing concave utility function. In this case, the objective function of (1) becomes Π(f1,⋯,fT)=∑t=1Tπt(ft), where the function π_t(⋅) is increasing and concave. That is, the utility of the consumption flow is the summation of the utility from the consumption in each period. According to the sequence of events as described before, the optimization model (1) can be solved by the following dynamic programming recursion.

Vt(x,w)=maxyE[Wt¯(x,w,y;ϵt)](4)

where

Wt¯(x,w,y;ϵt)=maxw′{πt(w−γw′+Pt¯(x,y;ϵt))+Vt+1(y−Dt(ϵt),w′)}(5)

with boundary conditions V_T(x, w) = π_T(w + P_T(x, y;ϵ_T)), V_T+1(x, 0) = 0. In contrast to risk-neutral stochastic cash problem, here the state variable is two-dimensional, i.e, the current inventory level x and the wealth level w.

Instead of working with the dynamic program (4) ∼ (5), we find that it is more convenient to work with an equivalent formulation. If y≥x,letΠt′(x,w)=Vt(x,w−kx), and the modified income in period t be Pt′(y;ϵt)=(γk−k)y+(p−γk)Dt(ϵt)−ht(yt−Dt(ϵt)). In this case, the dynamic program (4)∼(5) becomes

Πt′(x,w)=maxy≥xE[Wt′(x,w,y;ϵt)](6)

where

Wt′(x,w,y;ϵt)=maxz′{πt(w−γz′−Kδ(y−x)+Pt′(y;ϵt))+Πt+1′(y−Dt(ϵt),z′)}(7)

If y≤x,letΠt″(x,w)=Vt(x,w+qx), and the modified income in period t be Pt″(y;ϵt) = (q − γq)y + (γq − p)D_t(ϵ_t) − h_t(y_t − D_t(ϵ_t)). In this case, the dynamic program (4)∼(5) becomes

Πt″(x,w)=maxy≤xE[Wt″(x,w,y;ϵt)](8)

where

Wt″(x,w,y;ϵt)=maxz″{πt(w−γz″−Qδ(x−y)+Pt″(y;ϵt))+Πt+1″(y−Dt(ϵt),z″)}(9)

Therefore, The dynamic program (4)∼(5) becomes

max{Πt′(x,w),Πt″(x,w)}(10)

Lemma 1

Assume thatK = 0. In this case, Πt′(x,w)is jointly concave inxand w for any periodt. Furthermore, a wealth dependent base stock policy with the base stock level L_t(w) is optimal.

Proof

We prove the lemma by induction. Obviously, ΠT+1′(x,w) is jointly concave in x and w. Assume that Πt+1′(x,w) is jointly concave in x and w. Note that Pt′(y;ϵt) is concave in y for any realization of ϵ_t. Thus,

Wt′(w,y;ϵt)=maxz′{πt(w−γz′+Pt′(y;ϵt))+Πt+1′(y−Dt(ϵt),z′)}

is jointly concave in (w, y), which further implies that E[Wt′(w,y;ϵt)] is jointly concave in (w, y).

Let L_t(w) be an optimal solution for the problem maxy≥xE[Wt′(w,y;ϵt)]. Since E[Wt′(w,y;ϵt)] is concave in y for any fixed w, it is optimal to order up to L_t(w) when x < L_t(w) and not to order otherwise. That is to say, a wealth dependent base stock policy is optimal. Further, according to the properties of the concave function, it is easy to show Πt′(x,w) is jointly concave in x and w. Hence, the lemma follows by induction.

Similar to Lemma 1, we have

Lemma 2

Assume that Q = 0. In this case, Πt′(x,w)is jointly concave inxand w for any periodt. Furthermore, a wealth dependent base return policy with the base return level U_t(w) is optimal.

Note that we have L_t(w)≤ U_t(w). Otherwise, there exists a x such that U_t(w) ≤ x ≤ L_t(w). By Lemma 1, it is optimal to order up to L_t(w) when x ≤ L_t(w); By Lemma 2, it is optimal to reduce down to U_t(w) when x ≥ U_t(w). This is a contradiction.

Due to Lemmas 1 and 2, we have

Theorem 3

Assume thatK = Q = 0, the optimal inventory levelytw(x)after a decision is made satisfies

ytw(x)=Lt(w),ifx≤Lt(w),x,ifx∈(Lt(w),Ut(w)),Ut(w),ifx≥Ut(w)(11)

Recall that in the case of risk-neutral decision maker, Eppen and Fama[2] and Whisler[3] study a special case of the stochastic cash balance problem where K = Q = 0. They show that in period t, there exist two parameters L_t and U_t with L_t ≤ U_t, such that the optimal inventory level y_t(x) after a decision is made satisfies

yt(x)=Lt,ifx≤Lt,x,ifx∈(Lt,Ut),Ut,ifx≥Ut.

However, Theorem 3 implies that the optimal policy for the additive increasing concave utility model is different. Indeed, in the risk-averse case, two parameters in the optimal policy depend on the wealth, measured by the position of the risk-free financial security.

4 Additive exponential utility function

In this section, we focus on a special case — the exponential utility function π_t(f) = −αte−fβt with parameters α_t, β_t > 0, where β_t is the risk tolerance factor, α_t reflects the decision maker’s attitude towards the utility obtained from different periods.

According to Chen et al[15], for a risk tolerance parameter R, denote the “certainty equivalent” operator with respect to a random variable ξ to be CEξR[ξ]=−Rln⁡E[e−ξR], which represents the amount of money a decision maker feels indifferent to a gamble with random payoff ξ. We also consider the “effective risk tolerance” per period defined as Rt=∑τ=tTγτ−tβτ. Further, we can obtain the expression R_t(1 + r_f) = (1 + r_f)β_t + R_t+1.

The next lemma states that we are able to separately make the operations decisions without considering the wealth/consumption decisions.

Lemma 3

The optimal operations decisions are independent of the wealth/consumption decisions under additive exponential utility function.

Proof

We prove the lemma by induction. First, let P_t(y_t; ϵ_t) := p_tD_t(ϵ_t) − h_t(y_t − D_t(ϵ_t)) in the profit function P_t(x_t, y_t; ϵ_t) = −Kδ(y_t − x_t) − Qδ(x_t − y_t) − k(y_t − x_t)⁺ − q(y_t − x_t)⁻ + p_tD_t(ϵ_t) − h_t(y_t − D_t(ϵ_t)) of period t.

For t = T, we have

VT(x,w)=maxyE[−αTe−(w−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−+PT(y;ϵT))βT]=αTe−wβTmaxy−eKδ(y−x)+Qδ(x−y)+k(y−x)++q(y−x)−βTE[e−PT(y;ϵT)βT].

Let GT(x)=maxy{−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−+CEϵTβT[PT(y;ϵT)]}, then

maxy−eKδ(y−x)+Qδ(x−y)+k(y−x)++q(y−x)−βTE[e−PT(y;ϵt)βT]=−e−GT(x)βT.Thus,VT(x,w)=−αTe−(GT(x)+w)RT.

Suppose that the lemma is true for some t + 1, i.e., Vt+1(x,w)=−At+1e−(Gt+1(x)+w)Rt+1 for some constant A_t+1 > 0. We have

Vt(x,w)=maxyE[maxw′{−αte−(w−γw′−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−+Pt(y;ϵt))βt−At+1e−(Gt+1(y−Dt(ϵt))+w′)Rt+1}].

For any given y, the first order optimality condition with respect to w′ is

1βtαte−(w−γw′)βteKδ(y−x)+Qδ(x−y)+k(y−x)++q(y−x)−−Pt(y;ϵt)βt=1γRt+1At+1e−w′Rt+1e−Gt+1(y−Dt(ϵt))Rt+1(12)

equivalently,

ln⁡αtβt−w−γw′βt+Kδ(y−x)+Qδ(x−y)+k(y−x)++q(y−x)−−Pt(y;ϵt)βt=ln⁡At+1γRt+1−w′Rt+1−Gt+1(y−Dt(ϵt))Rt+1.

Thus, at state (x, w), for any given y and the realization of the current period uncertainty ϵ_t, the optimal banking decision w∗′ is

w∗′=−βtRtGt+1(y−Dt(ϵt))+Rt+1Rt(−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−+Pt(y;ϵt))+Rt+1Rtw+Rt+1βtRtln⁡At+1βtγαtRt+1,

which implies that the optimal consumption decision in period t is

ft′=βtRt[w+(−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−+Pt(y;ϵt))+γGt+1(y−Dt(ϵt))]−γRt+1βtRtln⁡At+1βtγαtRt+1.

Furthermore, by Eq (12), we have

Vt(x,w)=RtγRt+1At+1maxyE[−e(w∗′+Gt+1(y−Dt(ϵt))Rt+1]=Ate−wRtmaxyE[−e−γGt+1(y−Dt(ϵt))−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−+Pt(y;ϵt))Rt],

where At=RtγRt+1At+1(At+1βtγαtRt+1)−βtRt>0. Let

Gt(x)=maxy{−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−−Rtln⁡E[e{−1Rt[Pt(y;ϵt)+γGt+1(y−Dt(ϵt))]}]}=maxy{−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−+CEϵtRt[Pt(y;ϵt)+γGt+1(y−Dt(ϵt))]}(13)

then Vt(x,w)=−Ate−(Gt(x)+w)Rt. Hence, the lemma follows by induction.

Therefore, by Lemma 3, the stochastic cash balance problem under additive exponential utility function reduces to the optimal problem (13) with boundary condition G_T+1(x) = 0.

To present our main result for the problem with K > 0 and Q > 0, we need the following proposition.

Proposition 1

If a function f(x, ξ) is (K, Q)-concave inxfor any realization ofξ, then for anyR > 0 the function

g(x)=CEξR[f(x,ξ)]

is also (K, Q)-concave.

Proof

Let M(x) = E[exp(f(x, ξ)]. For any x₀, x₁ with x₀≤ x₁ and λ ∈ [0,1], x_λ = (1 − λ)x₀ + λx₁, We have

M(xλ)≤E[exp⁡((1−λ)f(x0,ξ)+λf(x1,ξ)+λK+(1−λ)Q−min{λ,1−λ}min{K,Q})]=exp⁡(λK)exp⁡((1−λ)Q)exp⁡(−min{λ,1−λ}min{K,Q})E[exp⁡((1−λ)f(x0,ξ))exp⁡(λf(x1,ξ))]≤exp⁡(λK)exp⁡((1−λ)Q)exp⁡(−min{λ,1−λ}min{K,Q})E[exp⁡(f(x0,ξ)]1−λE[exp⁡(f(x1,ξ))]λ=M(x0)1−λM(x1)λexp⁡(λK)exp⁡((1−λ)Q)exp⁡(−min{λ,1−λ}min{K,Q}),

where the first inequality holds since f(⋅) is (K, Q)-convex and the second inequality follows from the Hölder inequality with 1p=1−λand1q=λ.

Note that Proposition 1 also holds for K-concave and symmetric-K-concave function since K-concave and symmetric-K-concave are both special cases of (K, Q)-concave function.

We can now present the optimal policy for the risk-averse stochastic case balance problem with additive exponential utility function. Without loss of generality, we assume that K ≥ Q ≥ 0.

Let

Hte(x)=CEϵtRt[Pt(x;ϵt)+γGt+1(x−Dt(ϵt))].

Define Lte∈arg⁡maxx{Hte(x)−kx},lte=min{x|Hte(x)−kx=Hte(Lte)−kLte−K},lt′e=min{x|Hte(x)−kx=Hte(Lte)−kLte−(K−Q)},Ute∈arg⁡maxx{Hte(x)+qx},ute=max{x|Hte(x)+qx=Hte(Ute)+qUte−Q},ut′e=min{x|Hte(x)+qx=Hte(Ute)+qUte−(K−Q)}.

Then, with Proposition 1, similar to Theorems 1 and 2, we have the following main results for the additive exponential utility model with K > 0 and Q > 0.

Theorem 4

Assume thatK ≥ Q > 0. G_t(x) andHte(x)are (K, Q)-concave and the optimal inventory levelyte(x)after a decision is made satisfies

yte(x)=Lte,ifx≤lte,∈{x,Lte},ifx∈(lte,lt′e),x,ifx∈(lt′e,ut′e),∈[lt′e,x],ifx∈(ut′e,ute),Ute,ifx≥ute(14)

The results for the case Q ≥ K > 0 follow from a symmetric argument.

On a special case of the stochastic cash balance problem where K = Q > 0, we have

Theorem 5

Assume thatK = Q. G_t(x) andHte(x)are symmetric K-concave and the optimal inventory levelyte(x)after a decision is made satisfies

yte(x)=Lte,ifx≤lte,∈{x,Lte},ifx∈(lte,lte+Lte2),x,ifx∈[lte+Lte2,ute+Ute2],∈{x,Ute},ifx∈(ute+Ute2,ute),Ute,ifx≥ute(15)

5 Additive exponential utility function with ambiguity aversion

In this section, we introduce the finite horizon ambiguity averse model under exponential utility function. Specially, assume that the decision maker does not know the exact probability distribution for the random variable ϵ_t. Rather, the decision maker is only aware of a set of probability distributions to which the probability distribution of ϵ_t belongs. According to Chen and Sun[17], in period t, the decision maker choose his policies assuming that nature is adversarial, choosing probability distributions g_{ϵ_t} from an ambiguity set Ω_t to minimize the decision maker’s expected utility. Thus, similar to (4)∼(5), a dynamic program for the risk and ambiguity averse stochastic cash balance problem is as follows:

Vt(x,w)=maxymingϵt∈ΩtEgϵt[Wt¯(x,w,y;ϵt)](16)

where

Wt¯(x,w,y;ϵt)=maxw′{πt(w−γw′+Pt¯(x,y;ϵt))+Vt+1(y−Dt(ϵt),w′)}(17)

with the boundary condition V_T(x, w) = π_T(w + P_T(x, y; ϵ_t)), V_T+1(x, 0)=0.

According to [17], we adopt the “general certainty equivalent” operator ϕ(⋅) defined on a function g(⋅) of an ambiguous uncertainty ξ, i.e, ΦΩR[ϕ(ξ)]=mingϵ∈Ω−Rln⁡Egϵ[e−ϕ(ξ)R]. Note that ΦΩR=CEξR when Ω is a singleton. Obviously, the operator ΦΩR generalizes the certainty equivalent operator CEξR in Section 4.

Assume that πt(f)=−αte−fβt, and the ambiguity sets satisfy certain technical conditions so that the minimization in the general certainty equivalent operator can always be attained. Similar to the proof of Lemma 3, the stochastic cash balance problem in the ambiguity and risk averse model (16)∼(17) can be calculated through the following dynamic programming

Gt(x)=maxy{−Kδ(y−x)−Qδ(x−y)−k(y−x)+−q(y−x)−+ΦΩtRt[Pt(y;ϵt)+γGt+1(y−Dt(ϵt))]}(18)

with boundary condition G_T+1(x) = 0.

To obtain the structure on the optimal policies, we need the following result, which implies the minimum envelope of (K, Q)-concave functions is still (K, Q)-concave.

Proposition 2

If f(x, v) is (K, Q)-convex inxfor any v, then g(x) = max_vf(x, v) is also (K, Q)-convex.

Proof

For any x₀ ≤ x₁ and λ ∈ [0, 1], x_λ = (1 − λ)x₀ + λ x₁, we have

g(xλ)=maxvf((1−λ)x0+λx1,v)≤maxv[(1−λ)f(x0,v)+λf(x1,v)+λK+(1−λ)Q−min{λ,1−λ}min{K,Q}]≤maxv[(1−λ)f(x0,v)]+maxv[λf(x1,v)]+λK+(1−λ)Q−min{λ,1−λ}min{K,Q}=(1−λ)g(x0)+λg(x1)+λK+(1−λ)Q−min{λ,1−λ}min{K,Q}.

Note that Proposition 2 also holds for K-concave and symmetric-K-concave since K-concave function and symmetric-K-concave are both special cases of (K, Q)-concave function.

Then, combined with Proposition 2, similar to the proof of the exponential utility function case, it is easy to see that Theorems 4 and 5 hold for the stochastic cash balance problem under the exponential utility function with ambiguity aversion.

6 Conclusions

In this paper, we propose a framework for incorporating risk aversion in stochastic cash balance problem. We characterize the structure of the optimal policy on the risk-averse stochastic cash balance problem according to the consumption model. We show that the structure of the optimal policy for a decision maker with exponential utility function is almost identical to the structure of the optimal risk-neutral operations policy. Furthermore, we extend the results for the exponential utility function to the ambiguity aversion case.

Supported by National Natural Science Foundation of China (Grant No. 11301445)

References

[1] Arrow K J, Karlin S, Scarf H. Studies in the mathematical theory of inventory and production. Stanford University Press, Stanford, California, 1958.Suche in Google Scholar

[2] Eppen G D, Fama E F. Cash balance and simple dynamic portofolio problems with proportional costs. International Economics Review, 1969, 10: 119–133.10.2307/2525547Suche in Google Scholar

[3] Whisler W D. A stochastic inventory model for rented equipment. Management Science, 1967, 13: 640–647.10.1287/mnsc.13.9.640Suche in Google Scholar

[4] Feinberg E L, Lewis M E. Optimality of four-threshold policies in inventory systems with customer returns and borrowing/storage options. Probability in the Enginering and Informatin Sciences, 2005, 19(1): 45–71.10.1017/S0269964805050047Suche in Google Scholar

[5] Girgis N M. Optimal cash balance levels. Management Science, 1968, 15(3): 130–140.10.1287/mnsc.15.3.130Suche in Google Scholar

[6] Neave E H. The stochastic cash balance probem with fixed costs for increases and decreases. Management Science, 1970, 16(7): 472–490.10.1287/mnsc.16.7.472Suche in Google Scholar

[7] Chen X, Simchi-Levi D. A new approach for the stochastic cash balance problem with fixed cost. Probability in the Engineering and Informational Science, 2009, 23(4): 545–562.10.1017/S0269964809000242Suche in Google Scholar

[8] Ye Q, Duenyas I. Optimal capacity investment decision with two-sided fixed capacity adjustment costs. Operations Research, 2007, 55(2): 272–283.10.1287/opre.1060.0386Suche in Google Scholar

[9] Feinberg E L, Lewis M E. Optimality inequalities for average cost Markov decision processes and the stochastic cash balance problem. Mathematics of Operations Research, 2007, 32(4): 769–783.10.1287/moor.1070.0269Suche in Google Scholar

[10] Chen Y H, Xu M H, Zhang Z G. A risk-averse newsvendor model under CVaR criterion. Operations Research, 2009, 57(4): 1040–1044.10.1287/opre.1080.0603Suche in Google Scholar

[11] Eeckhoudt L, Gollier C, Schlesinger H. The risk-averse (and prudent) newsboy. Management Science, 1995, 41(5): 786–794.10.1287/mnsc.41.5.786Suche in Google Scholar

[12] Lau H S. The newsboy problem under alternative optimization objective. Journal of the Operational Research Society, 1980, 31(6): 525–535.10.1057/jors.1980.96Suche in Google Scholar

[13] Wu M, Zhu S X, Teunter R H. Newsvendor problem with random shortage cost under a risk criterion. International Journal of Production and Economics, 2013, 145: 790–798.10.1016/j.ijpe.2013.06.007Suche in Google Scholar

[14] Bouakiz M, Sobel M J. Inventory control with an exponential utility criterion. Operations Research, 1992, 40(3): 603–608.10.1287/opre.40.3.603Suche in Google Scholar

[15] Chen X, Sim M, Simchi-Levi D, et al. Risk aversion in inventory management. Operations Research, 2007, 55(5): 828–842.10.1287/opre.1070.0429Suche in Google Scholar

[16] Nilim A, EI Ghaoui L. Robust solutions to Markov decision problems with uncertain transition matrices. Operations Research, 2005, 53(5): 780–798.10.1287/opre.1050.0216Suche in Google Scholar

[17] Chen X, Sun P. Optimal structural policies for ambiguity and risk averse inventory and pricing models. SIAM Journal on Control and Optimization, 2012, 50(1): 133–146.10.1137/100791488Suche in Google Scholar

[18] Scarf H. The optimality of (s, S) policies in the dynamic inventory problem. Proceedings of the 1st Stanford Symposium on Mathematical Methods in the Social Sciences, Stanford University Press, Stanford, California, 1960.Suche in Google Scholar

[19] Chen X, Simchi-Levi D. Coordinating inventory control and pricing strategies with random demand and fixed ordering cost: The finite horizon case. Operations Research, 2004, 52(6): 887–896.10.1287/opre.1040.0127Suche in Google Scholar

Received: 2014-4-1

Accepted: 2014-9-1

Published Online: 2014-12-25

Artikel in diesem Heft

https://doi.org/10.1515/JSSI-2014-0520

Schlagwörter für diesen Artikel

stochastic cash balance problem; risk aversion; ambiguity aversion; stochastic dynamic programming