Startseite The Stochastic Cash Balance Problem with Fixed Costs: The Risk-averse Case
Artikel Öffentlich zugänglich

The Stochastic Cash Balance Problem with Fixed Costs: The Risk-averse Case

  • Shuren Liu EMAIL logo und Pei Tang
Veröffentlicht/Copyright: 25. Dezember 2014
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

This paper discusses multi-period stochastic cash balance problem with fixed costs when the decision maker is risk averse. By using the consumption model introduced by Chen et al, we characterize the structure of the optimal policy for the stochastic cash balance problem under the general increasing concave utility function and exponential utility function, respectively. We show that the structure of the optimal policy for a decision maker with exponential utility function is almost identical to the structure of the optimal risk-neutral operations policy. Furthermore, we extend the results for the exponential utility function to the ambiguity aversion case.

1 Introduction

The stochastic cash balance problem is an optimization problem faced by a firm, which has to decide how much cash to hold in order to meet its transaction requirement for a given planing horizon with multiple periods. Arrow et al[1] point out that the similarity between the motives of inventories of goods and those for keeping cash balances. In contrast to the usual inventory problem, the stochastic cash balance problem with a case where the cash level (i.e., the checking account level) during the period can either increase or decrease, depending on whether the income is larger or smaller than the expenses during that period. It also allows the decision maker to change the cash level in any direction at the beginning of each period. He can increase the checking account level by withdrawing money from his savings, or decrease it by transferring money to his savings. Therefore, the stochastic cash balance problem can be regarded as a special type of inventory control problems, where the customer demands may be positive or negative and the decision maker can increase or decrease it. Hence, we will use the term inventory level instead of cash level, and also use the terms “order” or “return” to indicate the increase or decrease of the cash levels. At the beginning of each period, the firm may decide to replenish the inventory or return excess stock. Both the ordering cost and the return cost may include a fixed component and a variable component which is proportional to the transaction amount. A holding or penalty cost is charged depending on whether the inventory level is positive or negative. The objective of the firm is to find an ordering or return policy so as to minimize the total expected cost, or equivalently, maximize the total expected profit over the entire planning horizon. Of course, this focus on optimizing expected profit or cost is appropriate for a risk-neutral decision maker, i.e., a firm that is insensitive to profit variations.

The stochastic cash balance problem received considerable amount of attention in the 1960s. Eppen and Fama[2], Whisler[3] consider a cash balance model with independent and identical distribution discrete demands with finite support and without fixed costs. They show the existence of order-up-to and return-down-to levels in the finite and infinite horizon models with discounted cost criterion. Feinberg and Lewis[4] justify the average cost case with the general demand distribution and study the problems with borrowing and lending options and no fixed costs, for which they establish the optimality of simple four-threshold policies. Girgis[5] investigates finite and infinite horizon discounted cost problems with continuous demand when there are fixed costs for increasing or decreasing demand (but not both). Neave[6] studies finite horizon problems with continuous demand when both transactions have fixed costs. However, Chen and Simchi-Levi[7] and Ye and Duenyas[8] notice that some of the claims in [6] are not proved. By using the notion of a (K, Q)-convex function introduced by [8], Chen and Simchi-Levi[7] describe the structural properties of optimal solutions of finite horizon cash balance problems when both transactions have fixed costs. Feinberg and Lewis[9] show that structural results stated by [7] indeed hold for finite horizon cash balance problems with discounted criteria and extend the results to the average cost per unit time criteria.

All the papers referenced above assume that the decision makers are risk-neutral. However, many are willing to tradeoff lower expected profit for downside protection against possible losses. Note that traditional stochastic cash balance models fall short of meeting the needs of risk-averse planners. For instance, traditional stochastic cash balance models do not suggest mechanism to reduce the chance of unfavorable profit levels. Thus, it is important to incorporate the notions of risk aversion in the stochastic cash balance problem.

A parallel stream of research studies risk-averse inventory models. Many of the risk-averse inventory models consider single period newsvendor type of models (see, for example, Chen, Xu and Zhang[10], Eeckhoudt, Gollier and Schlesinger[11], Lau[12], Wu, Zhu and Teunter[13]). Bouakiz and Sobel[14] characterize the inventory control strategy so as to minimize the expected utility of the net present value of costs over a finite planing or an infinite horizon. Assuming linear ordering cost, they prove that a base stock policy is optimal. Chen et al[15] propose a general framework to incorporate risk aversion into multi-period inventory models as well as multi-period models that coordinate inventory and pricing strategies. In both cases, they distinguish between models with fixed ordering costs and models with no fixed ordering costs. They show that the structure of the optimal policy for a decision maker with exponential utility function is almost identical to the structure of the optimal risk-neutral inventory (and pricing) policies. These structural results are extended to models in which the decision maker has access to a (partially) complete financial market and can hedge his operational risk through trading financial securities.

On the other hand, a decision maker may not know the exact demand distributions and have to estimate them from limited historical data. In this case, the decision maker is ambiguous about the probability distribution. Recently, Nilim and EI Ghaoui[16] study robust solutions to Markov decision problems with uncertain transition matrices. They propose the general idea on the ambiguity averse models, that is, the decision maker choose his policies assuming that nature is adversarial, choosing probability distributions from an ambiguity set to minimize the decision maker’s expected utility. Chen and Sun[17] adopt the robust dynamic programming modelling framework introduced by [16] to ambiguity and risk averse inventory and pricing models. They show that the optimal control policies share similar structure properties as Chen et al[15] for the finite horizon case and extend Chen et al[15] to including ambiguity aversion and considering infinite horizon models.

In this paper, we propose a framework for incorporating risk aversion in stochastic cash balance problem. We characterize the structure of the optimal policy on the risk-averse stochastic cash balance problem by using the consumption model introduced by Chen et al[15]. We show that the structure of the optimal policy for a decision maker with exponential utility function is almost identical to the structure of the optimal risk-neutral operations policy. Furthermore, we extend the results for the exponential utility function to the ambiguity aversion case.

The paper is organized as follows. In Section 2, we propose a model to incorporate risk aversion in the stochastic cash balance problem. In Sections 3 and 4, we focus on characterizing the structure of the optimal policies under the general increasing concave utility function and exponential utility function, respectively. In Section 5, we extend the results for the exponential utility function to the ambiguity aversion case. Finally, Section 6 is concluding section.

2 The basic model

Consider a risk-averse firm facing stochastic demand that has to make ordering or return decisions over a finite planning horizon with a total of T periods.

At the beginning of each period, an ordering or return decision is made. Let xt be the inventory level at the beginning of period t before a decision is made and yt be the inventory level at the beginning of period t after an ordering or return decision is made. Lead time for the ordering or return transaction is assumed to be zero. The transaction cost is denoted by c(xt, yt), which is calculated as follows:

c(xt,yt)=K+k(ytxt),ifyt>xt,0,ifyt=xt,Q+q(xtyt),ifyt<xt,

where K ≥ 0, Q ≥ 0, k + q ≥ 0. Note that the assumption that k + q ≥ 0 implies that the unit refund is no more than the unit ordering cost.

For t = 1, 2, ⋯, T, let pt be per unit “sale price” of product in period t and Dtt) (here ϵt is a random variable) be “stochastic demand” in period t, which consists of obligations paid less funds received (note that the demand in a period can be negative, which corresponds to receiving more funds than were paid out that period). Furthermore, demands in different periods are independent of each other. Unsatisfied demand is backlogged. Therefore, the inventory level carried over from period t to the next period, xt+1, may be positive or negative. A cost ht(xt+1) is incurred at the end of period t which represents holding cost when xt+1 > 0 and shortage cost if xt+1 < 0. For technical reasons, we assume that function ht(x) is convex and lim|x|ht(x)=. Further, similar to Assumption 1 in Chen and Simchi-Levi[7], it is assumed that there are finite numbers xtytvtzt such that (ht(yt) − ht(xt))/(ytxt) < −k and (ht(zt) − ht(vt))/(ztvt) > q.

To study the stochastic cash balance problem with fixed costs under risk aversion, we adopt the consumption model under uncertainty introduced by Chen et al[15]. The general idea is to directly model consumption, saving and borrowing decisions as well as inventory decisions for the stochastic cash balance problem. Specifically, assume that the decision maker has access to a financial market for borrowing and lending with a risk-free saving and borrowing interest rate rf, or equivalently, the discount factor is γ=11+rf. At the beginning of period t, assume that the decision maker has initial wealth wt and chooses an operations policy (order or return) that affects his income cash flow. At the end of period t, that is, after the uncertainty of this period has been resolved, the decision maker observes his current wealth level wt + Pt and decides his consumption level ft for the period, where Pt is the income generated at period t. Note that the income at period t is

Pt¯(xt,yt;ϵt)=Kδ(ytxt)Qδ(xtyt)k(ytxt)+q(ytxt)+ptDt(ϵt)ht(ytDt(ϵt)),

where x+ = max{x, 0}, x = min{x, 0},

δ(x)=1,ifx>0,0,otherwise.

The remaining wealth, wt + Ptft, is then saved (or borrowed, if negative) for the next period, i.e., wt+1 = (1 + rf)(wt + Ptft), or, equivalently, ft = wtγwt+1 + Pt. The decision maker’s objective is to maximize his expected utility of the consumption flow E[Π(f1, ⋯, fT)] over the planing horizon 1,⋯, T. Moreover, at the last period T, we assume the decision maker consumes everything, which corresponds to wT+1 = 0.

According to the consumption model, the decision maker’s problem is to find the inventory level yt and decide the initial wealth level wt (or equivalently, the consumption level ft) for the following optimization problem.

maxE[Π(f1,f2,,fT)]s.t.xt+1=ytDt(ϵt),ft=wtγwt+1+Pt¯(xt,yt;ϵt),wT+1=0.(1)

When the utility function Π(f1, f2, ⋯, fT) takes the linear form Π(f1, f2, ⋯, fT) = t=1Tγt1ft, the consumption model reduces to the traditional risk-neutral stochastic cash balance problem. In this case, we denote Vt(x) to be the profit-to-go function at the beginning of period t with the initial inventory level x. A natural dynamic program for the risk-neutral stochastic cash balance problem is as follows:

Vt(x)=maxy{Kδ(yx)Qδ(xy)k(yx)+q(yx)+pDt(ϵt)ht(yDt(ϵt))+γVt+1(yDt(ϵt))}=max{Ht(x),maxy>xHt(y)Kk(yx),maxy<xHt(y)Qq(xy)},

with boundary condition VT+1(x) = 0, where Ht(x) = E{pDtt) − ht(xDtt)) + γVt+1(xDtt))}. Without loss of generality, we assume that kQ. Define Ltargmaxx{Ht(x)kx},lt=min{x|Ht(x)kx=Ht(Lt)kLtK},lt=min{x|Ht(x)kx=Ht(Lt)kLt(KQ)},Utargmaxx{Ht(x)+qx},ut=max{x|Ht(x)+qx=Ht(Ut)+qUtQ},ut=min{x|Ht(x)+qx=Ht(Ut)+qUt(KQ)}.

From Lemma 3 in Chen and Simchi-Levi[7], we have LtUt,ltut. Moreover, utUt due to KQ ≥ 0. Therefore, the above parameters satisfy the following relationship: ltltLtUtut,ltutUtut.

Notice that these critical points have explicit implications in the stochastic cash balance problem. By definition, lt is the largest value below which one always orders; lt is the smallest value above which one never orders; ut is the smallest value above which one always returns; ut is the largest value below which one never returns. In particular, we call {lt,lt}and{ut,ut} the pairs of order- and return-associated critical points, respectively.

To provide a characterization of the optimal policy, Chen and Simchi-Levi use the following concept of (K, Q)-convexity, which is introduced by Ye and Duenyas[8].

Definition 1

A real-valued function is called (K, Q)-convex for K, Q ≥ 0, if for anyx0, x1withx0x1, and λ ∈ [0, 1],

f((1λ)x0+λx1)(1λ)f(x0)+λf(x1)+λK+(1λ)Qmin{λ,1λ}min{K,Q}.

A function f is called (K, Q)-concave iff is (K, Q)-convex.

See Lemmas 1 and 2 in [7] for the properties of the (K, Q)-convex function.

Note that the (K, 0)-convexity is exactly the K-convexity introduced by Scarf[18] for the classical stochastic inventory control problem with fixed ordering costs. Moreover, the (K, K)-convexity is the symmetric K-convexity, a concept introduced and applied in Chen and Simchi-Levi[19] to analyze a joint inventory control and pricing problem with fixed ordering costs and a general demand distributions.

Similar to the proof of Theorems 3.1 and 3.2 in [7], we have the following main results for the traditional risk-neutral stochastic cash balance problem.

Theorem 1

Assume that KQ > 0. The profit-to-go functionsVt(x) andHt(x) are (K, Q)-concave and the optimal inventory levelyt(x) after a decision is made satisfies

yt(x)=Lt,ifxlt,{x,Lt},ifx(lt,lt),x,ifx(lt,ut),[lt,x],ifx(ut,ut),Ut,ifxut(2)

The results for the case QK > 0 follow from a symmetric argument.

On a special case of the stochastic cash balance problem where K = Q > 0, we have

Theorem 2

Assume that K = Q. The profit-to-go functions Vt(x) and Ht(x) are symmetric K-concave and the optimal inventory level yt(x) after a decision is made satisfies

yt(x)=Lt,ifxlt,{x,Lt},ifx(lt,lt+Lt2),x,ifx[lt+Lt2,ut+Ut2],{x,Ut},ifx(ut+Ut2,ut),Ut,ifxut(3)

3 Additive increasing concave utility model

In this section, we focus on the additive general increasing concave utility function. In this case, the objective function of (1) becomes Π(f1,,fT)=t=1Tπt(ft), where the function πt(⋅) is increasing and concave. That is, the utility of the consumption flow is the summation of the utility from the consumption in each period. According to the sequence of events as described before, the optimization model (1) can be solved by the following dynamic programming recursion.

Vt(x,w)=maxyE[Wt¯(x,w,y;ϵt)](4)

where

Wt¯(x,w,y;ϵt)=maxw{πt(wγw+Pt¯(x,y;ϵt))+Vt+1(yDt(ϵt),w)}(5)

with boundary conditions VT(x, w) = πT(w + PT(x, yT)), VT+1(x, 0) = 0. In contrast to risk-neutral stochastic cash problem, here the state variable is two-dimensional, i.e, the current inventory level x and the wealth level w.

Instead of working with the dynamic program (4) ∼ (5), we find that it is more convenient to work with an equivalent formulation. If yx,letΠt(x,w)=Vt(x,wkx), and the modified income in period t be Pt(y;ϵt)=(γkk)y+(pγk)Dt(ϵt)ht(ytDt(ϵt)). In this case, the dynamic program (4)∼(5) becomes

Πt(x,w)=maxyxE[Wt(x,w,y;ϵt)](6)

where

Wt(x,w,y;ϵt)=maxz{πt(wγzKδ(yx)+Pt(y;ϵt))+Πt+1(yDt(ϵt),z)}(7)

If yx,letΠt(x,w)=Vt(x,w+qx), and the modified income in period t be Pt(y;ϵt) = (qγq)y + (γqp)Dtt) − ht(ytDtt)). In this case, the dynamic program (4)∼(5) becomes

Πt(x,w)=maxyxE[Wt(x,w,y;ϵt)](8)

where

Wt(x,w,y;ϵt)=maxz{πt(wγzQδ(xy)+Pt(y;ϵt))+Πt+1(yDt(ϵt),z)}(9)

Therefore, The dynamic program (4)∼(5) becomes

max{Πt(x,w),Πt(x,w)}(10)

Lemma 1

Assume thatK = 0. In this case, Πt(x,w)is jointly concave inxand w for any periodt. Furthermore, a wealth dependent base stock policy with the base stock level Lt(w) is optimal.

Proof

We prove the lemma by induction. Obviously, ΠT+1(x,w) is jointly concave in x and w. Assume that Πt+1(x,w) is jointly concave in x and w. Note that Pt(y;ϵt) is concave in y for any realization of ϵt. Thus,

Wt(w,y;ϵt)=maxz{πt(wγz+Pt(y;ϵt))+Πt+1(yDt(ϵt),z)}

is jointly concave in (w, y), which further implies that E[Wt(w,y;ϵt)] is jointly concave in (w, y).

Let Lt(w) be an optimal solution for the problem maxyxE[Wt(w,y;ϵt)]. Since E[Wt(w,y;ϵt)] is concave in y for any fixed w, it is optimal to order up to Lt(w) when x < Lt(w) and not to order otherwise. That is to say, a wealth dependent base stock policy is optimal. Further, according to the properties of the concave function, it is easy to show Πt(x,w) is jointly concave in x and w. Hence, the lemma follows by induction.

Similar to Lemma 1, we have

Lemma 2

Assume that Q = 0. In this case, Πt(x,w)is jointly concave inxand w for any periodt. Furthermore, a wealth dependent base return policy with the base return level Ut(w) is optimal.

Note that we have Lt(w)≤ Ut(w). Otherwise, there exists a x such that Ut(w) ≤ xLt(w). By Lemma 1, it is optimal to order up to Lt(w) when xLt(w); By Lemma 2, it is optimal to reduce down to Ut(w) when xUt(w). This is a contradiction.

Due to Lemmas 1 and 2, we have

Theorem 3

Assume thatK = Q = 0, the optimal inventory levelytw(x)after a decision is made satisfies

ytw(x)=Lt(w),ifxLt(w),x,ifx(Lt(w),Ut(w)),Ut(w),ifxUt(w)(11)

Recall that in the case of risk-neutral decision maker, Eppen and Fama[2] and Whisler[3] study a special case of the stochastic cash balance problem where K = Q = 0. They show that in period t, there exist two parameters Lt and Ut with LtUt, such that the optimal inventory level yt(x) after a decision is made satisfies

yt(x)=Lt,ifxLt,x,ifx(Lt,Ut),Ut,ifxUt.

However, Theorem 3 implies that the optimal policy for the additive increasing concave utility model is different. Indeed, in the risk-averse case, two parameters in the optimal policy depend on the wealth, measured by the position of the risk-free financial security.

4 Additive exponential utility function

In this section, we focus on a special case — the exponential utility function πt(f) = αtefβt with parameters αt, βt > 0, where βt is the risk tolerance factor, αt reflects the decision maker’s attitude towards the utility obtained from different periods.

According to Chen et al[15], for a risk tolerance parameter R, denote the “certainty equivalent” operator with respect to a random variable ξ to be CEξR[ξ]=RlnE[eξR], which represents the amount of money a decision maker feels indifferent to a gamble with random payoff ξ. We also consider the “effective risk tolerance” per period defined as Rt=τ=tTγτtβτ. Further, we can obtain the expression Rt(1 + rf) = (1 + rf)βt + Rt+1.

The next lemma states that we are able to separately make the operations decisions without considering the wealth/consumption decisions.

Lemma 3

The optimal operations decisions are independent of the wealth/consumption decisions under additive exponential utility function.

Proof

We prove the lemma by induction. First, let Pt(yt; ϵt) := ptDtt) − ht(ytDtt)) in the profit function Pt(xt, yt; ϵt) = −Kδ(ytxt) − Qδ(xtyt) − k(ytxt)+q(ytxt) + ptDtt) − ht(ytDtt)) of period t.

For t = T, we have

VT(x,w)=maxyE[αTe(wKδ(yx)Qδ(xy)k(yx)+q(yx)+PT(y;ϵT))βT]=αTewβTmaxyeKδ(yx)+Qδ(xy)+k(yx)++q(yx)βTE[ePT(y;ϵT)βT].

Let GT(x)=maxy{Kδ(yx)Qδ(xy)k(yx)+q(yx)+CEϵTβT[PT(y;ϵT)]}, then

maxyeKδ(yx)+Qδ(xy)+k(yx)++q(yx)βTE[ePT(y;ϵt)βT]=eGT(x)βT.Thus,VT(x,w)=αTe(GT(x)+w)RT.

Suppose that the lemma is true for some t + 1, i.e., Vt+1(x,w)=At+1e(Gt+1(x)+w)Rt+1 for some constant At+1 > 0. We have

Vt(x,w)=maxyE[maxw{αte(wγwKδ(yx)Qδ(xy)k(yx)+q(yx)+Pt(y;ϵt))βtAt+1e(Gt+1(yDt(ϵt))+w)Rt+1}].

For any given y, the first order optimality condition with respect to w′ is

1βtαte(wγw)βteKδ(yx)+Qδ(xy)+k(yx)++q(yx)Pt(y;ϵt)βt=1γRt+1At+1ewRt+1eGt+1(yDt(ϵt))Rt+1(12)

equivalently,

lnαtβtwγwβt+Kδ(yx)+Qδ(xy)+k(yx)++q(yx)Pt(y;ϵt)βt=lnAt+1γRt+1wRt+1Gt+1(yDt(ϵt))Rt+1.

Thus, at state (x, w), for any given y and the realization of the current period uncertainty ϵt, the optimal banking decision w is

w=βtRtGt+1(yDt(ϵt))+Rt+1Rt(Kδ(yx)Qδ(xy)k(yx)+q(yx)+Pt(y;ϵt))+Rt+1Rtw+Rt+1βtRtlnAt+1βtγαtRt+1,

which implies that the optimal consumption decision in period t is

ft=βtRt[w+(Kδ(yx)Qδ(xy)k(yx)+q(yx)+Pt(y;ϵt))+γGt+1(yDt(ϵt))]γRt+1βtRtlnAt+1βtγαtRt+1.

Furthermore, by Eq (12), we have

Vt(x,w)=RtγRt+1At+1maxyE[e(w+Gt+1(yDt(ϵt))Rt+1]=AtewRtmaxyE[eγGt+1(yDt(ϵt))Kδ(yx)Qδ(xy)k(yx)+q(yx)+Pt(y;ϵt))Rt],

where At=RtγRt+1At+1(At+1βtγαtRt+1)βtRt>0. Let

Gt(x)=maxy{Kδ(yx)Qδ(xy)k(yx)+q(yx)RtlnE[e{1Rt[Pt(y;ϵt)+γGt+1(yDt(ϵt))]}]}=maxy{Kδ(yx)Qδ(xy)k(yx)+q(yx)+CEϵtRt[Pt(y;ϵt)+γGt+1(yDt(ϵt))]}(13)

then Vt(x,w)=Ate(Gt(x)+w)Rt. Hence, the lemma follows by induction.

Therefore, by Lemma 3, the stochastic cash balance problem under additive exponential utility function reduces to the optimal problem (13) with boundary condition GT+1(x) = 0.

To present our main result for the problem with K > 0 and Q > 0, we need the following proposition.

Proposition 1

If a function f(x, ξ) is (K, Q)-concave inxfor any realization ofξ, then for anyR > 0 the function

g(x)=CEξR[f(x,ξ)]

is also (K, Q)-concave.

Proof

Let M(x) = E[exp(f(x, ξ)]. For any x0, x1 with x0x1 and λ ∈ [0,1], xλ = (1 − λ)x0 + λx1, We have

M(xλ)E[exp((1λ)f(x0,ξ)+λf(x1,ξ)+λK+(1λ)Qmin{λ,1λ}min{K,Q})]=exp(λK)exp((1λ)Q)exp(min{λ,1λ}min{K,Q})E[exp((1λ)f(x0,ξ))exp(λf(x1,ξ))]exp(λK)exp((1λ)Q)exp(min{λ,1λ}min{K,Q})E[exp(f(x0,ξ)]1λE[exp(f(x1,ξ))]λ=M(x0)1λM(x1)λexp(λK)exp((1λ)Q)exp(min{λ,1λ}min{K,Q}),

where the first inequality holds since f(⋅) is (K, Q)-convex and the second inequality follows from the Hölder inequality with 1p=1λand1q=λ.

Note that Proposition 1 also holds for K-concave and symmetric-K-concave function since K-concave and symmetric-K-concave are both special cases of (K, Q)-concave function.

We can now present the optimal policy for the risk-averse stochastic case balance problem with additive exponential utility function. Without loss of generality, we assume that KQ ≥ 0.

Let

Hte(x)=CEϵtRt[Pt(x;ϵt)+γGt+1(xDt(ϵt))].

Define Lteargmaxx{Hte(x)kx},lte=min{x|Hte(x)kx=Hte(Lte)kLteK},lte=min{x|Hte(x)kx=Hte(Lte)kLte(KQ)},Uteargmaxx{Hte(x)+qx},ute=max{x|Hte(x)+qx=Hte(Ute)+qUteQ},ute=min{x|Hte(x)+qx=Hte(Ute)+qUte(KQ)}.

Then, with Proposition 1, similar to Theorems 1 and 2, we have the following main results for the additive exponential utility model with K > 0 and Q > 0.

Theorem 4

Assume thatKQ > 0. Gt(x) andHte(x)are (K, Q)-concave and the optimal inventory levelyte(x)after a decision is made satisfies

yte(x)=Lte,ifxlte,{x,Lte},ifx(lte,lte),x,ifx(lte,ute),[lte,x],ifx(ute,ute),Ute,ifxute(14)

The results for the case QK > 0 follow from a symmetric argument.

On a special case of the stochastic cash balance problem where K = Q > 0, we have

Theorem 5

Assume thatK = Q. Gt(x) andHte(x)are symmetric K-concave and the optimal inventory levelyte(x)after a decision is made satisfies

yte(x)=Lte,ifxlte,{x,Lte},ifx(lte,lte+Lte2),x,ifx[lte+Lte2,ute+Ute2],{x,Ute},ifx(ute+Ute2,ute),Ute,ifxute(15)

5 Additive exponential utility function with ambiguity aversion

In this section, we introduce the finite horizon ambiguity averse model under exponential utility function. Specially, assume that the decision maker does not know the exact probability distribution for the random variable ϵt. Rather, the decision maker is only aware of a set of probability distributions to which the probability distribution of ϵt belongs. According to Chen and Sun[17], in period t, the decision maker choose his policies assuming that nature is adversarial, choosing probability distributions gϵt from an ambiguity set Ωt to minimize the decision maker’s expected utility. Thus, similar to (4)∼(5), a dynamic program for the risk and ambiguity averse stochastic cash balance problem is as follows:

Vt(x,w)=maxymingϵtΩtEgϵt[Wt¯(x,w,y;ϵt)](16)

where

Wt¯(x,w,y;ϵt)=maxw{πt(wγw+Pt¯(x,y;ϵt))+Vt+1(yDt(ϵt),w)}(17)

with the boundary condition VT(x, w) = πT(w + PT(x, y; ϵt)), VT+1(x, 0)=0.

According to [17], we adopt the “general certainty equivalent” operator ϕ(⋅) defined on a function g(⋅) of an ambiguous uncertainty ξ, i.e, ΦΩR[ϕ(ξ)]=mingϵΩRlnEgϵ[eϕ(ξ)R]. Note that ΦΩR=CEξR when Ω is a singleton. Obviously, the operator ΦΩR generalizes the certainty equivalent operator CEξR in Section 4.

Assume that πt(f)=αtefβt, and the ambiguity sets satisfy certain technical conditions so that the minimization in the general certainty equivalent operator can always be attained. Similar to the proof of Lemma 3, the stochastic cash balance problem in the ambiguity and risk averse model (16)∼(17) can be calculated through the following dynamic programming

Gt(x)=maxy{Kδ(yx)Qδ(xy)k(yx)+q(yx)+ΦΩtRt[Pt(y;ϵt)+γGt+1(yDt(ϵt))]}(18)

with boundary condition GT+1(x) = 0.

To obtain the structure on the optimal policies, we need the following result, which implies the minimum envelope of (K, Q)-concave functions is still (K, Q)-concave.

Proposition 2

If f(x, v) is (K, Q)-convex inxfor any v, then g(x) = maxvf(x, v) is also (K, Q)-convex.

Proof

For any x0x1 and λ ∈ [0, 1], xλ = (1 − λ)x0 + λ x1, we have

g(xλ)=maxvf((1λ)x0+λx1,v)maxv[(1λ)f(x0,v)+λf(x1,v)+λK+(1λ)Qmin{λ,1λ}min{K,Q}]maxv[(1λ)f(x0,v)]+maxv[λf(x1,v)]+λK+(1λ)Qmin{λ,1λ}min{K,Q}=(1λ)g(x0)+λg(x1)+λK+(1λ)Qmin{λ,1λ}min{K,Q}.

Note that Proposition 2 also holds for K-concave and symmetric-K-concave since K-concave function and symmetric-K-concave are both special cases of (K, Q)-concave function.

Then, combined with Proposition 2, similar to the proof of the exponential utility function case, it is easy to see that Theorems 4 and 5 hold for the stochastic cash balance problem under the exponential utility function with ambiguity aversion.

6 Conclusions

In this paper, we propose a framework for incorporating risk aversion in stochastic cash balance problem. We characterize the structure of the optimal policy on the risk-averse stochastic cash balance problem according to the consumption model. We show that the structure of the optimal policy for a decision maker with exponential utility function is almost identical to the structure of the optimal risk-neutral operations policy. Furthermore, we extend the results for the exponential utility function to the ambiguity aversion case.


Supported by National Natural Science Foundation of China (Grant No. 11301445)


References

[1] Arrow K J, Karlin S, Scarf H. Studies in the mathematical theory of inventory and production. Stanford University Press, Stanford, California, 1958.Suche in Google Scholar

[2] Eppen G D, Fama E F. Cash balance and simple dynamic portofolio problems with proportional costs. International Economics Review, 1969, 10: 119–133.10.2307/2525547Suche in Google Scholar

[3] Whisler W D. A stochastic inventory model for rented equipment. Management Science, 1967, 13: 640–647.10.1287/mnsc.13.9.640Suche in Google Scholar

[4] Feinberg E L, Lewis M E. Optimality of four-threshold policies in inventory systems with customer returns and borrowing/storage options. Probability in the Enginering and Informatin Sciences, 2005, 19(1): 45–71.10.1017/S0269964805050047Suche in Google Scholar

[5] Girgis N M. Optimal cash balance levels. Management Science, 1968, 15(3): 130–140.10.1287/mnsc.15.3.130Suche in Google Scholar

[6] Neave E H. The stochastic cash balance probem with fixed costs for increases and decreases. Management Science, 1970, 16(7): 472–490.10.1287/mnsc.16.7.472Suche in Google Scholar

[7] Chen X, Simchi-Levi D. A new approach for the stochastic cash balance problem with fixed cost. Probability in the Engineering and Informational Science, 2009, 23(4): 545–562.10.1017/S0269964809000242Suche in Google Scholar

[8] Ye Q, Duenyas I. Optimal capacity investment decision with two-sided fixed capacity adjustment costs. Operations Research, 2007, 55(2): 272–283.10.1287/opre.1060.0386Suche in Google Scholar

[9] Feinberg E L, Lewis M E. Optimality inequalities for average cost Markov decision processes and the stochastic cash balance problem. Mathematics of Operations Research, 2007, 32(4): 769–783.10.1287/moor.1070.0269Suche in Google Scholar

[10] Chen Y H, Xu M H, Zhang Z G. A risk-averse newsvendor model under CVaR criterion. Operations Research, 2009, 57(4): 1040–1044.10.1287/opre.1080.0603Suche in Google Scholar

[11] Eeckhoudt L, Gollier C, Schlesinger H. The risk-averse (and prudent) newsboy. Management Science, 1995, 41(5): 786–794.10.1287/mnsc.41.5.786Suche in Google Scholar

[12] Lau H S. The newsboy problem under alternative optimization objective. Journal of the Operational Research Society, 1980, 31(6): 525–535.10.1057/jors.1980.96Suche in Google Scholar

[13] Wu M, Zhu S X, Teunter R H. Newsvendor problem with random shortage cost under a risk criterion. International Journal of Production and Economics, 2013, 145: 790–798.10.1016/j.ijpe.2013.06.007Suche in Google Scholar

[14] Bouakiz M, Sobel M J. Inventory control with an exponential utility criterion. Operations Research, 1992, 40(3): 603–608.10.1287/opre.40.3.603Suche in Google Scholar

[15] Chen X, Sim M, Simchi-Levi D, et al. Risk aversion in inventory management. Operations Research, 2007, 55(5): 828–842.10.1287/opre.1070.0429Suche in Google Scholar

[16] Nilim A, EI Ghaoui L. Robust solutions to Markov decision problems with uncertain transition matrices. Operations Research, 2005, 53(5): 780–798.10.1287/opre.1050.0216Suche in Google Scholar

[17] Chen X, Sun P. Optimal structural policies for ambiguity and risk averse inventory and pricing models. SIAM Journal on Control and Optimization, 2012, 50(1): 133–146.10.1137/100791488Suche in Google Scholar

[18] Scarf H. The optimality of (s, S) policies in the dynamic inventory problem. Proceedings of the 1st Stanford Symposium on Mathematical Methods in the Social Sciences, Stanford University Press, Stanford, California, 1960.Suche in Google Scholar

[19] Chen X, Simchi-Levi D. Coordinating inventory control and pricing strategies with random demand and fixed ordering cost: The finite horizon case. Operations Research, 2004, 52(6): 887–896.10.1287/opre.1040.0127Suche in Google Scholar

Received: 2014-4-1
Accepted: 2014-9-1
Published Online: 2014-12-25

© 2014 Walter de Gruyter GmbH, Berlin/Boston

Heruntergeladen am 20.11.2025 von https://www.degruyterbrill.com/document/doi/10.1515/JSSI-2014-0520/html
Button zum nach oben scrollen