Building Reputation in a War of Attrition Game: Hawkish or Dovish Stance?

Selçuk Özyurt

doi:10.1515/bejte-2015-0093

Enjoy 40% off

academic books on De Gruyter Brill *

Article Publicly Available

Building Reputation in a War of Attrition Game: Hawkish or Dovish Stance?

Selçuk Özyurt

Published/Copyright: May 25, 2016

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal The B.E. Journal of Theoretical Economics Volume 16 Issue 2

Abstract

This paper examines a two-player war of attrition game in continuous-time, where (1) fighting (i. e., escalating the conflict) is costless for a player unless he quits, (2) at any point in time, each player can attack to his opponent and finalize the game with a costly war, (3) there is two-sided uncertainty regarding the players’ resolve, and (4) each player can choose his tone/stance (either hawkish or dovish) at the beginning of the game, which affects his quitting cost. The results imply that choosing hawkish (dovish) regime is optimal if and only if the benefit-cost ratio of the dispute is sufficiently high (low). If hawkish tone is going to give a player upper hand in a dispute, then choosing a more aggressive tone does not increase his payoff. However, choosing a more dovish tone increases a player’s payoff whenever dovish regime is optimal.

Keywords: war of attrition game; continuous time games; reputation; dispute resolution

JEL: C72; D74; D82; D83

In politics… never retreat, never retract… never admit a mistake.
Napoleon Bonaparte

1 Introduction

The current paper generalizes the results of Ozyurt (2014) and extends its workhorse war of attrition model by adding an initial stage where the players can endogenously choose their escalation cost parameters. What is novel about the game is that (1) escalating the conflict (i. e., fighting) is costless for a player provided that he never retreats, (2) at any point in time, each player can choose one of three possible actions; escalate (wait), back down (quit), and attack, (3) there is two-sided uncertainty regarding the players’ commitment (i. e., resolve), and finally (4) each player can choose his stance, tone, or regime (either hawkish or dovish) at the beginning of the game, which affects the cost he will suffer when he quits. Sending strong public threats to the opponent, pledging himself to a certain course of action or to some bold, future act can be interpreted as choosing hawkish tone in a dispute because these actions raise public’s expectation, and so cause bigger embarrassment in case of a failure of meeting these expectations. ^[1]

An international dispute between two states and a labor dispute between a firm and a union are two examples that would fit to the setup provided above. Mimicking the resolved type by escalating the dispute is potentially a costly action for a rational player because quitting after escalating the dispute for a while is costly. Attacking is also a costly action. However, both of these actions have signaling values, and so, they may provide a leverage to a player during the war of attrition game. A player can build his reputation by mimicking the resolved type. Higher reputation for resolve intimidates the rival and forces him to play a (mixed) strategy in which he quits with a greater probability. Clearly, a player’s ex ante payoff increases as his opponent quits with a greater probability. On the other hand, the threat of a costly war introduces some sort of deadline effect on reputation building. A rational player does not quit beyond time, at which his escalation costs exceed his war cost, and so, low war cost or greater sensitivity to escalation costs implies a shorter deadline. By escalating the conflict, a rational player signals that he is committed to attack (with a positive probability) at some time after his deadline if the game ever reaches to this time. Therefore, if a player’s deadline is shorter and if he escalates the dispute, then his rival will have to build his reputation faster, which means that his rival will play a strategy such that he quits with a greater probability.

By choosing his regime, a player can control how fast he can build his reputation on his resolve and how short his deadline for attacking will be. Dovish stance gives the ability of building reputation at a faster rate. ^[2] Hawkish regime, on the other hand, shortens the players’ deadline for attacking. Therefore, a rational player will either choose the dovish regime and build his reputation faster or the hawkish regime and commit to a shorter deadline. Our results show that choosing the hawkish regime (and so committing to a shorter deadline) is optimal when the benefit-cost ratio of the dispute is sufficiently high. Otherwise, choosing the dovish regime (i. e., building reputation at a faster rate) is optimal. These results were speculated in Ozyurt (2014), but formally proved in the current paper. In addition, the current results show that having more hawkish tone never benefits a player if hawkish regime is already the optimal one. However, choosing more dovish tone increases a player’s payoff whenever the dovish regime is optimal. Furthermore, hawkish tone may be advantageous in a dispute only if the players have the option of finishing the dispute with a costly war, such as starting a combat in international disputes or strike in labor disputes.

War of attrition game is first proposed by Maynard Smith (1974) and is useful for the study of a wide variety of conflict situations. ^[3] This paper is primarily related to the reputation and bargaining literature initiated by Myerson (1991). Myerson investigates the impacts of one-sided reputation building on bilateral negotiations. Abreu and Gul (2000), Kambe (1999), Compte and Jehiel (2002), Atakan and Ekmekci (2014), and Ozyurt (2014, 2015) consider two-sided versions of it. The structure of the war of attrition game in this paper is similar to the one studied in Fearon (1994). However, there are two fundamental differences between the two: Fearon assumes that (1) players are known to be flexible, but there is some uncertainty regarding the players’ war costs, and (2) the players’ escalation costs are exogenous.

Ozyurt (2014) is the most related work to the current paper. It challenges conventional wisdom – in international relations literature – that the ability of generating higher escalation (i. e., audience) costs is an advantage for a leader of a state. In particular, Ozyurt (2014) shows that lower escalation costs may give a player upper hand in a crisis depending on (1) the benefit-cost ratio of the dispute, (2) initial probability of resolve, and (3) how fast states generate escalation costs with time. The current paper takes a step further, generalizes the results of Ozyurt (2014) for a larger class of escalation cost functions, and extends its workhorse model by adding an initial stage where the players can endogenously choose their escalation cost parameters. This extension allows the possibility of studying not only the optimal type (hawkish vs. dovish) but also the optimal intensity of regime choice in a dispute.

Section 2 explains the details of the three-stage, infinite-horizon, continuous-time war of attrition game. Section 3 characterizes the third-stage equilibrium strategies. Section 4 presents the main results: I study four particular examples and examine how players’ regime choices change with the characteristics of the dispute. Finally, Section 5 concludes.

2 The War of Attrition Game

2.1 Timing

Two players, 1 and 2, are in dispute over a prize worth vi>0 for each player i∈{1,2}. The dispute (war of attrition game) is a three-stage infinite-horizon, continuous-time game. All stages start and the first two stages end at time 0. In stage 1, players simultaneously choose and announce their messages, regimes, or tones mi∈{h,d}:=M for i=1,2. Each player can choose either the hawkish(h) or the dovish(d) regime.

A player knows that he will never be forced to commit to a hardline policy, but is uncertain about his opponent. Therefore, each player believes that nature sends one of two messages {r,f} to his opponent in stage 2. A player who receives the message r, resolved, is constrained to follow a hardline policy; he never quits and never attacks. If a player receives the message f, flexible, then he will continue to play the game with no commitment. The players share the same belief that player i receives the message r with probability zi where zi∈(0,1). Therefore, the probability of being resolved is independent of the chosen messages.

Upon the beginning of the third stage (still at time 0), the players begin to play a war of attrition game, where at all times t≥0 before the dispute ends, each player can choose to escalate (wait), quit (back down), or attack. The dispute ends when one or both players attack or quit.

2.2 Payoffs

If either player attacks before the other quits, then the dispute ends with “war” and each player i receives the (net expected) payoff −wi<0. It indicates all the risks and gains resulting from war. The case −wi<0 indicates that staying away from the dispute is more desirable for a player than attacking once the player is involved in it. If player i picks message mi∈M and quits at time t≥0 before the other quits or attacks, then its opponent j receives the prize while player i suffers escalation costs equal to cimi(t), a continuous and strictly increasing function of the amount of escalation. I assume that 0≤cimi(t), cimi(0)<vi, and cid(t)≤cih(t) for all i and t. The last condition implies that the hawkish stance leads to a (weakly) higher escalation costs.

If one player chooses to attack at time t and the other chooses to quit or attack at the same time, then both flexible players receive their war payoffs −wi. However, if both quit at time t, then flexible player i receives vi2−cimi(t) given that i’s regime choice was mi in the first stage. These particular assumptions are not crucial because in equilibrium, simultaneous concessions or attacks occur with probability 0. Finally, if players escalate the conflict indefinitely, each flexible player gets a payoff that is strictly less than his war payoff. ^[4] Call this war of attrition game where all the parameters are common knowledge G.

2.3 Strategies

The only source of uncertainty is the players’ actual types, which matters only in the third stage of the game. In the first stage, all players are flexible (rational) in the sense that they choose their strategies, given their beliefs, to maximize their expected payoffs. At the beginning of the game each player knows that his future self will be flexible. However, each player is uncertain about his opponent’s future self: following the second stage, the opponent is either flexible or resolved.

The strategy of the resolved type is simple: never quit and never attack during the war of attrition game. Resolved types’ payoffs are ignored because they are redundant for the analysis. What motivates and interests the subsequent analyses are the optimal (i. e., equilibrium) strategies of the flexible types. Equilibrium strategies of the third stage of the game are in mixed strategies, and so they are intricate objects. For pedagogical reasons, I first present what pure strategies may look like and the payoffs that they correspond to. Then I provide the general description for the strategies and payoffs.

A pure strategy for player i in the third stage of the game G is a tuple (Qi(mˉ),Ai(mˉ))∈R+2 for each (m1,m2):=mˉ∈M, where Qi(mˉ) and Ai(mˉ) denote the times at which player i quits and attacks, respectively. Therefore, the strategies of the resolved type of player i are Qi(mˉ)=Ai(mˉ)=∞. Suppose that players choose (m1,m2)=mˉ∈M in stage 1 and players i and j, for i,j=1,2 and j≠i, are flexible.

If Ai(mˉ)≤min{Aj(mˉ),Qi(mˉ),Qj(mˉ)}, then the payoffs to players i and j are −wi<0 and −wj<0, respectively.
If Qi(mˉ)<min{Ai(mˉ),Aj(mˉ),Qj(mˉ)}, then the payoffs to players i and j are −cimi(Qi(mˉ))<0 and vj>0, respectively.
If Qi(mˉ)=Qj(mˉ)<min{Ai(mˉ),Aj(mˉ)}, then the payoffs to players i and j are vi2−cimi(Qi(mˉ)) and vj2−cjmj(Qj(mˉ)), respectively.
If Qi(mˉ)=Qj(mˉ)=Ai(mˉ)=Aj(mˉ)=∞, then the payoffs to players i and j are some −M∈R−, where max{wi,wj}<M.

More formally, a mixed strategy of flexible player i in the first stage is a function μi:M→[0,1], where μi(m) denotes the probability that flexible player i chooses the message m∈M. Note that μi(h)+μi(d)=1 for i=1,2.

Player i’s third stage (mixed) strategy, {Qimˉ,Aimˉ}mˉ∈M2, has two parts. For each message profile mˉ=(m1,m2)∈M2, a right-continuous distribution function Qimˉ(t):R+∪{∞}→[0,1] represents the probability that player i quits by time t (inclusive). Similarly, for each message profile mˉ, a continuous distribution function Aimˉ(t):R+∪{∞}→[0,1] represents the probability that player i attacks by time t (inclusive). If player i’s strategy, following the message profile mˉ, is Qimˉ(t), then flexible player i’s strategy is Qimˉ(t)/(1−zi) because the resolved type never quits. The same arguments hold for Aimˉ(t). Note that Qimˉ(t)+Aimˉ(t)≤1 for all 0≤t and limt→∞Aimˉ(t)≤1−zi for all i and mˉ.

Given a strategy profile σ=μi,{Qimˉ,Aimˉ}mˉ∈M2i=12, in the subgame following the message realization mˉ, flexible player i’s expected payoff of quitting at time t is

[1]Uiq(t,σ,mˉ)=viQjmˉ(t)−wiAjmˉ(t)+1−Qjmˉ(t)−Ajmˉ(t)−cimi(t)+vi2−cimi(t)Qjmˉ(t)−Qjmˉ(t−),

with Qjmˉ(t−)=limy↑tQjmˉ(y). Similarly, flexible player i’s expected payoff of attacking at time t is

[2]Uia(t,σ,mˉ)=viQjmˉ(t)−wi[1−Qjmˉ(t)].

Therefore, flexible player i’s expected payoff in the subgame following the message realization mˉ is

[3]Ui(σ,mˉ)=∫t∈[0,∞)Uiq(t,σ,mˉ)1−zidQi(t)+∫t∈[0,∞)Uia(t,σ,mˉ)1−zidAi(t)

Finally, flexible player i’s expected payoff in the game is

[4]Ui(σ)=∑mˉ∈M2μ(mˉ)Ui(σ,mˉ)

where for all mˉ=(m1,m2)∈M2, μ(mˉ)=μ1(m1)μ2(m2).

2.4 Discussion

Escalation costs can be interpreted as the players’ opportunity costs of quitting. That is, it is the total value of the opportunities that are missed or not used effectively by backing down from the hardline policy. Public commitments that players make during a dispute are expected to serve an agenda the players may form. This agenda may include items that are not directly related to the prize (e. g., increasing audiences’ support and the likelihood of winning upcoming elections, building reputation for future negotiations, discouraging potential rivals, and preventing future disputes etc.). Therefore, a player’s expected value of following a hardline policy may increase with time because as time passes, the player is more likely to convince his audiences about his resolve, and persuaded audiences would increase the likelihood of the successful execution of his agenda. As a result, if the expected benefit of following the hardline policy increases in time, then the opportunity cost of backing down is expected to increase in time as well. Alternatively, escalation costs may represent a player’s psychological disutility – due to social disapproval or embarrassment – of quitting. ^[5]

A player may commit not to back down in a dispute for reasons other than the prospects of the dispute. For example, a leader may be resolved because of his firm belief that backing down, and thus giving up for the prize, is simply a decision that will not be ratified by his supporters. Therefore, the positive priors (i. e., zi’s) can be interpreted as the players’ initial beliefs on the existence of such motives that may force their opponents to be resolved. Resolved types closely resemble the commitment types that are first defined by Myerson (1991) (r-insisting types) and studied further by Abreu and Gul (2000) and Kambe (1999). These strategy types are first used in games by Kreps and Wilson (1982) and Milgrom and Roberts (1982), where commitments are modeled as behavioral types that exist in society so that rational players can mimic them if it is optimal to do so. In a bilateral negotiation, commitment types always demand a particular share and accept an offer if and only if it weakly exceeds that share. ^[6]

3 Equilibrium for the Third Stage of the War of Attrition Game

Because most of the arguments in this section are direct generalization of the analysis of Ozyurt (2014), readers are advised to refer to this paper for a more detailed discussion. All the proofs are deferred to Appendix.

Lemma 1

If a strategy profileσ=μi,{Qimˉ,Aimˉ}mˉ∈M2i=12constitutes a sequential equilibrium of the game G, then for eachmˉ=(m1,m2)∈M2, there exists a finite numbertmˉ∗>0such that, fori=1,2, the following conditions hold:

Qimˉ(t)=1−[1−Qimˉ(0)][vj+cjmj(0)]vj+cjmj(t)for allt≤tmˉ∗,
Q1mˉ(0)Q2mˉ(0)=0,
Qimˉ(tmˉ∗)=1−ziiftmˉ∗satisfiescimi(tmˉ∗)<wi,
cimi(tmˉ∗)≤wi, and
Aimˉ(t)=0for allt<tmˉ∗andlimk→∞Aimˉ(k)=1−zi−Qimˉ(tmˉ∗).

In equilibrium, if player i believes that j will never quit after time t and cimi(t)<wi, then flexible player i will immediately quit at that time. There are two critical thresholds beyond which j never quits; the time that player j’s reputation reaches 1 and the time that j’s escalation costs reaches his war cost wj. Let κjmˉ satisfy cj(κjmˉ)=wj and τjmˉ solve Qjmˉ(τjmˉ)=1−zj. Thus, player i never backs down after time tjmˉ=minκjmˉ,τjmˉ. Similar arguments hold for player j. If t1mˉ<t2mˉ holds and the dispute escalates until time t1mˉ, then flexible player 2 ends the game at this time for sure because he knows that player 1 will never quit beyond this point. As a result, the war of attrition game will end no later than time tmˉ∗=min{t1mˉ,t2mˉ}. The next result formally proves these arguments.

Lemma 2

If a strategy profileσ=μi,{Qimˉ,Aimˉ}mˉ∈M2i=12constitutes a sequential equilibrium of the war of attrition game G, then for anymˉ∈M2andi,j∈{1,2}withi≠j,

flexible player i never quits after timetjmˉ=minκjmˉ,τjmˉ, where
1. κjmˉsolvescjmj(κjmˉ)=wj, and
2. τjmˉsolvescimi(τjmˉ)=vi(1−zj)+cimi(0)zj, and
the game G ends by timetmˉ∗=min{t1mˉ,t2mˉ}.

Lemma 3

Suppose that a strategy profileσ=μi,{Qimˉ,Aimˉ}mˉ∈M2i=12constitutes a sequential equilibrium of the war of attrition game G. For anymˉ=(m1,m2)∈M2, iftimˉ>tjmˉ, then

Qjmˉ(t)=1−vi+cimi(0)vi+cimi(t)andQimˉ(t)=1−[1−Qimˉ(0)][vj+cjmj(0)]vj+cjmj(t),

where

Qimˉ(0)=1−zi(vj+cjmj(tjmˉ))vj+cjmj(0)

Definition 1

For any strategy profileσ=μi,{Qimˉ,Aimˉ}mˉ∈M2i=12that constitutes a sequential equilibrium of the war of attrition game G, playeriis called

advantaged in the subgame followingm¯ifQjmˉ(0)is strictly positive, and
advantaged in equilibriumσif he is advantaged on the equilibrium path, that is, he is advantaged in all the subgames following allmˉ’s that are on the equilibrium path.

We know from Lemma 1 that in equilibrium, the equality Q1mˉ(0)Q2mˉ(0)=0 must hold for any mˉ∈M2. Therefore, if timˉ>tjmˉ holds, then player i (and only player i) quits with a positive probability at time 0 in the subgame following mˉ. Thus, at most one player can be advantaged in equilibrium. The following result immediately follows from the last arguments.

Corollary 1

In any sequential equilibrium of the war of attrition game G, playeriis advantaged in the subgame followingmˉif and only if the parameters of the game satisfytjmˉ>timˉ.

4 Type (Hawkish vs. Dovish) and Intensity of the Tone

This section aims to understand when the players prefer to choose the hawkish or the dovish stance and in what extent. For a sharper presentation of the underlying dynamics, I will analyze various variations of the war of attrition game with additional structure on the parameters.

Suppose that a strategy profile σ=μi,{Qimˉ,Aimˉ}mˉ∈M2i=12 constitutes a sequential equilibrium of the war of attrition game G. For any message profile mˉ, players’ strategies entail quitting at any time before tmˉ∗. Therefore, player i’s expected payoff of quitting at any time t where 0<t≤tmˉ∗ has a fixed value, and thus player i’s expected payoff in the subgame following the message realization mˉ must be equal to this fixed value.

More formally, we see by eq. [1] and Lemma 1 that for any small ϵ>0, flexible player i’s expected payoff of quitting at any time t≤tmˉ∗ is

Uiq(t,σ,mˉ)=viQjmˉ(ϵ)−wiAjmˉ(ϵ)+[−cimi(ϵ)]1−Qjmˉ(ϵ)−Ajmˉ(ϵ)+vi2−cimi(ϵ)[Qjmˉ(ϵ)−Qjmˉ(ϵ−)]=viQjmˉ(ϵ)−cimi(ϵ)[1−Qjmˉ(ϵ)].

Taking ϵ→0 thus yields

[5]Uiq(t,σ,mˉ)=viQjmˉ(0)−cimi(0)[1−Qjmˉ(0)]

Similarly, we see by eq. [1] and Lemma 1 that player i’s expected payoff of quitting at time tmˉ∗ is equal to

Uiq(tmˉ∗,σ,mˉ)=viQjmˉ(tmˉ∗)−cimi(tmˉ∗)[1−Qjmˉ(tmˉ∗)],

and one can easily check that it’s value is equal to eq. [5]. Finally, eq. [2] and Lemma 1 imply that player i’s expected payoff of attacking at any time t≥tmˉ∗ is

Uia(tmˉ∗,σ,mˉ)=viQjmˉ(tmˉ∗)−wi[1−Qjmˉ(tmˉ∗)].

By the fourth condition of Lemma 1, we have cimi(tmˉ∗)≤wi. That is, for any t≤tmˉ∗, Uia(t,σ,mˉ)≤Uiq(t,σ,mˉ). Thus, flexible player i’s expected payoff in the subgame following the message realization mˉ is given by eq. [5].

For the rest of this section, I will consider linear escalation costs functions. In particular, for any message m∈M and player i, I suppose that cim(t)=cimt with cim>0. Therefore, flexible player i’s continuation payoff, following the subgame where message profile is mˉ, is simply

Uimˉ=viQjmˉ(0),

which is equal to vi1−zj1+citmˉ∗vi when player i is advantaged in the subgame following mˉ, or 0 otherwise. We know from Corollary 1 that player i is advantaged in the subgame following mˉ if and only if timˉ<tjmˉ. According to Lemma 2, for any mˉ=(m1,m2)∈M2, the war of attrition game ends at time tmˉ∗=min{t1mˉ,t2mˉ}, where

t1mˉ=minw1c1m1,v2(1−z1)z1c2m2,t2mˉ=minw2c2m2,v1(1−z2)z2c1m1.

4.1 One-Sided Communication Under the Absence of War

Let the game G1 be the same as the game G with only two differences: (1) in the first stage, only player 1 can choose his stance and (2) in the third stage, players cannot choose to attack. Furthermore, I suppose that c1d<c2<c1h. Therefore, the game G1 will end at time tm∗=min{t1m,t2m}, where t1m=v2(1−z1)z1c2 and t2m=v1(1−z2)z2c1m.

Moreover, player 1’s expected payoff is

[6]U1m=v11−z21+v2(1−z1)v1z1c1mc2

if player 1 is advantaged in the subgame following m (i. e., t1m<t2m). Note that U1m is a decreasing function of c1m, and so player 1 can increase his game payoff by choosing a message that generates smaller escalation costs. Clearly, if player 1 can never be advantaged (i. e., t2m≤t1m for all m∈M), then player 1’s equilibrium payoff will be 0 regardless of his message.

Proposition 1

Ifc1d<v1z1(1−z2)v2z2(1−z1)c2, then there exists a unique sequential equilibrium of the gameG1. In this equilibrium, player 1 is advantaged and chooses the dovish regime in the first stage.

If two players are similar in the sense that vi=v and zi=z for i=1,2, then the above inequality would become c1d<c2, which is automatically satisfied by our assumption. Therefore, we could conclude that if two players are playing the war of attrition game under the absence of war and if only player 1 can choose his regime, then player 1 prefers to choose the dovish regime such that he keeps his escalation costs minimal.

4.2 One-Sided Communication Under the Shadow of War

Let G2 be the same as the game G with only one difference: in the first stage, only player 1 can choose his regime. Similar to the previous section, I will continue to assume that c1d<c2<c1h. Furthermore, let vi=v, wi=w, and zi=z hold for each i. According to Lemma 2, the equilibrium horizon of the game is a function of the players’ war cost w. In particular, for each m∈M, the game G2 will end at time tm∗=min{t1m,t2m}, where t1m=minwc1m,v(1−z)zc2 and t2m=minwc2,v(1−z)zc1m. As a result, in equilibrium, player 1 should choose the hawkish regime when the benefit-cost ratio of the game is higher than the relative likelihood of a player being the resolved type. The next result formally proves this point.

Proposition 2

Ifz1−z<vw, then there exists a unique sequential equilibrium of the gameG2, in which player 1 is advantaged and choosing the hawkish regime in the first stage. However, if the reverse of this inequality holds, then there is a unique sequential equilibrium of the gameG2, in which player 1 is advantaged but choosing the dovish regime in the first stage.

If only player 1 has had the option of attacking, and so player 2 can never attack, then there would exist a unique equilibrium where player 1 was advantaged and choosing the dovish regime when vw<z1−z. However, there would exist a continuum of equilibrium – in all of which player 1 is advantaged – when z1−z<vw because player 1 would be indifferent between the dovish and the hawkish regimes.

These two subsections show that player 1, who is the only player that can choose his escalation costs in the game, is the advantaged player in equilibrium, and this conclusion is correct whether or not players have the option to attack. However, the advantaged player’s regime varies depending on the benefit-cost ratio of the dispute and the relative likelihood of a player being the resolved type.

4.3 Two-Sided Communication

In this section, I consider the war of attrition game G, where both players can choose a regime. For sharper results, I will suppose that c2d<c1d<c2h<c1h, vi=v, wi=w, and zi=z for each i.

Proposition 3

Ifz1−z<vw, then in any sequential equilibrium of the game G, player 1

is advantaged and choosing the hawkish regime in the first stage, and
attacks (at some time afterwc1h) with probabilityvv+w−z.

However, ifvw<z1−z, then in any sequential equilibrium of the game G, player 2 is advantaged and choosing the dovish regime in the first stage.

4.4 Continuous Message Space and Intensity

Similar to the previous section, I suppose that both players can choose escalation costs coefficients (ci), and vi=v, wi=w, and zi=z hold for each i. Furthermore, I suppose that message space is continuous, and so for i=1,2, ci∈[cmin,cmax] with 0<cmin<cmax<∞. The lower (or the upper) bound represents the most dovish (hawkish) tone a player can pick.

Proposition 4

Suppose thatcj∈[cmin,cmax]forj∈{1,2}. Then the best response correspondence for the flexible type of playeri∈{1,2}withi≠jis

[7]BRi(cj)={(cj,cmax], ifcj<cmaxandz1+z<vw,[cmin,cmax] ifcj=cmaxandz1+z<vw,{cmin},ifcmin<cjandz1+z>vw[cmin,cmax] ifcmin=cjandz1+z>vw.

Therefore, more dovish tone increases a player’s payoff as long as the dovish regime is optimal and the opponent has not chosen the most dovish regime cmin. On the other hand, a more hawkish tone never benefits a player if it is the optimal regime for the player. Nevertheless, in all sequential equilibrium of the war of attrition game G, at least one of the players will certainly choose the most hawkish regime cmax whenever z1+z<vw.

5 Concluding Remarks

This paper examines a two-player war of attrition game in continuous-time. A player can build his reputation by mimicking the resolved type and escalating the dispute. Higher reputation for resolve intimidates the rival and forces him to play a mixed strategy in which he quits with a greater probability. By choosing his regime at the beginning of the game, players can control how fast they will build their reputation. The dovish regime gives the ability of building reputation at a faster rate. The threat of a costly war introduces some sort of deadline effect for reputation building. A flexible player does not quit beyond a time at which his escalation costs exceed his war cost. Therefore, by choosing the hawkish regime, players commit to attack with a positive probability after this time, and so to a shorter time horizon for reputation building.

Our results show that choosing the hawkish regime (and so imposing a shorter deadline) is optimal when the benefit-cost ratio of the dispute is higher than the relative likelihood of a player being the resolved type. Otherwise, choosing the dovish regime (i. e., building reputation at a faster rate) is the optimal action. These results were speculated in Ozyurt (2014), but formally proved in the current paper. In addition, the current paper shows that having more hawkish tone never benefits a player if hawkish regime is already the optimal one. However, choosing more dovish tone increases a player’s payoff when the dovish regime is optimal. Furthermore, hawkish tone is not advantageous in a dispute if the players do not have the option of finishing the dispute with a costly war. Although a linear functional form for the escalation costs is an important restriction, it does not alter the qualitative nature of our results.

Funding statement: This research was supported by the Marie Curie International Reintegration Grant (# 256486) within the European Community Framework Programme.

Appendix

Proof of Lemma 1:

Letσ=μi,{Qimˉ,Aimˉ}mˉ∈M2i=12be a sequential equilibrium of the gameG. For anymˉ∈M2andi∈{1,2}, letκimˉ=inf{t≥0|Qimˉ(t)=limk→∞Qimˉ(k)}. The optimality of equilibrium implies that playeridoes not quit beyond timet, satisfyingci(t)=wi. Therefore, κimˉis finite. The proofs of the following arguments(i)−(v) directly follow from the proof of Proposition 1 in Ozyurt (2014). Therefore, I skip the details.

κ1mˉ=κ2mˉ: A flexible player does not delay quitting once he knows that his opponent will never quit.
If Qimˉ jumps at t, then Qjmˉ is constant at some ϵ-neighborhood of t.
If Qimˉ is constant between (t′,t′′), then so is Qjmˉ.
There is no interval (t′,t′′) with t′′<κ1m¯ on which Q1mˉ and Q2mˉ are constant.
If t′<t′′<κ1m¯, then Qim¯(t′)<Qim¯(t′′) for i=1,2.

From (i)−(v), it follows that Q1mˉ and Q2mˉ must be continuous and strictly increasing on [0,tmˉ∗], where tmˉ∗=κ1mˉ=κ2mˉ. That is, flexible players are indifferent between quitting at time t<tmˉ∗ and waiting for an infinitesimal period Δ and then quit at time t+Δ<tmˉ∗. But then, player i must be quitting with a constant hazard rate λi=cj'(t)vj+cj(t), and thus Qimˉ(t)=1−[1−Qimˉ(0)][vj+cjmj(0)]vj+cjmj(t) for all t≤t∗. By (ii), both Q1mˉ(0) and Q2mˉ(0) cannot be positive, implying that Q1mˉ(0)Q2mˉ(0)=0.

Because Qimˉ is strictly increasing on [0,tmˉ∗], it must be the case that ci(tmˉ∗)≤wi for each i. Therefore, Aimˉ(t)=0 for all t<tmˉ∗. Since escalating forever is costlier than attacking and Qimˉ(t)+Aimˉ(t)≤1−zi for all t≥0, we must have limk→∞Aimˉ(k)=1−zi−Qimˉ(tmˉ∗) for i=1,2. Finally, probability that player i quits by time tmˉ∗ conditional on the event that he chooses mj is less than (or equal to) 1, implying that Qimˉ(t∗)≤1−zi.

Finally, since a flexible player will not delay quitting once he knows his opponent will never quit, and will not attack before time tmˉ∗, we can conclude that Qimˉ(t∗)=1−zi for i=1,2 if tmˉ∗ satisfies cimi(tmˉ∗)<wi. However, when cimi(tmˉ∗)=wi, player i will be indifferent between quitting at time tmˉ∗ and attacking at (or after) time tmˉ∗. Therefore, we must have Qimˉ(t∗)<1−zi if cimi(tmˉ∗)=wi.

Proof of Lemma 2:

The discussions that follow Lemma 1 prove the first part (i. e., 1), so I will not repeat them here. Regarding the equality in1−ii, note thatτjmˉsolvesQjmˉ(τjmˉ)=1−zjunder the assumptionQjmˉ(0)=0. Moreover, those discussions already prove that the game will end no later thantmˉ∗. I will complete the proof by showing that the game G will continue until timetmˉ∗with a positive probability. Suppose that the game ends prior to timetˆwith certainty, wheretˆ<tmˉ∗. According to the equilibrium strategies given by Lemma 1, Qimˉ(0)>0must hold fori=1,2because otherwise the conditionQimˉ(tˆ)=1−ziwill not hold for eachi(note that iftˆ<timˉ, thenci(tˆ)<wi). However, havingQ1mˉ(0)>0andQ2mˉ(0)>0simultaneously contradicts the optimality of the equilibrium (recall the condition 2 of Lemma 1). Hence, in equilibrium, given that flexible players randomize the timing of quitting, escalation continues until timetmˉ∗with some positive probability and stops at this time with certainty.

Proof of Lemma 3:

Sincetimˉ>tjmˉ, we havetmˉ∗=tjmˉ. Therefore, τimˉ,κimˉ>tmˉ∗. Recall thatτimˉis the time satisfyingQimˉ(τimˉ)=1−ziifQimˉ(0)=0. Since the game ends before timeτimˉandκimˉ>tmˉ∗holds, we must haveQimˉ(tmˉ∗)=1−zi. Hence, we haveQimˉ(0)>0. According to the second condition of Lemma 1, we should also haveQ1mˉ(0)Q2mˉ(0)=0. The last condition withQimˉ(0)>0implies thatQjmˉ(0)=0. However, Qjmˉ(0)=0impliesQjmˉ(t)=1−vi+cimi(0)vi+cimi(t). Moreover, sinceQimˉ(tjmˉ)=1−[1−Qimˉ(0)][vj+cjmj(0)]vj+cjmj(tjmˉ)=1−zi, we haveQi(0)=1−zi(vj+cjmj(tjmˉ))vj+cjmj(0). This completes the proof.

Proof of Proposition 1:

Consider the following strategies: In the first stage, player 1 chooses the dovish regime with probability 1 (i. e., μ1(d)=1). In the second stage, following the dovish regime, players’ strategies are given by Lemma 3, that is, Q1d(t)=1−v2v2+c2t and Q2d(t)=1−[1−Q2d(0)]v1v1+c1dt for all t≤td∗=v2(1−z1)z1c2, where Q2d(0)=1−z2(v1+c1dtd∗)v1. Similarly, Lemma 3 characterizes the players’ equilibrium strategies for the third stage when player 1 chooses the hawkish regime in the first period.

Given Qjm, flexible player i is indifferent between quitting at time t′ and waiting for some time and then quitting at time t′′, where 0≤t′<t′′≤tm*, where m∈M. Hence, any mixed strategy on the support [0,tm∗], in particular, Qim is optimal for player i. Finally, because player 1’s payoff decreases with c1m, μ1(d)=1 is also optimal. Hence, these strategies indeed form equilibrium. Uniqueness of the equilibrium is implied by the fact that the payoff of player 1 is strictly higher under dovish regime (see eq. [6]) and by the third stage equilibrium strategies that are characterized in Section 3.

Proof of Proposition 2:

Similar to the strategies given in the proof of Proposition 1, Lemma 1–3 characterize the equilibrium strategies of the third stage of the game G2. In order to find player 1’s first stage equilibrium strategy, consider first the case where player 1 chooses the dovish regime. In this case, t1d=minwc1d,v(1−z)c2z and t2d=minwc2,v(1−z)c1dz. First note that wc2<wc1d and v(1−z)c2z<v(1−z)c1dz. Therefore, according to Corollary 1, if z1−z<vw, then player 2 is advantaged. If the inequality is reversed, then player 1 is advantaged.

On the other hand, if player 1 chooses the hawkish regime, then t1h=minwc1h,v(1−z)c2z and t2h=minwc2,v(1−z)c1hz. Since wc1h<wc2 and v(1−z)c1hz<v(1−z)c2z, Corollary 1 implies that player 1 is advantaged if and only if wc1h<v(1−z)c1hz, equivalently if and only if z1−z<vw.

Thus, if z1−z<vw, then player 1’s optimal strategy is to choose the hawkish regime, and if vw<z1−z, then player 1’s optimal strategy is to choose the dovish regime. Finally, the uniqueness is implied by player 1’s first stage strategies and by the the third stage equilibrium strategies that are characterized in Section 3.

Proof of Proposition 3:

The equilibrium strategies in the third stage of the game are already characterized in Section 3. Recall that for any mˉ=(m1,m2)∈M2, the war of attrition game ends at time tmˉ∗=min{t1mˉ,t2mˉ}, where t1mˉ=minw1c1m1,v2(1−z1)z1c2m2 and t2mˉ=minw2c2m2,v1(1−z2)z2c1m1. In order to characterize the first stage equilibrium strategies, we need to consider four cases.

Case 1: Both players choose the dovish regime: Since wc1d<wc2d and v(1−z)c1dz<v(1−z)c2dz, we have

tdd∗=wc1d (i. e., player 1 is advantaged) whenever wc1d<v(1−z)c1dz, which is equivalent to z1−z<vw, and
tdd∗=v(1−z)c1dz (i. e., player 2 is advantaged) whenever v(1−z)c1dz<wc1d (i. e., vw<z1−z).

Case 2: Player 1 and 2 choose the hawkish and the dovish regime, respectively: Since wc1h<wc2d and v(1−z)c1hz<v(1−z)c2dz, we have

thd∗=wc1h (i. e., player 1 is advantaged) whenever wc1h<v(1−z)c1hz (i. e., z1−z<vw), and
thd∗=v(1−z)c1hz (i. e., player 2 is advantaged) whenever v(1−z)c1hz<wc1h (i. e., vw<z1−z).

Case 3: Player 1 and 2 choose the dovish and the hawkish regime, respectively: Since wc2h<wc1d and v(1−z)c2hz<v(1−z)c1dz, we have

tdh∗=wc2h (i. e., player 2 is advantaged) whenever wc2h<v(1−z)c2hz (i. e., z1−z<vw), and
tdh∗=v(1−z)c2hz (i. e., player 1 is advantaged) whenever v(1−z)c2hz<wc2h (i. e., vw<z1−z).

Case 4: Both players choose the hawkish regime: Since wc1h<wc2h and v(1−z)c1hz<v(1−z)c2hz, we have

thh∗=wc1h (i. e., player 1 is advantaged) whenever wc1h<v(1−z)c1hz (i. e., z1−z<vw), and
thh∗=v(1−z)c1hz (i. e., player 2 is advantaged) whenever v(1−z)c1hz<wc1h (i. e., vw<z1−z).

First suppose that z1−z<vw. When player 1 chooses the dovish regime, player 2’s best response is to choose hawkish regime (by cases 1 and 3). However, when player 2 takes the hawkish regime, player 1’s best response is to choose the hawkish regime as well (by cases 3 and 4). Hence, if benefit-cost ratio of the dispute exceeds the relative likelihood of a player being the resolved type, then there exists no sequential equilibrium in which player 1 takes a dovish regime. However, when player 1 takes a hawkish regime, player 2 is indifferent between the hawkish and dovish messages simply because player 1 is advantaged in either case (see cases 2 and 4). Because hawkish regime is a best response for player 1 and the game ends at time w/c1h regardless of player 2’s regime, we can conclude that there is a continuum of equilibrium, in which player 1 is advantaged and choosing the hawkish regime, and the horizon of the game is w/c1h. By Lemma 1, the probability of attacking is limk→∞A1hh(k)=1−z−Q1hh(w/c1h)=vv+w−z.

Now suppose that vw<z1−z. When player 2 chooses the hawkish message, player 1’s best response is to choose the dovish message (by cases 3 and 4). However, when 1 chooses the dovish regime, player 2’s best response is to choose the dovish regime, not the hawkish regime (by cases 1 and 3). Hence, if the relative likelihood of a player being the resolved type exceeds the benefit-cost ratio of the dispute, then there exists no sequential equilibrium in which player 2 takes a hawkish regime. However, when player 2 chooses the dovish regime, both dovish and hawkish messages are best response for player 1 (by cases 1 and 2) because player 2 is advantaged in each case, and thus, player 1’s payoff is 0. Because the dovish regime is a best response for player 2 regardless of player 1’s regime, we can conclude that there is a continuum of equilibrium, in which player 2 is advantaged and choosing the dovish regime. In these equilibria, the game ends at times c(1−z)c1dz and c(1−z)c1hz, when player 1’s realized message is d and h, respectively. Therefore, in all equilibria, both players’ reputations simultaneously converge to one at the end of the game, so that no player attacks with a positive probability.

Proof of Proposition 4:

Suppose that cmin<cj<cmax. Proposition 3 implies that if z1+z<vw, then player i can be advantaged by choosing a hawkish tone ci such that cj<ci. In that case, the horizon of the game (t∗) is equal to wci (see the proof of Proposition 3). By substituting this value in player i’s payoff function Ui=v1−z1+cit∗v we get Ui=v1−z1+wv, which is independent of ci. Thus, player i’s expected payoff does not chance with ci as long as cj<ci.

On the other hand, if z1+z>vw, then player i can be advantaged by choosing a dovish regime ci such that ci<cj. In this case, the horizon of the game (t∗) is equal to v(1−z)cjz (see the proof of Proposition 3). By substituting this value in player i’s payoff function we get Ui=v1−z1+ci(1−z)cjz, which decreases with ci. Thus, player i’s best response to cj is picking the most dovish tone that is less than cj, which is cmin. Verifying the cases where cj takes the boundary values is very similar, and so omitted.

References

Abreu, D., and F. Gul. 2000. “Bargaining and Reputation.” Econometrica 68 (1):85–117.10.1111/1468-0262.00094Search in Google Scholar

Abreu, D., and R. Sethi. 2003. “Evolutionary Stability in a Reputational Model of Bargaining.” Games and Economic Behavior 44 (2):195–216.10.1016/S0899-8256(03)00029-0Search in Google Scholar

Atakan, A. E., and M. Ekmekci. 2014. “Bargaining and Reputation in Search Markets.” The Review of Economic Studies 81 (1):1–29.10.1093/restud/rdt021Search in Google Scholar

Bishop, D. T., C. Cannings, and J. Maynard Smith. 1978. “The War of Attrition with Random Rewards.” Journal of Theoretical Biology 74 (3):377–88.10.1016/0022-5193(78)90220-5Search in Google Scholar

Bliss, C., and B. Nalebuff. 1984. “Dragon-Slaying and Ballroom Dancing: The Private Supply of a Public Good.” Journal of Public Economics 25 (1):1–12.10.1016/0047-2727(84)90041-0Search in Google Scholar

Bulow, J., and P. Klemperer. 1999. “The Generalized War of Attrition.” American Economic Review 89 (1):175–89.10.3386/w5872Search in Google Scholar

Chatterjee, K., and L. Samuelson. 1987. “Bargaining with Two-Sided Incomplete Information: An Infinite Horizon Model with Alternating Offers.” The Review of Economic Studies 54 (2):175–92.10.2307/2297510Search in Google Scholar

Compte, O., and P. Jehiel. 2002. “On the Role of Outside Options in Bargaining with Obstinate Parties.” Econometrica 70 (4):1477–517.10.1111/1468-0262.00339Search in Google Scholar

Fearon, J. D. 1994. “Domestic Political Audiences and the Escalation of International Disputes.” American Political Science Review 88 (3):577–92.10.2307/2944796Search in Google Scholar

Fudenberg, D., R. Gilbert, J. Stiglitz, and J. Tirole. 1983. “Preemption, Leapfrogging and Competition in Patent Races.” European Economic Review 22 (1):3–31.10.1016/0014-2921(83)90087-9Search in Google Scholar

Fudenberg, D., and J. Tirole. 1986. “A Theory of Exit in Duopoly.” Econometrica 54 (4):943–60.10.2307/1912845Search in Google Scholar

Ghemawat, P., and B. Nalebuff. 1985. “Exit.” The RAND Journal of Economics 16 (2):184–94.10.2307/2555409Search in Google Scholar

Guisinger, A., and A. Smith. 2002. “Honest Threats the Interaction of Reputation and Political Institutions in International Crises.” Journal of Conflict Resolution 46 (2):175–200.10.1177/0022002702046002001Search in Google Scholar

Hendricks, K., A. Weiss, and C. Wilson. 1988. “The War of Attrition in Continuous Time with Complete Information.” International Economic Review 29 (4):663–80.10.2307/2526827Search in Google Scholar

Kambe, S. 1999. “Bargaining with Imperfect Commitment.” Games and Economic Behavior 28 (2):217–37.10.1006/game.1998.0700Search in Google Scholar

Kreps, D. M., and R. Wilson. 1982. “Reputation and Imperfect Information.” Journal of Economic Theory 27 (2):253–79.10.1016/0022-0531(82)90030-8Search in Google Scholar

Maynard Smith, J. 1974. “The Theory of Games and the Evolution of Animal Conflicts.” Journal of Theoretical Biology 47 (1):209–21.10.1016/0022-5193(74)90110-6Search in Google Scholar

Milgrom, P., and J. Roberts. 1982. “Predation, Reputation, and Entry Deterrence.” Journal of Economic Theory 27 (2):280–312.10.1016/0022-0531(82)90031-XSearch in Google Scholar

Milgrom, P. R., and R. J. Weber. 1985. “Distributional Strategies for Games with Incomplete Information.” Mathematics of Operations Research 10 (4):619–32.10.1287/moor.10.4.619Search in Google Scholar

Myerson, R. B. 1991. Game Theory: Analysis of Conflict. Cambridge, Massachusetts: Harvard University Press.Search in Google Scholar

Nalebuff, B., and J. Riley. 1985. “Asymmetric Equilibria in the War of Attrition.” Journal of Theoretical Biology 113 (3):517–27.10.1016/S0022-5193(85)80036-9Search in Google Scholar

Ordover, J. A., and A. Rubinstein. 1986. “A Sequential Concession Game with Asymmetric Information.” The Quarterly Journal of Economics 101 (4):879–88.10.2307/1884183Search in Google Scholar

Osborne, M. J. 1985. “The Role of Risk Aversion in a Simple Bargaining Model.” Game Theoretic Models of Bargaining, 181–213.10.1017/CBO9780511528309.010Search in Google Scholar

Ozyurt, S. 2014. “Audience Costs and Reputation in Crisis Bargaining.” Games and Economic Behavior 88:250–9.10.1016/j.geb.2014.09.008Search in Google Scholar

Ozyurt, S. 2015. “Searching for a Bargain: Power of Strategic Commitment.” American Economic Journal: Microeconomics 7 (1):320–53.Search in Google Scholar

Ponsati, C., and J. Sakovics. 1995. “The War of Attrition with Incomplete Information.” Mathematical Social Sciences 29 (3):239–54.10.1016/0165-4896(94)00771-YSearch in Google Scholar

Riley, J. G. 1980. “Strong Evolutionary Equilibrium and the War of Attrition.” Journal of Theoretical Biology 82 (3):383–400.10.1016/0022-5193(80)90244-1Search in Google Scholar

Sartori, A. E. 2002. “The Might of the Pen: A Reputational Theory of Communication in International Disputes.” International Organization 56 (1):121–49.10.1162/002081802753485151Search in Google Scholar

Smith, A. 1998. “International Crises and Domestic Politics.” American Political Science Review 92 (3):623–38.10.2307/2585485Search in Google Scholar

Snyder, J., and E. D. Borghard. 2011. “The Cost of Empty Threats: A Penny, Not a Pound.” American Political Science Review 105:437–56.10.1017/S000305541100027XSearch in Google Scholar

Trachtenberg, M. 2012. “Audience Costs: An Historical Analysis.” Security Studies 21:3–42.10.1080/09636412.2012.650590Search in Google Scholar

Published Online: 2016-5-25

Published in Print: 2016-6-1

Articles in the same Issue

https://doi.org/10.1515/bejte-2015-0093

Keywords for this article

war of attrition game; continuous time games; reputation; dispute resolution