Heterogeneous Effects of Informational Nudges on Pro-social Behavior

Jiayi Bao; Benjamin Ho

doi:10.1515/bejeap-2014-0125

Article Publicly Available

Heterogeneous Effects of Informational Nudges on Pro-social Behavior

Jiayi Bao and Benjamin Ho

Published/Copyright: April 28, 2015

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal The B.E. Journal of Economic Analysis & Policy Volume 15 Issue 4

Abstract

Numerous experimental studies of informational nudges both in the lab and the field have demonstrated not just that informational nudges are effective policy tools for influencing behavior, but also that nudges have heterogeneous impacts that differ depending on the characteristics of the person involved and the situation. We adapt Andreoni’s theory of warm-glow impure altruism to account for how altruism motives respond differently depending on the disposition of the person and the situation. The model explains both positive spillovers (moral cleansing) and negative spillovers (moral licensing) for behavioral interventions, showing that targeting of informational campaigns depends on the complementarity between people’s traits and the intervention’s content. More importantly, the design of economic incentives (like Pigouvian taxes) to shift economic behavior should depend on both the distribution of social preferences in the population and the use of behavioral interventions.

Keywords: nudge; heterogeneity; moral licensing; moral cleansing; impure altruism; warm glow

1 Introduction

Over the past decade, informational nudges, a type of information provision with the purpose of altering social behavior, have become increasingly prominent in the literature (Allcott and Mullainathan, 2010; Thaler and Sunstein, 2008; Cialdini et al. 2006). Moreover, real-world implications have been acknowledged in multiple spaces. In the marketplace, research has shown that peer comparisons can induce pro-social behavior in energy consumption (Ayres, Raseman, and Shih 2013). Nudge, a New York Times bestseller written by Cass Sunstein and Richard Thaler (2008), introduced and advocated the concept of “nudges,” a type of government policy that helps people make better choices without limiting their freedom to choose. These policies have seen widespread popularity: the Behavioral Insights Team, launched in the UK in 2010, has successfully nudged tax debtors to pay more of their taxes owed by mailing them letters telling them about the behavior of others^[1] while the White House has kicked off several federal projects in the second half of 2013 to nudge Americans to save more money for retirement, to curb energy uses, and to trim energy costs.^[2]

Despite their increasing popularity, evidence for the success of programs using informational nudges to induce pro-social behavior has been mixed (Schultz et al. 2007). In particular, making the most of these policies requires understanding the heterogeneity in responses across subpopulations and across intervention designs. One literature focuses on transient situational factors that affect the short-term response to a nudge such as the importance of the wording of the message, for example, whether the message is injunctive, “this is what you ought to do,” or descriptive, “this is what others are doing” (Ferraro and Price, 2013; Merritt, Effron, and Monin 2010; Cialdini et al. 2006). Another literature focuses on long-run dispositional difference across different groups like differences in social responsibility, group identity, or prior behavior (Akerlof and Kennedy, 2013; Allcott, 2011; Beshears et al. 2011). The heterogeneity in impact can lead to unintended spillovers. For instance, a nudge toward good behavior in one domain may license individuals to behave badly in a different domain (Ho etal. 2015). Even successful nudges such as the letters to tax debtors sent in the UK are instructive of the limitations of the nudge approach. The UK policy focuses only on a subset of the worse offenders, and was only effective when the message carefully selected the comparison group (Hallsworth et al., 2014).

Therefore, despite the popularity of nudges in popular discourse, important questions remain. How should we account for the heterogeneous impact of nudges across different populations or different interventions both in terms of designing behavioral policies and also in terms of evaluating their welfare impact? More importantly, as the magnitude of change observed in most field studies is small – for example, 1–2% savings in the case of electricity (Allcott, 2011), or less than 1% in the case of water savings (Ferraro and Price, 2013) – do the benefits of the nudge outweigh the feelings of guilt they induce? Also, as nudges by themselves seem to have small effects, would nudge-based policies work better in conjunction with traditional economic incentives like Pigouvian taxation?

In this paper, we organize what we know from the empirical literature into a coherent framework to think about the heterogeneity of nudges. The framework allows us to address questions about how behavioral interventions interact with traditional economic policies and to assess impacts on social welfare. We focus on the interaction between two dimensions of heterogeneity, the effect of situational nudges and individual disposition, developing a model that considers short-term situation and long-run disposition as two separate channels for inducing the warm-glow or cold-prickle associated with impure altruism. While some of the past literature we have noted does acknowledge the heterogeneity, there has been no systematic study of how different kinds of nudges interact with disposition. Existing research focuses on each nudge intervention independently. Also, while the distinction between disposition and situation is a fundamental one in psychology, it is a relatively underutilized conceptual framework within the realm of economics.

We parameterize the situational impure altruism induced by nudges and the dispositional impure altruism inherent to the individual using cross-derivatives, and predict behavior and welfare based on this relationship. By organizing the existing literature in this way, we hope to provide guidance for the empirical research needed to properly assess the welfare of nudge-based policies.

The main policy context we are interested in is informational message campaigns targeted toward reducing consumption of goods that involve negative externalities: e.g., reducing consumption of gasoline in order to mitigate environmental damage. Therefore, the main disposition we will focus on is whether people are more or less inclined to have altruistic preferences toward curtailing their own consumption for the good of the public. Since the natural policy in this context is Pigouvian taxation, we will consider the interactions between our two dimensions of impure altruism and optimal tax policy, although the model applies to any traditional economic incentive scheme designed to change consumption behavior. Even in domains where changes to tax policy are politically infeasible, design of behavioral interventions should depend on the existing tax environment.

The main findings of the paper include the following:

The effect of pro-social nudges on welfare can be either positive or negative.
The observed heterogeneity across behavioral interventions can be explained by the cross-derivative between situational and dispositional effects.
The optimal tax is a function of disposition and situational nudges.
1. Targeted taxes on certain subpopulations may be welfare enhancing.
2. Optimal economic policy accounts for interactions with situational nudges.
3. The inefficiency of a tax policy that does not take into account impure altruism and situational nudges depends on the complementarity between situational altruism and dispositional altruism.

The paper proceeds as follows. We first review the empirical literature on informational nudges and the explanations for their heterogeneous effects. We compare our model with past experimental evidence and similar theories before we introduce our model of impure altruism. We then present the main propositions about the effects of impure altruism on individual behavior, individual utility, and social welfare, and then examine the role of the interaction between situation and disposition. Next, we consider taxation as either an alternative or a supplement to nudges for social welfare enhancement and develop a model to explore how optimal taxes can depend on situational factors. The implications for policy are discussed. Finally, some extensions of the model are considered for future research.

2 Literature Review

The gaining popular use of informational nudges is a result of the emerging behavioral economics and psychological literatures. Bergstrom, Blume, and Varian (1986) introduce the idea of impure altruism and acknowledge its role in driving individual choices. Andreoni (1990, 1995) extends the previous analysis and incorporates impure altruism into a more general model, reasoning that nudges work by inducing the feeling of a cold-prickle when creating negative social externalities. Ayres, Raseman, and Shih (2013) find in two large field experiments partnered with private companies, Positive Energy and oPower, that by providing feedback to customers on home electricity and natural gas usage with a focus on peer comparisons, utilities can reduce energy consumption at a low cost. Berger and Rand (2008) also acknowledge the power of peer information provision in redressing heavy drinking in universities through laboratory and field experiments.

However, informational nudges are not always effective and their effects heavily rely on the type of people targeted and the framing of the message. While some nudges do effectively reduce production of a negative social externality, others have no impact or may even be counterproductive. Cialdini, Kallgren, and Reno (1991) explain the heterogeneous effects by arguing that injunctive nudges, which guide behavior by shifting the perception of how most others would approve or disapprove of a person’s conduct, are more likely to lead to beneficial social conduct than the descriptive types, which guide behavior through the perception of how most others would behave. Cialdini et al. (2006) show how some nudges backfire. In Arizona’s Petrified Forest National Park, while the rate of petrified wood theft decreased when people were given an injunctive message about what behaviors were not considered acceptable, another treatment using a descriptive message noting that others routinely violated the prohibition proved to be counterproductive. Merritt, Effron, and Monin (2010) explain how one act of pro-social behavior can lead to more pro-social behavior when it is seen as a commitment to the cause, but can lead to more antisocial behavior if it makes people feel like they have done “enough.” Ferraro and Price (2013) revisit a field experiment about water consumption and find that messages based on social norms and social comparisons are more effective than instructive descriptions for reducing water usage.

The type of nudge is not the only source of heterogeneity in terms of effectiveness. Different groups of people are often affected differently by the same informational messaging for two general reasons. The first reason has to do with different people beginning with different status quos. Allcott (2011) notices the heterogeneous effects by evaluating oPower’s Home Energy Report letters comparing residential utility customers’ electricity usage to that of their neighbors. Residents using less than the average increased their energy usage, while though those using more decreased usage – mean energy usage was therefore largely unchanged. Akerlof and Kennedy (2013) further point out that campaigns deploying messages describing public levels of drug and alcohol use, recycling, and littering, etc., have often had little success in heightening adoption of pro-social behaviors. They attribute the ineffectiveness of informational nudges to their heterogeneous impact on people whose conducts are above social norms (socially desirable) and people whose conducts are below social norms (socially undesirable). The second source of heterogeneity comes from differences in individual disposition for pro-sociality due to factors such as group identity, perception of moral obligations, and individual altruism. Beshears et al. (2011) find that peer information increased retirement savings of nonunionized recipients but decreased savings of unionized recipients, attributing the difference to differences in norms between the union and nonunion workers. Jordan, Mullen, and Murnighan (2011) find that people who were asked to recall their immoral behavior reported greater participation of moral activities than people who recalled moral behavior. Ho et al. (2015) find in paired lab and field studies of green electricity purchases that informational nudges have a stronger effect on those who were intrinsically inclined to be pro-social.

Our paper considers situation and disposition as two ways to induce the warm-glow or cold-prickle associated with impure altruism and parameterizes their effects into two temporal dimensions. Formally, disposition is a tendency by an individual to act in a specified way and is stable over time. We denote the effect of disposition by θ and call it the parameter for dispositional altruism. A person with high θ is more likely to be pro-social whereas a person with low θ is less likely. A nudge, on the other hand, has a situational effect that changes each period. We denote the effect of nudges by ω and call it situational altruism. In the case of informational nudges, someone who receives a nudge and thereafter perceives his behavior to be more socially desirable than the average (above the norm) will experience low ω. Someone who perceives his personal behavior to be less socially desirable than the average (below the norm) will experience high ω.

Our theory shows how the heterogeneous effect of behavioral interventions depends on a third factor: the interaction between situation and disposition. We examine the individual psychic cost function for the relationship between situation and disposition. Situation and disposition are complements when people that are more altruistic are more receptive to information that confirms their prior inclinations. Situation and disposition are substitutes when a situational nudge causes resentment in those most likely to give.

Our findings are consistent with the self-completion theory and moral regulation model in psychology. According to the self-completion theory of Jordan, Mullen, and Murnighan (2011), “recalling one’s (im)moral behavior will lead to compensatory rather than consistent moral action as a way of completing the moral self.” Sachdeva, Iliev, and Medin (2009) explain the compensatory moral action differently and argue that moral or immoral behavior can result from an internal balancing of moral self-worth and the cost inherent in altruistic behavior. When moral identity is threatened, moral behavior is a means to regain some lost self-worth (moral cleansing); however, affirming a moral identity leads people to feel licensed to act immorally (moral licensing). An experiment conducted by Darlington and Macker (1966) shows that participants who were led to believe that they had harmed another person were more likely to subsequently engage in altruistic behavior such as donating blood to a local hospital, indicating guilt-induced moral cleansing. On the other hand, moral licensing can happen when good deeds of those who are above the norm establish moral credits that can be “withdrawn” to “purchase” the right to do bad deeds with impunity (Merritt, Effron, and Monin 2010). Monin and Miller (2001) notice that licensing operates by providing a temporary boost in self-concept, and an initial altruistic intent that boosts self-concept can liberate people to choose more indulgent option. Khan and Dhar (2006) and Ayal and Gino (2011) also note that people may feel licensed to refrain from good behavior when they have amassed a surplus of moral currency.

Formally, our model shares much with that of Andreoni (1990) as we both aim to extend the analysis of Bergstrom, Blume, and Varian (1986) by decomposing their original idea of impure altruism. Andreoni’s model characterizes people as either pure altruists whose preferences only depend on the total consumption of a public good or as egoists whose preferences only depend on the private consumption of the public good, with impure altruism combining the two. We add to Andreoni’s idea of egoism and impure altruism by focusing on how impure altruism may arise from two distinct sources, i.e., situation and disposition, modeling the effect of situation and disposition through a psychic cost function. Moreover, just as Andreoni (1995) notices the behavioral asymmetry between the warm-glow of doing something good and the cold-prickle of doing something bad we uncover a different type of behavioral asymmetry, one not based on the consequences of the behavior but based on the disposition of people and the interaction between situation and disposition.

We also endogenize our measure of impure altruism by allowing the situational effects of a nudge to vary with the individual’s own choices. A common behavioral policy intervention is to tell individuals how their behavior compares to the behavior of others. However, if implemented on a large scale, the “behavior of others” also depends on the intervention. Rotemberg (1994) examines how altruism may arise endogenously among a small set of strategically related individuals. Bowles (1998) discusses how markets and other economic institutions may give rise to endogenous preferences by changing the exogenous determinants in a cultural equilibrium. Casadesus-Masanell (2004) studies how principals may utilize motivational schemes that rely on social influences such as norms, ethical standards, and altruism to foster intrinsic motivation and trustworthy behavior. Our interest differs from theirs in that we are less interested in how these preferences arise. Instead we are interested in how the government needs to account for these preferences when considering behavioral policy interventions. We formalize such endogeneity by deriving the optimal message that policy makers should send regarding the behavior of others and show that the optimal message should take into account the population’s heterogeneity as well as the complementarity between dispositional and situational altruisms.

We discuss above examples of heterogeneity in response to such messages in trials involving messages about water usage (Ferraro and Price, 2013) or electricity usage (Allcott, 2011) or carbon footprint (Ho et al. 2015). Other studies in the lab and field shed light about the mechanisms for how information about the behavior of others influences behavior. For example, Azmat and Iriberri (2010) examine the effect of a natural experiment where high school students learn their relative rank. They find the strongest effect of such information on the tails of the distribution and argue that such information works because people have competitive preferences that make especially high or low rankings more meaningful.

A different type of heterogeneity has been demonstrated in field data for cashiers (Mas and Moretti, 2009) and envelope stuffers (Falk and Ichino, 2006) who observe the performance of peers. Both find that working next to higher performing peers induces higher performance, but primarily for low-performing workers. Mas and Moretti (2009) consider the alternative mechanisms of social pressure, contagious enthusiasm, and knowledge spillovers, and argue that social pressure to conform to high performance is the driving mechanism.

Masclet et al. (2003) identify a different source of potential heterogeneity that operates through the mechanism of social pressure. They show that nonmonetary punishments, i.e., the verbal expression of disapproval, will induce some people to contribute to a public good. Individuals of different dispositions would likely respond differently to such nonmonetary sanctions. A similar nonmonetary effect can be seen in Ball et al. (2001) and Eckel, Fatas, and Wilson (2010), which show that people respond to arbitrarily assigned status in both market and public goods games. Knowing how you rank in some meaningless task domain affects behavior in other domains.

The main limitation to all of these studies is that even those that consider heterogeneity tend to focus on a single intervention of fixed size. This paper highlights the need to consider how the heterogeneity in the response to a nudge varies as the size of the nudge varies. This paper studies such variation in a general model that accounts both for economic mechanisms such as social disapproval, peer comparisons, and status and also for more psychological mechanisms like moral regulation and guilt.

Our paper further explores the policy implications of behavioral nudges in light of optimal tax treatments. Our tax model builds on Sandmo’s (1975) model for optimal commodity tax, but with a damage that comes from both the externality and a psychic cost. Johansson (1997) and Diamond (2006) also consider warm-glow preferences to optimal tax calculations, but unlike Diamond, we focus on the cold-prickle from the creation of a negative externality rather than warm-glow, and additionally explore the interaction effects between taxes and impure altruism.

3 Main Model and Propositions

3.1 Model

Assume that there is a group of I individuals who are impurely altruistic, I≥2 and I∈N∗. The ith individual chooses a quantity of gi units of a good with negative externality to the whole group.

We first evaluate the individual’s decision problem. Each individual chooses g∗ to maximize the following individual utility function:

(1)ui=Vgi−NG−cgi,θ,ω

where Vgi is the value of gi units of the good to the individual, NG is the per person cost due to the negative externality of the total consumption of the good, and G is the total number of units consumed by everyone, i.e., G=∑i=1Igi. dVdg is the marginal benefit for each additional unit of good consumed, and d2Vdg2 measures the change in the marginal benefit for each additional unit of good consumed. dNdG is the marginal cost for each individual for each additional unit of good consumed by the group, and d2NdG2 is the change in the marginal cost for each individual for each additional unit of good consumed by the group.

The cost function, cg,θ,ω, is based on the units of good consumed (g), the individual’s level of dispositional altruism (θ), and the situational altruism induced by nudges (ω). Assuming θ and ω are both exogenous, ∂c∂g,∂c∂θ, and ∂c∂ω are, respectively, the marginal cost of consumption conditional on psychological factors, the marginal psychic costs due to disposition, and the marginal psychic costs induced by nudges. The second derivative, ∂2c∂g2, is the change in the marginal cost of good for each additional unit of good consumed. The cross-derivatives, ∂2c∂θ∂g and ∂2c∂ω∂g, are our proxies for the interaction between consumption and impure altruism. Specifically, ∂2c∂θ∂g and ∂2c∂ω∂g measure how the marginal cost of consumption changes, respectively, with people’s dispositional and situational altruism. We use the concept of impure altruism and culpability interchangeably in the model as in Andreoni (1995), the warm-glow of altruism you get from reducing consumption is equal to the cold-prickle of guilt you feel when you increase consumption.

Finally, we examine the socially optimal level of consumption, G∗∗, which is found by maximizing the sum of utilities for the population UG:

(2)UG=IVGI−ING−IcGI,θ,ω.

3.2 Assumptions

The main model and its propositions are based on the following assumptions:

All individuals have the same utility function.
dNdG>0 and d2NdG2≥0.
Vg is concave and increasing in g, that is, dVdg>0 and d2Vdg2<0.
The impure altruism or culpability variables, θ and ω, are exogenous, that is, they are determined externally and do not change as g changes.
∂c∂θ>0,∂c∂ω>0, and ∂c∂g>0. Purchase and transaction costs are subsumed by the cost function along with the psychic costs.
∂2c∂g2≥0.
Interaction between consumption and culpability: ∂2c∂θ∂g>0 and ∂2c∂ω∂g>0.

(A1)–(A7) are typical concavity assumptions. (A7) argues that the cold-prickle of culpability increases in strength for larger deviations.

3.3 Propositions

Propositions 1 through 4 reproduce the standard tragedy of the commons results. We describe them briefly here but leave the details to the Appendix. Proposition 1 says that the individually rational choice of consumption of the antisocial good g∗ and G∗ is greater than the socially optimal level of consumption, g∗∗ and G∗∗. Proposition 2 says that consumption of the antisocial good is decreasing in dispositional or situational altruism. We interpret high situational altruism, ω, as the effect of moral cleansing. A person who is made to feel situationally guilty will act more altruistically to offset that guilt. We interpret low situational altruism as the effect of moral licensing. A subject who feels they have done something moral will consume more of the polluting good, g∗.

Propositions 3 and 4 evaluate the effect of impure altruism on utility and welfare. Proposition 3 says that if the marginal disutility of increased altruism is greater (less) than the marginal utility from mitigating the externality, then altruism is welfare decreasing (increasing). In other words, higher level of either situational or dispositional guilt potentially reduces individual utility. Increased guilt increases psychic cost but reduces consumption of the dirty good. The net effect on welfare depends on the relative magnitudes between the two. Proposition 4 looks at social welfare loss, the gap between first best and equilibrium consumption as defined in Proposition 1, and decomposes the effect of altruism on welfare loss into two parts. The first part captures welfare gains due to change in marginal psychic costs. The second part captures welfare loss due to change in marginal social externality at the equilibrium outcome. The net effect of impure altruism on social welfare loss depends on the relative magnitude of these two parts.

The heterogeneous effects of nudges are determined by the relationship between θ and ω, specifically, the cross-derivative ∂2c∂ω∂θ and ∂2c∂θ∂ω. The cross-derivatives indicate how the marginal psychic costs of additional guilt of one type change with the level of guilt of another type. From here, we assume c∗ is continuous, so ∂2c∂ω∂θ=∂2c∂θ∂ω. If ∂2c∂ω∂θ or ∂2c∂θ∂ω>0, we say θ and ω are complements. If∂2c∂ω∂θ or ∂2c∂θ∂ω<0, we say θ and ω are substitutes. We need to make one additional assumption before moving on to Proposition 5:

The psychic cost function is linear in g, that is, ∂2c∂g2=0. In this case, the cost function can be written as cg,θ,ω=g⋅c′θ,ω.

Intuitively it means that marginal psychic cost remains constant as consumption increases (i.e., there is no “income” effect for psychic costs).

Proposition 5:

Interaction between θ and ω . Assuming (A2), (A3), (A6), and (A8), we have that the impacts of situation and disposition are complements (substitutes) if and only if the impacts of situation and disposition on psychic costs are complements (substitutes):

signd2g∗dωdθ=−sign∂2c∂ω∂θ.

Symmetrically, we similarly have

signd2g∗dθdω=−sign∂2c∂θ∂ω.

(See Appendix A.5 Proof of Proposition 5.)

This proposition effectively says:

When situation and disposition are substitutes:
1. ω has a larger effect on g if θ is smaller.
2. That is, situational guilt has a larger effect on people inclined to be bad (versus people inclined to be good).
3. θ has a larger effect on g if ω is smaller.
4. That is, disposition has a larger effect on people who experience low situational guilt (versus people who experience high situational guilt).
When situation and disposition are complements:
1. ω has a larger effect on g if θ is larger.
2. That is, situational guilt has a larger effect on people inclined to be good (versus people inclined to be bad).
3. θ has a larger effect on g if ω is larger.
4. That is, disposition has a larger effect on people who experience high situational guilt (versus people who experience low situational guilt).

Therefore,

If ∂2c∂ω∂θ>0 (θ and ω are complements) then d2g∗dωdθ<0.
If ∂2c∂ω∂θ<0 (θ and ω are substitutes) then d2g∗dωdθ>0.

Proposition 5 begs the question: which types of situational nudges are complements and which types are substitutes? We can certainly speculate based on the mechanisms behind such nudges in the literature. For example, there is some evidence that nudges that appeal to social pressure would be more effective on those who have a disposition to be altruistic (e.g., Ho et al. 2015). Alternatively, nudges that threaten one’s status may be counterproductive to those of an altruistic disposition. Peer effects for work performance seem to be a substitute for those with an intrinsic disposition toward good performance (Falk and Ichino, 2006; Mas and Moretti, 2009). However, to our knowledge there has been no systemic study of the complementarity between nudges and dispositional altruism.

The following section offers some guidance on how one might answer the complementarity question by re-examining existing data for one type of situational intervention, the peer information nudge that tells people how their behavior compares to others.

3.4 Endogenous Situational Altruism

Up until now we have considered only cases where situation and disposition were exogenous. To understand the specific case of peer information nudges where people are targeted with a descriptive message telling them about the behavior of their peers, we consider the case where ω changes with g – that is, my situational guilt increases the more I choose to consume. We introduce gˆ as an informational nudge where ω is a function of g and gˆ. Think of gˆ as the informational message selected by the nudge designer that reads “other people like you chose to consume gˆ units of the good.” Clearly, such messages only work if this message, gˆ, differs from the person’s prior belief about the consumption of others. We denote the prior beliefs regarding the consumption of others to be goˆ. Even a truth telling policy maker has quite a bit of discretion of deciding gˆ, because the term “others like you” could refer to many different comparison groups (e.g., people in your neighborhood, or in your state, or in your age group) (Hallsworth et al., 2014).

The key assumption for our model of endogenous altruism is that my desire to curtail consumption declines as I cut my own consumption, ∂ω∂g>0, but increases if I think others are cutting their consumption, ∂ω∂gˆ<0.

Proposition 6:

The Case of Endogenous ω . Assuming (A2), (A3), (A5), (A6), and (A7), if situational altruism depends on g such that ∂ω∂g>0, ∂2ω∂g2>0, and ∂ω∂gˆ<0, then

(i) dg∗dgˆ>0, and (ii) dg∗dθ<0.

(See Appendix A.6 for Proof of Proposition 6.)

Proposition 6 reproduces our main results from Propositions 1–4 for the case of endogenous ω. In other words, when there is a positive complementarity between situational culpability and consumption, informational nudges influence social behavior by providing a reference point.

We use these results to derive the optimal message gˆ so that individuals are nudged to choose the consumption level g that is optimal for society. In line with the discussion of exogenous impure altruism in Section 3.3, we know that individuals choose g∗g0ˆ,θ, resulting in a total social utility of I⋅g∗g0ˆ,θ, while a social planner would choose G∗∗g0ˆ,θ<I⋅g∗g0ˆ,θ. Note that as specified by (4), G∗∗ does not account for the psychic costs of receiving a social nudge. While this assumption is plausible given that current policy rarely considers psychic costs, it is not innocuous. We will see later that as in Propositions 3 and 4, impure altruism makes the welfare implications of nudges less straightforward. For now, we maintain the assumption that the social planner, knowing the individual demand function g∗⋅,θ and disposition, θ, chooses an optimal message gˆopt such that

I⋅g∗gˆopt,θ=G∗∗g0ˆ,θ.

Proposition 7:

Optimal Message under Endogenous ω .In addition to the assumptions in Proposition 6, assuming (A6) and some technical conditions, a solution gˆ=gˆoptto the following equation exists:

I⋅g∗gˆ,θ=G∗∗g0ˆ,θ.

Moreover, the optimal message depends on the dispositional altruism of the population:

dgˆoptdθ≠0.

(See Appendix A.7 for Proof of Proposition 7.)

The optimal message is chosen such that each individual maximizes

(3)ui=Vgi−NG−cgi,θ,ωgi,gˆopt,

and the social planner also maximizes total social utility (excluding psychic costs) given g0ˆ (we can think of g0ˆ as the true average consumption in the population),

(4)UG=IVGI−ING−IcGI,θ,ωGI,g0ˆ.

One caveat is that when the optimal message is chosen as such, the psychic cost function complicates the welfare implications. In a standard public good model without psychic costs, nudging individual consumption toward the best social choice has clear welfare improving consequences. However, when there is impure altruism, a message that induces a cold-prickle would increase psychic costs and thus reduce individual utility. As in Propositions 3 and 4, the welfare loss depends on the degree of complementarity between situational and dispositional altruism.

In essence, the heterogeneous effects of informational nudges hinge on three factors: situational altruism induced by nudges (ω), long-run dispositional altruism (θ), and the relationship between the two. Differences in ω or θ alone for different groups of people can explain many cases of heterogeneity in consumption, but the net outcome is less obvious when they are inconsistent (e.g., ω high and θ low). Also, whether situation and disposition are viewed as complements or substitutes affects the effectiveness of nudges for certain types of people. A nudge that induces situational guilt has a larger effect on people inclined to be bad (versus good) if people view situation and disposition as substitutes, but the nudge has a larger effect on people inclined to be good (versus bad) if people view situation and disposition as complements. Modeling ω as either exogenous or endogenous does not affect our conclusions. From the standpoint of policy interventions with the goal of enhancing the effectiveness of informational nudges, if a nudge can be designed so that people view it as a substitute for disposition, then it may be used to target people inclined to be bad. If a nudge can be designed so that people view it as a complement to disposition, then it may be used to target people inclined to be good.

In light of the results suggested by our model, additional or alternative measures should be considered when informational provision is less effective for certain groups of people. Some other forms of prevalent situational nudges include commitment devices, default options, implementation intentions, and exploitation of nonlinear demand curves (Allcott and Mullainathan, 2010). The identification of the relationship between situation and disposition, however, can be demanding. While such inferences may be possible by reanalyzing existing data, more research is necessary to avoid potentially adverse consequences of nudge-based policies.

4 The Role of Government: A Model of Taxation

As noted in the general model in Section 3, social welfare loss due to choices made by self-centered individuals may serve as the rationale behind governmental intervention. One possible way to alter choice of consumption is through the use of informational nudges, as discussed in the previous sections. Such interventions are less conventional but are favored by many researchers and policy makers over commands, requirements, and prohibitions (Thaler and Sunstein, 2008). Nevertheless, more conventional governmental interventions have been in place for a longer time and are still worth studying. This section will be devoted to the discussion of one specific type of traditional intervention– taxation. This section aims to provide insights into the optimal tax in the presence of dispositional and situational pro-sociality.

We first consider the first best policy, where government chooses both taxes and nudges optimally. However, policy makers often see nudges as a way to influence behavior when taxes are politically infeasible. In such cases, we hope our model serves as a reminder that nudges are not designed in a vacuum, and should be designed to complement existing tax policies.

4.1 Impure Altruism and Taxation

To maximize total social welfare, the government can impose a Pigouvian corrective tax^[3] of T on the purchase of each unit of the good to affect the choice of g. For simplicity, we focus this discussion on a combined measure of impure altruism parameter αθ,ω. Let the density function of this combined function be ϕα, α∈[α_,α¯]. Assumptions are adjusted to focus on α:

All the individuals have the same utility function.
dNdG>0 and d2NdG2≥0.
Vg is concave and increasing in g, so dVdg>0 and d2Vdg2<0.
The impure altruism/culpability variable, α, is exogenous, that is, it is determined externally and do not change as g changes.
∂c∂α>0,∂c∂g>0, and purchase costs are subsumed by the cost function.
∂2c∂g2≥0.
Interaction between consumption and culpability: ∂2c∂α∂g>0.

The relationship between the following propositions denoted in terms of α and the disposition and situation parameters θ and ω follows in a straightforward way through application of the chain rule. We assume ∂α∂θ>0 and ∂α∂ω>0, and that the interaction between situation and disposition is again governed by the complementarity parameter ∂2α∂θ∂ω. We return to the three-way interaction between tax rates, situation, and disposition in Section 4.4.

Assume the government knows the distributions of α. Moreover, the government also knows the response function of people, g∗=hα,T, based on the tax T set by the government for each unit of consumption. People take T as exogenous, and g∗=hα,T is the solution to the utility maximization problem faced by each individual:

maxg s=Vg−NG−cα,g−gT=u−gT.

Total social welfare is

Π=S1+⋯+SI+GT=u1+⋯+uI−(g1+⋯+gI)T+GT=∑1Iui=∑1I[V(gi)−N(G)−c(α,gi)].

Since α is a random variable, the government maximizes the expected value of Π:

maxT EΠ=∑1IE[Vgi∗−NG∗−cα,gi∗].

4.2 Stochastic Dominance of Impure Altruism and Effect onTaxation

This section explores the stochastic dominance of impure altruism and the corresponding effect on government’s tax decision. First-order stochastic dominance (FSD) transformations refer to stochastically larger impure altruism in the society and second-order stochastic dominance (SSD) transformations refer to stochastically less volatile impure altruism in the society.

We now make pragmatic adjustments to the utility function of each individual by taking into consideration a budget constraint faced by individuals, b, and a real price of the good, x. Both b and x are positive, and we assume x<b. So each individual maximizes the following problem:

maxgs=Vg−NG−cα,g−Tg−xg,

subject to

gx+T≤b,g≥0.

These adjustments do not change Π, so the government faces the same problem:

maxT≥0EΠT,α

where, as we recall, Π=∑1Iui. Following Ormiston (1992), we have the next two propositions.

Proposition 8:

First-Order Stochastic Dominance. By Lemma 8.1 (see Appendix A.8), there is an existence of interior solution, T∗, for FSD. T∗ increases (decreases) for all FSD transformations of α if uTαT,α≥0 (uTαT,α≤0) everywhere.

(See Appendix A.9 for Proof of Proposition 8.)

Therefore,

The government will increase tax when impure altruism is stochastically larger in the society when d2g∗dTdα≥0, i.e., when tax and impure altruism are complements.
The government will decrease tax when impure altruism is stochastically larger in the society when d2g∗dTdα≤0, i.e., when tax and impure altruism are substitutes.

Proposition 9:

Second-Order Stochastic Dominance . T∗increases (decreases) for all SSD transformations ofαifuTαT,α≥0anduTααT,α≤0(uTαT,α≤0anduTααT,α≥0) everywhere.)

(See Appendix A.10 for Proof of Proposition 9.)

Therefore,

The government will increase tax when impure altruism is stochastically less volatile provided that d2g∗dαdT≥0 and ddαd2udαdT≤0, i.e., when tax and impure altruism are complements and ddαd2udαdT=T⋅ddαd2g∗dαdT≤0.
The government will decrease tax when impure altruism is stochastically less volatile provided that d2g∗dαdT≤0 and ddαd2udαdT≥0, i.e., when tax and impure altruism are substitutes and ddαd2udαdT=T⋅ddαd2g∗dαdT≥0.

It is worth noting here that the complementarity we discuss in this proposition is a different complementarity than the one discussed in Section 3. Whereas before, we were interested in how dispositional altruism and situational altruism interacted, Propositions 8 and 9 describe the altruism between impure altruism and taxes. We discuss the relationship between all three of these – taxes, dispositional altruism, and situational altruism – in Section 4.4.

4.3 Taxation under Binary Types

To better understand the intuition of how government should set taxes in relation to altruistic preferences, consider government’s taxation decision for a more specific functional form where there are only two types of α. Assume α∈αH,αL, where αH denotes high impure altruism and αL denotes low impure altruism. More specifically,

α={αH,with a probability of pαL,with a probability of 1−p

where p∈0,1.

Proposition 10:

Taxation under Binary Types of Impure Altruism. Suppose the same assumptions needed for Propositions 8 and 9 also hold, we have

signdT∗dp=signd2g∗dαdT.

(See Appendix A.11 for Proof of Proposition 10.)

Therefore, if d2g∗dαdT>0, that is, tax and impure altruism are complements, then the government will tax more if people are more likely to be impurely altruistic. If d2g∗dαdT<0, that is, tax and impure altruism are substitutes, then the government will tax less if people are more likely to be impurely altruistic. This proposition is consistent with the results in Section 4.2 since the binary assumption for impure altruism is just a specific case for the distribution of α.

So far, our discussion about the taxation implications has focused on the single parameter α that captures the randomness in impure altruism among the population. As mentioned earlier, α is defined as a function of both situation and disposition. In the next section, we decompose α to account for the individual effect of θ and ω.

4.4 Taxation under Impure Altruism Interactions

In this section, we assume that the government takes into consideration both dispositional altruism θ and situational altruism ω when determining the corrective tax, T. θ differs for each person in the economy with a density function of φθ, _{θ∈[θ_,θ¯]}. ω also differs for each person in the economy with a density function of ψω, _{ω∈[ω_,ω¯]}. We assume that θ and ω are independently distributed. Assumptions are as (A1)–(A7) in Section 3.2. We make an additional assumption analogous to (A8) in the derivation of Proposition 5:

The psychic cost function cθ,ω,g=c˜θ,ω⋅cˆg where ∂c˜∂θ>0,∂c˜∂ω>0,∂cˆ∂g>0, and ∂2cˆ∂g2≥0.

Assume the government knows the distributions of θ and ω. Moreover, the government also knows the response function, g∗=hθ,ω,T, based on the tax T set by the government for each unit of consumption. People take T as exogenous, and g∗=hθ,ω,T is the solution to the utility maximization problem faced by each individual:

maxgs=Vg−NG−cθ,ω,g−gT=u−gT.

Total social welfare is

Π=s1+⋯+sI+GT=∑1Iui.

The government maximizes the expected social welfare:

maxTEΠ=∑1IE[Vgi∗−NG∗−cθ,ω,gi∗].

Proposition 11:

Effect of Impure Altruism on Optimal Tax . Assuming (A2), (A3), (A6), and (A8′), we have

dT∗dθ≥0 and dT∗dω≥0.

The equalities hold if and only if∂2cˆ∂g2=0.

(See Appendix A.12 for Proof of Proposition 11.)

While our earlier results about the stochastic dominance of impure altruism suggest that the effect on taxation depends on the relationship between the tax rate and the degree of altruism, here the additional assumption (A8′) allows us to determine the direction of the effect. When the psychic cost function is separable in terms of the impure altruism component and the consumption component, we derive that the optimal tax is nondecreasing in one type of impure altruism conditional on the other type.

The next proposition allows us to better understand the optimal tax level in light of two types of interaction. The relationship between each person’s individual situation and disposition based on one’s own psychic cost functions is referred to as individual substitutes (complements). The relationship between the population’s situation and disposition and the choice of tax rate is referred to as policy substitutes (complements).

Proposition 12:

Optimal Tax under Interacting Impure Altruism. Assuming (A2), (A3), (A6), and (A8′), we have that the optimal tax depends on the relationship between situation and disposition:

a. When the impacts of situation and disposition on psychic costs are individual substitutes, then the impacts of situation and disposition on optimal tax are policy complements:

d 2 c ˜ d ω d θ < 0 ⇒ d 2 T ∗ d ω d θ > 0.

b. When the impacts of situation and disposition on optimal tax are policy substitutes, we need the impacts of situation and disposition on psychic costs to be sufficiently individually complementary:

d 2 T ∗ d ω d θ < 0 i f a n d o n l y i f d 2 c ˜ d ω d θ > 0 a n d l a r g e e n o u g h .

(See Appendix A.13 for Proof of Proposition 12.)

This proposition effectively says:

When situation and disposition are individual substitutes:
1. ω has a larger effect on T∗ if θ is larger.
2. Situation has a larger effect on the optimal tax when people are inclined to be good. The optimal dispersion of tax rates for subpopulations experiencing different levels of situational guilt is larger when the population disposition is more altruistic. If tax rates cannot vary by situational guilt, the inefficiency becomes larger when the population has a more altruistic disposition.
3. θ has a larger effect on T∗ if ω is larger.
  Disposition has a larger effect on the optimal tax when people experience high situational guilt. The optimal dispersion of tax rates for subpopulations with different dispositions for altruism is larger when the experienced situational guilt is higher. If tax rates cannot vary by dispositions, then the inefficiency becomes larger when the population is more situationally guilty.
When situational and dispositional altruism are individual complements, the interaction effect on T∗ is ambiguous. However, for situation and disposition sufficiently complementary, we have a symmetric result to (i).
1. ω has a larger effect on T∗ if θ is smaller.
2. Situation has a larger effect on the optimal tax when people are inclined to be bad. Optimal tax dispersion for subpopulations experiencing different levels of situational guilt is larger when the population disposition is lower. If tax rates cannot vary by situational guilt, the inefficiency becomes larger when the population has a less altruistic disposition.
3. θ has a larger effect on T∗ if ω is smaller.
4. Disposition has a larger effect on the optimal tax when people experience low situational guilt. The optimal tax dispersion for subpopulations with different dispositions is larger when the experienced situational guilt is lower. If tax rates cannot vary by situational guilt, the inefficiency becomes larger when the population is less situationally guilty.

5 Discussion: Implications for Policy

As mentioned in the introduction, local authorities and central governments in the UK, the USA, and France have been avidly engaged in the use of informational nudges to increase tax pay-offs, to encourage more retirement savings, to improve energy efficiency, just for a start. However, Akerlof and Kennedy (2013) note limited success over the past 15 years of campaigns using messages of social comparison in areas including drug and alcohol use, recycling, and littering. Our model shows that informational nudges may have very limited impact for three reasons. First, a nudge is less effective when it fails to induce high situational altruism. Cialdini et al. (2006) point out that informational nudges, when purely descriptive, may not be effective in mitigating socially disapproved conduct and may even worsen the problem. To address this counterproductive licensing effect, policy makers should pay attention to how social messages are conveyed.

Second, our model shows that nudges may have very limited impact if low dispositional altruism is dominant in the targeted population. Hence, it is crucial for the government to understand the disposition of certain groups of people when designing public programs to induce pro-social behavior through nudges. Moreover, better understanding of people’s disposition will allow the government to target certain subpopulations and use its resources more efficiently.

Third, even when a nudge successfully induces high situational altruism and when people’s disposition can be identified, the effectiveness of nudges still depends on the relationship between nudges and disposition. While existing work on mechanisms behind situational nudges can provide guidance on this relationship, experiments should be reanalyzed and redesigned to test theories about how certain subpopulations’ situation and disposition interact, and how different types of nudges target different subpopulations. If a nudge can be designed so that people view them as a substitute for disposition, then it may be used to target people with a bad disposition. If a nudge can be designed so that people view it as a complement to disposition, then that nudge should be used to target people inclined to be good.

Some types of dispositional impure altruism include conformity, moral obligation, group identity, etc. Some examples of informational nudges that can induce situational altruism include the use of heuristics (anchoring, availability, and representativeness), framing, emotion arousing, feedback, depiction of social norms, and peer comparison (Thaler and Sunstein, 2008). Others are based on social disapproval, status, or social pressure (Kandel and Lazear, 1992). While our model suggests that pro-social behavior following a nudge is affected by the relationship between disposition and situation, we have not addressed how to identify whether disposition and situation are complements or substitutes in a specific real-life case. However, our model provides a foundation for testing hypotheses based on field experiments. For instance, if situation and disposition are complements, then according to our model, a nudge that induces situational guilt will be more effective for those with high disposition. Costa and Kahn (2010) find in a field experiment that environmentalists are more responsive to green nudges than the average person, implying that situation and disposition are very likely complements in this case.

This paper also considers taxes as an alternative or supplemental tool of public intervention in inducing socially optimal behavior and that the optimal taxes depend on the relationship between the tax and impure altruism. Consequently, two important implications arise. First, if we consider the effect of taxes on dispositional altruism, then identifying subpopulations with different distributions of θ and targeting them with different optimal taxes can be eventually welfare enhancing. Alternatively, if we consider the effect of taxes on situational altruism induced by nudges, then the optimal taxes should be responsive to the behavioral policies being implemented. Also, while in principle, it would be possible to implement a tax system that induces truthful revelation of private information about one’s disposition for altruism along the lines of Mirrlees (1971), in practice such a tax policy would likely require large information rents and would be unlikely. Our tax results, therefore, also provide guidance regarding the size of the welfare loss, when tax policy cannot account for the altruism of the population. In our framework, it is also worth noting that given a choice between a tax and a nudge, tax is generally preferable. Assuming the government revenue is efficiently allocated, the tax costlessly reduces the externality, while the nudge imposes additional psychic costs.

6 Conclusions

Ideally, informational nudges can be used as tools to induce pro-social behavior at the individual level and to reduce negative externality at the public level. The ultimate question is whether or when the nudges are effective. When nudges fail to induce high situational altruism, they are less effective or can even have counterproductive moral licensing effect. But even when nudges do induce high situational altruism, the pro-social impact of a nudge can still be foiled if the intervention targets low disposition individuals. Furthermore, the interaction between situational nudges and disposition serves as an additional explanation for the heterogeneous effects. An effective nudge has a larger impact on people inclined to be bad if people view situation and disposition as substitutes, but the nudge has a larger impact on people inclined to be good if people view situation and disposition as complements. Our conclusions can also be extended to environments with multiple goods or multiple domains.

From a government standpoint, taxes serve as the standard tool used to enhance social welfare given externalities. We show that the optimal tax level is dependent on the distribution of preferences for altruism (both dispositional and situational) as well as the interdependence between people’s altruistic preferences. We show that limiting the tax base by targeting taxes on certain subpopulations may be welfare enhancing.

Finally, the relationship between situation and disposition is unobservable and relatively unexplored. So is the relationship between taxes and impure altruism. This paper aims to fill the theoretic gap in the literature for the policy implications of nudges and to provide a framework for future empirical research uncovering these relationships.

Acknowledgments

We thank seminar and conference participants at Harvard and Vassar for their helpful comments, as well as funding from the Mr. and Mrs. Noah Barnhart, Jr. Fund. In particular, we thank Evsen Turkay and Paul Ruud for very helpful feedback.

Appendix

A.1 Proposition 1 and Proof

Proposition 1:

Tragedy of the Commons. Assuming (A2), (A3), and (A6), the individually rational choice of consumption of the antisocial good g∗ and G∗ is greater than the socially optimal level of consumption, g∗∗ and G∗∗:

G∗>G∗∗andg∗>g∗∗.

Proof:

Each individual faces the maximization problem below:

(EQ1)maxgiui=Vgi−NG−cgi,θ,ω

where ui is the utility of the ith individual and gi is the consumption level chosen by him. The first-order condition is

dVgidgi−dNGdgi−dcgi,θ,ωdgi=0.

If (g1∗,g2∗,…,gI∗) is a Nash equilibrium, then gi∗ maximizes (EQ1) given that the others choose (g1∗,…,gi−1∗,gi+1∗,…,gI∗). Because of the symmetry of the equilibrium, we have g1∗=⋯=gi∗⋯=gI∗=g∗. The total unit of goods consumed is G∗=Igi∗=Ig∗ and total social utility is UG∗=Iui∗. We have

(EQ2)dVg∗dg−dNG∗dgi−dcg∗,θ,ωdg=0.

However, the socially optimal level of consumption, G∗∗=Ig∗∗, is found by maximizing the total utility of the society, U_G, through solving the problem below:

maxGUG=IVGI−ING−IcGI,θ,ω

and the first-order condition is

IdVGIdG−IdNGdG−IdcGI,θ,ωdG=0.

So we have

(EQ3)dVg∗∗dg−dNG∗∗dg−dcg∗∗,θ,ωdg=0.

Now we compare (EQ2) and (EQ3). We have

dVg∗dg−dNG∗dg−dcg∗,θ,ωdg=dVg∗∗dg−dNG∗∗dg−dcg∗∗,θ,ωdg.

Suppose G∗≤G∗∗, g∗≤g∗∗, then by (A3) and (A6), we have dVg∗dg≥dVg∗∗dg, and dcg∗,θ,ωdg≤dcg∗∗,θ,ωdg. Then we must dNG∗dg>dNG∗∗dg. By (A2), dNG∗dG≤dNG∗∗dG, so we have dNG∗dg≤dNG∗∗I⋅dg. This implies that I⋅dNG∗dg≤dNG∗∗dg<dNG∗dg. Note that since dNG∗dg>0, when I≥1, it cannot be that I⋅dNG∗dg<dNG∗dg, presenting a contradiction. Therefore, G∗>G∗∗,g∗>g∗∗.

If every individual aims to maximize his or her own utility by choosing g privately instead of coordinating and choosing G collectively, the group will choose a higher level of total amount of the good and cause more negative externality. Then,

UGG∗∗>UGG∗=Iui=IVgi−ING−Icgi,θ,ω

⇒UGG∗∗I>UGG∗I⇒ui∗∗>ui∗.

A.2 Proposition 2 and Proof

Proposition 2:

Effect of Impure Altruism/Culpability on Consumption .Assuming (A2), (A3), (A6), and (A7), consumption of the antisocial good is decreasing in disposition and situation:

dg∗dθ<0anddg∗dω<0

. In other words, an individual with high disposition or situational culpability will choose a smaller g than people with low disposition or situational culpability.^[4]

Proof:

According to Implicit Function Theorem and by (A2), (A3), (A6), and (A7),

dg∗dθ=∂2c∂θ∂gd2Vdg2−d2Ndg2−d2cdg2<0,

dg∗dω=∂2c∂ω∂gd2Vdg2−d2Ndg2−d2cdg2<0.

A.3 Proposition 3 and Proof

Proposition 3:

Effect of Impure Altruism/Culpability on Utility . Assuming (A2), (A3), (A5), and (A7), if the marginal disutility of increased altruism is greater (less) than the marginal utility from mitigating the externality, then altruism is welfare decreasing (increasing).

Proof:

By Envelope Theorem, we have

∂u^∂θ=du^*dθ=ddθ[V(g*)−c(g*,θ,ω)] =dVdgdg*dθ−∂c∂gdg*dθ−∂c∂θ =[(dVdg−∂c∂g)dgdθ−∂c∂θ]|g* =[(dVdg−dcdg)dgdθ−∂c∂θ]|g* =(−∂c∂θ)|g*<0

as ∂c∂g=dcdg, dVdg−dcdgg∗=0 (by first-order condition), and ∂c∂θ>0 by our assumption (A5).

Similarly,

∂uˆ∂ω<0.

The total social externality is iNG∗. We have

dN(G*)dθ=dNdGdGdgdgdθ|g*=dNdGdgdθ|g*<0

as dNdG>0 by assumption (A2) and dg∗dθ<0 by Proposition 2.

Similarly,

dNG∗dω<0.

To study the effect of impure altruism on total social utility at g∗, we examine

and

∂iu∂ω|g*=i∂u^∂ω|g*−idNdω|g*.

A.4 Proposition 4 and Proof

Proposition 4:

Effect of Impure Altruism/Culpability on Social Welfare Loss .Assuming (A2), (A3), (A5), and (A7), the change in welfare loss due to dispositional and situational altruism is

∂WL∂θ=i∂cg∗,θ,ω∂θ−∂cg∗∗,θ,ω∂θ⏟Part I,>0+idNG∗dθ⏟Part II,<0,

and

∂WL∂ω=i∂cg∗,θ,ω∂ω−∂cg∗∗,θ,ω∂ω⏟Part I,>0+idNG∗dω⏟Part II,<0

Social welfare loss represents the gap between first best and equilibrium consumption as defined in Proposition 1. Part I represents how the difference in the marginal psychic costs changes with θ or ω and this part is positive. Part II represents how the marginal social externality at g∗ (or equivalently, G∗) changes with θ or ω and this part is negative. The net effect of impure altruism on social welfare loss depends on the relative magnitude of Part I and Part II.

Proof:

∂WL∂ω=∂U∗∗∂ω−∂U∗∂ω=∂UGG∗∗∂ω−∂∂ωiuˆg∗−iNG∗=∂∂ωiVG∗∗i−iNG∗∗−icG∗∗i,θ,ω−i∂∂ωVg∗−cg∗,θ,ω+i∂NG∗∂ω=i−∂cg∗∗,θ,ω∂ω−i−∂cg∗,θ,ω∂ω+idNG∗dω=i∂cg∗,θ,ω∂ω−∂cg∗∗,θ,ω∂ω⏟Part I,>0+idNG∗dω⏟Part II,≤0

by assumption (A2), (A7), g∗>g∗∗ (Proposition 1), and dNG∗dω<0 (Proof of Proposition 3). The sign of dWLdω can be either positive or negative, depending on the relative magnitude of Part I and Part II.

Similarly,

∂WL∂θ=i∂cg∗,θ,ω∂θ−∂cg∗∗,θ,ω∂θ⏟Part I,>0+idNG∗dθ⏟Part II,<0

A.5 Proof of Proposition 5

Taking the cross-derivative, we have

ddω(dg*dθ)=ddω(∂2c∂θ∂gd2Vdg2−d2Ndg2−d2cdg2)|g* =(d2Vdg2−d2Ndg2−d2cdg2)⋅ddω(∂2c∂θ∂g)+∂2c∂θ∂g⋅ddω(∂2c∂g2)(d2Vdg2−d2Ndg2−d2cdg2)2|g*

Based on (A2), (A3), (A6), and (A8),

ddω(dg*dθ)=ddω(∂2c∂θ∂gd2Vdg2−d2Ndg2)|g* =(d2Vdg2−d2Ndg2)⋅ddω(∂2c∂θ∂g)−∂2c∂θ∂g⋅ddω(d2Vdg2−d2Ndg2)(d2Vdg2−d2Ndg2)2|g* =(d2Vdg2−d2Ndg2)⋅ddω(∂2c∂θ∂g)(d2Vdg2−d2Ndg2)2|g*.

Since d2Vdg2−d2Ndg2<0,

ofddω(dg∗dθ) is opposite to that of |ddω(∂2c∂θ∂g)g∗.

By (A8), we have cg,θ,ω=g⋅c′θ,ω. Thus,

∂2c∂θ∂g=∂2gc′θ,ω∂θ∂g=∂∂θc′θ,ω=∂c′∂θ.

Then

ddω(∂2c∂θ∂g)|g*=∂∂ω(∂2c∂θ∂g)+∂∂g(∂2c∂θ∂g)∂g∂ω|g* =∂2c′∂ω∂θ+∂2c′∂g∂θ∂g∂ω|g*=∂2c′∂ω∂θ|g*

as ∂2c′∂g∂θ=0. We also have ∂2c∂ω∂θ=∂2gc′θ,ω∂ω∂θ=g∂2c′∂ω∂θ, so we have

sign of ∂2c∂ω∂θ is the same as that of ∂2c′∂ω∂θ

and thus,

sign of ddω(∂2c∂θ∂g)|g*is the same as that of∂2c∂ω∂θ.

Therefore, we have

sign of ddω(dg*dθ)is opposite to that of∂2c∂ω∂θ.

Similarly,

sign of ddθdg∗dωis opposite to that of ∂2c∂θ∂ω.

A.6 Proof of Proposition 6

In the case of endogenous ω, the utility maximization problem is

maxgiui=Vgi−NG−cgi,ωgi,giˆ,θ.

F.O.C.⇒duidgi=dVdg−dNdg−dcdg=dVdg−dNdg−∂c∂ω∂ω∂g−∂c∂g=0

In addition to (A2), (A3), (A5), and (A7), assume that ∂w∂g>0,∂2w∂g2>0, and ∂w∂gˆ<0, and that the psychic cost function is continuous so ∂2c∂g∂ω=∂2c∂ω∂g>0, ∂2c∂g∂θ=∂2c∂θ∂g>0.

According to the Implicit Function Theorem, we have

dg∗dgˆ=∂∂gˆ∂c∂ω∂ω∂g+∂∂gˆ∂c∂gd2Vdg2−∂∂gdNdG−∂∂g∂c∂ω∂ω∂g−∂2c∂g2>0.

Since d2Vdg2<0, ∂∂gdNdG≥0, ∂2c∂g2≥0, the sign of the denominator is determined by the sign of ∂∂g∂c∂ω∂ω∂g=∂ω∂g∂2c∂g∂ω+∂c∂ω∂2ω∂g2. Since ∂c∂ω>0,∂2ω∂g2>0, ∂w∂g>0, and ∂2c∂g∂ω>0, we have ∂∂g∂c∂ω∂ω∂g>0. Therefore, the sign of the denominator is negative. The sign of the numerator is also negative since ∂∂gˆ∂c∂ω∂ω∂g+∂∂gˆ∂c∂g=2∂∂gˆ∂c∂g=2∂∂ω∂c∂g⋅∂ω∂gˆ<0. Thus,

dg∗dgˆ>0.

According to the Implicit Function Theorem, we have

dg∗dθ=∂∂θ∂c∂ω∂ω∂g+∂∂θ∂c∂gd2Vdg2−∂∂gdNdG−∂∂g∂c∂ω∂ω∂g−∂2c∂g2.

As before, the sign of the denominator is negative. Now we discuss the sign of the numerator. Since ∂∂θ∂c∂ω∂ω∂g+∂∂θ∂c∂g=2∂∂θ∂c∂g>0, we have

dg∗dθ<0.

A.7 Proof of Proposition 7

We first justify the existence claim. Let g0ˆ and θ be given. Note that by Proposition 6, dg∗dgˆ>0. We assume that we can find gˆ small enough such that I⋅g∗gˆ′,θ<G∗∗g0ˆ,θ. Since I⋅g∗g0ˆ,θ>G∗∗g0ˆ,θ, under some technical conditions we have that there exists gˆopt between gˆ′ and g0ˆ such that gˆ=gˆopt solves the following equation:

I⋅g∗gˆ,θ=G∗∗g0ˆ,θ,

or equivalently,

g∗gˆ,θ−1IG∗∗g0ˆ,θ=0.

To show for the rest of the proposition, note that by Implicit Function Theorem, we have

dgˆoptdθ=−dg∗gˆ,θdθ−1IdG∗∗g0ˆ,θdθdg∗gˆ,θdgˆ=dg∗∗dθ−dg∗dθdg∗dgˆ.

By Proposition 6, dg∗dgˆ>0. So we have sign of dgˆoptdθ is the same as that of dg∗∗dθ−dg∗dθ. Since dg∗dθ=∂2c∂θ∂gd2Vdg2−d2Ndg2−d2cdg2|g∗ and dg∗∗dθ=∂2c∂θ∂gd2Vdg2−d2Ndg2−d2cdg2g∗∗|, then we have

dgˆoptdθ≠0⇔∂∂g∂2c∂θ∂g≠0.

Therefore, as long as we assume ∂∂g∂2c∂θ∂g≠0, then the optimal message would depend on the dispositional altruism θ.

A.8 Lemma 8.1: Existence of Interior Solution T∗ for FSD

In the case of the more realistic modifications of the maximization problem (a budget constraint and a real price), we have

d2udT2=d2g∗dT2⋅T+dg∗dT=dg∗dT<0.

So ΠT,α is strictly concave in T since Π=1I∑1Iui. Therefore, the existence of an interior solution T∗ that solves EuTT,α=0 and EuTTT,α<0 can be assumed and FSD works.

A.9 Proof of Proposition 8

Note that T∗ increases (decreases) for all FSD transformations of the random variable α if uTαT,α≥0 (uTαT,α≤0) everywhere. We want to show that uTαT,α=d2udαdT≥0 is equivalent to d2g∗dαdT≥0.

In the case of the more realistic modifications of the maximization problem, we have

d2udαdT=ddαdVdT−dNdT−dcdT−xdgdT=ddαdVdg∗dg∗dT−dNdg∗dg∗dT−dcdg∗dg∗dT−xdgdT=d2g∗dαdTdVdg∗−dNdg∗−dcdg∗+dg∗dTddαdVdg∗−dNdg∗−dcdg∗−xddαdgdT.

Since g∗ is the solution to individual utility maximization problem, that is, g∗ is the solution to dVdg−dNdg−dcdg−T−x=0, we have

dVdg∗−dNdg∗−dcdg∗=T+x.

Then

d2udαdT=d2g∗dαdTT+x+dg∗dTdTdα−d2g∗dαdTx=d2g∗dαdTT.

With T≥0, the sign of d2udαdT is the same as that of d2g∗dαdT.

A.10 Proof of Proposition 9

Refer to A.9 Proof of Proposition 8 to see that the sign of d2udαdT is the same as thatof d2g∗dαdT.

A.11 Proof of Proposition 10

Under the binary assumption for α, the government faces the following utility problem:

maxT≥0EΠT=∑1IEui=∑1Ipu1+1−pu2,s.t.0≤T≤x.

This is equivalent to

maxTEπ=pu1+1−pu2,s.t.0≤T≤x,

where

u1=uαH,T=Vg∗−NG∗−cαH,g∗−xg∗,u2=uαL,T=Vg∗−NG∗−cαL,g∗−xg∗,

and g∗=hα,T is the solution to the following individual utility maximization problem based on the level of taxation:

maxgs=Vg−NG−cα,g−Tg−xg,

subject to

gx+T≤b,g≥0.

Write Lg,λ1,λ2=sg−λ1gx+T−b−λ2−g. In the case of inequality constraints, we solve the Kuhn–Tucker conditions in addition to the inequalities gx+T≤b and g≥0. The Kuhn–Tucker conditions for maximum consist of the first-order condition

dLdg=0⇒dsdg−λ1x+T+λ2=0⇒dsdg=λ1x+T−λ2

and the complementary slackness conditions are given by

λ1≥0 and λ1=0 whenever gx+T=b,λ2≥0 and λ2=0 whenever T=0.

Similarly write L′T,δ1,δ2=pu1+1−pu2−δ1T−x−δ2−T, and solve for Kuhn–Tucker conditions in addition to the inequality constraints

F.O.C.⇒dEπdT=pdu1dT+1−pdu2dT−δ1+δ2=FT=0,

and

δ1≥0 and δ1=0 whenever T=xδ2≥0 and δ2=0 whenever T=0.

According to the Implicit Function Theorem,

dT∗dp=−∂F/∂p∂F/∂T=−du1dT−du2dTpd2u1dT2+1−pd2u2dT2=∗.

The denominator of ∗ has the same sign as d2udT2 and the sign of the numerator of ∗ is determined by d2udαdT. If d2udαdT>0, then du1dT>du2dT, and the numerator of ∗ is positive, vice versa. So

sign ofdT∗dpis opposite to that of d2udαdTd2udT2.

Now d2udT2=d2g∗dT2⋅T+dg∗dT=dg∗dT<0, so the sign of dT∗dp is the same as the sign of d2udθdT, which is the same as the sign of d2g∗dθdT as we have shown in the proof of Proposition 8.

A.12 Proof of Proposition 11

The government maximizes the expected value of Π:

maxTEΠ=∑1IE[Vgi∗−NG∗−cθ,ω,gi∗],

Then by the first-order condition, we have the following true at T=T∗:

dEΠdT=I⋅EVhhT−NhhT−chhT=0.

By Implicit Function Theorem,

dT∗dθ=−EhT⋅Vhθ−Nhθ−chθ+hTθ⋅Vh−Nh−chEhT⋅VhT−NhT−chT+hTT⋅Vh−Nh−ch

Since dg∗dT=1d2Vdg2−d2Ndg2−d2cdg2<0, hT<0 and hTT=0.

Assuming I≥2, note that the first-order condition of the individual utility maximization problem gives

Vh−Nh−ch=T≥0,

and taking derivative with respect to T gives

VhT−NhT−chT=1,Vhθ−Nhθ−chθ=0.

Also hTθ=d2Vdg2−d2Ndg2−d2cdg2−2⋅ddθd2cdg2=d2Vdg2−d2Ndg2−d2cdg2−2⋅dc˜dθ⋅d2cˆdg2≥0. Thus, we have

dT∗dθ=−EhTθ⋅TEhT≥0.

A similar proof shows that dT∗dω≥0. The equalities hold if and only if ∂2cˆ∂g2=0.

A.13 Proof of Proposition 12

d 2 T ∗ d ω d θ = − E h T θ ω ⋅ T E h T − E h T ω E h T θ ⋅ T E h T 2 = T ⋅ E h T ω E h T θ − E h T θ ω E h T E h T 2

So we have the sign of d2T∗dωdθ is the same as the sign of

T⋅EhTωEhTθ⏟≥0−EhTθωEhT⏟<0⋅

Note hTθ≥0. Similarly, we can derive hTω≥0. Also, hT<0. Thus, the sign of d2T∗dωdθ depends on the sign of EhTθω. The sign of EhTθω is in turn determined by the sign of d2c˜dωdθ:

hTθω=ddωd2g∗dθdT=ddωddθd2cdg2d2Vdg2−d2Ndg2−d2cdg22=d2Vdg2−d2Ndg2−d2cdg22d2d2cdg2dωdθ−ddθd2cdg2⋅2d2Vdg2−d2Ndg2−d2cdg2⋅−ddωd2cdg2d2Vdg2−d2Ndg2−d2cdg24=d2Vdg2−d2Ndg2−d2cdg22d2c˜dωdθ⋅d2cˆdg2+2⋅ddθd2cdg2⋅ddωd2cdg2⋅d2Vdg2−d2Ndg2−d2cdg2d2Vdg2−d2Ndg2−d2cdg24

Since d2cˆdg2≥0,ddθd2cdg2=dc˜dθ⋅d2cˆdg2≥0, ddωd2cdg2=dc˜dω⋅d2cˆdg2≥0, the sign of hTθω is opposite to that of

d2cˆdg2d2Vdg2−d2Ndg2−d2cdg2⏟<0d2c˜dωdθ+2⋅ddθd2cdg2⋅ddωd2cdg2⏟≥0

Thus, the sign of hTθω is determined by d2c˜dωdθ. If d2c˜dωdθ<0, then hTθω>0, and as a result, d2T∗dωdθ>0. If d2c˜dωdθ>0, then the sign of hTθω is ambiguous. For d2c˜dωdθ large enough, we will have hTθω really negative that can possibly make d2T∗dωdθ<0.

References

Akerlof, K. , and C.Kennedy. “Nudging toward a Healthy Natural Environment: How Behavior Change Research Can Inform Conservation.” June 10, 2013. Accessed November 13, 2014. http://climatechangecommunication.org/sites/default/files/reports/NudgesforConservation_GMU_061013.pdf.Search in Google Scholar

Allcott, H. 2011. “Social Norms and Energy Conservation.” Journal of Public Economics95 (9):1082–95.10.1016/j.jpubeco.2011.03.003Search in Google Scholar

Allcott, H. , and S.Mullainathan. 2010. “Behavioral Science and Energy Policy.” Science327 (5970):1204–5.10.1126/science.1180775Search in Google Scholar

Andreoni, J. 1990. “Impure Altruism and Donations to Public Goods: A Theory of Warm-Glow Giving.” The Economic Journal100 (401):464–77.10.2307/2234133Search in Google Scholar

Andreoni, J. 1995. “Warm-Glow Versus Cold-Prickle: The Effects of Positive and Negative Framing on Cooperation in Experiments.” The Quarterly Journal of Economics110 (1):1–21.10.2307/2118508Search in Google Scholar

Ayal, S. , and F.Gino. 2011. Mikulincer, Mario Ed, and Phillip R. Shaver. “Honest Rationales for Dishonest Behavior.” In The Social Psychology of Morality: Exploring the Causes of Good and Evil, Washington, DC: American Psychological Association.10.1037/13091-008Search in Google Scholar

Ayres, I. , S.Raseman, and A.Shih. 2013. “Evidence From Two Large Field Experiments That Peer Comparison Feedback Can Reduce Residential Energy Usage.” Journal of Law, Economics, and Organization29 (5):992–1022.10.1093/jleo/ews020Search in Google Scholar

Azmat, G. , and N.Iriberri. 2010. “The Importance of Relative Performance Feedback Information: Evidence from a Natural Experiment Using High School Students.” Journal of Public Economics94 (7):435–52.10.1016/j.jpubeco.2010.04.001Search in Google Scholar

Ball, S. , C.Eckel, P. J.Grossman, and W.Zame. 2001. “Status in Markets.” Quarterly Journal of Economics116 (1):161–18.10.1162/003355301556374Search in Google Scholar

Berger, J. , and L.Rand. 2008. “Shifting Signals to Help Health: Using Identity Signaling to Reduce Risky Health Behaviors.” Journal of Consumer Research35 (3):509–18.10.1086/587632Search in Google Scholar

Bergstrom, T. , L.Blume, and H.Varian. 1986. “On the Private Provision of Public Goods.” Journal of Public Economics29 (1):25–49.10.1016/0047-2727(86)90024-1Search in Google Scholar

Beshears, J. , J. J.Choi, D.Laibson, B. C.Madrian, and K. L.Milkman. The effect of providing peer information on retirement savings decisions. No. w17345. National Bureau of Economic Research, 2011.10.3386/w17345Search in Google Scholar

Bowles, S. 1998. “Endogenous Preferences: The Cultural Consequences of Markets and Other Economic Institutions.” Journal of Economic Literature36 (1):75–111.Search in Google Scholar

Casadesus‐Masanell, R. 2004. “Trust in Agency.” Journal of Economics of Management Strategy13 (3):375–404.10.1111/j.1430-9134.2004.00016.xSearch in Google Scholar

Cialdini, R. B. , L. J.Demaine, B. J.Sagarin, D. W.Barrett, K.Rhoads, and P. L.Winter. 2006. “Managing Social Norms for Persuasive Impact.” Social Influence1 (1):3–15.10.1080/15534510500181459Search in Google Scholar

Cialdini, R. B. , C. A.Kallgren, and R. R.Reno. 1991. “A Focus Theory of Normative Conduct: A Theoretical Refinement and Reevaluation of the Role of Norms in Human Behavior.” Advances in Experimental Social Psychology24 (20):1–243.10.1016/S0065-2601(08)60330-5Search in Google Scholar

Costa, D. L. , and M. E.Kahn. Why has California’s residential electricity consumption been so flat since the 1980s? A microeconometric approach. No. w15978. National Bureau of Economic Research, 2010.10.3386/w15978Search in Google Scholar

Darlington, R. B. , and C. E.Macker. 1966. “Displacement of Guilt-Produced Altruistic Behavior.” Journal of Personality and Social Psychology4 (4):442.10.1037/h0023743Search in Google Scholar

Diamond, P. 2006. “Optimal Tax Treatment of Private Contributions for Public Goods with and without Warm Glow Preferences.” Journal of Public Economics90 (4):897–919.10.1016/j.jpubeco.2005.06.001Search in Google Scholar

Eckel, C. C. , E.Fatas, and R.Wilson. 2010. “Cooperation and Status in Organizations.” Journal of Public Economic Theory12 (4):737–62.10.1111/j.1467-9779.2010.01472.xSearch in Google Scholar

Falk, A. , and A.Ichino. 2006. “Clean Evidence of Peer Effects.” Journal of Labor Economics24 (1):39–57.10.1086/497818Search in Google Scholar

Ferraro, P. J. , and M. K.Price. 2013. “Using Nonpecuniary Strategies to Influence Behavior: Evidence from a Large-Scale Field Experiment.” Review of Economics and Statistics95 (1):64–73.10.1162/REST_a_00344Search in Google Scholar

Hallsworth, M. , et al. The behavioralist as tax collector: Using natural field experiments to enhance tax compliance. No. w20007. National Bureau of Economic Research, 2014.10.3386/w20007Search in Google Scholar

Ho, B. , G.Poe, J.Taber, and A.Bento. 2015. “The Effects of Moral Licensing and Moral Cleansing in Contingent Valuation and Laboratory Experiments on the Demand to Reduce Externalities.” Environmental and Resource Economics 1–24. http://link.springer.com/article/10.1007/s10640-014-9872-y10.1007/s10640-014-9872-ySearch in Google Scholar

Johansson, O. 1997. “Optimal Pigovian Taxes under Altruism.” Land Economics73 (3):297–308.10.2307/3147169Search in Google Scholar

Jordan, J. , E.Mullen, and J. K.Murnighan. 2011. “Striving for the Moral Self: The Effects of Recalling Past Moral Actions on Future Moral Behavior.” Personality and Social Psychology Bulletin37 (5):701–13.10.1177/0146167211400208Search in Google Scholar

Kandel, E. , and E. P.Lazear. 1992. “Peer Pressure and Partnerships.” Journal of Political Economy100 (4):801–17.10.1086/261840Search in Google Scholar

Khan, U. , and R.Dhar. 2006. “Licensing Effect in Consumer Choice.” Journal of Marketing Research43 (2):259–66.10.1509/jmkr.43.2.259Search in Google Scholar

Mas, A. , and E.Moretti. 2009. “Peers at Work.” American Economic Review99 (1):112–45.10.1257/aer.99.1.112Search in Google Scholar

Masclet, D. , C.Noussair, S.Tucker, and M.-C.Villeval. 2003. “Monetary and Nonmonetary Punishment in the Voluntary Contributions Mechanism.” American Economic Review93 (1):366–80.10.1257/000282803321455359Search in Google Scholar

Merritt, A. C. , D. A.Effron, and B.Monin. 2010. “Moral Self‐Licensing: When Being Good Frees Us to Be Bad.” Social and Personality Psychology Compass4 (5):344–57.10.1111/j.1751-9004.2010.00263.xSearch in Google Scholar

Mirrlees, J. A. 1971. “An Exploration in the Theory of Optimum Income Taxation.” The Review of Economic Studies38 (2):175–208.10.2307/2296779Search in Google Scholar

Monin, B. , and D. T.Miller. 2001. “Moral Credentials and the Expression of Prejudice.” Journal of Personality and Social Psychology81 (1):33.10.1037/0022-3514.81.1.33Search in Google Scholar

Ormiston, M. B. 1992. “First and Second Degree Transformations and Comparative Statics under Uncertainty.” International Economic Review33 (1):33–44.10.2307/2526981Search in Google Scholar

Rotemberg, J. J. 1994. “Human Relations in the Workplace.” Journal of Political Economy102 (4):684–717.10.1086/261951Search in Google Scholar

Sachdeva, S. , R.Iliev, and D. L.Medin. 2009. “Sinning Saints and Saintly Sinners the Paradox of Moral Self-Regulation.” Psychological Science20 (4):523–8.10.1111/j.1467-9280.2009.02326.xSearch in Google Scholar

Sandmo, A. 1975. “Optimal Taxation in the Presence of Externalities.” The Swedish Journal of Economics77 (1):86–98.10.2307/3439329Search in Google Scholar

Schultz, P. W. , J. M.Nolan, R. B.Cialdini, N. J.Goldstein, and V.Griskevicius. 2007. “The Constructive, Destructive, and Reconstructive Power of Social Norms.” Psychological Science18 (5):429–34.10.1111/j.1467-9280.2007.01917.xSearch in Google Scholar

Thaler, R. H. , and C. R.Sunstein. 2008. Nudge: Improving Decisions about Health, Wealth, and Happiness. New Haven & London: Yale University Press.Search in Google Scholar

Published Online: 2015-04-28

Published in Print: 2015-10-01

Articles in the same Issue

https://doi.org/10.1515/bejeap-2014-0125

Keywords for this article

nudge; heterogeneity; moral licensing; moral cleansing; impure altruism; warm glow