A Signaling Theory of the Online Consumer Review Policy

Jeong-Yoo Kim; Wei Xu

doi:10.1515/rne-2023-0041

Article Open Access

A Signaling Theory of the Online Consumer Review Policy

Jeong-Yoo Kim and Wei Xu

Published/Copyright: December 18, 2023

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Review of Network Economics Volume 22 Issue 2

Abstract

In this paper, we consider a two-period model of an experience good with a seller (informed player) and a consumer (uninformed player) in each period. In the model, we examine the seller’s decision about offering refunds for online reviews of his products, and each period consumer’s purchasing decision together with the first period consumer’s reviewing decision. Our main interest is whether a high-quality product seller offers a high amount or a low amount of cashback for an online review. We show that a lenient cashback policy for a review can be a signal of high quality of the product. Intuitively, a high-quality seller can offer a higher amount of cashback to a consumer who reviews, whether the review is positive or negative. This separation is possible mainly due to a difference in the second-period profits across types. We also briefly discuss the effect of the conditional review policy fostering fake reviews.

Keywords: experience good; online consumer review; quality; signal; fake review; word-of-mouth (WOM) communication

JEL Classification: D82; L15

1 Introduction

If you do build a great experience, customers tell each other about that. Word of mouth is very powerful. Jeff Bezos

The online consumer review system is a new channel to communicate information about products with growing popularity. Since amazon.com started to offer consumers a chance to post their reviews on products on its website in 1995,^[1] an increasing number of online sellers including circuitcity.com, goodguys.com, computer4sure.com, wine.com followed with similar strategies. Its popularity and significant impact on buying decisions have been extended from online sellers to offline sellers, as found in yelp.com.^[2] Yelp.com has been one of the most popular review sites for local businesses including restaurants, bars, beauty salons, doctors, and dentists, since the website started in 2004.^[3] According to one study, 91 % of people trust online reviews as much as personal recommendations. A study by Northwestern University’s Spiegel Research Center (2021) reports that nearly 95 % of shoppers read online reviews before making a purchase.

With its booming development, however, some problems also emerged. As more consumers rely on information from review sites, sellers become more inclined to manipulate the review outcomes. For example, Yelp has been accused of using unfair practices to raise revenue from the businesses that are reviewed on its site – e.g. by presenting more negative review information for companies that do not purchase its advertising services or by prominently featuring advertisements of the competitors of such non-paying companies or conversely by excluding negative reviews from companies’ overall rating for the reason that they are not recommended. Its reliability has also been affected by the submission of fake reviews by external users, such as false positive reviews submitted by a company to promote its own business or false negative reviews submitted by competing companies.^[4] Overall, it is reported that 20 % of Yelp reviews are fake. A similar practice can be found in e-commerce platform sellers who guide buyers to praise and improve the praise rate through the way of “cashback for positive reviews”, so as to attract more consumers to buy their products. The strategy of “cashback on favorable comments” for buyers through e-commerce platforms enables sellers to obtain higher profits at least in the short term.

The fundamental reason for a seller to use a consumer review policy is that consumers are not perfectly informed of the quality of the product it sells before they make purchasing decisions. Such a product is called an experience good in the literature in economics. The literature on experience goods usually assumes that consumers can learn the quality of the product by word of mouth in the second period,^[5] and focuses only on the information transmission mechanism in the first period. Although many economists follow this approach and treat word-of-mouth communication exogenously, it is true that the speed and scope of information diffusion by word-of-mouth communications can be determined endogenously by the seller’s strategic decision. A consumer’s review is just a kind of word-of-mouth communication that can be verified due to its written form for the purpose of rewarding.

In this paper, we examine the decision of a seller selling experience goods to affect word-of-mouth communication of consumers through review systems.^[6] We consider a two-period model with a seller (informed player) and a consumer (uninformed player) in each period in which a seller offers a refund amount for reviews of his products and a consumer in each period decides whether to purchase the experience good and whether to submit a review for the product, although we assume that the review option is not available to the final (second)-period consumer. We are mainly interested in whether a high-quality product seller offers a high amount or a low amount of cashback for a review. We show that a seller of a high-quality product offers a lenient cashback policy for a review, that is, if a seller pays a high amount of cashback to a consumer who submits a review, it is a signal of high quality of the product. Intuitively, a high-quality seller can offer a higher amount of cashback to a consumer who reviews, whether the review is positive or negative, mainly due to higher second-period profits he can get than a low-quality seller.

It is widespread phenomena that retailers run a pay-per-review scheme. Consumers receive some payment or a product in return for a review. There are quite a few review platforms that connect retailers and customers, for example, www.ilovetoreview.com.^[7] Amazon offers a $15 Amazon gift card in exchange for writing a review on Amazon. Also, in the Amazon’s incentive program “Vine Club,” a selected group of customers receive pre-release products free of charge in exchange for a review.^[8]

According to the classification by Nelson (1970), experience goods are those whose quality is learned only after purchase, whereas search goods are those whose quality can be ascertained by consumers before purchasing. Darby and Karni (1973) extended Nelson’s taxonomy into credence goods which are defined as those whose quality cannot be observed even after it is purchased. Because the quality of experience and credence goods is unobservable before purchasing, consumers must make purchase decisions without certainty about product quality. Therefore, consumers have a clear motive to seek information that might help identify high-quality goods. Accordingly, sellers of experience or credence goods also have a clear motive to find ways to signal high quality. Among others, price is a widely used signaling device (Bagwell and Riordan 1991; Judd and Riordan 1994; Wolinsky 1983), while strategic advertising another signaling device as a signaling device (Nelson 1974; Schmalensee 1978). Milgrom and Roberts (1986), Hertzendorf (1993) and Fluet and Garella (2002) considered price and advertising as a jointly chosen pair of signaling devices. Berg, Kim, and Seon (2021) identified conditions under which the seller’s refund policy could be used as a signaling device for sellers of credence goods.

This literature on experience goods identifies two conflicting effects: the so-called Schmalensee and Nelson effect. The Schmalensee effect occurs whenever low-quality sellers have a stronger incentive than high-quality sellers to attract consumers because low-quality sellers’ lower costs of production generate greater profit in the future. The Schmalensee effect occurs only when the cost of producing high-quality products is greater than the cost of producing low-quality products. In our model, there is no difference in production costs across types of sellers, but offering a refund for a review has the effect of endogenizing different costs of sales (more accurately, different future revenues from sales) across seller types, because of their different probabilities of good reviews that can generate more sales in the future. On the other hand, the Nelson effect occurs whenever high-quality sellers have a stronger incentive than low-quality sellers to attract consumers because high-quality goods generate more repeat purchases by consumers who experienced the product or more sales through word-of-mouth communication. Therefore, the information transmission mechanism in our paper is closer to the Nelson effect in the sense that a high refund amount lowers the effective price to induce more consumers to buy in the next period by word-of-mouth communication (review). However, we do not make the naive assumption that word-of-mouth communication is exogenously made by the consumers who purchased in the first period, rather investigate the endogenous decision that leads to word-or-mouth communication. That is, our main concern in this paper is to determine the refund amount that optimizes the second-period word-of-mouth communication at the expense of the first-period profit by allowing some additional refund cost.

Two main features of this paper that are distinguished from the existing signaling literature deserve receiving special attention. First, we examine two separate endogenous information transmission mechanisms both to the first-period consumer and to the second-period consumer at the same time, whereas most of the existing literature focuses only on the information transmission mechanism to the first-period consumer. In particular, in our model, the signal to the second-period consumer is not entirely generated by the seller but jointly created by the seller and the first-period consumer who leaves a review after getting partially informed of the product quality. In fact, the endogenous learning outcome in the second period does affect the seller’s decision in the first period, because purchasing a product of high quality in the first period is more likely to generate a good review thereby inducing more purchases in the second period and accordingly the high-type seller’s second-period reward from a refund is larger. Second, a refund, which is used as a signal of quality, is not socially wasteful in this paper, whereas advertising in Milgrom and Roberts (1986) and education in Spence (1973) are dissipative. In this paper, a seller’s refund is paid to a consumer, so it is just a monetary transfer, not socially wasteful, although too much costly refund may be wasteful to the seller. It has two implications. While any separating equilibrium involving a positive amount of advertising cannot be a reasonable equilibrium in Milgrom and Roberts (1986), a positive amount of refund is possible in a reasonable separating equilibrium passing the Intuitive Criterion because a price and a refund rate have tradeoffs so neither is dissipative in our paper in the sense that a refund increases the demand at the expense of increasing the cost. Another implication is on social welfare. If the production cost is assumed to be zero, a larger amount of refund, which is more costly to a seller, is socially better, because it can generate more sales, whereas too much advertising in Milgrom and Roberts (1986) and too much education in Spence (1973) are socially inefficient.

Literature on consumer reviews in marketing suggests that consumer reviews have become important for consumer purchase decisions and product sales. For example, Chevalier and Mayzlin (2006) found that online book reviews have a significant effect on its sales. Liu (2006) also found that consumer reviews at the Yahoo Movies website has a considerable impact on the revenue. Chen and Xie (2008) argued that online consumer review can help consumers identify the products that best match their preferences. By using a formal model of consumer reviews, they showed that consumer reviews can help a horizontal match between a seller and a consumer. This is contrasted with our model in which a review can give information of the quality to consumers who all prefer high quality to low quality. So, our model is vertical, whereas theirs is horizontal. They also distinguish consumer reviews with traditional offline word-of mouth (WOM) communication. They argue that WOM communication is limited to local social networks, whereas the impact of consumer reviews can reach further. Our paper complements the marketing literature on WOM communication and the consumer reviews by endogenizing WOM communication or consumer review decisions through the seller’s incentive scheme. Chen, Zhang, and Liu (2019) analyzed the effect of the review system of crowdfunding and showed that the crowdfunding platform with the review system makes more profits by providing more information to consumers. In their model, however, none of the crowdfunding platform’s refund decision, its signaling effect, and the reviewing decision of consumers are analyzed. Dorner, Giamattei, and Greiff (2020) also considered the incentive scheme for online product reviewers. They experimentally considered two different incentive schemes; a pay-per-review scheme as in our model and a tournament scheme in which the reviewer who wrote the most helpful review receives a bonus payment. However, they did not provide a formal model to compare the outcomes under the two schemes. Liu (2017) also considered a two-period (the advance selling period and the spot selling period) signaling model similar to ours. In her model, however, reviewing is not a strategic variable of consumers although she discusses negative review effect, nor is the cashback amount a signaling strategy of the seller. Instead, a seller uses two-dimensional signals which are the price and the credibility of e-commerce platform. Li, Tadelis, and Zhou (2020) is closest to ours. They also argue that rewarding buyers for leaving ratings, which they call Rebate-for-Feedback (RFF), can signal high quality of the seller. However, they did not provide a formal signaling model. Furthermore, they assert that “the monetary value of the RFF is not what supports the separation of good versus bad types, but instead it is the quality of feedback that a buyer will leave”, but the assertion is misleading.^[9] Note that it is not the case in separating equilibria we identify. The monetary value of the RFF enables the first-period consumer to tell good types from bad types, while the feedback that the first-period consumer leaves enables the second-period consumer to distinguish good types and bad types probabilistically.

There are also some studies on credibility of reviews and fake reviews. Since reviews are essentially cheap talk,^[10] the review messages may not be credible. In fact, there is significant evidence that fake reviews are widespread and that there exist markets for fake reviews.^[11] For example, Luca and Zervas (2016) find that roughly 16 % of restaurant reviews on Yelp are filtered, and these reviews tend to be more extreme (favorable or unfavorable) than other reviews, and the prevalence of suspicious reviews has grown significantly over time. Also, they find that a restaurant is more likely to commit review fraud when its reputation is weak, i.e. when it has few reviews or it has recently received bad reviews. Schuckert, Liu, and Law (2016) suggest an objective method for ascertaining whether an online review posted on a particular e-commerce platform (TripAdvisor) is unauthentic or suspicious. Relatedly, Jin, Yang, and Hosanagar (2022) consider the brushing strategy of online merchants which is to place fake orders of their own products on major e-commerce platforms and write good reviews about their fake orders (to boost their rankings in search results). We also consider the possibility that sellers manipulate reviews by paying only for positive reviews^[12] thereby distorting the review outcomes by false reviews.^[13] Section 5 briefly discusses this issue.

The article is organized as follows. In Section 2, we introduce a basic model. In Section 3, we analyze the model under complete information as a benchmark case, and in Section 4, we provide the analysis under incomplete information. In Section 5, we discuss the robustness of our results. Concluding remarks follow in Section 6. Proofs are provided in the Appendix.

2 Model

We consider a two-period model of consumer product review. There is a seller and a representative consumer in each period. The seller offers an experience good the quality of which is not known to the consumer. For simplicity, we assume that there are only two quality types, H or L (H > L > 0), with quality type denoted as q ∈ {H, L}. Thus, the quality of the good is private information of the firm. The prior probability that q = H is denoted λ ∈ (0, 1), which is common knowledge.

If a consumer purchases the good of quality q, then he gets a noisy signal s for the quality after consuming it. We assume that if the true quality is q = H, he receives

s = H with probability α L with probability 1 − α ,

and if q = L, he receives

s = L with probability α H with probability 1 − α ,

where α > 1 2 . As α → 1, the consumer gets a more accurate signal.

Let θ be the measure of a consumer’s quality sensitivity which is to tell how much a consumer cares about quality. We assume that θ is uniformly distributed over the unit interval [0,1].

After consuming the good, a consumer may write a review on his experience of the product by choosing one of the two messages “satisfactory (H)” or “unsatisfactory (L)”. It may be used as a reference of the second-period consumer who is also a priori uninformed of the quality of the product. To encourage the first-period consumer to write a review on the product, the firm pays back a refund r to the consumer who completes writing a review. We assume that reviewing incurs some cost k( > 0). A consumer may or may not write an honest report but we assume that lying by writing a dishonest review incurs some cost.^[14]

The seller of either type (q) faces the same marginal cost of production, which is assumed to be constant and equal to c ≥ 0. We also assume that c < L < H.^[15]

Interaction between the seller and a consumer in each period proceeds in two periods. In the first period, the firm chooses r, assuming that p is exogenously given. Following the firm’s choice of r, a consumer then updates the belief about the quality of the good, conditional on r, and then decides whether to purchase the good based on the belief. If he purchases and observes a noisy signal s about q, he decides whether to review and whether to write an honest review if he reviews, based on his observation. In the second period, another consumer makes a purchasing decision based on the review of the first-period consumer. Because there is no third-period consumer, the seller does not offer cashback to the second-period consumer (i.e. r ₂ = 0), and accordingly, the second-period consumer does not review. We assume that the discount factor is δ ∈ (0, 1). Also, we assume that the cashback amount in each period is observable only to the consumer in the period.^[16]

3 Complete Information

In this section, as a benchmark case, we consider the case of complete information in which consumers are fully informed about the quality of the product q.

We will use backward induction due to the sequential nature of the model. In the second period, the seller does not offer a refund for a review (r = 0) and the consumer does not review, either, due to k > 0. The utility function of the second-period consumer is given by

U ( p , q ) = θ q − p .

So, given the price p, the consumer buys the product if θ ≥ p q and does not buy it otherwise. If we denote by π(r, q; p) the seller’s per-period profit as a function of the refund amount r, the second-period equilibrium profit is

π ( 0 , p , q ) = ( p − c ) D ( p , q ) ,

where D ( p , q ) = 1 − p q . This is partly because the second-period consumer’s profit does not depend on the first-period consumer’s review in this complete information case. Note that D 1 ≡ ∂ D ∂ p < 0 and D 2 ≡ ∂ D ∂ q > 0 . This profit is maximized when the seller could charge the monopoly price p m = q + c 2 where r = H, L, and the resulting monopoly profit would be π m ( q ) = 1 q q − c 2 2 , although p is not a choice variable of the seller in this model.

The first-period consumer makes two choices; purchasing decision and reviewing decision. If he writes a review on the quality of the product he purchased, his net valuation is r − k if he makes an honest review. Due to the positive lying cost, a consumer always make an honest review if he reviews. Then, a consumer’s expected net surplus from his purchase is:

(1) U = θ q − p + V ,

where V = max{r − k, 0} is his net valuation in the reviewing stage. If he reviews, V = r − k and if he does not, V = 0. If the consumer’s reservation utility is zero, then he purchases the good if and only if U = θq − p + V ≥ 0 or, equivalently, if θ ≥ p − V q , which gives the consumer demand function conditional on q: D ( p , q ) = 1 − p − V q .

In the first period, if the seller pays a positive amount of cashback to a consumer who submits a review, then the actual (effective) price that a consumer faces is p ̃ = p + k − r , as long as r ≥ k so that the consumer reviews. Thus, a positive refund amount has the effect of lowering the effective price, thereby increasing demand. Note that a consumer may purchase the good even if θq − p < 0 if he expects a positive amount of cashback for his review. Despite the increased demand, however, the resulting profit cannot be greater than the monopoly profit π ^m(q).

To see this, if the cashback amount is r ≥ k, then a consumer chooses to purchase if the utility of a consumer is U = θq − p + r − k ≥ 0 or, equivalently, if θ ≥ p ̃ q . Therefore, the seller’s first-period profit when he chooses both r and p (rather than taking p as given) is modified as follows:

(2) π ( r , p , q ) = ( p − c − r ) D ( p ̃ , q ) = ( p ̃ − k − c ) D ( p ̃ , q ) ≡ π ̃ , if r ≥ k π m if r < k .

We will call the case that r ≥ k a reviewing regime and the case that r < k a non-reviewing regime. Note that π ̃ ≡ ( p ̃ − k − c ) D ( p ̃ , q ) < π m for any p ̃ if k > 0. A positive amount of cashback has two effects. It increases the demand by dropping the effective price, while it increases the cost. Overall, in the reviewing regime, the cost-increasing effect is not completely offset by the demand-increasing effect due to the extra cost k that the consumer should bear for a review, and as a result, the first-period profit is lower when r ≥ k than when r < k. Is it possible, though, that the second-period profit is higher when r ≥ k than when r < k? Unfortunately, it is not possible, because the second-period consumer does not rely on the review in the complete information case in which he already knows the true quality q. Accordingly, a choice of r (whether r ≥ k or r < k) in the first period would not affect the second-period profit in the complete information case. Hence, the seller will choose r = 0 in equilibrium for any q. Intuitively, since a first-period consumer’s review does not give any additional information to a second-period consumer in the case of complete information, the seller will not have an incentive to induce a consumer to review in a costly way by choosing r > 0 even in the two-period model. It is clear that a review and a positive amount of cashback can occur in equilibrium only if there is incomplete information about the quality of the product.

4 Incomplete Information

If consumers do not have complete information about the quality of the seller, then the refund amount that the seller chooses may signal the quality of the product due to H- and L-type sellers’ different signaling costs (or different future benefits). Paying a large amount of refund for a review in the first period is more deserving to a high-quality seller because it helps to sell more in the first period thereby inducing more sales in the second period by a better review signal. This enables separation.

We will use the weak Perfect Bayesian Equilibrium (wPBE) as our basic equilibrium concept to solve the incomplete information game. Let (r _H, r _L) represent a pair of equilibrium cashback amount offered by the H- and L-type seller, respectively. An equilibrium may be separating (r _H ≠ r _L) or pooling (r _H = r _L = r _λ). Also, we will be interested only in the reviewing equilibrium which involves a review at least for one type of seller.

Let λ ̂ 1 ( r ) denote the first-period consumer’s posterior belief that q = H after observing the seller’s choice of r. Then λ ̂ 1 ( r H ) = 1 and λ ̂ 1 ( r L ) = 0 in a separating equilibrium, and λ ̂ 1 ( r λ ) = λ in a pooling equilibrium.

4.1 Separating Equilibria

We first consider separating equilibria. Since the private information of the seller is revealed with probability one in a separating equilibrium, we will denote the consumer’s belief about the seller’s type as q ^e (for expected type).

The second-period consumer updates his belief after observing whether a review is available (i.e. whether the first-period consumer bought it given the assumption that r ≥ k so that a consumer reviews whenever he buys) and after observing the signal from the review (whether s = H or L). Let λ ̂ 2 , b and λ ̂ 2 , s be the posterior beliefs respectively. We will consider two cases. If a second-period consumer updates his belief only based on s with ignoring the first observation, we will call him a naive consumer. Then, the resulting posterior belief updated by the naive consumer will be denoted simply by λ ̂ 2 . If he updates his belief twice, he will be called rational consumer.

In this section, we only consider the case that the second-period consumer is so naive that he updates his belief solely based on the review outcome without making any inference from other observations. The analysis for the case of a rational consumer will be provided in Section 5.1.

Let Π(r, q, q ^e) be the discounted sum of the seller’s first period profit and the second period profit when the seller whose actual type is q is perceived to be a type of q ^e after it chooses the cashback amount r. Under the review regime (r ≥ k), the second-period demand is determined by the review signal s, whereas the first-period demand is determined by the quality that the first-period consumer perceives from the signal r (q ^e). Let D ( r , q e ) = 1 − p − r + k q e be the first-period demand function. After observing s, the second-period consumer updates his posterior belief to

(3) λ ̂ 2 ( s = H ) = λ α λ α + ( 1 − λ ) ( 1 − α ) = λ λ + ( 1 − λ ) 1 − α α > λ ,

(4) λ ̂ 2 ( s = L ) = λ ( 1 − α ) λ ( 1 − α ) + ( 1 − λ ) α = λ λ + ( 1 − λ ) α 1 − α < λ .

Note that λ ̂ 2 ( H ) > λ > λ ̂ 2 ( L ) , since 1 − α α < 1 . By using them, we can compute

(5) Π ( r , H , q e ) = ( p − c − r ) D ( r , q e ) + δ D ( r , q e ) ( p − c ) α 1 − p E ( q | H ) + ( 1 − α ) 1 − p E ( q | L ) + δ ( 1 − D ( r , q e ) ) ( p − c ) 1 − p M ,

(6) Π ( r , L , q e ) = ( p − c − r ) D ( r , q e ) + δ D ( r , q e ) ( p − c ) ( 1 − α ) 1 − p E ( q | H ) + α 1 − p E ( q | L ) + δ ( 1 − D ( r , q e ) ) ( p − c ) 1 − p M ,

where E ( q | H ) = λ ̂ 2 ( H ) H + ( 1 − λ ̂ 2 ( H ) ) L , E ( q | L ) = λ ̂ 2 ( L ) H + ( 1 − λ ̂ 2 ( L ) ) L , and M ≡ λH + (1 − λ)L are expected qualities conditional on s = H, s = L, and unconditional expected quality, respectively. It is clear that E(q|H) > M > E(q|L) because λ ̂ 2 ( H ) > λ > λ ̂ 2 ( L ) .

Equation (5) can be explained as follows. The first term is the first-period profit of a high-type seller. In the second period, there are three possibilities. If the first-period consumer purchases the good and writes a review saying “H” (or “L” respectively), the second-period consumer believes that q ^e = H (or q ^e = L, respectively). If he does not purchases the good so no review is available, the second-period consumer maintains his prior belief λ, i.e. q ^e = λH + (1 − λ)L ≡ M. Equation (6) can be similarly interpreted.

Let Π*(q, q ^e) = max_rΠ(r, q, q ^e) and r*(q, q ^e) = arg max_rΠ(r, q, q ^e). Also, let ρ ≡ p − c be the margin. Then, it is easy to prove the following lemma from (5) and (6).

Lemma 1

(i) There exists α 0 > 1 2 such that for any α ∈ ( 1 2 , α 0 ) , Π*(H, H) > Π*(H, L), Π*(L, H) > Π*(L, L). (ii) For any α > 1 2 , r*(H, H) > r*(L, L).

If q = H, the second-period average demand when the first-period consumer purchases and reviews is higher than the second-period demand when no review is available. This is intuitive, because the review is more likely to be good. Similarly, if q = L, the second-period average demand when a review is available is lower than the second-period demand when a review is unavailable, because the review is more likely to be bad. So, in the first case, the seller will offer a higher amount of cashback r to induce the first-period consumer to buy, whereas he will offer a lower amount of cashback to discourage the first-period consumer from purchasing the good or providing a review. It is noteworthy that if α is too large, a low type may earn a lower profit by pretending to be a high type, because being perceived as a high type increases the probability of the first-period sales and thus decreases the probability of the second-period sales due to the likelihood of negative reviews. So, there is a tradeoff between the first-period sales and the second-period sales for a low type. If α is low, the first-period effect dominates the second-period effect. In this case, the low type will want to be perceived as a high type.

If we define the first best solution of H type and L type by r ^f(H) and r ^f(L) respectively, Lemma 1 (ii) implies that r ^f(H) > r ^f(L) since r ^f(H) ≡ r*(H, H) and r ^f(L) ≡ r*(L, L).

To avoid the trivial case, we assume that:

(7) Π ( r f ( H ) , L , H ) > Π ( r f ( L ) , L , L ) .

This assumption implies that a low-quality seller prefers imitating a high-type seller’s cashback amount if the high-quality seller pays r ^f(H) for a review, rather than choosing a different cashback amount r ^f(L) which is his first-best cashback amount. This condition will be called [DC] condition.^[17] If we define α ₁ by α making (7) binding, i.e. satisfying Π(r ^f(H), L, H; α ₁) = Π(r ^f(L), L, L; α ₁), then [DC] condition implies that for any α ∈ 1 2 , α 1 , [DC] condition is satisfied. As α gets larger, a low type’s profit when he imitates a high type gets lower, because the second-period consumer is less likely to buy the product. This means that Π(r ^f(H), L, H; α) < Π(r ^f(L), L, L; α) for α > α ₁. In this case, a low type has no incentive to pretend to be a high type, so he will chooses r = r ^f(L) and [DC] condition is not satisfied. Hence, no costly signaling by a high type would be necessary.

To characterize the whole set of separating equilibria, we will assume the most pessimistic off-the-equilibrium belief λ ̂ 1 ( r ) = 0 for any off-the-equilibrium refund rate r, which gives the most severe punishment when the seller deviates from an equilibrium refund amount. The following lemma turns out to be useful for characterizing the set of all possible separating equilibria.

Lemma 2

In any separating equilibrium, r _L = r ^f(L).

This lemma implies that a low type’s equilibrium refund amount will not be distorted from r ^f(L). This is clear because, if r _L ≠ r ^f(L), then a low-type seller would prefer deviating to r ^f(L) insofar as its type is revealed in equilibrium.

The high type’s equilibrium refund amount r _H must satisfy the following two incentive compatibility conditions:

[ICL1] Π ( r L , L , L ) ≥ Π ( r H , L , H ) ,

[ICH1] Π ( r H , H , H ) ≥ Π ( r , H , L ) , ∀ r ≠ r H .

The first inequality, which is labeled as [ICL1], is the L type’s incentive compatibility condition. It requires that r _H is too costly for a low-type seller to imitate (i.e. it ensures that the low-type seller does not find it profitable to imitate the high type’s refund amount r _H.) On the other hand, [ICH1], which is the H type’s incentive compatibility condition, requires that r _H is not so costly for a high-type seller that it would prefer deviating from r _H to be perceived as a low type.

Figure 1 shows the region of r _H that satisfies both incentive compatibility conditions. In Figure 1, r ̲ L and r ̄ L are values of r that satisfy [ICL1] with equality, and r ̲ H and r ̄ H are values of r that satisfy [ICH1] with equality. Figure 1a illustrates the possibility that a high-quality good is signaled by a high cashback amount. In contrast, Figure 1b illustrates both possibilities: that a high-quality good may be signaled either by a high refund amount or by a low refund amount. If we invoke the stronger equilibrium concept of Cho and Kreps’ (1987) Intuitive Criterion (abbreviated as “C–K”), however, then high quality is signaled only by a high refund amount.

Figure 1:

Separating equilibria.

We say that a weak perfect Bayesian equilibrium (r _H, r _L) fails to pass the C–K Intuitive Criterion if there exists a cashback amount r( ≠ r _H, r _L) such that:

(8) ( CK − i ) Π ( r L , L , L ) ≥ Π ( r , L , q e ) , ∀ q e = L , H ,

(9) ( CK − ii ) Π ( r H , H , H ) < Π ( r , H , H ) .

Inequality (8) implies that an off-the-equilibrium r is equilibrium-dominated for type L. Inequality (9) implies that whenever the first-period consumer believes that the refund amount r was chosen by H (for whom r is not equilibrium-dominated), then H would have an incentive to deviate to r from r _H. If there exists a refund amount (r) that satisfies these two conditions, then (r _H, r _L) cannot pass the C–K Intuitive Criterion because H would have an incentive to deviate from the equilibrium. Applying C–K Intuitive Criterion, we obtain the following proposition.

Proposition 1

In the case of a naive consumer, there is some small ρ ̄ ( > 0 ) such that for any ρ ≤ ρ ̄ , there exist α ̲ ( > 1 2 ) and α ̄ such that for any α ∈ ( α ̲ , α ̄ ) , ( r L , r H ) = ( r f ( L ) , r ̄ L ) is the unique separating equilibrium that satisfies the C–K Intuitive Criterion. In this equilibrium, r _L < r _H.

Proposition 1 implies that a high refund amount for a review signals the seller’s high quality. Although the seller’s quality is not signaled to the second-period consumer due to his limited ability to observe, it is possible that he can get some signal from the review of the first-period consumer. The assumption that α > 1 2 makes it less costly (or more profitable) for a high-quality type to pay a consumer back for a review because it induces a higher expected profit in the second period, due to the higher probability that the review outcome is good. Therefore, a low type cannot imitate a high type’s generous payback.

Proposition 1 also says that the high-type seller does not need to use any signal that costs more than r ̄ L , which is the well-known “Riley outcome.” The Riley outcome r ̄ L is the most efficient separating equilibrium because it incurs the least signaling cost. To see why applying the Intuitive Criterion can eliminate all the separating equilibria except the most efficient one ( r ̄ L ) for any equilibrium refund amount of the high type, r H ∈ ( r ̄ L , r ̄ H ] , consider the following off-the-equilibrium message r ′ ∈ ( r ̄ L , r H ) as depicted in Figure 2. In Figure 2, it is easy to see that Π ( r f ( L ) , L , L ) = Π ( r ̄ L , L , H ) > Π ( r ′ , L , L ) , Π ( r ′ , L , H ) , which implies that r′ is equilibrium-dominated for low-quality types. Once the consumer observes r′ and therefore eliminates the possibility that the seller is a low type, it follows that Π(r _H, H, H) < Π(r′, H, H), since r f ( H ) < r ̄ L < r ′ < r H . Therefore, C–K Intuitive Criterion requires that λ ̂ 1 ( r ′ ) = 1 . This result implies that the equilibrium rate for a high type, r H ∈ ( r ̄ L , r ̄ H ] , fails to pass the Intuitive Criterion.

Figure 2:

C–K Intuitive Criterion.

Also, the Nelson effect is confirmed by this result. Since a higher amount of reward implies a lower effective price, this proposition suggests that a high-quality seller chooses a lower price, because it can generate more repeat purchases.

This result has some implication on social welfare. Although it is well known that a higher level of education than the Riley outcome in the job market signaling model of Spence (1973) is socially inefficient, it is not the case in our model, because a refund, which is a monetary transfer between a seller and a consumer, is not socially wasteful although it may be wasteful to the seller. If we assume that c = 0, as a high-type seller pays a larger refund amount to the first-period consumer, the effective price gets lower and this increases the first-period sale, and as a result, the second-period sale is also likely to be increased. This clearly improves social efficiency.

4.2 Pooling Equilibria

We now consider pooling equilibria. Let r _λ be a pooling equilibrium cashback amount. Under the most pessimistic belief, r _λ must satisfy the following two incentive compatibility conditions:

[ICL2] Π ( r λ , L , M ) ≥ Π ( r , L , L ) , ∀ r ≠ r λ ,

[ICH2] Π ( r λ , H , M ) ≥ Π ( r , H , L ) , ∀ r ≠ r λ .

The first inequality is the L type’s incentive compatibility condition which requires that an L type prefers being pooled by choosing r _λ rather than giving up looking better by deviating to r ^f(L). That is, r _λ should not be too far from r ^f(L). On the other hand, the second inequality is the H type’s incentive compatibility condition requiring that an H type also prefers being pooled by choosing r _λ rather than deviating from it and being perceived to be a low type, implying that r _λ should not be too far from r*(H, L).

The set of r _λ satisfying the two inequalities, [ICL2] and [ICH2], is illustrated in Figure 3, as [ r ̲ L λ , r ̄ L λ ] and [ r ̲ H λ , r ̄ H λ ] , respectively. Let E p = [ r ̲ L λ , r ̄ L λ ] ∩ [ r ̲ H λ , r ̄ H λ ] be the set of pooling equilibrium cashback amounts. It is characterized in the following proposition.

Figure 3:

Pooling equilibria.

Proposition 2

(i) For any λ ∈ (0, 1), there exist α ̃ ( λ ) ∈ 1 2 , 1 such that for any α ≤ α ̃ ( λ ) , E _p ≠ ∅. (ii) For any α > 1 2 , there exists λ ̲ ( α ) > 0 such that for any λ ≤ λ ̲ ( α ) , E _p =∅.

In pooling equilibria described above, the cashback amount cannot convey any information about the quality of the product and thus the review outcome is the only source from which consumers can get information about the quality.

We may resort to a more refined equilibrium concept such as Intuitive Criterion just as in the case of separating equilibria, if our equilibrium concept wPBE based on the most pessimistic belief appears to be too weak. However, it is not difficult to see that C–K Intuitive Criterion does not eliminate all the pooling equilibria, although it can eliminate some.

For example, in Figure 3, r _λ = r ^f(L) can be eliminated by CK Intuitive Criterion. If one takes an off-the-equilibrium message r ″ = r ̂ + ϵ , it clearly satisfies (CK-i), because Π(r ^f(L), L, M) > Π(r″, L, q ^e) for any q ^e = H, L. It is also clear that (CK-ii) is satisfied because Π(r ^f(L), H, M) < Π(r″, H, H). The intuitive reason why some pooling equilibria can be eliminated by Intuitive Criterion is that there exists a deviant message that cannot be profitable to a low-type seller but increases only the profit of a high-type seller.

On the other hand, Figure 3 also shows the possibility of a pooling equilibrium that survives the C–K Intuitive Criterion. Eliminating an equilibrium message r λ = r ̄ L λ requires that there exist an off-the-equilibrium message r′ that is equilibrium dominated for an L type, which would be possible if r ′ > r ̄ L . However, for any r ′ > r ̄ L , Π ( r ̄ L λ , H , M ) > Π ( r ′ , H , H ) . This means that none of such an off-the-equilibrium message r ′ > r ̄ L satisfies (CK-ii), implying that the original equilibrium passes the Intuitive Criterion. Intuitively, an equilibrium passes the Intuitive Criterion if there exists no deviant message that is not profitable to a low type but profitable only to a high type. When we consider the pooling equilibrium message r ̄ L λ in Figure 3, a deviation of a low type cannot be profitable if the deviant refund rate is too large, i.e. r ≥ r ̄ L , but those messages are not profitable to a high-type seller, either.

5 Discussion

Our analysis hitherto has been made for a simple model set up in Section 2. In this section, we will examine the robustness of our results by altering some assumptions.

5.1 Rational Consumer

While a naive second-period consumer updates his posterior belief only based on the review outcome, a consumer could be more rational by using more information. Since a high-quality product is sold more than a low-quality product and a consumer who purchases always submits a review in equilibrium, observing a review by a previous consumer can signal that it is more likely to be a high quality.

If the second-period consumer can utilize this information in addition to the review outcome, he will update his posterior belief twice. Let y ∈ Y = {0, 1} indicate the purchasing decision of the first-period consumer (or availability of a review by the first-period consumer) where y = 1 indicates “a review is available” and y = 0 indicates “a review is not available”. Also, let λ ̂ 2 , b ( y ) be the posterior belief of the second-period consumer updated from the prior belief λ based on his observation of y, and λ ̂ 2 , s ( s ) be his posterior belief updated from λ ̂ 2 , b ( y ) based on his observation of s. Note that λ ̂ 2 , b ( y ) is not updated from λ ̂ 1 which was updated by the first-period consumer based on his observation of r, because the second-period consumer is assumed that he cannot observe r. Then, the posterior beliefs are calculated as follows: If a review is available,

(10) λ ̂ 2 , b ( 1 ) = λ D ( r H , H ) λ D ( r H , H ) + ( 1 − λ ) D ( r L , L ) = λ λ + ( 1 − λ ) D ( r L , L ) D ( r H , H )

(11) > λ

and if a review is not available,

(12) λ ̂ 2 , b ( 0 ) = λ ( 1 − D ( r H , H ) ) λ ( 1 − D ( r H , H ) ) + ( 1 − λ ) ( 1 − D ( r L , L ) ) = λ λ + ( 1 − λ ) 1 − D ( r L , L ) 1 − D ( r H , H )

(13) < λ .

It is easy to see that λ ̂ 2 , b ( 1 ) > λ > λ ̂ 2 , b ( 0 ) because D(r _H, H) > D(r _L, L), assuming that r _H > r _L which will be shortly proved.

If a signal s is observed from a review, he updates the posterior belief one more time to

(14) λ ̂ 2 , s ( H ) = λ ̂ 2 , b ( 1 ) α λ ̂ 2 , b ( 1 ) α + ( 1 − λ ̂ 2 , b ( 1 ) ) ( 1 − α ) > λ ̂ 2 ( H ) ,

(15) λ ̂ 2 , s ( L ) = λ ̂ 2 , b ( 1 ) ( 1 − α ) λ ̂ 2 , b ( 1 ) ( 1 − α ) + ( 1 − λ ̂ 2 , b ( 1 ) ) α > λ ̂ 2 ( L ) .

Lemma 3

There exists α ̂ ∈ 1 2 , 1 such that for any α ≥ α ̂ , E(q|H, y = 1) > E(q|y = 0) > E(q|L, y = 1).

This is mainly because λ ̂ 2 , s ( H ) > λ ̂ 2 , b ( 0 ) > λ ̂ 2 , s ( L ) if α is accurate enough. The intuition goes as follows. It is obvious that the posterior belief that q = H is higher when the second-period rational consumer receives the review report s = H than when he receives no report. On the other hand, if the consumer receives the review s = L, it is both better news and worse news than when he receives no review. First, the information that the first-period consumer purchased the good and wrote a review updates the belief upwards to λ ̂ 2 , b ( 1 ) ( > λ ̂ 2 ( 0 ) ) . Second, the information that s = L updates the belief downwards to λ ̂ 2 , s ( L ) ( < λ ̂ 2 , b ( 1 ) ) . Therefore, it is unclear whether λ ̂ 2 , s ( L ) > λ ̂ 2 , b ( 0 ) or λ ̂ 2 , s ( L ) < λ ̂ 2 , b ( 0 ) . However, if the signal s is accurate enough, the second effect dominates the first effect; hence, λ ̂ 2 , s ( L ) < λ ̂ 2 , b ( 0 ) < λ ̂ 2 , s ( H ) . The counterpart of Proposition 1 for a rational consumer follows directly from this lemma.

Proposition 3

In the case of a rational consumer, there is some small ρ ̄ ′ ( > 0 ) such that for any ρ ≤ ρ ̄ ′ , there exist α ̲ ′ ( > 1 2 ) and α ̄ ′ such that for any α ∈ ( α ̲ ′ , α ̄ ′ ) ≠ ∅ , r L ′ , r H ′ = ( r f ′ ( L ) , r ̄ L ′ ) is the unique separating equilibrium that satisfies the C–K Intuitive Criterion. In this equilibrium, r L ′ < r H ′ .

Since Lemma 1 and Lemma 2 still hold with a minor modification of replacing λ ₂(s) by λ ̂ 2 , s ( s ) , it is straightforward that Proposition 1 remains unaffected qualitatively in the case of a rational consumer. The counterpart of Proposition 2 for a rational consumer can be similarly derived from Lemma 1, so we omit it.

5.2 Endogenizing Price

To highlight the role of the refund rate as a quality signal, we have, so far, assumed that price is exogenously given. In this section, we assume that in the first period, the seller chooses the price that will prevail in both periods together with the refund rate.

In this modified game, there may be a separating equilibrium in which the seller uses a separating price strategy as well as a separating refund policy. In the equilibrium, however, the review of the first-period consumer does not convey better information to the second-period consumer than the separating price, because the second-period consumer can infer the true quality of the product from observing the price. Naturally, it is our main interest to see whether there is a separating equilibrium in which the type of a seller is revealed only by the refund rate in the first period. Since our previous analysis was based on the implicit assumption that a pooling equilibrium price exists, the analysis of this section can provide a justification for it.

We can rewrite the profit function of a high-type seller by using Equation (5) as

(16) Π ( p , r , H , q e ) = ( p − c − r ) 1 − p + k − r q e + δ ( p − c ) 1 − p + k − r q e 1 − α p E ( q | H ) − ( 1 − α ) p E ( q | L ) + δ ( p − c ) p + k − r q e 1 − p M = ( p − c − r ) D + δ ( p − c ) ( 1 − p Q H ) ,

where D = 1 − p + k − r q e and Q H = D α E ( q | H ) + 1 − α E ( q | L ) + 1 − D M . Here, D is the first-period demand, Q _H is, roughly speaking, the inverse of expected quality given the information available in the second period when the true quality is H, and thus 1 − pQ _H is the second-period demand. We can denote the price and the refund rate maximizing (16) by p*(H, q ^e) and r*(H, q ^e).

On the other hand, the profit function of a low-type seller can be rewritten by using Equation (6) as

(17) Π ( p , r , L , q e ) = ( p − c − r ) 1 − p + k − r q e + δ ( p − c ) 1 − p + k − r q e 1 − ( 1 − α ) p E ( q | H ) − α p E ( q | L ) + δ ( p − c ) p + k − r q e 1 − p M = ( p − c − r ) D + δ ( p − c ) ( 1 − p Q L ) ,

where Q L = D 1 − α E ( q | H ) + α E ( q | L ) + 1 − D M . We can denote the price and the refund rate maximizing (17) p*(L, q ^e) and r*(L, q ^e).

By abusing notation, let (p _λ, r _H) and (p _λ, r _L) be the separating strategy pairs of types H and L. Under the most pessimistic belief, the incentive compatibility conditions are as follows.

(18) Π ( p λ , r H , H , H ) ≥ Π ( p , r , H , L ) , ∀ p , ∀ r ,

(19) Π ( p λ , r L , L , L ) ≥ Π ( p λ , r H , L , H ) .

There can be many equilibrium strategies that satisfy both (18) and (19). Figure 4 illustrates the set of the equilibrium refund rates of the high type r _H with the pooling price p _λ. In the blue region, the low type cannot imitate the high type; i.e. the high-type equilibrium pairs in the blue region satisfy (19). Similarly, in the green circular region, the high type cannot deviate from its equilibrium strategies. The set of (r, p) indicated in red, which is the intersection of the blue region and the green region that satisfy both incentive compatibility conditions, is the high-type’s separating equilibrium pairs of the price and refund rate. The intersection of the red set and the horizontal line at p = p _λ are the set of separating equilibria (p _λ, r _H) we are interested in.

Figure 4:

Separating equilibria with a pooling price and separating refund rates.

We can also show that point B is the only separating equilibrium that passes the Intuitive Criterion. For example, consider point B′ inside the equilibrium set. Take any point B″ between B and B′. Then, it is not difficult to see that it is equilibrium dominated for a low type. Note that the iso-profit curve (of true type L and perceived type H) passing through its equilibrium point A = (p _λ, r _L) yields a higher profit than the iso-profit curve possing through B″. Then, it is obvious that if the low type is perceived to be a low type, its profit is even lower when it chooses point B″. On the other hand, it is clear from the figure that the iso-profit curve of the high type passing through point B″ yields a higher profit than point B′. Therefore, such a deviation must be believed to come from a high-type seller, implying that the original equilibrium point B′ fails to pass the Intuitive Criterion. Similar arguments can show that point B is the unique separating equilibrium that survives the Intuitive Criterion.

Although our main interest is in separating equilibria, there can be also pooling equilibria. In a pooling equilibrium, a low type successfully imitates a high type, while the high type gives up preventing the low type from imitating itself because it is too costly. Thus, we will consider the undistorted price and refund rate of the high-type seller when he is perceived to be M = λH + (1 − λ)L.

Under incomplete information, both types of seller use the same price and refund rate. Let (p _λ, r _λ) be the pooling equilibrium pairs of the price and refund rates of types H and L where p _λ = p*(H, M) and r _λ = r*(H, M). Under the most pessimistic belief, i.e. λ ̂ ( p , r ) = 0 for any (p, r) ≠ (p _λ, r _λ), it must satisfy the following incentive compatibility conditions.

(20) Π ( p λ , r λ , H , M ) ≥ Π ( p , r , H , L ) , ∀ p , ∀ r ,

(21) Π ( p λ , r λ , L , M ) ≥ Π ( p * ( L , L ) , r * ( L , L ) , L , L ) .

Both inequalities require that neither type of seller has an incentive to deviate from the pooling price and refund rate.

It is clear that Inequality (20) is always satisfied, that is, the high-type seller has no incentive to deviate from (p _λ, r _λ), because it is the optimal strategy of a high type when q ^e = M and any deviation from it simply leads into the belief that the deviant seller is a low quality type.

Let us consider the incentive compatibility of the low type. We start from the case that δ = 0. Then, it is also clear that (21) is trivially satisfied, because (p*(H, M), r*(H, M)) maximizes Π ( p , r , H , M ) = Π ( p , r , L , M ) = ( p − c − r ) 1 − p + k − r M . In fact, any pair (p, r) such that p − r = p ̃ = M − k + c 2 is optimal. Now, we increase δ from zero. If δ > 0, the optimal pair of (p, r) is no longer indeterminate. Now, Π of either type depends both on p and r, not a function of the effective price p ̃ solely. We will assume that the optimal pair of (p, r) is unique for any (q, q ^e) when δ > 0. If δ > 0, we cannot generally compare the relative size of Π(p, r, L, M) and Π(p, r, L, L), because the first-period profit of a low type is clearly higher when the perceived quality is M but the second-period profit of the low-type seller is higher when the perceived quality is L. Intuitively, this is because if the first-period sales volume is high due to its high perceived quality, its second-period demand can be low due to bad reviews. However, if δ is very low, the first-period effect outweighs the second-period effect, implying that Π(p, r, L, M) > Π(p, r, L, L). Also, since Π(p, r, L, M) is continuous with respect to δ, there exists δ ̄ ( > 0 ) such that for any δ ≤ δ ̄ , Π(p*(H, M), r*(H, M), L, M) > Π(p*(L, L), r*(L, L), L, L).

5.3 Conditional Cashback Policy

In this section, we consider an extended model in which the seller can choose a conditional cashback policy of paying only a consumer who writes a positive review on its product.

Under the conditional cashback policy, the first-period consumer’s decisions are significantly affected. Although we argued that a consumer never makes a dishonest review under the unconditional cashback policy due to a positive lying cost, it is possible that if he can get a refund only for a positive review, he may choose “H” message in his review after receiving a signal s = L even with the lying cost. We denote the lying cost by η( > 0).

Consider the first-period consumer’s reviewing decision given a conditional refund policy. It is clear that if he receives s = H, he has no reason to lie so he will review honestly if the benefit exceeds the cost, i.e. r ≥ k. If he receives s = L, he can get paid only when he lies, so he will review by choosing a dishonest message “H” to get a positive amount of cashback. Thus, he will review if r ≥ k + η. The reviewing decision of the first-period consumer given a conditional refund rate can be summarized as follows; (i) if r ≥ k + η, a consumer who purchases reviews for any s = H or L, (ii) if k ≤ r < k + η, only the consumer who receives s = H after purchasing reviews, and (iii) if r < k, a consumer who purchases never reviews for any s = H or L. In the first case that r ≥ k + η, the review message conveys no useful information as in a babbling equilibrium in a cheap talk game, although the review message is not cheap talk in the sense that it has a negative cost of r and the cost differs across messages, i.e. it is payoff-relevant.

Now, we consider the seller’s policy choice. The main advantage of using a conditional refund policy is to induce more positive reviews. It is clear that the advantage is larger for the low-type sellers than for the high-type seller because the low-quality product is more likely to generate a low signal s = L. Thus, it is our first guess that an unconditional refund policy signals high product quality while a conditional refund policy signals low quality. In other words, our main interest is whether there exists a separating equilibrium in which a high-quality seller chooses an unconditional refund policy and a low-quality seller chooses a conditional refund policy in equilibrium.

If a high-type seller can signal its high quality by using an unconditional refund policy itself (rather than a conditional policy), it does not need to involve a costly distortion in its price and refund rate. So, suppose that a high-quality seller charges p ^f(H) and offers r ^f(H) unconditionally while a low-quality seller charges p ^f(L) and offers r ^f(L) conditionally in equilibrium. In this equilibrium, the low-type seller’s total profit is

(22) Π ( L , L ) = ( p f ( L ) − c − r f ( L ) ) 1 − p f ( L ) + k − r f ( L ) L + δ ( p f ( L ) − c ) 1 − p f ( L ) Q L .

If the low-type seller deviates by imitating a high-type seller, its profit is

(23) Π ( L , H ) = ( p f ( H ) − c − r f ( H ) ) 1 − p f ( H ) + k − r f ( H ) H + δ ( p f ( H ) − c ) 1 − p f ( H ) Q H .

Intuitively, if a low type imitates a high type by using the unconditional refund policy with (p ^f(H), r ^f(H)), it can increase the first-period demand by being perceived to be a high type. So, if δ = 0, the low type will always imitate the high type. As δ gets larger, the cost of imitation gets larger in the second period, because a low message is more costly when the first-period sales volume is larger. The following proposition captures this intuition.

Proposition 4

There exists a separating outcome in which a high type seller chooses an unconditional refund policy and a low type seller chooses a conditional refund policy if L ≤ L ̄ for some L ̄ > 0 .

The proof of this proposition in the Appendix shows that it is more difficult for a low type to imitate a high type as the signal gets more accurate (i.e. α gets larger), δ gets larger, and the quality difference gets larger (i.e. H − L gets larger). For example, (38) shows that as δ gets larger, the imitation’s positive effect of increasing the first-period sales is more likely to be outweighed by the second-period negative effect. It is also clear that as the signal gets more accurate, the second-period negative effect gets larger. If the quality difference gets larger, the low type’s second-period negative effect gets much larger.

In this equilibrium, the first-period consumer infers the quality by using the signal of the refund policy and the second-period consumer infers the quality by using the signal of the first-period consumer’s message. Note that a negative review may be more helpful to a seller than a positive review in this separating equilibrium, because it is a signal that the seller is a high type.

6 Conclusion and Caveats

In this paper, we showed that a seller of an experience good can signal the quality of its product by the incentive scheme of paying a high amount of reward for a consumer review. This scheme can help the seller sell more in the first period by lowering the effective price, although whether it can help more second-period consumers buy may depend on the quality of the product. Thus, the seller producing a low-quality good can also use this incentive scheme at least to increase the first-period sales.

We also characterized pooling equilibria as well as separating equilibria, and showed their existence depends on various parameter values. In particular, Proposition 1 and Proposition 2 say that if λ is small, no pooling equilibrium exists and only separating equilibria are possible. This result may deserve to be empirically tested. For example, we may consider the following testing hypotheses; (i) sellers of credence goods are less likely to use a consumer review policy than sellers of experience goods, and (ii) items whose quality is less known are more likely to use a review policy. In the first hypothesis, a credence good can be regarded as generating more noisy signals relative to experience goods. The noisiness of the signals may be measured by how diverse the evaluations of consumers are. The second hypothesis is to examine the relation between consumers’ initial belief and the tendency of using a review policy.

However, we must admit that the analysis is rather restrictive. Above all things, we assumed that the seller is a monopoly in this paper. If there are several sellers competing in the product market, refund rates may be determined from a strategic consideration. It will be interesting to address the issue in the future.

Corresponding author: Jeong-Yoo Kim, Department of Economics, Kyung Hee University, 1 Hoegidong, Dongdaemunku, Seoul 130-701, Korea, E-mail: jyookim@khu.ac.kr

This research was supported by the Ministry of Education of the Republic of Korea and the National Research Foundation of Korea (NRF-2022S1A5A2A0304932311).

Appendix

Proof of Lemma 1

We first prove (ii). The first order conditions for the problem of maximizing Π(r, H, H) and Π(r, L, L) with respect to r are given by

(24) Π r ( r , H , H ) = 1 H ( p − r + k ) + ( p − c − r ) + δ ( p − c ) Δ H = 0 ,

(25) Π r ( r , L , L ) = 1 L ( p − r + k ) + ( p − c − r ) + δ ( p − c ) Δ L = 0 ,

where

(26) Δ H = α 1 − p E ( q | H ) + ( 1 − α ) 1 − p E ( q | L ) − 1 − p M ,

(27) Δ L = ( 1 − α ) 1 − p E ( q | H ) − α 1 − p E ( q | L ) − 1 − p M .

From (26) and (27), we obtain

(28) r * ( H , H ) = 2 p + k − c + δ ( p − c ) Δ H 2 ,

(29) r * ( L , L ) = 2 p + k − c + δ ( p − c ) Δ L 2 .

Since Δ_H > Δ_L as long as α > 1 2 , the result follows.

From (5), we have

Π ( r , H , H ) − Π ( r , H , L ) = ( D ( r , H ) − D ( r , L ) ) [ ( p − c − r ) + δ ( p − c ) Δ H ] .

Note that Δ_H > 0 for any α > 1 2 , because E(q|H) > E(q|L). Since D(r, H) > D(r, L), we have Π(r, H, H) > Π(r, H, L) for any r such that r < p − c. This implies that Π*(H, H) > Π*(H, L).

Similarly, from (6), we have

Π ( r , L , H ) − Π ( r , L , L ) = ( D ( r , H ) − D ( r , L ) ) [ ( p − c − r ) + δ ( p − c ) Δ L ] .

Note that Δ_L < 0 for any α > 1 2 . If α = 1 2 , Δ_L = 0. Due to the continuity, there exists α ₀ such that for any α ∈ ( 1 2 , α 0 ) , Π(r, L, H) > Π(r, L, L). This implies that Π*(L, H) > Π*(L, L) for any α ∈ ( 1 2 , α 0 ) .

Finally, let us compare (5) and (6). It is easy to see that the first terms are the same while the second term in (5) is greater than that in (6) because Δ_H > Δ_L. So, Π(r, H, q ^e) > Π(r, L, q ^e) for any q ^e = H, L. This completes the proof. □

Proof of Proposition 1

The two incentive compatibility conditions given by [ICL1] and [ICH1] imply that the set of r _H that are possible in any separating equilibrium is [ r ̲ L , r ̲ H ] ∪ [ r ̄ L , r ̄ H ] .

By Lemma 1, it remains to show that r H = r ̄ L is the unique r _H that satisfies the C–K Intuitive Criterion.

Claim 1

r ̄ L < r ̄ H , i.e. [ r ̄ L , r ̄ H ] ≠ ∅ .

Proof

We will show that there exists α 1 ( > 1 2 ) such that Π(r ^f(H), L, H; α ₁) = Π(r ^f(L), L, L; α ₁). To prove it, we will consider two extreme cases.

First, if α = 1 2 , Δ_H = Δ_L, so the signal is not informative at all, i.e. the second-period profits of both types are the same. Thus, we have
(30) Π ( r , q , q e ) = ( p − c − r ) D ( r , q e ) + δ ( p − c ) 1 − p M .
This implies that Π(r, q, q ^e) does not depend on q but only on q ^e. From (30), we obtain that Π(r ^f(q), H) > Π(r ^f(q), L) for any q = H, L and r ^f(H) = r ^f(L). Therefore, it is clear that r ̄ L > r f ( H ) , implying that an L type has an incentive to imitate an H type when an H-type seller chooses r ^f(H). So, [DC] condition is satisfied, i.e. Π ( r f ( H ) , L , H ; 1 2 ) > Π ( r f ( L ) , L , L ; 1 2 ) .

Next, if α = 1, we can calculate r ^f(H) from the first order condition (24) as
(31) r f ( H ) = 1 2 [ p + k + δ ( p − c ) ( 1 + Δ H ) ] .
Let r ̄ be the maximum r such that Π(r, L, H) ≥ 0. Then, after tedious algebra, we can find that r ̄ < r f ( H ) if and only if
(32) δ ( p − c ) p 1 L − 1 H − 1 − p M + H 2 > δ ( p − c ) 2 − p L − 1 − p M + p − H 2 + δ ( p − c ) 4 p 1 − p M + 2 − p L H .

Assume that p = c, i.e. ρ = 0. Then, (32) is reduced to H ² > (p − H)², i.e. p < 2H. So, if p ≤ H, it holds. Since r ̲ H > r f ( H ) > ρ > r ̄ L , it implies that [DC] condition is violated if α = 1, i.e. Π(r ^f(H), L, H; 1) < Π(r ^f(L), L, L; 1). By continuity with respect to ρ, this inequality holds for any ρ ∈ ( 0 , ρ ̄ ) for some small ρ ̄ . Also, by continuity with respect to α, there exists α 1 ( > 1 2 ) such that Π(r ^f(H), L, H; α ₁) = Π(r ^f(L), L, L; α ₁).
Note that Δ_H(α) is strictly increasing in α and Δ_L(α) is strictly decreasing in α. Therefore, Π(r ^f(H), L, H; α) is strictly decreasing in α and Π(r ^f(L), L, L; α) is strictly increasing in α. This implies that for any α < α ₁, Π(r ^f(H), L, H; α) > Π(r ^f(L), L, L; α), i.e. [DC] condition is satisfied.
Similarly, we know that r ̄ L ( 1 2 ) = r ̄ H ( 1 2 ) if α = 1 2 and that r ̄ L ( α 1 ) < r ̄ H ( α 1 ) if α = α ₁. Thus, by continuity, there exists α ₂ = α ₁ − ϵ for some ϵ > 0 such that for any α ∈ (α ₂, α ₁), r ̄ L ( α ) < r ̄ H ( α ) .
Also, note that satisfaction of [DC] condition (Π(r ^f(H), L, H) > Π(r ^f(L), L, L)) implies that Π(r, L, H) > Π(r, L, L) for any r. That is, if α < α ₁, α < α ₀. This implies that α ₁ < α ₀.
Take α ̲ = α 2 and α ̄ = min { α 0 , α 1 } = α 1 . Then, for any α ∈ ( α ̲ , α ̄ ) , [DC] condition is satisfied and [ r ̄ L ( α ) , r ̄ H ( α ) ] ≠ ∅ . Therefore, the proof is completed. □

Claim 2

Every r H ∈ ( r ̄ L , r ̄ H ] fails to satisfy the C–K Intuitive Criterion.

Proof

For any r H ∈ ( r ̄ L , r ̄ H ] , take r H ′ = r H − ϵ ∈ ( r ̄ L , r H ) , and observe the following two inequalities:

(33) π ( r f ( L ) , L , L ) ≥ π r H ′ , L , q e , ∀ q e = L , H ,

(34) π ( r H , H , H ) < π r H ′ , H , H ,

since r f ( H ) < r H ′ < r H . Therefore, no r H ∈ ( r ̄ L , r ̄ H ] satisfies the C–K Intuitive Criterion. □

Claim 3

r H = r ̄ L satisfies the C–K Intuitive Criterion.

Proof

If we take any r H ′ > r H , then r H ′ satisfies (CK-i) given by (8) but does not satisfy (CK-ii) given by (9), because r f ( H ) < r H < r H ′ . If we take any r H ′ ∈ ( r ̲ L , r ̄ L ) , then r H ′ does not satisfy (CK-i) by the definitions of r ̲ L and r ̄ L . Finally, if we take any r H ′ ≤ r ̲ L , then r H ′ satisfies (CK-i) but does not satisfy (CK-ii), because r ̲ L and r ̄ L are equidistant from r*(L, H), whereas r*(L, H) < r ^f(H) implies that π r H ′ , H , H ≤ π ( r ̲ L , H , H ) < π ( r ̄ L , H , H ) = π ( r H , H , H ) . □

Claim 4

Every r H ∈ [ r ̲ H , r ̲ L ) fails to satisfy the C–K Intuitive Criterion.

Proof

For any r _H, take r H ′ = r H + ϵ ∈ ( r H , r ̲ L ) . It is clear that r H ′ satisfies (CK-i) because r H ′ < r ̲ L . It also satisfies (CK-ii) because r H < r H ′ < r f ( H ) . □

Claim 5

r H = r ̲ L fails to satisfy the C–K Intuitive Criterion.

Proof

Take r H ′ = r ̄ L . Clearly, r H ′ satisfies (CK-i) by definition of r ̄ L . It also satisfies (CK-ii) for the same reason as in the proof of Claim 2. □

Proof of Proposition 2

We will first prove (ii) and then prove (i).

(ii) By Lemma 1, r ^f(L) = r*(L, L) < r*(H, L) for any α > 1 2 . Now, if λ = 0 so that M = L, both incentive compatibility conditions consist of a single point, i.e. [ r ̲ L λ , r ̄ L λ ] = { r * ( L , L ) } and [ r ̲ H λ , r ̄ H λ ] = { r * ( H , H ) } . Therefore, it is clear that E p = [ r ̲ L λ , r ̄ L λ ] ∩ [ r ̲ H λ , r ̄ H λ ] = ∅ . As λ increases, r ̲ L λ , r ̄ L λ , r ̲ H λ and r ̄ H λ all change continuously, implying that E p = [ r ̲ L λ , r ̄ L λ ] ∩ [ r ̲ H λ , r ̄ H λ ] = ∅ for any λ ≤ λ ̲ for some λ ̲ .

(i) First note that the two incentive compatibility conditions are completely equivalent if α = 1 2 . To see this, note that the second terms of Π(r, H, q ^e) and Π(r, L, q ^e) are the same if α = 1 2 . Since the first terms of Π(r, H, q ^e) and Π(r, L, q ^e) depend only on q ^e, it is clear that Π(r, H, q ^e) and π(r, L, q ^e) are the same. Since Π(r _λ, H, M) = π(r _λ, L, M) and Π(r, H, L) and Π(r, L, L) for all r, the two incentive compatibility conditions are completely equivalent. This implies that [ r ̲ L λ , r ̄ L λ ] = [ r ̲ H λ , r ̄ H λ ] if α = 1 2 , meaning that E _p ≠ ∅. Also, as α increases, r ̲ L λ , r ̄ L λ , r ̲ H λ and r ̄ H λ all change continuously, implying that E p = [ r ̲ L λ , r ̄ L λ ] ∩ [ r ̲ H λ , r ̄ H λ ] ≠ ∅ for any α ≤ λ ̃ for some λ ̃ . □

Proof of Lemma 3

It suffices to show that λ ̂ 2 , s ( H ) > λ ̂ 2 , b ( 0 ) > λ ̂ 2 , s ( L ) if α ≥ α ̂ for some α ̂ . Since it is clear from (10), (11), and (12) that λ ̂ 2 , s ( H ) > λ ̂ 2 , b ( 1 ) > λ ̂ 2 , b ( 0 ) , we only need to show that there exists α ̂ such that for any α ≥ α ̂ , λ ̂ 2 , b ( 0 ) > λ ̂ 2 , s ( L ) . It follows from (11) and (13) that

(35) λ ̂ 2 , b ( 0 ) = 1 1 + 1 − λ λ 1 − D ( r L , L ) 1 − D ( r H , H ) ,

(36) λ ̂ 2 , s ( L ) = 1 1 + 1 − λ ̂ 2 , b ( 1 ) λ ̂ 2 , b ( 1 ) α 1 − α .

It is clear that 1 − λ λ 1 − D ( r L , L ) 1 − D ( r H , H ) < 1 − λ ̂ 2 , b ( 1 ) λ ̂ 2 , b ( 1 ) α 1 − α if α → 1. Therefore, there exists α ̂ such that for any α ≥ α ̂ , λ ̂ 2 , b ( 0 ) > λ ̂ 2 , s ( L ) . □

Proof of Proposition 4

Define Δ = Π(L, H) − Π(L, L). We only need to find L ̄ > 0 such that Δ < 0 for any L ≤ L ̄ . For simplicity, we assume that c = 0. Also, for notational brevity, let p ^f(q) = p _q and r ^f(q) = r _q. As α → 1, we have

(37) Δ = ( p H − r H ) D H + δ p H ( 1 − p H α H ) − [ ( p L − r L ) D L δ p L ( 1 − p L α L ) ) ] ,

where D q = 1 − p q + k − r q q and α q = D q L + 1 − D q M . Note that D _H > D _L and α _H > α _L. Rearranging (37) yields

(38) Δ = [ ( p H − r H ) D H − ( p L − r L ) D L ] + δ p H − p L − p H 2 α H − p L 2 α L .

Since p H 2 α H − p L 2 α L → 1 L p H 2 D H − p L 2 D L → + ∞ as L → 0 due to p H 2 D H > p L 2 D L , it follows that there exists L ̄ > 0 such that Δ < 0 if L ≤ L ̄ . □

References

Bagwell, K., and M. Riordan. 1991. “High and Declining Prices Signal Product Quality.” American Economic Review 81: 224–39.Search in Google Scholar

Berg, N., J.-Y. Kim, and I. Seon. 2021. “A Performance-Based Payment: Signaling the Quality of a Credence Good.” Managerial and Decision Economics 42: 1117–31. https://doi.org/10.1002/mde.3295.Search in Google Scholar

Chen, Y., N. Kartik, and J. Sobel. 2008. “Selecting Cheap-Talk Equilibria.” Econometrica 76: 117–36. https://doi.org/10.1111/j.0012-9682.2008.00819.x.Search in Google Scholar

Chen, Y., R. Zhang, and B. Liu. 2019. “Pricing Decisions on Reward-Based Crowdfunding with Bayesian Review System Facing Strategic Consumers.” Discrete Dynamics in Nature and Society 2019: 1–14. https://doi.org/10.1155/2019/4592690.Search in Google Scholar

Chen, Y., and J. Xie. 2008. “Word-of-Mouth as a New Element of Marketing Communication Mix.” Management Science 54: 477–91. https://doi.org/10.1287/mnsc.1070.0810.Search in Google Scholar

Chevalier, J. A., and D. Mayzlin. 2006. “The Effect of Word of Mouth on Sales: Online Book Reviews.” Journal of Marketing Research 43: 345–54. https://doi.org/10.1509/jmkr.43.3.345.Search in Google Scholar

Cho, I., and D. Kreps. 1987. “Signaling Games and Stable Equilibria.” Quarterly Journal of Economics 102: 179–221. https://doi.org/10.2307/1885060.Search in Google Scholar

Crawford, V., and J. Sobel. 1982. “Strategic Information Transmission.” Econometrica 50: 1431–51. https://doi.org/10.2307/1913390.Search in Google Scholar

Darby, M., and E. Karni. 1973. “Free Competition and the Optimal Amount of Fraud.” Journal of Law and Economics 16: 67–88. https://doi.org/10.1086/466756.Search in Google Scholar

Dorner, V., M. Giamattei, and M. Greiff. 2020. “The Market for Reviews: Strategic Behavior of Online Product Reviewers with Monetary Incentives.” Schmalenbach Business Review 72: 397–435. https://doi.org/10.1007/s41464-020-00094-y.Search in Google Scholar

Fluet, C., and P. Garella. 2002. “Adversiting and Prices as Signals of Quality in a Regime of Price Rivalry.” International Journal of Industrial Organization 20: 907–30. https://doi.org/10.1016/s0167-7187(01)00075-3.Search in Google Scholar

Fradkin, A., E. Grewal, and D. Holtz. 2021. “Reciprocity and Unveiling in Two-Sided Reputation Systems: Evidence from an Experiment on Airbnb.” Marketing Science 40: 1009–216.10.1287/mksc.2021.1311Search in Google Scholar

He, S., B. Hollenbeck, and D. Proserpio. 2022. “The Market for Fake Reviews.” Marketing Science 41: 871–1027. https://doi.org/10.1287/mksc.2022.1353.Search in Google Scholar

Hertzendorf, M. 1993. “I’m Not A High-Quality Firm – But I Play One on TV.” RAND Journal of Economics 24: 236–47. https://doi.org/10.2307/2555760.Search in Google Scholar

Jin, C., L. Yang, and K. Hosanagar. 2022. “To Brush or Not to Brush: Product Rankings, Consumer Search, and Fake Orders.” Information Systems Research 34: 1–21.10.1287/isre.2022.1128Search in Google Scholar

Judd, K., and M. Riordan. 1994. “Price and Quality in a New Product Monopoly.” Review of Economic Studies 61: 773–89. https://doi.org/10.2307/2297918.Search in Google Scholar

Kartik, N. 2009. “Strategic Communication with Lying Costs.” Review of Economic Studies 76: 1359–95. https://doi.org/10.1111/j.1467-937x.2009.00559.x.Search in Google Scholar

Li, L., S. Tadelis, and X. Zhou. 2020. “Buying Reputation as a Signal of Quality: Evidence from an Online Marketplace.” RAND Journal of Economics 51: 965–88. https://doi.org/10.1111/1756-2171.12346.Search in Google Scholar

Liu, X. 2017. “The Signaling Study of Advance Selling Considering the Product Diffusion and Online Reviews Effect.” Journal of Service Science and Management 10: 72–86. https://doi.org/10.4236/jssm.2017.101006.Search in Google Scholar

Liu, Y. 2006. “Word of Mouth for Movies: Its Dynamics and Impact on Box Office Revenue.” Journal of Marketing 70: 74–89. https://doi.org/10.1509/jmkg.70.3.074.Search in Google Scholar

Luca, M., and G. Zervas. 2016. “Fake it Till You Make it: Reputation, Competition, and Yelp Review Fraud.” Management Science 62 (12): 3412–27. https://doi.org/10.1287/mnsc.2015.2304.Search in Google Scholar

Milgrom, P., and J. Roberts. 1986. “Price and Advertising Signals of Product Quality.” Journal of Political Economy 94: 796–821. https://doi.org/10.1086/261408.Search in Google Scholar

Nelson, P. 1970. “Information and Consumer Behavior.” Journal of Political Economy 78: 311–29. https://doi.org/10.1086/259630.Search in Google Scholar

Nelson, P. 1974. “Advertising as Information.” Journal of Political Economy 81: 729–54. https://doi.org/10.1086/260231.Search in Google Scholar

New York Times. 2004. Amazon Glitch Unmasks War of Reviewers, February 14, Al.Search in Google Scholar

Schmalensee, R. 1978. “A Model of Advertising and Product Quality.” Journal of Political Economy 86: 485–503. https://doi.org/10.1086/260683.Search in Google Scholar

Schuckert, M., X. Liu, and R. Law. 2016. “Insights into Suspicious Online Ratings: Direct Evidence from TripAdvisor.” Asia Pacific Journal of Tourism Research 21 (3): 259–72. https://doi.org/10.1080/10941665.2015.1029954.Search in Google Scholar

Spence, M. 1973. “Job Market Signaling.” Quarterly Journal of Economics 87: 355–74. https://doi.org/10.2307/1882010.Search in Google Scholar

Spiegel Research Center. 2021. How Online Reviews Influence Sales. https://spiegel.medill.northwestern.edu.Search in Google Scholar

Tirole, J. 1988. The Theory of Industrial Organization. Cambridge: MIT Press.Search in Google Scholar

Wolinsky, A. 1983. “Price as Signals of Product Quality.” Review of Economic Studies 50: 647–58. https://doi.org/10.2307/2297767.Search in Google Scholar

Xu, H., D. Liu, H. Wang, and A. Stavrou. 2015. “E-Commerce Reputation Manipulation: The Emergence of Reputation-Escalation-as-a-Service.” In Proceedings of 24th World Wide Web Conference (WWW 2015), 1296–306.10.1145/2736277.2741650Search in Google Scholar

Received: 2023-07-15

Accepted: 2023-11-29

Published Online: 2023-12-18

Published in Print: 2023-08-15

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

https://doi.org/10.1515/rne-2023-0041

Keywords for this article

experience good; online consumer review; quality; signal; fake review; word-of-mouth (WOM) communication

Creative Commons

BY 4.0