Judicial Torture as a Screening Device

Kong-Pin Chen; Tsung-Sheng Tsai

doi:10.1515/bejte-2014-0023

Enjoy 40% off

academic books on De Gruyter Brill *

Article Publicly Available

Judicial Torture as a Screening Device

Kong-Pin Chen and Tsung-Sheng Tsai

Published/Copyright: February 26, 2015

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal The B.E. Journal of Theoretical Economics Volume 15 Issue 2

Abstract

Judicial torture to extract information or to elicit a confession was a common practice in pre-modern societies, both in the east and the west. This paper proposes a positive theory for judicial torture. It is shown that torture reflects the magistrate’s attempt to balance type I and type II errors in the decision-making, by forcing the guilty to confess with a higher probability than the innocent, and thereby decreases the type I error at the cost of the type II error. Moreover, there is a non-monotonic relationship between the superiority of torture and the informativeness of investigation: when investigation is relatively uninformative, an improvement in technology used in the investigation actually lends an advantage to torture so that torture is even more attractive to the magistrates; however, when technological progress reaches a certain threshold, the advantage of torture is weakened, so that a judicial system based on torture becomes inferior to one based on evidence. This result can explain the historical development of the judicial system.

Keywords: torture; type I and type II errors; evidence

1 Introduction

Judicial torture for the purpose of eliciting information was a common practice in pre-modern societies. In the west, it emerged in Greek law and continued in Roman law. Its history can then be traced through the Middle Ages, down to the legal reforms of the eighteenth century and the abolition of torture in criminal legal procedure in the nineteenth century in most parts of Europe (Peters 1985, 5). In China, judicial torture formed an important part of the imperial legal codes from the first Empire (Qin dynasty, 221–207 bc) to the last Qing dynasty (1644–1911) (Shen 1985). Various forms of judicial torture were also widely implemented in Muslim, African and various Asian societies before our time (Lea 1971, 203–8). It is indeed hard to find any pre-modern society that did not rely on torture for the gathering of evidence in judicial proceedings.

The modern man, for whom judicial torture is not only immoral but also irrational, may imagine that it was inflicted exclusively on the accused to obtain a confession. Nothing is further from the truth. First, judicial torture was officially institutionalized, as its practice was explicitly written into many legal codes. Second, torture was explicitly applied not only to the suspect, but also to the witness, and even to the plaintiff. Witnesses were tortured in Greek and Roman cases, and this continued in the medieval period and until the late eighteenth century (Peters 1985, 18, 69). The historical development of judicial torture has indicated that it was an institutionalized practice, rather than a manifestation of a magistrate’s abuse of power.

During the Middle Ages, European courts had increasingly used torture as a legitimate means to extract confessions or to obtain the accomplice’s identity or other information regarding a crime. In 1252, Pope Innocent IV issued a papal bull which authorized the use of torture by inquisitors. The application of torture began to appear in significant numbers from around the thirteenth century onward and peaked in the sixteenth century. However, starting from around the mid-eighteenth century, the practice of judicial torture in the west was gradually replaced by a system which was based on evidence. Its abolition was a long and gradual process that lasted from the mid-eighteenth to the early nineteenth centuries. The conventional historical explanation of this abolition movement relies heavily on the influence of the humanists of the Enlightenment. However, experts on jurisprudence usually find this explanation too loose, and prefer to explain the phenomenon by changes in the judicial system itself. According to John Langbein, a famous legal scholar, the abolition was largely due to the emergence of the new law of proof. During this period, the courts gradually imposed new and less rigorous punishments according to a less strict standard of proof, one of persuasion rather than certainty. Since certainty was no longer the only requirement to put the accused in prison, torture became unnecessary. “Only when confession evidence was no longer necessary to convict the guilty could European law escape its centuries of dependence on judicial torture” (Langbein 1983, 1555–6). This explanation, however, is incomplete because, as noted by Damaška (1978) and Silverman (2001), torture continued to be used well after the change in the law of proof.

In this paper, we propose a positive model for judicial torture based on the theory of information economics to explain its rise, decline and eventual abolishment in the legal system. We explain why under a certain environment a system based on torture might become a prevailing judicial system. Besides the pain inflicted on the accused, there are two consequences of torture. On the one hand, since some innocent suspects are tortured to confession, torture increases the chance that an innocent suspect will be wrongfully convicted (i.e. a type I error). On the other hand, if investigation is of informational value, in the sense that it is more likely to find evidence against the guilty suspects during investigation, then (if the magistrate uses torture as a threat when such evidence is found) the expected cost of denying a crime will be greater for the guilty than the innocent. This reduces the chance that a guilty suspect will be wrongfully released (i.e. a type II error). The goal of the magistrate is to balance the costs of type I and type II errors, as well as the cost of torture such as the pain imposed on the suspect or the witness.

The main argument of this paper is that the attractiveness of torture is crucially determined by the precision of investigation. It will be shown that if little information is revealed during the investigation, the gain in reducing a type II error (by torture) will outweigh the cost of the increase in a type I error. In such a case, torture is the “optimal” choice for the magistrates. However, there is a non-monotonic relationship between the superiority of torture and the precision of investigation: when investigation is relatively uninformative, an improvement in technology used in the investigation can lend an advantage to torture so that torture is even more attractive to the magistrates; however, when technological progress reaches a certain threshold, the advantage of torture is weakened, so that a system solely based on evidence will overtake torture as a better system.

Based on our theory, the rise and decline of judicial torture in European history can be explained in a more satisfactory way if we look at another factor other than the humanitarian considerations or the change in the law of proof: the employment of medical science in law. Between the thirteenth and the sixteenth centuries, the trend of the application of medical knowledge in judicial cases exhibited a stable increase, while at about the same time torture was used more and more intensively and its application reached its peak in the sixteenth century. However, later developments in medical science between the seventeenth and nineteenth centuries, including fingerprinting, physical matching and the use of precise measurements of the human body structure (anthropometry), indicates that, with those technological breakthroughs, the precision of investigation had passed that threshold. Past human activity could now be specified without relying exclusively on the confessions of the guilty or witnesses, often obtained by torture in earlier times (Parker 1983, 432–3; Fullmer 1980, 27). During this period of time, the number of incidents of judicial torture began to decrease substantially in many European regions. The application of modern scientific technology had apparently reached a mature stage in the late nineteenth century when Hans Gross published the first classic in the specialty, Handbuch für Untersuchungsrichter als System der Kriminalistik, in 1893 (published in English in 1907 under the title Criminal Investigation). Thereafter, a system based on evidence finally dominated, and judicial torture was eventually abolished from the statutory law by the end of the nineteenth century in most European countries.

Our model can also explain why it may be optimal for the magistrate to torture the witness or the plaintiff. If the witness or the plaintiff is also tortured, then only those witnesses or plaintiffs who are more sure of the suspect’s crime will come to the court. Thus, the average quality of the cases entering the court will increase. This implies a lower type I error when the magistrate also tortures the suspect in the legal proceedings. If this reduction in the type I error is sufficiently large, then torturing the plaintiff or the witness is indeed the magistrate’s optimal choice. Note that this explanation does not rely on the possibility that the magistrate might be sadistic. On the contrary, this is true even if the torturee’s pain enters negatively into the magistrate’s utility function.

The insight that resources need to be invested in order to overcome informational barriers is a common theme in the theoretical economics literature. For example, workers take costly actions (mainly education) to signal their abilities (e.g. Spence 1973); firms engage in costly predation to signal their toughness against new entrants (e.g. Kreps and Wilson 1982; Milgrom and Roberts 1982) and bargainers use costly delays to signal their time–patience during bargaining (e.g. Rubinstein 1985). Our theory is related to the literature in which a firm might profitably utilize a strategy that raises the cost of all competing firms in the market (Salop 1979). In our model the magistrate uses costly torture to partially resolve the uncertainty regarding the type of the accused.

There is surprisingly scant formal modeling of torture. Wantchekon and Healy (1999) analyze torture as a game with incomplete information between the state (torturer) and the victim. The latter does not know whether the torturer is sadistic (deriving pleasure from torture) or professional (incurring cost when torturing); the former is uncertain of whether the victim is strong or weak. The victim possesses a certain information, and the state decides whether or not to torture to extract that information. They show that in equilibrium there will be a positive probability of torture, even for the professional torturer, as he tries to test the victim’s type. In our model, the magistrate is never sadistic, as he also suffers from torture. Therefore, our model does not explain torture in terms of some torturer being sadistic. Mialon et al. (2012) build up a theoretical model to analyze the effect when a government agency is allowed to torture the suspects when evidence of terrorist involvement is high. They show that allowing torture reduces the agency’s effort to counter terrorism by means other than torture. This reduces the quality of the agency’s information in deciding whether to torture, which in turn prompts them to torture even in cases where evidence is low. Their paper differs from ours in that they are mainly concerned with the possibility of a “slippery slope” in which the agency might be tempted (albeit optimally) to torture the suspects with low evidence when they are only allowed to do so in high evidence cases. In Chen et al. (2009), it is shown that even if torture is blind in distinguishing the innocent and the guilty (i.e. they have the same probability of confession under torture), as long as there is two-sided informational asymmetry regarding the degree of the magistrate’s willingness to torture and the defendant’s willingness to endure, torture can be an equilibrium outcome. ^[1]

The remainder of this article is organized as follows. Section 2 introduces the model. Section 3 discusses the equilibrium outcomes in the evidence-based system and Section 4 analyzes those in the game of torture. Section 5 compares the social welfare under these two systems and discusses how the optimality is affected by the informativeness of investigation. Section 6 extends the basic model to consider the situation where the magistrate can torture the plaintiff or witness. Section 7 reviews the historical development of torture and Section 8 concludes the paper. The details of the proofs are relegated to the Appendices.

2 The model

Suppose that an accused suspect in a certain crime case is brought to the court ruled by a magistrate. The suspect is either guilty (θ=G) or innocent (θ=I), which is his private information. The magistrate needs to reach a verdict on whether he is guilty or not. The prior belief of the magistrate that the suspect is guilty is q, with 0<q<1. That is, Prob(θ=G)=q. ^[2]

We consider two possible types of judicial system: the evidence-based system and the torture system. In the evidence-based system, the magistrate conducts an investigation, which can improve the precision of his information but not resolve the uncertainty he faces. Specifically, the magistrate draws a signal (evidence) from the set S={g,i}. If the suspect is indeed guilty, then with probability qG (resp. 1−qG) the magistrate will draw a piece of evidence s=g (resp. s=i). If the suspect is innocent, then with probability qI (resp. 1−qI) the magistrate draws g (resp. i). We assume that 1>qG>qI>0, i.e. the investigation is informative in the sense that the magistrate is more likely to draw g when he faces a guilty defendant than when he faces an innocent defendant. Let qˆ be the posterior that the defendant is guilty. The magistrate then decides whether or not to convict the suspect based on the posterior. If the defendant is convicted, he is subject to a legal penalty causing a disutility P. Otherwise, he is released, in which case his utility is 0.

Since investigation is imperfect, there are two possible errors that the magistrate can commit. First, he might wrongfully convict a defendant who is actually innocent. We call this a type I error. Second, he might wrongfully release a defendant who is actually guilty. We call this a type II error. Assume that a type I error causes a loss of L1, and that a type II error causes a loss of L2, to the magistrate’s utility. ^[3] The objective of the magistrate is to minimize the value of his expected loss, which is qˆL2 if he releases the accused and (1−qˆ)L1 if he convicts him, by choosing whether to convict the suspect after drawing the evidence.

In the torture system, a confession from the defendant is required for a conviction. In other words, the magistrate cannot convict the suspect if the suspect does not confess. For that purpose the magistrate is allowed to torture. ^[4] The defendant is given an option of whether to confess or not. If he does, the magistrate will not conduct any investigation but will convict him directly. In this case, he is subject to a legal penalty for the crime, which causes a disutility of P. If he does not, the magistrate will go through the same investigation as in the evidence-based system. Based on the evidence drawn, he will decide whether or not to torture the defendant. If he does not, then the defendant will be released, which results in a utility of 0 for the latter. If he tortures, let T be the defendant’s disutility from being tortured. In this case, his total expected disutility will be T+rP, where r is the probability that the defendant will confess when tortured. This setup can be endogenized by viewing torture as a war-of-attrition game between the magistrate and the defendant. If r is the probability that the defendant will concede before the magistrate (in the war of attrition), and T is the expected length it takes for one of the sides to concede, then the expected utility of the defendant, when he is tortured, is exactly T+rP. ^[5]

The magistrate’s (dis)utility is assumed to be p1L1+p2L2+kT, where p1 and p2 are the probabilities of committing type I and II errors, respectively. Note that when a level of torture T is imposed on the accused, it also causes a loss of kT in the magistrate’s utility. ^[6] Therefore, unlike the simple explanation that the magistrate tortures simply because he is sadistic, in our model the magistrate is known by the defendant to dislike torture. The magistrate is benevolent in that his aim is to minimize both the sum of the expected losses caused by the two types of errors and, if he tortures, the defendant’s pain due to torture. Since the magistrate’s decision will affect the defendant’s confession decision and vice versa, this is a game played between the magistrate and the defendant. The torture system is substantially more complicated than the evidence-based system, as it involves the strategic interaction of the two parties.

Before proceeding with the analysis, there are several points regarding our setup that are worth mentioning. First of all, the assumption qI<qG is crucial to our result that the torture system can sometimes be superior to the evidence-based system. Since it is more likely that a guilty evidence will be found when the defendant is guilty, the expected cost of denying a crime will be greater for the guilty than the innocent if the magistrate ever applies torture. Thus, it will be more likely that the guilty will confess when facing a threat of being tortured. By using this “screening device,” the magistrate can be more certain about the defendant’s type during the investigation if he does not confess. This will decrease the chances of a guilty defendant being wron gfully released so that the type II error will be reduced. Such an advantage in information elicitation brought about by torture is the main reason for its superiority.

Secondly, investigation takes place after the initial questioning but before torture. Given this, the optimal strategy of a suspect, if he intends to confess, is actually to deny his crime initially, and to confess immediately at precisely the moment when the magistrate draws a guilty signal and decides to torture. Our specification assumes that the magistrate can make a credible commitment in that if he decides to torture, he commits to torture by an amount T, regardless of whether the suspect confesses before or during torture. This assumption of commitment greatly simplifies the model, as there are only two consequences for a suspect who does not confess: either he is released, or he is tortured and confesses. The tradeoff facing the suspect is indeed simple: if he confesses before investigation, he suffers a disutility P; while if he does not confess, he suffers an extra pain of T but may be released with some probability. This specification is equivalent to the real world plea bargaining, in which the suspect’s penalty is reduced (by an amount of T in our specification) if he chooses to confess before investigation. ^[7]

Finally, we assume that the innocent and the guilty defendants have the same probability of confessing under torture (i.e. r is independent of the type θ). Historical evidence suggests that torture is expected to function in such a way that the accused defendant will reveal details of his crime which no innocent person possesses (and be convicted). ^[8] If the system functions perfectly, then only guilty defendants will confess under torture, as no innocent defendant can offer evidence of the crime. The assumption that torture is blind in distinguishing between the innocent and the guilty actually places a heavier burden on our proof. If it is assumed that the guilty is more likely to confess under torture (i.e. rG>rI), our argument will hold even more generally. That is, we will actually show that a system based on torture might incur a lower social loss than one based on evidence, even if torture has no power in distinguishing between the innocent and the guilty.

To sum up, the events under the evidence-based system proceed in the following order: (1) Nature determines whether the suspect is guilty or innocent (i.e. whether θ=G or θ=I). (2) The magistrate draws on evidence. (3) Based on the evidence, the magistrate decides whether to convict or release the accused. The events under the torture system proceed in the following order: (1) Nature determines whether the suspect is guilty. (2) The suspect decides whether to confess or not. If he confesses, then he is convicted and no further investigation is conducted. In this case, he is subject to a legal penalty of P. If he does not confess, then (3) the magistrate conducts an investigation by drawing on evidence. Based on the evidence, he decides whether to torture the suspect or not. If not, the suspect is released. (4) If the magistrate tortures, then after suffering a disutility of torture T, the defendant confesses with probability r.

3 The evidence-based system

Under the evidence-based system, the magistrate draws on evidence, and decides whether to convict the suspect solely based on the posterior derived from the evidence. If he draws a piece of evidence i, his posterior that the suspect is guilty will be

qi′≡q(1−qG)q(1−qG)+(1−q)(1−qI).

Similarly, if he draws g, the posterior will be

qg′≡qqGqqG+(1−q)qI.

Thus, if the evidence collected is s∈{i,g}, then the expected loss will be (1−qs′)L1 if the magistrate convicts the suspect and is qs′L2 if he releases the suspect. Whether he should convict the suspect or not depends on the evidence he draws and the relative sizes of (1−qs′)L1 and qs′L2. To make our discussion non-trivial, we make the following assumption:

A1. qGqIL2>1−qqL1> 1−qG1−qIL2.

If A1 does not hold, then regardless of the evidence drawn, either he never convicts (if the second inequality fails to hold) or he never releases a suspect (if the first inequality fails to hold). Investigation thus does not have any function. If A1 holds, then it can be easily checked that qg′L2>(1−qg′)L1 and qi′L2<(1−qi′)L1. Thus, the magistrate will convict (release) the suspect if he draws g (i). We therefore have the following lemma:

Lemma 1

Suppose that A1 holds. Then, under the evidence-based system, the magistrate convicts the suspect if and only if he draws g.

4 The game of torture

As a benchmark, we first consider the case where no investigation is conducted. Since the prior belief of the magistrate that the suspect is guilty is q, if the magistrate decides to torture the suspect, the probability of making a type I error is (1−q)r when the suspect is tortured to confess, and the probability of making a type II error is q1−r when he still denies the crime even after being tortured. On the other hand, the probability of making a type II error is q if the suspect is released. Thus, without an investigation, the magistrate’s expected loss when a suspect is tortured is r(1−q)L1+(1−r)qL2+kT and is qL2 if the defendant is released. We make the following assumption:

A2. 1−qqL1+kTrq> L2> kTr.

The first inequality means that under the prior belief q, the expected social loss from torture is greater than that from releasing the suspect. This implies that the magistrate will not torture when no investigation is conducted. If this assumption is not satisfied, since an investigation can only reduce the expected cost under torture, the magistrate will torture the suspect anyway regardless of what signal he receives during investigation, which is, however, an uninteresting case to study. The second inequality is obtained by assuming that r(1−q)L1+(1−r)qL2+kT<qL2 when q=1, which means that if the magistrate believes that the suspect is guilty with probability 1, the loss from the type II error if he is released is always greater than that from torture, so that the magistrate may want to torture the suspect. If this condition is violated, the magistrate will never torture even if he believes that the suspect is guilty with probability 1. In this case, there will never be torture at all, and no suspect will confess. This is again a less interesting situation because torture plays no role.

4.1 The optimal decision of the magistrate

Suppose that after the defendant has denied the crime, and the evidence is collected, the magistrate is considering whether to torture. Note that since the defendant’s type is his private information, his decision will be based on whether he is innocent or guilty, and this decision will in turn play a role in determining the posterior of the magistrate. Assume that the innocent defendant denies the crime with probability νI, and the guilty denies the crime with probability vG. Then the posterior of the magistrate that the suspect is guilty, when he draws g, is

qˆ(θ=G|νG,νI,g)=qνGqGqνGqG+(1−q)νIqI≡qˆg.

On the other hand, the magistrate’s posterior, when he draws i, is

qˆ(θ=G|νG,νI,i)=qνG(1−qG)qνG(1−qG)+(1−q)νI(1−qI)≡qˆi.

It can be easily seen that qˆg>qˆi. Based on this posterior, the expected loss for the magistrate is r(1−qˆs)L1+(1−r)qˆsL2+kT if he tortures the suspect, where s=i, g. If he decides not to torture and releases the suspect, the expected loss is qˆsL2. Therefore, the magistrate tortures the suspect if and only if

[1](1−qˆs)L1+kTr≤qˆsL2.

4.2 Optimal decision of the suspect

The decision of a suspect of type θ is to choose the value of νθ to maximize his expected utility, which is

−νθ(qθxg+(1−qθ)xi)(T+rP)−(1−νθ)P,

where xg and xi are the probabilities that the magistrate will torture after he draws g and i, respectively. Since qˆg>qˆi, according to eq. [1], it is obvious that xg≥xi in equilibrium. The following two lemmas are very useful for solving the equilibria later.

Lemma 2

There does not exist a separating equilibrium. That is, there does not exist an equilibrium in whichνG=0andνI=1.

Proof. Consider a separating equilibrium in which vI=1 and νG=0. Then by Bayes’ rule, it is easy to see that the posterior for the magistrate is qˆi=qˆg=0. That is, he believes that a denying defendant is innocent, regardless of the evidence drawn. Then according to eq. [1], he will not torture the denying defendant (i.e. xi=xg=0). However, if this is the case, then the guilty defendant will surely deviate from vG=0.□

Lemma 2 implies that, in any equilibrium, the pool of denying defendants (if there are any) will comprise both innocent and guilty suspects. An important consequence of this is that both types of denying defendant will face the same posterior of the magistrate, and will be tortured with the same probability. That is, as we have already mentioned, xg and xi are independent of θ.

Lemma 3

In any equilibrium, the guilty defendant will confess with a higher probability than the innocent one; i.e. νI≥νG.

Proof. If a defendant confesses before being tortured, then he is subject to a legal penalty P, regardless of his type. If he denies, then his expected utility is −[qGxg+(1−qG)xi](T+rP) if he is guilty, and −[qIxg+(1−qI)xi](T+rP) if he is innocent. Since qG>qI and xg≥xi, the consequence of denying is more serious for the guilty defendant. As a result, it must be the case that νI≥νG.□

Although simple, Lemma 3 is actually an important result. If, contrary to Lemma 3, vI=vG, then torture will not change the composition of the pool of denying defendants. That is, given any evidence drawn, the probability that the denying defendant is guilty is exactly the same as the magistrate’s prior. In this case, torture will be necessarily inferior to an evidence-based system, as the magistrate can directly convict the defendant, which results in exactly the same type I and type II errors but saves the disutility of pain of torture. The informational advantage brought about by torture can be seen clearly by observing that νI≥νG implies qˆg>qg′ and qi′>qˆi. In other words, compared with the evidence-based system, in the torture system the magistrate is more sure that the suspect is guilty when he draws g and is more sure that the suspect is innocent when he draws i.

Lemma 3 also implies that qˆg>q>qˆi. That is, the magistrate’s posterior that the defendant is guilty, after he draws g (i), is greater (smaller) than the prior. Therefore, since the magistrate will not torture under the prior belief (by A2), he will not torture when he draws i either. Thus, we have the following result:

Lemma 4

If the magistrate draws i, then the suspect is released.

Proof. By the first inequality of A2, we know that 1−qL1+kTr>qL2. Since q>qˆi, it follows that (1−qˆi)L1+kTr>qˆiL2. That is, under the posterior qˆi, the expected loss of the magistrate is greater if he tortures the suspect. This implies that the magistrate will release the suspect if he draws i.□

By Lemma 4, the magistrate will torture with a positive probability only if he draws g. As a result, xi is always 0, and we will use x as a shorthand form of xg. That is, x now denotes the probability of torture when the magistrate draws g.

4.3 The equilibrium

Given the behavior of the suspect derived in the previous section, the magistrate chooses the value of x to minimize his expected loss:

[2]minxWT(x;q,νG,νI)≡x[kT+r(1−qˆg)L1+(1−r)qˆgL2]+(1−x)qˆgL2=x(kT+r(1−qˆg)L1)+(1−rx)qˆgL2.

The first term in eq. [2] is the expected loss from torture (the pain of torture plus the cost of a type I error). The second term is the expected loss from a type II error. The equilibrium for the game of torture can then be characterized by (νI∗,νG∗,x∗) in the following proposition:

Proposition 1

If the magistrate draws i, he will release the suspect. If he draws g, then:

IfqGqI>(1−q)(L1+kT/r)q(L2−kT/r), then:
1. whenPrP+T<qG, thenνI∗=1, νG∗=ν_, andx∗=PqG(rP+T);
2. whenPrP+T=qG, thenνI∗=1, νG∗∈[ν_,1), andx∗=1;
3. whenPrP+T>qG, thenνI∗=νG∗=1, andx∗=1;
  where ν_=qI(1−q)(L1+kT/r)qGq(L2−kT/r).
IfqGqI=(1−q)(L1+kT/r)q(L2−kT/r), then:
1. whenPrP+T<qG, thenνI∗=νG∗=1, andx∗∈0,PqG(rP+T);
2. when PrP+T≥qG, then νI∗=νG∗=1, and x∗∈[0,1].
IfqGqI<(1−q)(L1+kT/r)q(L2−kT/r), thenνI∗=νG∗=1, andx∗=0.

In the case whenPrP+T≤qI, there also exists a pooling equilibrium in whichνI∗=νG∗=0, andx∗∈[PqI(rP+T),1]. This is supported by the posterior belief of the magistrate that any non-confessing suspect is guilty with probabilityq∗∈[L1+kT/rL1+L2,1].

Proof. See Appendix A.□

Broadly speaking, there are two kinds of equilibria. The first is a pooling equilibrium in which both types of defendants confess. This is supported by the magistrate’s (out-of-equilibrium) belief that a non-confessing suspect is highly likely to be guilty. Under this belief, he tortures with so high a probability that both types of suspect confess before investigation. The second, and more interesting, type of equilibrium is one in which the innocent type never confesses (νI∗=1). The magistrate’s probability of torturing, x∗, depends on the denying probability of the guilty, νG∗. The higher its value, the greater his posterior that a denying suspect is guilty and, therefore, the greater the chance that he will torture. Figure 1 plots the non-pooling equilibrium as a function of the parameters PrP+T and (1−q)(L1+kT/r)q(L2−kT/r).

Figure 1:

The non-pooling equilibrium.

The value of the vertical axis, PrP+T=1r+TP, can be viewed as the defendant’s relative disutility of penalty versus torture. The greater its value, the less likely it is that the defendant will confess. Indeed, as we move up vertically from any point in Figure 1, the value of νG increases. This is, however, also accompanied by an increase of x∗, as the posterior of a denying suspect being guilty also rises when νG increases. When νG equals 1, ^[9] the magistrate’s posterior that the suspect is guilty, after he draws g, is so high that he will torture with probability 1. This is the equilibrium outcome in region A of Figure 1. Note that, in this region, all suspects still deny with probability 1 even if they know that they will be tortured with certainty when the magistrate draws g. The reason for this is that P is so large compared to T that they are willing to take the chance of being tortured.

The value on the horizontal axis, (1−q)(L1+kT/r)q(L2−kT/r), is the expected cost of a type I error relative to that of a type II error. As its value increases, the magistrate will be less inclined to torture. This can be seen by moving from any point in Figure 1 horizontally to the right. In this case, although the value of x∗ remains fixed, the value of νG will increase, meaning that the guilty suspect becomes less likely to confess. If the value of (1−q)(L1+kT/r)q(L2−kT/r) is large enough (specifically, larger than qGqI), either because the prior q is low or because the loss of the type I error is large, the magistrate will be unwilling to torture even if he draws g. In this case, x∗=0, and the suspect will never confess. This is the equilibrium outcome in region C of Figure 1. In region B, the value of the legal penalty is so low that the guilty suspect is inclined to deny the guilt. This is, however, tempered by the high probability that the magistrate will torture (since the value of (1−q)(L1+kT/r)q(L2−kT/r) is low). In the equilibrium, he confesses with a probability greater than 0 but less than 1.

Proposition 1 also implies that the magistrate uses a more lenient rule to convict a defendant under the torture system. Recall that under the evidence-based system, the magistrate convicts the suspect if and only if he draws g. Here, it is still the case that the suspect will be released when i is drawn. However, the magistrate will not necessarily torture (and convict) the suspect even if g is drawn.

Note that of all equilibria, the guilty and the innocent act differently only in region B of the non-pooling equilibrium. That is, only in region B does the torture system force the guilty suspect to confess more readily. This indicates that the torture system can incur a lower social loss than an evidence-based system only in region B, a fact that will be proved in Section 5.

5 Evidence vs torture

The central question that the paper intends to answer is which of the two systems incurs a lower expected cost. Torture, by forcing a suspect to confess regardless of whether he is guilty or not, decreases the chance of the type II error at the expense of both the pain imposed on the suspect and an increase in the type I error. As a result, the relative merit of the two systems critically depends on two factors. The first is the relative size of the losses brought about by type I and type II errors. The second is the accuracy of the magistrate’s investigation. In this section, we will first derive a precise condition under which system it incurs a lower loss than the other. Based on this condition, we then show how their relative merit will change in response to an improvement in information revealed in the investigation. Our result is then used to explain certain aspects of the historical evolution of the judicial system.

5.1 Welfare comparison

Under the evidence-based system, with probability q(1−qG)+(1−q)(1−qI) the magistrate will draw i, in which case the defendant will be released. Since there will be no type I error, the expected loss is q(1−qG)L2. If he draws g, which occurs with probability qqG+(1−q)qI, the magistrate will convict the defendant. This results in an expected loss of (1−q)qIL1. The total expected loss under the evidence-based system is thus

[3]WE≡q(1−qG)L2+(1−q)qIL1.

The expected loss of the torture system depends on which equilibrium outcome we consider. For the pooling equilibrium, in which νI∗=νG∗=0 and x∗∈PqI(rP+T),1, the expected loss is WT=(1−q)L1. ^[10] As a result, the difference in the expected loss between the two systems is

[4]ΔW≡WT−WE=(1−q)L1−[(1−q)qIL1+q(1−qG)L2]=(1−q)(1−qI)L1−q(1−qG)L2.

As a result, ΔW≤0 if and only if

[5]1−qG1−qI≥(1−q)L1qL2.

Inequality [5], however, violates assumption A1. This means that if an investigation is of certain informational value in the sense of A1, the evidence-based system always results in a lower expected loss than the pooling equilibrium in the torture system. The reason for this is as follows. In the pooling equilibrium, the defendant always confesses, and the magistrate never needs to investigate. The torture system thus has not utilized the information that would have been revealed had there been an investigation. Given that the investigation is of informational value, the evidence-based system, as one in which the magistrate’s decision is based on evidence, is naturally superior.

For the non-pooling equilibrium, the comparison becomes much more complicated, because the equilibrium outcome depends on parametric configurations. In general, the expected loss under the torture system is

[6]WT≡(1−q)[(1−νI∗)+νI∗qIx∗r]L1+qνG∗[(1−qG)+qG(1−x∗+x∗(1−r))]L2+[νI∗(1−q)qI+νG∗qqG]x∗kT=(1−q)qIx∗rL1+qνG∗(1−qGx∗r)L2+[(1−q)qI+νG∗qqG]x∗kT,

where the equality comes from the fact that νI∗ is always 1 in the non-pooling equilibrium.

In region A of Figure 1 where νI∗=νG∗=x∗=1, WT reduces to (1−q)qIrL1+q(1−qGr)L2+[(1−q)qI+qqG]kT. We then have

[7]ΔW=(1−r)[qqGL2−(1−q)qIL]+[(1−q)qI+qqG]kT>0.

The first term is positive because of assumption A1. Therefore, if the suspect always denies and the magistrate always tortures when he draws g under the torture system, the evidence-based system is superior. The reason for this is as follows. Under the equilibrium in region A, the magistrate will torture a suspect if and only if he draws g. With probability r, the suspect confesses after being tortured, so that the outcomes are exactly the same in these two systems. However, the torture system is inferior by the magnitude of the expected cost of torture. With probability 1−r, the suspect still denies after being tortured, and is released so that torture is a waste. As long as the investigation is informative, the evidence-based system is still superior.

In region C of Figure 1 where νI∗=νG∗=1 and x∗=0, WT reduces to qL2. Thus,

[8]ΔW=qL2−(1−q)qIL1−q(1−qG)L2=qqGL2−(1−q)qIL1.

Therefore, ΔW>0 if and only if

[9](1−q)L1qL2<qGqI,

which always holds by A1. This implies that if the suspect always denies and the magistrate never tortures under the torture system, the evidence-based system incurs a lower social loss. The reason for this is actually very simple. In the case when νI∗=νG∗=1 and x∗=0, a system based on torture actually does nothing. Suspects are routinely summoned to the court and released. Given that the investigation is of informational value (i.e. A1), a system that convicts a suspect based on evidence is naturally superior.

All the three cases above share a common feature: Information that is available through investigation is not utilized in the torture system in an efficient way. Either all types of suspect confess, so that investigation is not needed, or all who deny are released, or the magistrate convicts the suspect in exactly the same fashion as the evidence-based system. Since information offered by investigation is valuable, and since an evidence-based system does utilize this piece of information, the torture system always incurs a greater loss. To summarize, if the magistrate’s decision is not altered by the outcomes of the investigation in a torture system, or the criterion to convict a suspect is the same as in the evidence-based system, a torture system is inferior.

The more interesting case occurs when the equilibrium outcome falls in region B of Figure 1, i.e. when the evidence collected will affect the magistrate’s decision in a way different from the evidence-based system. In this case, the difference in the expected loss is

[10]ΔW=−(1−q)(1−x*r)qIL1+q[ν_(1−qGx*r)−(1−qG)]L2+[(1−q)qI+ν_qqG]x*kT =(1−q)qIx*r(L1+kT/r)−qqGx*rν_(L2−kT/r)−(1−q)qIL1−q(1−qG−ν_)L2 =−[(1−q)qIL1+q(1−qG−ν_)L2],

where the last equality in eq. [10] comes from the fact that the first two terms after the second equality sum to 0. Note that eq. [10] is an increasing function of ν_. Moreover, ΔW<0 when ν_=0 and ΔW>0 when ν_=1. Therefore, there exists a ν0≡(1−q)qIL1qL2+1−qG∈(0,1) such that ΔW>0 if and only if ν_>ν0. However, since ν_=qIqG(1−q)(L1+kT/r)q(L2−kT/r), we know that ΔW>0 if and only if

[11](1−q)(L1+kT/r)q(L2−kT/r)≥ν0qGqI.

Figure 2:

Expected loss comparison between two systems.

We can now summarize the results of our comparison in Figure 2. The curve (1−q)(L1+kT/r)q(L2−kT/r)=qGqIν0 partitions region B in Figure 1 into two parts, B1 and B2. Configurations lying in B1 (B2) result in equilibria having a lower (higher) expected loss in the torture system than the evidence-based system. Therefore, among all possible equilibria, the torture system is superior only when the configurations of the parameters lie in region B1, in which the values of both PrP+T and (1−q)(L1+kT/r)q(L2−kT/r) are lower. In other words, a torture system is superior only when the relative disutility of legal penalty to torture is low, and the expected cost of a type I error relative to that of a type II error is low. Recall that torture imposes two kinds of cost on the society. One is the pain of torture and the other is the increase in the type I error. Thus, only when the type II error imposes a serious cost in the magistrate’s eyes, and the legal penalty is not great enough to be deterring, can the magistrate be justified in resorting to torture. However, if one of the conditions is not met, then torture is more costly, and a system based solely on evidence will be better.

5.2 Improvement in informativeness of investigations

In this section, we investigate how the behavior of the suspect and the magistrate will change in response to an improvement in the information revealed during the investigation. As can be seen in Proposition 1, an important parameter that influences the behavior of both the magistrate and the suspect is qG/qI. Since qG(resp.1−qI) is the probability that the magistrate will draw g(resp.i) when the defendant is guilty (resp. innocent), an increase in qG or a decrease in qI, (and therefore an increase in qG/qI) implies that more precise information is released during the investigation. The value of qG/qI is therefore a measure of the informativeness of the investigation. We will say that the investigation becomes more informative if qG increases or qI decreases, with at least one strictly so.

We will show an interesting result of a non-monotonic relationship between the informativeness of investigation (in terms of qG/qI) and the attractiveness of the torture system (the occurrence of ΔW<0): when little information is revealed during investigation, the torture system performs even better when information is improved; however, as investigation becomes sufficiently informative (i.e. qG/qI has passed some threshold), then further improvement in the information will begin to lend an advantage to the evidence-based system. Eventually, the torture system is dominated.

The reasoning of this result is the following. When the investigation becomes more informative, the outcomes in both systems will be affected. In the evidence-based system, both inequalities in assumption A1 become more likely to hold. Consequently, the magistrate is more likely to release a suspect when he draws i, and more likely to convict him when he draws g. Moreover, the expected losses from a type I error (when he convicts a suspect) and a type II error (when he releases a suspect) both decrease. This can be easily seen from eq. [3]: As the investigation becomes more informative, WE will decrease. In particular, when qG approaches 1 and qI approaches 0, the social loss of the evidence-based system approaches 0.

In the torture system, both qˆg and qˆi will increase as the investigation becomes more informative. Furthermore, the line (1−q)(L1+kT/r)q(L2−kT/r)=qGqI will shift to the right. In this case, region C will retract, and regions A and B will expand. Therefore, the magistrate will be more likely to torture on the one hand, and the guilty suspect will be more likely to confess on the other. Note that as qG approaches 1 and qI approaches 0, ν_ will approach 0 and the probability that the magistrate will torture, x∗, will approach PrP+T. This resembles a separating equilibrium. However, unlike the evidence-based system, what separates the innocent and the guilty is not the evidence, but the fact that the innocent always denies and the guilty always confesses. This relies on the magistrate’s commitment that his probability to torture is bounded away from 0.

It can be shown that qGqIν0 is a decreasing function of qI, meaning that as qI becomes smaller, the dividing curve between B1 and B2 will shift to the right. In other words, the torture system will gain its advantage as qI decreases. Moreover, qGqIν0 is increasing (decreasing) in qG if

[12]qG<(>)qI1−qqL1L2+12.

That is, as the value of qG increases from a small value, qGqIν0 will first increase, and then decrease as the value of qG passes the threshold value on the right-hand side of eq. [12]. Consequently, an increase in qG will favor the torture system when qG is small, and favor the evidence-based system when it is large. Note that the smaller the value of qI, the more likely it is that qG is greater than the right-hand side of eq. [12]. Therefore, at the same time the magistrate becomes more competent in identifying the innocent suspect, an increase in qG is more likely to lend advantage to the evidence-based system. In particular, when qG is sufficiently large, then ΔW≥0 regardless of the value of qI. This can be seen from the following fact: if qG is sufficiently close to 1, then ν0 is close to (1−q)qIL1qL2, so that it is always the case that (1−q)(L1+kT/r)q(L2−kT/r)≥ν0qGqI. In other words, as the magistrate’s ability to identify a guilty suspect through investigation passes a certain threshold, there will be no more room for torture: a system based on evidence is always less costly. When this is true, the evidence-based system will incur a lower social cost in all possible equilibria, and will be unambiguously better and eventually guarantee its domination.

In Section 7, we review the historical development of judicial torture, which appears to be consistent with our theoretical results. First note that although medical expertise had started to be used in judicial cases since the thirteenth century (so that the precision of investigation had improved), there was actually an increase in the use of torture. This is consistent with our prediction that an improvement in the technology of investigation lends advantage to torture, when the investigation is relatively uninformative. Only after the sixteenth century (the beginning of the use of modern forensic science) did the practice of torture start to decline. This is also consistent with our prediction that as the technology of investigation becomes sufficiently advanced, an increase in informativeness will give an advantage to an evidence-based system. Finally, at the end of nineteenth century, when more advanced testing methods (like fingerprints and blood type identification) started to be accepted in court and when the systematic use of scientific investigation for legal use became common, a system based on evidence dominated thereafter.

5.3 An illustrative example

In this section, we use a simple example to demonstrate the non-monotonicity result found in the previous section. Consider the symmetric case where q=1/2 and L1=L2=L. That is, it is believed that the suspect is guilty or innocent with equal probability a priori, and the costs of type I and type II errors are equally important to the magistrate.

We have shown that the torture system is worse than the evidence-based system in the pooling equilibrium and in regions A and C of the non-pooling equilibrium. We only need to discuss region B of the non-pooling equilibrium where the torture system can possibly win. Based on eq. [10], we have

ΔW=−L2(qI+1−qG−ν_)=−L2qI+1−qG−qIqG⋅L+kT/rL−kT/r.

Let ℓ=L+kT/rL−kT/r. Therefore, ΔW<0 if and only if

[13]qG+qGqI−(qG)2qI≡b(qI,qG)>ℓ.

ℓ represents the relative cost between torturing and not torturing the suspect. Obviously, ℓ>1. On the other hand, bqI,qG represents the relative benefit of information elicitation by confession via torturing the suspect. It is easy to see that: (1) b(qI,qG)>1 since qI<qG<1, and b(qI,qG) is concave in qG; (2) b(qI,qG)→1 when qG→qI and qG→1 and (3) b(qI,qG) reaches its peak when qG=1+qI2. The relationship between the net benefit of torture, b−ℓ, and the level of qG (given qI) is drawn in Figure 3.

Figure 3:

The net benefit of torture.

As shown in Figure 3, when qG approaches qI, b−ℓ<0, which means that the torture system loses to the evidence-based system. This is because investigation does not reveal too much information, so that it is very likely that the guilty suspect will deny the crime (i.e. ν_ is close to 1). In this case, torture does not bring much advantage but causes a social cost. On the other hand, when qG approaches 1, again, b−ℓ<0, because investigation has already done a good job in separating the guilty from the innocent, so that torture cannot outperform the evidence-based system. It is when qG is at some intermediate level that the torture system can become better than the evidence-based system. The relationship between b−ℓ and qG is then hump shaped: when qG is small, an increase in qG will even improve the benefit from using torture; however, once it passes the threshold 1+qI2, torture becomes less attractive as qG becomes higher, and eventually becomes inferior to the evidence-based system.

6 Torturing plaintiffs or witnesses

Judicial torture in pre-modern societies was legitimately applied almost universally to suspects, witnesses and plaintiffs for various reasons before the mid-nineteenth century. The accuser or the plaintiff would undergo an ordeal to substantiate his veracity or be tortured when he was unable to make good his accusation. In Roman law, the accuser could be exposed to the lex talionis (law of retaliation) in case he failed to prove the justice of the charge (Lea 1971, 333). In primitive Russian laws, the accuser was obliged to undergo the ordeal of a red-hot iron if he could not substantiate his case with witnesses. Archbishop Hincmar of Rheims of the ninth century required that cases of complaints against priests be supported by seven witnesses, of whom one must be tortured to prove the truth of his companions’ oath, as a wholesome check upon perjury and subornation (Lea 1971, 290).

The same thing could be observed in China. The legal code of the Qing dynasty clearly allowed torture to be used not only on the accused, but also on “secondary suspects and to witnesses as well as to principals” (Bodde and Morris 1967, 97). In fact, the tradition could be traced back to the Tang code of ad 653 that constituted the basis for all subsequent imperial law codes. According to Tang law, if the accused insisted on his innocence even after having suffered the maximum amount of torture allowed by the law, the plaintiff would in turn be tortured (Shen 1985, 511). Moreover, since at least the Tang dynasty, a plaintiff who bypassed the immediate authority and presented his complaints to higher administrative levels was required to undergo torture before the examination of the case (Xue 1998, 636–9), again as a check upon perjury and subornation, or as a measure to discourage such actions. During the early years of the Ming dynasty (1368–1644), frequent complaints brought directly to the capital prompted the government to implement an extremely severe punishment – banishment to the frontier – on plaintiffs, even at times on those who could substantiate their cases, to control the number of such acts (Shen 1985, 1142).

In this section, we analyze the puzzling phenomenon that the plaintiffs or the witnesses are tortured in the judicial procedure. Since the basic logic behind our argument is the same for torturing the plaintiffs and the witnesses, we will consider only the former. Our basic model in Section 2 is modified in the following way. Suppose that, before the magistrate accepts a case, he can torture the plaintiff, ostensibly to “verify” or “test” whether the plaintiff is telling the truth. Once tortured, the disutility for the plaintiff is Tp, and it also yields a loss to the magistrate, kpTp. The events then follow as described in the model mentioned before.

In the previous sections, we have assumed that the prior of the magistrate that the suspect is guilty is q. Here, in order to discuss how torture changes the plaintiff’s incentives to come to the court, and thus the magistrate’s prior, we assume that q is determined by the cases that plaintiffs bring into the court. Specifically, suppose that every plaintiff has a belief regarding how likely the defendant is to be guilty, represented by a probability ρ. The distribution of plaintiffs’ beliefs is given by the density function f(ρ). Hence, the prior of the magistrate that a defendant is guilty, if all plaintiffs go to the court, will be ∫01ρf(ρ)dρ. Thus, what we call the prior of the magistrate in the previous sections is actually the mean value of the beliefs of the plaintiffs who actually enter the court.

Assume that a plaintiff obtains a utility Y>0 when a guilty defendant is convicted. If the guilty suspect is wrongfully released, or the innocent defendant is wrongfully convicted, the plaintiff is assumed to obtain a negative utility −y, y>0. The plaintiff’s utility is 0 if an innocent suspect is released. Therefore, given the probability that the magistrate will torture the plaintiff, φ, the expected utility for the plaintiff with a belief ρ, when he comes to the court, is

[14]ρ[(1−νG∗)+νG∗qGx∗r]Y−ρνG∗(1−qGx∗r)y−(1−ρ)[(1−νI∗)+νI∗qIx∗r]y−φTp.

The first term in eq. [14] is the expected utility that a guilty suspect is convicted. The second and third terms are the expected losses from type II and type I errors, respectively. The fourth term is the expected cost of torture. If, on the other hand, the plaintiff decides not to come into the court, his expected payoff is simply −ρy.

To see how torturing the plaintiff affects the equilibrium of the torture game, we again need to discuss the two types of equilibrium in Proposition 1 separately, i.e. the pooling and the non-pooling equilibria. The details of the analysis are relegated to Appendix B. The key driving force is that, by torturing the plaintiff, the incentive of the plaintiff to come to the court is reduced. Those plaintiffs who are less sure of the defendants’ guilt are deterred. Consequently, the average quality of cases brought to the court (which is represented by q(φ)=∫ρ(φ)1ρf(ρ)1−F(ρ(φ))dρ) is increased in the sense that the suspect has a higher probability of being guilty.

The main intuition for the result that torturing the plaintiff or witness can sometimes be optimal is the following. In the pooling equilibrium, both types of suspect confess. Thus, the expected loss under torture is WT(φ)=(1−q(φ))L1+φkpTp. If the magistrate tortures the plaintiff by increasing the value of φ, q(φ) will be increased so that the expected cost related to the type I error will be reduced. Therefore, when L1 is large enough, the benefit of raising the quality of cases entering into court will outweigh its cost kpTp, so that torturing the plaintiff can be optimal.

In the non-pooling equilibrium, the expected loss is WT(φ)=(1−q(φ))qIx∗rL1+q(φ)νG∗(1−x∗qGr)L2+[(1−q(φ))qI+q(φ)νG∗qG]x∗kT+φkpTp. If the magistrate never tortures the suspect (i.e. x∗=0, or region C in Figure 1), the expected loss will be qL2+φkpTp. Obviously, it is optimal to choose φ∗=0. This implies that if the magistrate decides to torture the plaintiff or the witness, he will torture the suspect as well. On the other hand, if the magistrate decides to torture the suspect with a positive probability (i.e. x∗>0, or regions A and B in Figure 1), by torturing the plaintiff which further raises q, the cost of the type II error may even be reduced since it would be more likely for the guilty suspect to confess (i.e. ∂νG∗∂q≤0). The magistrate thus trades off the loss resulting from torturing the plaintiff with the benefit from reducing both the type I and type II errors. If the benefit surpasses the loss, the magistrate will torture the plaintiff.

We also find that when either Y or y is larger, which means that the plaintiff has a stronger sense of “justice” in that he derives (suffers) a greater utility (disutility) in having a guilty (innocent) suspect convicted, then torture will be less likely to deter him from coming into court. Thus, the screening function of torture is smaller, so that it will be less likely to be used.

More importantly, when the investigation is less informative (i.e. qG/qI is smaller), the screening effect of torture will be stronger. In this case, torturing the plaintiff will occur more frequently. This means that there is a negative relationship between the informativeness of the investigation and the tendency to torture the plaintiff. Moreover, since the magistrate is also more likely to torture the suspect when qG/qI is relatively low, the two practices (torturing the suspect and torturing the plaintiff or the witness) generally go hand in hand.

7 The historical evolution of torture

In this section, we would like to review the historical evolution of torture and show that it is consistent with our theoretical predictions.

European countries do not have exactly the same history of judicial torture, as each has a different judicial tradition and religious culture that greatly affects legal application. However, from the specific cases of England and France, one may construct a picture reflecting the general history of judicial torture in Europe: the application of judicial torture began to soar from around the thirteenth century ^[11] onward and peaked in the sixteenth century. It declined thereafter and was nominally abolished in most European countries toward the end of the eighteenth and early nineteenth centuries. In England, “references to its use (use of judicial torture) in the earlier years are remarkably scanty, but during the fifteenth, sixteenth and seventeenth centuries, the evidence is abundant...(J)udicial torture reached its greatest ecumenicity in the reign of Elizabeth (1533–1603)” (Scott 2003, 88–9). In France, in a southern town such as Toulouse where the tradition of Roman law was strong, judicial torture “was first employed... during the 13th century by the town consuls,” and it “reflected a new methodology and a new epistemology” (Silverman 2001, 5–7). The decline in the actual use of judicial torture in France in general seemed to begin from the mid-sixteenth century to the seventeenth century, depending on the region, and lasted until its nominal abolition in 1788 (Silverman 2001, 18). The pattern corresponds roughly to the one observed in England. By 1850, the movement for the abolition of torture swept most parts of Europe. Furthermore, “by the closing decades of the 19th century it was widely thought that torture was a barbaric practice that belonged to history” (Evans and Morgan 1998, 12–13). Although recent scholarship still casts doubt on the real disappearance of judicial torture in Europe by the end of the nineteenth century, as evidence suggests the persistence of the illegal use of judicial torture in a democratic country like the United States even in the 1920s (Evans and Morgan 1998, 14), such evidence shows that torture was no longer the norm. The decline of judicial torture was therefore a continuous process beginning from the sixteenth century up to the early twentieth century. It was legally and ideologically condemned by the end of the eighteenth century, and, in practice, it may have persisted up to the turn of the twentieth century.

The decline of judicial torture in modern Europe has been the subject of a number of scholarly works in the past few decades. The conventional historical account of the abolition movement placed great weight on the philosophic goals of the Enlightenment, which is considered to be the driving force behind the abolition of torture. ^[12] Langbein proposes a more specific albeit controversial explanation based on changes within the penological history itself. He suggests that judicial torture became increasingly useless when a new law of proof emerged around the sixteenth century that no longer required a strict standard of proof, accompanied by less rigorous punishments, a development he calls the “evidentiary revolution” (Langbein 1977, 1983). Langbein’s thesis was criticized by another law specialist M. Damaška, especially on the significance of the “evidentiary revolution” in the sixteenth century. For Damaška, judicial torture still had a raison d’être in Europe during and after the sixteenth century (Damaška 1978).

More recently, historian Lisa Silverman has proposed another explanation for the final abolition of judicial torture in eighteenth-century France, partially rehabilitating the humanist theory rejected by Langbein. For Silverman, there was a “dramatic paradigm shift” in the way abolitionists of the Enlightenment understood the relation between judicial torture (pain), truth and the body. Bodily pain was no longer believed to be able to produce truth (Silverman 2001). One reason Silverman does not find satisfaction in Langbein’s theory is that long after the “evidentiary revolution,” judicial torture was still practiced in many parts of Europe, a point also raised in Damaška’s review.

This historical evolution of judicial torture in Europe corresponds meaningfully with our model if we look at another factor that might explain in a more satisfactory way the rise and decline of judicial torture: the employment of medical science in law. There are indications of the application of medical expertise in judicial cases in northern Europe from the tenth century onward, and it was explicitly mentioned in Norman law from the early thirteenth century onward. Between the thirteenth and the sixteenth centuries, the trend exhibited a stable increase until the publication in 1562 of the first judicial postmortem in France by Ambroise Paré, although the conventional wisdom is that the first comprehensive work on forensic medicine, De relationibus medicorum libri quatuor, was published by Fortunato Fedele (1550–1630) in 1602. An increasing number of works of the kind were published throughout the seventeenth and eighteenth centuries with the first serial devoted to forensic medicine published in 1755. These early European publications on forensic medicine indicate that the practice of judicial forensic medicine was already common throughout the sixteenth century, if not earlier. Later developments in the science of fingerprinting (from the late seventeenth century), physical matching (late eighteenth century), anthropometry (late nineteenth century) and so on were the intensification of the same trend. More significantly, the systematic application of modern scientific technology in criminal investigations began in the last decades of the nineteenth century. Most notable was the new classification system created by E. Henry in 1896 enabling fingerprints to be easily filed, searched and traced. It was quickly used worldwide and is still used today. ^[13] Henry himself was appointed assistant administrator of Scotland Yard in 1901 in charge of the Criminal Investigation Department, and forced the adoption of fingerprint identification. ^[14] New York State in the United States followed in 1903. ^[15] Most European countries established crime detection laboratories as government or university units in the first quarter of the twentieth century (Parker 1983, 432). In other words, there were two critical points in the development of forensic medicine in criminal investigation: first, the sixteenth century marked the beginning of modern forensic medicine for legal uses (and the “evidentiary revolution”). Second, the end of the nineteenth century marked a paradigm shift in such technology and its systematic application in criminal investigation (Nemec 1968, 5–15; Saferstein 2001, 3; Parker 1983, 430). At the same time, the sixteenth century represents the peak of the legal employment of torture in most parts of Europe. The decline, abolition and the ultimate disappearance of judicial torture in democratic countries would take more than 300 years.

8 Conclusion

This paper applies the economic theory of information to analyze the nature and consequences of judicial torture. The theory also suggests an explanation for the rise and fall of torture in Europe. It is shown that if the magistrate aims to balance type I and type II errors in judgment, and if during the investigation little information is revealed, then a system based on torture functions better than one based on evidence. This mainly comes from the ability of torture to force the guilty defendant to confess with a higher probability than the innocent. It is also shown that the information advantage of a torture system will be more pronounced as the precision of investigation rises from a low level. However, as its precision passes a certain threshold, the advantage of torture decreases relative to a system based on evidence. This may explain an increase in the practice of torture before the sixteenth century on the one hand, and a transition from a torture system to an evidence-based system starting from the mid-eighteenth century in Europe on the other. Furthermore, torturing the plaintiffs might also be an equilibrium outcome, as it helps to screen the cases so that only those with greater merits enter the court. Again, this is more likely to occur when the quality of information revealed during the investigation is low.

There have been two major theories to explain the abolition of torture, one based on humanist concern and the other based on legal change. In this paper we propose an alternative theory based on the progress of technology. We hasten to add that our theory does not mean to replace, or even compete with, the existing explanations. All three factors might contribute to the decline of torture, and in this sense they are complementary to each other. In fact, the humanist theory and the legal theory can both comfortably fit into our framework. The reason why there had been a change in the rule of proof might be precisely because the improvement in the technology of evidence collection has been such that a system based solely on evidence was found to reach more precise verdicts. On the other hand, a change in the attitude of the general public toward torture is equivalent to an increase of L1 in our model, which shifts the curve dividing B1 and B2 to the left, and therefore increases the cost of torture. From this viewpoint, the contribution of this paper might be seen as providing a unifying framework in which the humanist, legal, and technology theories are all important components in explaining the rise and fall of judicial torture.

The paper’s results also have some political implication. Western countries condemn regimes that use torture. To reduce the prevalence of torture in those less-developed countries, the developed countries can provide forensic technology in the form of “judicial development aid,” instead of intervention or sanctions. However, given the non-monotonic result of torture’s effectiveness, we suggest that it is important for the technology to be advanced enough to avoid its adverse effect. ^[16]

Appendix A: Proof of Proposition 1

First note that an innocent suspect will confess before being tortured with a positive probability only if P≤qI(rP+T)x, i.e. the loss of confessing is smaller than that of denying. The similar condition for a guilty suspect is P≤qG(rP+T)x. Also, given the posterior qˆg, the magistrate has a positive probability to torture only if qˆgL2≥(1−qˆg)L1+kTr, or, equivalently, qGqI≥(1−q)νI(L1+kT/r)qνG(L2−kT/r).

We first focus on the equilibrium in which νI∗>0.

Suppose qGqI>(1−q)(L1+kT/r)q(L2−kT/r):

First of all, we only have to consider the case x>0 in equilibrium. The reason is as follows. If x=0, then νI=νG=1, because it is always the case where P>qI(rP+T)x and P>qG(rP+T)x. However, if this were the case, then x=1 due to the assumption qGqI>(1−q)(L1+kT/r)q(L2−kT/r), which is a contradiction. Secondly, if in equilibrium, x∈(0,1), then it must be the case that νI=1 and νG∈(0,1). To see this, recall that we have already shown that νI≥νG. If νI=νG=1, then again it must be the case that x=1, which violates the assumption x∈(0,1). If νI=1, or νI∈(0,1) and νG=0, then x=0, which is also a contradiction.

There are three further cases to consider: (1) If PrP+T<qG, it is not possible that x=1 in equilibrium. The reason is the following. If x=1, since P<qG(rP+T)x, νG=0. It follows that x=0 as long as νI>0, which is a contradiction. Thus, x∈(0,1). Since the magistrate is using a mixed strategy, it must be the case that he is indifferent between torturing and releasing the suspect. That is, it must be the case that qGqI=(1−q)νI(L1+kT/r)qνG(L2−kT/r). As we have already shown, x∈(0,1) also implies νG∈(0,1). Again, the guilty suspect is indifferent between confessing and denying, so P=qG(rP+T)x. Thus, in equilibrium, νI∗=1, νG∗=qI(1−q)(L1+kT/r)qGq(L2−kT/r)≡ν_ and x∗=PqG(rP+T). (2) If qG=PrP+T>qI, then P=qG(rP+T) and P>qI(rP+T). The inequality above implies that νI=1. If x=1, then νG∈(0,1). Since by assumption qGqI>(1−q)(L1+kT/r)qνG(L2−kT/r), it must be that vG>v_. If x∈(0,1), then νG=1, which will result in x=1, a contradiction. Therefore, the equilibrium is νI∗=1, νG∗∈(ν_,1), and x∗=1. (3) If PrP+T>qG, then both types of suspect prefer to deny for any x, so νI=νG=1. It follows that x∗=1.

Suppose qGqI=(1−q)(L1+kT/r)q(L2−kT/r):

First, if x=0, then it must be that νG=νI=1. The fact that qGqI=(1−q)(L1+kT/r)q(L2−kT/r) thus implies that the magistrate is indifferent between torturing and releasing the suspect. That is, x∗=0 and νI∗=νG∗=1 constitute an equilibrium. Next, if x>0, then it must be the case that to torture is at least as good as to release for the magistrate, which means qGqI≥(1−q)νI(L1+kT/r)qνG(L2−kT/r). Since νI≥νG, this is possible only if νI=νG. However, if νI=νG<1, then it must be that P≤qI(rP+T)x and P<qG(rP+T)x (note that qG>qI). This implies that νG=0 and thus, νI=0. This violates the case that we set out to consider (νI∗>0). This means that if x>0, νG=νI=1. There are thus two cases to consider. (1) Let PrP+T≥qG. Then P≥qG(rP+T)x and P>qI(rP+T)x for any x∈[0,1]. Consequently, x∗∈[0,1] and νI∗=νG∗=1 constitutes an equilibrium. (2) Let PrP+T<qG, then in order for νI∗=νG∗=1 to hold, it must be the case that P≥qG(rP+T)x, i.e. x∈[0,PqG(rP+T)]. In other words, νI∗=νG∗=1 and x∗∈[0,PqG(rP+T)] constitutes an equilibrium.

Suppose qGqI<(1−q)(L1+kT/r)q(L2−kT/r):

In this case, since νI≥νG, it must be the case that qGqI<(1−q)νI(L1+kT/r)qνG(L2−kT/r). Therefore, x∗=0, which implies that νI∗=νG∗=1.

So far we have assumed that νI>0. Suppose now νI=0. Then, since νI≥νG, it must be the case that νI=νG=0. In order that neither type of suspect deviates, it is required that P≤qI(rP+T)x and P≤qG(rP+T)x. Since x≤1, this is possible, however, only if qI≥PrP+T. Moreover, it must be that x∈PqI(rP+T),1. Since νI=νG=0, if any suspect denies, the magistrate must form an out-of-equilibrium belief q∗. The value of q∗ can be arbitrary under the sequential equilibrium requirement. However, since the social cost of torture under belief q∗ is r(1−q∗)L1+(1−r)q∗L2+kT, and the social cost of acquittal is q∗L2, to support x∈PqI(rP+T),1 as an equilibrium, it must be (1−q∗)L1+kTr≤q∗L2. In other words, q∗∈L1+kT/rL1+L2,1.

Appendix B: Torturing the plaintiff or the witness

We consider the two types of equilibrium in Proposition 1 separately as follows:

Pooling equilibrium:

In this case, νI∗=νG∗=0 and x∗=1, and eq. [14] becomes ρY−(1−ρ)y−φTp. Therefore, a plaintiff with belief ρ will come into the court if and only if ρY−(1−ρ)y−φTp≥−ρy. Note that a plaintiff with ρ=0 will never go to court, as it implies a lower utility regardless of the value of φ. For a plaintiff with ρ=1, whether he will go to court depends on the value of Y−φTp versus −y. We will make the assumption that Y+y>Tp. ^[17] In this case, there exists a ρ(φ)∈[0,1] so that a plaintiff with belief ρ≥(<)ρ(φ) will (will not) come into the court, where

[15]ρ(φ)=φTp+yY+2y.

Given that only those plaintiffs whose beliefs are greater than ρ(φ) will go to court, the belief of the magistrate that the suspect is guilty is then

[16]q(φ)=∫ρ(φ)1ρf(ρ)1−F(ρ(φ))dρ,

where F(⋅) is the distribution function of ρ. Note that in our previous setup, the event starts at the time when a case has already entered into the court. We will interpret q(φ) as the prior of the magistrate. Since in our previous setup the magistrate does not torture the plaintiff, it corresponds to the case φ=0. Consequently, q(0)=q, that is, the prior q in the previous sections is actually q(0) in the current setup.

After the plaintiff brings the case into court, the magistrate chooses an optimal probability of torturing the plaintiff, φ∗, to minimize the expected loss:

[17]minφWT(φ)=(1−q(φ))L1+φkpTp.

A full characterization of φ∗ is difficult. This is because a change in φ will affect the magistrate’s and suspect’s behavior during the investigation in a complicated way that is hard to capture in a simple fashion. However, we can find a simple sufficient condition under which the magistrate will torture the plaintiff with a positive probability. Differentiating WT with respect to φ, evaluated at the point where φ=0, we have

∂WT∂φ|φ=0=kpTp−L1∂q(φ)∂φ|φ=0=kpTp−L1f(ρ(0))(q−ρ(0))ρ′(0)1−F(ρ(0)).

According to eq. [15], ρ′(0)=TpY+2y. Thus, ∂WT∂φ|φ=0<0 if

[18]L1>kp(Y+2y)[1−F(ρ(0))]f(ρ(0))(q−ρ(0))=(Y+2y)kpH,

where H≡1−F(ρ(0))f(ρ(0))(q−ρ(0)). This means that if L1 is sufficiently large or kp is sufficiently small, increasing the value of φ slightly from 0 will decrease the value of WT. Consequently, at the optimum the magistrate will torture the plaintiff with a strictly positive probability.

Inequality [18] has a clear intuition. Recall that in the pooling equilibrium all types of suspect confess, so that only the type I error is committed. If the cost of the type I error is large enough, then the benefit in terms of raising the quality of cases entering into court (by torturing the plaintiff) outweighs its cost.

Also note that eq. [18] is less likely to hold when either Y or y is large. This is again an intuitive result. If the plaintiff has a strong sense of “justice” in that he derives (suffers) a great utility (disutility) in having a guilty (innocent) suspect convicted, torture will be less likely to deter him from coming into court. The screening function of torture is thus small, and will be less likely to be used.

Non-pooling equilibrium:

In this case, eq. [14] becomes

[19]ρ[(1−νG∗)+νG∗qGx∗r]Y−ρνG∗(1−qGx∗r)y−(1−ρ)qIx∗ry−φTp.

As a result, he will come to the court only if ρ≥ρ(φ), where

[20]ρ(φ)=φTp+qIx∗ry[1−νG∗+νG∗qGx∗r](Y+y)+qIx∗ry.

We can define the magistrate’s prior q(φ) in a way similar to eq. [15]. Again, we assume that q(0)=q to facilitate comparison with the case when the plaintiff is not tortured. The magistrate chooses an optimal probability of torture, φ∗, to minimize the expected loss in this equilibrium:

[21]minφWT(φ)=(1−q(φ))qIx∗rL1+q(φ)νG∗(1−x∗qGr)L2+[(1−q(φ))qI+q(φ)νG∗qG]x∗kT+φkpTp.

In the case where νI∗=νG∗=1 and x∗=0 (i.e. region C in Figure 1), the expected payoff of the plaintiff when he comes to the court is −ρy−φTp. As the expected payoff of a plaintiff is −ρy if he does not come to court, we know that once φ>0, no plaintiff will be coming to the court. Assuming that the magistrate’s prior is q in this case, the expected loss is qL2+φkpTp. Obviously, the value of φ that minimizes the social welfare loss is 0. Consequently, φ∗=0. We thus have the following result:

Lemma 5

In the equilibrium in which the defendant never confesses and the magistrate never tortures the suspect, the magistrate will not torture the plaintiff either.

In region A of Figure 1 where x∗=νG∗=νI∗=1, we have

[22]∂WT∂φ|φ=0=kpTp+[(1−qGr)L2−qIrL1+(qG−qI)kT] ∂q(φ)∂φ|φ=0,

which is negative if ^[18]

[23]L1+kTr>qGqI(Y+y)+ykpH+L2rqI−qGqIL2−kTr.

Since qGqI>(1−q)(L1+kT/r)q(L2−kT/r) in region A, a sufficient condition for eq. [23] to hold is

[24]L1+kTr>qqGqI(Y+y)+ykpH+qrqIL2.

The important thing to note is that eq. [24] is more likely to hold if qG/qI is small. That is, torturing the plaintiff is more likely to occur when the investigation is less informative.

In region B of Figure 1 where νG∗<1 and x∗<1,

[25]∂WT∂φφ=0|=kpTp+{TrP+Tν_L2−qIPqG(rP+T)(rL1+kT)+PrP+Tν_kT}∂q(φ)∂φ|φ=0+q(φ)[(1−x*qGr)L2+qGx*kT]∂νG*∂q∂q(φ)∂φ|φ=0

Since ∂νG∗∂q≤0, we know that ∂WT∂φ|φ=0<0 if the first two terms in eq. [25] are negative. Substituting ν_=qI(1−q)(L1+kT/r)qGq(L2−kT/r) and ∂q(φ)∂φ|φ=0=ρ′(0)H into eq. [25], we know that eq. [25] is negative if

[26]L1+kTr>qGqI⋅qrP+TrPL2−kTrkpHq−(1−q)TrPL2−kTrρ′(0).

Condition [26] is more likely to hold if kp is small, or ∂q∂φ|φ=0 is large (i.e. when ρ′(0) is large or when H is small). That kp needs to be small is easy to understand. A large value of ∂q∂φ|φ=0 means that the average quality of cases brought into the court can be increased significantly due to a strong deterring effect of torture. Therefore, the probability of committing a type II error can be substantially reduced. Condition [26] is also more likely to hold when Y+y (which enters into eq. [26] via ρ′(0)) is relatively small. The reason is similar to the pooling equilibrium case. If the plaintiff derives much utility from convicting the guilty, or cares much about the type II error, he will bring in cases even facing torture. In this case, the function of torture as a screening device will be small.

Again, eq. [26] is less likely to hold when qG/qI is large. If information revealed during investigation is relatively precise, then the chance of committing either type of error is low. In this case, there is no need to torture the plaintiff to reduce the cost of either type of error.

Acknowledgments

The authors thank the editor Burkhard Schipper and a referee for their helpful comments that led to significant improvements in the paper. The authors also thank Angela Ki-Che Leung for her valuable suggestions.

References

Bodde, R., and C. Morris. 1967. Law in Imperial China: Exemplified by 190 Ch’ing (Qing) Dynasty Cases. Cambridge, MA: Harvard University Press.10.4159/harvard.9780674733213Search in Google Scholar

Chen, K.-P., C.-F. Chou and T.-S. Tsai. 2009. ‘‘Judicial Torture as a War of Attrition.” mimeo.10.2139/ssrn.1412066Search in Google Scholar

“Confession.” 2005. West’s Encyclopedia of American Law. From Encyclopedia.com. Accessed December 8, 2014. http://www.encyclopedia.com/doc/1G2-3437701048.htmlSearch in Google Scholar

Damaška, M. 1978. “‘The Death of Torture.” The Yale Law Journal 87:860–84.10.2307/795611Search in Google Scholar

Evans, M. D., and R. Morgan. 1998. Preventing Torture. A Study of the European Convention for the Prevention of Torture and Inhuman or Degrading Treatment or Punishment. New York: Oxford University Press.Search in Google Scholar

Fullmer, J. 1980. “Technology, Chemistry, and the Law in Early 19th-Century England.” Technology and Culture 21:1–28.10.2307/3103985Search in Google Scholar

Gross, H. 1907. “Criminal Investigation: A Practical Textbook for Magistrates, Police Officers and Lawyers.” In Translated and Adapted from the System Der Kriminalistik of Hans Grossk, edited by J. Adam and J. C. Adam. London: Specialist Press.Search in Google Scholar

Kreps, D., and R. Wilson. 1982. “Reputation and Imperfect Information.” Journal of Economic Theory 27:253–79.10.1016/0022-0531(82)90030-8Search in Google Scholar

Langbein, J. H. 1974. Prosecuting Crime in the Renaissance: England, Germany, France. Cambridge, MA: Harvard University Press.10.4159/harvard.9780674184251Search in Google Scholar

Langbein, J. H. 1977. Torture and the Law of Proof: Europe and England in the Ancien Régime. Chicago: University of Chicago Press.10.7208/chicago/9780226922614.001.0001Search in Google Scholar

Langbein, J. H. 1978. “Torture and Plea Bargaining.” The University of Chicago Law Review 46:3–22.10.4324/9781315189437-11Search in Google Scholar

Langbein, J. H. 1983. “‘Torture.” In Encyclopedia of Crime and Justice, edited by S. H. Kadish, 1552–6. New York: The Free Press.Search in Google Scholar

Lea, H. 1971. Superstition and Force. New York: Haskell House Publishers.Search in Google Scholar

Mialon, H., S. Mialon, and M. Stinchcombe. 2012. “Torture in Counterterrorism: Agency Incentives and Slippery Slopes.” Journal of Public Economics 96:33–41.10.1016/j.jpubeco.2011.07.011Search in Google Scholar

Milgrom, P., and J. Roberts. 1982. “Predation, Reputation, and Entry Deterrence.” Journal of Economic Theory 27:280–312.10.1016/0022-0531(82)90031-XSearch in Google Scholar

Nemec, J. 1968. Highlights in Medicolegal Relations. Bethesda, MD: National Library of Medicine.Search in Google Scholar

Parker, B. 1983. “Criminalistics.” In Encyclopedia of Crime and Justice, edited by S. H. Kadish, 429–38. New York: The Free Press.Search in Google Scholar

Peters, E. 1985. Torture. New York: Basil Blackwell.Search in Google Scholar

Rubinstein, A. 1985. “A Bargaining Model with Incomplete Information about Time Preferences.” Econometrica 53:1151–72.10.2307/1911016Search in Google Scholar

Saferstein, R. 2001. Criminalistics. An Introduction to Forensic Science. Upper Saddle River, NJ: Prentice Hall.Search in Google Scholar

Salop, S. C. 1979. “Strategic Entry Deterrence.” American Economic Review, Papers and Proceedings 69:335–8.Search in Google Scholar

Sanchirico, C. W. 2000. “Games, Information, and Evidence Production: With Application to English Legal History.” American Law and Economics Review 2:342–80.10.1093/aler/2.2.342Search in Google Scholar

Sanchirico, C. W. 2001. “Relying on the Information of Interested – and Potentially Dishonest – Parties.” American Law and Economics Review 3:320–57.10.1093/aler/3.2.320Search in Google Scholar

Scott, G. R. 2003. The History of Torture throughout the Ages. London: Kegan Paul.Search in Google Scholar

Seidmann, D. 2005. “The Effects of a Right to Silence.” Review of Economic Studies 72:593–614.10.4324/9780203930885.ch12Search in Google Scholar

Seidmann, D., and A. Stein. 2000. “The Right to Silence Helps the Innocent: A Game-Theoretic Analysis of the Fifth Amendment Privileges.” Harvard Law Review 114:430–510.10.2307/1342573Search in Google Scholar

Shen, J. 1985. Lidai Xingfa Kao (A Study on Criminal Laws throughout History). Beijing: Zhonghua shuju.Search in Google Scholar

Silverman, L. 2001. Tortured Subjects. Pain, Truth, and the Body in Early Modern France. Chicago: University of Chicago Press.10.7208/chicago/9780226757520.001.0001Search in Google Scholar

Spence, A. M. 1973. “Job Market Signalling.” Quarterly Journal of Economics 87:355–74.10.2307/1882010Search in Google Scholar

Wantchekon, L., and A. Healy. 1999. “The ‘Game’ of Torture.” Journal of Conflict Resolution 43:596–609.10.1177/0022002799043005003Search in Google Scholar

Xue, Y. 1998. Tang-Ming Lue Hebian (The Tang and the Ming Codes). Beijing: Falue chubanshe.Search in Google Scholar

Published Online: 2015-2-26

Published in Print: 2015-7-1

Articles in the same Issue

https://doi.org/10.1515/bejte-2014-0023

Keywords for this article

torture; type I and type II errors; evidence