New Traffic Conflict Measure Based on a Potential Outcome Model

Kentaro Yamada; Manabu Kuroki

doi:10.1515/jci-2018-0001

Article Open Access

New Traffic Conflict Measure Based on a Potential Outcome Model

Kentaro Yamada and Manabu Kuroki

Published/Copyright: January 23, 2019

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Journal of Causal Inference Volume 7 Issue 1

Abstract

A key issue in the analysis of traffic accidents is to quantify the effectiveness of a given evasive action taken by a driver to avoid crashing. Since 1977, the widely accepted definition for this effectiveness measure, which is called traffic conflict, has been “the risk of a collision if the driver movement remains unchanged.” Although the definition is expressed counterfactually, the full power of counterfactual analysis was not utilized. In this paper, we propose a counterfactual measure of traffic conflict called Counterfactual Based Conflict (CBC). The CBC is interpreted as the probability that a driver avoided a crash actually by taking an evasive action in the counterfactual situation in which the crash would have occurred if he/she had not taken an evasive action and the crash would not have occurred if he/she had taken an evasive action. The CBC captures realistic aspects of the traffic situation, and lends itself to modern causal analysis. In addition, we provide some of identification conditions for the CBC. Furthermore, we formulate bounds on the CBC when the proposed identification conditions are violated. Finally, through an application of the CBC to the 100-Car Naturalistic Driving Study, we discuss the usefulness and limitations of the proposed measure.

Keywords: traffic conflict; crash-to-conflict ratio; counterfactual conditional; potential outcome model; structural causal model

1 Introduction

A traffic crash occurs when one vehicle collides with another vehicle or with a pedestrian, animal, road debris, or other stationary obstruction such as a tree or utility pole and often leads to severe consequences, including injury, death and property damage [1], [2], [3]. With the aim of preventing such consequences, a great deal of research effort has been expended in developing road safety theory (i. e., why crashes occur?), and identifying factors affecting road safety (i. e., what caused crashes?) [2], [4], [5].

It would be reasonable to observe crashes to conduct successful road safety research, if such observation were possible. However, there are two difficulties in conducting such research. First, crashes occur sporadically, and some of them might not be reported [6]. Second, waiting for such severe consequences in order to prevent crashes is questionable ethically [7]. To overcome these drawbacks, Perkins and Harris [8] developed a traffic conflict technique that analyzes observed situations in which drivers took an evasive action to avoid crashes.

Since the seminal work of Perkins and Harris [8], the concept of traffic conflicts has been refined and discussed by many researchers and practitioners in the field of road safety research [9], [10], [11]. In Oslo in 1977, a conceptual definition of traffic conflicts was proposed in the first workshop on traffic conflicts. The definition, which has been widely accepted by road safety researchers and practitioners since then, is as follows:

“A traffic conflict is an observable situation in which two or more road users approach each other in space and time to such an extent that there is a risk of collision if their movements remain unchanged” [12].

Regarding this definition, Davis et al. [13] pointed out that

“An observed event qualifies as a conflict only if it satisfies a counterfactual test; if the movements had remained unchanged then a collision would probably have resulted,”

and introduced the idea of counterfactual conditionals on the basis of structural causal model [14] into traffic conflict technique [13], [15], [16].

In 1982, Hauer [17] pointed out the importance of crash-to-conflict ratio in the traffic conflict technique. Since then, over the past decades, the research on crash-to-conflict ratio has attracted interest in the field of traffic conflict technique. Recently, Davis et al. [13] proposed a novel type of crash-to-conflict ratio based on the level of initial conditions in traffic situations and clarified some of the properties. We refer to this crash-to-conflict ratio as “Davis’s crash-to-conflict ratio.” Different from the existing traffic conflict techniques, Davis’s crash-to-conflict ratio is formulated based on the conceptual aspect of the potential outcomes in the form of the initial conditions. Although it is assumed that the information on the initial conditions are fully available in Davis et al. [13], considering that crashes occur as a result of complex interactions among drivers, their vehicles, and their road environments [7], it would be reasonable to consider that the initial conditions cannot be observed fully in actual field studies. Thus, it might be unrealistic to evaluate Davis’s crash-to-conflict ratio under such an assumption. Solving this problem requires formulating a new traffic conflict measure.

This paper proposes a new traffic conflict measure, the Counterfactual Based Conflict (CBC). To overcome the drawback of existing traffic conflict techniques, the CBC is formulated based on the potential outcome model, and thus reflects the counterfactual statement. The CBC can be interpreted as the probability that a driver avoided a crash actually by taking an evasive action in the counterfactual situation in which the crash would have occurred if he/she had not taken an evasive action and the crash would not have occurred if he/she had taken an evasive action. In this sense, the CBC evaluates the effectiveness of a given evasive action to avoid crashing, that is, the rate at which drivers in the abovementioned counterfactual situation actually avoided crashes by taking an evasive action. In addition, we provide identification conditions for the CBC. Furthermore, we formulate the bounds on the CBC, that is, the ranges within which the true values of the CBC must lie, when the proposed identification conditions are violated. Finally, through an application of the CBC to the 100-Car Naturalistic Driving Study (hereafter, the 100-car study) [1], we discuss the usefulness and limitations of the proposed measures. The extension of the CBC to a case of multiple evasive actions is also discussed in Appendix.

2 Potential outcome model

In this section, we introduce the potential outcome model [14], [18], [19], [20], to propose our new traffic conflict measure.

Let X and Y be random variables indicating an evasive action of a driver and a collision-related outcome, respectively. For simplicity, regarding a dichotomous evasive action X, x0 indicates that the driver does not take an evasive action, and x1 indicates that the driver takes an evasive action (x0<x1). In this paper, an evasive action is defined as any action of the driver that is aimed at avoiding a severe consequence. Thus, this can include actions that were both successful and unsuccessful, and both abrupt and non-abrupt in avoiding the severe consequence. In some observational studies, it may be difficult to directly judge whether a driver took an evasive action. However, in other cases, such evidence is available from the behavior of a vehicle, for example, the activation of the brake signal or the abrupt change of course. Although the discussion in the main part of this paper is based on a distinction between the presence and absence of an evasive action, in Appendix, we extend the discussion to a case of multiple evasive actions. It is noted that the decision as to whether we should focus on a dichotomous evasive action or multiple evasive actions is dependent on the problem setting of interest. For example, if (i) road safety researchers and practitioners wish to analyze several types of evasive actions or (ii) the assumption of “no multiple versions of treatment” (the definition is to be mentioned later) for a dichotomous evasive action is violated, we should focus on multiple evasive actions.

A collision-related outcome Y is the consequence of an evasive action taken in the event of interest, such as a crash or a near crash defined by physical quantities (e. g., the distance between vehicles and the time required for two vehicles to collide). Thus, Y can be a variable that takes a sampled value over time such as the time to collision (TTC), or a properly selected single value such as the minimum TTC or the time gap. The selection of the collision-related outcome depends on the problem setting of interest. Although it is important to develop a protocol to observe the collision-related outcome, we do not focus on this because the discussion in the paper does not depend on a specific accident occurrence process. For the discussion of the protocol to observe the collision-related outcome, for example, refer to Brown [21], Hayward [22], and Van der Horst and Hogema [23] for the TTC, and refer to Sun and Benekohal [24] for the time gap.

When Y>y for a given y, a smaller value of Y represents a situation in which a driver is closer to a severe consequence of interest. In contrast, when Y≤y, the driver is involved in the severe consequence of interest. For example, letting Y be a variable representing the TTC, if we are interested in whether the driver avoids crashing, then y would be zero. If we wish to clarify the driver’s judgment of braking timing in a severe situation based on the TTC [25], [26], y would be taken as a threshold value that determines whether he/she perceives to be in the severe situation. Although this paper mainly considers that a smaller value of y for Y>y represents a situation in which the driver was closer to a crash (but not occurred) and Y≤y represents a situation in which the crash occurred, the discussion of the paper is not limited to this problem setting.

pr(X=x)=pr(x) and pr(Y=y|X=x)=pr(y|x) indicate a marginal probability of X=x and a conditional probability of Y=y given X=x, respectively. In addition, pr(X=x,Y>y)=pr(x,Y>y) and pr(Y>y) indicate a joint probability of X=x and Y>y and a marginal probability of Y>y, respectively. Similar notation is used for other probabilities.

Regarding the ith of the N drivers, for X=x(x∈{x0,x1}), we use potential outcome Yx(i)=y to denote the counterfactual statement “Y would have a value y, had X been x for the ith driver.” In the potential outcome model, we adopt the “stable unit treatment value assumption” (SUTVA), which combines the “no interference between units” assumption, the “no multiple versions of treatment” assumption, and the “consistency” assumption to evaluate causal quantities. The “no interference between units” assumption means that Yx(i) for the ith driver is not dependent on the set of evasive actions taken by other drivers [18], [27], [28]. When drivers in the study are treated as random samples from the population under consideration, it is reasonable to expect that the assumption would be satisfied [29], and Yx(i) can be treated as the random variables Yx. In this paper, a set of random variables is assumed to be independent and identically distributed (i. i. d.), that is, each random sample is obtained independently from the same joint probability distribution as the others. Then, the probabilities of the potential outcomes can be defined as pr(Yx=y)=pr(yx), which is called a causal risk of X=x on Y=y. The other part of SUTVA concerns the “no multiple versions of treatment” assumption, stating that for each driver there is only a single version of each level of an evasive action. For example, Wu and Jovanis [30] studied an abrupt evasive action as an indication of a traffic conflict and concluded that the abruptness of an evasive action may not be practical for defining/detecting a conflict in some cases. In the framework of the potential outcome model, their finding implies that the assumption of “no multiple versions of treatment” for a dichotomous evasive action is violated and thus a potential outcome with multiple evasive actions should be used in some cases. Based on the “no multiple versions of treatment” assumption, we assume “consistency” [14], [18], [31], [32], and state that Yx(i) is observed if the ith driver has taken an evasive action X=x. The “consistency” assumption is formulated as

X=x⇒Yx=Y.

Note that neither the “no multiple versions of treatment” nor the “consistency” assumption is required in the framework of structural causal models [14]. They are derived from the assumption of autonomous data-generating process. Davis et al. [13] addressed structural causal models that represent collision-related outcomes resulting from interactions between initial conditions and evasive actions. Here, our results can also be derived in the framework of structural causal models, because the potential outcome can be constructed from structural causal models [14].

When a randomized experiment is conducted, X is independent of (Yx1,Yx0). This condition is known as “exogeneity”, or referred to as “full exchangeability” [33]. Under exogeneity, pr(yx) is identifiable and is given by

pr(yx)=pr(y|x).

Here, “identifiable” means that the causal quantities such as pr(yx) can be estimated consistently from a joint distribution of observed variables. When a randomized experiment is difficult to conduct, pr(yx) can still be identified according to conditional ignorability [18], [34], or graphically, the back door criterion [14]. If there exists a set W of covariates such that X is conditionally independent of Yx given W for any x, and pr(x|w)>0 for any x and w, we say that treatment assignment is conditionally ignorable given W, or that W satisfies the conditional ignorability condition. pr(yx) can then be estimated using W and is given by

pr(yx)=Ew{pr(y|x,W)}.

Here, Ew{pr(y|x,W)} represents the expectation of pr(y|x,W) regarding W. In the framework of structural causal models, Pearl [14] provided various kinds of identification conditions for pr(yx), in addition to the conditional ignorability condition.

3 Counterfactual Based Conflict (CBC)

3.1 Definitions and basic properties

Letting U be the set of discrete and continuous variables representing driver conditions that could affect X and Y, both observed and unobserved, Davis et al. [13] described the causal mechanism of a collision-related outcome Y in terms of the directed graph shown in Fig. 1. In Davis et al. [13], a condition U is limited to the running status of vehicles (e. g., speed, deceleration, headway, and driver reaction time). In contrast, in this paper, condition U is not so limited but covers any factor necessary to make Yx a deterministic function of x for the driver. This includes aspects of the wider traffic situation such as the status of a road and a vehicle, and the characteristics of a driver such as gender, age, and health [35]. In the figure, the directed edge from X to Y indicates that X could have an effect on Y. In addition, the absence of a directed edge from Y to X indicates that Y cannot be a cause of X, and the directed path from U to Y through X indicates that some elements of U could have an effect on Y mediated by X. Fig. 1 also provides the graphical representation of the data generating process

(1)Y=gy(X,U,ϵy),X=gx(U,ϵx),U=gu(ϵu),

where ϵx, ϵy, and ϵu are independent random disturbances.

Figure 1

Simple graphical representation of the causal mechanism of a collision-related outcome.

In the situation shown in Fig. 1, irrespective of the complexity of U, the impact of U on Y cannot amount to more than a modification of the functional relationship between X and Y. Thus, for example, letting Y be a dichotomous variable based on a threshold y, there are exactly four functions regarding two dichotomous variables X and Y, and thus the value taken by U selects one of these four functions [14], [36]. Considering these observations, based on the level of an evasive action X, Davis et al. [13] divided driver conditions into the following four types:

u1={u|Yx1(u)>y,Yx0(u)>y}: is the “safe condition” in which a driver would not have crashed regardless of the level of an evasive action. In this paper, we call a driver who satisfies this condition “a safe driver” for simplicity.
u2={u|Yx1(u)>y,Yx0(u)≤y}: is the “normal condition” in which a driver would not have crashed if he/she had taken an evasive action and would have crashed if he/she had not taken an evasive action. We call a driver who satisfies this condition “a normal driver.”
u3={u|Yx1(u)≤y,Yx0(u)≤y}: is the “doomed condition” in which a driver would have crashed regardless of the level of an evasive action. We call a driver who satisfies this condition “a doomed driver.”
u4={u|Yx1(u)≤y,Yx0(u)>y}: is the “incompetent condition” in which a driver would have crashed if he/she had taken an evasive action and would not have crashed if he/she had not taken an evasive action. We call a driver who satisfies this condition “an incompetent driver.”

Then, following Amundsen and Hyden [12] and Davis et al. [13], under the assumption of pr(u2)≠0, we define the traffic conflict measure called the counterfactual based conflict (CBC) motivated by Davis’s crash-to-conflict ratio, pr(Y≤y|uj)(j=1,…,4) [13], as

(2)pr(x1,Y>y|u2).

As seen from Eq. (2), the CBC is the probability that a normal driver took an evasive action and did not crash actually. Here, pr(u2) is also called the probability of necessity and sufficiency (PNS) [14], [37]. The PNS is the probability that a crash would respond to an evasive action both ways and therefore measures both the necessity and sufficiency of the evasive action to avoid crashing.

Davis et al. [13] pointed out two features of the definition of traffic conflicts:

“First, the situation referred to appears to have three components: an initial condition, the actions of the road users, and a collision-related outcome. Second, an observed event qualifies as a conflict only if it satisfies a counterfactual test.”

In Eq. (2), the initial condition is given by u2 through the relationships between an evasive action X and a collision-related outcome Y. In addition, since Yx is the mathematical expression of the counterfactual statement, both observed and counterfactual quantities are included in Eq. (2). Thus, these two features are reflected in the CBC.

According to Davis et al. [13], u1,…,u4 are considered as conflict severities, and also referred to as “response types” in causal inference [14] and epidemiology [38]. Here, it is noted that these conflict severities are related to “principal stratification”, which is a cross-classification of subjects defined by the joint potential values of the collision-related outcome under each of the evasive actions being compared [39], [40]. However, different from the principal stratification approach whose main purpose is to estimate the causal effects within principal strata, we are concerned with evaluating the effectiveness of an evasive action actually taken by a driver to avoid crashing.

For a dichotomous variable X, since pr(x0,Y>y|u2)=0, Eq. (2) is transformed into

(3)pr(x1,Y>y|u2)=pr(x0,Y>y|u2)+pr(x1,Y>y|u2)=pr(Y>y|u2).

Here,

pr(Y≤y|u2)=1−pr(Y>y|u2)

is the probability that a normal driver crashed actually. This is also interpreted as the special case of “the crash-to-conflict ratio for initiating events in set Uj” in Davis et al. [13]. However, as seen from Appendix A, 1−CBC is generally different from Davis’s crash-to-conflict ratio in the case of multiple evasive actions. In addition, since pr(x1,Y≤y|u2)=0, Eq. (2) is transformed into

(4)pr(x1,Y>y|u2)=pr(x1,Y>y|u2)+pr(x1,Y≤y|u2)=pr(x1|u2).

Eqs. (3) and (4) show that the probability that a normal driver took an evasive action actually is equivalent to the probability that he/she did not crash actually, as Davis et al. [13] pointed out.

Guttinger [41] stated,

“For some, the conflict is an event that precedes an evasive action that can be either successful or not (collision). For others, it is the same as a near-miss situation after an evasive action. In this last view, a conflict cannot lead to a collision but is an event parallel with a collision.”

Regarding Guttinger’s statement, the CBC probabilistically evaluates the second interpretation in the sense that it takes an actual evasive action into account. Here, although the word “traffic conflict” implies Guttinger’s first interpretation in some existing studies, this paper uses the word in the sense of Guttinger’s second interpretation that includes both crash and near-miss situations. The term “near-miss” used in Guttinger’s statement is considered to be the same as the concept of “near crash” in this paper.

As for Guttinger’s first interpretation, letting Z be a set of covariates not affected by evasive action X, we can also propose a traffic conflict measure, pr(z|u2). This measure is transformed into

(5)pr(z|u2)=pr(u2|z)pr(u2)pr(z),

which includes the ratio between the PNS, pr(u2) [14], [37] and the conditional PNS given Z, pr(u2|z) [42]. We are not concerned with the evaluation problem of pr(z|u2), because PNS has been discussed in detail by, for example, Kuroki and Cai [42], Cai and Kuroki [43], Pearl [14], [37] and Tian and Pearl [44].

It is worth emphasizing that the use of the CBC becomes significant when we wish to judge the effectiveness of an evasive action. For a crash that would have occurred if a driver had not taken an evasive action, the CBC evaluates the extent to which the crash was avoided by taking an evasive action actually. A higher value of the CBC suggests that an evasive action actually taken by a driver was more effective in avoiding the crash, represented by the given value y. In addition, the CBC is associated with the probability of the crash pr(Y<y) through Proposition 2 of Davis et al. [13]:

(6)pr(Y<y)=∑i=14pr(Y<y|ui)pr(ui)=(1−CBC)pr(u2)+pr(u3)+pr(Y<y|u4)pr(u4).

Therefore, when the value of the CBC is higher, the probability of the crash may be lower. This implies that the use of the CBC with a continuous collision-related outcome enables us to evaluate the effectiveness of an evasive action to avoid a severe consequence at any level.

The CBC is defined based on normal drivers, but we can also focus on other conditions to formulate “CBC-like measures” if necessary. For example, regarding safe drivers, doomed drivers, and incompetent drivers, the “CBC-like measures” that we can consider are

(7)pr(x1,Y>y|u1),

(8)pr(x1,Y>y|u3),

(9)pr(x1,Y>y|u4),

respectively. In addition, if we observe some of traffic conditions Z such as the status of a road or a vehicle, or the characteristics of a driver, we can formulate conditional CBC given Z=z as

(10)pr(x1,Y>y|u2,z),

and the conditional CBC-like measures as

(11)pr(x1,Y>y|u1,z),

(12)pr(x1,Y>y|u3,z),

(13)pr(x1,Y>y|u4,z).

Here, note that Eqs. (8), (9), (12) and (13) provide no information in the sense that they are zero. In this paper, we do not focus on Eqs. (7), (10) and (11) since the related results can be derived by the similar procedure to the CBC.

3.2 Identification conditions

Since the CBC is adapted from Davis et al. [13], when incompetent drivers do not exist in the population of interest, the CBC would be identifiable if we derive the information stated in Davis et al. [13]: (i) a structural model of crash events that supports counterfactual situation, (ii) understanding how evasive actions are achieved over a range of conflict severities, and (iii) knowledge of the relative frequencies of the different conflict severities. However, it may be difficult to derive such information in reality. In addition, the CBC is formulated on the basis of the joint distributions of potential outcomes Yx1 and Yx0, that is, pr(u2) and pr(x1,u2). These joint distributions cannot be expressed through pr(Yx1>y), pr(Yx0≤y), pr(x1,Y>y), and pr(x0,Y≤y), without further causal assumptions. This means that the CBC is not identifiable without further causal assumptions, even if both pr(x,y) and pr(yx)(x∈{x1,x0}) are available. Thus, we discuss the identification problem of the CBC.

Proposition 1.

Assume that the whole population consists only of normal drivers for a given y (i. e.,pr(u2)=1for a given y). Ifpr(x1)is available under the assumption, then the CBC is identifiable and is given by

(14)pr(x1,Y>y|u2)=pr(x1)

for a given y. In addition, ifpr(Y>y)is available under the assumption, then the CBC is also identifiable and is given by

(15)pr(x1,Y>y|u2)=pr(Y>y)

for a given y.

The proof is straightforward from Eqs. (3) and (4).

Perkins and Harris [8] stated that

“Over 20 objective criteria for traffic conflicts (or impending accident situations) have been defined as to specific accident patterns at intersections. Essentially, these traffic conflicts are defined by the occurrence of an evasive action, such as braking or weaving, which are forced on a driver by an impending accident situation or a traffic violation.”

Eq. (14) implies that the concept of traffic conflicts proposed by Perkins and Harris [8] does not contradict the counterfactual statement given at the first workshop on traffic conflicts. In addition, Eq. (15) implies that the concept of traffic conflicts defined by collision-related outcomes such as the TTC is justified when the population under consideration consists only of normal drivers.

From Proposition 1, when both pr(x1) and pr(Y>y) are available, we have

pr(x1)=pr(Y>y).

This property is useful for supporting the hypothesis that the population under consideration includes various types of drivers through the use of statistical data. Consider the statistical hypothesis

H0:pr(x1)=pr(Y>y)vsH1:pr(x1)≠pr(Y>y).

If the null hypothesis H0 is rejected at the given significance level, we can judge at the level that various types of drivers exist in the population under consideration. Here, note that there can exist a situation in which the statistical null hypotheses pr(x1)=pr(Y>y) or pr(x0)=pr(Y>y) are not rejected at the same time. However, this situation does not imply that both pr(u2)=1 and pr(u4)=1 are reasonable. Actually, pr(u2)=1 and pr(u4)=1 cannot occur simultaneously. As seen from these considerations, the statistical hypothetical testing described above can be used to check the necessity condition of pr(u2)=1, but not to check the sufficiency condition. Here, the equivalence between Eqs. (14) and (15) holds for the normal condition defined by a given specific value y. It is noted that a normal driver under a given specific value y may be classified as other type of driver for a different value y′(≠y).

Proposition 2.

Assume thatX=x1is independent of(Yx1,Yx0). Ifpr(x1)is available under the assumption, then the CBC is identifiable and is given by

(16)pr(x1,Y>y|u2)=pr(x1).

The proof is straightforward from Eq. (4).

Proposition 3.

Assume that incompetent drivers do not exist in the population under consideration. Ifpr(Yx1>y|x1),pr(Yx0>y|x1),pr(Yx1>y)andpr(Yx0>y)are available under the assumption, then the CBC is identifiable and is given by

(17)pr(x1,Y>y|u2)=pr(Yx1>y|x1)−pr(Yx0>y|x1)pr(Yx1>y)−pr(Yx0>y)pr(x1).

The proof can be achieved as a special case of Proposition 8 in Appendix A.

We assume that pr(Yx>y|x′) and pr(Yx>y) are available for x and x′(x,x′∈{x0,x1}) through the identification condition of causal risk or an expert’s knowledge of road safety mechanisms. pr(Yx1>y|x1)−pr(Yx0>y|x1) is the causal risk difference among drivers who took an evasive action actually. In addition, pr(Yx1>y)−pr(Yx0>y) is the causal risk difference among drivers in the entire population [45]. Note that a collision-related outcome is used to formulate the CBC, that is, the CBC may be difficult to evaluate in the absence of data on the collision-related outcome. To address this, we suggest the use of proxy variables of the collision-related outcome. Such research designs have been discussed in the context of effect restoration [46], [47], and latent structure analysis [48], [49]. A further suggestion is the use of results from pilot studies or past studies in the main study. Such research designs can be found in the literature on the data fusion problem [50].

3.3 Bounds

As can be seen from the definition, the CBC is not identifiable based on the knowledge of observed data alone, even under the conditional ignorability condition. When the CBC is not identifiable, one possible solution is to derive the closed-form formulas of the bounds on the CBC under the milder conditions.

Proposition 4.

Assume that

(18)pr(x1|u2)≥max{pr(x1|u1),pr(x1|u3)},

(19)pr(u3)≥pr(u1)

hold. If bothpr(x1,Y>y)andpr(x0,Y≤y)are available under the assumptions, then the bounds on the CBC are given by

(20)pr(x1,Y>y)pr(x1,Y>y)+pr(x0,Y≤y)≤pr(x1,Y>y|u2)≤1.

The proof can be achieved as a special case of Proposition 9 in Appendix A.

Intuitively, Eq. (18) implies that drivers who took an evasive action actually in the normal condition are more numerous than both those in the safe condition and those in the doomed condition. In addition, Eq. (19) shows that doomed drivers are more numerous than safe drivers in the population of interest. These assumptions would be reasonable when we focus on a traffic situation such as a potential rear-end collision in which a driver of the following vehicle adequately recognizes the risk of crashing. Note that Eq. (18) or Eq. (19) might not hold for specific values of y. In this case, the lower bound on the CBC is undefined. In this context, the assumption in Eq. (18) may be thought questionable when y is close to zero. However, Eq. (18) would still be reasonable when the main focus is on traffic safety research with a categorized collision-related outcome as, for example, in the 100-car study discussed in Section 4.

Proposition 5.

If bothpr(Yx>y)andpr(x,Y>y)are available forx∈{x1,x0}, then the bounds on the CBC are given by

(21)αl(x0,x1)×pr(x1,Y>y)αl(x0,x1)×pr(x1,Y>y)+βu(x0,x1)×pr(x0,Y≤y)≤pr(x1,Y>y|u2)≤αu(x0,x1)×pr(x1,Y>y)αu(x0,x1)×pr(x1,Y>y)+βl(x0,x1)×pr(x0,Y≤y),

where

(22)αl(x0,x1)=max0,pr(Y>y)−pr(Yx0>y)pr(x1,Y>y),

(23)αu(x0,x1)=min1,pr(Yx0≤y)−pr(x0,Y≤y)pr(x1,Y>y),

(24)βl(x0,x1)=max0,pr(Yx1>y)−pr(Y>y)pr(x0,Y≤y),

(25)βu(x0,x1)=min1,pr(Yx1>y)−pr(x1,Y>y)pr(x0,Y≤y),

respectively.

Proposition 5 can be proved as a special case of Proposition 10 in Appendix A.

4 Application to the 100-car study

4.1 Background

We apply the proposed traffic conflict measure to the data from the 100-car study, as shown in Table 1. The data was collected by the National Highway Traffic Safety Administration, Virginia Department of Transportation, and Virginia Tech Transportation Institute over a period of more than a year, from January 2003 to July 2004, from 100 drivers, to observe their behavior in various traffic situations that arise in daily life. The drivers were recruited through a newspaper advertisement. Thus, the 100-car study was an observational study.

According to Dingus et al. [1], first, events of interest that are candidate collision-related outcomes were selected based on the sensing values of lateral acceleration, longitudinal acceleration, forward TTC, rear TTC, yaw rate, and event button. The thresholds of these values were determined from a sensitivity analysis. Invalid events were then excluded by trained operators, whom Dingus et al. [1] referred to as “data reductionists.” They took account of the following: (a) video data on the driver’s view and behavior, and (b) time plots of the sensing values. They also assigned each valid event (i. e., collision-related outcome) to one of three levels: crash (y0), near crash (y1), and incident (y2). The “crash” category was used if “there is any contact with an object, in which kinetic energy is measurably transferred or dissipated.” The “near crash” category was used if “there is any circumstance that requires a rapid maneuver (“rapid evasive action” in our paper) by the subject vehicle to avoid crashing.” Other events were categorized as an “incident.” Dingus et al. [1] defined a “rapid maneuver” as one involving “subject vehicle braking greater than 0.5g (g is the constant of gravitation) or steering input that results in a lateral acceleration greater than 0.4g to avoid crashing.” In this section, we assume that y0<y1<y2 according to the severity. For details of the 100-car study, refer to Dingus et al. [1].

4.2 Analysis

Based on the data provided in Table 1, we present an analysis of rear-end collisions by the following vehicles. In Table 1, “evasive action (x1)” indicates that a driver took an evasive action such as braking, turning the steering wheel, accelerating, or a combination of these, and “no evasive action (x0)” indicates that the driver did not take any evasive action. In this situation, it would be reasonable to consider that the presence or absence of an evasive action taken by a driver of the following vehicle strongly affects the occurrence or non-occurrence of a crash. Then, the number of drivers in the safe, doomed and incompetent conditions can be assumed to be negligible, that is, pr(Yx1>y0,Yx0=y0)≃1. Making this assumption, from Proposition 1, we have

(26)pr(x1)≃pr(Y>y0).

Note that y0, y1 and y2 are the level taken by the collision-related outcome Y, but they do not mean the potential outcome Yx0 or Yx1.

Table 1

Drivers of the following vehicles: dichotomous evasive action.

	Crash (y0)	Near crash (y1)	Incident (y2)
No evasive action (x0)	7	0	29
Evasive action (x1)	8	380	5754

From Table 1, the unbiased estimates of pr(x1) and pr(Y>y0) are given by prˆ(x1)=0.9942 (standard error: 0.0010) and prˆ(Y>y0)=0.9976 (standard error: 0.0006), respectively, assuming that Table 1 was generated from a multinomial distribution. Thus, since there appears to be no strong reason for pr(x1) and pr(Y>y0) to be significantly different, it would be reasonable to consider that Table 1 was based on normal drivers for a given y0. In this situation, the CBC pr(x1,Y>y0|Yx1>y0,Yx0=y0) can be evaluated using prˆ(x1) and prˆ(Y>y0). In contrast, if we believe that Eq. (26) does not hold, then it is not appropriate to evaluate the CBC using prˆ(x1) or prˆ(Y>y0). In this situation, it would be useful to evaluate the bounds on the CBC. In the 100-car study, since the drivers of the following vehicles are considered to be able to recognize the crash risks adequately, it would be reasonable to assume that both Eqs. (18) and (19) hold for y0. Then, according to Eq. (20), the lower bound on pr(x1,Y>y0|Yx1>y0,Yx0=y0) is given by 0.9989 (standard error: 0.0004). Thus, the CBC takes a value within the range [0.9989,1.0000], indicating that nearly all normal drivers took an evasive action and did not crash actually.

Here, regarding pr(x1,Y>y1|Yx1>y1,Yx0≤y1), since the unbiased estimate of pr(Y>y1) is given by 0.9361 (standard error: 0.0031), one might think that pr(x1) and pr(Y>y1) are different. Therefore, it might be judged that pr(Yx1>y1,Yx0≤y1)≃1 does not hold. Then, as in the case of pr(x1,Y>y0|Yx1>y0,Yx0=y0), when we assume that both Eqs. (18) and (19) hold for y1, the lower bound on the CBC is given by 0.9988 (standard error: 0.0005). Thus, pr(x1,Y>y1|Yx1>y1,Yx0≤y1) takes a value within the range [0.9988,1.0000]. Consequently, it is considered that nearly all normal drivers took an evasive action and did not have a near crash actually.

5 Conclusions

In recent road safety studies, researchers and practitioners have proposed a range of traffic conflict techniques. However, most of these did not consider the counterfactual statement “there is a risk of collision if their movements remain unchanged.” To address this, we have proposed a new traffic conflict measure, the CBC. In contrast with most existing traffic conflict techniques, the CBC is formulated based on the potential outcome model in causal inference; hence, reflecting the counterfactual statement. We also provided identification conditions for the CBC and formulated the bounds on the CBC under particular assumptions when the proposed identification conditions are violated. In addition, through an application to the 100-car study, we showed that the CBC is helpful in reliably evaluating the effectiveness of an evasive action.

Funding source: Japan Society for the Promotion of Science

Award Identifier / Grant number: 15K00060

Funding statement: This work was partially supported by Japan Society for the Promotion of Science (JSPS), Grant Number 15K00060.

Acknowledgment

We would like to thank Professor Christer Hyden of Lund University and Clemens Kaufmann of International Co-operation on Theories and Concepts in Traffic Safety for providing us with important literature. In addition, we would like to appreciate comments from two referees that significantly improved the presentation of the paper. Finally, we thank Professor Judea Pearl of UCLA for his suggestion about clarifying emphasis points of the paper.

Appendix A CBC with multiple evasive actions

The occurrence of a crash depends not only on whether or not a driver takes an evasive action but also on what type of evasive action is taken. To address this issue, we discuss the case in which a driver chooses one of a set of possible evasive actions {x0,…,xp}. In this setting, under the assumption that pr(Yxj>y,Yxk≤y)≠0, Eq. (2) can be extended as follows:

(27)pr(xj,Y>y|Yxj>y,Yxk≤y)

(j,k=0,1,…,p;j≠k). Eq. (27) represents the probability that for a driver who would not have crashed if he/she had taken the evasive action X=xj and would have crashed with the evasive action X=xk, the driver took the evasive action X=xj and avoided crashing actually. In this paper, we consider both Eqs. (2) and (27) as CBCs because Eq. (27) is equivalent to Eq. (2) when j=1 and k=0.

A.1 Identification conditions and bounds

In this section, we provide the identification conditions and bounds for the CBC with multiple evasive actions. The proofs are given in the Appendix D.

Proposition 6.

Assume thatpr(Yxj>y,Yxk≤y)=1holds for a given y. Ifpr(xj)is available under the assumption, then the CBC is identifiable and is given by

(28)pr(xj,Y>y|Yxj>y,Yxk≤y)=pr(xj)

for a given y. Ifpr(xj,Y>y)is available under the assumption, then the CBC is also identifiable and is given by

(29)pr(xj,Y>y|Yxj>y,Yxk≤y)=pr(xj,Y>y)

for a given y.

Proposition 7.

Assume thatX=xjis independent of(Yxj,Yxk). Ifpr(xj)is available under the assumption, then the CBC is identifiable and is given by

(30)pr(xj,Y>y|Yxj>y,Yxk≤y)=pr(xj).

Proposition 8.

Assume thatpr(Yxj≤y,Yxk>y)=0holds for a given y. Ifpr(Yxj>y|xj),pr(Yxk>y|xj),pr(Yxj>y)andpr(Yxk>y)are available under the assumption, then the CBC is identifiable and is given by

(31)pr(xj,Y>y|Yxj>y,Yxk≤y)=pr(Yxj>y|xj)−pr(Yxk>y|xj)pr(Yxj>y)−pr(Yxk>y)pr(xj).

In addition, the following equations hold:

(32)pr(Yxj>y,Yxk>y)=pr(Yxk>y)pr(Yxj>y,Yxk≤y)=pr(Yxj>y)−pr(Yxk>y)pr(Yxj≤y,Yxk≤y)=pr(Yxj≤y).

Proposition 9.

Assume that

(33)pr(xj|Yxj>y,Yxk≤y)≥max{pr(xl|Yxj>y,Yxk≤y),pr(xj|Yxj>y,Yxk>y)}

forl≠j,k,

(34)pr(xk|Yxj≤y,Yxk≤y)≥pr(xk|Yxj>y,Yxk≤y),

and

(35)pr(Yxj≤y,Yxk≤y)≥pr(Yxj>y,Yxk>y).

If bothpr(xj,Y>y)andpr(xk,Y≤y)are available under the assumption, then the bounds on the CBC are given by

(36)pr(xj,Y>y)p×pr(xj,Y>y)+pr(xk,Y≤y)≤pr(xj,Y>y|Yxj>y,Yxk≤y)≤1,

where p stands for the number of evasive action types.

Proposition 10.

If bothpr(Yx>y)andpr(x,Y>y)are available, then the bounds on the CBC are given by

(37)αl(xk,xj)×pr(xj,Y>y)αl(xk,xj)×pr(xj,Y>y)+βu(xk,xj)×pr(xk,Y≤y)+∑l≠j,kpr(xl)≤pr(xj,Y>y|Yxj>y,Yxk≤y)≤αu(xk,xj)×pr(xj,Y>y)αu(xk,xj)×pr(xj,Y>y)+βl(xk,xj)×pr(xk,Y≤y).

Note that the probabilities of causationpr(Yxk≤y|xj,Y>y)andpr(Yxj>y|xk,Y≤y)lie within the ranges [αl(xk,xj),αu(xk,xj)] and [βl(xk,xj),βu(xk,xj)], respectively. Here, according to Tian and Pearl [44],αl(xk,xj),αu(xk,xj),βl(xk,xj), andβu(xk,xj)are derived by replacingx0andx1of Eqs. (22)–(25) withxkandxj, respectively.

Appendix B Application to the 100-car study (CBC with multiple evasive actions)

Here, we apply the results from the CBC with multiple evasive actions in Appendix A to the rear-end collision data from the 100-car study. According to Dingus et al. [1], we classified evasive actions into four types. “Ordinary evasive action (x1)” indicates that a driver only applied the brake. “Aggressive evasive action (x2)” indicates that a driver turned the steering wheel, with or without the ordinary evasive action. “Skilled evasive action (x3)” indicates that a driver accelerated, with or without the aggressive evasive action. “No evasive action (x0)” indicates that a driver did not take any evasive action. Here, we assume that x0<x1<x2<x3. Using the same data aggregated in Table 1, we re-aggregated the data in Table 2 to reflect this classificatory scheme. Note that the total number of drivers shown in Table 2 is slightly different from that shown in Table 1, as we excluded 10 drivers whose evasive actions were unknown.

Table 2

Drivers of the following vehicles: multiple evasive actions.

	Crash (y0)	Near crash (y1)	Incident (y2)
No evasive action (x0)	7	0	29
Ordinary evasive action (x1)	6	265	4930
Aggressive evasive action (x2)	1	111	746
Skilled evasive action (x3)	0	4	69

Based on Table 2, we address whether the following equation from Proposition 6 is reasonable to hold for xj, xk, and yl(j,k=0,…3;j≠k;l=0,1).

(38)pr(xj)≃pr(xj,Y>yl)

Here, in Table 3, under the assumption that Table 2 is generated from a multinomial distribution, we provide some statistics derived from Table 2. In Table 3, the first and second rows show the unbiased estimates and the standard error of the corresponding probabilities, respectively.

Table 3

Unbiased estimates and standard errors.

	prˆ(xj)	prˆ(xj,Y>y0)	prˆ(xj,Y>y1)
No evasive action (x0)	0.0058	0.0047	0.0047
	(0.0010)	(0.0009)	(0.0009)
Ordinary evasive action (x1)	0.8432	0.8423	0.7993
	(0.0046)	(0.0046)	(0.0051)
Aggressive evasive action (x2)	0.1391	0.1389	0.1209
	(0.0044)	(0.0044)	(0.0042)
Skilled evasive action (x3)	0.0118	0.0118	0.0112
	(0.0014)	(0.0014)	(0.0013)

Table 3 indicates that there appears to be no strong reason for pr(xj) and pr(xj,Y>y0) to be significantly different from each other for j=0,1,2,3, but these results do not enable us to determine which of the four evasive actions are to be compared with X=xj to evaluate the CBCs. As stated in Section 3.2, it is not reasonable to consider that pr(Yxj>y0,Yxk=y0)=1 for any j,k=0,1,2,3(j≠k), since they cannot hold simultaneously. From these observations, noting that prˆ(x0,Y>y0) is smaller than the other probabilities, it might be possible to assume that pr(Yx1>y0,Yx2>y0,Yx3>y0,Yx0=y0)≃1. This assumption does not contradict the statistical results derived from Table 3. From this assumption, it would be reasonable to consider that Table 2 was based on drivers who would take Y>y0 if they had taken any of the evasive actions and would take Y=y0 if they had not taken any evasive action. This observation is consistent with the case of a dichotomous evasive action discussed in Section 4.2, and the CBCs, pr(xj,Y>y0|Yxj>y0,Yx0=y0), can then be evaluated using prˆ(xj) and prˆ(xj,Y>y0).

In contrast, since there appear to be significant differences between pr(x1) and pr(x1,Y>y1) and between pr(x2) and pr(x2,Y>y1), it would be difficult to judge that pr(Yx1>y1,Yxk≤y1)=1 for k=0,2,3 or that pr(Yx2>y1,Yxk≤y1)=1 for k=0,1,3. This observation does not contradict the case of a dichotomous evasive action discussed in Section 4.2. In addition, one might believe that the population of interest comprised various types of drivers and thus that Eq. (38) does not hold. Therefore, it is inappropriate to evaluate the CBCs, pr(xj,Y>y1|Yxj>y1,Yxk≤y1), through prˆ(xj) or prˆ(xj,Y>y1) for j=1,2;k=0,1,2,3;j≠k even if the null hypothesis in Eq. (38) is not rejected. In this situation, it would be useful to evaluate the bounds on the CBCs. Here, if it would be reasonable to assume that Eqs. (33), (34), and (35) hold for y0 and y1, then the lower bounds on the CBCs are as given in Table 4. The lower triangular part of Table 4 provide the lower bounds on pr(xj,Y>y1|Yxj>y1,Yxk≤y1); j>k, and the upper triangular part of Table 4 provides the lower bounds on pr(xj,Y>y0|Yxj>y0,Yxk=y0); j>k. In Table 4, the first and second rows provide the estimates and the standard error corresponding to the lower bounds, respectively.

Table 4

Lower bounds on the CBCs.

	No(x0)	Ordinary(x1)	Aggressive(x2)	Skilled(x3)
No(x0)	–	0.3332	0.3324	0.3230
	–	(0.0001)	(0.0003)	(0.0040)
Ordinary(x1)	0.3332	–	0.3326	0.3244
	(0.0001)	–	(0.0003)	(0.0037)
Aggressive(x2)	0.3323	0.2973	–	0.3318
	(0.0004)	(0.0023)	–	(0.0015)
Skilled(x3)	0.3224	0.1444	0.2163	–
	(0.0043)	(0.0111)	(0.0117)	–

The results in Table 4 indicate that the lower bounds on prˆ(xj,Y>y0|Yxj>y0,Yxk=y0) are greater than the lower bounds on prˆ(xj,Y>y1|Yxj>y1,Yxk≤y1) for j,k=0,1,2,3;j>k. In addition, the lower bounds in Table 4 are smaller than those in the case of a dichotomous evasive action, perhaps reflecting the following observation. Intuitively, since the dichotomous evasive action case implies that a driver can choose from a wide range of evasive actions, including braking, turning the steering wheel, accelerating, or a combination of these, the probability of a crash avoidance or a near crash avoidance by taking an evasive action is relatively high. In contrast, when considering a crash or a near crash based on multiple evasive actions, a driver is allowed to take only a designated evasive action, that is, there is no other choice of evasive actions.

Appendix C Numerical examples

In this appendix, we present comparisons of the CBC and pr(Y>y) using numerical examples. pr(Y>y) can be considered as a measure based on existing traffic conflict techniques such as the TTC and the time gap. The TTC and the time gap are empirically known to follow the Weibull distribution [24], [51], [52]. Thus, we assume that (Yx1,Yx0) follows the bivariate Weibull distribution for a given actual evasive action X=xm(m=0,1). Yxn represents a potential outcome associated with a counterfactual evasive action X=xn(n=0,1). The bivariate Weibull distribution of interest here has the following survival function [53], [54]:

(39)pr(Yx0>y0,Yx1>y1|X=xm;γm0,γm1,λm0,λm1,αm)=exp−y0λm0γm0αm+y1λm1γm1αmαmyn>0,γmn>0,λmn>0,0<αm≤1;m,n=0,1.

Here, γmn, λmn, and αm are the shape, scale, and association parameters, respectively.

In accordance with previous empirical studies [24], [51], we assume that the collision-related outcomes follow the Weibull distribution with a shape parameter of approximately 1.0∼2.5. With such a parameter setting, the marginal distribution of Eq. (39) has a unimodal right-skewed shape with a long right tail. In addition, regarding the scale parameters, a smaller value given X=xm can be considered to indicate that a driver would take evasive action X=xn as he/she would be close to a severe situation given X=xm. In contrast, a larger value given X=xm can be considered to indicate that a driver would take evasive action X=xn even if he/she would be far from a severe situation given X=xm. Furthermore, regarding the association parameter, if it takes a smaller value given X=xm, potential outcomes Yx0 and Yx1 largely depend on each other given X=xm. If it takes a larger value given X=xm, the dependence between potential outcomes Yx0 and Yx1 is lower given X=xm. This situation arises when a randomized experiment of X is conducted.

$Figure 2 Comparison between the CBC and pr(Y>y)\mathrm{pr}(Y>y) (1).$

Figure 2

Comparison between the CBC and pr(Y>y) (1).

$Figure 3 Comparison between the CBC and pr(Y>y)\mathrm{pr}(Y>y) (2).$

Figure 3

Comparison between the CBC and pr(Y>y) (2).

Here, although we compare the CBC and pr(Y>y) under an ideal situation in which γmn, λmn, and αm are known, the true values of these parameters would be unknown in an actual road safety study. However, for example, when exogeneity holds in experiments such as the present simulation, Yxm are observed as the values from the distribution of Y in the group of drivers who take evasive action X=xm, (m=0,1), under proper random sampling. In this case, both γmn and λmn can be estimated using, for example, a maximum likelihood method based on the conditional distribution of Y given X=xm. However, since the values of Yx1(i) and Yx0(i) cannot be obtained simultaneously at the same time, the association parameter αm, which remains unknown, would be evaluated using a sensitivity analysis.

First, based on observations from the empirical studies, letting

(γ00,γ01)=(1.10,1.50),(γ10,γ11)=(1.10,2.00),(λ00,λ01)=(1.50,2.50),(λ10,λ11)=(1.50,3.00),α0=α1=0.10,

we show the difference between the CBC and pr(Y>y) in Fig. 2. In the figure, the CBC shows a unimodal function shape, whereas pr(Y>y) shows a simple decreasing function shape for y. Here, when the value of y is zero, the value of the CBC is equal to the value of pr(x1). The CBC takes the maximum value 0.770 at y=2.55 when pr(x1)=0.67, and 0.456 at y=2.55 when pr(x1)=0.33. The values of the CBC depend strongly on the value of pr(x1). These results indicate that the CBC is considered to reflect evasive actions of the drivers. In contrast, unlike the CBC, pr(Y>y) may not reflect evasive actions of the drivers adequately, because the shapes of pr(Y>y) are similar at pr(x1)=0.67 and 0.33.

Next, letting

(γ00,γ01)=(1.50,1.70),(γ10,γ11)=(1.50,2.20),(λ00,λ01)=(λ10,λ11)=(1.50,2.50),α0=0.90,α1=0.10,

Fig. 3 shows the difference between the CBC and pr(Y>y). In the figure, both the CBC and pr(Y>y) are decreasing functions of y, but the slope of the CBC is less steep than that of pr(Y>y). The values of the CBC depend more strongly on the value of pr(x1) compared to pr(Y>y). Thus, the CBC is considered to reflect evasive actions of the drivers, whereas pr(Y>y) may not reflect these actions adequately.

$Figure 4 CBC and bounds (when pr(x1)=0.67\mathrm{pr}({x_{1}})=0.67; the CBC is represented by solid lines, and the upper and lower bounds are represented by dotted lines).$

Figure 4

CBC and bounds (when pr(x1)=0.67; the CBC is represented by solid lines, and the upper and lower bounds are represented by dotted lines).

Finally, Fig. 4, which was obtained using the same parameter settings that were used to generate Fig. 2, shows the function shapes of the CBC and Eq. (20). In the figure, the lower bound on the CBC is a decreasing function of y, and the lower bound is smaller than the CBC. Here, the lower bound is undefined when y is smaller than 1.690 and larger than 5.190 because the proposed assumptions in Proposition 4 do not hold.

Appendix D Proofs

Proofs of Propositions6and7. Note that

(40)pr(xj,Y>y|Yxj>y,Yxk≤y)=pr(xj,Yxj>y|Yxj>y,Yxk≤y)=pr(xj|Yxj>y,Yxk≤y).

Thus, when pr(Yxj>y,Yxk≤y)=1 holds for a given y, we obtain Eqs. (28) and (29) immediately. In addition, when X=xj is independent of (Yxj,Yxk), we obtain Eq. (30).

Proof of Proposition8. When pr(Yxj≤y,Yxk>y)=0 for any y holds, we have

pr(xj,Y>y|Yxj>y,Yxk≤y)=pr(xj,Y>y,Yxk≤y)pr(Yxj>y,Yxk≤y)=pr(xj,Y>y)−pr(xj,Yxk>y)+pr(xj,Y≤y,Yxk>y)pr(Yxj>y)−pr(Yxk>y)+pr(Yxj≤y,Yxk>y)=pr(xj,Y>y)−pr(xj,Yxk>y)pr(Yxj>y)−pr(Yxk>y)=pr(Yxj>y|xj)−pr(Yxk>y|xj)pr(Yxj>y)−pr(Yxk>y)pr(xj).

Proof of Proposition9. First, we show that the following equation can be obtained:

(41)pr(Yxk≤y|xj,Y>y)≥pr(Yxj>y|xk,Y≤y).

To do this, from Eqs. (33) and (34), we derive

pr(xj|Yxj>y,Yxk≤y)pr(xk,Y≤y)−pr(xk|Yxj>y,Yxk≤y)pr(xj,Y>y)=pr(xj|Yxj>y,Yxk≤y){pr(xk,Y≤y,Yxj≤y)+pr(xk,Y≤y,Yxj>y)}−pr(xk|Yxj>y,Yxk≤y){pr(xj,Y>y,Yxk>y)+pr(xj,Y>y,Yxk≤y)}=pr(xj|Yxj>y,Yxk≤y)pr(xk|Y≤y,Yxj≤y)pr(Yxj≤y,Yxk≤y)−pr(xk|Yxj>y,Yxk≤y)pr(xj|Y>y,Yxk>y)pr(Yxj>y,Yxk>y)≥pr(xj|Yxj>y,Yxk≤y)pr(xk|Yxj>y,Yxk≤y)pr(Yxj≤y,Yxk≤y)−pr(xk|Yxj>y,Yxk≤y)pr(xj|Yxj>y,Yxk≤y)pr(Yxj>y,Yxk>y)=pr(xj|Yxj>y,Yxk≤y)pr(xk|Yxj>y,Yxk≤y)×(pr(Yxj≤y,Yxk≤y)−pr(Yxj>y,Yxk>y)).

Then, from Eq (35), we derive

pr(xj|Yxj>y,Yxk≤y)pr(xk,Y≤y)−pr(xk|Yxj>y,Yxk≤y)pr(xj,Y>y)≥0.

Thus, we have

pr(xj,Yxj>y,Yxk≤y)pr(xk,Y≤y)≥pr(xk,Yxj>y,Yxk≤y)pr(xj,Y>y),

which leads to Eq. (41).

Second, we show that the following equation can be obtained:

(42)pr(Yxj>y,Yxk≤y)≤p×pr(Yxk≤y|xj,Y>y)pr(xj,Y>y)+pr(Yxk≤y|xj,Y>y)pr(xk,Y≤y).

To do this, we derive

(43)pr(Yxj>y,Yxk≤y)=∑l=0ppr(xl,Yxj>y,Yxk≤y)=∑l≠j,kpr(xl,Yxj>y,Yxk≤y)+pr(xj,Yxj>y,Yxk≤y)+pr(xk,Yxj>y,Yxk≤y)=∑l≠j,kpr(xl,Yxj>y,Yxk≤y)+pr(Yxk≤y|xj,Y>y)pr(xj,Y>y)+pr(Yxj>y|xk,Y≤y)pr(xk,Y≤y).

From Eq. (33), the first term of Eq. (43) can be transformed into

∑l≠j,kpr(xl,Yxj>y,Yxk≤y)=∑l≠j,kpr(xl|Yxj>y,Yxk≤y)pr(Yxj>y,Yxk≤y)≤∑l≠j,kpr(xj|Yxj>y,Yxk≤y)pr(Yxj>y,Yxk≤y)=(p−1)pr(xj,Yxj>y,Yxk≤y)=(p−1)pr(Yxk≤y|xj,Y>y)pr(xj,Y>y).

And from Eq. (41), the third term of Eq. (43) can be transformed into

pr(Yxj>y|xk,Y≤y)pr(xk,Y≤y)≤pr(Yxk≤y|xj,Y>y)pr(xk,Y≤y).

By combining these equations, we derive Eq. (42). Since the CBC with multiple evasive actions can be transformed into

pr(xj,Y>y|Yxj>y,Yxk≤y)=pr(xj,Y>y,Yxk≤y)pr(Yxj>y,Yxk≤y)≥pr(Yxk≤y|xj,Y>y)pr(xj,Y>y)p×pr(Yxk≤y|xj,Y>y)pr(xj,Y>y)+pr(Yxk≤y|xj,Y>y)pr(xk,Y≤y)=pr(xj,Y>y)p×pr(xj,Y>y)+pr(xk,Y≤y)

from Eq. (42), we have Eq. (36).

Proof of Proposition10. From Eq. (43), the CBC can be transformed as follows:

This leads to Eq. (37) by considering bounds on probabilities of causation (i. e., pr(Yxk≤y|xj,Y>y) and pr(Yxj>y|xk,Y≤y)).

References

1. Dingus TA, Klauer SG, Neale VL, Petersen A, Lee SE, Sudweeks J, Perez MA, Hankey J, Ramsey D, Gupta S, Bucher C, Doerzaph ZR, Jermeland J, Knipling RR. The 100-car naturalistic driving study: phase II – results of the 100-car field experiment. DOT, HS-810 2006;593.10.1037/e624282011-001Search in Google Scholar

2. Traffic Accident Causation in Europe (TRACE) project. All TRACE reports are available for download at http://www.trace-project.org, 2006–2008.Search in Google Scholar

3. Zheng L, Ismail K, Meng X. Traffic conflict techniques for road safety analysis: open questions and some insights. Can J Civ Eng. 2014;41:633–41.10.1139/cjce-2013-0558Search in Google Scholar

4. Theofilatos A, Yannis G. A review of the effect of traffic and weather characteristics on road safety. Accid Anal Prev. 2014;72:244–56.10.1016/j.aap.2014.06.017Search in Google Scholar

5. Wang C, Quddus MA, Ison SG. The effect of traffic and road characteristics on road safety: a review and future research direction. Saf Sci. 2013;57:264–75.10.1016/j.ssci.2013.02.012Search in Google Scholar

6. Parker MR, Zegger CV. Traffic conflict techniques for safety and operations: observers manual. Federal Highway Administration. FHWA–IP–88–027 1989.Search in Google Scholar

7. Chin HC, Quek ST. Measurement of traffic conflicts. Saf Sci. 1997;26:169–85.10.1016/S0925-7535(97)00041-6Search in Google Scholar

8. Perkins SR, Harris JI. Traffic conflict characteristics: accident potential at intersections. General Motors Research Publication. GMR–718 1967.Search in Google Scholar

9. Hyden C. The development of a method for traffic safety evaluation: the Swedish traffic conflict technique. Lund Institute of Technology, Department of Traffic Planning and Engineering, Doctoral Thesis 1987.Search in Google Scholar

10. Migletz DJ, Glauz WD, Bauer KM. Relationships between traffic conflicts and accidents volume 2: final technical report. Federal Highway Administration. FHWA/RD–84/042 1985.Search in Google Scholar

11. Spicer BR. Study of traffic conflicts at six intersections. TRRL Report LR551. 1973.Search in Google Scholar

12. Amundsen FH, Hyden C. Proceedings of the First Workshop on Traffic Conflicts. Oslo; 1977.Search in Google Scholar

13. Davis GA, Hourdos J, Xiong H, Chatterjee I. Outline for a causal model of traffic conflicts and crashes. Accid Anal Prev. 2011;43:1907–19.10.1016/j.aap.2011.05.001Search in Google Scholar

14. Pearl J. Causality: models, reasoning, and inference. 2nd ed. Cambridge University Press; 2009.10.1017/CBO9780511803161Search in Google Scholar

15. Davis GA. Using Bayesian networks to identify the causal effect of speeding in individual vehicle/pedestrian collisions. In: Proceedings of the 17th Conference on Uncertainty in Artificial Intelligence. 2001. p. 105–11.Search in Google Scholar

16. Davis GA. Towards a unified approach to causal analysis in traffic safety using structural causal models. In: Transportation and Traffic Theory in the 21st Century. 2002. p. 247–65.10.1108/9780585474601-013Search in Google Scholar

17. Hauer E. Traffic conflicts and exposure. Accid Anal Prev. 1982;14:359–64.10.1016/0001-4575(82)90014-8Search in Google Scholar

18. Imbens GW, Rubin DB. Causal Inference for Statistics, Social, and Biomedical Sciences: an introduction. Cambridge University Press; 2015.10.1017/CBO9781139025751Search in Google Scholar

19. Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol. 1974;66:688–701.10.1037/h0037350Search in Google Scholar

20. Rubin DB. Bayesian inference for causal effects: the role of randomization. Ann Stat. 1978;6:34–58.10.1214/aos/1176344064Search in Google Scholar

21. Brown TL. Adjusted minimum time-to-collision (TTC): a robust approach to evaluating crash scenarios. In: Proceedings of Driving Simulation Conference 2005 North America, Orlando. 2005.Search in Google Scholar

22. Hayward JC. Near-miss determination through use of a scale of danger. Highw Res Rec. 1972;384:24–34.Search in Google Scholar

23. Horst RVD, Hogema J. Time-to-collision and collision avoidance systems. In: Proceedings of the 6th ICTCT Workshop. Salzburg. 1994.Search in Google Scholar

24. Sun D, Benekohal RF. Analysis of work zone gaps and rear-end collision probability. J Transp Stat. 2005;8:71–86.Search in Google Scholar

25. Hiraoka S, Wada T, Tsutsumi S, Doi S. Automatic braking method for collision avoidance and its influence on drivers behaviors. In: Proceedings of the First International Symposium on Future Active Safety Technology toward zero-traffic-accident. Tokyo. 2011.Search in Google Scholar

26. Horst RVD. Time-to-collision as a cue for decision making in braking. Vision in Vehicles III. Elsevier Science; 1991. p. 19–26.Search in Google Scholar

27. Rosenbaum PR. Interference between units in randomized experiments. J Am Stat Assoc. 2007;102:191–200.10.1198/016214506000001112Search in Google Scholar

28. Rubin DB. Which ifs have causal answers: comment on Holland (1986). J Am Stat Assoc. 1986;81:961–2.10.1080/01621459.1986.10478355Search in Google Scholar

29. Wooldridge JM. Econometric Analysis of Cross Section and Panel Data. 2nd ed. The MIT Press; 2010.Search in Google Scholar

30. Wu K, Jovanis PP. Defining and screening crash surrogate events using naturalistic driving data. Accid Anal Prev. 2013;61:10–22.10.1016/j.aap.2012.10.004Search in Google Scholar

31. Robins JM. A new approach to causal inference in mortality studies with a sustained exposure period: application to control of the healthy worker survivor effect. Math Model. 1986;7:1393–512.10.1016/0270-0255(86)90088-6Search in Google Scholar

32. Robins JM. The analysis of randomized and non-randomized AIDS treatment trials using a new approach to causal inference in longitudinal studies. In: Sechrest L, Freeman H, Mulley A, Public US, editors. Health Service Research Methodology: a focus on AIDS. Health Service, National Center for Health Services Research; 1989. p. 113–59.Search in Google Scholar

33. Hernan MA, Robins JM. Causal inference. CRC Press; 2018.Search in Google Scholar

34. Rosenbaum PR, Rubin DB. The central role of propensity score in observational studies for causal effects. Biometrika. 1983;70:41–55.10.21236/ADA114514Search in Google Scholar

35. Wu K. Defining, screening, and testing crash surrogates using naturalistic driving data. The Pennsylvania State University, Department of Civil and Environmental Engineering, PhD Thesis, 2011.Search in Google Scholar

36. Pearl J. Aspects of graphical models connected with causality. In: Proceedings of the 49th Session of the International Statistical Institute. 1993. p. 391–401.Search in Google Scholar

37. Pearl J. Probabilities of causation: three counterfactual interpretations and their identification. Synthese. 1999;121:93–149.10.1023/A:1005233831499Search in Google Scholar

38. Greenland S, Robins JM. Identifiability, exchangeability, and epidemiological confounding. Int J Epidemiol. 1986;15:413–9.10.1093/ije/15.3.413Search in Google Scholar PubMed

39. Frangakis CE, Rubin DB. Principal stratification in causal inference. Biometrics. 2002;58:21–9.10.1111/j.0006-341X.2002.00021.xSearch in Google Scholar

40. Pearl J. Principal stratification: a goal or a tool? Int J Biostat. 2011;7:1–13.10.2202/1557-4679.1322Search in Google Scholar

41. Guttinger VA. Conflict observation in theory and practice. In: International Calibration Study of Traffic Conflict Techniques. Springer; 1984. p. 17–24.10.1007/978-3-642-82109-7_3Search in Google Scholar

42. Kuroki M, Cai Z. Statistical analysis of “probabilities of causation” using covariate information. Scand J Stat. 2011;38:564–77.10.1111/j.1467-9469.2011.00730.xSearch in Google Scholar

43. Cai Z, Kuroki M. Variance estimators for three “probabilities of causation”. Risk Anal. 2005;25:1611–20.10.1111/j.1539-6924.2005.00696.xSearch in Google Scholar

44. Tian J, Pearl J. Probabilities of causation: bounds and identification. Ann Math Artif Intell. 2000;28:287–313.10.1023/A:1018912507879Search in Google Scholar

45. Rothman KJ, Greenland S, Lash TL. Modern Epidemiology. 3rd ed. Lippincott Williams and Wilkins; 2008.Search in Google Scholar

46. Kuroki M, Pearl J. Measurement bias and effect restoration in causal inference. Biometrika. 423;2014:101. 437.10.21236/ADA557455Search in Google Scholar

47. Pearl J. On measurement bias in causal inference. In: Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence. 2010. p. 425–32.Search in Google Scholar

48. Hagenaars JA, McCutcheon AL. Applied Latent Class Analysis. Cambridge University Press; 2002.10.1017/CBO9780511499531Search in Google Scholar

49. Kuroki M. Graphical identifiability criteria for causal effects in studies with an unobserved treatment/response variable. Biometrika. 2007;94:37–47.10.1093/biomet/asm005Search in Google Scholar

50. Bareinboim E, Pearl J. Causal inference and the data-fusion problem. Proc Natl Acad Sci. 2016;113:7345–52.10.1073/pnas.1510507113Search in Google Scholar

51. Chin HC, Quek ST, Cheu RL. Quantitative examination of traffic conflicts. Transp Res Rec. 1992;1376:86–91.Search in Google Scholar

52. Wang Y, Ieda H, Mannering F. Estimating rear-end accident probabilities at signalized intersections: occurrence-mechanism approach. J Transp Eng. 2003;129:377–84.10.1061/(ASCE)0733-947X(2003)129:4(377)Search in Google Scholar

53. Kotz S, Balakrishnan N, Johnson NL. Continuous Multivariate Distributions: Models and Applications, volume 1. 2nd ed. John Wiley and Sons; 2000.10.1002/0471722065Search in Google Scholar

54. Lu JC, Bhattacharyya GK. Some new constructions of bivariate Weibull models. Ann Inst Stat Math. 1990;42:543–59.10.1007/BF00049307Search in Google Scholar

Received: 2018-01-02

Revised: 2018-12-08

Accepted: 2018-12-10

Published Online: 2019-01-23

Published in Print: 2019-04-26

This article is distributed under the terms of the Creative Commons Attribution Non-Commercial License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Articles in the same Issue

https://doi.org/10.1515/jci-2018-0001

Keywords for this article

traffic conflict; crash-to-conflict ratio; counterfactual conditional; potential outcome model; structural causal model

Creative Commons

BY-NC-ND 3.0