Home Mathematics Piecewise Cause-Specific Association Analyses of Multivariate Untied or Tied Competing Risks Data
Article Publicly Available

Piecewise Cause-Specific Association Analyses of Multivariate Untied or Tied Competing Risks Data

  • Hao Wang and Yu Cheng EMAIL logo
Published/Copyright: September 19, 2014

Abstract

In this paper we extend the bivariate hazard ratio to multivariate competing risks data and show that it is equivalent to the cause-specific cross hazard ratio. Two approaches are proposed to estimate these two equivalent association measures. One extends the plug-in estimator, and the other adapts the pseudo-likelihood estimator for bivariate survival data to multivariate competing risks data. The asymptotic properties of the extended estimators are established by using empirical processes techniques. The extended plug-in and pseudo-likelihood estimators have comparable performance with the existing U-statistic when the data have no tied events. However, in many applications, there are tied events in which all the three estimators are found to produce biased results. To our best knowledge, we are not aware of any association analysis for multivariate competing risks data that has considered tied events. Hence we propose a modified U-statistic to specifically handle tied observations. The modified U-statistic clearly outperforms the other estimators when there are rounding errors. All methods are applied to the Cache County Study to examine mother–child and sibship associations in dementia among this aging population, where the event times are rounded to the nearest integers. The modified U performs consistently with our simulation results and provides more reliable results in the presence of tied events.

1 Introduction

Multivariate competing risks data occur often in genetic familial studies and demography as there are usually more than one risk involved. As a motivating example, we consider a study conducted in Cache County, Utah on dementia in an aging population. One goal of this study is to measure the association in dementia onset among family members. For each family member, the researchers recorded his or her onset age of dementia, age at death, or age at termination of the study, whichever occurred first, along with the corresponding cause indicator. This falls into the paradigm of multivariate competing risks, where there is within-subject dependence between the risk of interest and competing risks, as well as between-subject dependence among family members induced by common genetic or environmental factors.

Many researchers have studied association analysis for bivariate survival data with independent right censoring. An overview of their work was given by Hougaard [1]. The previous work, however, cannot be applied directly to multivariate competing risks data because the event of interest may not be observed due to the occurrence of a competing event. One limitation of competing risks data is that the within-subject dependence between the target event and competing events is nonparametrically untestable, and their marginal distributions are nonparametrically nonidentifiable [2, 3]. The nonidentifiability issue raises difficulties in analyzing the association in dementia in the presence of possibly dependent censoring by death, which occurred frequently among this aging population.

Some of the association analyses for bivariate survival data with independent censoring have been recently adapted to bivariate competing risks data Bandeen-Roche and Liang [4] modified Oakes’s [5] cross hazard ratio to the bivariate competing risks setting. They defined a cause-specific association measure θCS(s,t;k,l) which gives the factor by which a child’s risk of having cause k event at time s compares if his/her mother has experienced the cause l event by time t and has not yet experienced any event by time t. To estimate θCS(s,t;k,l), Bandeen-Roche and Liang [4] proposed a U-type estimator based on a local version of Kendall’s τ with inference using standard theory for U-statistics [6]. Cheng and Fine [7] defined a ratio ζ(s,t;k,l) of the bivariate hazard with cause-specific events from both subjects to the products of the bivariate hazards with a single event from each subject as an association measure. They established the equivalence of ζ(s,t;k,l) to the association measure θCS(s,t;k,l) and developed a plug-in estimator with an explicit variance form.

More recently, Shih and Albert [8] examined familial association in breast cancer based on a cross hazard ratio for paired times of first event and an odds ratio for two competing causes (cancer and non-cancer deaths) and estimated the two association measures by adopting the two-stage estimation method [9]. They noted that the cause-specific hazard ratio θCS(s,t;k,l) may be more clinically relevant than their cross hazard ratio for the first event times of paired members. Hence, we focus on the cause-specific association measures θ and ζ which have straightforward interpretations as the two are essentially a local correlation between indicator functions.

Most of the previous works related to θCS or ζ focused on bivariate competing risks data with few exceptions. Cheng et al. [10] adapted the cause-specific cross hazard ratio θCS for bivariate data to sibship data and multiple mother–child pairs and greatly broadened its scope of application. Gorfine and Hsu [11] proposed a flexible frailty model for multivariate competing risks data based on cause-specific hazard functions, which includes the frailty model considered in Bandeen-Roche and Liang [4] and the positive-stable model in Katki et al. [12] as special cases. However, the modeling and estimation approach in Gorfine and Hsu [11] requires the specification of a proper frailty model and the use of the EM algorithm with numerical integration. The focus of this paper is on nonparametric estimation of the cause-specific association measures for multivariate competing risks data within some pre-specified time regions, in line with the work by Cheng et al. [10].

Therefore, we first extend the association measure ζ from bivariate competing risks data to clustered data and develop a plug-in estimator as is done in the bivariate case. Next, we adapt a pseudo-likelihood estimator for bivariate survival data [13, 14] to multivariate competing risks data. The asymptotic properties of the two proposed estimators are established by using empirical processes techniques. The finite performances of the two proposed methods are compared with those of the existing U-estimator [10] for clustered competing risks data. Interestingly, we have found that all the three estimators produce biased results when the data have tied events. Rounding errors are common in many applications as the event times are often rounded to their nearest integers. Somewhat surprisingly, we are not aware of any work that addresses how to handle tied events in evaluating concordant status between two pairs of event times of interest. Therefore, we develop a modified U-estimator that explicitly takes into account the tied events and show that it outperforms the other three estimators for tied data. To our best knowledge, it is the first attempt to address the issue of rounding errors in nonparametric association analyses of bivariate and multivariate competing risks data.

The rest of the paper is organized as follows. We extend the plug-in estimator to multivariate competing risks data in Section 2.2, briefly describe the notation and the method for the existing U-estimator [10] in Section 2.3, and propose the pseudo-likelihood estimator in Section 2.4. In Section 2.5 tied event data are considered, and a modified U-statistic is proposed to specifically handle rounding errors. In Section 2.6 we will discuss some formal tests on whether the associations are constant over time. The extended plug-in estimators and pseudo-maximum-likelihood estimators are compared with the U-statistics for untied data via simulation studies in Sections 3.13.3. In Section 3.4 we investigate the impact of rounding errors on the performance of the three estimators in comparison to the modified U-estimator. All methods are applied to the Cache County Study in Section 4, and some remarks are given in Section 5.

2 Methods

2.1 Multivariate competing risks data

In this paper, we consider clustered competing risks data. Suppose that in each cluster there are M+1 failure times (T0,T1,,TM) and their corresponding cause indicators (0,1,,M) from a mother and her M children, where M is random and is assumed to bear no information on the association. We only consider two causes here j=1,2,j=0,,M, as we are primarily interested in association in cause 1 events and can always group multiple competing events together. In addition, there may also be independent and non-informative censoring times (C0,C1,,CM). Note that the data were collected retrospectively, hence it is reasonable to assume censoring times are independent of event times. This is in contrast with a perspective study where there may be some dependence among censoring times from mothers and children due to their inherent age differences. Hence one observes Yj=min(Tj,Cj), and ηj=jI{TjCj}, for j=0,1,,M, where I is the indicator function. This notation incorporates two types of multivariate competing risks data. One is the sibship data, where pairwise exchangeability can be assumed among siblings, i.e. all siblings have the same marginals and the pairwise joint densities are symmetric. The other consists of multiple mother and child pairs where we do not impose the exchangeability assumption between a mother and her child, but assume exchangeability among all children. The observed data containing n i.i.d. clusters are denoted as {(Yi0,,Yi,mi,ηi0,,ηi,mi,mi),i=1,,n}. The following two association measures are considered for each of the two types of multivariate competing risks data.

2.2 Bivariate cause-specific hazard ratio for clustered data and its plug-in estimator

Cheng and Fine [7] proposed a cause-specific association measure based on bivariate cause-specific hazard functions and demonstrated its equivalence to the conditional cross hazard ratio [4]. We now extend their association measure to the clustered competing risks data. To simplify the notation we focus on association analysis of cause 1 events, though the methods proposed here are applicable to other causes. Analogous to Cheng and Fine [7], for any child–mother pair (Tj,T0),1jM, we define a bivariate cause-specific hazard function for double cause 1 events from both subjects under the exchangeability assumption for all children

λ11(s,t)=lim(Δs,Δt)0PTj[s,s+Δs],j=1,T0[t,t+Δt],0=1|Tjs,T0t/(ΔsΔt),

and define bivariate hazard functions of a single cause 1 event from the child or mother with the other member being at risk as the following:

λ10(s,t)=limΔs0PTj[s,s+Δs],j=1|Tjs,T0t/Δs,and
λ01(s,t)=limΔt0PT0[t,t+Δt],0=1|Tjs,T0t/Δt.

Then the association measure for cause 1 events based on these bivariate hazard functions is

(1)ζCM(s,t;1,1)=λ11(s,t)λ10(s,t)λ01(s,t).

It quantifies the elevated risk that both a child and his/her mother failed at times (s,t), as compared to the product of the risks that only one of them failed at s or t, conditionally on both the child and the mother being at risk at (s,t).

A simple plug-in estimator of ζCM(s,t;1,1) can be constructed analogously to Cheng and Fine [7] based on all child–mother pairs. We first need to extend the event processes to the more complicated family structure with multiple children in a family. Let N11(s,t)=jMI(Yjs,ηj=1,Y0t,η0=1), N10*(s,t)=jMI(Yjs,ηj=1,Y0t), N01*(s,t)=jMI(Yjs,Y0t,η0=1), H*(s,t)=jMI(Yjs,Y0t), where M is the number of children in a family which may vary from family to family. Cheng et al. [10] divided the time scale as follows: 0=τ1,0<τ1,1<τ1,2<<τ1,n1=τ1,0=τ2,0<τ2,1<τ2,2<<τ2,n2=τ2, where τ1 and τ2 are the largest observed event times. These grids are predetermined based on practical interests as well as the feasibility of analyzing the data (i.e. the number of at risk or the number of the events should not be too small in any time grid). Assuming constant association measure ζCM(Ωqr) over the region Ωqr(τ1,q,τ1,q+1)×(τ2,r,τ2,r+1) with 0qn11,0rn21, we have

(s,t)Ωqrw(s,t)EN11(ds,dt)EH(s,t)ζCM(Ωqr)EN10(ds,t)EN01(s,dt)=0,

where E is the expectation operator and w(s,t) is some bounded deterministic function. The above equation is derived by recasting the hazard functions in eq. (1) in terms of the expectations of N11,N10,N01, and H, similar to the one given in Cheng and Fine [7]. That is, EN11(s,t)=E(M)P(Y1s,η1=1,Y0t,η0=1) and the expectations of the other three stochastic processes can also be expressed as the products of E(M) and the probabilities of a single event or at risk. Therefore, ζCM(Ωqr)=EN11(ds,dt)EH(s,t)EN10(ds,t)EN01(s,dt) and the above equation follows automatically.

Define the empirical process PnN11(s,t)=1ni=1nj=1miI(Yijs,Yi0t,ηij=1,ηi0=1), and similarly for PnH(s,t), PnN10(s,t), and PnN01(s,t). Since the number of children M is random, the above formula naturally handles different cluster sizes. Plugging the empirical processes into the above equation yields a plug-in estimator

(2)ζˆCM(Ωqr)=(s,t)Ωqrwˆ(s,t)PnN11(ds,dt)PnH(s,t)(s,t)Ωqrwˆ(s,t)PnN10(ds,t)PnN01(s,dt),

where wˆ is a weight function satisfying (s,t)Ωqrwˆ(s,t)dsdt=1, ||wˆw||0, and n(wˆw)=OP(1). The selection of the weight function wˆ deserves some attention. If the association is constant within the region Ωqr, different weight functions usually would yield comparable estimates of ζCM. In this case, the uniform weight wˆ(s,t)=1(τ1,q+1τ1,q)(τ2,r+1τ2,r) is a convenient choice. However, if the association is time-varying within Ωqr, the weight will play an important role in summarizing the time-varying association. When wˆ(s,t)=1PnN10(ds,t)PnN01(s,dt)/Ωqr1PnN10(ds,t)PnN01(s,dt), we are averaging the point-wise association across Ωqr with equal weight. One may consider other weighting schemes such as putting more weights on early time points at which the number of at risk is large.

The consistency and asymptotic normality of ζˆCM(Ωqr) can be established using empirical processes techniques [15, 16]. Given EM2< we can show that all the above indicator processes are P-Donsker. Since ζˆCM is a compact differential map of empirical processes indexed by Donsker classes, its consistency and asymptotic normality follow. In addition, there is an asymptotically linear representation n1/2(ζˆCMζCM)=n1/2iIi+oP(1), where Ii are influence functions, upon which the variance of ζˆCM can be estimated explicitly. Detailed proofs are given in the section “Asymptotic properties of ζˆCM” in Appendix.

The association measure ζ(s,t;1,1) for bivariate competing risks data can be similarly adapted to sibship data denoted as ζCC(s,t;1,1). To estimate ζCC, we define the empirical processes PnN11#(s,t)=1ni=1n1j,jmiIYijs,ηij=1,Yijt,ηij=1 and similarly PnH#(s,t), PnN10#(s,t), and PnN01#(s,t) based on all within-cluster pairs. The asymptotic property of the plug-in estimator for ζCC can be established using similar arguments to those in the section “Asymptotic properties of ζˆCM” in Appendix except assuming EM4< as a sufficient condition, which is reasonable as the family size M is random but bounded.

2.3 Cause-specific cross hazard ratio for clustered data

For mother and all children data, under the exchangeability assumption for the children, the child–mother cause-specific density was defined as

g(s,t;k,l)=lim(Δs,Δt)0PsTjs+Δs,tT0t+Δt,j=k,0=l/(ΔsΔt),

for k,l=1,2 and j=1,,M. Assuming g is absolutely continuous, a child–mother pair (Tj,T0) has a joint overall survival function

G(s,t)=stk=12l=12g(u,v;k,l)dudv.

Cheng et al. [10] defined a child–mother cause-specific association measure denoted by θCM(s,t;k,l) as follows:

(3)θCM(s,t;k,l)=G(s,t)g(s,t;k,l)sm=12g(u,t;m,l)duth=12g(s,v;k,h)dv,

which shares the same interpretation as the conditional cross hazard ratio proposed by Bandeen-Roche and Liang [4]. That is, the risk of a child experiencing a cause k event by time s would be accelerated for θCM>1 or decelerated for θCM<1 had his mother developed a cause l event by time t as compared to the case that his mother had no event by time t. This extended conditional cross hazard ratio was estimated in a piecewise manner by a U-statistic [10]. We now briefly describe the notation and methods given in Cheng et al. [10]. Similar notation will be used when we develop our modified U-statistic in Section 2.5 that specifically deals with tied events.

Let Yi,a,Yi,b,1a<bmi be the failure times of two distinct members of the ith cluster, and Yj,c,Yj,d,1c<dmj be the failure times from the jth cluster. θCM(s,t;k,l) was assumed to be a constant over each grid Ωqr, 0qn11,0rn21. Cheng et al. [10] defined Y(iajc)=min(Yia,Yjc),η(iajc)=ηiaI(Yia<Yjc)+ηjcI(Yia>Yjc) and similarly Y(ibjd) and η(ibjd) and defined the concordant indicator

ϕij,acbdqr=IYiaYjcYibYjd>0,Y(iajc),Y(ibjd)Ωqr,η(iajc),η(ibjd)=(k,l),

and the discordant indicator

ψij,acbdqr=IYiaYjcYibYjd<0,Y(iajc),Y(ibjd)Ωqr,η(iajc),η(ibjd)=(k,l).

The extended child–mother association measure was estimated by

(4)θˆCMU(Ωqr;k,l)=n211i<jn1ami1cmjϕij,ac00qrn211i<jn1ami1cmjψij,ac00qr

which was the ratio of the number of concordanst pairs to that of discordant pairs, among those pairs where concordance status was determinable, and the times and causes of failure matched those in θCM.

For any 1j<jM in the sibship data, Cheng et al. [10] defined the bivariate cause-specific densities fjj(s,t;k,l)=lim(Δs,Δt)0P(sTjs+Δs,j=k,tTjt+Δt,j=l)/(ΔsΔt), for k,l=1,2 and the overall survival function Sjj(s,t)=stk=12l=12fjj(u,v;k,l)dudv assuming that fjj is absolutely continuous. The subscripts j and j were dropped under the exchangeability assumption among siblings, such that f(s,t;k,l)=f(t,s;l,k), for any pair of causes k,l at any time point (s,t). The sibship cause-specific association measure assuming exchangeability was defined as

(5)θCC(s,t;k,l)=f(s,t;k,l)sm=12f(u,t;m,l)duS(s,t)th=12f(s,v;k,h)dv.

The estimation of the sibship association θCC is more complicated than that of the child–mother association θCM, as under pairwise exchangeability it is meaningful to define concordance and discordance indicators based on two different pairings (Yia,Yjc) with (Yib,Yjd) as well as (Yia,Yjd) with (Yib,Yjc) [17]. Hence θCC is estimated by

(6)θˆCCUΩqr;k,l=n211i<jn1a<bmi1c<dmjϕij,acbdqr+ϕij,adbcqrn211i<jn1a<bmi1c<dmjψij,acbdqr+ψij,adbcqr.

The estimators (4) and (6) are U-statistics whose asymptotic properties can be obtained with U-statistic theory [10].

2.4 Maximum-pseudo (composite)-likelihood estimator

One can show that ζCC is equivalent to θCC defined in eq. (5) and that ζCM is equivalent to θCM defined in eq. (3), following the same argument as shown in Cheng and Fine [7]. Even though the data are multivariate, the association measures are still defined for paired event times, and the argument in Cheng and Fine [7] can be applied directly. Hence the two sets of estimators, the U-statistics and the plug-in estimators, are estimating the same quantities. We now discuss another estimating procedure for θ s adapted from the Clayton’s [13] pseudo-likelihood function for bivariate survival data. The likelihood was also called a composite likelihood as discussed in Lindsay [18]. Ning and Bandeen-Roche [19] recently adapted the pseudo-likelihood to bivariate competing risks data under a regression setting. We now further extend the likelihood to multivariate competing risks data using a stratified method.

We first divide mother–multiple children data into several strata, each containing pairs of mothers and their dth children. Here d ranges from 1 to mmax, where mmaxmaxi=1nmi is the maximum number of children among all families. Without loss of generality, we consider the first stratum of mothers and the corresponding eldest children. Define the cause-specific set A(s,t;k,l) of size a(s,t;k,l) pairs satisfying (1) both members are not censored by time (s,t), and a(s,t;k,l)1; (2) there is at least one type l event that occurred at time t from one of the mothers and no event just prior to time s from her eldest child; and (3) there is at least one type k event that happened at time s from one of the eldest children and no event just prior to time t from the corresponding mother. Let Δ(s,t;k,l)=1 if the types k and l events were from the same pair and 0 otherwise. It can be shown that P{Δ(s,t;k,l)=1|A(s,t;k,l)}=θCS(s,t;k,l)a(s,t;k,l)1+θCS(s,t;k,l), and P{Δ(s,t;k,l)=0||A(s,t;k,l)}=a(s,t;k,l)1a(s,t;k,l)1+θCS(s,t;k,l).

We now construct the pseudo-likelihood function based on the observed mother–multiple children data {(Yi0,,Yi,mi,ηi0,,ηi,mi,mi),i=1,,n} from the Cache County Study. To incorporate the multiple children structure, we adopt a stratified method. Let Dd denote the set containing all the families that have a dth child, where d=1,,mmax. Then mother–child pairs belonging to Dd form a bivariate competing risks data set {(Yid,ηid,Yi0,ηi0),iDd} for which the pseudo-likelihood can be constructed. As discussed before, we assume a constant θCM(Ωqr) over the region Ωqr. To find all (s,t) pairs in Ωqr such that a(s,t;k,l)1 and Δ(s,t;k,l)=1, we count each child–mother pair in Dd as a concordant pair, if two event times from a pair are contained in Ωqr and their causes are (k,l), denoted by the indicator function ϕi,d0qr=I{(Yid,Yi0)Ωqr,(ηid,ηi0)=(k,l),}, iDd. The contribution of such pair to the log-likelihood function is ϕi,d0qrlog(θCMa(Yid,Yi0,k,l)1+θCM). Next, to find (s,t) such that Δ(s,t;k,l)=0, i.e. the events k and l are not from the same pair, we check any two different pairs (Yid,ηid,Yi0,ηi0) and (Yjd,ηjd,Yj0,ηj0) in Dd. If the times of the two pairs are discordant, s will be the minimum of children’s failure times Y(idjd) and t will be the minimum of mothers’ failure times Y(i0j0). The pair of the minimum failure times (Y(idjd),Y(i0j0)) needs to be in Ωqr and the cause indicators of the minimums (η(idjd),η(i0j0)) need to match (k,l). We call two such pairs “discordant pairs” and define the discordant indicator ψij,dd00qr=I{(YidYjd)(Yi0Yj0)<0, (Y(idjd),Y(i0j0))Ωqr,(η(idjd),η(i0j0))=(k,l)}, where i<j and i,jDd. Their contribution to the log-likelihood function is ψij,dd00qrlog(a(Y(idjd),Y(i0j0),k,l)1a(Y(idjd),Y(i0j0),k,l)1+θCM). Combining all the bivariate competing risks data from different strata Dd together, d=1,,mmax, we have the following pseudo-log-likelihood function:

(7)Ln(θCM(Ωqr))=d=1mmaxiDdϕi,d0qrlog{θCMa(Yid,Yi0;k,l)1+θCM}+d=1mmaxi>j&i,jDdψij,dd00qrlog{a(Y(idjd),Y(i0j0);k,l)1a(Y(idjd),Y(i0j0);k,l)1+θCM}.

The maximum-pseudo-likelihood estimator θ˜CML can be obtained by using some standard minimization procedure, e.g. function “nlminb” in R, for the negative log likelihood. As pointed out by Oakes [14], the variance of θ˜CML cannot be obtained from the second derivative of the pseudo-log-likelihood function as ψij,dd00qr may be dependent when θCM(Ωqr)1. Instead, we use some results for Z-estimators in empirical processes [15] to derive the asymptotic property of θ˜CML based on the estimation equation Ψn(θCM)=θCM(2n)L˙n(θCM)=0, where the superscript dot denotes the derivative. The inference will be based on the nonparametric bootstrap method.

The above framework can be further extended to clustered sibship data for which exchangeability can be assumed among all siblings. The adaptation of the maximum-pseudo-likelihood estimation to the sibship data is more complicated as there are two meaningful ways to form a pair, (Yia,Yib) and (Yib,Yia), for any two children a,b from the ith family, under the exchangeability assumption. In addition, there may be too many different combinations of a and b from multiple children families. To simplify the problem, we continue to use our stratified method and denote the set Dab, 1a<bmmax, which consists of all families having ath and bth children. To adapt the exchangeability assumption, we consider the bivariate competing risks set consisting of all child–child pairs {(Yia,ηia,Yib,ηib),iDab} together with its switched set containing all pairs {(Yib,ηib,Yia,ηia),iDab}. The cause-specific set A(s,t,k,l) is naturally extended to this bivariate set and its switched set, with the size of a(s,t,k,l). As before let Δ(s,t;k,l)=1 if cause k event at time s and cause l event at time t were from the same pair and Δ(s,t;k,l)=0 otherwise. We can still show that P{Δ(s,t;k,l)=1|A(s,t;k,l)}=1P{Δ(s,t;k,l)=0|A}=θCC(s,t;k,l)a(s,t;k,l)1+θCC(s,t;k,l).

We now construct the pseudo-log-likelihood function based on the stratum Dab. As before we assume constant sibship association θCC within time region Ωqr. To find all (s,t) pairs in A(s,t;k,l)Ωqr such that Δ(s,t;k,l)=1, we count each concordant sibship pair in the bivariate set and its reversed set and define concordant indicators ϕi,abqr=I{(Yia,Yib)Ωqr,(ηia,ηib)=(k,l)} and ϕi,baqr=I{(Yib,Yia)Ωqr,(ηib,ηia)=(k,l)}, iDab. It is more complicated to find all time points (s,t)Ωqr such that Δ(s,t;k,l)=0, because for any two pairs, i<j, from Dab, we may have four different pairings: (Yia,Yib) with (Yja,Yjb), (Yia,Yib) with (Yjb,Yja), (Yib,Yia) with (Yja,Yjb), and (Yib,Yia) with (Yjb,Yja). For the first pairing we define the discordant indicator ψij,aabbqr=I{(YiaYja)(YibYjb)<0, (Y(iaja),Y(ibjb))Ωqr,(η(iaja),η(ibjb))=(k,l)}. Similarly, we define discordant indicators ψij,abbaqr, ψij,baabqr, and ψij,bbaaqr. Combining all the strata Dab together we have the following pseudo-log-likelihood function:

(8)=1a<bmmaxiDab[φi,abqrlog{θCCa(Yia,Yib;k,l)1+θCC}+φi,baqrlog{θCCa(Yib,Yia;k,l)1+θCC}]+1a<bmmaxi<j&i,jDab[ψij,aabbqrlog{a(Y(iaja),Y(ibjb);k,l)1a(Y(iaja),Y(ibjb);k,l)1+θCC}+ψij,abbaqrlog{a(Y(iajb),Y(ibja);k,l)1a(Y(iajb),Y(ibja);k,l)1+θCC}+ψij,baabqrlog{a(Y(ibja),Y(iajb);k,l)1a(Y(ibja),Y(iajb);k,l)1+θCC}+ψij,bbaaqrlog{a(Y(ibjb),Y(iaja);k,l)1a(Y(ibjb),Y(iaja);k,l)1+θCC}].

The maximum-likelihood estimator θ˜CCL for the constant association measure θCC can be similarly obtained by using the function “nlminb” in R. The consistency and asymptotic properties of θ˜CCL can be proved analogously as for θ˜CML in the section “Asymptotic properties of θ˜CML” in Appendix.

For each extended association measures θCC(ζCC) and θCM(ζCM), we have discussed three sets of estimators. The plug-in estimators ζˆCC and ζˆCM and the U-type estimators discussed in Cheng et al. [10] (θˆCMU,θˆCCU) are computationally straightforward to obtain. The extended maximum-pseudo-likelihood estimators proposed in this paper (θ˜CML,θ˜CCL) do not have explicit variance estimators and hence are more computationally intensive. Asymptotic normality can be established for all the three estimators. Their relative efficiencies are examined by numerical studies in Section 3.

2.5 Modified U-statistic for tied data

In practice, the data are often rounded to the nearest integers, resulting in tied events. The adaptation of the plug-in estimator to the tied data is straightforward. For the U-statistic, we followed Cheng et al. [10] and excluded those pairs where one or both event times were tied. For the pseudo-likelihood function (7), the tied observations do not affect concordant pairs much, since a mother–child pair with double events is still counted as a concordant pair after rounding. However, rounding tends to underestimate the number of discordant pairs. Suppose we have two discordant pairs (Yi0,ηi0,Yi1,ηi1) and (Yj0,ηj0,Yj1,ηj1) from two distinct families i and j, where Yi0Yj0,ηi0=k and Yi1Yj1,ηj1=l. When ties exist, e.g. Yi1=Yj1, and if ηi1=l, then the pairing of these two pairs would not be considered as discordant any longer.

We will show through simulation studies in Section 3.4 that the plug-in estimator tends to underestimate while the other two tend to overestimate the true positive association with the presence of tied data. For the pseudo-likelihood estimator, the ties decrease the number of discordant pairs, and hence increase the estimated strength of association. For the U-statistic, rounding causes the removal of roughly similar amounts of concordant and discordant pairs. When the association is positive, i.e. the numerator is larger than the denominator, subtracting the same positive value from both the numerator and the denominator makes the ratio larger. Therefore, the U-statistic overestimates the positive association when ties are present. The similar arguments can be used to show that the U-statistic results in an estimate that is smaller than the true value when the association is negative (0<θ(ζ)<1) and ties are present. On the other hand, the pseudo-likelihood estimator still overestimates θ, since the number of discordant pairs is getting smaller. However, it is not clear theoretically how rounding errors would affect the plug-in estimator. Our simulation study suggests that the plug-in estimator tends to bias toward independence for both positive and negative associations.

To address potential loss of accuracy in the three estimators for tied data, we propose a modified U-statistic which essentially adds back those removed concordant and discordant pairs due to tied events. When two mother–child pairs have tied observations – for example, when both mothers had the same event time – we consider their corresponding cause indicators. If mother 1 died and mother 2 had dementia at the same age, we assume that mother 1 would have dementia later than mother 2, had mother 1 not died. If child event times were not tied, we could determine concordant status. For those tied pairs that can be equally concordant or discordant, we add half a pair to both concordant and discordant pairs. It becomes more complicated for sibship data under the exchangeability assumption. The section “Formula for corrected U” in Appendix provides details on how to add back all missing concordant and discordant pairs for sibship data.

The idea may be used analogously for the pseudo-likelihood estimator. In the section “Asymptotic properties of θ˜CML” in Appendix, we show that the pseudo-likelihood estimator is a Z-estimator based on the estimating equation which is a U-statistic of order 2 with the kernel function h(Xi,Xj;θ) given in eq. (11). One may let both ϕij,dd00qr and ψij,dd00qr in the numerator of eq. (11) be 0.5 for tied events following the same scheme as in the section “Formula for corrected U” in Appendix. However, it is not clear how to fix the plug-in estimator directly when there are rounding errors. Some random perturbation method may be used by adding simulated small error terms to tied observations. However, multiple imputation is needed which is beyond the scope of this paper.

2.6 Grid selection

The selection of grids plays an important role in capturing the local association structure. If we use too few grid points, the resulting estimates may be too smooth, missing crucial local associations. On the other hand, if there are too many grid points, the number of events would be too few and there would be too much variability in the estimators. In practice, if there is no particular scientific definition of time regions in place, we recommend that we start with some crude divisions and then further refine the grid points. We will carry out formal hypothesis tests on two neighboring associations being the same, based on which we decide whether we should keep the finer grid points. That is, we would like to test the following hypotheses:

H0:ζCM(Ωqr)=ζCM(Ωq+1,r),q=0,1,,n12,and
H0:ζCM(Ωqr)=ζCM(Ωq,r+1),r=0,1,,n22,

where Ωqr(τ1,q,τ1,q+1)×(τ2,r,τ2,r+1) with 0qn11,0rn21. For any pair of two adjacent regions, e.g. Ωqr and Ωq+1,r, we construct a Wald type of statistic based on the difference in the two local associations. That is,

χW2=ζˆCM(Ωqr)ζˆCM(Ωq+1,r)2varζˆCM(Ωqr)+varζˆCM(Ωq+1,r),

where var{ζˆCM(Ωqr)} and var{ζˆCM(Ωq+1,r)} are the variance estimators of ζˆCM(Ωqr) and ζˆCM(Ωq+1,r) computed based on the influence functions given in eq. (10). When the number of events in each grid is not too few, we can compare this test statistic with its asymptotic chi-squared distribution with one degree of freedom. For the two neighboring regions Ωqr and Ωq,r+1, similar tests can be constructed. Due to the equivalence of ζ and θ, the above hypotheses can be equally expressed in terms of θ and be tested based on the U-estimator or the pseudo-likelihood estimator, though for the latter estimator the bootstrap method is used to obtain the variance. The same idea can be applied analogously to the sibship association analysis.

3 Simulation studies

3.1 Constant association

We conducted simulation studies to evaluate small-sample performance of the plug-in estimator ζˆ, the maximum-pseudo-likelihood estimator θ˜L, and the U-statistic θˆU. We first simulated mother–multiple children data by extending Bandeen-Roche and Liang’s [4] frailty model. They modeled cause-specific between-subject dependency by an overall frailty A and proportions of cause allocations B which was between 0 and 1 and independent of A. As the Archimedean copula is associative, we naturally extended their frailty model for the bivariate competing risks to multivariate settings in our simulations. Three different strengths of association were considered. For each scenario, we generated 500 replicates of 500 families with varying family sizes. More details of our simulation are given below.

For each family, we first drew the familywise propensity A from a Gamma (1,1). The number of children for a family M was determined from a multinomial distribution ranging from 1 to 5 with probabilities 0.09, 0.18, 0.24, 0.24, and 0.25 which were estimated from the Cache County Data. Next we generated M+1 failure times from Exp(A) for the mother and her M children. Since the Cache County Study data were collected retrospectively, some previous investigation [20] showed that the marginal cumulative incidence function (CIF) for a mother is similar to that for her eldest child. For the sake of simplicity, we used the same distribution for mother and children in this simulation. Cause indicators were generated from a Bernoulli distribution with probability B for each subject, where B was drawn from a Beta(R1,1R1). Three values of R1, 0.2,0.5, and 0.8, were considered with corresponding theoretical values of θCM(s,t;1,1) being 6.0,3.0, and 2.25 for all s and t. Finally, independent censoring times were generated from Uniform(5,10) for each subject in this family, imposing about 12% censoring.

We first adopted the same frailty model as those used in Bandeen-Roche and Liang [4] and Cheng and Fine [7]. The model assumes the same strength of association between causes and results in constant association over time. Hence the estimation is carried out over the entire observation region with uniform weight. We applied the three methods to each of the 500 simulated data sets. The mean of estimates, mean of standard error (for the plug-in and U-estimators), or bootstrap standard error (for the pseudo-likelihood estimator) using family-based resampling with replacement with 500 replications, empirical standard error, and coverage rate of 95% confidence intervals are summarized in the upper left panel of Table 1. With moderate sample size, light censoring, and 50–80% of cause 1 events, all the three estimators perform well. The estimates are very close to the true values. The model-based or bootstrap standard errors agree well with the empirical standard errors. The coverage rates are close to the nominal level 0.95. There is some bias in the pseudo-likelihood estimate and U-statistic when R1=0.2, since there were fewer cause 1 events. Similarly, we simulated 500 sibship data sets using the same simulation strategy. Hence the child–child association θCC takes on the same values as the mother–child association θCM. Each data set contained 250 families. The results are summarized in upper right panel of Table 1. Since there are 250 families instead of 500 in each sibship data, there is some discrepancy between the estimates and the true value of association when the number of cause 1 events is small (R1=0.2).

Table 1

Simulation results for mother–multiple children data (CM) and sibship data (CC)

R1θ(ζ)CM or θ(ζ)CCSTATζˆCMθ˜CMLθˆCMUζˆCCθ˜CCLθˆCMU
Mother–ChildrenSibship
n=500,CP=12%n=250,CP=12%
0.26EST6.036.166.155.986.236.22
MSE0.820.840.841.111.151.18
ESE0.830.790.861.211.231.31
COV0.930.960.950.900.930.92
0.53EST2.993.013.013.003.043.05
MSE0.240.230.240.320.330.33
ESE0.230.230.230.320.330.33
COV0.950.950.960.950.940.96
0.82.25EST2.242.262.252.242.262.26
MSE0.130.130.130.180.180.18
ESE0.130.130.130.160.160.17
COV0.940.960.940.970.960.97
n=200,CP=35%n=200,CP=35%
0.26EST6.116.496.456.086.416.42
MSE1.602.031.771.511.591.64
ESE1.731.801.941.641.661.81
COV0.940.980.950.910.950.94
0.53EST3.023.083.083.003.073.06
MSE0.460.470.480.430.410.45
ESE0.470.440.490.440.410.45
COV0.950.970.950.940.960.95
0.82.25EST2.222.262.252.232.252.26
MSE0.250.240.260.230.220.24
ESE0.240.230.250.230.220.24
COV0.940.950.940.950.940.95
n=100,CP=35%n=100,CP=35%
0.26EST6.307.157.166.056.796.81
MSE2.254.442.832.032.662.40
ESE2.583.053.392.382.732.93
COV0.890.980.920.870.930.90
0.53EST3.023.173.152.973.103.10
MSE0.640.750.700.600.600.64
ESE0.630.660.700.640.640.69
COV0.930.970.940.900.930.92
0.82.25EST2.252.322.302.232.282.28
MSE0.350.370.370.330.320.34
ESE0.350.340.370.330.330.35
COV0.940.960.950.930.920.95

Next, we evaluated the performances of the three estimators when the number of clusters was smaller and censoring was heavier. We reduced the cluster sizes for mother–children data and sibship data to 200 or 100 families and generated censoring times from an exponential function with mean 4 yielding ~35% censoring. The results are summarized in the middle and bottom panels of Table 1. The bias is more noticeable in the estimates as compared to the true values. Both the empirical standard error and the model-based or bootstrap standard error increase, though they in general agree with each other. The coverage rates are still pretty close to the nominal level 0.95, except when the number of cause 1 event is very small (n=100 and R1=0.2).

3.2 Time-varying association

In practice, familial association may not be constant over each time grid. In this case, θ or ζ can be viewed as a weighted average of time-varying association across all time points within a specific region. It would be interesting to examine how the three estimators perform when the underlying association is actually time-varying and how the selection of grid may affect the results. Hence we simulated mother–multiple children data with time-varying association between the cause 1 events by adopting a gamma frailty model for the bivariate CIF as discussed in Cheng and Fine [21]. More specifically, the bivariate CIF of a child–mother pair F11CM(s,t)=P(Tjs,j=1,T0t,0=1),j=1,,M was connected with the marginal CIF for the child F1C(s)=P(Tjs,j=1) and that for the mother F1M(t)=P(T0t,0=1) through a gamma frailty model. That is,

F11CM(s,t)={F1C(s)1α+F1M(t)1α1}11α,

where we chose α=1.14, the same value as estimated from the Cache County data [21]. For the marginals, we assumed equal distribution and adopted a modified three-parameter logistic model [22] as suggested by the Cache County data which were collected retrospectively. That is,

F1C(s)=F1M(s)=pexp{b(sc)}pexp(bc)1+exp{b(sc},

with p=0.5,b=1, and c=2. The competing events were simulated from a conditional three-parametric logistic model, where p=1, b=2, and c=4; see Cheng and Fine [21, Section 4.1] for details. The independent censoring time was generated from Uniform (4,5) resulting in about 20% censored observations.

We generated 500 clusters of mother and children data, each containing 500 or 100 families, and obtained the mean estimates, the model-based or bootstrap standard errors, the empirical standard errors, and the coverage rates where the true values were computed by averaging the estimates from 200 replications of 2,000 mother–child pairs. We examined the performance of the three estimators over a four-region grid, as well as over the entire observed time region, with uniform weight. The results are summarized in Table 2. The true values for the plug-in estimator and the U-estimator are identical and slightly different from those for the pseudo-likelihood estimator, since the latter weights the time-varying associations at various time points differently. The true values change across time regions. When we employ the entire region, the true values fall somewhere between the smallest and largest associations among all subregions. In general, all the three estimators still have good practical performance, except for the pseudo-likelihood estimator when the cluster size is n=100. The coverage rates for the average association across the entire region are slightly lower than those over smaller regions. It may be explained that the estimators over the entire region have smaller variability, and some slight departure from the true value would lead to lower coverage. When the association is time-varying, the selection of grids plays a more important role in summarizing the strength of association than in the situation of constant association. As suggested by the results from n=100, both the bias and the standard errors of the estimators increase when the sample size is much smaller than 500. To reduce the variability, we may reduce the number of grids, which on the other hand will restrict our ability of examining time-varying association.

Table 2

Simulation results for mother–multiple children data with time-varying association

(s,t)Ωqrs[0,2.5]s[0,2.5]s[2.5,5]s[2.5,5]s[0,5]
t[0,2.5]t[2.5,5]t[0,2.5]t[2.5,5]t[0,5]
n = 500
ζˆCMTRUE1.401.131.131.061.26
EST1.401.131.131.071.27
MSE0.130.140.150.170.08
ESE0.130.140.160.180.08
COV0.930.930.940.950.90
n = 100
ζˆCMTRUE1.401.131.131.061.26
EST1.411.171.161.101.27
MSE0.280.300.320.370.18
ESE0.300.340.360.420.18
COV0.940.920.890.900.94
n = 500
θˆCMUTRUE1.401.131.131.061.26
EST1.401.131.131.071.27
MSE0.130.130.150.170.08
ESE0.130.140.150.180.08
COV0.930.930.940.940.91
n = 100
θˆCMUTRUE1.401.131.131.061.26
EST1.411.171.161.111.28
MSE0.290.310.340.400.19
ESE0.310.350.380.450.16
COV0.940.920.890.890.95
n = 500
θ˜CMLTRUE1.361.121.121.051.19
EST1.361.121.121.061.21
BSE0.120.130.140.160.07
ESE0.120.130.150.160.07
COV0.920.930.940.940.88
n = 100
θ˜CMLTRUE1.361.121.121.051.19
EST1.381.171.151.121.21
BSE0.130.130.150.200.07
ESE0.300.330.350.450.16
COV0.620.590.610.610.60

3.3 Tied observations

Real data are often rounded as in the Cache County Study, resulting in tied observations. To evaluate how the three estimators perform when there are tied events, we rounded simulated mother–children data and sibship data in Section 3.1 to one decimal place containing 100 time points from 0 to 9.9. This is roughly comparable to 53 age points from 55 to 107 in the Cache County Study. The results are summarized in Table 3. The plug-in estimator ζˆ tends to underestimate θCM while the maximum-pseudo-likelihood estimator θ˜L and the U-statistic θˆU tend to overestimate it. We also evaluated the performance of the three estimators for sibship data with tied observations. The results are given in the upper right panels of Table 3. We observed similar trends as in the mother–multiple children data. The plug-in estimators are likely to underestimate the association while the other two are prone to overestimate the association. As a comparison, we also evaluated the performance of the modified U-statistics, denoted by θˆCMMU and θˆCCMU, for the rounded mother–children and sibship data. With the presence of rounding errors, the modified U-estimates clearly outperform the other three, as they are closest to the true values. There is also some efficiency gain over the original U-statistic because the latter loses information by discarding all tied observations. When there are fewer clusters (n=200 or n=100) and higher censoring rate (CP = 35%), we observed more bias in all the four estimators (bottom panel of Table 3). However, the modified U-statistic still clearly outperforms the other three.

Table 3

Simulation results for mother–multiple children data (CM) and sibship data (CC) with tied events

R1θ(ζ)CM or θ(ζ)CCSTATζˆCMθ˜CMLθˆCMUθˆCMMUζˆCCθ˜CCLθˆCCUθˆCCMU
Mother–ChildrenSibship
n=500,CP=12%n=250,CP=12%
0.26EST5.746.476.455.985.666.546.506.12
MSE0.750.910.930.811.011.231.281.14
ESE0.760.890.940.810.991.231.291.14
COV0.900.950.960.960.880.970.960.95
0.53EST2.843.243.152.982.893.323.242.99
MSE0.210.260.260.230.300.360.380.32
ESE0.210.260.270.220.330.400.410.31
COV0.870.880.930.940.890.890.920.94
0.82.25EST2.142.492.372.232.132.492.362.23
MSE0.120.150.150.130.160.200.200.17
ESE0.120.150.150.130.170.210.210.18
COV0.820.670.910.930.840.810.920.94
n=200,CP=35%n=200,CP=35%
0.26EST5.636.726.666.215.716.736.776.20
MSE1.402.141.911.701.341.691.801.56
ESE1.451.790.891.801.451.822.041.64
COV0.870.980.940.920.870.960.950.91
0.53EST2.833.333.233.012.853.363.263.05
MSE0.400.520.540.470.390.470.510.45
ESE0.410.500.550.480.330.490.550.45
COV0.900.960.960.940.890.940.950.93
0.82.25EST2.122.542.402.272.132.552.412.27
MSE0.220.290.300.260.210.260.280.24
ESE0.220.260.290.260.230.290.310.24
COV0.870.900.950.950.860.850.920.95
n=100,CP=35%n=100,CP=35%
0.26EST5.747.457.436.695.657.217.156.42
MSE1.964.243.072.571.832.892.702.26
ESE2.514.014.263.092.022.873.092.75
COV0.840.980.930.890.840.970.940.88
0.53EST2.823.403.303.152.853.433.343.11
MSE0.570.840.790.700.540.700.750.65
ESE0.570.760.800.750.570.730.810.69
COV0.890.970.960.950.890.980.970.93
0.82.25EST2.162.662.492.292.142.612.452.31
MSE0.320.450.440.370.300.380.410.35
ESE0.310.430.430.380.310.410.430.37
COV0.900.940.970.940.880.910.960.93

3.4 Negative association

The proposed plug-in and pseudo-likelihood estimators, as well as the original and modified U-estimators, can be naturally applied to the clustered data where the paired events have a negative association. To evaluate their practical performance under negative association, we simulated 500 or 100 clusters of mother–multiple children data where the mother–child paired event times satisfy a Clayton model with copula parameter α=0.5 and have the same marginal exponential distributions with mean 1. For each subject, the cause indicator was generated from a Beta(0.5,0.5). The other settings are the same as in Section 3.1 expect that there is no independent censoring. The corresponding mother–child association is θCM=0.75, estimated based on 100 replications of 1,000 mother–child pairs. The results are summarized in Table 4. Under this simulation setting, all the three estimators have satisfactory performance. We were also interested in examining the effect of rounding errors on the three types of estimators when there is a negative association. Hence, we rounded the simulated data to the first decimal place and computed the three estimators and the modified U-statistic as listed in the bottom panel of Table 4. Interestingly, when θCM=0.75 and there are rounding errors, the plug-in estimator and pseudo-likelihood estimator overestimated the true value with bias toward independence, while the original U-estimator underestimated the true value over-evaluating the negative association. We also observe some improvements in the coverage rates when the cluster size decreases from 500 to 100, because the estimators are roughly as biased as before but the standard errors increase in the smaller sample. In contrast to the performance of the three estimators, the modified U-estimator produced the most reliable result with the presence of negative association for both cluster sizes.

Table 4

Simulation results for mother–multiple children data (CM) for negative association

θ(ζ)CMStatζˆCMθ˜CMLθˆCMUθˆCMMUθ(ζ)CMStatζˆCMθ˜CMLθˆCMUθˆCMMU
Untied data n = 500
0.75EST0.750.750.750.45EST0.450.450.45
MSE0.060.060.04MSE0.040.030.04
ESE0.060.060.04ESE0.040.030.04
COV0.950.950.95COV0.940.940.94
Untied data n = 100
EST0.780.770.77EST0.480.460.46
MSE0.130.120.13MSE0.090.080.09
ESE0.140.120.15ESE0.090.070.10
COV0.930.950.92COV0.940.970.92
Tied data n = 500
EST0.811.020.700.79EST0.530.870.390.50
MSE0.060.050.060.06MSE0.040.040.040.04
ESE0.060.060.050.06ESE0.050.040.040.04
COV0.8600.820.92COV0.5600.60.80
Tied data n = 100
EST0.820.990.700.79EST0.540.770.400.50
MSE0.130.140.130.14MSE0.090.100.080.08
ESE0.130.140.130.14ESE0.090.090.090.08
COV0.950.660.880.95COV0.900.040.830.91

4 Cache County Study

The data from the Cache County Study on Memory in Aging have been analyzed in previous works. Bandeen-Roche and Liang [4] and Bandeen-Roche and Ning [6] applied their U-type estimator, and Cheng and Fine [7] used a plug-in estimator to investigate the child–mother association in dementia based on 3,635 pairs of mothers and their eldest children. In order to utilize the information on other siblings, Cheng et al. [10] adapted the U-estimator for bivariate competing risks data to multivariate cases, assuming exchangeability among all siblings. In this paper, we were interested in both the child–mother and the sibship associations. Hence we extended the plug-in estimator from bivariate competing risks data and the pseudo-likelihood estimator from a typical bivariate survival setting to more complicated data structures such as mother–multiple children data and sibship data. The two extended estimators were applied to 3,635 families containing mothers and all her children for the mother–child association and to the sibship data containing 4,770 families of siblings for the sibship association. There are ~60% censored cases among mothers and 70% censored cases among children. The number of children in each family ranges from 1 to 14 in the mother–children data with mean and median both around 5. The distribution is similar for sibship data before we excluded single child families from the sibship analysis.

More importantly, the data had tied observations due to ages being rounded to the nearest integers, which has been ignored in the previous works. We computed the modified U-statistic along with the two extended estimators and the original U-statistic. To facilitate comparison with previous work, we examined the strengths of association over the same time regions as reported in Cheng et al. [10]. We first divided the age ranges into three intervals: 70, 71–80, and >80 for both dimensions. For each age region, we computed the plug-in estimate (ζˆCM), the pseudo-likelihood estimate (θ˜CML), the original U-estimate (θˆCMU), and the modified U-estimate (θˆCMMU) of the child–mother association according to eqs (2), (7), (4) and an equation similar to eq. (12) with uniform weight. The model-based standard errors (MSE) were available for the plug-in and U-type estimators and calculated based on their influence functions. We also computed the bootstrap standard errors (BSE) for the four estimators based on 500 bootstrap samples. The results are summarized in the top panel of Table 5. Similarly, for the sibship association, we computed the four estimates ζˆCC, θ˜CCL, θˆCCU, and θˆCCMU. The model-based standard errors were given for the plug-in estimator and U-estimators. The bootstrap standard errors were also calculated for all four estimators based on 500 bootstrap samples, each containing 2,000 clusters of siblings drawn from 4,770 families with replacement. The reported BSEs were adjusted by a factor of 2,0004,770. The results on sibship association are given in the lower panel of Table 5. The same analyses were performed on four regions (>)75×(>)80, and the results are shown in Table 6.

Table 5

Child–mother association and sibship association in the Cache County Study estimated within 3 ×3 time regions Ωqr, number of pairs at risk in each time region (N), number of cause 1 pairs in each time region (N11), estimate (EST), model-based standard error (MSE), bootstrap standard error (BSE), s for child, and t for mother

Child–mother association
(s,t)ΩqrObsζˆCMθ˜CMLθˆCMUθˆCMMU
NN11ESTMSEBSEESTBSEESTMSEBSEESTMSEBSE
s70,t701,338104.031.331.402.571.224.211.421.514.061.361.34
s70,70<t802,171122.270.690.723.131.162.340.730.762.290.700.69
s70,t>803,90771.240.460.481.310.561.300.490.511.250.470.49
70<s80,t701,258184.861.261.304.831.515.141.391.444.931.301.36
70<s80,70<t802,271232.640.580.603.000.842.760.640.662.680.600.58
70<s80,t>803,757152.130.630.641.850.632.270.700.722.150.650.67
s>80,t7065030.960.490.481.830.710.940.490.490.960.490.51
s>80,70<t80961102.440.890.952.550.892.550.981.082.470.920.96
s>80,t>801,71592.401.041.121.880.912.661.191.152.431.081.07
Sibship association
((s,t)ΩqrObsζˆCCθ˜CCLθˆCCUθˆCCMU
NN11ESTMSEBSEESTBSEESTMSEBSEESTMSEBSE
s70,t7022,478263.401.011.083.161.233.421.031.123.441.031.09
s70,70<t8017,525443.350.580.563.870.763.450.610.593.370.590.57
s70,t>805,256272.420.600.602.420.632.560.660.682.440.620.66
70<s80,70<t8021,5001042.950.490.483.160.553.110.530.533.000.510.52
70<s80,t>8010,531562.580.440.402.410.422.800.500.472.610.450.44
s>80,t>809,984601.650.450.421.430.271.810.520.521.670.450.46

The original U-estimates for mother–child associations are identical to those reported in Cheng et al. [10]. However, there are some that differ in the sibship analysis because we are now using not only the pairs whose event times belong to the region of interest but also the pairs whose switched times under exchangeability belong to the region. For mother–child associations, the four estimates are coherent on most regions. The modified U-estimates almost always lie between the plug-in estimates and the original U-estimates except for the region s>80,t70 where there are only three mother–child pairs both having dementia (N11=3). We observe a similar trend for sibship associations in the lower panel of Table 5. This is in agreement with our simulation studies in Section 2.5 where we have found that the plug-in estimator tends to underestimate and the original U-statistic tends to overestimate, while the modified U-statistic is closest to the true positive association. This is the first time that tied events have been taken into account in estimating the constant association over a time region, which produces more reliable results as suggested by our simulation studies.

However, when the number of double cause 1 events N11 is small (i.e. N11=2), there is a noticeable discrepancy between the pseudo-likelihood estimate and the other three. For sibship associations, because of the exchangeability assumption, the estimators are the same over two symmetric regions, for example, s70,70<t80 and 70<s80,t70. We only report one of the symmetric regions. The four sets of estimates are generally close, and the standard errors are smaller than those for mother–child associations. As more sibship pairs were used in the sibship analysis than child–mother pairs in the child–mother analysis, the sibship estimates tend to be more robust than the child–mother estimates. This suggests that it would be important to extend original estimators for bivariate data to clustered data and to incorporate as much information as possible in quantifying mother–child associations and sibship associations when the event is rare.

Table 6

Child–mother association and sibship association in the Cache County Study estimated within 2 ×2 time regions Ωqr, number of pairs at risk in each time region (N), number of cause 1 pairs in each time region (N11), estimate (EST), model-based standard error (MSE), and bootstrap standard error (BSE)

Child–mother association
(s,t)ΩqrObsζˆCMθ˜CMLθˆCMUθˆCMMU
NN11ESTMSEBSEESTBSEESTMSEBSEESTMSEBSE
s75,t805,319413.410.600.593.570.683.550.650.653.440.610.62
s75,t>805,891121.300.390.401.250.401.360.420.441.300.390.45
s>75,t803,330352.400.480.522.450.492.490.530.582.430.490.50
s>75,t>803,488192.710.870.551.800.412.900.990.632.750.900.95
Sibship association
(s,t)ΩqrObsζˆCCθ˜CCLθˆCCUθˆCCMU
NN11ESTMSEBSEESTBSEESTMSEBSEESTMSEBSE
s75,t8060,9221283.280.520.513.170.503.360.550.543.330.530.52
s75,t>809,663442.610.470.522.410.452.760.520.582.630.480.49
s>75,t8033,8931682.940.350.342.770.343.110.390.392.970.360.36
s>75,t>8016,108942.030.340.331.670.252.240.400.402.060.350.34

When we examine the mother–child associations on broader regions, as given in Table 6, the plug-in estimates, the modified U-estimates, and the original U become closer and always follow the same order. A similar trend is observed in the sibship analysis. However, the discrepancy between the likelihood-based estimates and the other three still exists, especially on the region s>75,t>80, for both mother–child and sibship associations, in spite of the fact that the N11 s get bigger in the four-region analysis. One possible explanation is that the association may be time-varying on these broader regions, and the estimated association measure can be thought of as some weighted average over each region. The likelihood-based method weights the time-varying association differently as compared to the plug-in and U-statistics, since the denominator of eq. (11) is in general different from 1.

In this application, we consider and contrast the associations over a 3×3 grid and a 2×2 grid. The selection of endpoints requires careful consideration. In the Cache County Study, the number of family is large. However, the incidence of dementia is relatively low. Cheng et al. [20] reported the cumulative incidence of dementia for a mother or her eldest child is about 0.1 at age 90. Since we are quantifying association in dementia onset among family members, we pay particular attention to N11. One of the 3×3 grids s>80,t70 only contains three double cause 1 events, and the result is likely unreliable. The N11 s over the 2×2 grid are 12 or above in the mother–children analysis, and the analysis tends to be more reliable. On the other hand, the varying association is likely to be smoothed out. Our experience suggests starting with large grids and then refining the analysis depending on the change in association and the number of double cause 1 events in each cell.

5 Remarks

In this paper we discuss three types of nonparametric approaches to estimating two equivalent and easily interpreted association measures for multivariate competing risks data. The pseudo-likelihood estimator lacks an explicit variance estimator and relies on bootstrapping, hence it is most computationally intensive. For example, it took about half a second to obtain the plug-in estimate or the U-estimate as well as the corresponding standard error and about 0.5(500)=250 seconds to obtain the pseudo-likelihood estimate and its bootstrap standard error based on 500 bootstrap samples, for the case of n=100,CP=35%, and R1=0.5 as reported in Table 1, on a PC with intel i7 1.6 GHz CPU. When the sample size increases to 200, the computing time for obtaining the plug-in estimate or the U-estimate and its corresponding standard error increases to be around 2 seconds, and that for the pseudo-likelihood estimate increases to be 5(500)=2,500 seconds. Therefore, we recommend using the plug-in estimator or the U-estimator for their simplicity and good practical performance as illustrated in our simulations, when the assumption of constant association holds within each grid and there are no or few tied events in a practical application.

On the other hand, the pseudo-likelihood estimator has the potential to be most efficient. In this paper we use a stratified approach which always matches up children with the same sibling order. It is possible to check concordance status for two pairs, for example, a mother and her eldest child from one family and another mother and her second child from another family. There are actually many more combinations than those used in eqs (7) and (8). Based on our simulation studies, the stratified pseudo-likelihood estimator has comparable efficiency to the other two estimators. If we were able to use all possible combinations, which of course would be very computationally intensive, we would further improve the efficiency of the pseudo-likelihood estimator. Moreover, the pseudo-likelihood estimator weights the time-varying associations differently from the plug-in estimator and the U-estimator. We hence recommend using both the pseudo-likelihood estimator and the plug-in (or U-) estimator to capture different aspects of the underlying association, when we suspect the association to vary over time but cannot refine the grids due to the limited sample size. In addition, the likelihood-based approach is more adaptive to a regression setting or to a parametric model. For example, one may extend the regression model for bivariate competing risks data [19] to multivariate competing risks data based on the pseudo-likelihood functions in eqs (7) and (8). Recently, Hu et al. [23] used a polynomial function to approximate cross hazard ratio and estimated the unknown parameters based on a pseudo-partial likelihood. Similar parametric approximation may be used for the cause-specific cross hazard ratio, and the pseudo-likelihood functions considered in this paper can again be used as the base of estimation. These will be future research topics.

More importantly, we investigate how rounding errors affect the performance of the three piecewise estimators and propose a modified U-statistic to handle tied multivariate competing risks data. For the modified U-statistic, we simply adjust for concordant and discordant pairs for the tied events. It is also fast to compute the modified U-estimate, e.g. taking about a second to obtain the modified U-estimate and standard error for the case of n=100,CP=35%,R1=0.5 as reported in Table 3. Based on our current simulation studies and real data analysis, we suggest using the modified U-statistic when the data are subject to rounding errors.

It is worth pointing out that our method can also handle other-cause association and across-cause association. However, the estimators may not have satisfactory performance under settings in which it is relatively rare to observe two different types of outcomes for two relatives. This is also true for the primary cause. In the Cache County Study, the onset of dementia is relatively rare. Thus, a large sample is needed to warrant reliable estimation of the familial association in dementia, and it is more crucial to use the information from all children in the estimation. If a large sample is not available, some parametric or semiparametric models such as the frailty model considered in Gorfine and Hsu [11] may be more efficient in estimating the familial association of a rare event among family members.

Appendix

Asymptotic properties of ζˆCM

Recall the extended association measure for mother–children data within region Ωqr

ζCM(Ωqr)=Ωqrw(s,t)EN11(ds,dt)EH(s,t)Ωqrw(s,t)EN10(ds,t)EN01(s,dt),

where N11, N10, N01, and H are the double or single event processes and at-risk process defined over all mother–child pairs, for instance, N11(s,t)=jI(Tjs,j=1,T0t,0=1). We now consider asymptotic properties of the nonparametric plug-in estimator

ζˆCM(Ωqr)=Ωqrwˆ(s,t)PN11(ds,dt)PH(s,t)Ωqrwˆ(s,t)PN10(ds,t)PN01(s,dt),

as given in eq. (6).

Following the arguments in Cheng and Fine [7], we establish consistency and asymptotic normality as below. For simplicity, in the sequel, we omit the integration region Ωqr. Observe that

ζˆCMζCM=Awˆ(s,t)PnN10(ds,t)PnN01(s,dt)×w(s,t)EN10(ds,t)EN01(s,dt),

where

A=w(s,t)EN10(ds,t)EN01(s,dt)wˆ(s,t)PnN11(ds,dt)PnH(s,t)w(s,t)EN11(ds,dt)EH(s,t)wˆ(s,t)PnN10(ds,t)PnN01(s,dt)=w(s,t)EN10(ds,t)EN01(s,dt){wˆ(s,t)PnN11(ds,dt)PnH(s,t)w(s,t)EN11(ds,dt)EH(s,t)}w(s,t)EN11(ds,dt)EH(s,t){wˆ(s,t)PnN10(ds,t)PnN01(s,dt)w(s,t)EN10(ds,t)EN01(s,dt)}.
Consistency ofζˆCM: Since N11(s,t) and H(s,t) are summations of bivariate indicator functions with EM<, we have n(nH*EH*nN11*EN11*)ZH*ZN11*, where ZN11 and ZH are mean zero Gaussian processes, and denotes convergence in distribution. By the functional δ-method,
nPnN11(ds,dt)PnH(s,t)EN11(ds,dt)EH(s,t)
PZN11(ds,dt)EH(s,t)+EN11(ds,dt)ZH(s,t),

where P denotes convergence in probability. Then,

wˆPnN11(ds,dt)PnH(s,t)wEN11(ds,dt)EH(s,t)PnN11(ds,dt)PnH(s,t)EN11(ds,dt)EH(s,t)+wˆw|τ1,q+1τ1,q||τ2,r+1τ2,r|P0,

as n, where |||| denotes supremum over region Ωqr. Similarly, we obtain

wˆPnN10(ds,t)PnN01(s,dt)wEN10(ds,t)EN01(s,dt)P0,

as n. The consistency of ζˆCMqr is immediate.

Asymptotic normality ofn1/2(ζˆCMζCM): By some simple algebra,

nA=wEN10(ds,t)EN01(s,dt){n(wˆw)PnN11(ds,dt)PnH(s,t)+wEN11(ds,dt)n(PnE)H(s,t)+wn(PnE)N11(ds,dt)PnH(s,t)}wEN11(ds,dt)EH(s,t){n(wˆw)PnN10(ds,t)PnN01(s,dt)+wEN10(ds,t)n(PnE)N01(s,dt)+wn(PnE)N10(ds,t)PnN01(s,dt)}.

Note that

(9)wEN10(ds,t)EN01(s,dt)n(wˆw)PnN11(ds,dt)PnH(s,t)wEN11(ds,dt)EH(s,t)n(wˆw)PnN10(ds,t)PnN01(s,dt)=wEN10(ds,t)EN01(s,dt)n(wˆw)EN11(ds,dt)EH(s,t)+oP(1)wEN11(ds,dt)EH(s,t)n(wˆw)EN10(ds,t)EN01(s,dt)+oP(1)=oP(1),

since we assume constant ζ in the integration region. It follows that under time-invariance, n(ζˆCMζCM)=1ni=1nIi+oP(1), where the influence function

(10)Ii=1wEN10(ds,t)EN01(s,dt)[wEN11(ds,dt){jmiIYijs,Yi0tE(mi)PYi1s,Yi0t+w{jmiIYij=s,ηij=1,Yi0=t,ηi0=1E(mi)PYi1=s,ηi1=1,Yi0=t,ηi0=1EH(s,t)wEN11(ds,dt)EH(s,t)wEN10(ds,t)EN01(s,dt)2[wEN10(ds,t)jmiIYijs,Yi0=t,ηi0=1E(mi)PYi1s,Yi0=t,ηi0=1+wjmiIYij=s,ηij=1,Yi0tE(mi)PYi1=s,ηi1=1,Yi0tEN01(s,dt)].

By the asymptotic linearity, and assuming EM2<, we have the normality of n1/2(ζˆCMζCM) via standard central limit theorem.

Asymptotic properties of θ˜CML

Without confusion we simply denote θCM(Ωqr) as θ. Taking the first derivative of Ln in θ and multiplying by θ(2n), we have the following estimating equation:

Ψn(θ)=1n2[1ind=1mmaxI{iDd}ϕi,d0qr{a(Yid,Yi0;k,l)1}aYid,Yi0;k,l1+θ1i<jnd=1mmaxI{i,jDd}ψij,dd00qrθaY(idjd),Y(i0j0);k,l1+θ]=1n2[1i<jnd=1mmaxI{i,jDd}ϕij,dd00qraY(idjd),Y(i0j0);k,l1+θ1i<jnd=1mmaxI{i,jDd}ψij,dd00qrθaY(idjd),Y(i0j0);k,l1+θ=0,]

where ϕij,dd00qr is the concordant indicator defined for U-statistics.

Define Xi=(Yi0,Yi1,,Yimi,ηi0,ηi1,,ηimi,mi),i=1,,n, and denote

(11)hXi,Xj;θ=d=1mmaxI{i,jDd}ϕij,dd00qrψij,dd00qrθaY(idjd),Y(i0j0);k,l1+θ.

Then Ψn(θ) is a U-statistic of order 2 with the kernel function h(Xi,Xj;θ).

Let Ψ(θ)=EΨn(θ) and θ0 be the solution to Ψ(θ)=0. Define Ψˆ(θ)=2ni=1nh1(Xi;θ), where h1(Xi;θ)=Eh(Xi,Xj;θ|Xi)Ψ(θ). Since ϕij,dd00qrψij,dd00qrθa(Y(idjd),Y(i0j0);k,l)1+θ1+θθ, for any θ>0, and mmax is finite for a family study, we have Eh2(Xi,Xj;θ)<. By Theorem 12.3 of van der Vaart [16], n(Ψn(θ)Ψ(θ)Ψˆ(θ))P0 and n(Ψn(θ)Ψ(θ)) converge to a mean zero random variable Z for any θ>0. Note that the map θΨn(θ) is non-increasing. By Lemma 5.10 of van der Vaart [16], we have θ˜CMLθ0 in probability, as n.

Next, we will establish the asymptotic normality of the estimator using a master theorem of Z-estimators. For any θ1,θ2>0, note that |h(Xi,Xj;θ1)h(Xi,Xj;θ2)|=d=1mmaxI{i,jDd}{ϕ+ψ(a1)}(θ2θ1)(a1+θ1)(a1+θ2)mmaxa(Y(idjd),Y(i0j0);k,l)1|θ1θ2|. Therefore,

n(ΨnΨ)(θ˜CML)n(ΨnΨ)(θ0)=nΨˆ(θ˜CML)Ψˆ(θ0)+oP(1)=n1ni=1nh1(Xi;θ˜CML)h1(Xi;θ0)+oP(1)n|θ˜CMLθ0|Emmaxa(Y(idjd),Y(i0j0);k,l)1+oP(1)=oP1+n|θ˜CMLθ0|,

since a(Y(idjd),Y(i0j0);k,l) is the at-risk set of order n. Let Ψ˙(θ0)=E{h1(Xi;θ)θ|θ=θ0}.θ˜CML is chosen such that |Ψn(θ˜CML)|=oP(n1/2). By Theorem 13.4 of Kosorok [15], we have n(θ˜CMLθ0)Ψ˙θ01Z in distribution. The bootstrap validity also follows.

Formula for corrected U

We now define concordant and discordant pairs for tied sibship observations as follows:

ϕ˜ij,acbdqr=IYia=Yjc,Yib>Yjd,(Yia,Yjd)Ωqr,ηia1,ηjc=1,ηjd=1+IYia=Yjc,Yib<Yjd,(Yia,Yib)Ωqr,ηia=1,ηjc1,ηib=1+IYia>Yjc,Yib=Yjd,(Yjc,Yib)Ωqr,ηjc=1,ηib1,ηjd=1+I{Yia<Yjc,Yib=Yjd,(Yia,Yib)Ωqr,ηia=1,ηib=1,ηjd1}+0.5×{IYiaYjc,Yib=Yjd,(Y(iajc),Yib)Ωqr,η(iajc)=1,ηib=ηjd=1IYia=Yjc,YibYjd,(Yia,Y(ibjd))Ωqr,ηia=ηjc=1,η(ibjd)=1I{Yia=Yjc,Yib=Yjd,(Yia,Yib)Ωqr,ηia=ηjc=1,{ηib=1,ηjd1}or{ηib1,ηjd=1}}I{Yia=Yjc,Yib=Yjd,(Yia,Yib)Ωqr,}{ηia=1,ηjc1}or{ηia1,ηjc=1},ηib=ηjd=1}+0.5×IYia=Yjc,Yib=Yjd,(Yia,Yib)Ωqr,ηia=ηjc=ηib=ηjd=1,
ψ˜ij,acbdqr=IYia=Yjc,Yib>Yjd,(Yia,Yjd)Ωqr,ηia=1,ηjc1,ηjd=1+IYia=Yjc,Yib<Yjd,(Yia,Yib)Ωqr,ηia1,ηjc=1,ηib=1+IYia>Yjc,Yib=Yjd,(Yjc,Yib)Ωqr,ηjc=1,ηib=1,ηjd1+IYia<Yjc,Yib=Yjd,(Yia,Yib)Ωqr,ηia=1,ηib1,ηjd=1+0.5×{I{YiaYjc,Yib=Yjd,(Y(iajc),Yib)Ωqr,η(iajc)=1,ηib=ηjd=1IYia=Yjc,YibYjd,(Yia,Y(ibjd))Ωqr,ηia=ηjc=1,η(ibjd)=1I{Yia=Yjc,Yib=Yjd,(Yia,Yib)Ωqr,}ηia=ηjc=1,{ηib=1,ηjd1}or{ηib1,ηjd=1}I{Yia=Yjc,Yib=Yjd,(Yia,Yib)Ωqr,}ηia=1,ηjc1}or{ηia1,ηjc=1},ηib=ηjd=1}+0.5×IYia=Yjc,Yib=Yjd,(Yia,Yib)Ωqr,ηia=ηjc=ηib=ηjd=1.

Similarly we define concordant and discordant pairs for the reversed tied observations under exchangeability ϕ˜ij,adbcqr and ψ˜ij,adbcqr. Then we have the modified U-statistic for the sibship association

(12)θˆCCMU(Ωqr;k,l)=n211i<jn1a<bmi1c<dmjϕij,acbdqr+ϕ˜ij,acbdqr+ϕij,adbcqr+ϕ˜ij,adbcqrn211i<jn1a<bmi1c<dmjψij,acbdqr+ψ˜ij,acbdqr+ψij,adbcqr+ψ˜ij,adbcqr.

Acknowledgments

The authors would like to thank the two referees and Dr. Nancy Pfenning for their constructive comments. We are grateful to the Cache County Study Steering Committee for providing the dementia dataset. This research was supported by the National Science Foundation grants DMS-0906449 and DMS-1207711 to Cheng.

References

1. Hougaard P. Analysis of multivariate survival data. New York: Springer, 2000.10.1007/978-1-4612-1304-8Search in Google Scholar

2. Pruitt RC. Identifiability of bivariate survival curves from censored data. J Am Stat Assoc 1993;88:573–9.10.1080/01621459.1993.10476309Search in Google Scholar

3. Tsiatis A. A nonidentifiability aspect of the problem of competing risks. Proc Natl Acad Sci 1975;72:20–2.10.1073/pnas.72.1.20Search in Google Scholar PubMed PubMed Central

4. Bandeen-Roche K, Liang K. Modelling multivariate failure time associations in the presence of a competing risk. Biometrika 2002;89:299–314.10.1093/biomet/89.2.299Search in Google Scholar

5. Oakes D. Bivariate survival models induced by frailties. J Am Stat Assoc 1989;84:487–93.10.1080/01621459.1989.10478795Search in Google Scholar

6. Bandeen-Roche K, Ning J. Nonparametric estimation of bivariate failure time associations in the presence of a competing risk. Biometrika 2008;95:221–32.10.1093/biomet/asm091Search in Google Scholar PubMed PubMed Central

7. Cheng Y, Fine JP. Nonparametric estimation of cause-specific cross hazard ratio with bivariate competing risks data. Biometrika 2008;95:233–40.10.1093/biomet/asm089Search in Google Scholar

8. Shih JH, Albert PS. Modeling familial association of ages at onset of disease in the presence of competing risk. Biometrics 2010;66:1012–923.10.1111/j.1541-0420.2009.01372.xSearch in Google Scholar PubMed PubMed Central

9. Shih JH, Louis TA. Inferences on the association parameter in copula models for bivariate survival data. Biometrics 1995;51:1384–99.10.2307/2533269Search in Google Scholar

10. Cheng Y, Fine JP, Bandeen-Roche K. Association analyses of clustered competing risks data via cross hazard ratio. Biostatistics 2010;11:82–92.10.1093/biostatistics/kxp039Search in Google Scholar PubMed PubMed Central

11. Gorfine M, Hsu L. Frailty-based competing risks model for multivariate survival data. Biometrics 2011;67:415–26.10.1111/j.1541-0420.2010.01470.xSearch in Google Scholar PubMed PubMed Central

12. Katki HA, Blackford A, Chen S, Parmigiani G. Multiple diseases in carrier probability estimation: accounting for surviving all cancers other than breast and ovary in BRCAPRO. Stat Med 2008;27:4532–48.10.1002/sim.3302Search in Google Scholar PubMed PubMed Central

13. Clayton DG. A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence. Biometrika 1978;65:141–52.10.1093/biomet/65.1.141Search in Google Scholar

14. Oakes D. A model for association in bivariate survival data. J R Stat Soc Ser B Methodological 1982;44:414–22.10.1111/j.2517-6161.1982.tb01222.xSearch in Google Scholar

15. Kosorok MR. Introduction to empirical processes and semiparametric inference. New York: Springer, 2007.10.1007/978-0-387-74978-5Search in Google Scholar

16. van der Vaart AW. Asymptotic statistics.Cambridge: Cambridge University Press, 1998.10.1017/CBO9780511802256Search in Google Scholar

17. Fine JP, Jiang H. On association in a copula with time transformations. Biometrika 2000;87:559–71.10.1093/biomet/87.3.559Search in Google Scholar

18. Lindsay BG. Composite likelihood methods. In: Prabhu NU, editor. Statistical inference from stochastic processes. American Mathematical Society, 1988:221–39.10.1090/conm/080/999014Search in Google Scholar

19. Ning J, Bandeen-Roche K. Estimation of time-dependent association for bivariate failure times in the presence of a competing risk. Biometrics 2014;70:10–20.10.1111/biom.12110Search in Google Scholar PubMed PubMed Central

20. Cheng Y, Fine JP, Kosorok MR. Nonparametric analysis of bivariate competing risks data. J Am Stat Assoc 2007;102:1407–16.10.1198/016214507000001157Search in Google Scholar

21. Cheng Y, Fine JP. Cumulative incidence association models for bivariate competing risks data. J R Stat Soc Ser B 2012. DOI: DOI: 10.1111/j.1467-9868.2011.01012.x.DOI: 10.1111/j.1467-9868.2011.01012.xSearch in Google Scholar

22. Cheng Y. Modeling cumulative incidences of dementia and dementia-free death using a novel three-parameter logistic function. Int J Biostat 2009. DOI: DOI: 10.2202/1557-4679.1183.DOI: 10.2202/1557-4679.1183Search in Google Scholar

23. Hu T, Nan B, Lin X, Robins JM. Time-dependent cross ratio estimation for bivariate failure times. Biometrika 2011;98:341–54.10.1093/biomet/asr005Search in Google Scholar PubMed PubMed Central

Published Online: 2014-9-19
Published in Print: 2014-11-1

© 2014 by Walter de Gruyter Berlin / Boston

Downloaded on 31.12.2025 from https://www.degruyterbrill.com/document/doi/10.1515/ijb-2013-0023/html
Scroll to top button