Home Mathematics Classification in Postural Style Based on Stochastic Process Modeling
Article Publicly Available

Classification in Postural Style Based on Stochastic Process Modeling

  • Christophe Denis EMAIL logo
Published/Copyright: July 3, 2014

Abstract

We address the statistical challenge of classifying subjects as hemiplegic, vestibular or normal based on complex trajectories obtained through two experimental protocols designed to evaluate potential deficits in postural control. The classification procedure involves a dimension reduction step where the complex trajectories are summarized by finite-dimensional summary measures based on a stochastic process model for a real-valued trajectory. This allows us to retrieve from the trajectories information relative to their temporal dynamic. A leave-one-out evaluation yields a 79% performance of correct classification for a total of n=70 subjects, with 22 hemiplegic (31%), 16 vestibular (23%) and 32 normal (46%) subjects.

1 Introduction

This article contributes to the study of postural maintenance. Posture is fundamental for physical activity. A deficit in postural maintenance often results in falling, which is particularly hazardous in elderly people. The main objective of the research in postural maintenance is to adapt protocols for functional rehabilitation for people who display deficits in maintaining posture. We focus here on the issue of classifying subjects in terms of postural maintenance. A cohort of 70 subjects has been followed at the center for the study of sensorimotor functioning (CESEM) of Université Paris Descartes. The subjects who did not exhibit deficit in postural maintenance were labeled as normal. The others were hemiplegic and vestibular subjects and labeled accordingly. A hemiplegic subject suffers from a disorder of his proprioceptive system, which pertains to the sense of position, location, orientation of the body and its parts. A vestibular subject suffers from a deficit of his vestibular system, which is the system composed by the inner ear and the vestibular nerve and contributes to the sense of balance.

Each subject completed two experimental protocols designed to evaluate his/her potential deficits in maintaining posture. Each protocol is divided into three phases: a first phase of 15 s with no postural perturbation, a second phase of 35 s with postural perturbation followed by a last phase of 20 s without postural perturbation. During each protocol, measurements of the center-of-pressure of each foot are performed at discrete times, which results in temporal trajectories. Our objective is to classify subjects as hemiplegic, vestibular and normal based on these trajectories. This is a significantly more difficult extension of the problem considered by Chambaz and Denis [1], where only hemiplegic and normal subjects were to be classified (particularly because classifying into three classes is more difficult than classifying into two classes), and thus another step in the direction of clustering subjects in terms of their postural style. We refer to the bibliography in Chambaz and Denis [1] for a review on the analysis of postural control.

Our present study is related to the topic of functional data classification. There is a sizeable literature dedicated to this topic. A variety of methods have been proposed, relying for instance on linear discriminant analysis [2], principal component analysis [3], or a functional data version of the nearest-neighbors classification rule [4]. We refer to Ramsay and Silverman [5] for a general introduction to functional data analysis. Our approach here involves a dimension reduction step of our complex trajectories based on change points estimation and inference on a stochastic process model. Although the various statistical methods involved in the dimension reduction step were known, their combination for the sake of classifying in terms of postural style (three classes) is very orignal and performs quite well. This allows us to better use the data at hand than in Chambaz and Denis [1] in the sense that we manage to retrieve from the trajectories information relative to their temporal dynamic. In contrast, Chambaz and Denis [1] retrieve static information from the trajectories in the sense that the dimension reduction step relies on comparisons of basic statistics (such as the mean value of a segment of the trajectory, see Section 5.2) computed on arbitrarily chosen time intervals starting or ending where the perturbation phase starts or ends.

The dataset and its modeling are introduced in Section 2. We present our inference procedure in Section 3. We carry it out on real and simulated data, and summarize the results in the latter section. The classification procedure is presented in Section 4. The results of its application to the real dataset are exposed and commented on in Section 5, which concludes on some perspectives for future research.

2 Data and modeling

The dataset at hand is described in Section 2.1. We introduce and motivate its modeling in Section 2.2.

2.1 Original dataset

The dataset was collected at the center for the study of sensorimotor functioning (CESEM) of Université Paris Descartes. It is composed of a cohort of n=70 subjects. Among the 70 subjects, 22 are hemiplegic (due to cerebrovascular accidents), 16 are vestibular, while the 32 remaining subjects are identified as normal based on an initial medical evaluation.

Each subject completed two protocols designed to evaluate potential deficits in maintaining posture. These protocols have been identified as the most informative among four similar protocols for classifying hemiplegic versus normal subjects in the earlier study [1]. Both protocols are divided into three phases: a first phase (lasting 15 s) with no postural perturbation, a second phase (lasting 35 s) with postural perturbations (either some muscular perturbations or a combination of muscular and visual perturbations), and a third phase (lasting 20 s) with no postural perturbation. We expect that the subject’s postural sway changes around the beginning and the end of the perturbation phase (around 15 and 50 s).

For each protocol, the center-of-maximal-pressure exerted by each foot on a force-platform is recorded at equispaced discrete times. Thus each protocol results in a trajectory (Li,Ri)iN, where (Li,Ri) is the observation at time ti=iδ for i=1,,N=2,800 and δ=0.025 s. For each i=1,,N, Li=(Li1,Li2)2 and Ri=(Ri1,Ri2)2 respectively correspond to the left and right foot (Table 1).

Table 1

Specifics of the two protocols considered in this study. A protocol is divided into three phases: a first phase with no postural perturbation is followed by a second phase with perturbations, which is itself followed by a third phase without perturbation. Different types of perturbations are considered. In protocol 1, the brain’s processing of proprioceptive information (through muscular stimulation) is perturbed, whereas in protocol 2, the processing by the brain of both visual information (subjects must close their eyes) and proprioceptive information (through muscular stimulation) is perturbed

Protocol1st phase (015 s)2nd phase (1550 s)3rd phase (5070 s)
1No perturbationMuscular stimulationNo perturbation
2Eyes closed
Muscular stimulation

Following Chambaz and Denis [1], we derive from (Li,Ri)iN the one-dimensional trajectory C1:N (sometimes we will denote a n-tuple (x1,,xn) by x1:n) characterized by

Ci=Li+Ri2γ2,

where γ is defined as the component-wise median value of (12(Li+Ri))i400 over the ten first seconds of the protocol. The process C1:N provides a relevant description of the sway of the body during the course of the protocol. In Figure 1, we display the trajectories C1:N corresponding to protocol 1 (left plot) and protocol 2 (right plot) as completed by a single hemiplegic subject. Figure 1 confirms the intuition that the subject’s postural sway is not necessarily instantaneously affected by the start and end of perturbations.

Figure 1 Two trajectories C1:N$${C_{1:N}}$$ respectively corresponding to protocol 1 (left) and protocol 2 (right) undergone by a single hemiplegic subject. Vertical lines correspond to the estimated change points (τˆ1δ,τˆ2δ)$$({\hat \tau _1}\delta, {\hat \tau _2}\delta)$$ as obtained based on the two (real) observed trajectories
Figure 1

Two trajectories C1:N respectively corresponding to protocol 1 (left) and protocol 2 (right) undergone by a single hemiplegic subject. Vertical lines correspond to the estimated change points (τˆ1δ,τˆ2δ) as obtained based on the two (real) observed trajectories

2.2 Data modeling

We model the trajectory C1:N as the observation at discrete times of a stochastic process (C(t))t[δ,Nδ] characterized by a stochastic differential equation.

We model the effects of the perturbations by possible changes in the volatility and drift functions at two unknown change points. In order to account for the fact that subjects react differently, we also assume that the change points differ among subjects. We denote by (T1,T2) the change points of a trajectory, with T0=τ0δ=δ<T1=τ1δ<T2=τ2δ<T3=70 (τ0=1, τ1 and τ2 are integers). We assume that there exist two functions a, b, and two parameters (ϕ1,ϕ2,ϕ3) and (σ1,σ2,σ3) such that, for all t[Tk1,Tk) and k=1,2,3,

{dC(t)=a(C(t),ϕk)dt+b(C(t),σk)dW(t)C(Tk1)=Cτk1,

where (W(t))t[δ,Nδ] is a standard Wiener process.

We now specify the parametric forms of the volatility and drift functions a and b. Because Figure 1 indicates that the variance of C1:N is not constant over time and because, in addition, the process C1:N takes positive values, we decide to rely on the classical Cox–Ingersoll–Ross (CIR) process [6], by setting a(x,ϕ=(λ,μ))=λ(μx) and b(x,σ)=σx (for all x0).

In summary, we model the trajectory C1:N as Ci=C(iδ)=C(ti) the observation at discrete times of the stochastic process (C(t))t[δ,Nδ] being characterized, for t[Tk1,Tk)(k=1,2,3), by

(1){dC(t)=λk(μkC(t))dt+σkC(t)dW(t)C(Tk1)=Cτk1,

where the initial condition C(Tk1) is the observation Cτk1 at time Tk1, λk, and μk and σk are positive constants. It is known that if 2μkλk/σk21 then the process remains positive and admits a stationary distribution, namely a Gamma distribution with shape parameter 2μkλk/σk2 and scale parameter σk2/2λk [6]. We summarize all parameters as a single vector (τ1,τ2,θ1,θ2,θ3) with θk=(λk,μk,σk) for k=1,2,3. We assume that the changes, induced by the perturbation phase, necessarily affect the drift parameter through a change inμ. In other words, we assume that μ1μ2 and μ2μ3. Changes can also affect the other parameters.

3 Inference

In this section, we address inference about the parameter (τ1,τ2,θ1,θ2,θ3). We first describe the estimating procedure in Section 3.1, then present results from its application on the motivating dataset in Section 3.2. Finally, we summarize a simulation study performed to evaluate its performance in Section 3.3.

3.1 A two-step estimating procedure

We define a two-step estimating procedure: we first estimate the sequence of change points, and then estimate (θ1,θ2,θ3) on each interval characterized by the previously estimated change points.

A great variety of methods have been proposed for the estimation of change points (among many others [7911]. We choose to adopt a popular approach originally proposed by Bai [7]. The approach relies on a least-squares criterion and aims at detecting change points which affect the mean of a linear process. The estimator (τˆ1,τˆ2) of the sequence of change points (τ1,τ2) is defined as:

(τˆ1,τˆ2)=argmin(τ1,τ2)1Nk=13i=τk1τk1CiCˉk2,

where Cˉk is the arithmetic mean of Cτk1:τk1. This makes sense because if (C(t))t[Tk1,Tk) reaches a stationary regime then the random variables Cτk,,Cτk1 are identically distributed from a stationary distribution whose mean parameter is μk.

Fix k{1,2,3}. On each interval [Tˆk1,Tˆk)=[τˆ1δ,τˆ2δ), we estimate θk by minimizing a contrast function based on the log-likelihood of the approximated discrete-time Euler–Maruyama scheme (see Kessler [12], for instance). The latter scheme with step size δ guarantees that, for i=τˆk1,,τˆk1,

(2)Ci+1(1δλk)Ci+δλkμk+σkδCiηi+1,

where (ηi)τˆk1<iτˆk is a sequence of independent random variables with standard Gaussian distribution. By eq. (2), it is convenient to consider the following equivalent parametrization: θk=(θ1,k,θ2,k,θ3,k), with θ1,k=(1δλk), θ2,k=δλkμk and θ3,k=σkδ. The estimator θˆk of parameter θk is defined as:

θ^k=argminθk+3i=τ^k1τ^k1(Ci+1θ1,kCiθ2,k)2Ciθ3,k2+(τ^kτ^k1)log(θ3,k2),

which actually yields closed-form expressions:

θˆ1,k=τˆkτˆk1Ci+1CiCi+11Ciτˆkτˆk12Ci1Ci,
θˆ2,k=Ci+1θˆ1,kCi(τˆkτˆk1),
θˆ3,k=Ci+1θˆ1,kCiθˆ2,k)2Ciτˆkτˆk1

(the sums in the above expressions range over [τˆk1,τˆk1]).

Under mild conditions, and if the process reaches the stationary regime, then (τˆ1,τˆ2) consistently estimates (τ1,τ2) (see Lavielle [13], Lavielle and Ludeña [14], for instance). Furthermore, if the true change points (τ1,τ2) are known then, under another set of mild conditions, the estimators (θˆ1,θˆ2,θˆ3) consistently estimate (θ1,θ2,θ3) (see Kessler [12], Theorem 1). To the best of our knowledge, there is no satisfactory result in the literature regarding the joint estimation of the change points (τ1,τ2) and the parameter (θ1,θ2,θ3).

3.2 Application to the real dataset

We undertake a simulation study of the properties of the two-step estimating procedure presented in the previous section and summarize its results in the next section. Because we characterize our simulation scheme based on the results obtained when applying the latter procedure to the real dataset, we first present them here.

For each subject and protocol, we estimate (τ1,τ2,θ1,θ2,θ3) from the corresponding (real) observed trajectory. The results pertaining to the estimation of (τ1,τ2) are summarized in Table 2 (the mean and standard deviation of the estimates (τˆ1δ,τˆ2δ) computed over each group of subjects are provided) and illustrated in Figure 1. Results pertaining to the estimation of (θ1,θ2,θ3) are summarized in Table 3 (the mean and standard deviation of the estimates (θˆ1,θˆ2,θˆ3) computed over each group of subjects are provided).

Table 2

For each subject and protocol, we estimate the change points (τ1,τ2). For each group of subjects, we compute over the group the mean and standard deviation (given between parentheses) of the estimates (τˆ1δ,τˆ2δ)

Protocol 1Protocol 2
Hemiplegic subjects18.4 (7.4)43.3 (12.2)16.8 (1.0)44.5 (12.4)
Vestibular subjects20.9 (5.4)49.9 (6.5)19.5 (6.9)50.3 (5.3)
Normal subjects21.4 (8.2)47.5 (10.7)22.4 (8.1)51.4 (2.7)
Table 3

For each subject and each protocol, we first estimate the change points (τˆ1δ,τˆ2δ) and then compute the estimates (θˆ1,θˆ2,θˆ3) of (θ1,θ2,θ3) over each of the three resulting intervals [δ,τˆ1δ), [τˆ1δ,τˆ2δ) and [τˆ2δ,Nδ], where θk=(θ1,k,θ2,k,θ3,k) (k=1,2,3). For each group of subjects and each protocol, we compute over the group and for that protocol the mean and standard deviation (between parentheses) of the estimates (θˆ1,θˆ2,θˆ3)

Protocol 1Protocol 2
k=1k=2k=3k=1k=2k=3
Hemiplegic subjectsθˆ1,k0.97 (0.02)0.98 (0.02)0.98 (0.01)0.97 (0.02)0.98 (0.01)0.97 (0.02)
θˆ2,k0.11 (0.08)0.39 (0.48)0.16 (0.11)0.14 (0.09)0.38 (0.33)0.38 (0.62)
θˆ3,k20.08 (0.05)0.06 (0.06)0.06 (0.04)0.10 (0.06)0.10 (0.10)0.11 (0.09)
k=1k=2k=3k=1k=2k=3
Vestibular subjectsθˆ1,k0.99 (0.01)0.99 (0.01)0.99 (0.01)0.98 (0.02)0.98 (0.01)0.98 (0.01)
θˆ2,k0.06 (0.03)0.23 (0.24)0.09 (0.09)0.14 (0.12)0.56 (0.37)0.19 (0.20)
θˆ3,k20.05 (0.04)0.07 (0.08)0.07 (0.06)0.10 (0.07)0.12 (0.09)0.09 (0.07)
k=1k=2k=3k=1k=2k=3
Normal subjectsθˆ1,k0.99 (0.01)0.99 (0.01)0.99 (0.01)0.98 (0.02)0.99 (0.01)0.98 (0.01)
θˆ2,k0.02 (0.22)0.13 (0.15)0.10 (0.09)0.08 (0.08)0.15 (0.29)0.15 (0.14)
θˆ3,k20.04 (0.04)0.02 (0.03)0.04 (0.05)0.06 (0.05)0.04 (0.04)0.06 (0.06)

Three features of Table 2 are worth commenting on. First, one notes that there is no significant difference across groups of subjects (however, the means of τˆ1δ and τˆ2δ are slightly shifted to the left in the group of hemiplegic subjects relative to the two other groups). Second, the means of τˆ1δ are close to the time of start of perturbations for all groups and protocols. As for the means of τˆ2δ, they are close to the end time in vestibular and normal subjects and both protocols. In hemiplegic subjects, one notes that the standard deviations are quite large and that the mean of τˆ2δ is shifted to the left relative to the end time for perturbations. This is due to the fact that for each protocol, 30% of hemiplegic subjects feature an estimator τˆ2δ close to 30 s.

Third, judging by the standard deviations, all hemiplegic subjects tend to react similarly by adjusting quickly to the perturbations undergone in protocol 2. Likewise, all normal subjects tend to react similarly by adjusting quickly to the end of perturbations undergone in protocol 2. In protocol 1, the large standard deviation associated to the mean value of τˆ2δ computed over the group of normal subjects reflects the fact that 20% of these subjects feature an estimator τˆ2δ close to 30 s.

Regarding Table 3, we consider in turn parameters θ1,k, θ2,k and θ3,k (k=1,2,3). First, the estimates θˆ1,k behave similarly across groups of subjects, protocols and intervals [Tˆk1,Tˆk) for each k=1,2,3. In contrast, for each k=1,2,3, the estimates of θˆ2,k behave quite differently across groups of subjects, protocols and intervals [Tˆk1,Tˆk). This is a promising feature for the sake of classifying subjects by group, which is the problem at stake here. As for the estimates θˆ3,k (k=1,2,3), for any given group of subjects and protocol, they behave quite similarly. However, for protocol 2, it seems that the estimates θˆ3,k (k=1,2,3) in normal subjects behave differently from their counterpart in hemiplegic or vestibular subjects. This is another promising feature.

3.3 Simulation study

We carry out a simulation study to evaluate the performances of the two-step estimating procedure presented in Section 3.1. We directly simulate the trajectory C1:N. Specifically,

  1. we set (T1,T2)=(τ1δ,τ2δ)=(15,50);

  2. we rely on the Euler scheme (2) with step size δ/10 to approximate the sampling of (C(t))t[δ,Nδ] from the distribution characterized by eq. (1) where, for each k{1,2,3}, θk equals the mean of its 22 estimates based on the 22 real trajectories associated to the 22 hemiplegic subjects and protocol 1 (see Section 3.2 and Table 3);

  3. we conclude by sub-sampling (C(t))t[δ,Nδ] to derive C1:N.

We sample B=100 independent copies of C1:N. For each copy, we estimate the parameters (τ1,τ2,θ1,θ2,θ3). The means and standard deviations of the estimated change points and parameters computed over the 100 independent replications are reported in Table 4.

Table 4

For each of the B=100 independently simulated datasets, we derive the estimates (τˆ1δ,τˆ2δ,θˆ1,θˆ2,θˆ3). We report here the true values and means and standard deviations (between parentheses) computed over the B=100 replications

k=1k=2k=3
θˆ1,k0.975 (0.010)0.975 (0.009)0.973 (0.006)
θ1,k0.9700.9800.980
θˆ2,k0.114 (0.033)0.473 (0.121)0.214 (0.076)
θ2,k0.1100.3900.160
θˆ3,k20.078 (0.004)0.058 (0.002)0.060 (0.003)
θ3,k20.0800.0600.060
τˆ1δ16.2 (0.9)τˆ2δ50.8 (2.1)
τ1δ15.0τ2δ50.0

Three comments on Table 4 are in order. First, regarding the estimation of (τ1,τ2), we note that the means of the estimated change points are very close to their respective true values. Moreover, the standard deviations are small. Second, regarding the estimation of (θ1,k,θ3,k) (k=1,2,3), we note that the means of the estimates of θ1,k and θ3,k are very close to their respective true values and that the standard deviations are small. Third, regarding the estimation of θ2,k (k=1,2,3), we emphasize that the means are quite apart from their respective true values. Moreover, the standard deviations are not small. Overall this indicates a poorer estimation of θ2,k (k=1,2,3) than of (θ1,k,θ3,k) (k=1,2,3). This is probably due to the fact that the time intervals [Tk1,Tk) (k=1,2,3) are relatively narrow given the value of δ.

4 Classification

In Section 4.2, we describe our classification procedure of subjects as hemiplegic, vestibular or normal based on their trajectories obtained under the two protocols. It is built upon the previous section. Indeed, it does not rely on the trajectories themselves but rather on their finite-dimensional summary measures whose definition, given in Section 4.1, depends on the results of our two-step estimation procedure.

4.1 Summary measures

Most classification procedures based on trajectories involve a preliminary step of dimension reduction where the high-dimensional trajectories are summarized into a low-dimensional summary measure [4, 5]. Here we build a tailored finite-dimensional summary measure of every trajectory based on the estimates (τˆ1,τˆ2,θˆ1,θˆ2,θˆ3) derived from it via our two-step estimating procedure.

We argue in Section 3.2 that, overall, τˆ1, τˆ2, θˆ2,k and θˆ3,k (k=1,2,3) may be relevant for the sake of classifying subjects as hemiplegic, vestibular or normal. Because the ratio θˆ2,k/(1θˆ1,k)=μˆk is easier to interpret than θˆ2,k (and since θˆ1,k varies very little across subjects and protocols) and because θˆ3,k=σˆkδ, we choose to define our finite-dimensional summary measure as (τˆ1,τˆ2,μˆ1,μˆ2,μˆ3,σˆ1,σˆ2,σˆ3). Hereafter, we denote X1 the latter vector derived from the trajectory associated to protocol 1 and X2 that derived from the trajectory associated to protocol 2.

4.2 Classification procedure

We actually construct two classifiers, ϕˆ1 and ϕˆ2, to classify subjects as hemiplegic, vestibular or normal based on either X1 or X2 for ϕˆ1, and on both (X1,X2) for ϕˆ2.

The generic observed data structure associated to a given subject is (X1,X2,Y), where Y{1,2,3} indicates the subject’s group (with convention Y=1 for hemiplegic, Y=2 for vestibular, and Y=3 for normal). We denote P0 the true distribution of (X1,X2,Y).

Let S be the set of all classifiers based on X=(X1,X2). The misclassification risk associated to SS is R(P0)(S)=EP01{S(X)Y}. Denote by R=minSSR(P0)(S) its minimum. It is achieved at the Bayes classifier SS, characterized by S(X)=argmaxy{1,2,3}P0Y=y|X. For j=1,2, we also introduce the Bayes classifier SjS which relies only on Xj: Sj(X)=argmaxy{1,2,3}P0(Y=y|Xj).

We consider Dn={(Xi,Yi),i=1,,n}, where Xi=(Xi1,Xi2) is the summary measure associated to the subject i and Yi indicates his group. Our objective is to construct, based on Dn, ϕˆ1 and ϕˆ2 which respectively estimate the better classifier among S1 and S2(i.e., argminS1,S2R(P0)(Sj)) and S. We choose to rely on the popular methodology of random forests [15], which have proved powerful in a variety of applications. This is made easy thanks to the

R
package
randomForest
(we used the default tuning). The construction of the estimators Sˆ1 and Sˆ2 of S1 and S2 and that of ϕˆ2 is straightforward. We derive ϕˆ1 from (Sˆ1,Sˆ2) by V-fold cross-validation, with V=10.

5 Application

This section is devoted to the application of our classification procedure to the motivating dataset. In Section 5.1, we apply it exactly as it is described in Section 4. In Section 5.2, we consider a slightly enhanced version with improved performance. We provide concluding remarks in Section 5.3.

5.1 Performance of the classification procedure

We evaluate the performances of the classification procedure by the leave-one-out rule and the 0.632+bootstrap method as described by Efron and Tibshirani [16]. We apply the 0.632 + bootstrap method in order to take into account the fact that the leave-one-out rule may result in overly optimistic error rates. Resorting to the leave-one-out rule is notably motivated by the relatively small sample size of our dataset. The results are reported in Table 5 (second row).

Table 5

Leave-one-out performances and .632 + bootstrap error rates of ϕˆ1 and ϕˆ2 on the real dataset for the sake of classifying subjects as hemiplegic, vestibular or normal. First row: classification procedure of Section 4.2 when imposing (τˆ1δ,τˆ2δ)=(15,50). Second row: classification procedure of Section 4.2. Third row: extended classification procedure of Section 5.2

Performances0.632 + error rate
ϕˆ1ϕˆ2ϕˆ1ϕˆ2
Procedure of Section4.2 with (τˆ1δ,τˆ2δ)=(15,50)61%64%0.400.40
4.274%68%0.350.36
5.276%79%0.280.29

With its leave-one-out performance equal to 74% and its 0.632 + bootstrap error rate equal to 0.35, the best classifier is ϕˆ1, which involves one protocol only. For curiosity, we also evaluate the performances of ϕˆ1 and ϕˆ2 when systematically replacing (τˆ1δ,τˆ2δ) with (15,50) (the start and end times of the perturbation phase). We report the results in Table 5 (first row). Quite satisfactorily, we note that both ϕˆ1 and ϕˆ2 are not as good as the original versions: estimating the change points proves very relevant.

5.2 Performance of an extended classification procedure

It is tempting to extend our classification procedure by simply extending the definition of the summary measure which is at its core. Following Chambaz and Denis [1], we merely substitute (X1,U1) and (X2,U2) for X1 and X2 with Uj (j=1,2) derived from C1:N as Uj=(Cˉ1+Cˉ1,Cˉ2Cˉ1+,Cˉ2+Cˉ2) where

Cˉ1=δ5i[10/δ,15/δ[Ci,Cˉ1+=δ5i]15/δ,20/δ]Ci,
Cˉ2=δ5i[45/δ,50/δ[Ci,Cˉ2+=δ5i]50/δ,55/δ]Ci.

We refer to Chambaz and Denis [1] for a justification. Then we apply the extended classification procedure and report its performances in Table 5 (third row).

The performances achieve by both classifiers ϕˆ1 and ϕˆ2 are slightly better than the best performance obtained in Section 5.1: enriching the summary measures seems to provide relevant additional information for the sake of classifying subjects as hemiplegic, vestibular or normal.

5.3 Perspectives

We address the statistical challenge of classifying subjects as hemiplegic, vestibular or normal based on complex trajectories obtained through two experimental protocols which were designed to evaluate potential deficits in postural control. The classification procedure involves a dimension reduction step where the complex trajectories are summarized by finite-dimensional summary measures based on a stochastic process model for a real-valued trajectory. This allows us to retrieve from the trajectories information relative to their temporal dynamic. A leave-one-out evaluation yields a 79% performance of correct classification for a total of n=70 subjects, with 22 hemiplegic (31%), 16 vestibular (23%) and 32 normal (46%) subjects.

In future work, we will extend the classification procedure by introducing finite-dimensional summary measures based on a stochastic process model for the original trajectories in 4. We will also draw advantage from our good understanding of the classification problem to tackle the closely related statistical challenge of clustering subjects according to postural style.

Acknowledgments

I wish to thank I. Bonan (Service de Médecine Physique et de réadaptation, CHU Rennes) and P-P. Vidal (CESEM, Université Paris Descartes), my PhD advisors A. Chambaz and A. Samson (MAP5, Université Paris Descartes), and my colleague V. Genon-Catalot (MAP5, Université Paris Descartes), for introducing me to this interesting problem and providing the dataset, for their attentive guidance and all the fruitful discussions.

References

1. Chambaz A, Denis C. Classification in postural style. Ann Appl Stat 2012;6(3):977–93.10.1214/12-AOAS542Search in Google Scholar

2. James G, Hastie T. Functional linear discriminant analysis for irregularly sampled curves. J Roy Stat Soc Ser B 2001;63:533–50.10.1111/1467-9868.00297Search in Google Scholar

3. Hall P, Poskitt D, Presnell B. A functionnal data-analytic approach to signal discrimination. Technometrics 2001;43:1–9.10.1198/00401700152404273Search in Google Scholar

4. Biau G, Bunea F, Wegkamp M. Functional classification in Hilbert spaces. IEEE Trans Inform Theory 2005;51:2163–72.10.1109/TIT.2005.847705Search in Google Scholar

5. Ramsay J, Silverman B. Functional data analysis. New York: Springer, 2005.10.1007/b98888Search in Google Scholar

6. Kloeden P, Platen E. Numerical solution of stochastic differential equation, vol. 23. In: Stochastic modelling and applied probability. Berlin Heidelberg: Springer-Verlag, 1992.10.1007/978-3-662-12616-5Search in Google Scholar

7. Bai J. Least squares estimation of a shift in linear processes. J Time Ser Anal 1994;15:453–72.10.1111/j.1467-9892.1994.tb00204.xSearch in Google Scholar

8. Bardet JM, Kengne W, Witenberger O. Multiple breaks detection in general causal time series using penalized quasi-likelihood. Electron J Stat 2012;6:435–77.10.1214/12-EJS680Search in Google Scholar

9. Basseville M, Nikiforov IV. Detection of abrupt changes: theory and application. Prentice Hall Information and System Sciences Series. Englewood Cliffs, NJ: Prentice Hall Inc, 1993. ISBN: 0-13-126780-9.Search in Google Scholar

10. Cuenod C, Favetto B, Genon-Catalot V, Rozenholc Y, Samson A. Parameter estimation and change-point detection from dynamic contrast enhanced MRI data using stochastic differential equations. Math Biosci 2011;233:68–76.10.1016/j.mbs.2011.06.006Search in Google Scholar PubMed

11. Lavielle M. Using penalized contrasts for the change-point problem. Signal Process 2005;85:1501–10.10.1016/j.sigpro.2005.01.012Search in Google Scholar

12. Kessler M. Estimation of an ergodic diffusion from discrete observations. Scand J Statist 1997;24:211–29. ISSN: 0303-6898.10.1111/1467-9469.00059Search in Google Scholar

13. Lavielle M. Detection of multiple changes in a sequence of dependent variables. Stochastic Process Appl 1999;83:79–102. ISSN: 0304-4149.10.1016/S0304-4149(99)00023-XSearch in Google Scholar

14. Lavielle M, Ludeña C. The multiple change-points problem for the spectral distribution. Bernoulli 2000;6:845–69. ISSN: 1350-7265.10.2307/3318759Search in Google Scholar

15. Breiman L. Random forests. Machine Learn 2001;45:5–32.10.1023/A:1010933404324Search in Google Scholar

16. Efron B, Tibshirani R. Improvements on cross-validation: the .632 + bootstrap method. J Am Stat Assoc 1997;92:548–60.Search in Google Scholar

Published Online: 2014-7-3
Published in Print: 2014-11-1

© 2014 by Walter de Gruyter Berlin / Boston

Downloaded on 31.12.2025 from https://www.degruyterbrill.com/document/doi/10.1515/ijb-2012-0017/html
Scroll to top button