Managing variability in the summary and comparison of gait data
- Tom Chau^{1, 2}Email author,
- Scott Young^{1, 2} and
- Sue Redekop^{1}
https://doi.org/10.1186/1743-0003-2-22
© Chau et al; licensee BioMed Central Ltd. 2005
Received: 30 April 2005
Accepted: 29 July 2005
Published: 29 July 2005
Abstract
Variability in quantitative gait data arises from many potential sources, including natural temporal dynamics of neuromotor control, pathologies of the neurological or musculoskeletal systems, the effects of aging, as well as variations in the external environment, assistive devices, instrumentation or data collection methodologies. In light of this variability, unidimensional, cycle-based gait variables such as stride period should be viewed as random variables and prototypical single-cycle kinematic or kinetic curves ought to be considered as random functions of time. Within this framework, we exemplify some practical solutions to a number of commonly encountered analytical challenges in dealing with gait variability. On the topic of univariate gait variables, robust estimation is proposed as a means of coping with contaminated gait data, and the summary of non-normally distributed gait data is demonstrated by way of empirical examples. On the summary of gait curves, we discuss methods to manage undesirable phase variation and non-robust spread estimates. To overcome the limitations of conventional comparisons among curve landmarks or parameters, we propose as a viable alternative, the combination of curve registration, robust estimation, and formal statistical testing of curves as coherent units. On the basis of these discussions, we provide heuristic guidelines for the summary of gait variables and the comparison of gait curves.
Keywords
Introduction
Definition of variability
In quantitative gait analysis, variability is commonly understood to be the fluctuation in the value of a kinematic (e.g. joint angle), kinetic (e.g. ground reaction force), spatio-temporal (e.g. stride interval) or electromyographic measurement. This fluctuation may be observed in repeated measurements over time, across or within individuals or raters, or between different measurement, intervention or health conditions. In this paper, we will focus on the variability in two types of data: unidimensional gait variables and single-cycle, prototypical gait curves, as these are the most common abstractions of spatio-temporal, kinematic and kinetic data, typically collected within a gait laboratory.
Measurement
Many different analytical methods have been proposed for estimating the variability in gait variables. The most widely used measures are those relating to the second moment of the underlying probability distribution of the gait variable of interest. Examples include, standard deviation (e.g., [1–4]), coefficient of variation (e.g., [5–8]) and coefficient of multiple correlation (e.g., [9, 10]). Other less conventional variability measures have also been suggested. For example, Kurz et al. demonstrated an information-theoretic measure of variability, where increased uncertainty in joint range-of-motion (ROM), and hence entropy, reflected augmented variability in joint ROM [11].
For gauging variability among gait curves, some distance-based measures have been put forth, including the mean distance from all curves to the mean curve in raw 3-dimensional spatial data [12], the point-by-point intercurve ranges averaged across the gait cycle [13] and the norm of the difference between coordinate vectors representing upper and lower standard deviation curves in a vector space spanned by a polynomial basis [14]. Instead of reporting a single number, an alternative and popular approach to ascertain curve variability has been to peg prediction bands around a group of curves. Recent research on this topic has demonstrated that bootstrap-derived prediction bands provide higher coverage than conventional standard deviation bands [15–17].
Additionally, various summary statistics, such as the intra-class correlation coefficient [8] and Pearson correlation coefficient [18], for estimating gait measurement reliability, repeatability or reproducibility have been deployed in the assessment of methodological, environmental and instrumentation or device-induced variability. Principal components and multiple correspondence analyses have also been applied in the quantification of variability in both gait variables and curves, as retained variance and inertia, respectively, in low dimensional projections of the original data [19].
Sources of variability
Internal
Internal variability is inherent to a person's neurological, metabolic and musculoskeletal health, and can be further subdivided into natural fluctuations, aging effects and pathological deviations. It is now well known that neurologically healthy gait exhibits natural temporal fluctuations that are governed by strong fractal dynamics [21–23]. The source of these temporal fluctuations may be supraspinal [24] and potentially the result of correlated central pattern generators [25]. One hierarchical synthesis hypothesis purports that these nonlinear dynamics are due to the neurological integration of visual and auditory stimuli, mechanoreception in the soles of the feet, along with vestibular, proprioceptive and kinesthetic (e.g., muscle spindle, Golgi tendon organ and joint afferent) inputs arriving at the brain on different time scales [24, 26]. Internal variability in gait measurements may be altered in the presence of pathological conditions which affect natural bipedal ambulation. For example, muscle spasticity tends to augment within-subject variability of kinematic and time-distance parameters [10] while Parkinson's disease, particularly with freezing gait, leads to inflated stride-to-stride variability [27] and electromyographic (EMG) shape variability and reduced timing variability in the EMG of the gastrocnemius muscle [28]. Similarly, recent studies have reported increased stride-to-stride variability due to Huntington's disease [29], amplified swing time variability due to major depressive and bipolar disorders [30], and heightened step width [31] and stride period [32] variability due to natural aging of the locomotor system.
External
Aside from mechanisms internal to the individual, variability in gait measurements may also arise from various external factors, as shown in Figure 1. For example, influences of the physical environment, such as the type of walking surface [33], the level of ambient lighting in conjunction with type of surface [34] and the presence and inclination of stairs [35] have been shown to affect cadence, step-width, and ground reaction force variability, respectively, in certain groups of individuals. Assistive devices, such as canes or semirigid ankle orthoses may reduce step-time and step-width variability [36] while different footwear (soft or hard) can affect the variability of knee and ankle joint angles, possibly by altering peripheral sensory inputs [14].
Variability may also originate from the nature of the instrumentation employed. This variability is often appraised by way of test-retest reliability studies. Some recent examples include the reproducibility of measurements made with the GAITRite mat [8], 3-dimensional optical motion capture systems [9, 18], triaxial accelerometers [37], insole pressure measurement systems [4], and a global positioning system for step length and frequency recordings [7].
Experimenter error or inconsistencies may also contribute, as an external source, to the observed variability in gait data. Besier et al. contend that the repeatability of kinematic and kinetic models depends on accurate location of anatomical landmarks [38]. Indeed, various studies have confirmed the exaggerated variability in kinematic data due to differences in marker placement between trials [9, 39] and between raters [40]. Finally, analytical manipulations, such as the computation of Euler angles [9] or the estimation of cross-sectional averages [41] may also amplify the apparent variability in gait data.
Clinical significance of variability
The magnitude of variability and its alteration bears significant clinical value, having been linked to the health of many biological systems. Particularly in human locomotion, the loss of natural fractal variability in stride dynamics has been demonstrated in advanced aging [32] and in the presence of neurological pathologies such as Parkinson's disease [42], and amyotrophic lateral sclerosis [42]. In some cases, this fractal variability is correlated to disease severity [32]. Variability may also serve as a useful indicator of the risk of falls [43] and the ability to adapt to changing conditions while walking [44]. Stride-to-stride temporal variability may be useful in studying the developmental stride dynamics in children [45]. Natural variability has been implicated as a protective mechanism against repetitive impact forces during running [14] and possibly a key ingredient for energy efficient and stable gait [46]. Variability is not always informative and useful and in fact may lead to discrepancies in treatment recommendations. For example, due to variability in static range-of-motion and kinematic measurements, Noonan et al. found that different treatments were recommended for 9 out of 11 patients with cerebral palsy, examined at four different medical centres [13].
Dealing with variability
Given the ubiquity and health relevance of variability in gait measurements, it is critical that we summarize and compare gait data in a way that reflects the true nature of their variability. Despite the apparent simplicity of these tasks, if not conducted prudently, the derived results may be misleading, as we will exemplify. In fact, there are to date many open questions relating to the analysis of quantitative gait data, such as the elusive problem of systematically comparing two families of curves.
The objectives of this paper are twofold. First, we aim to review some of the analytical issues commonly encountered in the summary and comparison of gait data variables and curves, as a result of variability. Our second goal is to demonstrate some practical solutions to the selected challenges, using real empirical data. These solutions largely draw upon successful methods reported in the statistics literature. The remainder of the paper addresses these objectives under two major headings, one on gait variables and the other on gait curves. The paper closes with some suggestions for the summary and comparison of gait data and directions for future research on this topic.
Gait random variables
Unidimensional variables which are measured or computed once per gait cycle will be referred to as gait random variables. This category includes spatio-temporal parameters such as stride length, period and frequency, velocity, single and double support times, and step width and length, as well as parameters such as range-of-motion of a particular joint, peak values, and time of occurrence of a peak, which are extracted from kinematic or kinetic curves on a per cycle basis.
Due to variability, univariate gait measures and parameters derived thereof should be regarded as stochastic rather than deterministic variables [47, 48]. In this random variable framework, a one-dimensional gait variable is represented as X and governed by an underlying, unknown probability distribution function F_{ X }, or density function . A realization of this random variable is written in lower case as x.
Inflated variability and non-robust estimation
It has been recently demonstrated that typical location and spread estimators used in quantitative gait data analysis, i.e. mean and variance, are highly susceptible to small quantities of contaminant data [48]. Indeed, a few spurious or atypical measurements can unduly inflate non-robust estimates of gait variability. The challenge in the summary of highly variable univariate gait data lies in reporting location and spread, faithful to the underlying data distribution and minimally influenced by extraordinary observations.
Here, we focus on the issue of inflated variability and non-robust estimation by examining four different spread estimators, applied to stride period data from a child with spastic diplegic cerebral palsy. As stated above, the coefficient of variation and standard deviation are routinely employed in the summary of gait variables. Given a sample of N observations of a gait variable X, i.e., {x_{1},..., x_{ N }}, the coefficient of variation is defined as,
IQR(X) = x_{0.75} - x_{0.25} (2)
where x_{0.75} and x_{0.25} are the 75% and 25% quantiles. The q-quantile is defined as where as usual, F_{ X }is the probability distribution of X. Equivalently, the q-quantile is the value, x_{ q }, of the random variable where . That is, q × 100 percent of the random variable values lie below x_{ q }. We also introduce the median absolute deviation [49],
MAD(X) = med (|X - med(X)|) (3)
We note immediately that the spread estimates in the presence of outliers are higher. The standard deviation and coefficient of variation change the most, dropping 42 and 36 percent in value, respectively, upon outlier removal. This observation is particularly important in the comparison of gait variables, as inflated variability estimates will diminish the probability of detecting significant differences when they do in fact exist. In contrast, the interquartile range and median absolute deviation, only change by 21 and 11%, respectively. We see that these latter estimates are more statistically stable, in that they are not as greatly influenced by the presence of extreme observations.
To more fully comprehend estimator robustness or lack thereof, the field of robust statistics offers a valuable tool called influence functions, which as the name implies, summarizes the influence of local contaminations on estimated values. Their use in gait analysis was first introduced in the context of stride frequency estimation [48].
We first introduce the concept of a functional, which can be understood as a real-valued function on a vector space of probability distributions [50]. In the present context, functionals allow us to think of an estimator as a function of a probability distribution. For example, for the interquartile range, the functional is simply, .
Let the mixture distribution F_{z, ε}describe data governed by distribution F but contaminated by a sample z, with probability ε. The influence function at the contamination z is defined as
where T(·) is the functional for the estimator of interest. The influence function for a particular estimator measures the incremental change in the estimator, in the presence of large samples, due to a contamination at z. Clearly, if the impact of this contaminant on the estimated value is minimal, then the estimator is locally robust at z. Influence functions can be analytically derived for a variety of common gait estimators (see for example, [48]), including those mentioned above. For the sake of analytical simplicity and practical convenience, we will instead use finite sample sensitivity curves, SC(z), which can be defined as,
SC(z) = (N + 1){T(x_{1},..., x_{ N }, z) - T(x_{1},..., x_{ N })} (5)
We observe that both standard deviation and coefficient of variation have quadratic sensitivity curves with vertices close to the sample mean. In other words, as contaminants take on extreme low or high values, the estimated values are unbounded. Clearly, these two estimators are not robust, explaining their high sensitivity to the outliers in the stride period data. In contrast, both the interquartile range and median absolute deviation have bounded sensitivity curves, in the form of step functions. The median absolute deviation is actually not sensitive to contaminant values above 1.1 seconds whereas the interquartile range has a constant sensitivity to contaminant values over 1.6. Since most of the outliers in the stride period data were well above the mean, this difference explains the considerably lower sensitivity of the median absolute deviation to outlier influence.
From this example, we appreciate that estimators of gait variable spread (i.e. variability) should be selected with prudence. The popular but non-robust variability measures of standard deviation and coefficient of variation both have 0 breakdown points [51], meaning that only a single extreme value is required to drive the estimators to infinity. Indeed, as seen in Figure 2, the presence of a small fraction of outliers can unduly inflate our estimates of gait variability. Outlier management [52], with methods such as outlier factors [53] or frequent itemsets [54], represents one possible strategy to reduce unwanted variability when using these non-robust estimators. Apart from the addition of a computational step, this strategy introduces the undesirable effects of outlier smearing and masking [55], which need to be carefully addressed.
In contrast, outliers need not be explicitly identified with robust estimation, hence circumventing the above complications and abbreviating computation. The interquartile range and median absolute deviation, have breakdown points of 0.25 and 0.5, respectively [51]. Practically, this means that these estimators will remain stable (bounded) until the proportion of outliers reaches 25% and 50% of the sample size, respectively. To circumvent explicit outlier detection and its associated issues altogether, and in the presence of noisy data, which often result from spatio-temporal recordings and parameterizations of kinematic and kinetic curves, robust estimators may thus be preferable in the summary of gait variables.
Non-gaussian distributions
Even in the absence of outliers, univariate gait data may not adhere to a simple, unimodal gaussian distribution. In fact, distributions of gait measurements and derived parameters may be naturally skewed, leptokurtic or multimodal [56]. Neglecting these possibilities, we may summarize gait data with location and spread values which do not reflect the underlying data distribution.
Semi-parametric estimation
Summary of bimodal ROM data
Mixture distribution | k-means clustering | Normal distribution | |
---|---|---|---|
Mode # 1 | 37.7 ± 2.4 | 37.7 ± 2.6 | 40.4 ± 5.1 |
Mode # 2 | 49.1 ± 3.5 | 47.7 ± 3.0 | - |
Mixing proportion (mode I/mode 2) | 0.71/0.29 | 0.73/0.27 | - |
Critical value (lower) | 33.35 | 32.96 | 30.40 |
Critical value (upper) | 53.89 | 51.70 | 50.40 |
where W_{ i }is a scalar such that ∑_{ i }W_{ i }= 1 to preserve probability axioms, N_{ C }is the number of clusters or modes and is a gaussian density with mean μ_{ i }and variance . The fitting of (6) is known as semi-parametric estimation as we do not assume a particular parametric form for the data distribution per se, but do assume that it can modeled by a mixture of gaussians. In the present case, N_{ C }= 2 and we can use a simple optimization approach to determine the parameters of the mixture. In particular, we determined the parameter vector [W_{1}, W_{2}, μ_{1}, σ_{1}, μ_{2}, σ_{2}] to minimize the objective function , where n_{ j }is the number of points within an interval of length Δ around x_{ j }and N is the number of points in the sample. The latter term in the objective function is a crude probability density estimate [59]. As seen in Table 1, the results of fitting this bimodal mixture yields similar results to those obtained from clustering.
What are the implications of naively summarizing these data with a unimodal normal distribution? First of all, the probabilities of observing range-of-motion values between 35 and 39 degrees, where most of the observations occur, would be underestimated. Likewise, ROM values between 39 and 48 degrees, where the data exhibit a dip in observed frequencies, would be grossly overestimated. These discrepancies are labeled as regions B and C in Figure 4. More importantly, the discrepancies in the tails of the distributions, regions A and D, suggest that statistical comparisons with other data, say pathological ROM, would likely yield inconsistent conclusions, depending on whether the mixture or simple distribution was assumed. Indeed, as seen in Table 1 the lower critical value of the simple normal distribution for a 5% significance level is too low. This could lead to exagerrated Type II errors. Similarly, the upper critical value is not high enough, potentially leading to many false positive (Type I) errors.
The above example depicts bimodal data. However, the mixture distribution method can be applied to arbitrary non-normal data distributions, regardless of the underlying modality. Fitting such distributions can be accomplished by the well-established expectation-maximization algorithm [60]. For a comprehensive review of other semi-parametric and non-parametric estimation methods, see for example [59].
Parametric estimation
where a is the shape parameter, b is the scale parameter and Γ(·) is the gamma function. The gamma distribution fits are plotted as solid lines in Figure 5.
Statistical comparison of stride periods under different distributional assumptions
Child | No. strides | Gaussian distribution | Gamma distribution | ||
---|---|---|---|---|---|
u _{ Z } | σ _{ Z } | a | b | ||
1 | 24 | 1.36 | 0.158 | 79.19 | 0.0171 |
2 | 23 | 1.74 | 0.734 | 7.513 | 0.232 |
p = 0.31 | p = 0.036 |
In brief, the issue of non-normal distributions of measured gait variables or derived parameters, may lead to inaccurate reports of population means and variability and error-prone statistical testing. In fact, as the last example has shown, different distributional assumptions may lead to different statistical conclusions. Without a priori knowledge about the form of the distribution, one possible solution is to use a general mixture distribution to summarize the gait data. When we have some a priori knowledge about the underlying distribution, we can simply summarize the data using a known non-gaussian distribution, such as the gamma distribution exemplified above for the right-skewed stride period data. In either case, it is generally advisable to routinely check for significant departure from normality using such tests for normality as Pearson's Chi-square [64] or Lilliefors [57].
We remark that mixture models typically have a larger number of parameters than simple unimodal models. As a general rule-of-thumb, one should thus consider that mixture models generally require more data points for their estimation [59]. In particular, note that in any hypothesis test, the requisite sample size is dependent on the anticipated effect size, the desired level of significance and the specified level of statistical power [65]. For specific guidelines and methodology relating to sample size determination, the reader is referred to literature on sample size considerations in general hypothesis testing [66], normality testing [67], and other distributional testing [68].
Single-cycle gait curves
Kinematic, kinetic and metabolic data are often presented in the form of single-cycle curves, representing a time-varying value over one complete gait cycle. Time is often normalized such that the data vary over percentages of the gait cycle rather than absolute time. Examples include curves for joint angles, moments and powers, ground reaction forces, and potential and kinetic energy. Due to variability from stride-to-stride, these measurements do not generate a single curve, but a family of curves, each one slightly different from the other. We will consider a family of gait curves as realizations of a random function [69–71]. Let X_{ j }(t) denote a discrete time function, i.e. a gait curve, where for convenience and without loss of generality, t is a positive integer and t = 1,..., 100. We further assume that the differences among curves at each point in time are independently normally distributed. Each sample curve, X_{ j }(t), can thus be represented as [70],
X_{ j }(t) = f(t) + ε_{ j }(t) j = 1,..., N t = 1,..., 100 (8)
where f(t) is the true underlying mean function, ε_{ j }(t) ~ (0, σ_{ j }(t)^{2}) are independent, normally distributed, gaussian random variables with variance σ_{ j }(t)^{2} and N is the number of curves observed. With this formulation in mind, we now address four prevalent challenges in analyzing gait curves, namely, undesired phase variation, robust estimation of spread, the difficulty with landmark analysis and lastly, the comparison of curves as whole objects rather than as disconnected points.
Phase variation
It has been recognized that within a sample of single-cycle gait curves, there is both amplitude and phase variation [71–73]. Typically, when we describe variability in gait curves, we refer to amplitude variability. However, unchecked phase variation, that is the temporal misalignment of curves, can often lead to inflated amplitude variability estimates [72, 73]. Computing cross-sectional averages over a family of malaligned gait curves can lead to the cancellation of critical shape characteristics and landmarks [74]. This issue presents a significant challenge when summarizing a series of curves for clinical interpretation and treatment planning. On the one hand, the presentation of a large number of different curves can be overwhelmingly difficult to assimilate. On the other hand, a prototypical average curve which does not reflect the features of the individual curves is equally uninformative.
Curve registration [71] is loosely the process of temporally aligning a set of curves. More precisely, it is the alignment of curves by minimizing discrepancies from an iteratively estimated sample mean or by allineating specific curve landmarks. Sadeghi et al. demonstrated the use of curve registration, particularly to reduce intersubject variability in angular displacement, moment and power curves [72, 73]. Additionally, they reported that curve characteristics, namely, first and second derivatives and harmonic content were preserved while peak hip angular displacement and power increased upon registration [72]. This latter finding confirms that averaging unregistered curves may eliminate useful information.
Judging by the few gait papers employing curve registration, the method appears largely unknown among the quantitative gait analysis community. Here, we briefly outline the the global registration criterion method [71, 75].
Since each gait curve is a discrete set of points, it is useful to estimate a smooth sample function for each observed sample curve. Given the periodic nature of gait curves, the Fourier transform provides an adequate functional representation of each curve. The basic principle is then to repeatedly align a set of sample functions to an iteratively estimated mean function. The agreement between a sample function and the mean function can be measured by a sum-of-squared error criterion. The goal of registration is to find a set of temporal shift functions such that the evaluation of each sample function at the transformed temporal values minimizes the sum-of-squared error criterion. The sample mean is re-estimated at each iteration with the current set of time-warped curves. As an optimization problem, the curve registration procedure is the iterative minimization of the sum-of-squared criterion J,
where N is the number of sample curves, T is the time interval of relevance, w_{ i }(·) is the time-warping function and is the iteratively estimated mean based on the current time-warped curves X_{ i }(w_{ i }(s)). For greater methodological details, the reader is referred to [71, 72, 75]. This global registration criterion method is only one of several possibilities for curve alignment. Related methods which are applicable to gait data include dynamic time warping based on identified curve landmarks [41] and latency corrected ensemble averaging [28].
The right column of Figure 6 indicates that the differences in the mean and standard deviation curves before and after registration are non-trivial, with maximum changes of +15% and -51%, respectively. The post-registration mean curve not only exhibits heightened but shifted peaks (3 – 5% of the gait cycle). This observation suggests that simple cross-sectional averaging without alignment may not only diminish useful curve features but can also inadvertently misrepresent the temporal position of key landmarks. Inaccurate identification of these landmarks, such as the minimum dorsiflexion at the onset of swing phase in this example, could be problematic when attempting to coordinate spatio-temporal and EMG recordings with kinematic curves. The bottom right graph shows a dramatic decrease in variability after registration, particularly in terminal stance. This finding is in line with the tendency towards variability reduction reported by Sadeghi et al. [72].
While curve registration is useful for mitigating unwanted phase variation in gait curves, there may be instances where phase variability is itself of interest [3]. In such instances, curve registration can still be useful in providing information about the relative temporal phase shifts among curves. Because curve registration actually changes the temporal location of data, it should not be applied in studies concerned with temporal stride dynamic characterizations, such as scaling exponents [21] or Lyapunov exponents [44]. At present, only a few gait studies have applied curve registration to manage undesired phase variability. However, the evidence in those studies, along with the example above, supports further research and exploratory application of curve registration to fully grasp its merits and limitations in quantitative gait data analyses. For now, curve registration appears to be the most viable solution to the challenge of summarizing a family of temporally misaligned gait curves. In the ensuing sections, we will demonstrate how curve registration can be used advantageously, in conjunction with other methods to address other curve summary and comparison challenges.
Robustness of spread estimation
We have already seen that curve registration can mitigate amplitude variability in a family of gait curves. The robust measurement of variability in gait curves is itself a non-trivial challenge. One may need to estimate the variability in a group of curves for the purposes of classifying a new observation as belonging to the same population, or not [15]. Alternatively, knowledge of the variability among curves can help in the statistical comparison of two populations of curves [16], say arising from two different subject groups or pre- and post-intervention.
As in gait variables, the challenge lies in robustly estimating the spread of a sample of gait curves and to avoid fallacious under or overestimation. The intuitive and perhaps most popular way of estimating curve variability is the calculation of the standard deviation across the sample of curves, for each point in the gait cycle. This yields upper, U_{ X }, and lower bands, L_{ X }, around the sample of curves, i.e.
U_{ X }(t) = μ_{ X }(t) + σ_{ X }(t) t = 1,..., 100
L_{ X }(t) = μ_{ X }(t) - σ_{ X }(t) (10)
The basic idea of the bootstrap method is to create a large number of bootstrap subsets by resampling the curves X_{ j }, j = 1,..., N with replacement. For each subset, the bootstrap mean and standard deviation are calculated. One then checks how many of the sample curves are "covered" by the bootstrap standard deviation bands. A curve is considered covered, if its maximum absolute standardized difference from the bootstrap mean is less than the bootstrap constant C. The number of covered curves averaged over all the bootstrap subsets then yields the coverage probability for the given bootstrap constant, C. The upper and lower bootstrap prediction bands can then be written as,
The reader is referred to [15] for details for practical computer implementation of the above procedure.
To further understand the robustness properties of the two spread estimators, we generate sensitivity curves using the 45 knee angle curves introduced in Figure 4. These curves are first registered to minimize unwanted phase variability. In the case of gait curves, the contaminant is not a single point, but an entire curve. For convenience, we choose the following contaminant,
where δ ∈ ℝ and δ_{ min }≤ δ ≤ δ_{ max }. In other words, the contaminant is just a shifted version of the sample mean curve, . For simulating the sensitivity curve, we choose δ_{ min }= -50 and δ_{ max }= 50, recognizing that in practice, we would never observe deviations of this magnitude. This large range does however, gives us a more complete picture of the sensitivity curves. We proceed to define the sensitivity curves for the standard deviation and bootstrap estimates as follows,
We note that, as in the univariate case, the standard deviation exhibits quadratic sensitivity with vertex at the zero deviation curve. This parabolic sensitivity curve indicates that the standard deviation bands are not locally robust to contaminant curves. In contrast, the sensitivity curve for the bootstrap bands is not smooth and quartic in nature. The lack of smoothness is due to the random resampling inherent in the bootstrap method, such that with each contaminant curve, slightly different bootstrap samples are used in estimating the 90% prediction bands. Initially, as the contaminant curve deviates from the mean curve, the sensitivity is negative, meaning that the width of the estimated bands are smaller than those for the uncontaminated data. Indeed, the actual value of the bootstrap constant initially decreases, likely to counter the accompanying sharp increase in the standard deviation bands. In other words, as the standard deviation bands widen, a smaller bootstrap constant is required to cover 90% of the sample curves. However, as the contaminant curve deviates farther from the mean, the slope of standard deviation sensitivity increases in magnitude more slowly. With a smaller change in standard deviation band per unit of deviation of the contaminant curve, the bootstrap constant necessarily increases to maintain 90% coverage. This reasoning accounts for the subsequent increase in the tails of the bootstrap sensitivity curve. Finally, we note that overall, the bootstrap sensitivity curve, although apparently unbounded, traverses a much smaller range than the standard deviation curve. This would suggest that with the kinematic data employed in this example, the bootstrap coverage bands enjoy greater stability than their highly sensitive standard deviation cousins.
In brief, the foregoing discussion further supports the use of bootstrap coverage bands in robustly summarizing the variability within a family of gait curves. Moreover, curve registration and outlier removal can further tighten the location of the prediction bands.
Problems with simple parameterizations
The wavelet transform has been touted as a useful method for uncovering intrinsic trends in data [79, 80]. Hence, it may be possible to extract an underlying low frequency trend from the amputee force curve for the sake of striking a comparison with the able-bodied curve. To this end, we decomposed the mean force curve for the child with amputation using a 4 – level coiflet wavelet transform [81]. We reconstructed the force curve using only the approximation coefficients. The resulting trend line is plotted on the right graph of Figure 9 as the thick dashed line and more closely resembles the expected force profile. Extraction of the extrema yields plausible peak locations at 17% and 44% of the gait cycle and a valley at 30%. These locations are comparable to those for the able-bodied child (peaks at 12% and 44% and valley at 26% of the gait cycle), but suggest a slightly extended loading response phase.
The extraction of the trend line in this example illustrates that in some curves, the desired landmarks may be concealed by the fluctuations of higher frequency signal components and hence may be salvageable. However, even when landmarks are clearly identifiable among curves, they reflect only a very microscopic view of the entire curve. For example, two curves could have identical landmarks, but pronounced differences in shape characteristics. We therefore do not advocate the isolated use of simple parameterizations or landmarks for routine comparison of curves. Rather, the comparison of two sets of curves should be based on the entire curve and not isolated parameterizations. We suggest however, that landmark analysis and simple parameterizations can be meaningful as a post-hoc procedure, i.e. investigating how curves are similar or different, only after statistically significant differences among curves or lack thereof have been established. We therefore suggest to first statistically compare entire gait curves as unified objects, and reserve landmarks for post-hoc analysis. In the following section, we describe how such a statistical test may be carried out.
Comparison of gait curves as coherent entities
If gait curves were strictly deterministic, one could simply define a distance measure between two curves and be done. However, due to stride-to-stride variability, an extension of the univariate statistical test is needed, to determine if one set of curves could have arisen from the same statistical distribution as another. Alternatively, one could test whether the average difference between two sets of curves is approximately zero, within the critical values of an expected distribution of differences. The fundamental challenge is to compare families of gait curves as coherent entities rather than as unconnected, independent points. One way to consider curves as a whole rather than as disjoint points is to give them an appropriate functional representation. One can then compare the functional representations of the curves. Exploiting this principle, Fan and Lin [70] proposed a general method for comparing two sets of discrete time-sampled curves. In their method, the discrete Fourier transform of the standardized difference between the mean curves of two families of curves is computed. Only selected low frequency components of the transform, which encompass the majority of signal energy, are retained. These coefficients are then subjected to the adaptive Neyman test which yields the probability that the two families of curves have similar means. To the best of our knowledge, the adaptive Neyman statistic [69] has not yet been applied in the gait literature for the comparison of empirical gait curves. We therefore outline below, in some detail, the proposed procedure that we have adapted from Fan and Lin [70]. Suppose that we would like to compare two families of gait curves, {X_{ j }(t), j = 1,..., N_{ X }} and {Y_{ j }(t), j = 1,..., N_{ Y }}, with t = 1,..., 100. The null hypothesis is that the difference between the means of the two families of curves is zero. In the random function formulation given by Equation (8), we can write, H_{0} : f_{ X }(t) - f_{ Y }(t) = 0, where f_{ X }(t) and f_{ Y }(t) are the true underlying mean curves. For notational convenience, we will let t = 0,..., T - 1, where we have chosen T = 100 in the previous examples. The main steps of the test are as follows.
1. Compute the sample mean curves, μ_{ X }(t) and μ_{ Y }(t), where and likewise for μ_{ Y }(t)
4. Compute the standardized difference Z(t) between the registered means,
where k = 0,..., T/2, Real(·) and Imag(·) denote the real and imaginary components of the complex Fourier coefficient , respectively, and k denotes the Fourier frequency.
6. Form a new vector of coefficients E, of length T + 1, by pairing real and imaginary coefficients of the complex Fourier coefficients, , as follows,
7. Estimate the adaptive Neyman statistic, T_{ AN }(E) for the vector defined above. This proceeds in two steps.
where Var(E^{2}), is the variance of the square of the elements of E obtained in step 6.
(b) Let K = ln(T ln T). Compute the following final transformed test statistic value [70],
Here, we have explicited indicated that the statistic has been computed for the vector E of Fourier coefficients. Asymptotically, this statistic has an exponential of an exponential distribution [69], that is, P(T_{ AN }≤ x) → exp(-exp(-x)), as T becomes arbitrarily large.
8. Estimate the p-value of the computed test statistic value, T_{ AN }(E), by Monte Carlo simulation of a large number, say 10^{6}, of vectors, Y_{ i }, i = 1,..., 10^{6}, each of length T and whose elements are drawn from a standard normal distribution, i.e. Y_{ i }~ (0, 1), ∀i. The rationale is that when two sets of curves arise from the same random function, the standardized differences of their Fourier coefficients are normally distributed around 0. For each normal vector, Y_{ i }, evaluate T_{ AN }(Y_{ i }) as in step 7 above. When the null hypothesis of no differences is true, the probability of observing an adpative neyman statistic as extreme as T_{ AN }(E) is estimated as,
where H(·) is the heaviside function, where H(x) = 1 only if x > 0 and is 0 otherwise. In the examples below, we simulated 10^{6} such vectors to estimate the probability of observing T_{ AN }.
Note that we have not said anything about the requisite sample sizes for the statistical comparison of gait curves. Clearly, as in unidimensional power analysis [65], the required sample size depends on the effect size, significance level and specified power. To the best of our knowledge, no power-sample size tables have been derived for the adaptive Neyman statistic at the time of writing. For insights on the topic, the interested reader can refer to authoritative works [65, 82] on power analysis in the univariate case. The statistical testing demonstrated here can be extended to compare more than two groups of curves, using high-dimensional analysis of variance [70]. Further, when the standardized difference curve is not smooth, wavelet denoising can be used to identify the frequency bands where the majority of signal energy is concentrated [69]. The adaptive Neyman statistic introduced here is only one of several possibilities for objectively and rigorously testing differences among curves. Other alternatives include an ANOVA test for functional data [83] and functional canonical correlation analysis [84]. The procedure outlined in this section formalizes the comparison of gait curves as coherent entities. The method provides a means of statistically confirming overall similarities and differences that we may detect by visual inspection, but may have difficulty quantifying with conventional time and frequency domain parameterizations.
Recommendations
Future directions
This paper has only skimmed the tip of the iceberg in the discussion and demonstration of several promising analytical approaches for practically addressing variability issues in gait data summary and comparison. The topics of curve registration and bootstrap estimates of curve variability, although not necessarily new to gait data analyses, have been seldom studied and applied in the gait research community. The handful of studies to date on these subjects, have provided strong initial evidence for potentially improving the rigor and objectivity of gait data interpretation. Examples in the present paper lend further credence to these methods. Systematic comparisons of these techniques with conventional parameterizations, summary statistics, and even expert interpretation of gait data, would lead to a greater appreciation of their relative merits and limitations in gait data analyses. For example, would the use of registration and bootstrapping to consolidate gait data improve the consistency of clinical decision-making? Given the propensity for variability inflation in gait data, the topic of robust estimation needs to be studied in greater depth, in terms of contaminant influences and possibly adaptive estimators [49]. Likewise, the rigorous statistical comparison of gait curves as coherent entities rather than uncorrelated sets of points, is a promising area of research in gait variability analyses. This stream of study is only in the embryonic stages but promises to strengthen the comparison of quantitative gait data and to complement its subjective interpretation, a pratice which has been debated in literature [85–87].
Declarations
Acknowledgements
The primary author would like to acknowledge the support of the Canada Research Chairs program, the Natural Sciences and Engineering Research Council, the REMAD Foundation, the Canada Foundation for Innovation, the Ontario Innovation Trust and the Bloorview Research Institute. The authors also acknowledge Jan Andrysek who contributed the amputee data.
Authors’ Affiliations
References
- Owings T, Grabiner M: Step width variability, but not step length variability or step time variability, discriminates young and older adults during treadmill locomotion. Journal of Biomechanics 2004,37(6):935-938. 10.1016/j.jbiomech.2003.11.012View ArticlePubMedGoogle Scholar
- Danion F, Varraine E, Bonnard M, Pailhous J: Stride variability in human gait: the effect of stride frequency and stride length. Gait and Posture 2003, 18: 69-77. 10.1016/S0966-6362(03)00030-4View ArticlePubMedGoogle Scholar
- Kao J, Ringenbach S, Martin P: Gait transitions are not dependent on changes in intralimbcoordination variability. Journal of Motor Behavior 2003,35(3):211-214.View ArticlePubMedGoogle Scholar
- Randolph A, Nelson M, Akkapeddi S, Levin A, Alexandrescu R: Reliability of measurements of pressures applied on teh foot during walking by a computerized insole sensor system. Archives of Physical Medicine and Rehabilitation 2000,81(5):573-578. 10.1016/S0003-9993(00)90037-6View ArticlePubMedGoogle Scholar
- del Olmo M, Cudeiro J: Temporal variability of gait in Parkinson disease: effeccts of a rehabilitation program based on rhythmic sound cues. Parkinsonism and Related Disorders 2005, 11: 25-33. 10.1016/j.parkreldis.2004.09.002View ArticlePubMedGoogle Scholar
- Cavanagh P, Perry J, Ulbrecht J, Derr J, Pammer S: Neuropathic diabetic patients do not have reduced variability of plantar loading durig gait. Gait & Posture 1998,7(3):191-199. 10.1016/S0966-6362(98)00011-3View ArticleGoogle Scholar
- Terrier P, Schutz Y: Variability of gait patterns during unconstrained walking assesed by satellite positioning (GPS). European Journal of Applied Physiology 2003,90(5–6):554-561. 10.1007/s00421-003-0906-3View ArticlePubMedGoogle Scholar
- Menz H, Latt M, Tiedemann A, Kwan M, Lord S: Reliability of the GAITRite(R) walkway system for the quantification of temporo-spatial parameters of gait in young and older people. Gait & Posture 2004, 20: 20-25. 10.1016/S0966-6362(03)00068-7View ArticleGoogle Scholar
- Growney E, Meglan D, Johnson M, Cahalan T, An K: Repeated measures of adult normal walking using a video tracking system. Gait & Posture 1997,6(2):147-162. 10.1016/S0966-6362(97)01114-4View ArticleGoogle Scholar
- Steinwender G, Saraph V, Scheiber S, Zwick E, Uitz C, Hackl K: Intrasubject repeatability of gait analysis data in normal and spastic children. Clinical Biomechanics 2000, 15: 134-139. 10.1016/S0268-0033(99)00057-1View ArticlePubMedGoogle Scholar
- Kurz M, Stergiou N: The aging human neuromuscular system expresses less certainty for selecting joint kinematics during gait. Neuroscience Letters 2003,348(3):155-158. 10.1016/S0304-3940(03)00736-5View ArticlePubMedGoogle Scholar
- Abel R, Rupp M, Sutherland D: Quantifying the variability of a complex motor task specifically studying the gait of dyskinetic CP children. Gait & Posture 2003, 17: 50-58. 10.1016/S0966-6362(02)00054-1View ArticleGoogle Scholar
- Noonan K, Halliday S, Browne R, O'Brien S, Kayes K, Feinberg J: Interobserver variability of gait analysis in patients with cerebral palsy. Journal of Pediatric Orthopaedics 2003,23(3):279-287. 10.1097/00004694-200305000-00001PubMedGoogle Scholar
- Kurz M, Stergiou N: The spanning set indicates that variability during the stance period of running is affected by footwear. Gait & Posture 2003,17(2):132-135. 10.1016/S0966-6362(02)00064-4View ArticleGoogle Scholar
- Lenhoff M, Santer T, Otis J, Peterson M, Williams B, Backus S: Bootstrap prediction and confidence bands: a superior statistical method for analysis of gait data. Gait and Posture 1999, 9: 10-17. 10.1016/S0966-6362(98)00043-5View ArticlePubMedGoogle Scholar
- Duhamel A, Bourriez J, Devos P, Krystkowiak P, Destee A, Derambure P, Defebvre L: Statistical tools for clinical gait analysis. Gait and Posture 2004, 20: 204-212. 10.1016/j.gaitpost.2003.09.010View ArticlePubMedGoogle Scholar
- Murray-Weir M, Root L, Peterson M, Lenhoff M, Wagner C, Marcus P: Proximal femoral varus rotation osteotomy in cerebral palsy: a prospective gait study. Journal of Pediatric Orthopaedics 2003, 23: 321-329. 10.1097/00004694-200305000-00009PubMedGoogle Scholar
- Westhoff B, Hirsch M, Hefter H, an dR Krauspe AW: Test-retest reliability of 3-dimensional computerized gait analysis. Sportverletzung-sportschaden 2004,18(2):76-79. 10.1055/s-2004-813229View ArticlePubMedGoogle Scholar
- Chau T: A review of analytical techniques for gait data: Part I: fuzzy, statistical and fractal methods. Gait & Posture 2001, 13: 49-66. 10.1016/S0966-6362(00)00094-1View ArticleGoogle Scholar
- Schwartz M, Trost J, Wervey R: Measurement and management of errors in quantitative gait data. Gait and Posture 2004, 20: 196-203. 10.1016/j.gaitpost.2003.09.011View ArticlePubMedGoogle Scholar
- Hausdorff J, Peng C, Ladin Z, Wei J, Goldberger A: Is walking a random walk? Evidence of long-range correlations in stride interval of human gait. Journal of Applied Physiology 1995, 78: 349-358.PubMedGoogle Scholar
- West B, Griffin L: Allometric Control, Inverse Power Laws and Human Gait. Chaos, Solitons and Fractals 1999,10(9):1519-1527. 10.1016/S0960-0779(98)00149-0View ArticleGoogle Scholar
- Griffin L, West D, West B: Random Stride Intervals with Memory. Journal of Biological Physics 2000,26(3):185-202. 10.1023/A:1010322406831PubMed CentralView ArticlePubMedGoogle Scholar
- Hausdorff J, Purdon P, Peng C, Ladin Z, Wei J, Goldberger A: Fractal dynamics of human gait: stability of long-range correlations in stride interval fluctuations. Journal of Applied Physiology 1996,80(5):1448-1457.PubMedGoogle Scholar
- West B, Scafetta N: Nonlinear dynamical model of human gait. Physical Review E 2003,67(5):1063-1065. 10.1103/PhysRevE.67.051917View ArticleGoogle Scholar
- Hausdorff J, Peng C: Multiscaled randomness: A possible source of 1/f noise in biology. Physical Review E 1996,54(2):2154-2157. 10.1103/PhysRevE.54.2154View ArticleGoogle Scholar
- Hausdorff J, Schaafsma J, Balash Y, Bartels A, Gurevich T, Giladi N: Impaired regulation of stride variability in Parkinson's disease subjects with freezing gait. Experimental Brain Research 2003,149(2):187-194.PubMedGoogle Scholar
- Miller R, Thaut M, Mclntosh G, Rice R: Components of EMG symmetry and variability in parkinsonian and healthy elderly gait. Electromyography and motor control – electroencelphalography and clinical neurophysiology 1996, 101: 1-7. 10.1016/0013-4694(95)00209-XView ArticleGoogle Scholar
- Hausdorff J, Cudkowicz M, Firtion R, Wei J, Goldberger A: Gait variability and basal ganglia disorders: stride-to-stride variations in gait cycle timing in Parkinson's disease and Huntington's disease. Movement Disorders 1998,13(3):428-437. 10.1002/mds.870130310View ArticlePubMedGoogle Scholar
- Hausdorff J, Peng C, Goldberger A, Stoll A: Gait unsteadiness and fall risk in two affective disorders: a preliminary study. BMC Psychiatry 2004, 4: 39. 10.1186/1471-244X-4-39PubMed CentralView ArticlePubMedGoogle Scholar
- Owings T, Grabiner M: Variability of step kinematics in young and older adults. Gait & Posture 2004, 20: 26-29. 10.1016/S0966-6362(03)00088-2View ArticleGoogle Scholar
- Hausdorff J, Mitchell S, Firtion R, Peng C, Cudkowicz M, Wei J, Goldberger A: Altered fractal dynamics of gait: reduced stride-interval correlations with aging and Huntington's disease. Journal of Applied Physiology 1997, 82: 262-269.PubMedGoogle Scholar
- Menz H, Lord S, Fitzpatrick R: Acceleration patterns of the head and pelvis when walking on level and irregular surfaces. Gait and Posture 2003, 18: 35-46. 10.1016/S0966-6362(02)00159-5View ArticlePubMedGoogle Scholar
- Richardson J, Thies S, Demott T, Ashton-Miller J: Interventions improve gait regularity in patients with peripheral neuropathy while walking on an irregular surface under low light. Journal of the American Geriatrics Society 2004,52(4):510-515. 10.1111/j.1532-5415.2004.52155.xView ArticlePubMedGoogle Scholar
- Stacoff A, Diezi C, Luder G, Stussi E, Quervain IKD: Ground reaction forces on stairs: effects of stair inclination and age. Gait & Posture 2005, 21: 24-38. 10.1016/j.gaitpost.2003.11.003View ArticleGoogle Scholar
- Richardson J, Thies S, DeMott T, Ashton-Miller J: A comparison of gait characteristics between older women with and without peripheral neuropathy in standard and challenging environments. Journal of the American Geriatrics Society 2004,52(9):1532-1537. 10.1111/j.1532-5415.2004.52418.xView ArticlePubMedGoogle Scholar
- Henriksen M, Lund H, Moe-Nilssen R, Bliddal H, Danneskiod-Samsoe B: Test-retest reliability of trunk accelerometric gait analysis. Gait & Posture 2004,19(3):288-297. 10.1016/S0966-6362(03)00069-9View ArticleGoogle Scholar
- Besier T, Sturnieks D, Alderson J, Lloyd D: Repeatability of gait data using a functional hip joint centre and a mean helical knee axis. Journal of Biomechanics 2003,36(8):1159-1168. 10.1016/S0021-9290(03)00087-3View ArticlePubMedGoogle Scholar
- Carson M, Harrington M, Thompson N, O'Connor J, Theologis T: Kinematic analysis of a multi-segment foot model for research and clinical applications: a repeatability analysis. Journal of Biomechanics 2001,34(10):1299-1307. 10.1016/S0021-9290(01)00101-4View ArticlePubMedGoogle Scholar
- Maynard V, Bakheit A, Oldham J, Freeman J: Intra-rater and inter-rater reliability of gait measurements with CODA mpx30 motion analysis system. Gait & Posture 2003, 17: 59-67. 10.1016/S0966-6362(02)00051-6View ArticleGoogle Scholar
- Wang K, Gasser T: Alignment of curves by dynamic time warping. The Annals of Statistics 1997,25(3):1251-1276. 10.1214/aos/1069362747View ArticleGoogle Scholar
- Hausdorff J, Lertratanakul A, Cudkowicz M, Peterson A, Kaliton D, Goldberger A: Dynamic markers of altered gait rhythm in amyotrophic lateral sclerosis. Journal of Applied Physiology 2000, 88: 2045-2053.PubMedGoogle Scholar
- Hausdorff J, Rios D, Edelberg H: Gait variability and fall risk in community-living older adults: a 1-year prospective study. Archives of Physical Medicine and Rehabilitation 2001,82(8):1050-1056. 10.1053/apmr.2001.24893View ArticlePubMedGoogle Scholar
- Buzzi U, Stergiou N, Kurz M, Hageman P, Heidel J: Nonlinear dynamics indicates aging affects variability during gait. Clinical Biomechanics 2003, 18: 435-443. 10.1016/S0268-0033(03)00029-9View ArticlePubMedGoogle Scholar
- Hausdorff J, Zemany L, Peng C, Goldberger A: Maturation of gait dynamics: stride-to-stride variability and its temporal organization in children. Journal of Applied Physiology 1999,86(3):1040-1047.PubMedGoogle Scholar
- Goldberger A, Amaral L, Hausdorff J, Ivanov P, Peng C, Stanley H: Fractal dynamics in physiology: alterations with disease and aging. PNAS 2002,99(Supp l):2466-2472. 10.1073/pnas.012579499PubMed CentralView ArticlePubMedGoogle Scholar
- Stokes V, Thorstensson A, Lanshammar H: From stride period to stride frequency. Gait & Posture 1998, 7: 35-38. 10.1016/S0966-6362(97)00024-6View ArticleGoogle Scholar
- Chau T, Parker K: On the robustness of stride frequency estimation. IEEE Transactions on Biomedical Engineering 2004,51(2):294-303. 10.1109/TBME.2003.820396View ArticlePubMedGoogle Scholar
- Shevlyakov G, Vilchevski N: Robustness in data analysis. Utrecht: VSP; 2002.Google Scholar
- Kreyszig E: Introductory functional analysis. New York: Wiley; 1989.Google Scholar
- Wilcox R: Introduction to robust estimation and hypothesis testing. San Diego: Academic Press; 1997.Google Scholar
- Hodge V, Austin J: A survey of outlier detection methodologies. Artificial Intelligence Review 2004,22(2):85-126.View ArticleGoogle Scholar
- Saltenis V: Outlier detection based on the distribution of distances between data points. Informatica 2004,15(3):399-410.Google Scholar
- He Z, amd JZX, Huang XX, Deng S: A frequent pattern discovery method for outlier detection. Lecture Notes in Computer Science 2004, 3129: 726-732.View ArticleGoogle Scholar
- Tolvi J: Genetic algorithms for outlier detection and variable selection in linear regression models. Soft Computing 2004,8(8):527-533. 10.1007/s00500-003-0310-2View ArticleGoogle Scholar
- Chau T, Rizvi S: Automatic stride interval extraction from long, highly variable noisy gait timing signals. Human Movement Science 2002,21(4):495-514. 10.1016/S0167-9457(02)00125-2View ArticlePubMedGoogle Scholar
- Conover W: Practical nonparametric statistics. third edition. New York: John Wiley & Sons; 1998.Google Scholar
- Duda R, Hart P, Stork D: Pattern Classification. New York: Wiley Interscience; 2000.Google Scholar
- Scott D: Multivariate density estimation. Wiley series in probability and statistics, New York: Wiley Interscience; 1992.View ArticleGoogle Scholar
- McLachlan G, Krishnan T: The EM algorithm and extensions. Wiley series in probability and statistics, New York: John Wiley & Sons; 1997.Google Scholar
- Palisano R, Rosenbaum P, Walter P, Russel D, Wood E, Galuppi B: Development and reliability of a system to classify gross motor function in children with cerebral palsy. Developmental Medicine and Child Neurology 1997, 39: 214-223.View ArticlePubMedGoogle Scholar
- Papoulis A: Probability, random variables and stochastic processes. New York: McGraw Hill; 1991.Google Scholar
- Kazakos D, Papantoni-Kazakos P: Detection and estimation. New York: Computer Science Press; 1990.Google Scholar
- Bendat J, Piersol A: Random data. New York: Wiley; 2000.Google Scholar
- Cohen J: Statistical power analysis for the behavioral sciences. Lawrence Erlbaum Associates; 1988.Google Scholar
- Asraf R, Brewer J: Conducting tests of hypotheses: the need for an adequate sample size. Australian Educational Researcher 2004, 31: 79-94.View ArticleGoogle Scholar
- Baxter M, Beardah C, Westwood S: Sample size and related issues in the analysis of lead isotope data. Journal of Archaeological Science 2000,27(10):973-980. 10.1006/jasc.1999.0546View ArticleGoogle Scholar
- Kundu D, Manglick A: Discriminating between the Weibull and log-normal distributions. Naval Research Logistics 2004,51(6):893-905. 10.1002/nav.20029View ArticleGoogle Scholar
- Fan J: Test of significance based on wavelet thresholding and Neyman's truncation. Journal of the American Statistical Association 1996,91(434):674-688. 10.2307/2291663View ArticleGoogle Scholar
- Fan J, Lin S: Test of significance when data are curves. Journal of the American Statistical Association 1998,93(443):1007-1021. 10.2307/2669845View ArticleGoogle Scholar
- Ramsay J, Silverman B: Functional data analysis. New York: Springer Verlag; 1997.View ArticleGoogle Scholar
- Sadeghi H, Mathieu P, Sadeghi S, Labelle H: Continuous curve registration as an intertrial gait variability reduction technique. IEEE Transactions on Neural Systems and Rehabilitation Engineering 2003, 11: 24-30. 10.1109/TNSRE.2003.810428View ArticlePubMedGoogle Scholar
- Sadeghi H, Allard P, Shafie K, Mathieu P, Sadeghi S, Prince F, Ramsay J: Reduction of gait variability using curve registration. Gait & Posture 2000, 12: 257-264. 10.1016/S0966-6362(00)00085-0View ArticleGoogle Scholar
- Kneip A, Gasser T: Statistical tools to analyze data representing a sample of curves. Annals of Statistics 1992, 20: 1266-1305. 10.1214/aos/1176348769View ArticleGoogle Scholar
- Ramsay J, Li X: Curve registration. Journal of the Royal Statistical Society – Series B 1998,60(2):351-363. 10.1111/1467-9868.00129View ArticleGoogle Scholar
- Olshen R, Biden E, Wyatt M, Sutherland D: Gait analysis and the bootstrap. The Annals of Statistics 1989,17(4):1419-1440. 10.1214/aos/1176347372View ArticleGoogle Scholar
- Sutherland D, Kaufman K, Campbell K, Ambrosini D, Wyatt M: Clinical use of prediction regions for motion analysis. Developmental Medicine and Child Neurology 1996,38(9):773-781.View ArticlePubMedGoogle Scholar
- Perry J: Gait analysis: normal and pathological function. New Jersey: SLACK Inc; 1992.Google Scholar
- Andreas E, Trevino G: Using wavelets to detect trends. Journal of Atmospheric and Oceanic Technology 1997,14(3):554-564. PublisherFullText 10.1175/1520-0426(1997)014<0554:UWTDT>2.0.CO;2View ArticleGoogle Scholar
- Meyer F: Wavelet-based estimation of a semiparametric generalized linear model of fMRI time-series. IEEE Transactions on Medical Imaging 2003,22(3):315-322. 10.1109/TMI.2003.809587View ArticlePubMedGoogle Scholar
- Cooklev T, Nishihara A: Biorthogonal Coiflets. IEEE Transactions on Signal Processing 1999,47(9):2582-2588. 10.1109/78.782213View ArticleGoogle Scholar
- Bausell R, Li Y: Power analysis for experimental research: a practical guide for the biological, medical and social sciences. Cambridge: Cambridge University Press; 2002.View ArticleGoogle Scholar
- Cuevas A, Febrero M, Fraiman R: An anova test for functional data. Computational Statistics and Data Analysis 2004, 47: 111-122. 10.1016/j.csda.2003.10.021View ArticleGoogle Scholar
- Leurgans S, Moyeed R, Silverman B: Canonical correlation analysis when the data are curves. Journal of the Royal Statistical Society B 1993,55(3):725-740.Google Scholar
- Watts HG: Gait laboratory analysis for preoperative decision making in spastic cerebral palsy: Is it all it's cracked up to be? Journal of Pediatric Orthopaedics 1994,14(6):703-4.View ArticlePubMedGoogle Scholar
- Davis RB: Reflections on clinical gait analysis. Journal of Electromyography and Kinesiology 1997,7(4):251-7. 10.1016/S1050-6411(97)00008-4View ArticlePubMedGoogle Scholar
- Sisto SA: The Value of Information Resuling from Instrumented Gait Analysis: The Physical Therapist. In Gait Analysis in the Science of Rehabilitation, Monograph 002. Edited by: Lisa JAD. Department of Veteran Affairs; 1998:76-84.Google Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.