Fractal Objects and Self-Similar Processes

Next: Mapping Real-World Time Series Up: Fractal Analysis Methods Previous: Fractal Analysis Methods

Fractal Objects and Self-Similar Processes

Before describing the metrics we use to quantitatively characterize the fractal properties of heart rate and gait dynamics, we first review the meaning of the term fractal. The concept of a fractal is most often associated with geometrical objects satisfying two criteria: self-similarity and fractional dimensionality. Self-similarity means that an object is composed of sub-units and sub-sub-units on multiple levels that (statistically) resemble the structure of the whole object [7]. Mathematically, this property should hold on all scales. However, in the real world, there are necessarily lower and upper bounds over which such self-similar behavior applies. The second criterion for a fractal object is that it have a fractional dimension. This requirement distinguishes fractals from Euclidean objects, which have integer dimensions. As a simple example, a solid cube is self-similar since it can be divided into sub-units of 8 smaller solid cubes that resemble the large cube, and so on. However, the cube (despite its self-similarity) is not a fractal because it has an (=3) dimension. (Click here for a hands-on experiment about fractal curves.)

The concept of a fractal structure, which lacks a characteristic length scale, can be extended to the analysis of complex temporal processes. However, a challenge in detecting and quantifying self-similar scaling in complex time series is the following: Although time series are usually plotted on a 2-dimensional surface, a time series actually involves two different physical variables. For example, in Figure 1, the horizontal axis represents ``time,'' while the vertical axis represents the value of the variable that changes over time (in this case, heart rate). These two axes have independent physical units, minutes and beats/minute, respectively. (Even in cases where the two axes of a time series have the same units, their intrinsic physical meaning is still different.) This situation is different from that of geometrical curves (such as coastlines and mountain ranges) embedded in a 2-dimensional plane, where both axes represent the same physical variable. To determine if a 2-dimensional curve is self-similar, we can do the following test: (i) take a subset of the object and rescale it to the same size of the original object, using the same magnification factor for both its width and height; and then (ii) compare the statistical properties of the rescaled object with the original object. In contrast, to properly compare a subset of a time series with the original data set, we need two magnification factors (along the horizontal and vertical axes), since these two axes represent different physical variables.

To put the above discussion into mathematical terms: A time-dependent process (or time series) is self-similar if

where means that the statistical properties of both sides of the equation are identical. In other words, a self-similar process, y(t), with a parameter has the identical probability distribution as a properly rescaled process, , i.e., a time series which has been rescaled on the x-axis by a factor a () and on the y-axis by a factor of (). The exponent is called the self-similarity parameter.

In practice, however, it is impossible to determine whether two processes are statistically identical, because this strict criterion requires their having identical distribution functions (including not just the mean and variance, but all higher moments as well). Therefore, one usually approximates this equality with a weaker criterion by examining only the means and variances (first and second moments) of the distribution functions for both sides of Eq. 1.

Figure: Illustration of the concept of self-similarity for a simulated random walk. (a) Two observation windows, with time scales and , are shown for a self-similar time series y(t). (b) Magnification of the smaller window with time scale . Note that the fluctuations in (a) and (b) look similar provided that two different magnification factors, and , are applied on the horizontal and vertical scales, respectively. (c) The probability distribution, P(y), of the variable y for the two windows in (a), where and indicate the standard deviations for these two distribution functions. (d) Log-log plot of the characteristic scales of fluctuations, s, versus the window sizes, n.

Figure 2a shows an example of a self-similar time series. We note that with the appropriate choice of scaling factors on the x- and y-axis, the rescaled time series (Fig. 2b) resembles the original time series (Fig. 2a). The self-similarity parameter defined in Eq. 1 can be calculated by a simple relation

where and are the appropriate magnification factors along the horizontal and vertical direction, respectively.

In practice, we usually do not know the value of the exponent in advance. Instead, we face the challenge of extracting this scaling exponent (if one does exist) from a given time series. To this end it is necessary to study the time series on observation windows with different sizes and adopt the weak criterion of self-similarity defined above to calculate the exponent . The basic idea is illustrated in Fig. 2. Two observation windows (Fig. 2a), window 1 with horizontal size and window 2 with horizontal size , were arbitrarily selected to demonstrate the procedure. The goal is to find the correct magnification factors such that we can rescale window 1 to resemble window 2. It is straightforward to determine the magnification factor along the horizontal direction, . But for the magnification factor along the vertical direction, , we need to determine the vertical characteristic scales of windows 1 and 2. One way to do this is by examining the probability distributions (histograms) of the variable y for these two observation windows (Fig. 2c). A reasonable estimate of the characteristic scales for the vertical heights, i.e., the typical fluctuations of y, can be defined by using the standard deviations of these two histograms, denoted as and , respectively. Thus, we have . Substituting and into Eq. 2, we obtain

This relation is simply the slope of the line that joins these two points, (, ) and (, ), on a log-log plot (Fig. 2d).

In analyzing ``real-world'' time series, we perform the above calculations using the following procedures: (1) For any given size of observation window, the time series is divided into subsets of independent windows of the same size. To obtain a more reliable estimation of the characteristic fluctuation at this window size, we average over all individual values of s obtained from these subsets. (2) We then repeat these calculations, not just for two window sizes (as illustrated above), but for many different window sizes. The exponent is estimated by fitting a line on the log-log plot of s versus n across the relevant range of scales.

Next: Mapping Real-World Time Series Up: Fractal Analysis Methods Previous: Fractal Analysis Methods