Sie sind hier: ICP » R. Hilfer » Publikationen

2.3 Physical Introduction to Fractional Derivatives

[page 46, §1]

2.3.1 Basic Questions

[46.1.1] An introduction to fractional derivatives would be incomplete without an introduction to applications. [46.1.2] In the past fractional calculus has been used predominantly as a convenient calculational tool [76, 89, 26]. [46.1.3] A well known example is Riesz’ interpolation method for solving the wave equation [20]. [46.1.4] In recent times, however, fractional differential equations appear as ‘‘generalizations’’ of more or less fundamental equations of physics [18, 3, 90, 12, 104, 43, 60, 46, 54, 52, 56, 55, 58, 23, 91, 102, 129, 119]. [46.1.5] The idea is that physical phenomena can be described by fractional differential equations. [46.1.6] This practice raises at least two fundamental questions:

  1. [46.1.7] Are mathematical models with fractional derivatives consistent with the fundamental laws and fundamental symmetries of nature ?

  2. [46.1.8] How can the fractional order α of differentiation be observed or how does a fractional derivative emerge from concrete models ?

[46.1.9] Both questions will be addressed here. [46.1.10] The answer to the first question is provided by the theory of fractional time evolutions [43, 47], the answer to the second question by anomalous subdiffusion [60, 46].

2.3.2 Fractional Space

[46.2.1] Fractional derivatives are nonlocal operators. [46.2.2] Nevertheless, numerous authors have proposed fractional differential equations involving fractional spatial derivatives. [46.2.3] Particularly popular are fractional powers of the Laplace operator due to the well known work of Riesz, Feller and Bochner [97, 13, 27]. The nonlocality of fractional spatial derivatives raises serious (largely) unresolved physical problems.

[46.3.1] As an illustration of the problem with spatial fractional derivatives consider the one dimensional potential equation for functions fC2R

d2dx2fx=0,xG(2.136)

on the open interval G=]a,b[ with boundary conditions fa=0,fb=0 with a<b. A solution of this boundary value problem is fx=0 with xG. [46.3.2] This trivial solution remains unchanged as long as the boundary values fa=fb=0 remain unperturbed. [46.3.3] All functions fC2R that vanish on a,b are solutions of the boundary value problem. [46.3.4] In particular, the boundary

[page 47, §0]    specification

fx=0,for xRG(2.137)

and the perturbed boundary specification

fx=gx,for xRG(2.138)

with g0 and suppga,b= have the same trivial solution f=0 in G. [47.0.1] The reason is that d2/dx2 is a local operator.

[47.1.1] Consider now a fractional generalization of (2.136) that arises for example as the stationary limit of (Bochner-Levy) fractional diffusion equations with a fractional Laplace operator [13]. [47.1.2] Such a onedimensional fractional Laplace equation reads

Rαfx=0,(2.139)

where Rα is a Riesz fractional derivative of order 0<α<1. [47.1.3] For the boundary specification (2.137) it has the same trivial solution fx=0 for all xG. [47.1.4] But this solution no longer applies for the perturbed boundary specification (2.138). [47.1.5] In fact, assuming (2.138) for xRG and fx=0 for xG now yields Rαfx0 for all xG. [47.1.6] The exterior RG of the domain G cannot be isolated from the interior of G using classical boundary conditions. [47.1.7] The reason is that Rα is a nonlocal operator.

[47.2.1] Locality in space is a basic and firmly established principle of physics (see e.g. [115, 35]). [47.2.2] Of course, one could argue that relativistic effects are negligible, and that fractional spatial derivatives might arise as an approximate phenomenological model describing an underlying physical reality that obeys spatial locality. [47.2.3] However, spatial fractional derivatives imply not only action at a distance. [47.2.4] As seen above, they imply also that the exterior domain cannot be decoupled from the interior by conventional walls or boundary conditions. [47.2.5] This has far reaching consequences for theory and experiment. [47.2.6] In theory it invalidates all arguments based on surface to volume ratios becoming negligible in the large volume limit. [47.2.7] This includes many concepts and results in thermodynamics and statistical physics that depend on the lower dimensionality of the boundary. [47.2.8] Experimentally it becomes difficult to isolate a system from its environment. [47.2.9] Fractional diffusion would never come to rest inside a vessel with thin rigid walls unless the equilibrium concentration prevails also outside the vessel. [47.2.10] A fractionally viscous fluid at rest inside a container with thin rigid walls would have to start to move when the same fluid starts flowing outside the vessel. [47.2.11] It seems therefore difficult to reconcile nonlocality in space with theory and experiment.

[page 48, §1]

2.3.3 Fractional Time

2.3.3.1 Basic Questions

[48.1.1] Nonlocality in time, unlike space, does not violate basic principles of physics, as long as it respects causality [49, 48, 43, 47, 54]. [48.1.2] In fact, causal nonlocality in time is a common nonequilibrium phenomenon known as history dependence, hysteresis and memory. [48.2.1] Theoretical physics postulates time translation invariance as a fundamental symmetry of nature. [48.2.2] As a consequence energy conservation is fundamental, and the infinitesimal generator of time translations is a first order time derivative. [48.2.3] Replacing integer order time derivatives with fractional time derivatives raises at least three basic questions:

  1. [48.2.4] What replaces time translations as the physical time evolution ?

  2. [48.2.5] Is the nonlocality of fractional time derivatives consistent with the laws of nature ?

  3. [48.2.6] Is the asymmetry of fractional time derivatives consistent with the laws of nature ?

[48.2.7] These questions as well as ergodicity breaking, stationarity, long time limits and temporal coarse grainig were discussed first within ergodic theory [49, 48, 47] and later from a general perspective in [54].

[48.3.1] The third question requires special remarks because irreversibility is a longstanding and controversial subject [71]. [48.3.2] The problem of irreversibility may be formulated briefly in two ways.

Definition 2.20 (The normal irreversibility problem)

[48.4.1] Assume that time is reversible. Explain how and why time irreversible equations arise in physics.

Definition 2.21 (The reversed irreversibility problem)

[48.5.1] Assume that time is irreversible. Explain how and why time reversible equations arise in physics.

[48.6.1] While the normal problem has occupied physicists and mathematicians for more than a century, the reversed problem was apparently first formulated in [59]. [48.6.2] Surprisingly, the reversed irreversibility problem has a clear and quantitiative solution within the theory of fractional time. [48.6.3] The solution is based on the simple postulate that every time evolution of a physical system is irreversible. [48.6.4] It is not possible to repeat an experiment in the past [59]. [48.6.5] This empiricial fact seems to reflect a fundamental law of nature that rivals the law of energy conservation.

[48.7.1] The mathematical concepts corresponding to irreversible time evolutions are operator semigroups and abstract Cauchy problems [15, 93]. [48.7.2] The following brief introduction to fractional time evolutions (sections 2.3.3.22.3.3.8) is in large parts identical to the brief exposition in [59]. [48.7.3] For more details see [54].

[page 49, §1]

2.3.3.2 Time Evolution

[49.1.1] A physical time evolution TΔt:0Δt< is defined as a one-parameter family (with time parameter Δt) of bounded linear time evolution operators TΔt on a Banach space B. [49.1.2] The parameter Δt represents time durations. The one-parameter family fulfills the conditions

TΔt1TΔt2ft0=TΔt1+Δt2ft0(2.140)
t0=ft0(2.141)

for all Δt1,Δt20, t0R and fB. [49.1.3] The elements fB represent time dependent physical observables, i.e. functions on the time axis R. [49.1.4] Note that the argument Δt0 of TΔt has the meaning of a time duration, while tR in ft means a time instant. [49.1.5] Equations (2.140) and (2.141) define a semigroup. [49.1.6] The inverse elements T-Δt are absent. [49.1.7] This reflects the fundamental difference between past and future.

[49.2.1] The linear operator A defined as

Af=s-limΔt0+TΔtf-fΔt(2.142)

with domain

DA=fB:s-limΔt0+TΔtf-fΔtexists(2.143)

is called the infinitesimal generator of the semigroup. [49.2.2] Here s-limf=g is the strong limit and means limf-g=0 in the norm of B as usual.

2.3.3.3 Continuity

[49.3.1] Physical time evolution is continuous. [49.3.2] This requirement is represented mathematically by the assumption that

s-limΔt0TΔtf=f(2.144)

holds for all fB, where s-lim is again the strong limit. [49.3.3] Semigroups of operators satisfying this condition are called strongly continuous or C0-semigroups [15, 93]. [49.3.4] Strong continuity is weaker than uniform continuity and has become recognized as an important continuity concept that covers most applications [2].

2.3.3.4 Homogeneity

[49.4.1] Homogeneity of time means two different requirements: [49.4.2] Firstly, it requires that observations are independent of a particular instant or position in

[page 50, §0]    time. [50.0.1] Secondly, it requires arbitrary divisibility of time durations and self-consistency for the transition between time scales.

[50.1.1] Independence of physical processes from their position on the time axis requires that physical experiments are reproducible if they are ceteris paribus shifted in time. [50.1.2] The first requirement, that the start of an experiment can be shifted, is expressed mathematically as the requirement of invariance under time translations. [50.1.3] As a consequence one demands commutativity of the time evolution with time translations in the form

TτTΔtft0=TΔtTτft0=TΔtft0-τ(2.145)

for all Δt0 und t0,τR. [50.1.4] Here the translation operator Tt is defined by

Tτft0=ft0-τ.(2.146)

[50.1.5] Note that τR is a time shift, not a duration. [50.1.6] It can also be negative. Physical experiments in the past have the same outcome as in the present or in the future. [50.1.7] Outcomes of past experiments can be studied in the present with the help of documents (e.g. a video recording), irrespective of the fact that the experiment cannot be repeated in the past.

[50.2.1] The second requirement of homogeneity is homogeneous divisibility. [50.2.2] The semigroup property (2.140) implies that for Δt>0

TΔtTΔt=TΔtn=TnΔt(2.147)

holds. [50.2.3] Homogeneous divisibility of a physical time evolution requires that there exist rescaling factors Dn for Δt such that with Δt=Δt¯/Dn the limit

limnTnΔt¯/Dn=T¯Δt¯(2.148)

exists und defines a time evolution T¯Δt¯. [50.2.4] The limit n corresponds to two simultaneous limits n,Δt0, and it corresponds to the passage from a microscopic time scale Δt to a macroscopic time scale Δt¯.

2.3.3.5 Causality

[50.3.1] Causality of the physical time evolution requires that the values of the image function gt=TΔtft depend only upon values fs of the original function with time instants s<t.

2.3.3.6 Fractional Time Evolution

[50.4.1] The requirement (2.145) of homogeneity implies that the operators TΔt are convolution operators [128, 114]. Let T be a bounded linear operator on L1R that commutes with time translations, i.e. that fulfills eq. (2.145). [50.4.2] Then there

[page 51, §0]    exists a finite Borel measure μ such that

Tfs=μ*fs=fs-xμdx(2.149)

holds [128],[114, p.26]. [51.0.1] Applying this theorem to physical time evolution operators TΔt yields a convolution semigroup μΔt of measures TΔtft=μΔt*ft

μΔt1*μΔt2=μΔt1+Δt2(2.150)

with Δt1,Δt20. [51.0.2] For Δt=0 the measure μ0 is the Dirac-measure concentrated at 0.

[51.1.1] The requirement of causality implies that the support suppμΔtR+=0, of the semigroup is contained in the positive half axis.

[51.2.1] The convolution semigroups with support in the positive half axis 0, can be characterized completely by Bernstein functions [10]. [51.2.2] An arbitrarily often differentiable function b:0,R with continuous extension to 0, is called Bernstein function if for all x0,

bx0(2.151)
-1ndnbxdxn0(2.152)

holds for all nN. [51.2.3] Bernstein functions are positive, monotonously increasing and concave.

[51.3.1] The characterization is given by the following theorem [10, p.68]. [51.3.2] There exists a one-to-one mapping between the convolution semigroups μt:t0 with support on 0, and the set of Bernstein functions b:0,R [10]. [51.3.3] This mapping is given by

0e-uxμΔtdx=e-Δtbu(2.153)

with Δt>0 and u>0.

[51.4.1] The requirement of homogeneous divisibility further restricts the set of admissible Bernstein functions. [51.4.2] It leaves only those measures μ that can appear as limits

limn,Δt0μΔt**μΔtnfactors=limnμnΔt¯/Dn=μ¯Δt¯.(2.154)

[51.4.3] Such limit measures μ¯ exist if and only if bx=xα with 0<α1 and Dnn1/α holds [32, 11, 54].

[51.5.1] The remaining measures define the class of fractional time evolutions TαΔt that depend only on one parameter, the fractional order α.

[page 52, §1]    [52.1.1] These remaining fractional measures have a density and they can be written as [43, 48, 49, 47, 54]

TαΔtft0=0ft0-shαsΔtdsΔt,(2.155)

where Δt0 and 0<α1. [52.1.2] The density functions hαx are the one-sided stable probability densities [43, 49, 48, 47, 54]. [52.1.3] They have a Mellin transform [131, 103, 45]

Mhαxs=0xs-1hαxdx=1αΓ1-s/αΓ1-s(2.156)

allowing to identify

hα(x)=1αxH1110(1x|0,10,1/α)(2.157)

in terms of H-functions [30, 103, 45, 95].

2.3.3.7 Infinitesimal Generator

[52.2.1] The infinitesimal generators of the fractional semigroups TαΔt

Aαft=-M+αft=-1Γ-α0ft-s-ftsα+1ds(2.158)

are fractional time derivatives of Marchaud-Hadamard type [98, 51]. [52.2.2] This fundamental and general result provides the basis for generalizing physical equations of motion by replacing the integer order time derivative with a fractional time derivative as the generator of time evolution [43, 54].

[52.3.1] For α=1 one finds h1x=δx-1 from eq. (2.158), and the fractional semigroup Tα=1Δt reduces to the conventional translation semigroup T1Δtft0=ft0-Δt. [52.3.2] The special case α=1 occurs more frequently in the limit (2.154) than the cases α<1 in the sense that it has a larger domain of attraction. [52.3.3] The fact that the semigroup T1Δt can often be extended to a group on all of R provides an explanation for the seemingly fundamental reversibility of mechanical laws and equations. [52.3.4] This solves the "reversed irreversibility problem".

2.3.3.8 Remarks

[52.4.1] Homogeneous divisibility formalizes the fact that a verbal statement in the present tense presupposes always a certain time scale for the duration of an

[page 53, §0]    instant. [53.0.1] In this sense the present should not be thought of as a point, but as a short time interval [59, 54, 48].

[53.1.1] Fractional time evolutions seem to be related to the subjective human experience of time. [53.1.2] In physics the time duration is measured by comparison with a periodic reference (clock) process. [53.1.3] Contrary to this, the subjective human experience of time amounts to the comparison with an hour glass, i.e. with a nonperiodic reference. [53.1.4] It seems that a time duration is experienced as ‘‘long’’ if it is comparable to the time interval that has passed since birth. [53.1.5] This phenomenon seems to be reflected in fractional stationary states defined as solutions of the stationarity condition TαΔtft=ft. Fractional stationarity requires a generalization of concepts such as ‘‘stationarity’’ or ‘‘equilibrium’’. [53.1.6] This outlook could be of interest for nonequilibrium and biological systems [43, 49, 48, 47, 54].

[53.2.1] Finally, also the special case α0 challenges philosophical remarks [59]. [53.2.2] In the limit α0 the time evolution operator degenerates into the identity. [53.2.3] This could be expressed verbally by saying that for α=0 ‘‘becoming’’ and ‘‘being’’ coincide. [53.2.4] In this sense the paradoxical limit α0 is reminiscent of the eternity concept known from philosophy.

2.3.4 Identification of α from Models

[53.3.1] Consider now the second basic question of Section 2.3.1: How can the fractional order α be observed in experiment or identified from concrete models. [53.3.2] To the best knowledge of this author there exist two examples where this is possible. [53.3.3] Both are related to diffusion processes. [53.3.4] There does not seem to exist an example of a rigorous identification of α from Hamiltonian models, although it has been suggested that such a relation might exist (see [129]).

2.3.4.1 Bochner-Levy Fractional Diffusion

[53.4.1] The term fractional diffusion can refer either to diffusion with a fractional Laplace operator or to diffusion equations with a fractional time derivative. [53.4.2] Fractional diffusion (or Fokker-Planck) equations with a fractional Laplacian may be called Bochner-Levy diffusion. [53.4.3] The identification of the fractional order α in Bochner-Levy diffusion equations has been known for more than five decades [13, 26, 14]. [53.4.4] For a lucid account see also [27]. [53.4.5] The fractional order α in this case is the index of the underlying stable process [13, 27]. [53.4.6] With few exceptions [77] these developments in the nation of mathematics did, for many years, not find much attention or application in the nation of physics although eminent mathematical physicists such as Mark Kac were thoroughly familiar

[page 54, §0]    with Bochner-Levy diffusion [65]9 (This is a footnote:) 9Also, Herrmann Weyl, who pioneered fractional as well as functional calculus and worked on the foundations of physics, seems not to have applied fractional derivatives to problems in physics.. [54.0.1] A possible reason might be the unresolved problem of locality discussed above. [54.0.2] Bochner himself writes ‘‘Whether this (equation) might have physical interpretation, is not known to us’’ [13, p.370].

2.3.4.2 Montroll-Weiss Fractional Diffusion

[54.1.1] Diffusion equations with a fractional time derivative will be called Montroll-Weiss diffusion although fractional time derivatives do not appear in the original paper [87] and the connection was not discovered until 30 years later [60, 46]. [54.1.2] As shown in Section 2.3.3, the locality problem does not arise. [54.1.3] Montroll-Weiss diffusion is expected to be consistent with all fundamental laws of physics. [54.1.4] The fact that the relation between Montroll-Weiss theory and fractional time derivatives was first established in [60, 46] seems to be widely unknown at present, perhaps because this fact is never mentioned in widely read reviews [82] and popular introductions to the subject [112]10 (This is a footnote:) 10Note that, contrary to [112, p.51], fractional derivatives are never mentioned in [6]. .

[54.2.1] There exist several versions of diffusion equations with fractional time derivatives, and they differ physically or mathematically from each other [127, 104, 54, 82, 130]. [54.2.2] Of interest here will be the fractional diffusion equation for f:Rd×R+R

D0+α,1fr,t=CΔfr,t(2.159)

with a fractional time derivative of order α and type 1. [54.2.3] The Laplace operator is Δ and the fractional diffusion constant is C. [54.2.4] The function fr,t is assumed to obey the initial condition fr,0+=f0δr. [54.2.5] Equation (2.159) was introduced in integral form in [104], but the connection with [87] was not given.

[54.3.1] An alternative to eq. (2.159), introduced in [54, 53], is

D0+α,0fr,t=CΔfr,t(2.160)

with a Riemann-Liouville fractional time derivative D0+α of type 0. [54.3.2] This equation does not describe diffusion of Montroll-Weiss type [53]. [54.3.3] It has therefore been called ‘‘inconsistent’’ in [81, p.3566]. As emphasized in [53] the choice of D0+α in (2.159) is physically and mathematically consistent, but corresponds to a modified initial condition, namely I0+1-αfr,0+=f0δr. [54.3.4] Similarly, fractional diffusion equations with time derivative D0+α,β of order α and type β have been investigated in [54]. [54.3.5] For α=1 they all reduce to the diffusion equation.

[54.4.1] Before discussing how α arises from an underlying continuous time random walk it is of interest to give an overall comparison of ordinary diffusion with

[page 55, §0]    α=1 and fractional diffusion of the form (2.159) with α1. [55.0.1] This is conveniently done using the following table published in [46]. [55.0.2] The first column gives the results for α=1, the second for 0<α<1 and the third for the limit α0. [55.0.3] The first row compares the infinitesimal generators of time evolution Aα. [55.0.4] The second row gives the fundamental solution fk,u in Fourier-Laplace space. [55.0.5] The third row gives fk,t and the fourth fr,t. [55.0.6] In the fifth and sixth row the asymptotic behaviour is collected for r2/tα0 and r2/tα.

Table 2.1: Table from [46].
 
α=10<α<1α0
 
AαddtD~0+α1
fk,uf0u+Ck2f0uα-1uα+Ck2f0u1+Ck2
fk,tf0e-Ctk2f0Eα-Ctk2f01+Ck2
fr,tf0e-r2/4Ct4πCt-d/2f0r2πd/2Hαdr24Ctαf0r1-d2C2πdKd-2drC
r2tα0t-d/2r2-dtαr2/d-d/2
r2tαexp-r24Ctexp-cαr24Ctα12-αexp-rC
 

[55.1.1] In the table Eα,βx denotes the generalized Mittag-Leffler function from eq. (2.130), Kνx is the modified Bessel function [1], d>2, cα=2-ααα/2-α and the shorthand

Hαd(x)=H1220(x|1,αd/2,1,1,1)(2.161)

was used for the H-function H1220. [55.1.2] For information on H-functions see [30, 79, 95, 54].

[page 56, §1]    [56.1.1] The results in the table show that the normal diffusion (α=1) is slowed down for 0<α<1 and comes to a complete halt for α0. [56.1.2] For more discussion of the solution see [46].

2.3.4.3 Continuous Time Random Walks

[56.2.1] The fractional diffusion equation (2.159) can be related rigorously to the microscopic model of Montroll-Weiss continuous time random walks (CTRW’s) [87, 64] in the same way as ordinary diffusion is related to random walks [27]. [56.2.2] The fractional order α can be identified and has a physical meaning related to waiting times in the Montroll-Weiss model. [56.2.3] The relation between fractional time derivatives and CTRW’s was first exposed in [60, 46]. [56.2.4] The relation was established in two steps. First, it was shown in [60] that Montroll-Weiss continuous time random walks with a Mittag-Leffler waiting time density are rigorously equivalent to a fractional master equation. [56.2.5] Then, in [46] this underlying random walk model was connected to the fractional diffusion equation (2.159) in the usual asymptotic sense [109] of long times and large distances11 (This is a footnote:) 11This is emphasized in eqs. (1.8) and (2.1) in [46] that are, of course, asymptotic.. [56.2.6] For additional results see also [50, 54, 53, 57]

[56.3.1] The basic integral equation for separable continuous time random walks describes a random walker in continuous time without correlation between its spatial and temporal behaviour. [56.3.2] It reads [87, 88, 118, 39, 64]

fr,t=δr,0Φt+0tψt-trλr-rfr,tdt,(2.162)

where fr,t denotes the probability density to find the walker at position rRd after time t if it started from r=0 at time t=0. [56.3.3] The function λr is the probability for a displacement by r in each step, and ψt gives the probability density of waiting time intervals between steps. [56.3.4] The transition probabilities obey rλr=1, and Φt=1-0tψtdt is the survival probability at the initial site.

[56.4.1] The fractional master equation introduced in [60] with inital condition fr,0=δr,0 reads

D0+α,1fr,t=rwr-rfr,t(2.163)

with fractional transition rates wr obeying rwr=0. [56.4.2] Note, that eq. (2.162) contains a free function ψt that has no counterpart in eq. (2.163). [56.4.3] The rigorous relation between eq. (2.162) and eq. (2.163), first established in [60], is given by the relation

λk=1+ταwk(2.164)

[page 57, §0]    for the Fourier transformed transition rates wr and probabilities λr, and the choice

ψt=tα-1ταEα,α-tτα(2.165)

for the waiting time density, where τ>0 is a characteristic time constant. [57.0.1] With Eα,α0=1 it follows that

ψttα-1(2.166)

for t0. [57.0.2] From Eα,αxx-2 for x one finds

ψtt-α-1(2.167)

for t. [57.0.3] For α=1 the waiting time density becomes the exponential distribution, and for α0 it approaches 1/t.

[57.1.1] It had been observed already in the early 1970’s that continuous time random walks are equivalent to generalized master equations [66, 9]. [57.1.2] Similarly, the Fourier-Laplace formula

fk,u=uα-1/uα+Ck2(2.168)

for the solution of CTRW’s with algbraic tails of the form (2.167) was well known (see [117, eq.(21), p.402] [110, eq.(23), p.505] [67, eq.(29), p.3083]). [57.1.3] Comparison with row 2 of the table makes the connection between the fractional diffusion equation (2.159) and the CTRW-equation (2.162) evident. [57.1.4] However, this connection with fractional calculus was not made before the appearance of [60, 46]. [57.1.5] In particular, there is no mention of fractional derivatives or fractional calculus in [6].

[57.2.1] The rigorous relation between fractional diffusion and CTRW’s, established in [60, 46] and elaborated in [50, 54, 53, 57], has become a fruitful starting point for subsequent investigations, particularly into fractional Fokker-Planck equations with drift [19, 83, 81, 111, 80, 100, 33, 51, 61, 82, 130, 112].

Acknowledgement: The author thanks Th. Müller and S. Candelaresi for reading the manuscript.