Variational calculus

Introduction

It is not so much the mathematics that intrigues me but the fascinating problems you can solve using it. In this particular scenario I’m referring to a class of problems, for example, finding the shortest curve between two points on a plane or the path of traveling light between two points in the least amount of time. Luckily I’m definitely not the first one to think of these types of problems.

Finding the curve that yields the maximum surface for a given perimeter also known as the Dido problem (Weisstein, n.d.), which dates back to the age before Christ, is one of the first known formulations of a variational problem. This particular problem is considered a special case of an isoperimetric inequality problem, we will discuss this later in more detail(Dacorogna 2014). Great thinkers attempted to prove this isoperimetric inequality but one of the first to more or less succeed is Weierstrass in the 19th century with a formal proof (Siegel 2003).

It are more or less the problems that shape and evolve the mathematics behind them, making it more complete and formal. In the seventeenth century new problems around this topic emerged, Galileo (1638) with the branchistochrone problem1, Fermat’s problem (1662) on geometrical optics2, Newton (1685) with the movement of bodies through inviscid and in-compressible fluids3 and the Euler-Lagrange equations4 in the 1750s (Dacorogna 2014, Goldstine (2012)).

If we fast forward a bit to the nineteenth century we come across the Dirichlet integral5 which is the problem of of finding the function u(x) that minimizes this integral. Hilbert, Lebesgue and Tonelli played an important role in the further developing the Dirichlet integrals. Hilbert created a direct proof that there exists a shortest path between two points on a surface, Lebesgue showed that the area functional is semi-continuous and Tonelli applied the Arzela-Ascoli compactness theorem and Baire’s semi-continuity theorem to variational problems which we still use nowadays(Buttazzo, Giaquinta, and Hildebrandt 1998).

Some of the greatest minds have worked on this problem and now it is your turn. In the next section I’ll show some of the core concepts of variational calculus.

Variational calculus

Now that we have introduced some historic notes on the topic of variational calculus it is time to move on to a more solid definition. Assume that we have a function y(x) and a derivative y(x) then we can define a function that takes these functions as arguments, this so-called functional can look as follows: I=F(x,y,y)dx the objective will be to find extreme points for this function, this can either be maxima, minima or inflection points. We require this function to be twice differentiable because y(x)=0 indicates an extreme (we will use this later on in the derivation) and y(x) is signed and provides us with information about the type of extreme, e.g. a maxima, minima or inflection point. So we want to explorer the vicinity of the y(x) however, this is slightly different then the way we achieve this in standard calculus.

If we want to explorer extrema in standard calculus we can use the Taylor expansion, for example, if we want to find the extrema of a standard function f(x) we can use a Taylor expansion in the vicinity of x=a

f(x)=f(a)+f(a)1!(xa)+f2(a)2!(xa)2+...+f2(a)n!(xa)n

we assume that all of the n-th order derivatives are zero, hence we obtain

f(x)f(a)fn(a)n!(xa)n+O((xa)n) In case fn(a)>0 with n even we have a strict minimum point and for n odd we have a strict maximum point.

For the variational case we do not want to explorer near the vicinity of the value a but near the function y(x), in other words, we want to explorer the paths near the path of function y(x). We define a small variation of the function as δy(x)=y(x)y^(x)

In the next figure we show the function y(x) as original and two variations var1 and var2 using

y^(x)=y(x)+ϵη(x)

Where η(x) is an arbitrary continuous function with η(x0)=η(x1)=0.

0.02.55.07.5-80-60-40-200
var1originalvar2xytype

We can explorer the integral I in its vicinity, by using

I^=F(x,y+δy,y+δy)dx with δy0. Thus, if we perform a similar Taylor expansion at ϵ=0 we get

I=I^+dI^dϵϵ+d2I^dϵ2ϵ22!+O(ϵn) which gives II^=dI^dϵϵ+d2I^dϵ2ϵ22!+O(ϵn) We know that in order to be an extrema the first derivative must be zero hence we get dI^dϵϵ=0 Thus if we plugin the definition of I^ we get

ddϵ|ϵ=0F(x,y^,y^)dx=0 When we move the derivative inside we get a partial derivative F(x,y^,y^)ϵ|ϵ=0dx=0 x is an independent variable, hence we can use the chain rule of partial integration to get [Fy^y^ϵ+Fy^y^ϵ]|ϵ=0dx=0 We will now solve the second part of the chain rule,i.e. y^ϵ and y^ϵ, as a quick refresher the definitions of y^(x) and y^(x) y^(x)=y(x)+ϵη(x) y^(x)=y(x)+ϵη(x) taking the derivative of these functions with respect to ϵ gives y(x)ϵ=η(x) y(x)ϵ=η(x) we can use these derivatives and plug them into the previous function to obtain [Fy^η(x)+Fy^η(x)]|ϵ=0dx=0 the second part, Fy^η(x), of the integral can be computed using integration by parts (abu(x)v(x)dx=[u(x)v(x)]ababu(x)v(x)dx) if we take v(x)=η(x) and u(x)=Fy^ then we get

Fy^ηdx=Fy^ηdx(η)ddx[Fy^]dx using η(x)=η(x)dx we obtain

Fy^[η(x)]η(x)ddx[Fy^]dx which we can compute using the definition of the boundary conditions η(x1)=η(x2)=0 and get Fy^[η(x)]=0. Then we are left with η(x)ddx[Fy^]dx. If we use this in the original equation we get

[Fy^ηddx[Fy^]η]|ϵ=0dx=0 [Fy^ddx[Fy^]]η|ϵ=0dx=0 at ϵ=0

[Fy^ddx[Fy^]]ηdx=0 which is only true if Fy^ddx[Fy^]=0 which is the Euler-Lagrange equation. Solving this equation will provide us with the answer we are looking for. If we can find a function that satisfies this conditions we have found the function that minimizes or maximizes I and solved our original problem.

Conclusion

In this tutorial we provided a short overview of some historic notes on the variational calculus problems and the derviation of the Euler-Lagrange equation. In a follow-up post I’m planning to demonstrate some of the problems you can solve with this math.

References

https://books.google.de/books?id=_iTnBwAAQBAJ&pg=PA8&lpg=PA8&dq=Huygens+fluid+bodies+variational+calculus&source=bl&ots=zfpVNvTRK_&sig=zFoFk-R5WhCEud-6OTpGj2nWvGU&hl=de&sa=X&ved=0ahUKEwj1kc3Q6PzUAhXDuxQKHYwrCiwQ6AEINzAD#v=onepage&q=Huygens%20fluid%20bodies%20variational%20calculus&f=false

Buttazzo, Giuseppe, Mariano Giaquinta, and Stefan Hildebrandt. 1998. One-Dimensional Variational Problems: An Introduction. Vol. 15. Oxford University Press.

Dacorogna, Bernard. 2014. Introduction to the Calculus of Variations. World Scientific Publishing Co Inc.

Goldstine, Herman Heine. 2012. A History of the Calculus of Variations from the 17th Through the 19th Century. Vol. 5. Springer Science & Business Media.

Siegel, Alan. 2003. “A Historical Review of the Isoperimetric Theorem in 2-d, and Its Place in Elementary Plane Geometry.”

Weisstein, Eric W. n.d. “Dido’s Problem.”