Sample Programs

Sample programs for Minimal BASIC will appear here.

Numerical Integration

Introduction

There exists two cases, when the computation of the value of a definite integral by numerical methods is needed. One of them is the calculation of the area below the curve defined by a set of experimental data, and another is the calculation of the definite integral of a mathematical function, for which no known integral is known. The former is often the case of response functions in the experimental labors of science and engineering, while the latter is normally the case in the practical investigations of physics, mathematics, and engineering.

Independently of it, the development of numerical methods for integration purposes, a field that belongs to the department of applied mathematics, is based on the simple idea from which it stems, i.e., if $y(x)=f(x)$ is a real-valued (the complex-valued case can be treated analogously, by separating it into its real and imaginary parts) continuous function of $x$ defined in an interval $(a,b)$ , its definite integral,

$\int _{a}^{b}f(x)dx=\sum _{x=a}^{x=b}\lim _{\delta x\rightarrow 0}f(x)\delta x=\lim _{\delta x\rightarrow 0}\sum _{x=a}^{x=b}f(x)\delta x\approx \sum _{x=a}^{x=b}f(x)\Delta x$ ,

can be calculated approximately as the finite sum of the product $f(x)\Delta x$ evaluated at some given points in the interval $(a,b)$ .

In the case of experimental data, the set of points at which the value of the function is measured is usually not regularly distributed (i.e., the points are not equispaced), so the value of the definite integral must be calculated in the form:

$\int _{a}^{b}f(x)dx=\int _{{x_{0}}=a}^{x_{1}}f(x)dx+\int _{x_{1}}^{x_{2}}f(x)dx+...+\int _{x_{n-2}}^{x_{n-1}}f(x)dx+\int _{x_{n-1}}^{x_{n}=b}f(x)dx$ ,

which can be approximated either as:

$\int _{a}^{b}f(x)dx\approx f({x_{0}}=a)({x_{1}}-{x_{0}})+f({x_{1}})({x_{2}}-{x_{1}})+...+f({x_{n-2}})({x_{n-1}}-{x_{n-2}})+f({x_{n-1}})({x_{n}}-{x_{n-1}})$ ,

or as:

$\int _{a}^{b}f(x)dx\approx f({x_{1}})({x_{1}}-{x_{0}})+f({x_{2}})({x_{2}}-{x_{1}})+...+f({x_{n-1}})({x_{n-1}}-{x_{n-2}})+f({x_{n}}=b)({x_{n}}-{x_{n-1}})$ .

In the first case, the value of the integral is underestimated (overestimated) in the case of monotonically ascending (descending) functions, since the value of $f(x)$ taken in each evaluation is always the lowest (highest) in every subinterval, and hence constituting an absolute lower (upper) bound to the value of the integral, while in the second case, the value of the integral is overestimated (underestimated) in the case of monotonically descending (ascending) functions, since the value of $f(x)$ taken in each evaluation is always the highest (lowest) in every subinterval, and hence constituting an absolute upper (lower) bound to the value of the integral.

According to the Mean-Value Theorem of Calculus, the value of a definite integral can also be calculated as:

$\int _{a}^{b}f(x)dx=f(\gamma )(b-a)$ ,

for some value $\gamma$ in $(a,b)$ for which $f(\gamma )$ represents the mean value of $f(x)$ in $(a,b)$ , so it is then a better approximation to calculate the definite integral of a set of experimental data as:

$\int _{a}^{b}f(x)dx=\int _{{x_{0}}=a}^{x_{1}}f(x)dx+\int _{x_{1}}^{x_{2}}f(x)dx+...+\int _{x_{n-2}}^{x_{n-1}}f(x)dx+\int _{x_{n-1}}^{x_{n}=b}f(x)dx$ ,

$\int _{a}^{b}f(x)dx=f(\gamma _{(0,1)})({x_{1}}-{x_{0}})+f(\gamma _{(1,2)})({x_{2}}-{x_{1}})+...+f(\gamma _{(n-2,n-1)})({x_{n-1}}-{x_{n-2}})+f(\gamma _{(n-1,n)})({x_{n}}-{x_{n-1}})$ ,

$\int _{a}^{b}f(x)dx\approx {\frac {f({x_{0}})+f({x_{1}})}{2}}({x_{1}}-{x_{0}})+{\frac {f({x_{1}})+f({x_{2}})}{2}}({x_{2}}-{x_{1}})+...+{\frac {f({x_{n-2}})+f({x_{n-1}})}{2}}({x_{n-1}}-{x_{n-2}})+{\frac {f({x_{n-1}})+f({x_{n}})}{2}}({x_{n}}-{x_{n-1}})$ .

For reasons that we shall see later, this is equal to assume a piecewise linear interpolating function between the different points, and the value of the integral so calculated is exact for linear functions (i.e., functions for which its slope changes at constant rate), although it is underestimated for functions for which its slope grows at non-constant rate (i.e., its second derivative is strictly positive in the considered interval), and it is overestimated for functions for which its slope decreases at non-constant rate (i.e., its second derivative is strictly negative in the considered interval). The value so calculated constitutes a better approximation than the lower and upper bounds presented before, and in the case that the second derivative of the function (which can be calculated from the experimental data with the help of second or central differences) changes signs between subintervals, the value is expected to be close to the actual value due to the cancellation of the errors in the approximation of the mean values.

In the case of mathematical functions, there is more information about the function, since it is possible not only to calculate the value of the function at any given point, but also to compute first, second, and higher-order derivatives with any degree of accuracy.

Let us elaborate some mathematical results, going from simple to more elaborate methods:

The main theme in the development of numerical methods, together with the study of the stability (i.e., if a method converges), is the rate of convergence of the method, which studies how many evaluations are needed and the error in the approximation, for non-iterative methods, or how many iterations are needed and how the error is minimized in each iteration, for iterative methods.

In the case of the study of the stability, and as we have seen before, the value of the definite integral of a function $f(x)$ defined in an interval $(a,b)$ ,

$\int _{a}^{b}f(x)dx=\sum _{x=a}^{x=b}\lim _{\delta x\rightarrow 0}f(x)\delta x=\lim _{\delta x\rightarrow 0}\sum _{x=a}^{x=b}f(x)\delta x\approx \sum _{x=a}^{x=b}f({x_{k}})\Delta {x_{k}}$ ,

can be calculated approximately as the finite sum of the product $f(x)\Delta x$ evaluated at some given points in the interval $(a,b)$ .

In the limit $\Delta x\rightarrow \delta x\rightarrow 0$ , the finite sum tends to the infinite integral, and so convergence is assured.

In the case of the study of the rate of convergence, one is interested in increasing the accuracy of the approximation, while retaining the number of subintervals, with a minor increase in computational complexity.

The approach used consists normally in using a polynomial approximation for the evaluation of the function $f(x)$ in each subinterval, using the information provided by the value of the function at several points in the subinterval.

Let us consider the case of equally-spaced points (although this restriction can be easily lifted):

According to the definition,

$\int _{a}^{b}f(x)dx=\sum _{x=a}^{x=b}\lim _{\delta x\rightarrow 0}f(x)\delta x=\lim _{\delta x\rightarrow 0}\sum _{x=a}^{x=b}f(x)\delta x\approx \sum _{x=a}^{x=b}f({x_{k}})\Delta {x_{k}}=\sum _{x=a}^{x=b}f({x_{k}})({x_{j+1}}-{x_{j}})$ ,

with $k$ being indicative of the subinterval, ${x_{k}}$ being some arbitrary number in every subinterval $({x_{j}},{x_{j+1}})$ , with $j=0,1,2,...,n-1$ , and ${x_{0}}=a,{x_{n}}=b$ , and $\Delta {x_{k}}=({x_{j+1}}-{x_{j}})$ , with $k=1,2,...,n$ , and $j=k-1$ .

The Mean-Value Theorem of Calculus tells us, that if

$\int _{a}^{b}f(x)dx$

is the definite integral of $f(x)$ in $(a,b)$ , there exists a value $\gamma$ in $(a,b)$ , such that

$\int _{a}^{b}f(x)dx=f(\gamma )(b-a)$ .

Additionally, by definition, if

$\int _{a}^{b}f(x)dx$

is the definite integral of $f(x)$ in $(a,b)$ , this one can also be understood as being composed of the individual contributions

$\int _{a}^{b}f(x)dx=\int _{{x_{0}}=a}^{x_{1}}f(x)dx+\int _{x_{1}}^{x_{2}}f(x)dx+...+\int _{x_{n-2}}^{x_{n-1}}f(x)dx+\int _{x_{n-1}}^{x_{n}=b}f(x)dx$

for arbitrary values ${x_{0}}=a,{x_{1}},{x_{2}},...,{x_{n-2}},{x_{n-1}},{x_{n}}=b$ .

Now, applying the Mean-Value Theorem to each individual contribution yields the result:

$\int _{a}^{b}f(x)dx=f(\gamma _{({x_{0}},{x_{1}})})({x_{1}}-{x_{0}})+f(\gamma _{({x_{1}},{x_{2}})})({x_{2}}-{x_{1}})+...+f(\gamma _{({x_{n-2}},{x_{n-1}})})({x_{n-1}}-{x_{n-2}})+f(\gamma _{({x_{n-1}},{x_{n}})})({x_{n}}-{x_{n-1}})$ ,

which is exact.

In the particular case that every subinterval is of equal size, i.e., $({x_{1}}-{x_{0}})=({x_{2}}-{x_{1}})=...=({x_{n-1}}-{x_{n-2}})=({x_{n}}-{x_{n-1}})=\Delta x$ , then the expression reduces to

$\int _{a}^{b}f(x)dx=\left(f(\gamma _{({x_{0}},{x_{1}})})+f(\gamma _{({x_{1}},{x_{2}})})+...+f(\gamma _{({x_{n-2}},{x_{n-1}})})+f(\gamma _{({x_{n-1}},{x_{n}})})\right)\Delta x$ .

In this way, the calculation of the initial definite integral reduces to the calculation of the mean values

$f(\gamma _{({x_{0}},{x_{1}})}),f(\gamma _{({x_{1}},{x_{2}})}),...,f(\gamma _{({x_{n-2}},{x_{n-1}})}),f(\gamma _{({x_{n-1}},{x_{n}})})$ .

In a first approximation, with only one point,

$f(\gamma _{({x_{0}},{x_{1}})})\approx f({x_{i}}_{({x_{0}},{x_{1}})}),f(\gamma _{({x_{1}},{x_{2}})})\approx f({x_{i}}_{({x_{1}},{x_{2}})}),...,f(\gamma _{({x_{n-2}},{x_{n-1}})})\approx f({x_{i}}_{({x_{n-2}},{x_{n-1}})}),f(\gamma _{({x_{n-1}},{x_{n}})})\approx f({x_{i}}_{({x_{n-1}},{x_{n}})})$ ,

with ${x_{i}}$ being the value of $x$ in the middle of each subinterval, which leads to

$\int _{a}^{b}f(x)dx\approx \left(f({x_{i}}_{({x_{0}},{x_{1}})})+f({x_{i}}_{({x_{1}},{x_{2}})})+...+f({x_{i}}_{({x_{n-2}},{x_{n-1}})})+f({x_{i}}_{({x_{n-1}},{x_{n}})})\right)\Delta x$ .

In a second approximation, with only two points,

$f(\gamma _{({x_{0}},{x_{1}})})\approx {\frac {f({x_{0}})+f({x_{1}})}{2}},f(\gamma _{({x_{1}},{x_{2}})})\approx {\frac {f({x_{1}})+f({x_{2}})}{2}},...,f(\gamma _{({x_{n-2}},{x_{n-1}})})\approx {\frac {f({x_{n-2}})+f({x_{n-1}})}{2}},f(\gamma _{({x_{n-1}},{x_{n}})})\approx {\frac {f({x_{n-1}})+f({x_{n}})}{2}}$ ,

with ${x_{n-1}},{x_{n}}$ being the value of $x$ at the beginning and at the end of each subinterval, which leads to

$\int _{a}^{b}f(x)dx\approx \left(f({x_{0}})+2f({x_{1}})+...+2f({x_{n-1}})+f({x_{n}})\right)\Delta x/2$ .

In a third approximation, with only three points,

$f(\gamma _{({x_{0}},{x_{1}})})\approx {\frac {f({x_{0}})+f({x_{i}}_{({x_{0}},{x_{1}})})+f({x_{1}})}{3}},f(\gamma _{({x_{1}},{x_{2}})})\approx {\frac {f({x_{1}})+f({x_{i}}_{({x_{1}},{x_{2}})})+f({x_{2}})}{3}},...,f(\gamma _{({x_{n-2}},{x_{n-1}})})\approx {\frac {f({x_{n-2}})+f({x_{i}}_{({x_{n-2}},{x_{n-1}})})+f({x_{n-1}})}{3}},f(\gamma _{({x_{n-1}},{x_{n}})})\approx {\frac {f({x_{n-1}})+f({x_{i}}_{({x_{n-1}},{x_{n}})})+f({x_{n}})}{3}}$ ,

with ${x_{n-1}},{x_{i}},{x_{n}}$ being the value of $x$ at the beginning, in the middle, and at the end of each subinterval, which leads to

$\int _{a}^{b}f(x)dx\approx \left(f({x_{0}})+f({x_{i}}_{({x_{0}},{x_{1}})})+2f({x_{1}})+f({x_{i}}_{({x_{1}},{x_{2}})})+2f({x_{2}})+...+2f({x_{n-2}})+f({x_{i}}_{({x_{n-2}},{x_{n-1}})})+2f({x_{n-1}})+f({x_{i}}_{({x_{n-1}},{x_{n}})})+f({x_{n}})\right)\Delta x/3$ .

In a fourth approximation, while still making use of the evaluation of the function $f(x)$ at the beginning, in the middle, and at the end of each subinterval, one can use the result that if $f(\gamma ^{-})$ and $f(\gamma ^{+})$ are estimates for $f(\gamma )$ , then its arithmetic mean $\left(f(\gamma ^{-})+f(\gamma ^{+})\right)/2$ is also another estimate with at least the same accuracy, if not better.

So, adding the results for the first and second approximation, and dividing by two,

$f(\gamma _{({x_{n-1}},{x_{n}})})\approx {\frac {f({x_{i}}_{({x_{n-1}},{x_{n}})})+{\frac {f({x_{n-1}})+f({x_{n}})}{2}}}{2}}={\frac {f({x_{n-1}})+2f({x_{i}}_{({x_{n-1}},{x_{n}})})+f({x_{n}})}{4}}$ ,

which leads to the result

$\int _{a}^{b}f(x)dx\approx \left(f({x_{0}})+2f({x_{i}}_{({x_{0}},{x_{1}})})+2f({x_{1}})+2f({x_{i}}_{({x_{1}},{x_{2}})})+2f({x_{2}})+...+2f({x_{n-2}})+2f({x_{i}}_{({x_{n-2}},{x_{n-1}})})+2f({x_{n-1}})+2f({x_{i}}_{({x_{n-1}},{x_{n}})})+f({x_{n}})\right)\Delta x/4$ .

Adding the results for the first and third approximation, and dividing by two,

$f(\gamma _{({x_{n-1}},{x_{n}})})\approx {\frac {f({x_{i}}_{({x_{n-1}},{x_{n}})})+{\frac {f({x_{n-1}})+f({x_{i}}_{({x_{n-1}},{x_{n}})})+f({x_{n}})}{3}}}{2}}={\frac {f({x_{n-1}})+4f({x_{i}}_{({x_{n-1}},{x_{n}})})+f({x_{n}})}{6}}$ ,

which leads to the result

$\int _{a}^{b}f(x)dx\approx \left(f({x_{0}})+4f({x_{i}}_{({x_{0}},{x_{1}})})+2f({x_{1}})+4f({x_{i}}_{({x_{1}},{x_{2}})})+2f({x_{2}})+...+2f({x_{n-2}})+4f({x_{i}}_{({x_{n-2}},{x_{n-1}})})+2f({x_{n-1}})+4f({x_{i}}_{({x_{n-1}},{x_{n}})})+f({x_{n}})\right)\Delta x/6$ .

In practice, one can do no better with only three evaluations in an interval, but the results obtained are simple and accurate enough, even in the case of one single interval.

Let us illustrate the case by means of an example:

Let us suppose, that we wish to calculate the definite integral of the function $f(x)=exp(x)$ in the interval $(0,1)$ , for which we know its exact value, $F(x)|_{0}^{1}=\int _{0}^{1}exp(x)dx=exp(x)|_{0}^{1}=exp(1)-exp(0)=2.71828-1.00000=1.71828$ .

Let us also keep the problem simple and do the calculations with a single interval, i.e., ${x_{0}}=a=0.0$ , ${x_{n}}=b=1.0$ , and $x_{i}=0.5$ .

So, we have:

First approximation:

$\int _{a}^{b}f(x)dx\approx f(x_{i})(b-a)=exp(0.5)(1.0-0.0)=1.64872$