CHAPTER XVII
APPROXIMATE INTEGRATION
1. In order to obtain the area of a curve by the methods of the integral calculus as given in the preceding chapters two conmust necessarily hold. These conditions are
- (i)the equation of the curve must be known; and in that event
the function y = us representing the equation of the curve must be integrable.
In the theory of life contingencies these conditions are rarely satisfied. Rates of mortality, marriage, etc. are generally obtained from actual observations and the functions derived from these rates are seldom capable of expression in the form of a mathematical expansion. For example, if lx be the number of persons attaining exact age x in any year of time, the number living between ages x and x + 1 is given by
Ls= J 1,s+t dt, .owhere x and t are independent.
Unless l~_t follows some definite mathematical law, we cannot evaluate the integral by the methods hitherto employed. By making certain assumptions, however, we can obtain approximations to the value of the integral which are accurate enough for practical purposes.Thus, if we expand lx+t in terms of lx and differences of lx, we have
lx+t = (I + ~1)tlx
= lx + tOlxto first differences.ButOl,. = lx+i l~and l if+l is d, , the number out of the lx persons who die before reaching age x + Z.
286 INTEGRAL CALCULUS
.. Lx = J 1 lx+t dt
0 r1
= J o (lx tdx) dt as far as first differences
= Ctlx 2tzdx
L o
=lx2dx
=lx(Ix1x+1) = 2 (Ix + lx+i)
This is an approximate value of Lx on the assumption that lx+e is a first difference function of lx .
It is evident that a simple approximation to first differences will not be justifiable except in a limited number of instances : the conof formulae for approximate integration will necessarily depend on the data available and on the degree of accuracy required.
2. Simpson's rule.
Suppose that we are given two values only of us and that we represent these two values by means of the points H and K whose co-ordinates are (a, ua) and (b, ub) respectively. Suppose also that we have sufficient information to justify our representing the funcy = us by a curve of small slope and small changes of slope,
not affected by periodic changes. An infinite number of such mathecurves can be drawn to pass through H and K. The simplest curve passing through the points is a straight line, and for the purof interpolation there is usually no justification for adopting any formula other than a first difference
formula, as was done for example
in obtaining the approximation to O N b M Y
-----------
Lx in the preceding paragraph. If Fig. 36.
the straight line be drawn, the area cut off by the curve, the x-axis and the ordinates x = a and x = b will be that of the trapezium HNIl7K. The area of the trapezium is (ua + ub) (b a).
SIMPSON'S RULE 287 b
Alternatively, this area is ( uxdx.
-a
An approximation to the integral is therefore (ua + ub) (b a). If y = ux = lx+s and the limits b and a are 1 and o respectively,
we have the approximation to J lx+t dt given above, namely
0
i (lx + lx+1).
Let us consider the effect on the approximation of introducing a third known value of ux. In other words, suppose our data to be ua, ub, ue. Then there is still
an unlimited number of curves which can be drawn to pass through the three points H, K, L. The simplest of these curves will now no longer be a straight line, but will be a second difference curve. The form of this curve will be y = A + Bx + Cx2, and since three points on the curve are given we
are in possession of sufficient data to
determine the coefficients A, B, C. Fig. 37.
In practice the data will generally be available at equidistant intervals, and since we may choose our origin where we please, the problem is therefore reduced to one of finding the second difference curve which passes through the points (o, uo), (I, u1),
(2, u2).
Having found the equation of the curve we can then find the area enclosed by the curve, the extreme ordinates and the x-axis, by integrating the function thus obtained between the limits o and 2.
Assume therefore that y = ux = a + bx + cx2 is the required equation.
Then, by substitution,
uo=a, u1=a+b -!- c,
u2=a+2b+4C,
z88 INTEGRAL CALCULUS
whence a = uo, b=(-ut+4u1-3uo),
r c= 2(u22161-+0). 2
Also J o ux dx = to (a + bx + cx2) dx
= ax + 2bx2 + icx3l 2 0
=za+2b+3c
= 3 (u0 + 4u1 + u2),
on substituting for a, b and c.
This is Simpson's rule for approximate integration. We can obtain this formula by alternative methods. For example, let
ux = (1 + 0)x u0 = u0 + x,uo + ix (x I) 02u0 as far as second differences.
2 2
Then Iouxdx=Jo{uo+x,uo+2x(x-I)O2uo}dx
J 2
= J.o {u0 + x (Au0 Z,2u0) + x2202 u0} dx
[xuo + ix2(Au, L,2u0) + ix3A2u0 0
2u0 + 2 (Xu0 2A2160) + 3L2u0 = 2u0 + 2A110 + 1.12U0
,
and since uo = uo,
ul = uo + :duo ,
u2 = u0 + 2:,u0 + r,=uo ,
this reduces to (11.o + 4u1 + u2) as before.
A third method is to assume that the integral can be expressed in the form
muo + nu1 + put .
Then if y = a + bx + cx2 we have eventually that J0x = za + zb
;-Vic,
as in the first method.
CHANGE OF UNIT 289 Substituting in the assumed expression for the integral: ma-j-n(a+b+c)+p(a+2b+ 4c)=2a+zb+Rc.
Whence, by equating coefficients of a, b, c and solving the resulting equations,
m=3; n= p=3.
3. Change of unit.
r2
The formula J uxdx = 3 (uo + 4u1 + u2) is an approximate 0
formula obtained by considering unit intervals. If we wish to transform the formula to a form in which the interval is changed to, say, n, then the given values of ux will be uo; un; u2n. Our new variable is z, where z = nx.
dz = n dx.
The new limits are evidently o and 2n.
The formula is
j2n 1 (
- tlZd2 = (uo + 4un + n2n)) o n 12n
i.e. J u,dz = An (uo + 4nn + u2n),
0
or, on changing the variable to x, r2n
J uxdx = An (1d0 + 4nn + u2n) 0 This principle is of universal application and any formula of
approximate integration can immediately be transferred from unit intervals to nthly intervals and vice versa.
For example, the formula
uxdx = 22 (ul + u4 + u6 + u9)
(20
becomes f uxdx = 5 (u2 + U9 + 1112 + u19)
0
on doubling the interval;
iOn
and Jo u
dx = 2n (u+ u4+ u-+ u9)
when the interval is n.
F X9
r10 0
290 INTEGRAL CALCULUS 4. Change of origin.
Consider again Simpson's rule:
usdx = (uo + 4U1 + 112). 0
We have obtained a formula which gives in terms of u0, ui and u2 the area of the curve cut off by the Y ordinates x = o and x = 2. If we Y'
change the origin so that the point (1, ui) becomes the point (o, vo), as in Fig. 38, we shall have a formula of integration between the limits -1
and I in terms of v_1, vo and vi. v u
The formula will be otherwise un- 110 zr, ° 2 v,
altered and we shall have found an approximation to the same area with reference to a new y-axis 01 Yi .
In its new form the formula be- 0 0, %
comes Fig. 38,
Jvdx = (v-1 + 4vo + vi), -i or, changing back to u's,
~ uxdx = (u-i+4110+111) _i
Now when it is desired to obtain an approximate integration formula it is evident that integration between r and I (or between n and n) involves much less labour than integration between o and 2 (or between o and 2n). Thus, if
| |
uy=a+bx+cx2, |
| then |
110=a, |
|
| |
11_1=ab+c, |
| and |
U1=a+b+c, |
|
(u_i + 4uo + 111) = |
{4uo + (u-1 + u1)} |
| |
= |
(4a + za + zc) |
= s (6a + 2C).
CHANGE OF ORIGIN 291
Also J11(a+bx+cx2)dx= [ax + 2bx2+ 3cx3J11 = 2a + sc,
which proves the approximation.
Again, since in this form the coefficients of odd powers of x disappear in the definite integral and also in the paired terms ut and u_, , we could equally well employ as our assumed expansion for ux the third difference function a + bx + cx2 + dx3.
We should have
4uo+(u_1+III) =4a+ab+cd+ a+b+c+d=6a+zc as before, and
[ax + 2bx2 + 3cx3 + 4dx4]1 _ = 2a + 3c. 1
This leads to the important fact that Simpson's rule is true to one more order of differences than was originally assumed, i.e. it is true to third differences.
For the above reasons, namely
- (i)for simplicity in working;
- (ii)to enable us to find the true order of differences to which the approximation is correct;
it is advisable to integrate between n and n in preference to any other limits.
This can always be done by a suitable change of limits; for example, integration between z and I can be simplified to integrabetween say z and 4 by increasing the interval from to 2. Then, by changing the origin, a formula can be obtained for intebetween i and 1. To express the formula in the required form the process can then be reversed.
Example 1.'2'5Show that } vx dx = 2¢ (v5 + 22112 + Z'3) approximately.
-25 1 5
v,. dx = - / u~ dx (if we double the interval).
.1'5 2.3
This will produce a formula in u2; u4; us.
In the first place we will obtain a formula for j ux dx. 1
19-2
292 INTEGRAL CALCULUS
This formula will have to be of the form mu_2 + nuo + pu2, since we have moved our origin to the point where x was originally 4.
| Let |
ux = a + bx + cx2 + dx3. |
| Then |
u0 = a,
u_2=a2b+4c 8d, |
and so that |
u2=a+zb+4c+8d, |
(u_2 + 22116 + u2) = _i} (22a + 2a + 8c) = a + lc. f1 uxdx = [ax + 2bx2 + 3cx3 + edx4J1 1 = za +ic. 24 (u_2 + 22u0 + u2) = 2 I_ 1 ux dx.
Changing the origin back again to the original origin:
5
24 (112 + 22114+ u6) = 2 /3 uy dx.
2.5
(711+22712+V3)= [ uxdx. 24 . 1.5
Moreover, although only three values of vx have been given (so that only second differences are known), the formula is true to at least third differences.
5. Some well-known approximate integration formulae. (1) Extension of Simpson's rule.
2 We have f
o uxdx = (uo + 4u1 + u2). J
Therefore by changing the origin
uxdx= 3 (u2+4113+114).
16
Similarly uxdx = (u4 + 4u5 + 116). 4
.................................... In general, by addition,
r2n
11xdx = (u0 + 4111 + 2112 + 4113 + 2114 + ... + u2n). .o
This is a formula for approximate integration used extensively in practical mathematics. In engineering problems, where the
j 4
THE "THREE-EIGHTHS" RULE 293
curve to be integrated has actually been sketched as the result of experiments, a small unit may be chosen and a large number of ordinates drawn. In that event the extended formula can be apThis will in general give better results than the single formula
12n
n
o uxdx = 3 (110 + 4U,, + u2n)
It should be noted that Simpson's rule, applied many times (as above), does not assume that a smooth curve can be drawn between all the points u0, u1, ... u>,,. The method of obtaining the formula has in fact been to draw a number of disjointed curves between u0, u1i u2; u2, u3, u4; u2,,i and the curve passing through three points such as uo, u1, u2 will not as a rule pass through the next series
u3 and u4.
(2) The "three-eighths" rule.
The following symmetrical formula can be derived by working on the above lines, when four consecutive points are given.
If the four points were u0, u1i u2, u3 we should obviously inbetween o and 3. Change the unit and origin so that we pass firstly to the limits o and 6 and secondly to the limits 3 and 3
3
We require j'- uxdx, given u_3 i u_1, u1, u3.
3
Let ux = a + bx + cx2 + dx3,
so that u_1+211=2(a+c)
and u_3 + u3 = 2 (a + 9c).
3
If Jr 3uxdx = m (u_1 + u1) + n (u_3 + u3), we have
C3
ax + zbx2 + cx3 + idx4 = m (u_1 + u1) + n (u_3 + u3);
-3
i.e. 6a + 18c = m (u_1 + u1) + n (21_3 + u3)
= m (2a + 2c) + n (2a ± 18c)
= a (2m + 2u) + c (zm + 18n), from which, easily, m = 18/8 and n = 6/8.
294 INTEGRAL CALCULUS
J - uxdx = {6 (u_1 + u1) + 2 (u-3 + i.e. 3
i.e. JO
uxdx = $ {6 (u2 + u4) + 2 (up + u6)},
0
or uxdx = s {3 (u1 + u2) + (uo + u3)} = $ (u0+3111+3u2+u3). (3) Weddle's rule.
Suppose that seven equidistant ordinates are given. In order to reduce the algebra to a minimum, we integrate between limits 3 and + 3, and write the assumed formula for ux thus :
ux=a+bx+cx2+dx(x2I)+ex2(x2I)+fx(x2I)(x2-4) 2 6 24 120
+gx2(x2I)(x24)
720
This is Stirling's formula with the constants a, b, c, ... g re-placing the differences of ux.
Then
Judx= bx2 cx3 d x4 x2 e x5 x3l
Cax+ z + 6 +6(42)+24(5 3/
f xs 5x4 4x21 g (x7 5x5 x3;1 3
+ 120 \ 6 q+ 2/+ 720 7 5 3 J 3 g = 6a + 54 (6 72 + 540) + 486 (120 720) + 280g =6a+9c+ e+ 4'6- g.
Replacing a, c, e, g by the differences of ux in Stirling's formula, we have
3
(_ 3 uxdx=6a+ 9c+ oe+ i4hug
= 6u0 + 9A2 11_1 + 04 u-2 + 14., A6 71_3
If now A2u_1, 04u_2 and A'u_3 are expressed in terms of u_3, u-2, ... U2, 113i we shall obtain a formula correct to seventh differences. It will be found, however, that the coefficients of the terms are large. This is due to the awkward fraction 1410 multi-
WEDDLE'S RULE 295
plying 611_3. As sixth differences are usually small, the error involved in replacing r`'o-06u_3 by 4?;06u_3, i.e. by -1'',06u_3i will, in general, be negligible. By this substitution the coefficients in the final formula will be much simplified. The modified formula will involve an error of 11,Q6u_3 and will therefore be correct to fifth differences only.
The terms in the expression thus adjusted, namely,
6uo + 90211_1 + 'It; A4u_2 + A4u-3, may be collected as shown in the following table:
| |
u_3 |
u_2 |
u_1 |
u0 u, |
u2 |
U3 |
|
6u, |
|
|
|
6o |
|
|
|
|
9J21L-1 |
|
|
9'0 |
i8o |
9.0 |
|
|
|
3'3-1'U2 |
|
3'3 |
13.2 |
19.8 |
13'2 |
3'3 |
|
|
3~eu~ |
3 |
-1.8 |
4.5 |
6o |
4.3 |
-1.8 |
'3 |
|
Total ... |
3 |
1.5 |
3 |
1.8 |
3 |
1.5 |
3 |
3
Therefore J_3uxdx is approximately equal to
311_3 + I.511_2 + /311_1 + I 8110 + 3111 + 1'5112 + 3113
113- [(u-3 + 113) + 5 (11-2 + u2) + (u-1 + ul) + 6u0~
.6
or I0 usdx = [(no + 116) + 5 (ul + 115) + (112 + u4) + 6u3].
This is WVeddle's rule.
(4) Hardy's formulae.
Certain approximate formulae due to G. F. Hardy have been used extensively in actuarial work.
Let ux = a + bx + cx2 + dx3 + ex4 + fx5
3
so that ( uxdx = 6a + 18c + !Is. -3
110 = a; 11_2 + 112 = 2a + 8c + 32e; 11_3 + 113 = 2a + 18c + 162e.
Solving for a, c and e, and substituting:
((3
J uxdx = 2.2110 + I.62 (u-2 + 112) + 28 (u_3 + u3) -3
296 INTEGRAL CALCULUS (6
or I uxdx = 2'2u3 + I.6z (u1 + u5) + 28 (u0 + ZZ6) o
= '28 (u0 + u6) + I.62 (u1 + u5) + 2.2u3 ,
which is Hardy's "formula (37)."
If the interval of differencing be n, this becomes
r6n l
J uxdx =n{'28(u0+n6n)+I.62(un+ u5n)+2.2%n). 0 Similarly
12n
uxdx = n {.28 (u6n ± u12n) + I.62 (u7n + u11,,) + 2'2ugn},
on
and so on. Since
6n 12, 18n
J uxdx = J uxdx + J uxdx + J uxdx + ... , 0 0 6n 12n Jusdx = n {28 (Ito +2u6n+2Z112n+ )
+ 1'62 (un + u5n + u7n + ...) + 2'2 (u3n + uyn + ...)}. This is Hardy's "formula (38)."
If we are dealing with functions derived from a mortality table, and we choose n so that 7n falls just within or just without the limits of the tableso that 7n is in fact about toowe can write the above formula thus :
J0uxdx = n (.28% + I.62un + z2u3n + I.62u5n + 56upn + I.62u7).
0
In this form the formula is known as Hardy's " formula (39a)." (See G. King, Life Contingencies, p. 488.)
Note. Since all these approximations give in effect the area of the curve bounded by ux, the limiting ordinates and the x-axis, they are often termed "quadrature formulae," the word "quadrabeing defined as the exact or approximate calculation of the area of the square equal in area to that of the given figure.
6. Practical applications of the formulae.
Integration formulae of the above type may be used to advantage in obtaining approximations to the values of certain complicated forms of integral which occur in the theory of life contingencies.
PRACTICAL APPLICATIONS 297
Where the functions involved are such that the summation extends to the end of the life table, it is customary to calculate the values of the functions by Hardy's formula (39a). If, on the other hand, the upper limit is at an age-point short of the limiting age of the life table, any of the simpler integration formulae can be employed to advantage. (See, for example, Spurgeon, Life Contingencies, pp. 262, 263.)
It is evident that we can neither integrate the function ux nor interbetween given values of ux if we are absolutely without informaregarding the value of the function between the ordinates. For example, in a mortality table we may integrate between, say, 133 and 131, assuming first differences constant, for we know that in general the decrements between ages 30 and 31 are small and may be fairly conas being evenly distributed over the interval. We could not reasonably adopt the same assumption for interpolation between to and 11, since the deaths of infants in the first year of age are not evenly distributed over the year. Again, a reliable estimate of the population of a seaside resort in June of any calendar year would not be obtained by a first difference interpolation between the population figures for January and December of the same year. In the two latter illustrations further information would be necessary before we could proceed to interpolation or integration.
In applying the formulae of approximate integration to the solution of a problem it is therefore essential that we have sufficient knowledge of the function to justify our assumptions regarding the nature of the curve passing through the given points.
When the function to be integrated is one in which the slope of the curve is gradual between the limits of the integration, almost any of the above formulae will give satisfactory results. If the values of the function are changing rapidly between the limits there may be considerable differences between the approximate results. The following examples will show this clearly.
Example 2.
The formula for a continuous annuity-certain is _ (
a,-.1 = m
I e-Ft dt,
.o
where 8 is the force of interest corresponding to the rate of interest i at which the annuity is valued.
298 INTEGRAL CALCULUS
By means of an approximate integration formula find the value of aE at 4 per cent.
Since we have to evaluate a,, we shall require some or all of the values
of ut from t = o to t = 6, where ut = e81.
By a well-known formula in the theory of interest,
eat = vt, where v = (1 + 01.
Values of vt at 4 per cent. are available from any tables of interest. We have, therefore:
u° = e° = 1,
111 = e_s = v = '96154,
112 = e21 = v2 = '92456, u, = e38 = v3 = '88900, u, = e41 = v4 = '8548o, us = e55 = v5 = '82193, 116 = e63 = vs = '79031.
The following results will be obtained:
- (i)Simpson's rule applied three times :
3 (u0+4111+ 2112+4113+ 2U4 +4115 + u6)
= 5.34630.
(ii)The three-eighths rule (with interval of differencing 2):
8 {Z (Tt° + 3112 + 3U4 + u6)}= 5'34629.
(iii)Weddle's rule:
'3 (u0 + 5111 + u2 + 6113 + 114 + 5U5 + u°)= 5'34631.The result obtained by integrating ~Ge 5t dt will be L 8 est]s orv°-v608 , and since the value of 8 at 4 per cent. as given by an interest table is '039221, the integral_ 1 '79031 _ '20969'039221039221 = 5'34637
The three approximate formulae all give results differing from this in the fourth or fifth place of decimals. The excellence of the approximais due to the fact that the slope of the curvey = ut = e -8t is gradual between the limits t = o and t = 6.
THE EULER-MACLAURIN EXPANSION 299
Example 3.
Evaluate 's dx (i) exactly, (ii) by an approximate integration
o (1+4x)2
(1) .10 (1+4x)2L (1+4x)]0 = 4(I21) =24. (ii) By Simpson's rule applied three times, we obtain 237458;
| the three-eighths rule |
2'43249 |
| Weddle's rule |
2.36382. |
It will be seen that the differences between these results and the accurate result are considerable. None of the values is correct to more than the first place of decimals, and, moreover, the errors are not all in the same direction. The function y = (1 + 4x)-2 is changing rapidly over the given interval, and it is not to be expected that any of the above formulae will give a near approximation.
7. The Euler-Maclaurin expansion.
We have considered quadrature formulae in which the number of ordinates used is a simple multiple of the first two or three natural numbers, 2, 3, 4. We will now proceed to investigate the problem of approximate integration more generally.
The basic formula for expressing a definite integral in terms of given ordinates is the Euler-Maclaurin expansion. This expansion is in effect of similar form to the expressions already obtained, a greater degree of accuracy being ensured by the addition of funcnot of the ordinates themselves, but of derivatives of certain of the ordinates.
The formula may be derived by the expansion of operators, thus : x=n-1
f(x)=f(o)+f(1)+...+f(ni)
x=0
= F (n) F (o), where f (x) is AF (x).
Since f (x) = AF (x),
F (x) = A-11-(x)
= (eD 1) 1 f (x) (since 1 + A = eD. Chap. xi, para. 13)
D2 D3
= [(I + D + 2~+-+...) I f(x) 3
formula.
300 INTEGRAL CALCULUS
I D D2 ...1
I+2,-3,+ f(x) D2
=D-1 1 ~ d
... f(x)
720 D D3
D-1 _+ 12 {
720 ... J (x)
D3
= D-1 f (x) if (x) + 12 f (x) 720 f (.x) ...
i = Jf(x) dx f (x) + i z df dx~) 72i 0 d (x) ... . Between limits o and n, we have therefore
F (n) F (o) = Jf(x) dx {f (n) f (o)} + { f' (n) f' (o)}
h If"' (n) -f'" (o)} ... .
For F (n) F (o) we may write
x=n-1
f (x) or f (0) +f (I) +f (2) + ... + f (n 1).
Jf(x)f(0)+f(1)+f(2)f(n_1){f(n)_f(0)}
~'i {f' (n) f' (0)} + -7-i u If , (n) -f' (0)1...
=2f(o)+f(I)+f(2)+...+f(nI) +f(n)
, If ' (n) -f' (0)} + If"' (n) -f "' (o)} ... .
This is a simple form of the Euler-Maclaurin expansion.
A more general form can be obtained by changing the origin to the point a and the unit of measurement to r, thus :
Ja+nr r a f(x)dx= f(a)+f(a+r)+f(a-; 2r)+... +f (a+n ir)+2f(a+nr) i2{f'(a+nr)f'(a)}
r3
+ o { f,,, (a + nr) f,,, (a)} ....
It will be noted that, since
s
x2 x4 = I Zx+B12~B2 , +B3hj ... 4
x
ex
(Chap. xi, p. 185),
ILLUSTRATIVE EXAMPLES 30I
we can express the coefficients in (eD 1)-1 in terms of Bernouilli's numbers. As, however, the resulting approximation formula is to be used for numerical computation, it is of advantage to give the coefficients their actual numerical values.
8. The following examples are illustrative of the use of the formula.
Example 4.
Evaluate I oI I dx x to five places of decimals.
Choose a convenient unit, say oI. Then in the Euler-Maclaurin
expansion we have
a = o, n = Io, r = oI, and ux =
I + x'
dux d3 u 6
dx - - (I + x)2' dx3 = (1 + x)4'
I 11 dx I I I I I I I I 0'I- .IO I +x 2.I+II+I22+~ 3 +...+ I9+2 2
- —*°O2 -22+j2J+0200I[ 6 +6]...
1I3I1550000 120 ' 4 + 72000016 '90909
83333
.76923
71429
- rIdx = 69315 to five places of decimals. 01+x
This agrees with the true result (loge 2) to the required degree of accuracy.I
302 INTEGRAL CALCULUS
Example 5.
Find the sum of the fourth powers of the first n natural numbers by means of the Euler-Maclaurin formula.
In the formula
1 fa+nr
ux dx = sua + ua+r + ua+2r + r
r
+ ?ua+nr 12 (ua+nr 11'a) + 720 (1la+nr 11 a) ,
put a =r 0, r = I, ux = x4; then
1 ~t x4dx = 14 + 24 + 34 + ...
I
+ (n 1)4 + ~n4 1- L4x3]x=n + 720 [4.3.2x]xan
since higher differential coefficients of ux will be zero.
x,5 n r=n I I
I.e. [ ] = E r4 1-n4 4n3 + 24n.
5 o r=1 12 720
r=n I I I I
.. E Y4=-n5+ - n4+-n3--n.
r=1 5 2 3 30
By proceeding on the above lines it can easily be seen that the general formula for the sum of the pth powers of the first n natural numbers is
r=n
E YP= n2'+1+2nP+ pnp-1 I p(pI)(p2)nV-3
r=1 p + 12 720
+p(P 1)(p2)(p -3)(P4)np_5
9. Lubbock's formula.
The previous formulae have been developed for the purpose of relating a definite integral to the sum of a number of ordinates at finite distances apart. We have, in effect, obtained approximate formulae for the value of
Lt h (ua + ua+h + ua+2h + + ub). h-~o
In addition to formulae of this type we can find expressions which enable us to find the value of the sum of a number of orat finite distances apart in terms of the sum of ordinates at greater or less intervals apart. Thus, if for the curve y = ux there are nt unit intervals, so that we have Eux from x = o to x = n1 1,
30,240
LUBBOCK'S FORMULA 303
we may develop a relationship between this sum and the sum when the intervals are, say, h, where nh = 1.
In place of the sum h (ua + ua+h + ua+2h + ... + us), we may consider, without loss of generality, the simpler form when the series commences with u0.
Let there be m ordinates at unit distance apart, and let each of these unit distances be divided into n equal parts, so that the new ordinates are
U0, Ul'n, U2/n, . ur/n, ... uml/n.
Replace 1/n by h for convenience in working.
Then h (uo + uh + Ugh + ... + Umh) =h(I+Eh+E2h+...+Em-h)uo
Em I h
Eh_ Iuo
= h Eh l i (um u0)
1
h1 (I + 0)h I (um u0)
= h (hA + h2A2 + 113:,3 + ...)1 (um Z10)
h _ h'
= h.h101 (1 2 I 0 + I2 102 h2 -1 3 `
I -2'4' Z ... I (um uo)
2 //
= 01 (um uo) 2 (um u0) + 12 I 0 (um u0)
Z2
I
- A2(umZLo)...
24
_ (uo + ul + u2 + ... + Um1) h 2 I (nm uo)
+ h212 I h
(,um Auo) ~4 (A2 um A2 uo) ... ,
24
since A(uo+ul+u2+... +um1) _ (11l+u2+... +um1+um) (u0+u1+... +um1) =UmU0.
I/n2 I
(A2 um A2u0)
24
304
But h = 1/n.
INTEGRAL CALCULUS
n(uo+ul'n+u2,n+... +um-i/n)=(uo+u1+u2+...+um-1)
1/n2 I (um u0) + 1/n
12 (Sum Duo)
or
uo + U1!n + u2,1, ± ... + um1/n = n (uo + u1 + ... + um1)
I n (um u0)+I n2 (Aum0110)I n2 (02 um A2 uo)... 2 I2n 24n
n _ I 2 /
= n (uo + ui + ... + um1) + 2 (um u0) (
nI2n I Um Duo)
This is Lubbock's formula.
The coefficients of higher differences than the second are cumber-some, the terms in A3u and A4u being respectively
(n2 I) (19112 I) (L.3um 03uo)
720 n3
and (n2 48onn2 I) (04 um 041(0).
If the interval of differencing be originally n instead of unity, Lubbock's formula becomes, on changing the unit, u0+u1+112+ ... +umn1
=n(u0 +u +u +...+u(m1)n)+- 2 (umn U0)
n 2n
n2 I (Dumn Duo) + n2 I Z2umn L121(o).... 12n 24n
10. Woolhouse's formula.
Although Lubbock's formula has the advantage that it may be used when the function is not capable of expression as a matheexpansionas for example when the data are based on a
n2 I
+ - (A2um L12uo) .... 2411
WOOLHOUSE'S FORMULA 305
mortality tablethere are disadvantages in adopting the formula. In the first place, if it is necessary to proceed further than second differences the calculations are heavy, and secondly it may happen that the differences converge slowly, so that if we stop at second or third differences we are likely to obtain a result differing considerably from the true value of the function. Where a mathematical function is available an alternative summation formula can be adopted in which differential coefficients replace the finite difin Lubbock's formula.
The formula involving differential coefficients is due to Wool-house, and may be developed directly from the Euler-Maclaurin expansion.
The Euler-Maclaurin expansion is
111
uxdx = 0
2u0 + n1 + 112 + ... + u,n1 +Zuni IZ (um' u0')
1 r ,,,
+720(unZ uo) ....
If the interval of differencing be I/n, the formula becomes
rm I
n J uxdx = (2110 + nl n + u2,n + ... + Zum) (um' u0')
o I2n 0
"' rrr
+72on(11 uo )....
If, however, we multiply both sides of the first expression by n we have
n
n uxdx=n( uo+u1+u2+.zur)(um'110')
j 0 12
n
+ (um"' a') ....
720 0
Equating the two values of n Jndx:
yu0+ul,n+u2+...+ urnIX11(ur'u0')+ 2 (urn"',rr)...
720n
=n(?u0+u1+112+...+gum) n (it '')+'~20(um,,, u0rrr)...
12 F 20
306 INTEGRAL CALCULUS
or
I uo+ul/n+u2/+...+um (uo+um) 12n(um'nog)
+ I (u nr uour)
720113 ...
m
n n(uo+ul+u2+...+um)2(uo+um) _ 12(nn,'nor)
+ ..n u rrr uo''')
~2O(m
Re-arranging : uo+uln+u2n+...+um=n(uo+nl+u2+...+um)
1Z (Ito + um) 112 -
- -(um' _ uo,) + 714 I (ulna' norm) ... .
2I2n720/13If the unit of measurement be changed to n, we haveuo+u,+u2+...+umn=n(uo+un+u2n±...+umn)n I (uo + umn) 112 I (u, ,, uor) + - - I (um,nrrr u0ur) ... ,212720the usual form of Woolhouse's formula.
It should be noted that, by replacing the derivatives of u by their values in finite differences, Lubbock's formula can be obdirectly from the above.
In applying these formulae to certain actuarial functions the values of u,du d3 udx ' (1,0 ' ... at the end of the mortality table will disappear. Woolhouse'sformula may then be written as nnn2n4I= (uo + ul + ...) uo + -uo' 3 uorrr....2n127172011This is a convenient form for expressing the value of a benefit paid at nthly intervals in terms of the values at intervals of a year.11. Other formulae for approximate integration.
It will have been observed that in the formulae of the type of Simpson's, Weddle's, etc., the function of the u's is symmetrical about the central value. If, however, a fixed number of ordinates be given and it is desired to obtain an approximation to the area
APPROXIMATE INTEGRATION FORMULAE 307
of a curve in terms of these ordinates, the resulting form will not necessarily be symmetrical. Again, the formula for the area of the curve may be related not only to ordinates falling within the area to be measured, but to ordinates outside the area. It may be noted therefore that although standard formulae are avail-able, it is not difficult to devise approximations to fit the particular problems under investigation.
Example 6.
If ux is of the form a + bx + cx2, find a convenient formula for
t
f uxdx in terms of Ito, u1 and u2.
The interpolation formula which involves the terms uo, ul and u2 is
ux = 110 + x,110 + 2x (x I) L.2 110.
Jot
r
uxdx = Lxuo + ix2Au0 + (bx3 x2) A2 uol0
=uo+?Atli 11,'02uo J
=110+ (u1u0)3 (u22111+110)
1
= i°-lto + ~,_ ltl 1. 112,
The required formula is therefore
t
10 uxdx=,x;(5110+8111u,).
It should be noted that (i) the expression is unsymmetrical in the u's, and (ii) the ordinate u2 falls without the area to be integrated.
It is obviously impossible to quote all the formulae that are in current use. Further examples and illustrations of various approximation integration formulae will be found in the following sources : Whittaker and Robinson's Calculus of Observations, chapter vli; C. H. Wickens, J.I.A. vol. LIv, pp. 20913 ; A. E. King, T.F.A. vol. Ix, pp. 21831.
12. Alternative methods of proof of the formulae.
It has been stated above that the Euler-Maclaurin expansion is the basic quadrature formula. It will be instructive to develop Simpson's formula from this expansion.
J 2mn n2
uxdx=n(zu0+un+ll2n + . +u(2m1)n +2u2mn) 12 (u'2mn u0') 0
approximately.
20-2
308 INTEGRAL CALCULUS
Writing zn for n, but preserving the same range o to 21nn
j 2nzn 0
4n2
, I2 (u 2mn u0 ).
Subtracting this from four times the first:
r 2)nn
3 J u,dx= 2n(zn0+tun+u2n+2113n+... +u(2m2)n 0
i.e.
f 2nen rz
J 0 uxdx = 3 (u0 + 4un + 2112n + 4113n + ... + 211(2,n_2) n
+ 2u(2m1)n + 2u2mn),
which is Simpson's formula.
Again, we have shown in para. 2 (p. 288) that we may adopt the expression for ux in terms of uo and differences of u0 as the assumed form of function. Since Lagrange's formula is based on the same assumption it is evident that we could obtain any approximate formula by the use of the Lagrange formula.
For example, given uo, u, , u2,
(x I) (x 2) x (x 2) x (x I)
Zl.c = u0 ( I) ( 2) + Zll 1.(- I) + u2 2. I
= 2u0 (x2 3X + 2) + u1 (2x x2) + 2162 (x2 x),
so that
J 2 (x3 ~2 x3 x3 x212
ouxdx= `
[u06 -34 +x)+u1(x2 3)+u2(6 41 0
= 3 (uo + 4u1 + u2), which is Simpson's rule. If we integrate between o and I we shall obtain
)I
uxdx=(5uo+8111u..),
the formula given in Example 6.
uxdx = 211 (2u0 + u2n + Zlln + .. + 11(2m2)n + lu2mn)
+ 4u(2,nI)n + lmn),
EXAMPLES 309
EXAMPLES 17 1. Prove that
i
e 10 dx.
1
- 3.Show that [ ux dx =(fiu1 + 8u° u__1) approximately. Find the approximate mileage travelled between 12.0 and 12.30 by use of the above formula, from the following:
TimeSpeed (m.p.h.)
| 11.50 |
24'2 |
| 12.0 |
35.0 |
| 12.10 |
41'3 |
| 12.20 |
42.8 |
| 12.30 |
39.2 |
4. Prove that, if ux is a rational integral function of x, then ~ea ux dx = aea (I aD + a2 D2 a3 D3 + ...) ux,
where D = d dx'
- Show that the area of a curve, divided into n parts by n + 1 equiordinates u°, ut, .., u,,, is given approximately by the series
nu n2 Au, + {n3 n2} 02 u0 + {n 7t3 + n-i A3 uy_°' z321.24~ 1.2.3to n + 1 terms.
- Between the limits x = o and x = n the functions ux and dux/dx are continuously increasing.
n Show that / u,,. dx is less than 2u° + nEt ux +gun.
- Obtain the approximate formula
/1 uxdx = 13 (ul + u-1) (u3 + u73) 12showing up to what order of differences it holds.2a(aux dx = I° (ux + u2ax) dx,and illustrate the result geometrically. 2. If ux = a + bx + cx2, prove that.43ux dx = 2U2 +(u° 2U2 + u4),and hence find an approximate value for
310 INTEGRAL CALCULUS
5
8. Assuming ux to be of the fourth degree in x, express [ uxdx in .o
terms of u0, u1iu2,u3andu4.
- A plane area is bounded by a curve, the axis of x, and two ordinates. The area is divided into five figures by equidistant ordinates 2 inches apart, the lengths of the ordinates being 21.65, 21.04, 20.35, 19.61, 18.75 and 17.8o inches respectively. Apply the method of integration to obtain an approximate value of the area.
Io. Prove the approximate formulafo(ii) Simpson's rule :Io 2uxdx = (110 + 4111 + 112), applied three times..Iz. Which of the two following formulae would you expect to give .Ithe better approximation for' ux dx? :o
- y {5 (u0 + u4) + 4 (uI + 113) + 18u2},
EXAMPLES 31I
312 INTEGRAL CALCULUS
24. AB is the base of a semicircle, centre 0 and radius unity. The points P and Q bisect OA and OB respectively. The area between the
semicircle, the base PQ and the ordinates at P and Q is 6 + -3. Use
4
Weddle's rule that
.f(x)dx=0'3{f(o)+5f(I)+f(2)+6.f(3)+f(4)+5f(5)+f(6)} to find an approximate value of 7T to three places of decimals.
- 25.The following values of zz . are given:
xoI23456ux.146161176190204217230Use an approximate integration formula to find the value off uxdx. .o
It is found that, for the values given, y = log10 ('05X + P4) fits the data. Verify that this is so by integrating log10 (05.x + 1.4) between the limits o and 6. (loglo e = .4343; log10 P7 = 2304; Iog10 1.4 = 1461.)
26.If ux is a function whose fifth differences are constant, i1 zzx dx
I can be expressed in the form
pu_ + qu0 + put.
Find the values of p, q and a.
Use this formula, after making the necessary changes in the origin and scale, to find the value of loge 2 to four places of decimals from the equation
1Il I + x dx =loge 2.
- 27.Prove that, if a= o,
ra+rJux dx = Iu0 + uI + zr2 + ... + ur_I + ~u,. 11_ (Dur_I Duo) a (02 ur-2 +u0)(. 3ur_3 Q3uo)i u u (A'ur_4 + A4u0) ... .If ux be the function (I + x2)I find an approximate value for i.'3
- 28.Obtain an approximate formula for' uxdx in the form
a (u_2 + u2) + b (u_3 + 1[3)
and find the values of a and b.