Fundamental Actuarial Mathematics/Mortality Models

Learning objectives

edit

The Candidate will understand key concepts concerning parametric and non-parametric mortality models for individual lives.

Learning outcomes

edit

The Candidate will be able to:

  1. Understand parametric survival models, life tables, and the relationships between them.
  2. Given a parametric survival model, calculate survival and mortality probabilities, the force of mortality function, and curtate and complete moments of the future lifetime random variable.
  3. Identify and apply standard actuarial notation for future lifetime distributions and moments, including select and ultimate functions.
  4. Given a life table, calculate survival and mortality probabilities, the force of mortality function, and curtate and complete moments of the future lifetime random variable, using appropriate fractional age assumptions where necessary.
  5. Understand and apply select life tables.
  6. Identify common features of population mortality curves.

Survival distributions

edit

In the model discussed in this chapter, it describes the length of survival (or time until death) of an individual. Thus, the time-until-death random variable will be the basic building block.

Age-at-death random variable

edit

In this section, we will discuss a special case for the time-until-death random variable, in which the time until death is applicable to newborn (i.e. people aged zero). We denote this kind of random variable by  . We can observe that   also represents the age at death, since the age is counted starting from the beginning of life.

Since time-until-death random variable is describing time, it is a continuous random variable. Also, time is nonnegative, so the support (or "domain") of time-until-death random variable is  .

To describe the time until death for newborn, we need to determine the distribution of   completely. There are several ways to do this.

  • cumulative distribution function (cdf):  
  • probability density function (pdf):   if   is differentiable.
  • survival function
  • force of mortality

You should have learnt about cdf and pdf when learning about probability, but may not have learnt about survival function and force of mortality. Thus, we will discuss them here.

Survival function

edit

As suggested by the name "survival function", we may guess that this function is somewhat related to survival. This is actually true. Take the time-until-death random variable   as an example, when the newborn survives for, say,   units of time, what is its probability? It is   (or  , but since   is continuous, it does not matter). This probability corresponding to the input   is actually the survival function, which is defined below:

Definition. (Survival function) The survival function of a random variable   is  .

Remark.

  • As a corollary,  .
  • We can see that   gives the probability for the newborn to die within   units of time.
  • Since   is a nondecreasing function,   is a nonincreasing function.
  • For the survival function of  , we denote it by  , which equals  .
  • If  ,   since the support is a subset of  , so this event "covers" the entire support already.

Example. Recall that the cdf of the exponential distribution with rate   is  . Thus, the survival function of the exponential distribution with rate   is  .

Force of mortality

edit

In financial mathematics, you should have learnt about force of interest, which can be interpreted as the relative rate of change of the amount function, and it is given by   in which the notation has its usual meaning in financial mathematics. Why do we call it force of interest? This is because the interest refers to an increase (or positive change) in the amount.

We may guess that the force of mortality is defined similarly, in the sense that it can also be interpreted as the relative rate of change of something. We know that the interest refers to change in amount, but what change does mortality refer to? Since mortality means the state of being susceptible to death, it refers to the decrease (or negative change) in survival rate, and the "higher" the mortality, the larger decrease in survival rate. Recall that the survival function is related to survival rate (probability of surviving for a certain time) in some sense. So, we can make use of survival function to define mortality.

However, there is a difference between force of interest and force of mortality, namely for force of interest, interest refers to an increase in amount, while for force of mortality, mortality refers to a decrease in survival rate. So the changes are in opposite direction, and thus if we define force of mortality in an exact analogous manner, its value will be negative (the relative rate of change will be negative). To make the force of mortality positive, we can define the force of mortality as follows:

Definition. (Force of mortality) The force of mortality of a random variable   is  .

Remark.

  • For force of mortality of the time-until-death random variable  , it is denoted by  , which equals  .
  • If  , since   always,   and thus  
  • If  , since   is nonincreasing, its derivative is nonpositive, and so the negative of it is nonnegative. Also,   (because the survival function is a probability). Therefore,  .
  • The force of mortality of   at time   shows the relative rate of decrease in the survival rate of the newborn at time  .

Example. Show that  , assuming   is differentiable.

Solution:  

Remark.

  • We can interpret   (roughly) as the conditional probability of the newborn to die "instantaneously" at time  , given that the newborn survives for   time units, since   (  is small and close to zero).
  • This interpretation is intuitive, since the "instantaneous" death rate at time   may be interpreted as the relative rate of decrease of the survival rate at time   (when the "instantaneous" death rate is high at that time point, the relative survival rate at that time point diminishes a lot).
 

Exercise.

What is the force of mortality   of the exponential distribution with rate  ?

 
 
1
-1


After that, we will introduce some propositions related to the survival function and force of mortality.

Proposition.  

Proof.   as desired.

 

Proposition.  

Proof. For simplicity of presentation, we will have some abuse of notations (the infinity is in limit sense) in the following, but the reasoning is still understandable.  

 

Example. Given that   follows the uniform distribution with support  , what is the force of mortality of  ,  ?

Solution: We have  . So,  

 

Exercise.

Suppose instead the force of mortality of  ,  . What is the probability for the newborn to die within 10 years?

0.368
0.393
0.607
0.632




Future lifetime of a life aged x

edit

Now, we extend our discussion from future lifetime of a life aged zero (a newborn) to a life aged   ( ). For simplicity of presentation, we denote a life aged   by  .

Remark.

  • When we say "a life aged  ", we mean the life is aged exactly  , i.e. the life just reaches age   (birthday of the life), but not, say aged  ,  , etc, which are usually also referred as "aged  " in our daily life.

Similarly, we denote the future lifetime of   by   (recall that we denote the future lifetime of   (newborn) by  ). We define the distribution of   mathematically (and quite naturally) as the conditional distribution of  , given that  .

To understand this, consider the following reasoning: Refer to the following timeline:

                  death
    x       T_x   |
|---------|-------v
-------------------------
0         x                 t
|-----------------|
      T_0

We can observe that   if   (or  , but since   is continuous, it does not matter). So, if  , then  .

On the other hand, if  , we have the following timeline:

      death       
    x   |       
|-------v-|
-------------------------
0         x                 t
|-------|
   T_0

In this case,   does not exist, since the person does not survive for   years, and thus will never be age  , so there is not  , and therefore there is not  , future lifetime of  . This shows the necessity of the condition  .

From this definition, we have  ,  , etc.. This is quite important since it is the basis for the calculations of probabilities related to  .

For the pdf, cdf and survival function of  , we have similar notations as follows:

  •  : pdf of  
  •  : cdf of  
  •  : survival function of  

In particular, we have some special actuarial notations for the cdf and survival function, as follows:

  •  
  •  

In actuarial notations, " " often refers to something related to death, while " " often refers to something related to survival. In this context, this holds since   refers to the probability for   to die within   time units, and   refers to the probability for   to survive for   time units.

For simplicity, if  , we write   as   and   as  .

Using the relationship between   and  , we can develop some useful formulas for   and  , as follows:

Proposition.   and  .

Proof. First, we have  , in which   since   ( ), and so  , and thus   is a subset of  .

It follows that  .

 

We can also express the pdf of   as follows:

Proposition.  .

Proof. We have  

 

Remark.

  • Intuitively (and roughly),   gives the probability for   to survive for   time units and after that,   becomes  , and   gives the probability for   to die "instantaneously" at time  , given that the person survives to   time units. Multiplying   and  , it means the same as the (rough) interpretation of  :   die very shortly after (or "exactly at") time  .

Example. It is given that the survival function of newborn is  .

(a) Calculate   and  . Hence, determine whether  .

(b) Calculate  . Hence, calculate  .

Solution:

(a)   and  . Since  ,  .

(b) Since  ,  . So,  .

 

Exercise.

Find  .

 
 
 
 



We have a special notation for the probability for   to die between ages   and   ( ), namely   (we use " " here since this is related to death). Thus, we have by definition  . We have the following proposition for another formula of  .

Proposition.  .

Proof.  

 

Remark.

  • For proving formulas like this, it is generally better to change all " " to " " in the intermediate steps since " " is usually better to work with than " ".
  • To understand this more intuitively,   can be interpreted as the probability for   to survive for   time units, given that   survives for   time units, and   can be interpreted as the probability for   to die within   time units, given that   survives for   time units. Therefore, multiplying these two probability yields the probability for   to die within   time units, and survive for   time units, given that   survives for   time units.
  • This argument corresponds to the   in the above proof.
  • If we denote the above blue event as  , orange event as  , and purple event as  , we can represent the above argument using probability notations:  .
  • When you try to prove this equality, you can observe that this is equivalent to the   in the above proof.
  • Similarly, we denote   by   for simplicity.

Example. It is given that the survival function of newborn is  .

(a) Calculate  .

(b) Calculate  .

(c) Are the answers in (a) and (b) the same?

Solution:

(a)  

(b)  

(c) They are not the same.

 

Exercise.

1 Which of the following expression(s) represent(s) the probability for   to die between age 10 and 12?

 
 
 
 
 

2 Which of the following expression(s) represent(s) the probability for   to survive between age 10 and 12?

 
 
 
 
 



Curtate-future-lifetime of a life aged x

edit

The curtate-future-lifetime is just like the future lifetime in previous sections, except that it is discrete.

Definition. (Curtate-future-lifetime) The curtate-future-lifetime of  , denoted by  , is  , which is the floor function of  .

Remark.

  • Hence, the support of   is set of all nonnegative integers.

Similarly, we would like to completely determine the distribution of  , as in the case for  . We can do this using cdf or probability mass function (pmf). Its pmf is given by the following proposition.

Proposition. (Pmf of  ) The pmf of   is  .

Proof. The pmf of   is  

 

Proposition. (Cdf of  ) The cdf of   is  .

Proof. The cdf of   is  

 

Example. It is given that the survival function of newborn is  .

(a) Calculate the probability for   to die within 10 years by considering  .

(b) Calculate the probability for   to die within 10 years by considering  .

(c) Which probability, that in (a) or that in (b), is larger?

Solution:

(a) The probability is  .

(b) The probability is  

(c) The probability in (b) is larger.

 

Exercise.

1 Which of the following is (are) the survival function of  ?

 
 
 
 

2 Calculate the probability for   to survive for 20 years by considering  .

0.65
0.7375
0.75
0.9875




Life tables

edit

In a life table, the values of   and other functions for different (integer) ages   are tabulated. The values are assumed to be based on the survival distribution discussed in previous sections. In this section, we will discuss more functions appearing in a life table.

In previous sections, we have discussed time-until-death random variable for one person, and we will consider multiple people here. Suppose there are   newborns. Let the indicator function   Also let   be the sum of all such indicator functions  , i.e.  . We can interpret   as the number of survivors to age   for the   newborns.

We denote the expected value of   by  .

Proposition.  .

Proof. Since   (this is true for each life  , since the survival distribution for the future lifetime of different lifes are assumed to be the same),   equals  .

 


As a corollary,   (  is constant with respect to  ). Also,  . Also, we can use   to calculate probabilities like   and  , as follows:  , and thus  . In a later section in which selection age is involved in the life table (select table), we will use these formulas to calculate these probabilities from such life table, to incorporate the effect of selection.

We have discussed about the number of survivors to age  , and we will discuss the "opposite thing" in the following, namely the number of deaths to age   (i.e. between age 0 and  ), or in general, between age   and  .

We denote the expected value of such number of deaths by  .

Proposition.  .

Proof. We can define another indicator function for this context similarly (with value 1 if life   dies between age   and age   and 0 otherwise). Then, the expected value of each indicator function equals the probability for one of the newborns to die between   and  , which is  . With similar reasoning as in above, we have  .

 

Remark. Similarly, we write   as   for simplicity.

Apart from the life table functions   and   which are related to the expectation of number of survivors and deaths respectively, we will also discuss two more life table functions, that is related to the expectation of lifetime.

Example. Suppose   and the survival function for newborn is  .

(a) What is  ?

(b) What are   and  ? Are they equal?

Solution:

(a)  . (That is, the expected number of survivors to age 80 is 0.)

(b)   and  . They are equal.

Remark.

  • For (b), indeed,   for each  , since   for each  .
 

Exercise. It is given that  .

1 What is  ?

0.143
0.333
0.667
0.857

2 Calculate (i)  ; (ii)  .

(i) 25; (ii) 20
(i) 20; (ii) 20
(i) 20; (ii) 25
(i) 25; (ii) 25


3. (a) Calculate   in this case ( ). (b) Hence, show that   in all cases. (c) Using (b) directly, show that   in all cases.

Solution

(a)  

(b)

Proof. From (a), we can observe that   in all cases. Hence,   where the last equality follows from a previous proposition.

 


(c)

Proof. From (b), we have  

 


There are two types of expectation of life: one is discrete and another is continuous, and they are called curtate-expectation-of-life and complete-expectation-of-life respectively.

Definition. (Complete-expectation-of-life) The complete-expectation-of-life of  , denoted by  , is  .

Definition. (Curtate-expectation-of-life) The curtate-expectation-of-life of  , denoted by  , is  .

Proposition.  .

Proof. We will use integration by parts.   Now, it suffices to prove that   and this is true since   and   as  , so this limit either equals   or 0. However, since the expected value exists (i.e. does not tend to infinity, or else the expectation of life does not make sense), this limit cannot equal  , and so this limit is 0.

 

Remark.

  • With a similar proof, we can also prove that  .
  • Thus, the variance  .

Proposition.  .

Proof. The previous proposition about   uses integration by parts in the proof, and we can analogously use summation by parts (may be interpreted as a discrete analogue of integration by parts) in the proof. However, there is a simpler way to prove this proposition, where the summation is "split" appropriately:   We can observe that this sum and the sum in the proposition represent the same thing, and thus the result follows.

Proof using summation by parts

The idea in this proof is similar to the proof for the proposition about  , except that the integration by parts is replaced by summation by parts (which can be interpreted as the "discrete version" of integration by parts). The proof for summation by parts is omitted here. The details is as follows: First, define the forward difference   for the summation by parts (this is analogous to " " in the integral). Then,   In particular, we have  .

Again, it suffices to prove that  , and this is true for a similar reason: existence of the expected value  .

 

Remark.

  • With a similar proof, we can also prove that   (for summation by parts method).
  • Thus, the variance  .
 

Exercise. Prove that   using the "sum splitting" method.

Solution

Proof.  

 


The following are recursion relations for   and  , which can be useful when we want to find the complete/curtate-expectation-of-life of   given the expectation of a life with some other ages, say   and  .

We will state the recursion relations as a form of proposition, and then prove them formally. After the proof, we will try to give some intuitive explanations about the recursion relation for  .

Proposition.  .

Proof.   In particular, we have  .

 

An intuitive explanation of this recursion relation is as follows:

  • for LHS,   is the curtate-expectation-of-life of  ;
  • for RHS,   is the curtate-expectation-of-life of  , and we want to "transform" it to the expectation of  . The first step is adding 1 to it, since this is the expectation with respect to  , but we want the expectation from the perspective of  , which is 1 year younger. But only this step is not enough, since " " assumes the life already lives for   years, but for  , the life is only assumed to live for   years. Hence, we also need to multiply the probability for   to live for one year,  , to "get to"  .
  • Now, "the expectation of life from age   onward" is done through  . How about "the expectation of life from age   to age  "? Indeed, when the life dies within age   and  ,  . This means such "expectation of life" is zero.

Proposition.  .

Proof.  

 

Example. Given that  , we have  . In particular,   has nonzero value when  , and   has nonzero value when  . Also,   holds only when  . It follows that the bounds on   and   are given by   and  .

 

Exercise. It is given that  .

1 Calculate  .

0.5
0.75
1
1.25
1.5

2 Calculate  .

0.5
0.75
1
1.25
1.5



Assumptions for fractional ages

edit

Previously, we have discussed the continuous random variable   and discrete random variable  . A life table can specify the distribution of   since the values of   for different integer   can be obtained from the life table. However, the life table is not enough to specify the distribution of  , since we do not know the value of   when   is not an integer. Thus, in order to specify a distribution of   using a life table, we need to make some assumptions about the fractional (non-integer) ages.

In actuarial science, three assumptions are widely used, namely uniform distribution of deaths (UDD) (or linear interpolation), constant force of mortality (or exponential interpolation), and hyperbolic (or Balducci) assumption (or harmonic interpolation). We will define them each using survival functions, as follows:

Definition. (Uniform distribution of deaths) Uniform distribution of deaths assumption assumes  

Definition. (Constant force of mortality) Constant force of mortality assumption assumes  

Remark.

  • The equation can be alternatively stated as  .

Definition. (Balducci assumption) Balducci assumption assumes  

Remark.

  • It is also called hyperbolic assumption.

Under UDD assumption, we have some "nice" and simple expressions for various probabilities related to mortality. We can obtain those expressions by substituting   by the RHS of the equation mentioned in the assumption.

Example. Under UDD assumption, we have, when  ,  

 

Exercise. Show that under UDD assumption, when  ,  .

Answer

Under UDD assumption, when  ,   In particular, we differentiate with respect to   instead of   since only   is varying, and   should be fixed at an selected integer. Also, we use the result in the previous example:   in the last step.


For the three assumption mentioned, there is a particularly "nice" and simple result for each of them, and we may use those "nice" results for the calculation in practice, rather than applying the definitions. The "nice" result for UDD assumption is mentioned in a previous example: when  ,  . The "nice" results for the other two assumptions are as follows:

Theorem. (Notable property for constant force assumption) Under constant force assumption,  .

Proof.  

 

Theorem. (Notable property for Balducci assumption) Under Balducci assumption,  .

Proof.  

 

An interesting result under the UDD assumption is related to the independence of two random variables.

To simplify the notations, from now on, we let   and   unless otherwise specified.

Define a continuous random variable   by  . That is,   is the random variable representing the fractional part of a year lived in the year of death of  . For example, if  , then   lives for half year in the year of death.

Then,   and   are independent under UDD assumption. This is because   under UDD assumption. Also, we can observe that the cdf of   is  , which is the cdf of uniform distribution with support  . This means   follows the uniform distribution on   under UDD assumption. Hence,   and  . This gives rise to results under UDD assumptions:

  •  .
  •  .

These two results give us an alternative way to calculate the mean and variance of   where only discrete things are used in the calculation. However, we should be careful that these results hold under UDD assumption, so we cannot use these results without UDD assumption.

 

Exercise. (Previous exercise revisit) From a previous exercise, we are asked to calculate   and   given  . The following questions are some further questions on this exercise.

(a) Show that the UDD assumption is satisfied with this survival function.

(b) Verify that  .


Solution
(a)

Proof. Since when  ,   and   the UDD assumption is satisfied with this survival function by definition.

 

(b) From the previous exercise, we have   and  . Hence,  .


Laws of mortality

edit

In this section, we will introduce some simple laws of mortality (i.e. some specified distributions for mortality). Some of these laws may be appropriate to model the human mortality for some ages, but it is commonly believed that none of these laws is appropriate to model the human mortality for all ages.

Indeed, if we want to model the human mortality using some probability distribution, we may need a mixture of distributions, since the mortality should be distributed in a different manner when human is in different ages, and thus different distributions should be used in different ages. To choose a suitable distribution for some ages, we may investigate the shape of corresponding empirical distribution based on the actual human mortality data, and pick a suitable distribution accordingly. For example, if the mortality increases exponentially for some ages, then we may select a distribution for which the force of mortality also increases exponentially.

In practice, to calculate the probabilities related to human mortality, we usually use life tables for the calculations. This is usually the case for insurance companies. Each insurance company has its own life table, based on the mortality data of, possibly its clients. Since such life table is constructed using the actual human mortality data based on past experiences, the life table is usually deemed to be more accurate than a specific distribution.

Nevertheless, having a law for mortality allows us to simplify the calculation of the probabilities related to mortality.

Definition. (de Moivre's law of mortality) The de Moivre's law of mortality has the force of mortality   and the survival distribution  , with   (  is called the limiting age, i.e. an human must die (strictly) before that age).

Remark.

  • This is the simplest law among the laws we discuss.
  • Indeed, in this law, the mortality follows uniform distribution on  , since the cdf is  , which is exactly the same as the cdf of uniform distribution on  .
  • Hence, the pdf  .
 

Exercise. Verify that if the force of mortality is   ( ), then the survival distribution   ( ).

Solution

When   where  , we have  

Example. Show that under de Moivre's law of mortality,   (where  ), with certain bounds on  .

Proof. Under de Movire's law of mortality, we have   with certain bounds on  .

 

 

Exercise.

1. Hence, show that   under de Moivre's law of mortality.

Solution

Proof. By above, we have   under de Moivre's law of mortality. It follows that  

 


2. What are the bounds on   in the above example?

Solution

The bounds on   are given by  , since for  , the bounds are  . But   since   represents time. So, the bounds on   are given by  .

3. Considering the above example, which distribution does   follow?

Solution

From the above example, the cdf  . So,   follows the uniform distribution on  .

Remark.

  • From this, we can see that when the age   increases, the type of distribution does not change (still uniform), but the right endpoint of the support decreases.


4. Hence, show that  .

Solution

Proof. Since   follows the uniform distribution on  , it follows that the mean  

 



Definition. (Gompertz's law of mortality) The Gompertz's law of mortality has the force of mortality   and the survival distribution  , with   and  .

Definition. (Makeham's law of mortality) The Makeham's law of mortality has the force mortality   and the survival distribution  , with   and  .

Remark.

  • The Makeham's law of mortality is a generalization of the Gompertz's law of mortality, since these two laws are exactly the same when  .

Definition. (Weibull's law of mortality) The Weibull's law of mortality has the force mortality   and the survival distribution  , with   and  .


Select and ultimate table

edit

When a person purchases a life insurance policy offered by an insurance company, he needs to give some personal information to the insurance company, e.g. some information about his health status. For the insurance company to decide whether it should sell the policy to the person, those information provided by that person is accessed through the process of underwriting. For underwriting, the underwriter check the information to see whether the risk in insuring that person is appropriate.

Without underwriting, it is likely that people will only purchase life insurance policy when they think they will die soon (e.g. they have a very serious disease), so that they will likely have early claims. In this case, the insurance company may need to pay a large amount of money and suffer a great loss in a short time, and then bankrupt. This shows the necessity of underwriting.

Basically, the "select" in the section title arises from the underwriting process, and when we say an individual is "selected" at age  , he is underwritten at age   (so the most recent information about the individual is known). Since there are some new information about the individual when he is underwritten (or selected), we will expect there is some update in his survival distribution, and therefore his probabilities related to mortality will change as well. Because of this, we need to have some changes in the actuarial notations, depending on the selection age.

In such actuarial notations, we usually add square brackets around the age at selection, and the numbers in the subscript change accordingly. For example,   becomes   if the selection age is 25,   if the selection age is 12.

Since when a person is underwritten long time ago, he may have poorer health condition (e.g. getting older having some new diseases) from the time at which he is underwritten to now, we will intuitively expect that the longer time passed from the time at which a person aged   is underwritten, the more likely that person will be die in the coming year. That is,   [1].

The impact on the survival distribution from the selection age may decrease when the time passed from the selection is longer. Beyond a certain time period, say   years, the " "'s at the same attained age (i.e. selection age plus the time passed from it to now) but different selection ages will be very close. In other words,   (the condition on   is to ensure that the selection age at LHS  ) [2]. Such   years is called the select period. Because the " "'s mentioned above will be very close, all such " "'s will only be written as  , without any square brackets (since the effect of selection is basically "gone", and so the square brackets are gone). For example, if the selection period is 2 years, then   and   will be both written as   simply. However, we will not write   and   as  , since these two " "'s are "quite different" because of the selection impact is still "quite large".

The following are some terms related to life table.

  • An aggregate table is a life table in which the functions are only given for attained ages.
  • A select table is a life table in which some function involves the age at selection.
  • An ultimate table is usually appended to a select table as a last column to reflect the setting of select table. The combination of a select table and an ultimate table in such a way is called a select-and-ultimate table.

For example, an excerpt of select-and-ultimate table with select period 2 years may look like:

Select-and-ultimate table with 2-year select period
Age at selection,        
0      
1      
2      
... ... ... ...

The last column is the ultimate table. We can observe that we do not need additional columns for   etc., since we can already get such values in the " " value in a different row (with different  ).

Given a select-and-ultimate table, we can do various calculations based on it.

Example. You are given the following except of a (hypothetical) select-and-ultimate table with 4-year select period:   Then,  ,  , and   [3] (since the select period is 4 years).

 

Exercise.

Which of the following probability (or probabilities) cannot be calculated from the above select-and-ultimate table?

 
 
 
 




  1. The number inside square brackets is integer since it refers to age.
  2. (Optional) To be more precise, the selection period is the smallest integer   such that   (  is a small positive constant. When it is smaller, the requirement is "stricter" and the " "'s will be "closer".) for each  .
  3. The value of   is from the   in the row for  .