Fundamental Actuarial Mathematics/Mortality Models
Learning objectives
editThe Candidate will understand key concepts concerning parametric and non-parametric mortality models for individual lives.
Learning outcomes
editThe Candidate will be able to:
- Understand parametric survival models, life tables, and the relationships between them.
- Given a parametric survival model, calculate survival and mortality probabilities, the force of mortality function, and curtate and complete moments of the future lifetime random variable.
- Identify and apply standard actuarial notation for future lifetime distributions and moments, including select and ultimate functions.
- Given a life table, calculate survival and mortality probabilities, the force of mortality function, and curtate and complete moments of the future lifetime random variable, using appropriate fractional age assumptions where necessary.
- Understand and apply select life tables.
- Identify common features of population mortality curves.
Survival distributions
editIn the model discussed in this chapter, it describes the length of survival (or time until death) of an individual. Thus, the time-until-death random variable will be the basic building block.
Age-at-death random variable
editIn this section, we will discuss a special case for the time-until-death random variable, in which the time until death is applicable to newborn (i.e. people aged zero). We denote this kind of random variable by . We can observe that also represents the age at death, since the age is counted starting from the beginning of life.
Since time-until-death random variable is describing time, it is a continuous random variable. Also, time is nonnegative, so the support (or "domain") of time-until-death random variable is .
To describe the time until death for newborn, we need to determine the distribution of completely. There are several ways to do this.
- cumulative distribution function (cdf):
- probability density function (pdf): if is differentiable.
- survival function
- force of mortality
You should have learnt about cdf and pdf when learning about probability, but may not have learnt about survival function and force of mortality. Thus, we will discuss them here.
Survival function
editAs suggested by the name "survival function", we may guess that this function is somewhat related to survival. This is actually true. Take the time-until-death random variable as an example, when the newborn survives for, say, units of time, what is its probability? It is (or , but since is continuous, it does not matter). This probability corresponding to the input is actually the survival function, which is defined below:
Definition. (Survival function) The survival function of a random variable is .
Remark.
- As a corollary, .
- We can see that gives the probability for the newborn to die within units of time.
- Since is a nondecreasing function, is a nonincreasing function.
- For the survival function of , we denote it by , which equals .
- If , since the support is a subset of , so this event "covers" the entire support already.
Example. Recall that the cdf of the exponential distribution with rate is . Thus, the survival function of the exponential distribution with rate is .
Force of mortality
editIn financial mathematics, you should have learnt about force of interest, which can be interpreted as the relative rate of change of the amount function, and it is given by in which the notation has its usual meaning in financial mathematics. Why do we call it force of interest? This is because the interest refers to an increase (or positive change) in the amount.
We may guess that the force of mortality is defined similarly, in the sense that it can also be interpreted as the relative rate of change of something. We know that the interest refers to change in amount, but what change does mortality refer to? Since mortality means the state of being susceptible to death, it refers to the decrease (or negative change) in survival rate, and the "higher" the mortality, the larger decrease in survival rate. Recall that the survival function is related to survival rate (probability of surviving for a certain time) in some sense. So, we can make use of survival function to define mortality.
However, there is a difference between force of interest and force of mortality, namely for force of interest, interest refers to an increase in amount, while for force of mortality, mortality refers to a decrease in survival rate. So the changes are in opposite direction, and thus if we define force of mortality in an exact analogous manner, its value will be negative (the relative rate of change will be negative). To make the force of mortality positive, we can define the force of mortality as follows:
Definition. (Force of mortality) The force of mortality of a random variable is .
Remark.
- For force of mortality of the time-until-death random variable , it is denoted by , which equals .
- If , since always, and thus
- If , since is nonincreasing, its derivative is nonpositive, and so the negative of it is nonnegative. Also, (because the survival function is a probability). Therefore, .
- The force of mortality of at time shows the relative rate of decrease in the survival rate of the newborn at time .
Example. Show that , assuming is differentiable.
Solution:
Remark.
- We can interpret (roughly) as the conditional probability of the newborn to die "instantaneously" at time , given that the newborn survives for time units, since ( is small and close to zero).
- This interpretation is intuitive, since the "instantaneous" death rate at time may be interpreted as the relative rate of decrease of the survival rate at time (when the "instantaneous" death rate is high at that time point, the relative survival rate at that time point diminishes a lot).
Exercise.
After that, we will introduce some propositions related to the survival function and force of mortality.
Proposition.
Proof. as desired.
Proposition.
Proof. For simplicity of presentation, we will have some abuse of notations (the infinity is in limit sense) in the following, but the reasoning is still understandable.
Example. Given that follows the uniform distribution with support , what is the force of mortality of , ?
Solution: We have . So,
Exercise.
Future lifetime of a life aged x
editNow, we extend our discussion from future lifetime of a life aged zero (a newborn) to a life aged ( ). For simplicity of presentation, we denote a life aged by .
Remark.
- When we say "a life aged ", we mean the life is aged exactly , i.e. the life just reaches age (birthday of the life), but not, say aged , , etc, which are usually also referred as "aged " in our daily life.
Similarly, we denote the future lifetime of by (recall that we denote the future lifetime of (newborn) by ). We define the distribution of mathematically (and quite naturally) as the conditional distribution of , given that .
To understand this, consider the following reasoning: Refer to the following timeline:
death x T_x | |---------|-------v ------------------------- 0 x t |-----------------| T_0
We can observe that if (or , but since is continuous, it does not matter). So, if , then .
On the other hand, if , we have the following timeline:
death x | |-------v-| ------------------------- 0 x t |-------| T_0
In this case, does not exist, since the person does not survive for years, and thus will never be age , so there is not , and therefore there is not , future lifetime of . This shows the necessity of the condition .
From this definition, we have , , etc.. This is quite important since it is the basis for the calculations of probabilities related to .
For the pdf, cdf and survival function of , we have similar notations as follows:
- : pdf of
- : cdf of
- : survival function of
In particular, we have some special actuarial notations for the cdf and survival function, as follows:
In actuarial notations, " " often refers to something related to death, while " " often refers to something related to survival. In this context, this holds since refers to the probability for to die within time units, and refers to the probability for to survive for time units.
For simplicity, if , we write as and as .
Using the relationship between and , we can develop some useful formulas for and , as follows:
Proposition. and .
Proof. First, we have , in which since ( ), and so , and thus is a subset of .
It follows that .
We can also express the pdf of as follows:
Proposition. .
Proof. We have
Remark.
- Intuitively (and roughly), gives the probability for to survive for time units and after that, becomes , and gives the probability for to die "instantaneously" at time , given that the person survives to time units. Multiplying and , it means the same as the (rough) interpretation of : die very shortly after (or "exactly at") time .
Example. It is given that the survival function of newborn is .
(a) Calculate and . Hence, determine whether .
(b) Calculate . Hence, calculate .
Solution:
(a) and . Since , .
(b) Since , . So, .
Exercise.
We have a special notation for the probability for to die between ages and ( ), namely (we use " " here since this is related to death). Thus, we have by definition . We have the following proposition for another formula of .
Proposition. .
Proof.
Remark.
- For proving formulas like this, it is generally better to change all " " to " " in the intermediate steps since " " is usually better to work with than " ".
- To understand this more intuitively, can be interpreted as the probability for to survive for time units, given that survives for time units, and can be interpreted as the probability for to die within time units, given that survives for time units. Therefore, multiplying these two probability yields the probability for to die within time units, and survive for time units, given that survives for time units.
- This argument corresponds to the in the above proof.
- If we denote the above blue event as , orange event as , and purple event as , we can represent the above argument using probability notations: .
- When you try to prove this equality, you can observe that this is equivalent to the in the above proof.
- Similarly, we denote by for simplicity.
Example. It is given that the survival function of newborn is .
(a) Calculate .
(b) Calculate .
(c) Are the answers in (a) and (b) the same?
Solution:
(a)
(b)
(c) They are not the same.
Exercise.
Curtate-future-lifetime of a life aged x
editThe curtate-future-lifetime is just like the future lifetime in previous sections, except that it is discrete.
Definition. (Curtate-future-lifetime) The curtate-future-lifetime of , denoted by , is , which is the floor function of .
Remark.
- Hence, the support of is set of all nonnegative integers.
Similarly, we would like to completely determine the distribution of , as in the case for . We can do this using cdf or probability mass function (pmf). Its pmf is given by the following proposition.
Proposition. (Pmf of ) The pmf of is .
Proof. The pmf of is
Proposition. (Cdf of ) The cdf of is .
Proof. The cdf of is
Example. It is given that the survival function of newborn is .
(a) Calculate the probability for to die within 10 years by considering .
(b) Calculate the probability for to die within 10 years by considering .
(c) Which probability, that in (a) or that in (b), is larger?
Solution:
(a) The probability is .
(b) The probability is
(c) The probability in (b) is larger.
Exercise.
Life tables
editIn a life table, the values of and other functions for different (integer) ages are tabulated. The values are assumed to be based on the survival distribution discussed in previous sections. In this section, we will discuss more functions appearing in a life table.
In previous sections, we have discussed time-until-death random variable for one person, and we will consider multiple people here. Suppose there are newborns. Let the indicator function Also let be the sum of all such indicator functions , i.e. . We can interpret as the number of survivors to age for the newborns.
We denote the expected value of by .
Proposition. .
Proof. Since (this is true for each life , since the survival distribution for the future lifetime of different lifes are assumed to be the same), equals .
As a corollary, ( is constant with respect to ). Also, .
Also, we can use to calculate probabilities like and , as follows:
,
and thus .
In a later section in which selection age is involved in the life table (select table), we will use these formulas to calculate these probabilities from such life table, to incorporate the effect of selection.
We have discussed about the number of survivors to age , and we will discuss the "opposite thing" in the following, namely the number of deaths to age (i.e. between age 0 and ), or in general, between age and .
We denote the expected value of such number of deaths by .
Proposition. .
Proof. We can define another indicator function for this context similarly (with value 1 if life dies between age and age and 0 otherwise). Then, the expected value of each indicator function equals the probability for one of the newborns to die between and , which is . With similar reasoning as in above, we have .
Remark. Similarly, we write as for simplicity.
Apart from the life table functions and which are related to the expectation of number of survivors and deaths respectively, we will also discuss two more life table functions, that is related to the expectation of lifetime.
Example. Suppose and the survival function for newborn is .
(a) What is ?
(b) What are and ? Are they equal?
Solution:
(a) . (That is, the expected number of survivors to age 80 is 0.)
(b) and . They are equal.
Remark.
- For (b), indeed, for each , since for each .
Exercise. It is given that .
3. (a) Calculate in this case ( ). (b) Hence, show that in all cases. (c) Using (b) directly, show that in all cases.
(a)
(b)Proof. From (a), we can observe that in all cases. Hence, where the last equality follows from a previous proposition.
Proof. From (b), we have
There are two types of expectation of life: one is discrete and another is continuous, and they are called curtate-expectation-of-life and complete-expectation-of-life respectively.
Definition. (Complete-expectation-of-life) The complete-expectation-of-life of , denoted by , is .
Definition. (Curtate-expectation-of-life) The curtate-expectation-of-life of , denoted by , is .
Proposition. .
Proof. We will use integration by parts. Now, it suffices to prove that and this is true since and as , so this limit either equals or 0. However, since the expected value exists (i.e. does not tend to infinity, or else the expectation of life does not make sense), this limit cannot equal , and so this limit is 0.
Remark.
- With a similar proof, we can also prove that .
- Thus, the variance .
Proposition. .
Proof. The previous proposition about uses integration by parts in the proof, and we can analogously use summation by parts (may be interpreted as a discrete analogue of integration by parts) in the proof. However, there is a simpler way to prove this proposition, where the summation is "split" appropriately: We can observe that this sum and the sum in the proposition represent the same thing, and thus the result follows.
The idea in this proof is similar to the proof for the proposition about , except that the integration by parts is replaced by summation by parts (which can be interpreted as the "discrete version" of integration by parts). The proof for summation by parts is omitted here. The details is as follows: First, define the forward difference for the summation by parts (this is analogous to " " in the integral). Then, In particular, we have .
Again, it suffices to prove that , and this is true for a similar reason: existence of the expected value .
Remark.
- With a similar proof, we can also prove that (for summation by parts method).
- Thus, the variance .
Exercise. Prove that using the "sum splitting" method.
Proof.
The following are recursion relations for and , which can be useful when we want to find the complete/curtate-expectation-of-life of given the expectation of a life with some other ages, say and .
We will state the recursion relations as a form of proposition, and then prove them formally. After the proof, we will try to give some intuitive explanations about the recursion relation for .
Proposition. .
Proof. In particular, we have .
An intuitive explanation of this recursion relation is as follows:
- for LHS, is the curtate-expectation-of-life of ;
- for RHS, is the curtate-expectation-of-life of , and we want to "transform" it to the expectation of . The first step is adding 1 to it, since this is the expectation with respect to , but we want the expectation from the perspective of , which is 1 year younger. But only this step is not enough, since " " assumes the life already lives for years, but for , the life is only assumed to live for years. Hence, we also need to multiply the probability for to live for one year, , to "get to" .
- Now, "the expectation of life from age onward" is done through . How about "the expectation of life from age to age "? Indeed, when the life dies within age and , . This means such "expectation of life" is zero.
Proposition. .
Proof.
Example. Given that , we have . In particular, has nonzero value when , and has nonzero value when . Also, holds only when . It follows that the bounds on and are given by and .
Exercise. It is given that .
Assumptions for fractional ages
editPreviously, we have discussed the continuous random variable and discrete random variable . A life table can specify the distribution of since the values of for different integer can be obtained from the life table. However, the life table is not enough to specify the distribution of , since we do not know the value of when is not an integer. Thus, in order to specify a distribution of using a life table, we need to make some assumptions about the fractional (non-integer) ages.
In actuarial science, three assumptions are widely used, namely uniform distribution of deaths (UDD) (or linear interpolation), constant force of mortality (or exponential interpolation), and hyperbolic (or Balducci) assumption (or harmonic interpolation). We will define them each using survival functions, as follows:
Definition. (Uniform distribution of deaths) Uniform distribution of deaths assumption assumes
Definition. (Constant force of mortality) Constant force of mortality assumption assumes
Remark.
- The equation can be alternatively stated as .
Definition. (Balducci assumption) Balducci assumption assumes
Remark.
- It is also called hyperbolic assumption.
Under UDD assumption, we have some "nice" and simple expressions for various probabilities related to mortality. We can obtain those expressions by substituting by the RHS of the equation mentioned in the assumption.
Example. Under UDD assumption, we have, when ,
Exercise. Show that under UDD assumption, when , .
Under UDD assumption, when , In particular, we differentiate with respect to instead of since only is varying, and should be fixed at an selected integer. Also, we use the result in the previous example: in the last step.
For the three assumption mentioned, there is a particularly "nice" and simple result for each of them, and we may use those "nice" results for the calculation in practice, rather than applying the definitions. The "nice" result for UDD assumption is mentioned in a previous example: when , . The "nice" results for the other two assumptions are as follows:
Theorem. (Notable property for constant force assumption) Under constant force assumption, .
Proof.
Theorem. (Notable property for Balducci assumption) Under Balducci assumption, .
Proof.
An interesting result under the UDD assumption is related to the independence of two random variables.
To simplify the notations, from now on, we let and unless otherwise specified.
Define a continuous random variable by . That is, is the random variable representing the fractional part of a year lived in the year of death of . For example, if , then lives for half year in the year of death.
Then, and are independent under UDD assumption. This is because under UDD assumption. Also, we can observe that the cdf of is , which is the cdf of uniform distribution with support . This means follows the uniform distribution on under UDD assumption. Hence, and . This gives rise to results under UDD assumptions:
- .
- .
These two results give us an alternative way to calculate the mean and variance of where only discrete things are used in the calculation. However, we should be careful that these results hold under UDD assumption, so we cannot use these results without UDD assumption.
Exercise. (Previous exercise revisit) From a previous exercise, we are asked to calculate and given . The following questions are some further questions on this exercise.
(a) Show that the UDD assumption is satisfied with this survival function.
(b) Verify that .
Proof. Since when , and the UDD assumption is satisfied with this survival function by definition.
(b) From the previous exercise, we have and . Hence, .
Laws of mortality
editIn this section, we will introduce some simple laws of mortality (i.e. some specified distributions for mortality). Some of these laws may be appropriate to model the human mortality for some ages, but it is commonly believed that none of these laws is appropriate to model the human mortality for all ages.
Indeed, if we want to model the human mortality using some probability distribution, we may need a mixture of distributions, since the mortality should be distributed in a different manner when human is in different ages, and thus different distributions should be used in different ages. To choose a suitable distribution for some ages, we may investigate the shape of corresponding empirical distribution based on the actual human mortality data, and pick a suitable distribution accordingly. For example, if the mortality increases exponentially for some ages, then we may select a distribution for which the force of mortality also increases exponentially.
In practice, to calculate the probabilities related to human mortality, we usually use life tables for the calculations. This is usually the case for insurance companies. Each insurance company has its own life table, based on the mortality data of, possibly its clients. Since such life table is constructed using the actual human mortality data based on past experiences, the life table is usually deemed to be more accurate than a specific distribution.
Nevertheless, having a law for mortality allows us to simplify the calculation of the probabilities related to mortality.
Definition. (de Moivre's law of mortality) The de Moivre's law of mortality has the force of mortality and the survival distribution , with ( is called the limiting age, i.e. an human must die (strictly) before that age).
Remark.
- This is the simplest law among the laws we discuss.
- Indeed, in this law, the mortality follows uniform distribution on , since the cdf is , which is exactly the same as the cdf of uniform distribution on .
- Hence, the pdf .
Exercise. Verify that if the force of mortality is ( ), then the survival distribution ( ).
When where , we have
Example. Show that under de Moivre's law of mortality, (where ), with certain bounds on .
Proof. Under de Movire's law of mortality, we have with certain bounds on .
Exercise.
1. Hence, show that under de Moivre's law of mortality.
Proof. By above, we have under de Moivre's law of mortality. It follows that
2. What are the bounds on in the above example?
The bounds on are given by , since for , the bounds are . But since represents time. So, the bounds on are given by .
3. Considering the above example, which distribution does follow?
From the above example, the cdf . So, follows the uniform distribution on .
Remark.
- From this, we can see that when the age increases, the type of distribution does not change (still uniform), but the right endpoint of the support decreases.
4. Hence, show that .
Proof. Since follows the uniform distribution on , it follows that the mean
Definition. (Gompertz's law of mortality) The Gompertz's law of mortality has the force of mortality and the survival distribution , with and .
Definition. (Makeham's law of mortality) The Makeham's law of mortality has the force mortality and the survival distribution , with and .
Remark.
- The Makeham's law of mortality is a generalization of the Gompertz's law of mortality, since these two laws are exactly the same when .
Definition. (Weibull's law of mortality) The Weibull's law of mortality has the force mortality and the survival distribution , with and .
Select and ultimate table
editWhen a person purchases a life insurance policy offered by an insurance company, he needs to give some personal information to the insurance company, e.g. some information about his health status. For the insurance company to decide whether it should sell the policy to the person, those information provided by that person is accessed through the process of underwriting. For underwriting, the underwriter check the information to see whether the risk in insuring that person is appropriate.
Without underwriting, it is likely that people will only purchase life insurance policy when they think they will die soon (e.g. they have a very serious disease), so that they will likely have early claims. In this case, the insurance company may need to pay a large amount of money and suffer a great loss in a short time, and then bankrupt. This shows the necessity of underwriting.
Basically, the "select" in the section title arises from the underwriting process, and when we say an individual is "selected" at age , he is underwritten at age (so the most recent information about the individual is known). Since there are some new information about the individual when he is underwritten (or selected), we will expect there is some update in his survival distribution, and therefore his probabilities related to mortality will change as well. Because of this, we need to have some changes in the actuarial notations, depending on the selection age.
In such actuarial notations, we usually add square brackets around the age at selection, and the numbers in the subscript change accordingly. For example, becomes if the selection age is 25, if the selection age is 12.
Since when a person is underwritten long time ago, he may have poorer health condition (e.g. getting older having some new diseases) from the time at which he is underwritten to now, we will intuitively expect that the longer time passed from the time at which a person aged is underwritten, the more likely that person will be die in the coming year. That is, [1].
The impact on the survival distribution from the selection age may decrease when the time passed from the selection is longer. Beyond a certain time period, say years, the " "'s at the same attained age (i.e. selection age plus the time passed from it to now) but different selection ages will be very close. In other words, (the condition on is to ensure that the selection age at LHS ) [2]. Such years is called the select period. Because the " "'s mentioned above will be very close, all such " "'s will only be written as , without any square brackets (since the effect of selection is basically "gone", and so the square brackets are gone). For example, if the selection period is 2 years, then and will be both written as simply. However, we will not write and as , since these two " "'s are "quite different" because of the selection impact is still "quite large".
The following are some terms related to life table.
- An aggregate table is a life table in which the functions are only given for attained ages.
- A select table is a life table in which some function involves the age at selection.
- An ultimate table is usually appended to a select table as a last column to reflect the setting of select table. The combination of a select table and an ultimate table in such a way is called a select-and-ultimate table.
For example, an excerpt of select-and-ultimate table with select period 2 years may look like:
Age at selection, | |||
---|---|---|---|
0 | |||
1 | |||
2 | |||
... | ... | ... | ... |
The last column is the ultimate table. We can observe that we do not need additional columns for etc., since we can already get such values in the " " value in a different row (with different ).
Given a select-and-ultimate table, we can do various calculations based on it.
Example. You are given the following except of a (hypothetical) select-and-ultimate table with 4-year select period: Then, , , and [3] (since the select period is 4 years).
Exercise.
- ↑ The number inside square brackets is integer since it refers to age.
- ↑ (Optional) To be more precise, the selection period is the smallest integer such that ( is a small positive constant. When it is smaller, the requirement is "stricter" and the " "'s will be "closer".) for each .
- ↑ The value of is from the in the row for .