Griffith Feeney's Demography Website

The DEMOGRAPHY-STATISTICS-INFORMATION TECHNOLOGY Letter

#10 Sep 2016 Download PDF

The life table is a tool for describing levels and age patterns of mortality in a population using a standard set of descriptive statistics. Mastery of life table concepts and methods is essential for serious analysis of mortality. The importance of life table methods is more general, however, because the ideas that apply to the analysis of length of life apply in many other settings.

Life tables may present a stumbling block even to serious and determined students of demography. It is worth considering why this is so, and several possibilities may be suggested. One is the sheer volume of material to be absorbed. Reasonable mastery requires weeks of effort, not hours or days.

The notation used for life table columns and formulas requires a familiarity with basic mathematics that comes more easily for some than for others. A solid understanding of the distinction between period and cohort life tables requires experience using life table methods to study mortality in actual human populations.

A realistic appreciation of the effort required to learn life table methods may be the best way to avoid discouragement. That said, let’s begin.

A ** cohort** life table describes the mortality experience of a birth cohort, a group of persons born during a particular year or other time period. A cohort life table cannot be constructed until all or nearly all of the persons in the cohort have died. This means, firstly, that the persons must have been born a very long time ago, and secondly, that the mortality risks described by the table amalgamate mortality conditions over a very long period of time, nearly a century.

If we are interested in mortality during particular year or time period, we want a “period” life table. A ** period** life table looks just like a cohort life table, but instead of describing what happened in a birth cohort over the past 100 or so years, it describes the possible

Most life tables are period life tables, partly because few countries have the data required to construct cohort life tables, and partly because there is more interest in current than in past mortality.

The importance of cohort life tables is mainly conceptual. Life table concepts are more easily explained and understood for cohort life tables. Learning how cohort life tables are defined and constructed is the best preparation for learning period life tables.

The following section presents an example of a life table. Like most life tables encountered in practice, it is a period table, but to explain the meaning of the table the reader is asked to imagine that it is a cohort life table constructed from a list of the ages at death of persons born during a year in the distant past.

Table 1 shows a life table for Austrian males based on mortality risks observed in 1992. Understanding the table requires a few definitions. A person’s ** age** at any given time is the time elapsed since the person’s birth.

Age may be referred to as ** exact** age when it is unclear whether age or age in completed years is intended. In practice, “age” tends to be used without qualification, the reader being expected to figure out from context what meaning is intended. This is almost always possible, and not difficult, but it requires experience and practice.

Rows of the table correspond to age intervals. The number in the *x* column is the (exact) age at which the age interval begins. The number in the *n* column is the width of the interval. The age intervals are understood to be what mathematicians call “left closed, right open”, meaning that they include persons whose exact age is *x* but exclude persons whose exact age is *x* + *n*. The last interval is nearly always “open-ended”, in this case age 85 years and over.

The subscript notation in the column headings was invented by actuaries and has been in use for well over a century. The following subscript *x* denotes exact age. If the values in the column refer to an age interval, the leading subscript, sometimes called a “prescript”, gives the width of the interval. The prescript is omitted if the width is one year. Putting the width of the open-ended interval to ∞ is a convenient formalism.

The * _{n}m_{x}* column is not strictly a life table column, rather it is the primary input from which most period life tables are constructed. The

The remaining six columns of Table 1 tend to appear in all life tables. They are discussed in the following three sections.

To learn the meaning of the life table columns it is useful to imagine that Table 1 is a cohort life table for persons born during a year in the distant past and was constructed from a list of the exact ages at death of all persons born during this year. The * _{n}m_{x}* column is not needed if the life table is constructed in this way.

The * _{n}q_{x}*,

If we were really calculating a cohort life table from a list of ages at death, we would tally the number of deaths in each age interval and divide by the number of persons in the cohort to get the * _{n}d_{x}* column values, cumulate the

The different order of the columns in Table 1 is due to its being, in fact, a period life table. As explained in section 8, construction of a period life table begins with calculation of the * _{n}q_{x}* from age-specific death rates

The * _{n}d_{x}* and

The *l _{x}* column is the cumulative frequency distribution, except that the direction of cumulation is reversed: instead of the proportion of persons dying before age

The solid dots in Figure 1 show the frequency distribution of deaths (* _{n}d_{x}*) in Table 1. A common procedure for plotting values that refer to intervals is to plot the point at the midpoint of the interval. A slightly refined alternative is to plot the point for an interval at the mean of the values in the interval. This has the advantage of providing an age at which to plot values for the open-ended interval, for which the midpoint is undefined. For the distribution in Table 1, the means of the values in each age interval are given by

Figure 1 shows that many deaths occur at ages above 85 years, far too many to justify truncating the table at this age. Life tables that truncate mortality experience too early are common, partly because conventional upper age limits were adopted long ago when expectations of life were lower.

The hollow circles in Figure 1 show *d _{x}* values from a life table constructed by extrapolating the age-specific death rates (

The distribution of ages at death of members of a birth cohort is distinctive in that deaths occur at particular times, as well as at a particular ages. For the study of mortality, the time is as important as the age. This characteristic of the distribution of cohort deaths by age is not shared by frequency distributions in general, and it is essential to understanding the meaning of the * _{n}q_{x}* values.

The formula defining * _{n}q_{x}*, (1c), shows that its numerator is the same as the numerator of

To see the significance of this difference, suppose for simplicity that persons in the cohort are all born at the same time t, so that deaths in the age interval [*x, x + n*) occur between time t + x and time t + x + n. What does * _{n}d_{x}* tell us about the level of mortality for this age group during this time period?

The answer turns out to be—*nothing*. The value of * _{5}d_{60}* (say) may be low because most cohort members have died at earlier ages (high mortality), or because most cohort members die at older ages (low mortality). The

Dividing the number of deaths to the cohort in the age interval [*x, x + n*) by the number of survivors at age *x* controls for the history of mortality in the cohort by eliminating the dependence on the numbers of deaths below age x. The value of * _{n}d_{x}* cannot be larger than the number of persons surviving to age x. The value of

Another way is to ask what is the maximum number of people in the cohort who could die in the interval [*x, x + n*). The most fundamental principle of demographic analysis is that numbers of events tend to be larger when the numbers of persons who might experience these events are larger. Dividing the number of deaths by the number of persons who might have died gives a relative number that controls for large differences in numbers of persons. The influence of small differences in numbers of persons may be outweighed by other influences on numbers of deaths.

The number of persons in the cohort who die in the interval [*x, x + n*) is * _{n}d_{x}* times the number in the cohort. The number who might die is the proportion

A third way to understand * _{n}q_{x}* is as a conditional probability. By definition, the conditional probability of event

Let *A* be the event “died in the age interval [*x, x + n*)” and *B* the event “survived to age *x*”, which is the same as “died in the interval [*x, ∞*)”. The intersection of these two events is death in the interval [*x, x + n*), so * _{n}q_{x}* is the conditional probability of dying in the interval [

To personalize this, suppose that I am a cohort member, that I have just reached age *x*, and that I want to know how likely it is that I will die before reaching age *x* + 5 years. The * _{n}q_{x}* column of the life table tells me. If I am 30 years old, the chance of dying before age 35 is 0.7%. If I am 70 years old, the chance of dying before age 75 is nearly 20%—this of course on the assumption that my mortality risks are represented by the life table shown in Table 1.

Given any one of * _{n}q_{x}, l_{x}*, and

Note finally that the * _{n}q_{x}, l_{x}*, and

The values in these columns are defined as follows.

The “Person years lived” in the numerator of (3a) is the sum the time in years lived in the age interval by each person who survives to age *x*. This sum may be divided into two components.

(a) Persons who survive to age *x + n* contribute n person years to the sum. Multiplying the number of these persons by n and dividing by the total number of persons in the cohort gives *n×l _{x+n}*.

(b) Persons who die at age *x ≤ x _{i} < x + n* contribute

Combining these two expressions gives

This formula may be used to calculate the * _{n}L_{x}* column from the

Person years lived is additive over age intervals, so the *T _{x}* column may be calculated from the

The value of *e _{x} = T_{x} / l_{x}* is

Readers familiar with introductory statistics may be puzzled by the definition of life expectancy at birth in the preceding section. If it is simply the mean age at death for cohort members, why not calculate it using one of the formulas for the mean found in introductory statistics textbooks? Why “person years lived” and the cumbersome and apparently unnecessary three column calculation?

The answer may surprise readers who grew up with personal computers and spreadsheet programs. Life tables go back more than 350 years, beginning with John Graunt’s Natural and Political Observations Made Upon the Bills of Mortality, a study of statistics of deaths in contemporary London. The life table in its modern form is newer, but the columns in Table 1 go back more than a century.

This means that life tables were developed when arithmetic was done using pencil and paper—no calculating machines, to say nothing of computers. When arithmetic is done by hand, addition is simplest and least error-prone. Subtraction, to say nothing of multiplication and division, is more difficult and to be avoided if at all possible.

Calculations are therefore organized to minimize the use of subtraction, multiplication and division. This explains the seemingly peculiar manner in which life expectancy is calculated. Manual calculation also explains the layout of the life table, the first purpose of which was not to present results, but to organize the calculations in a way that minimizes the chance of careless error.

The difficulties of manual arithmetic explain the origin of the life table columns, but two other factors explain their persistence. Insurance and annuity calculations use *e _{x}* values at all ages, and if these are wanted, the traditional organization of their calculation is as good as any other. It has the advantage, moreover, of providing

Life expectancy at birth and mean age at death for a cohort are two expressions for the same quantity, but “mean age at death” for a *period* might reasonably be understood as the mean age at which deaths to the population during a time period. This mean reflects the population age distribution as well as the age-specific risks of mortality and is an extremely poor indicator of mortality. “Expectation of life at birth” avoids this confusion.

The * _{n}a_{x}* column shows the average years lived in the age interval [

Substituting *l _{x} − l_{x + n}* for

If a life table shows all three columns, this formula provides a check of the consistency between them. If a life table does not show * _{n}a_{x}* values, they may calculated from the

Formula 4 also implies

Since *l*_{0} = 1, this shows that the *l _{x}* column may be iteratively calculated from the

Formulas 4, 6, and 7 together show that the information in the *l _{x}*,

Preceding sections discussed the life table columns in Table 1 as though they were calculated from a list of the ages at death of a cohort of persons born in the distant past, so distant that all of the cohort members have died. This pedagogical device faciliates understanding of the meaning of the columns and their interrelationships.

As noted in the introduction, however, most life tables encountered in practice are period life tables. They show what would happen to a cohort of persons born during a recent time period if these persons experienced the mortality risks observed during the period in which they were born. The idea is not that mortality risks will remain in fact constant, but that this assumption provides a useful representation of current mortality risks.

Construction of period life tables involves many technicalities. The methods used vary with the data available. A thorough discussion is far beyond the scope of this letter. It will be useful, however, to present a generic procedure for calculating a period life table from a set of age-specific death rates.

Current mortality risks are represented by * _{n}m_{x}*, defined as the number of deaths of persons in an age interval [

The simplest way to calculate a period life table is to use a formula that translates age-specific death rates into * _{n}q_{x}* values. The

Calculation of the * _{n}L_{x}, T_{x}, and e_{x}* columns requires information on the age distribution of deaths within age ingervals. This is not provided by the

Given a set of age-specific death rates * _{n}m_{x}* and a set of

A derivation of this formula is given in section 3.1 of [3].

[1] Guillot, Michelle. 2003. “Life tables” entry in Volume 2 of the Encyclopedia of Population, Ed. Paul Demeny and Geoffrey McNicoll. New York: Macmillan Reference USA.

[2] King, George. 1902. Statistical applications of the mortality table. Excerpted in David P. Smith and Nathan Keyfitz, Mathematical Demography: Selected Papers, second, revised edition, Ed. Kenneth W. Wachter and Hervé Le Bras. Demographic Research Monographs: A Series of the Max Planck Institute for Demographic Research. New York: Springer. Available at www.demogr.mpg.de/books/, accessed 13 July 2016.

[3] Preston, Samuel H., Patrick Heuveline, and Michel Guillot. 2000. Demography: Measuring and Modeling Population Processes. Oxford, United Kingdom: Blackwell Publishers Inc.

[4] Wilmoth, John R., Sarah Zureick, Vladimir Canudas-Romo, Mie Inoue, and Cheryl Sawyer. 2012. A flexible two-dimensional mortality model for use in indirect estimation. Population Studies **66**(2):1-28.

Griffith Feeney Ph.D.

Scarsdale, New York, USA

EMAIL feeney@gfeeney.com

OFFICE +1 914 595 1916

MOBILE +1 914 721 3950

SKYPE gfeeney

**DOWNLOAD CV Paper Size A4 or
LETTER**

DSITL is an occasional
email letter on demography, statistics, and information technology by
Griffith Feeney. License: Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States License.
To subscribe (unsubscribe), send email to
feeney *at* gfeeney *dot* com
with “subscribe” (“unsubscribe”) in the subject line.