LIFE TABLES
The life table is a device that describes a cohort's or a population's mortality experience. The life table, sometimes called table of mortality (table de mortalité in French, and similarly labeled in most languages other than English) is one of the oldest and probably the single most valuable tool in demography. It has many applications that range well beyond mortality analysis. It is called a table, because in its classical presentation it is made of a number of numerical columns representing various indicators of mortality, but the information it contains can be also conveyed in graphical form.
The concept of a life table comes from a cohort perspective. If one could follow a birth cohort of 100,000 individuals through time in a closed population (a population without in-and outmigration), the number of survivors at various ages would correspond to the "number left alive at age x," in the lifetable its lx column. The difference between survivors at two consecutive ages, x and x+n, would correspond to the number of deaths between x and x+n, in the life table denoted by ndx. The ratio of ndx to lx produces another column of the life table, the probability of dying between age x and x+n, denotedby nqx. Thus, data on cohort survival can be readily converted into a life table.
In its classical form, the life table includes the following columns:
x = exact age
lx = number still alive at age x
ndx = number dying between ages x and x+n = lx-lx + n
nqx = probability of dying between ages x and x+n, conditional on survival to age x =
ndx/l
npx = probability of surviving from age x to age x+n, conditional on survival to age x = lx+n/lx
nax = average number of person-years lived in the interval between age x and x+n by those dying in the interval
nLx = number of person-years lived between ages x and x+n = nlx+nax. ndx
nmx = death rate between ages x and x+n = ndx/L
Tx = person-years lived above age x (sum of nL for ages x and higher)
ex = expectation of life at age x = T /l
Traditionally, a life table where n = 1 is called an "unabridged" or a "single-year" life table, and the left subscript can be omitted. A common alternative is to use 5-year age groups starting with age 5, and to present information for the age group 0–4 (that is, below exact age 5) in two groups: age 0 (i.e., less than age 1) and ages 1–4. In this case, the life table is called "abridged." It formerly was common practice to end a life table with an open-ended age interval starting at age 85. With the large proportions of survivors to age 85 in many low-mortality populations, though, the preferred practice is to present more detailed information above age 85, carrying it up to age 100 and concluding with an open-ended interval above that age. Because of the distinctive mortality differences between the sexes, life tables are commonly also presented separately for males and for females.
Although the life table is in principle a cohort concept, it is not commonly used in this fashion. The reason for this is both practical and substantive. Exhaustive cohort mortality data are rarely available, because that would require systematic death registration for a period spanning 100 years or more. In other words, it would be necessary to wait for a cohort to be extinct or near extinct to be able to construct a full cohort life table and obtain an accurate estimate of its life expectancy at birth. Accuracy would be compromised if the population was not closed–that is, if it were depleted not only by death but also outmigration–and if death registration included deaths of immigrants. A further and more substantive drawback of cohort life tables is that they refer to a period stretching over a century, during which mortality conditions are likely to have changed. For descriptive and policy purposes, typically less heterogenous and more timely information is necessary.
The limitations of cohort life tables have led demographers to design period life tables, based on the concept of "synthetic" cohorts. A synthetic cohort is a hypothetical cohort of persons subject through their life to the age-specific mortality rates of one specific period. This contrasts with a real cohort where each age-specific rate pertains to a different year. The theoretical construct of synthetic cohorts allows one to construct period life tables and compute life-cycle indexes (such as the life expectancy at birth) on the basis of observations relating to, and hence reflecting the mortality conditions for, a well-defined and relatively short time period. In practice, the period chosen is most often a single calendar year, or a two-year period bracketing a census count, or a quinquennium.
Life Table Construction
The construction of a period life table is not as straightforward as for a cohort, because many of the life table functions, including survivors at various ages, are not directly observable. One related life table function that can be estimated from actual data is nMx, the set of age-specific death rates in the population. This is commonly calculated as nDx/nNx, where nDx is the number of deaths between age x and x+n observed in a population during a specific period, and nNx is the population aged x to x+n at the middle of that period. The basic step in constructing a period life table is to estimate death probabilities, nqx, for the synthetic cohort, from nMx the age-specific death rates observed in a population for a particular period.
Strictly speaking, nMx is not exactly equivalent to nmx, the mortality rate observed in the corresponding synthetic cohort. This is because in a cohort (real or synthetic), nmx results from d and L, both of which are entirely produced by mortality conditions. That is, in a cohort, the number of survivors at various ages (and thus the corresponding person-years lived) is fully a product of underlying mortality conditions. Similarly, the number of deaths in the age-interval is also the product of mortality conditions applied to the number of survivors. Thus nmx is an unbiased mortality measure. By contrast, nMx is the product of specific mortality conditions applied to the population in the corresponding age-group, nNx. Unlike L, is not entirely produced by the mortality conditions to which the synthetic cohort is subject. It is also affected by the age distribution of the actual population within that age group, which is itself the product of past fluctuations in the number of births, past variations in mortality, and past migration. Thus nMx and m can differ. For most purposes, however, the difference is not large enough to produce significant differences in life table indexes such as life expectancy at birth. In life table construction, it is common to assume that nMx= nmx; there are, however, more involved methods of construction that do not require this simplifying assumption.
The second operation in constructing a period life table involves converting the set of nmx to q. This is done by using the following exact equation, derived from the equation for nLx specified earlier:
This equation relates an age-specific mortality rate in a cohort (real or synthetic) to the corresponding probability of dying in the age interval x to x+n. After this operation, one can readily calculate the number of survivors at various ages in the synthetic cohort and the remaining columns of the life table.
This equation shows, however, that nmx is not the only input for the nmx to q conversion. The conversion also involves nax, the average number of person-years lived between x and x+n by persons dying during the age interval. These values are usually not readily available in a population. When dealing with an unabridged life table, it is not consequential to make the assumption that life table deaths occur on average at the middle of the single-year interval (1ax = 0.5).
When dealing with five-year age groups, there are a number of methods for estimating nax in a population. One method involves graduation techniques that yield reliable results but that are not easy to implement. Another strategy consists of borrowing naxfrom another population with comparable mortality levels and patterns and for which nax values have been accurately estimated. The assumption that nax= n /2 is also used sometimes in abridged life tables. For the first two age groups (for ages 0 and 1–4), however, this assumption is seriously inadequate because mortality risks decrease rapidly within this age range, and thus deaths are more concentrated toward the beginning of the age interval rather than equally distributed. At these ages, it is possible to use estimation equations based on empirical populations at various mortality levels. These equations permit the estimation of 1a0 and 4 from the recorded level of 1m0. Similar adjustments may be necessary in very old age groups, in which the reverse phenomenon occurs: deaths are more concentrated toward the upper end of the age interval.
With nNx, nDx, and nax in hand, all life table columns can be derived. It is also necessary to choose an arbitrary value for l0, called the radix of the life table, to which the columns lx,ndL, and T are proportional. It is common to choose 100,000 as the value for l0. At the other end of the life table, for an open-ended age interval starting at age x*, it is usually assumed that ∞Lx* = lx* /∞Mx*.
An abridged period life table for the female population of Austria in 1992 is presented in Table 1. In its first two columns, this table also shows the empirical data–population numbers by age, derived from census statistics and numbers of deaths by age, derived from vital statistics–from which the life table was calculated.
The functions of the life table describe the level and age-pattern of mortality of a population. The nqxand nmx columns are two related ways of showing how the risk of mortality varies by age, conditional on survival to age x. The ndx column, if divided by l0, can be viewed as the probability for a newborn to die in a particular age group. The lx column, dividedagain by l0, corresponds to the probability of surviving from birth to age x. More generally, ly/lx (with y>x) corresponds to the probability of surviving from age x to age y, conditional on survival to age x, and 1 -ly/lx is the probability that a person who survived to age x will die between age and y. Furthermore, (lx-ly) /l0 is the probability that a newborn will die between ages x and y,(Tx-Ty) /l0 is the number of years that a newborn can expect to live between ages x and y. Figure 1 shows lx,1qx, and 1dx functions for the female population of Sweden. These data, presented for four different years during the twentieth century, illustrate how the age-pattern of life table functions varies as a country experiences mortality decline.
Perhaps the most widely-reported life table measure is e0, the life expectancy at birth. In a period life table, it corresponds to the average number of years that a newborn cohort would live if subjected through the life course to the age-specific mortality rates of the particular period to which the life table pertains. By extension, this number can be said to represent the length of life an average individual can expect at birth under the given mortality conditions. In a life table pertaining to an actual cohort that is now extinct, e0 corresponds to the observed mean age at death for that particular cohort. Life expectancy, as was indicated above, is also calculated for ages other than zero. For age x, ex can be interpreted as the number of years that an individual can expect to live above age x, conditional on survival to age x.
Life Table Applications
The life table has many applications beyond the measurement of mortality. One of these applications
TABLE 1
FIGURE 1
concerns the relationship between the mortality level of a population and its age structure. In particular, a life table can be conceived as a stationary population, which is a population closed to migration, where the annual number of births (B) and mortality conditions (embodied in the life table) are constant over time. If these conditions apply, the population will have an age distribution, nNx, that is proportional by a factor B/l0 to the nLx column of the life table, and a total population size, P = B e0. The annual number of deaths in this population will equal the annual number of births, and the crude death and birth rates will have the value of 1/e0. In this population all demographic parameters, including total population size, are constant over time. The growth rate of this population is zero–hence the designation "stationary."
It follows that if a population can be assumed to be stationary, population parameters can be directly translated into life table parameters. For example, in a stationary population, e0 = P /B, and nLx= nNx. l0 /B. Also, in a stationary population, the observed distribution of deaths, nDx, is proportional to the distribution of deaths in the life table, ndx. Thus, the observed mean age at death equals e0, which is not the case in non-stationary populations. These equalities are useful for estimating mortality or other single-decrement processes in any population or sub-population that can be assumed to be stationary.
The life table can also be used to project a population in the future. If a particular life table can be assumed to represent mortality conditions of a specific population between time t and +5, and if there is no migration between t and +5, the populationaged x to +5 at time t,5Nx(t) can be projected to time +5 using the 5Lx column of the life table and the following equation:
This equation rests on the assumption that the population is stationary between age x and +5, an assumption that is not very problematic for this purpose. The births that occurred between t and +5 can also be projected to time t +5 to calculate the population under age 5 at time +5 using the following equation:
Naturally, an important component of population projections involves making assumptions about the future course of mortality and the construction of corresponding life tables. The above equations describe the mechanical use of life tables for projection purposes, once life tables for future time periods have been estimated.
Mortality Models
Comparison of life tables in various populations has led demographers to observe regularities of age patterns of mortality. In particular, across populations and time periods, age-specific mortality risks tend to follow a U-shape curve, with higher mortality risks at younger and older ages, with the lowest risks in the neighborhood of age 10 (see Figure 1). Another observation is that mortality rates are highly intercorrelated within a population. When mortality rates are higher at particular ages, they also tend to be higher at other ages. Although with a given level of mortality–as indexed, for example, by the value of the expectation of life at birth–somewhat different age-specific mortality rates may be associated in different populations, this variability is within relatively narrow bounds.
These observations have led demographers to search for parsimonious representations of mortality patterns, or mortality "models." The purpose of mortality models is the representation of complex age/level variations in life table columns with a small number of parameters. Mortality models can permit the estimation of mortality indicators in settings where the data ideally required for life table construction (an accurate census count of the population and accurate death registration for years fairly close to the time of the census, or corresponding statistics derived from an accurate population register) are absent. In such situations it may be still possible to estimate a limited number of population parameters, which, along with a model, can be useful for estimating a full life table.
Mathematical mortality models are the oldest parsimonious representations of mortality patterns. The purpose of such models is to present a functional form relating mortality risks to age, with a small number of parameters adjusting for varying levels and patterns across populations. Mathematical mortality models have not been very successful because the shape of the mortality curve is too complex to be easily summarized in a functional form. An important contribution of mathematical representations of mortality, however, is the Gompertz "law" of mortality, according to which the logarithm of the death rate is a linear function of age. The Gompertz equation is frequently used to represent or estimate mortality rates past middle adult ages, although it tends to overestimate mortality rates at older ages.
The most widely-used mortality models are model life tables. In their classical form, model life tables are a set of life tables indexed along two dimensions, family and level. A family is a group of model life tables with similar age patterns of mortality, often based on the experience of populations that are geographically close. Within a family, tables are indexed by level, from low to high life expectancy at birth. These model life tables are estimated by grouping high-quality empirical life tables with similar age patterns of mortality, and by observing how age-specific mortality rates typically vary as the over-all mortality level changes. Regression equations are then used to construct model life tables for various families and mortality levels.
The most commonly used sets of model life tables were developed by A. J. Coale and P. Demeny and by the United Nations. Coale and Demeny model life tables are mostly based on the experience of European populations during the first half of the twentieth century, whereas United Nations tables are based on the experience of developing countries during the second half the twentieth century.
Model life tables are convenient to use, because within a family, a unique life table can be selected on the basis of only one life table indicator. The use of model life tables thus can flexibly accommodate various data configurations of the populations under study. However, the choice of the family of life tables appropriate for application to actual populations is not always straightforward, and the full range of age variations in actual mortality experience may not be fully represented in the currently available model life tables.
Another category of mortality models is the relational model, to which the logit model developed by William Brass belongs. Brass (1971) observed that any two life tables can be related to each other in a linear way after performing a "logit" transformation of q(x), the probability of dying before age x. The logit system is thus a system in which a "standard" life table can be adjusted for varying levels and agepatterns of mortality across populations after solving for the two linear parameters relating empirical life table values to the standard. The Brass logit system is often used to smooth an empirical life table or to complete a life table with missing values.
Multiple-Decrements
Although life tables were originally developed to study mortality, the same logic can be used to study many other processes. The only requirement is that the process must be a single-decrement process, which means that there is only one mode of exiting a defined state of interest with no possible return to that state. When studying mortality, the state of interest is being alive, and the only way to leave that state, with no return permitted, is through death. The same logic can be applied, for example, to first marriages, in which case the state of interest is being never married and the event of interest is first marriage. Other single-decrement processes include first migration from place of birth, marital survival, or entry to the labor force.
In reality, single-decrement processes are rare. For a real cohort, mortality is in fact the only true single-decrement process. Other states such as "being never married" can be left not only by marriage, but also by death. Mortality always operates in addition to other processes, and thus multiple-decrement processes are far more common than single-decrement processes. Nonetheless, single-decrement life table logic can be applied to multiple decrements if the different sources of exits can be merged into one combined source of decrement, or if sources of decrements other than the one of interest can be ignored because they are very infrequent during the age-range under study. Also, if information is available only for individuals who did not experience other sources of decrement, the process of interest can be studied as a single-decrement process. For example, data on first marriages reported by individuals aged 85 can be used to reconstruct a life table in the single state where first marriage is the only source of decrement. Such strategy will provide an unbiased single-decrement life table if survivors had the same risks of experiencing a first marriage as those who did not survive (independence of probabilities).
In a multiple-decrement environment, it is sometimes useful to present information on the various modes of exits. Such analysis is commonly performed in a multiple-decrement life table, which is a life table where several sources of decrement affect the number of people still alive, and where each source of decrement is specified in various columns. For example, in a cause-specific life table, there are as many ndxi columns as there are causes of death i. The sum of ndxi in an age group is equal to , the number of deaths from all causes in the corresponding single-decrement life table.
There are also methods that permit construction of a single-decrement life table in the absence of, or net of, other sources of decrement. Such life tables are sometimes called "associated single-decrement life tables" or "cause-deleted life tables." This approach involves information on a population subject to various sources of decrement, but it models the process of interest by reconstructing a hypothetical cohort where that process would be the only possible source of attrition. For example, if various causes of death are taken to be different modes of exiting the cohort, an associated single-decrement life table would permit estimation of the survival of a cohort in the absence of a particular cause (or group of causes).
Life table analysis is a powerful tool that has regained interest in the social sciences with recent developments in statistical techniques. If data on cohort survival is available at the individual level and includes covariates, it is possible to use a set of techniques termed "survival analysis" or "event-history analysis." The main purpose of these techniques is to estimate the influence of individual-level characteristics on the age-specific risk of attrition, the socalled "hazard rate." Because of their roots in classical life-table analysis, some of these procedures have been termed "life tables with covariates."
As noted earlier, the underlying assumption of single-and multiple-decrement life tables is that once the event of interest is experienced, individuals can never return to their original state. In reality, there are many processes where individuals can experience reverse flows. For example, if the state of interest is being "currently married," there are return flows as individuals who experience divorce or widowhood remarry and become "currently married" again.
Such processes can be analyzed through increment-decrement life tables, sometimes called multi-state life tables. In an increment-decrement life table, there are as many lx columns as there are states (excluding "being dead"), and these lx columns can increase or decrease depending on the observed rates of transition from one state to the other. Among other uses, increment-decrement life tables allow estimation of the expected number of years in a particular state. For example, they can permit the estimation of the number of years that a newborn can expect to live in the "currently married" state, which could not be done using classical, single-decrement life tables.
Historical Note
The concept of life tables was first created in 1662 by John Graunt (1620–1674) in his Natural and political observations made upon the bills of mortality (1662). Further developments were made by the Huygens brothers and Gottfried Leibniz (1646–1716). The first systematic construction of a life table is credited to the astronomer Edmund Halley (1656–1742) in 1693. During the seventeenth century, life table construction was significantly improved due to the work of Willem Kersseboom (1691–1771), Nicolaas Struyck (1686–1769), and Antoine Deparcieux (1703–1768). However, these scientists lacked the two necessary data sources for the construction of a period life table–deaths and population by age. Therefore, early life tables were accurate only if the population under study was a closed cohort or a stationary population, and thus they had limited applicability. Taking advantage of the exhaustive Swedish data, Pehr Wargentin (1717–1783) constructed the first scientifically correct period life table in 1766.
Before the seventeenth century, death was believed to be either a magical or sacred phenomenon that could not and should not be quantified. The invention of life tables was thus an important scientific breakthrough, not only because of the technical aspects of life table construction, but also because of the concept of mortality as a measurable phenomenon following observable regularities.