Population modelling
Population data play a range of vital roles in integrated assessment of environmental risks to health:
- As proxies for source activity and intensity data in models of emissions or environmental concentrations (e.g. in terms of domestic waste production, traffic density, atmospheric emissions, air pollutant concentrations)
- As a basis for modelling and estimating population exposures (e.g. by intersection of population data with environmental data, by extrapolation from personal monitoring or biomonitoring data)
- As a source of information on vulnerability (e.g. related to age, sex or socio-economic status).Population density and composition are both highly variable, often over short distances.
Substantial changes in population can also occur over periods of a few years not only as a result of natural changes of birth and death but also through migration and urban development. The spatial and temporal resolution of these data is therefore often critical in determining their utility for healtjh impact assessment. While some assessments can cope with aggregated data, based for example on national decennial population counts, for many purposes the data need to be available at the resolution of 1 km or less, in order to reflect local variations in environmental conditions, and for time periods of no greater than one or two years. Additionally, for many applications data are needed not as totals for the whole population, but for specific population groups, defined in terms of age, sex, ethnic origin etc. The ability to provide data at these levels of detail (i.e. cross-stratified by area, year, and population group) is likely to be one of the main limiting factors in the utility of available population data.
Two forms of population modelling are therefore likely to be necessary in many integrated assessments:
- temporal modelling (population projections), to calculate likely changes in population numbers and structure for future years, under the various policy scenarios being assessed;
- spatial modelling, to estimate the local distribution of these population groups under the various scenarios.
Population projections
Population projections are an estimate of a future population. Unlike censuses, in which information about the members of the population is systematically collected and recorded, population projections typically involve mathematical models based on pre-existing data. The projections are actually scenarios based on assumptions about the factors which influence population changes, such as fertility, mortality and migration. Projections of this type are often needed in assessments, for example, to indicate the size (and susceptibility) of the population for future years, to compute rates and DALYs for health outcomes with long lag periods (e.g. some cancers), and to take account of changes in population distribution and structure caused by the policy or other factors being analysed.
Estimates and projections for population size, growth and structure are available from various sources including the United Nations (world), Eurostat (for Europe) and national statistics offices. Depending on the data provider, projections may be available as world-wide, regional, national or sub-national estimates and include sex and/or age breakdown. The typical time horizon is the mid 21st century, but long-range projections to 2150 and 2300 are also available from, for example, the United Nations (UN 2004).
Regional (NUTS 3) projections to the year 2030 are currently available from Eurostat for European Member States. For some countries, more detailed projections may be obtained through the national statistics office. For example, the “2008-based Subnational Population Projections for England” (ONS 2010) provides local authority level estimates with age and sex breakdown to the year 2033. These are trend based projections, meaning that assumptions for future levels of births, deaths and migration are based on observed levels over the previous five years (2004 to 2008). The projections thus show what the population structure will be assuming the recent trends continue.
Spatial population modelling
Reliable information on the location and distribution of the population is essential for risk assessment in order to differentiate between, and calculate, population-level exposures. Census data provide routinely available information on population numbers and composition, generally on a decennial basis. By their design, however, census tracts - arranged to facilitate enumeration rather than to reflect the underlying geographic distributions of population - are of limited value for exposure assessment.
Census tracts are rarely uniform in size or population numbers, and typically cover large geographic areas including unpopulated lands (e.g. rural land, parks and industrial developments). The smallest geographic unit for the EU administrative regions, for example, is the Local Administrative Unit 2. Outside densely built-up urban centres most LAU2 regions are around 5-10 km2. In rural areas, they may be in excess of 100km2. Where the LAU2 regions are this large, the population tends to be dispersed or clustered in small towns with inhabitants typically residing in only a small proportion of the region.
In response to these problems, several spatial disaggregation methods have emerged to estimate small-area population for various science and policy applications including risk assessment. The methods range in complexity from simple area weighting (Goodchild and Lam1980) to dasymetric mapping techniques (Briggs et al. 2007; Flowerdew and Green 1994; Harris and Longley 2000; Mennis 2003).
Area weighting, for example, is used to construct the gridded population of the world (GPW-3) database (CIESIN 2004). It is an approximate approach, however, which relies on the (often false) assumption that the population is evenly distributed within each census district. To develop more reliable models of population distribution requires more sophisticated techniques incorporating additional data on exogenous variables that can be assumed to reflect lower level (i.e. within census tract) variations in population density. These more complex dasymetric mapping or disaggregation techniques often use land cover data derived from satellite imagery to provide the necessary ancillary information for modelling.
Briggs D. J., Fecht, D., Gulliver, J., and Vienneau, D. 2007 Dasymetric modelling of small-area population distribution using land cover and light emissions data. Remote Sensing of Environment 108, 451-466.
Flowerdew, R., and Green, M. 1994 Areal interpolation and types of data. In Spatial analysis and GIS (Fotheringham, S. and Rogerson, P. eds.). London: Taylor and Francis, pp. 121−145.
Goodchild, M.F. and Lam, N.S-N. 1980 Spatial interpolation methods, a review. American Cartographer 10, 129−149.

