The data presented below represent the predicted number of people per ~100 m pixel as estimated using the random forest (RF) model as described in Stevens, et al. (2015). The following pages contain a description of the RF model and its covariates, their sources and any metadata collected for each covariate. The prediction weighting layer is used to dasymetrically redistribute the census counts and project counts to match estimated populations based on UN estimates for the final population maps provided by AfriPop, AsiaPop and AmeriPop.
These data are the population density values used to estimate the RF model used to create the prediction weighting layer you see above. Values represent population density as measured by people per hectare and calculated from population counts within each census unit. These values are used as the dependent variable during model estimation.
Folder: Census
File Name: CHN_2010_wgs84.shp
Source: China CDC, acquired by Gaughan, et al. for use in AsiaPop data products.
Description: These census data are 2010 China Country Population Census Data. Required fields for map production are ADMINID and ADMINPOP.
Class: polygon
Derived Covariates:
area, buff, zones,   
class       : SpatialPolygonsDataFrame 
features    : 2925 
extent      : -2579239, 2095909, 925378, 6387627  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 52
  
These output and figures outline the estimated RF model that is used to predict the population density weighting layer. The model is fitted to the population density values for the preceding census data using covariates aggregatedfrom the ancillary data sources summarized following the model diagnostics.
Call:
 randomForest(x = x_data, y = y_data, ntree = popfit$ntree, mtry = popfit$mtry,      nodesize = length(y_data)/1000, importance = TRUE, proximity = TRUE) 
               Type of random forest: regression
                     Number of trees: 500
No. of variables tried at each split: 14
          Mean of squared residuals: 0.17
                    % Var explained: 95
Call:
 randomForest(x = x_data, y = y_data, ntree = popfit$ntree, mtry = popfit$mtry,      nodesize = length(y_data)/1000, importance = TRUE, proximity = TRUE) 
               Type of random forest: regression
                     Number of trees: 500
No. of variables tried at each split: 14
          Mean of squared residuals: 0.17
                    % Var explained: 95
Call:
 randomForest(x = x_data, y = y_data, ntree = popfit$ntree, mtry = popfit$mtry,      nodesize = length(y_data)/1000, importance = TRUE, proximity = TRUE) 
               Type of random forest: regression
                     Number of trees: 500
No. of variables tried at each split: 14
          Mean of squared residuals: 0.17
                    % Var explained: 95
Call:
 randomForest(x = x_data, y = y_data, ntree = popfit$ntree, mtry = popfit$mtry,      nodesize = length(y_data)/1000, importance = TRUE, proximity = TRUE) 
               Type of random forest: regression
                     Number of trees: 500
No. of variables tried at each split: 14
          Mean of squared residuals: 0.17
                    % Var explained: 95
Folder: Landcover
File Name: chn_lc_full_corr.tif
Source: MDA GeoCover Landcover Product, 30m
Description: Landcover from the Landsat-derived MDA GeoCover product, reclassified to match AfriPop coding and eventually broken down into binary classifications by aggregated land cover type (see Linard, et al., 2010 and Gaughan, et al. 2013 for category information).
Class: raster
Derived Covariates:
cls011, dst011, cls040, dst040, cls130, dst130, cls140, dst140, cls150, dst150, cls160, dst160, cls190, dst190, cls200, dst200, cls210, dst210, cls230, dst230, cls240, dst240, cls250, dst250, clsBLT, dstBLT,   
class       : RasterBrick 
dimensions  : 56689, 73971, 4193342019, 1  (nrow, ncol, ncell, nlayers)
resolution  : 0.00083, 0.00083  (x, y)
extent      : 73, 135, 6.3, 54  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=longlat +datum=WGS84 +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\Users\jnieves\Research\Population\Data\RF\data\CHN\Landcover\Derived\landcover.tif 
names       : landcover 
min values  :         0 
max values  :       240 
  
Folder: NPP
File Name: DEFAULT: MODIS 17A3 2010
Source: United States Geological Survey (USGS)
Description: MODIS 17A3 version-55 derived estimates of net primary productivity for the year 2010, estimated for 1km pixel sizes and subset and resampled to match the available land cover and final population map output requirements.
Class: raster
Derived Covariates:
,   
class       : RasterBrick 
dimensions  : 54823, 46953, 2574104319, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : -2589386, 2105914, 915329, 6397629  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=lcc +lat_1=30 +lat_2=62 +lat_0=0 +lon_0=105 +x_0=0 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\Users\jnieves\Research\Population\Data\RF\data\CHN\NPP\Derived\npp.tif 
names       :   npp 
min values  :     0 
max values  : 19212 
  
Folder: Lights
File Name: DEFAULT: VIIRS 2012
Source: http://ngdc.noaa.gov/eog/viirs/download_viirs_ntl.html
Description: These 'Lights at Night' data were derived from imagery collected by the Suomi National Polar-orbiting Partnership (NPP) Visible Infrared Imaging Radiometer Suite (VIIRS) sensor.  Data were collected in 2012 on moonless nights and though background noise associated with fires, gas-flares, volcanoes or aurora have not been removed it represents the best-available data for night-time light production.
Class: raster
Derived Covariates:
,   
class       : RasterBrick 
dimensions  : 11842, 16156, 191319352, 1  (nrow, ncol, ncell, nlayers)
resolution  : 0.0042, 0.0042  (x, y)
extent      : 70, 137, 5.6, 55  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=longlat +datum=WGS84 +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\Users\jnieves\Research\Population\Data\RF\data\CHN\Lights\Derived\lights.tif 
names       : lights 
min values  :      0 
max values  :   5621 
  
Folder: Temp
File Name: DEFAULT: BIO1
Source: http://www.worldclim.org/current
Description: WorldClim/BioClim 1950-2000 mean annual precipitation (BIO12) and mean annual temperature (BIO1) estimates (Hijmans et al., 2005) were downloaded, mosaicked and subset to match the extent of our land cover data for the mapping of this region.
Class: raster
Derived Covariates:
,   
class       : RasterBrick 
dimensions  : 5922, 8078, 47837916, 1  (nrow, ncol, ncell, nlayers)
resolution  : 0.0083, 0.0083  (x, y)
extent      : 70, 137, 5.6, 55  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=longlat +datum=WGS84 +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\Users\jnieves\Research\Population\Data\RF\data\CHN\Temp\Derived\temp.tif 
names       : temp 
min values  : -227 
max values  :  285 
  
Folder: Precip
File Name: DEFAULT: BIO12
Source: http://www.worldclim.org/current
Description: WorldClim/BioClim 1950-2000 mean annual precipitation (BIO12) and mean annual temperature (BIO1) estimates (Hijmans et al., 2005) were downloaded, mosaicked and subset to match the extent of our land cover data for the mapping of this region.
Class: raster
Derived Covariates:
,   
class       : RasterBrick 
dimensions  : 5922, 8078, 47837916, 1  (nrow, ncol, ncell, nlayers)
resolution  : 0.0083, 0.0083  (x, y)
extent      : 70, 137, 5.6, 55  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=longlat +datum=WGS84 +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\Users\jnieves\Research\Population\Data\RF\data\CHN\Precip\Derived\precip.tif 
names       : precip 
min values  :     12 
max values  :  11401 
  
Folder: Roads
File Name: roads.shp
Source: Open Street Map, Downloaded 2014-07-10, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: linear
Derived Covariates:
cls, dst,   
class       : SpatialLinesDataFrame 
features    : 469843 
extent      : 73, 135, 18, 54  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 8
  
Folder: Rivers
File Name:
Source: Open Street Map, Downloaded 2014-07-10, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: linear
Derived Covariates:
cls, dst,   
class       : SpatialLinesDataFrame 
features    : 57194 
extent      : -2589209, 2104026, 2373678, 6397187  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 9
  
Folder: Populated
File Name: DEFAULT: Merged pop/builtupp, pop/builtupa, pop/mispopp
Source: National Geospatial-Intelligence Agency (NGA), http://geoengine.nga.mil/geospatial/SW_TOOLS/NIMAMUSE/webinter/rast_roam.html
Description: The VMAP0 data area downloaded as separate files, grouped roughly by continent, and merged into individual shapefiles for subsetting and further processing for population mapping efforts.  These data were obtained directly from the original VMAP0 data sources provided by the NGA and pre-processed using Military Analyst in ArcGIS 10.0.  Point data sources are buffered to 100 m and then all polygon data sources are merged to a single shapefile prior to processing.
Class: polygon
Derived Covariates:
cls, dst,   
class       : SpatialPolygonsDataFrame 
features    : 16190 
extent      : -2575793, 2105465, 2372957, 6392344  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 12
  
Folder: Waterbodies
File Name: DEFAULT: hydro/watrcrsl
Source: National Geospatial-Intelligence Agency (NGA), http://geoengine.nga.mil/geospatial/SW_TOOLS/NIMAMUSE/webinter/rast_roam.html
Description: The VMAP0 data area downloaded as separate files, grouped roughly by continent, and merged into individual shapefiles for subsetting and further processing for population mapping efforts.  These data were obtained directly from the original VMAP0 data sources provided by the NGA and pre-processed using Military Analyst in ArcGIS 10.0.
Class: polygon
Derived Covariates:
cls, dst,   
class       : SpatialPolygonsDataFrame 
features    : 10141 
extent      : -2493402, 2096906, 2375124, 6387717  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 10
  
Folder: Elevation
File Name: DEFAULT: Void-Filled DEM.gdb
Source: HydroSHEDS Void-Filled DEM (Lehnert, et al., 2006), http://hydrosheds.cr.usgs.gov/dataavail.php
Description: The HydroSHEDS data are the result of an effort to provide a globally consistent dataset consisting of NASA's Shuttle Radar Topography Mission (SRTM) data and have been processed, void-filled and corrected for use at large scales.
Class: raster
Derived Covariates:
, slope,   
class       : RasterBrick 
dimensions  : 54823, 46953, 2574104319, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : -2589386, 2105914, 915329, 6397629  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=lcc +lat_1=30 +lat_2=62 +lat_0=0 +lon_0=105 +x_0=0 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\Users\jnieves\Research\Population\Data\RF\data\CHN\Elevation\Derived\elevation.tif 
names       : elevation 
min values  :      -268 
max values  :      8618 
  
Folder: Buildings
File Name: buildings.shp
Source: Open Street Map, Downloaded 2014-07-10, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: polygon
Derived Covariates:
cls, dst,   
class       : SpatialPolygonsDataFrame 
features    : 64335 
extent      : 74, 135, 18, 53  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 3
  
Folder: Places
File Name: places.shp
Source: Open Street Map, Downloaded 2014-07-10, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: point
Derived Covariates:
cls, dst,   
class       : SpatialPointsDataFrame 
features    : 40536 
extent      : 74, 135, 18, 53  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 4
  
Folder: Points
File Name: points.shp
Source: Open Street Map, Downloaded 2014-07-10, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: point
Derived Covariates:
cls, dst,   
class       : SpatialPointsDataFrame 
features    : 62573 
extent      : 74, 135, 18, 53  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 4
  
Folder: Railways
File Name: railways.shp
Source: Open Street Map, Downloaded 2014-07-10, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: linear
Derived Covariates:
cls, dst,   
class       : SpatialLinesDataFrame 
features    : 38110 
extent      : 76, 135, 18, 54  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 3
  
Folder: Residential
File Name: residential.shp
Source: Open Street Map, Downloaded 2014-07-10, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: polygon
Derived Covariates:
cls, dst,   
class       : SpatialPolygonsDataFrame 
features    : 10408 
extent      : 74, 135, 19, 53  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 3
  
Folder: Wda
File Name: wda.shp
Source: World Database on Protected Areas, Downloaded September, 2012, UNEP, http://www.wdpa.org, http://protectedplanet.net
Description: These data are compiled by UNEP and distributed via the Protected Planet website.  All protected areas were downloaded regardless of International Union for Conservation of Nature (IUCN) or any other designation, so they include sanctuaries, national parks, game reserves, World Heritage Sites, etc.
Class: polygon
Derived Covariates:
cls, dst,   
class       : SpatialPolygonsDataFrame 
features    : 806 
extent      : 74, 135, 18, 52  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 26
  
Folder: D90_urb
File Name: urb_1990fine_prj_polyline_dist_neg.tif
Source: Calculated from settlement extents (Wang, et al. 2012)
Description: These data are calculated distances to urban edges as processed using Landsat TM/ETM+ using base years of 1990, 2000, and 2010, to calculate all urban built-up areas in China; based on 663 cities
Class: raster
Derived Covariates:
,   
class       : RasterBrick 
dimensions  : 57808, 79666, 4605332128, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : -4073086, 3893514, 868429, 6649229  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=lcc +lat_1=30 +lat_2=62 +lat_0=0 +lon_0=105 +x_0=0 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\Users\jnieves\Research\Population\Data\RF\data\CHN\D90_urb\Derived\d90_urb.tif 
names       : d90_urb 
min values  :   -7518 
max values  : 3847624 
  
Folder: D00_urb
File Name: urb_2000fine_prj_polyline_dist_neg.tif
Source: Calculated from settlement extents (Wang et al., 2012)
Description: These data are calculated distances to urban edges as processed using Landsat TM/ETM+ using base years of 1990, 2000, and 2010, to calculate all urban built-up areas in China; based on 663 cities
Class: raster
Derived Covariates:
,   
class       : RasterBrick 
dimensions  : 57808, 79666, 4605332128, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : -4073086, 3893514, 868429, 6649229  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=lcc +lat_1=30 +lat_2=62 +lat_0=0 +lon_0=105 +x_0=0 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\Users\jnieves\Research\Population\Data\RF\data\CHN\D00_urb\Derived\d00_urb.tif 
names       : d00_urb 
min values  :   -8677 
max values  : 3847624 
  
Folder: D10_urb
File Name: urb_2010fine_prj_polyline_dist_neg.tif
Source: Calculated from settlement extents (Wang et al., 2012)
Description: These data are calculated distances to urban edges as processed using Landsat TM/ETM+ using base years of 1990, 2000, and 2010, to calculate all urban built-up areas in China; based on 663 cities
Class: raster
Derived Covariates:
,   
class       : RasterBrick 
dimensions  : 57808, 79666, 4605332128, 1  (nrow, ncol, ncell, nlayers)
resolution  : 100, 100  (x, y)
extent      : -4073086, 3893514, 868429, 6649229  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=lcc +lat_1=30 +lat_2=62 +lat_0=0 +lon_0=105 +x_0=0 +y_0=0 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : D:\Users\jnieves\Research\Population\Data\RF\data\CHN\D10_urb\Derived\d10_urb.tif 
names       : d10_urb 
min values  :  -10902 
max values  : 3847458 
  
Folder: Intersections
File Name: CHN_OSMRoad_Intersections_Mu.shp
Source: Open Street Map, Downloaded 2014-07-10, http://extract.bbbike.org/
Description: These data were downloaded as part of a per-country package of data layers made availalble as shapefiles through the http://extract.bbbike.org website, extracted from the Open Street Map (OSM) database.
Class: point
Derived Covariates:
cls, dst,   
class       : SpatialPointsDataFrame 
features    : 2298282 
extent      : 74, 135, 18, 54  (xmin, xmax, ymin, ymax)
coord. ref. : NA 
variables   : 2