↓ Skip to main content

GridSample: an R package to generate household survey primary sampling units (PSUs) from gridded population data

Overview of attention for article published in International Journal of Health Geographics, July 2017
Altmetric Badge

About this Attention Score

  • In the top 5% of all research outputs scored by Altmetric
  • Among the highest-scoring outputs from this source (#26 of 654)
  • High Attention Score compared to outputs of the same age (94th percentile)
  • High Attention Score compared to outputs of the same age and source (83rd percentile)

Mentioned by

twitter
67 X users
facebook
3 Facebook pages

Citations

dimensions_citation
40 Dimensions

Readers on

mendeley
101 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
GridSample: an R package to generate household survey primary sampling units (PSUs) from gridded population data
Published in
International Journal of Health Geographics, July 2017
DOI 10.1186/s12942-017-0098-4
Pubmed ID
Authors

Dana R. Thomson, Forrest R. Stevens, Nick W. Ruktanonchai, Andrew J. Tatem, Marcia C. Castro

Abstract

Household survey data are collected by governments, international organizations, and companies to prioritize policies and allocate billions of dollars. Surveys are typically selected from recent census data; however, census data are often outdated or inaccurate. This paper describes how gridded population data might instead be used as a sample frame, and introduces the R GridSample algorithm for selecting primary sampling units (PSU) for complex household surveys with gridded population data. With a gridded population dataset and geographic boundary of the study area, GridSample allows a two-step process to sample "seed" cells with probability proportionate to estimated population size, then "grows" PSUs until a minimum population is achieved in each PSU. The algorithm permits stratification and oversampling of urban or rural areas. The approximately uniform size and shape of grid cells allows for spatial oversampling, not possible in typical surveys, possibly improving small area estimates with survey results. We replicated the 2010 Rwanda Demographic and Health Survey (DHS) in GridSample by sampling the WorldPop 2010 UN-adjusted 100 m × 100 m gridded population dataset, stratifying by Rwanda's 30 districts, and oversampling in urban areas. The 2010 Rwanda DHS had 79 urban PSUs, 413 rural PSUs, with an average PSU population of 610 people. An equivalent sample in GridSample had 75 urban PSUs, 405 rural PSUs, and a median PSU population of 612 people. The number of PSUs differed because DHS added urban PSUs from specific districts while GridSample reallocated rural-to-urban PSUs across all districts. Gridded population sampling is a promising alternative to typical census-based sampling when census data are moderately outdated or inaccurate. Four approaches to implementation have been tried: (1) using gridded PSU boundaries produced by GridSample, (2) manually segmenting gridded PSU using satellite imagery, (3) non-probability sampling (e.g. random-walk, "spin-the-pen"), and random sampling of households. Gridded population sampling is in its infancy, and further research is needed to assess the accuracy and feasibility of gridded population sampling. The GridSample R algorithm can be used to forward this research agenda.

X Demographics

X Demographics

The data shown below were collected from the profiles of 67 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 101 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 101 100%

Demographic breakdown

Readers by professional status Count As %
Researcher 22 22%
Student > Master 21 21%
Student > Ph. D. Student 16 16%
Student > Postgraduate 6 6%
Student > Doctoral Student 3 3%
Other 7 7%
Unknown 26 26%
Readers by discipline Count As %
Social Sciences 15 15%
Agricultural and Biological Sciences 10 10%
Medicine and Dentistry 7 7%
Environmental Science 6 6%
Earth and Planetary Sciences 5 5%
Other 23 23%
Unknown 35 35%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 45. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 27 July 2023.
All research outputs
#918,867
of 25,389,520 outputs
Outputs from International Journal of Health Geographics
#26
of 654 outputs
Outputs of similar age
#18,543
of 319,044 outputs
Outputs of similar age from International Journal of Health Geographics
#3
of 12 outputs
Altmetric has tracked 25,389,520 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 96th percentile: it's in the top 5% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 654 research outputs from this source. They typically receive more attention than average, with a mean Attention Score of 9.7. This one has done particularly well, scoring higher than 96% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 319,044 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 94% of its contemporaries.
We're also able to compare this research output to 12 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 83% of its contemporaries.