↓ Skip to main content

Addressing data privacy in matched studies via virtual pooling

Overview of attention for article published in BMC Medical Research Methodology, September 2017
Altmetric Badge

Mentioned by

twitter
1 X user

Citations

dimensions_citation
7 Dimensions

Readers on

mendeley
30 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Addressing data privacy in matched studies via virtual pooling
Published in
BMC Medical Research Methodology, September 2017
DOI 10.1186/s12874-017-0419-0
Pubmed ID
Authors

P. Saha-Chaudhuri, C.R. Weinberg

Abstract

Data confidentiality and shared use of research data are two desirable but sometimes conflicting goals in research with multi-center studies and distributed data. While ideal for straightforward analysis, confidentiality restrictions forbid creation of a single dataset that includes covariate information of all participants. Current approaches such as aggregate data sharing, distributed regression, meta-analysis and score-based methods can have important limitations. We propose a novel application of an existing epidemiologic tool, specimen pooling, to enable confidentiality-preserving analysis of data arising from a matched case-control, multi-center design. Instead of pooling specimens prior to assay, we apply the methodology to virtually pool (aggregate) covariates within nodes. Such virtual pooling retains most of the information used in an analysis with individual data and since individual participant data is not shared externally, within-node virtual pooling preserves data confidentiality. We show that aggregated covariate levels can be used in a conditional logistic regression model to estimate individual-level odds ratios of interest. The parameter estimates from the standard conditional logistic regression are compared to the estimates based on a conditional logistic regression model with aggregated data. The parameter estimates are shown to be similar to those without pooling and to have comparable standard errors and confidence interval coverage. Virtual data pooling can be used to maintain confidentiality of data from multi-center study and can be particularly useful in research with large-scale distributed data.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 30 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 30 100%

Demographic breakdown

Readers by professional status Count As %
Student > Master 5 17%
Researcher 5 17%
Student > Ph. D. Student 4 13%
Other 3 10%
Unspecified 3 10%
Other 6 20%
Unknown 4 13%
Readers by discipline Count As %
Medicine and Dentistry 7 23%
Unspecified 3 10%
Psychology 3 10%
Nursing and Health Professions 3 10%
Computer Science 2 7%
Other 7 23%
Unknown 5 17%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 12 September 2017.
All research outputs
#18,571,001
of 23,001,641 outputs
Outputs from BMC Medical Research Methodology
#1,750
of 2,028 outputs
Outputs of similar age
#242,080
of 315,656 outputs
Outputs of similar age from BMC Medical Research Methodology
#24
of 32 outputs
Altmetric has tracked 23,001,641 research outputs across all sources so far. This one is in the 11th percentile – i.e., 11% of other outputs scored the same or lower than it.
So far Altmetric has tracked 2,028 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 10.2. This one is in the 6th percentile – i.e., 6% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 315,656 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 12th percentile – i.e., 12% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 32 others from the same source and published within six weeks on either side of this one. This one is in the 21st percentile – i.e., 21% of its contemporaries scored the same or lower than it.