Report for: Addressing data privacy in matched studies via virtual pooling

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	Addressing data privacy in matched studies via virtual pooling
Published in	BMC Medical Research Methodology, September 2017
DOI	10.1186/s12874-017-0419-0
Pubmed ID	28882105
Authors	P. Saha-Chaudhuri, C.R. Weinberg
Abstract	Data confidentiality and shared use of research data are two desirable but sometimes conflicting goals in research with multi-center studies and distributed data. While ideal for straightforward analysis, confidentiality restrictions forbid creation of a single dataset that includes covariate information of all participants. Current approaches such as aggregate data sharing, distributed regression, meta-analysis and score-based methods can have important limitations. We propose a novel application of an existing epidemiologic tool, specimen pooling, to enable confidentiality-preserving analysis of data arising from a matched case-control, multi-center design. Instead of pooling specimens prior to assay, we apply the methodology to virtually pool (aggregate) covariates within nodes. Such virtual pooling retains most of the information used in an analysis with individual data and since individual participant data is not shared externally, within-node virtual pooling preserves data confidentiality. We show that aggregated covariate levels can be used in a conditional logistic regression model to estimate individual-level odds ratios of interest. The parameter estimates from the standard conditional logistic regression are compared to the estimates based on a conditional logistic regression model with aggregated data. The parameter estimates are shown to be similar to those without pooling and to have comparable standard errors and confidence interval coverage. Virtual data pooling can be used to maintain confidentiality of data from multi-center study and can be particularly useful in research with large-scale distributed data.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
United Kingdom	1	100%

Demographic breakdown

Type	Count	As %
Members of the public	1	100%

Mendeley readers

The data shown below were compiled from readership statistics for 30 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Unknown	30	100%

Demographic breakdown

Readers by professional status	Count	As %
Student > Master	5	17%
Researcher	5	17%
Student > Ph. D. Student	4	13%
Other	3	10%
Unspecified	3	10%
Other	6	20%
Unknown	4	13%

Readers by discipline	Count	As %
Medicine and Dentistry	7	23%
Unspecified	3	10%
Psychology	3	10%
Nursing and Health Professions	3	10%
Computer Science	2	7%
Other	7	23%
Unknown	5	17%

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 12 September 2017.

All research outputs

#18,571,001

of 23,001,641 outputs

Outputs from BMC Medical Research Methodology

#1,750

of 2,028 outputs

Outputs of similar age

#242,080

of 315,656 outputs

Outputs of similar age from BMC Medical Research Methodology

#24

of 32 outputs

Altmetric has tracked 23,001,641 research outputs across all sources so far. This one is in the 11th percentile – i.e., 11% of other outputs scored the same or lower than it.

So far Altmetric has tracked 2,028 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 10.2. This one is in the 6th percentile – i.e., 6% of its peers scored the same or lower than it.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 315,656 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 12th percentile – i.e., 12% of its contemporaries scored the same or lower than it.

We're also able to compare this research output to 32 others from the same source and published within six weeks on either side of this one. This one is in the 21st percentile – i.e., 21% of its contemporaries scored the same or lower than it.

Addressing data privacy in matched studies via virtual pooling

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context