↓ Skip to main content

Clinical records anonymisation and text extraction (CRATE): an open-source software system

Overview of attention for article published in BMC Medical Informatics and Decision Making, April 2017
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age
  • Average Attention Score compared to outputs of the same age and source

Mentioned by

twitter
3 X users

Citations

dimensions_citation
43 Dimensions

Readers on

mendeley
106 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Clinical records anonymisation and text extraction (CRATE): an open-source software system
Published in
BMC Medical Informatics and Decision Making, April 2017
DOI 10.1186/s12911-017-0437-1
Pubmed ID
Authors

Rudolf N. Cardinal

Abstract

Electronic medical records contain information of value for research, but contain identifiable and often highly sensitive confidential information. Patient-identifiable information cannot in general be shared outside clinical care teams without explicit consent, but anonymisation/de-identification allows research uses of clinical data without explicit consent. This article presents CRATE (Clinical Records Anonymisation and Text Extraction), an open-source software system with separable functions: (1) it anonymises or de-identifies arbitrary relational databases, with sensitivity and precision similar to previous comparable systems; (2) it uses public secure cryptographic methods to map patient identifiers to research identifiers (pseudonyms); (3) it connects relational databases to external tools for natural language processing; (4) it provides a web front end for research and administrative functions; and (5) it supports a specific model through which patients may consent to be contacted about research. Creation and management of a research database from sensitive clinical records with secure pseudonym generation, full-text indexing, and a consent-to-contact process is possible and practical using entirely free and open-source software.

X Demographics

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 106 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 106 100%

Demographic breakdown

Readers by professional status Count As %
Researcher 22 21%
Student > Master 18 17%
Student > Ph. D. Student 13 12%
Other 11 10%
Student > Bachelor 7 7%
Other 14 13%
Unknown 21 20%
Readers by discipline Count As %
Medicine and Dentistry 26 25%
Computer Science 26 25%
Nursing and Health Professions 6 6%
Psychology 4 4%
Biochemistry, Genetics and Molecular Biology 3 3%
Other 12 11%
Unknown 29 27%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 2. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 02 May 2017.
All research outputs
#15,424,842
of 23,881,329 outputs
Outputs from BMC Medical Informatics and Decision Making
#1,252
of 2,030 outputs
Outputs of similar age
#184,769
of 311,808 outputs
Outputs of similar age from BMC Medical Informatics and Decision Making
#24
of 36 outputs
Altmetric has tracked 23,881,329 research outputs across all sources so far. This one is in the 34th percentile – i.e., 34% of other outputs scored the same or lower than it.
So far Altmetric has tracked 2,030 research outputs from this source. They receive a mean Attention Score of 4.9. This one is in the 38th percentile – i.e., 38% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 311,808 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 40th percentile – i.e., 40% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 36 others from the same source and published within six weeks on either side of this one. This one is in the 36th percentile – i.e., 36% of its contemporaries scored the same or lower than it.