Title |
De-identifying a public use microdata file from the Canadian national discharge abstract database
|
---|---|
Published in |
BMC Medical Informatics and Decision Making, August 2011
|
DOI | 10.1186/1472-6947-11-53 |
Pubmed ID | |
Authors |
Khaled El Emam, David Paton, Fida Dankar, Gunes Koru |
Abstract |
The Canadian Institute for Health Information (CIHI) collects hospital discharge abstract data (DAD) from Canadian provinces and territories. There are many demands for the disclosure of this data for research and analysis to inform policy making. To expedite the disclosure of data for some of these purposes, the construction of a DAD public use microdata file (PUMF) was considered. Such purposes include: confirming some published results, providing broader feedback to CIHI to improve data quality, training students and fellows, providing an easily accessible data set for researchers to prepare for analyses on the full DAD data set, and serve as a large health data set for computer scientists and statisticians to evaluate analysis and data mining techniques. The objective of this study was to measure the probability of re-identification for records in a PUMF, and to de-identify a national DAD PUMF consisting of 10% of records. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
India | 2 | 67% |
Canada | 1 | 33% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Practitioners (doctors, other healthcare professionals) | 2 | 67% |
Scientists | 1 | 33% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Germany | 2 | 3% |
United Kingdom | 1 | 2% |
United Arab Emirates | 1 | 2% |
United States | 1 | 2% |
Unknown | 55 | 92% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 15 | 25% |
Researcher | 13 | 22% |
Other | 7 | 12% |
Student > Master | 6 | 10% |
Librarian | 3 | 5% |
Other | 10 | 17% |
Unknown | 6 | 10% |
Readers by discipline | Count | As % |
---|---|---|
Computer Science | 16 | 27% |
Medicine and Dentistry | 13 | 22% |
Nursing and Health Professions | 5 | 8% |
Social Sciences | 4 | 7% |
Psychology | 3 | 5% |
Other | 6 | 10% |
Unknown | 13 | 22% |