Back to Search
Start Over
DataSHIELD: Taking the analysis to the data, not the data to the analysis
- Source :
- Int. J. Epidemiol. 43, 1929-1944 (2014), International Journal of Epidemiology, 43(6), 1929-1944. Oxford University Press, Gaye, A, Marcon, Y, Isaeva, J, Laflamme, P, Turner, A, Jones, E M, Minion, J, Boyd, A W, Newby, C J, Nuotio, M L, Wilson, R, Butters, O, Murtagh, B, Demir, I, Doiron, D, Giepmans, L, Wallace, S E, Budin-ljøsne, I, Oliver schmidt, C, Boffetta, P, Boniol, M, Bota, M, Carter, K W, Deklerk, N, Dibben, C, Francis, R W, Hiekkalinna, T, Hveem, K, Kvaløy, K, Millar, S, Perry, I J, Peters, A, Phillips, C M, Popham, F, Raab, G, Reischl, E, Sheehan, N, Waldenberger, M, Perola, M, Van den heuvel, E, Macleod, J, Knoppers, B M, Stolk, R P, Fortier, I, Harris, J R, Woffenbuttel, B H R, Murtagh, M J, Ferretti, V & Burton, P R 2014, ' DataSHIELD : taking the analysis to the data, not the data to the analysis ', International Journal of Epidemiology, vol. 43, no. 6, dyu188, pp. 1929-1944 . https://doi.org/10.1093/ije/dyu188, International Journal of Epidemiology, Int J Epidemiol, Int J Epidemiol, 2014, 43 (6), pp.1929-44. ⟨10.1093/ije/dyu188⟩
- Publication Year :
- 2014
-
Abstract
- Gaye, Amadou Marcon, Yannick Isaeva, Julia LaFlamme, Philippe Turner, Andrew Jones, Elinor M Minion, Joel Boyd, Andrew W Newby, Christopher J Nuotio, Marja-Liisa Wilson, Rebecca Butters, Oliver Murtagh, Barnaby Demir, Ipek Doiron, Dany Giepmans, Lisette Wallace, Susan E Budin-Ljosne, Isabelle Oliver Schmidt, Carsten Boffetta, Paolo Boniol, Mathieu Bota, Maria Carter, Kim W deKlerk, Nick Dibben, Chris Francis, Richard W Hiekkalinna, Tero Hveem, Kristian Kvaloy, Kirsti Millar, Sean Perry, Ivan J Peters, Annette Phillips, Catherine M Popham, Frank Raab, Gillian Reischl, Eva Sheehan, Nuala Waldenberger, Melanie Perola, Markus van den Heuvel, Edwin Macleod, John Knoppers, Bartha M Stolk, Ronald P Fortier, Isabel Harris, Jennifer R Woffenbuttel, Bruce H R Murtagh, Madeleine J Ferretti, Vincent Burton, Paul R eng MC_UP_A540₁021/Medical Research Council/United Kingdom MR/K006525/1/Medical Research Council/United Kingdom Medical Research Council/United Kingdom Wellcome Trust/United Kingdom Research Support, Non-U.S. Gov't England 2014/09/30 06:00 Int J Epidemiol. 2014 Dec;43(6):1929-44. doi: 10.1093/ije/dyu188. Epub 2014 Sep 26.; International audience; BACKGROUND: Research in modern biomedicine and social science requires sample sizes so large that they can often only be achieved through a pooled co-analysis of data from several studies. But the pooling of information from individuals in a central database that may be queried by researchers raises important ethico-legal questions and can be controversial. In the UK this has been highlighted by recent debate and controversy relating to the UK's proposed 'care.data' initiative, and these issues reflect important societal and professional concerns about privacy, confidentiality and intellectual property. DataSHIELD provides a novel technological solution that can circumvent some of the most basic challenges in facilitating the access of researchers and other healthcare professionals to individual-level data. METHODS: Commands are sent from a central analysis computer (AC) to several data computers (DCs) storing the data to be co-analysed. The data sets are analysed simultaneously but in parallel. The separate parallelized analyses are linked by non-disclosive summary statistics and commands transmitted back and forth between the DCs and the AC. This paper describes the technical implementation of DataSHIELD using a modified R statistical environment linked to an Opal database deployed behind the computer firewall of each DC. Analysis is controlled through a standard R environment at the AC. RESULTS: Based on this Opal/R implementation, DataSHIELD is currently used by the Healthy Obese Project and the Environmental Core Project (BioSHaRE-EU) for the federated analysis of 10 data sets across eight European countries, and this illustrates the opportunities and challenges presented by the DataSHIELD approach. CONCLUSIONS: DataSHIELD facilitates important research in settings where: (i) a co-analysis of individual-level data from several studies is scientifically necessary but governance restrictions prohibit the release or sharing of some of the required data, and/or render data access unacceptably slow; (ii) a research group (e.g. in a developing nation) is particularly vulnerable to loss of intellectual property-the researchers want to fully share the information held in their data with national and international collaborators, but do not wish to hand over the physical data themselves; and (iii) a data set is to be included in an individual-level co-analysis but the physical size of the data precludes direct transfer to a new site for analysis.
- Subjects :
- Biomedical Research
Datashield
Elsi
Bioinformatics
Confidentiality
Disclosure
Distributed Computing
Intellectual Property
Pooled Analysis
Privacy
Databases, Factual
Epidemiology
Pooling
Datasets as Topic
Information Storage and Retrieval
DISEASE
Firewall (construction)
Medicine
pooled analysi
Biomedical Research Computational Biology *Computer Security *Confidentiality Databases
Data processing
PRIVACY
Intellectual property
General Medicine
bioinformatics
confidentiality
3142 Public health care science, environmental and occupational health
3. Good health
ELSI
INDIVIDUAL-LEVEL
pooled analysis
disclosure
Data Matters
education
MODELS
EPIDEMIOLOGIC RESEARCH
Pooled analysis
RS
distributed computing
DataSHIELD
Humans
QUALITY
GENOME-WIDE ASSOCIATION
Computer Security
METAANALYSIS
Data collection
bioinformatic
business.industry
DATASHAPER APPROACH
Computational Biology
intellectual property
Factual *Datasets as Topic Great Britain Humans *Information Storage and Retrieval DataSHIELD Elsi bioinformatics confidentiality disclosure distributed computing intellectual property pooled analysis privacy
Data science
United Kingdom
Distributed computing
Data set
Data access
COHORT PROFILE
[SDV.SPEE]Life Sciences [q-bio]/Santé publique et épidémiologie
business
Subjects
Details
- Language :
- English
- ISSN :
- 03005771
- Database :
- OpenAIRE
- Journal :
- Int. J. Epidemiol. 43, 1929-1944 (2014), International Journal of Epidemiology, 43(6), 1929-1944. Oxford University Press, Gaye, A, Marcon, Y, Isaeva, J, Laflamme, P, Turner, A, Jones, E M, Minion, J, Boyd, A W, Newby, C J, Nuotio, M L, Wilson, R, Butters, O, Murtagh, B, Demir, I, Doiron, D, Giepmans, L, Wallace, S E, Budin-ljøsne, I, Oliver schmidt, C, Boffetta, P, Boniol, M, Bota, M, Carter, K W, Deklerk, N, Dibben, C, Francis, R W, Hiekkalinna, T, Hveem, K, Kvaløy, K, Millar, S, Perry, I J, Peters, A, Phillips, C M, Popham, F, Raab, G, Reischl, E, Sheehan, N, Waldenberger, M, Perola, M, Van den heuvel, E, Macleod, J, Knoppers, B M, Stolk, R P, Fortier, I, Harris, J R, Woffenbuttel, B H R, Murtagh, M J, Ferretti, V & Burton, P R 2014, ' DataSHIELD : taking the analysis to the data, not the data to the analysis ', International Journal of Epidemiology, vol. 43, no. 6, dyu188, pp. 1929-1944 . https://doi.org/10.1093/ije/dyu188, International Journal of Epidemiology, Int J Epidemiol, Int J Epidemiol, 2014, 43 (6), pp.1929-44. ⟨10.1093/ije/dyu188⟩
- Accession number :
- edsair.doi.dedup.....09ba4a13ce06c3c7bc507002847a2515