Promoting Open Science through ethical and efficient Sharing of Genomic Data for Research
Are you looking for (sharing) genome data? Discover more data with REPOSITIVE platform (supported by DNA Digest) that contributes to speeding up genetic diagnostics and research through efficient data access solutions. Moreover, this essay brings to your attention 10 Rules (presented in BioRxiv) that have been developed from combined experiences of working with human genomic data, data repositories and data users.
“Genomic research promises major advances in our understanding of health and disease. In parallel, sharing genomic data offers encouraging prospects to accelerate research by generating information-rich genome datasets…”, - Open sharing of genomic data: Who does it and why? |
REPOSITIVE - the company that created the world’s largest portal for accessing human genomic research data - is expanding the range of data available with the launch of a Specialist Data Collection for the Personal Genome Project. This Collection presents all of the data collected for the Personal Genome Project available in one place, combining data currently held in the US, UK, and Austria, and thus enabling researchers to quickly and easily find the data they need.
While endorsing the release of software as Open Source and doing this whenever practically and technically possible, the REPOSITIVE platform contributes to speeding up genetic diagnostics and powering research through efficient data access solutions.
REPOSITIVE enables you to build powerful search queries using predicated and boolean terms, filters to slice & dice results by accessibility, data source or assay type. Create account and browse in REPOSITIVE Data Discovery through multiple repositories for the human genomic data (organised into collections).
With REPOSITIVE platform you can:
- share datasets on the platform with colleagues,
- annotate them with public tags,
- add them to your favourites to quickly return to them later,
- discuss your work,
- meet new collaborators and
- actively promote Open Science (REPOSITIVE aims to make genomic data more accessible and make results more reproducible & citable).
"Open Science commonly refers to efforts to make the output of publicly funded research more widely accessible in digital format to the scientific community, the business sector, or society more generally. |
Considering that both enabling ethical data sharing and ensuring genetic privacy remains challenging, REPOSITIVE respects the individual's right to privacy, advocates full adherence to patient consent and data privacy laws, and encourages responsible custodianship of genomic data.
There are a number of international efforts underway to establish standards for good practice in sharing human genomic data:
|
Rule 1: Recognise the intrinsic value of the data |
Rule 2: Choose the appropriate patient consent framework |
Rule 3: Check whether support for data sharing is available |
Rule 4: Understand the datasets you generate |
Rule 5: Context is king: adhere to metadata standard descriptions |
Rule 6: Check the accuracy of your metadata |
Rule 7: Maximise the machine readability of your metadata |
Rule 8: Choose the most appropriate repository for your data |
Rule 9: Upload both raw and processed data |
Rule 10: Make it easy to cite your data |
can help researchers to increase the reusability of their human genomic data, whilst also ensuring that the privacy of their subjects is maintained according to their consent frameworks.
Many of the principles presented in the pre-print "10 Simple Rules for Sharing Human Genomic Data" are also applicable to other types of clinical research data, where participant privacy is a concern.
_____________________________________________________________________________________________________
Related content:
- RDA BioSharing Registry and RDA Wheat Data Interoperability Guidelines
- XCMS Online: a free metabolomics and systems biology data analysis platform
- Liberating species records from open data repositories
- Open Repositories 2017 CONFERENCE (26 to 30 June 2017)
- DCAT application profile for data portals in Europe: Towards an open government data ecosystem in Europe using common standards (European Commission)
- DNAdigest and Repositive: Connecting the World of Genomic Data (PLOS Biology)
- openSNP (licensed under the MIT License, the code is at GitHub) allows customers of direct-to-customer genetic tests to publish their test results, find others with similar genetic variations. The data is donated into the public domain using CC0 1.0. ù
- Developing an Information Asset Catalogue at ECDC (European Centre for Disease Prevention and Control) [see all presentations of the SEMIC2017 conference]
- Open science (Rationale and objectives, Major aspects), OECD
- Policies to promote Open Science: evidence from OECD Countries
- Open Science Training