Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

CanDIG: Secure Federated Genomic Queries and Analyses Across Jurisdictions

View ORCID ProfileL. Jonathan Dursi, View ORCID ProfileZoltan Bozoky, View ORCID ProfileRichard de Borja, Jimmy Li, View ORCID ProfileDavid Bujold, Adam Lipski, Shaikh Farhan Rashid, View ORCID ProfileAmanjeev Sethi, Neelam Memon, Dashaylan Naidoo, View ORCID ProfileFelipe Coral-Sasso, View ORCID ProfileMatthew Wong, P-O Quirion, View ORCID ProfileZhibin Lu, Samarth Agarwal, View ORCID ProfileKat Pavlov, Andrew Ponomarev, Mia Husic, Krista Pace, View ORCID ProfileSamantha L. Palmer, Stephanie A. Grover, Sevan Hakgor, Lillian L. Siu, David Malkin, Carl Virtanen, Trevor J. Pugh, Pierre-Étienne Jacques, Yann Joly, Steven J. M. Jones, View ORCID ProfileGuillaume Bourque, Michael Brudno
doi: https://doi.org/10.1101/2021.03.30.434101
L. Jonathan Dursi
1DATA Team, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for L. Jonathan Dursi
  • For correspondence: jonathan.dursi@uhn.ca brudno@cs.toronto.edu
Zoltan Bozoky
2Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada
16Digital Products, Providence Health Care, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Zoltan Bozoky
Richard de Borja
3Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
17Ontario Institute of Cancer Research, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Richard de Borja
Jimmy Li
2Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Bujold
4McGill University, Montreal, Québec, Canada
5Canadian Centre for Computational Genomics, Montréal, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for David Bujold
Adam Lipski
2Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada
18Zymeworks, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shaikh Farhan Rashid
1DATA Team, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Amanjeev Sethi
1DATA Team, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Amanjeev Sethi
Neelam Memon
2Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dashaylan Naidoo
2Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Felipe Coral-Sasso
2Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Felipe Coral-Sasso
Matthew Wong
6Centre for Computational Medicine, Hospital for Sick Children, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Matthew Wong
P-O Quirion
4McGill University, Montreal, Québec, Canada
5Canadian Centre for Computational Genomics, Montréal, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhibin Lu
9University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Zhibin Lu
Samarth Agarwal
1DATA Team, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kat Pavlov
1DATA Team, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kat Pavlov
Andrew Ponomarev
2Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada
19Sunbay B.V. Amsterdam, North Holland, Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mia Husic
6Centre for Computational Medicine, Hospital for Sick Children, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Krista Pace
1DATA Team, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Samantha L. Palmer
1DATA Team, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Samantha L. Palmer
Stephanie A. Grover
7Hospital for Sick Children, University of Toronto, Toronto, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sevan Hakgor
3Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lillian L. Siu
3Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Malkin
8Division of Haematology/Oncology, The Hospital for Sick Children, Department of Pediatrics, University of Toronto, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carl Virtanen
9University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Trevor J. Pugh
10Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
3Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
11Ontario Institute of Cancer Research, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pierre-Étienne Jacques
12Département de biologie, Université de Sherbrooke, Sherbrooke, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yann Joly
13Centre of Genomics and Policy, Department of Human Genetics, McGill University, Montreal, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Steven J. M. Jones
2Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Institute, Provincial Health Services Authority, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Guillaume Bourque
14Department of Human Genetics, McGill University, Montreal, Québec, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Guillaume Bourque
Michael Brudno
15Department of Computer Science, University of Toronto, Toronto, ON, Canada
1DATA Team, University Health Network, Toronto, ON, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jonathan.dursi@uhn.ca brudno@cs.toronto.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Rapid expansions of bioinformatics and computational biology have broadened the collection and use of -omics data including genomic, transcriptomic, methylomic and a myriad of other health data types, in the clinic and the laboratory. Both clinical and research uses of such data require co-analysis with large datasets, for which participant privacy and the need for data custodian controls must remain paramount. This is particularly challenging in multi-jurisdictional settings, such as Canada, where health privacy and security requirements are often heterogeneous. Data federation presents a solution to this, allowing for integration and analysis of large datasets from various sites while abiding by local policies.

The Canadian Distributed Infrastructure for Genomics platform (CanDIG) enables federated querying and analysis of -omics and health data while keeping that data local and under local control. It builds upon existing infrastructures to connect five health and research institutions across Canada, relies heavily on standards and tooling brought together by the Global Alliance for Genomics and Health (GA4GH), implements a clear division of responsibilities among its participants and adheres to international data sharing standards. Participating researchers and clinicians can therefore contribute to and quickly access a critical mass of -omics data across a national network in a manner that takes into account the multi-jurisdictional nature of our privacy and security policies. Through this, CanDIG gives medical and research communities the tools needed to use and analyze the ever-growing amount of -omics data available to them in order to improve our understanding and treatment of various conditions and diseases. CanDIG is being used to make genomic and phenotypic data available for querying across Canada as part of data sharing for five leading pan-Canadian projects including the Terry Fox Comprehensive Cancer Care Centre Consortium Network (TF4CN) and Terry Fox PRecision Oncology For Young peopLE (PROFYLE), and making data from provincial projects such as POG (Personalized Onco- Genomics) more widely available.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • http://github.com/candig

  • 2 https://www.genomecanada.ca/en/cancogen/cancogen-hostseq

  • ↵18 http://github.com/ga4gh/ga4gh-server

  • ↵19 https://www.ebi.ac.uk/ols/ontologies/duo

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted March 31, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
CanDIG: Secure Federated Genomic Queries and Analyses Across Jurisdictions
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
CanDIG: Secure Federated Genomic Queries and Analyses Across Jurisdictions
L. Jonathan Dursi, Zoltan Bozoky, Richard de Borja, Jimmy Li, David Bujold, Adam Lipski, Shaikh Farhan Rashid, Amanjeev Sethi, Neelam Memon, Dashaylan Naidoo, Felipe Coral-Sasso, Matthew Wong, P-O Quirion, Zhibin Lu, Samarth Agarwal, Kat Pavlov, Andrew Ponomarev, Mia Husic, Krista Pace, Samantha L. Palmer, Stephanie A. Grover, Sevan Hakgor, Lillian L. Siu, David Malkin, Carl Virtanen, Trevor J. Pugh, Pierre-Étienne Jacques, Yann Joly, Steven J. M. Jones, Guillaume Bourque, Michael Brudno
bioRxiv 2021.03.30.434101; doi: https://doi.org/10.1101/2021.03.30.434101
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
CanDIG: Secure Federated Genomic Queries and Analyses Across Jurisdictions
L. Jonathan Dursi, Zoltan Bozoky, Richard de Borja, Jimmy Li, David Bujold, Adam Lipski, Shaikh Farhan Rashid, Amanjeev Sethi, Neelam Memon, Dashaylan Naidoo, Felipe Coral-Sasso, Matthew Wong, P-O Quirion, Zhibin Lu, Samarth Agarwal, Kat Pavlov, Andrew Ponomarev, Mia Husic, Krista Pace, Samantha L. Palmer, Stephanie A. Grover, Sevan Hakgor, Lillian L. Siu, David Malkin, Carl Virtanen, Trevor J. Pugh, Pierre-Étienne Jacques, Yann Joly, Steven J. M. Jones, Guillaume Bourque, Michael Brudno
bioRxiv 2021.03.30.434101; doi: https://doi.org/10.1101/2021.03.30.434101

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4227)
  • Biochemistry (9107)
  • Bioengineering (6751)
  • Bioinformatics (23944)
  • Biophysics (12088)
  • Cancer Biology (9493)
  • Cell Biology (13739)
  • Clinical Trials (138)
  • Developmental Biology (7616)
  • Ecology (11661)
  • Epidemiology (2066)
  • Evolutionary Biology (15479)
  • Genetics (10616)
  • Genomics (14296)
  • Immunology (9462)
  • Microbiology (22792)
  • Molecular Biology (9078)
  • Neuroscience (48884)
  • Paleontology (355)
  • Pathology (1479)
  • Pharmacology and Toxicology (2565)
  • Physiology (3823)
  • Plant Biology (8308)
  • Scientific Communication and Education (1467)
  • Synthetic Biology (2290)
  • Systems Biology (6171)
  • Zoology (1297)