Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Large-Scale Uniform Analysis of Cancer Whole Genomes in Multiple Computing Environments

View ORCID ProfileChristina K. Yung, Brian D. O’Connor, Sergei Yakneen, Junjun Zhang, Kyle Ellrott, Kortine Kleinheinz, Naoki Miyoshi, Keiran M. Raine, Romina Royo, Gordon B. Saksena, Matthias Schlesner, Solomon I. Shorser, Miguel Vazquez, Joachim Weischenfeldt, Denis Yuen, Adam P. Butler, Brandi N. Davis-Dusenbery, Roland Eils, Vincent Ferretti, Robert L. Grossman, Olivier Harismendy, Youngwook Kim, Hidewaki Nakagawa, Steven J. Newhouse, David Torrents, Lincoln D. Stein, on behalf of the PCAWG Technical Working Group, Javier Bartolomé Rodriguez, Keith A. Boroevich, Rich Boyce, Angela N. Brooks, Alex Buchanan, Ivo Buchhalter, Niall J. Byrne, Andy Cafferkey, Peter J. Campbell, Zhaohong Chen, Sunghoon Cho, Wan Choi, Peter Clapham, Francisco M. De La Vega, Jonas Demeulemeester, Michelle T. Dow, Lewis J. Dursi, Juergen Eils, Claudiu Farcas, Francesco Favero, Nodirjon Fayzullaev, Paul Flicek, Nuno A. Fonseca, Josep L.l. Gelpi, Gad Getz, Bob Gibson, Michael C. Heinold, Julian M. Hess, Oliver Hofmann, Jongwhi H. Hong, Thomas J. Hudson, Daniel Huebschmann, Barbara Hutter, Carolyn M. Hutter, Seiya Imoto, Sinisa Ivkovic, Seung-Hyup Jeon, Wei Jiao, Jongsun Jung, Rolf Kabbe, Andre Kahles, Jules Kerssemakers, Hyunghwan Kim, Hyung-Lae Kim, Jihoon Kim, Jan O. Korbel, Michael Koscher, Antonios Koures, Milena Kovacevic, Chris Lawerenz, Ignaty Leshchiner, Dimitri G. Livitz, George L. Mihaiescu, Sanja Mijalkovic, Ana Mijalkovic Lazic, Satoru Miyano, Hardeep K. Nahal, Mia Nastic, Jonathan Nicholson, David Ocana, Kazuhiro Ohi, Lucila Ohno-Machado, Larsson Omberg, B.F. Francis Ouellette, Nagarajan Paramasivam, Marc D. Perry, Todd D. Pihl, Manuel Prinz, Montserrat Puiggròs, Petar Radovic, Esther Rheinbay, Mara W. Rosenberg, Charles Short, Heidi J. Sofia, Jonathan Spring, Adam J. Struck, Grace Tiao, Nebojsa Tijanic, Peter Van Loo, David Vicente, Jeremiah A. Wala, Zhining Wang, Johannes Werner, Ashley Williams, Youngchoon Woo, Adam J. Wright, Qian Xiang, the PCAWG Network
doi: https://doi.org/10.1101/161638
Christina K. Yung
1Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Christina K. Yung
Brian D. O’Connor
1Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
2UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, California, 95065, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sergei Yakneen
1Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
3Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Junjun Zhang
1Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kyle Ellrott
4Department of Computational Biology, Oregon Health and Science University, Portland, Oregon, 97239, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kortine Kleinheinz
5Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
6Department for Bioinformatics and Functional Genomics, Institutefor Pharmacy and Molecular Biotechnology and BioQuant, Heidelberg University, Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Naoki Miyoshi
7Human Genome Center, Institute of Medical Science, University of Tokyo, Tokyo,108-8639, Japan.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Keiran M. Raine
8Cancer Ageing and Somatic Mutation Programme, Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA,United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Romina Royo
9Department of Life Sciences, Barcelona Supercomputing Center, Barcelona, Catalunya, 8034, Spain.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gordon B. Saksena
10Cancer Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthias Schlesner
5Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Solomon I. Shorser
1Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Miguel Vazquez
11Structural Computational Biology Group, Centro Nacional de Investigaciones Oncologicas, Madrid, Madrid, 28029, Spain.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joachim Weischenfeldt
3Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Baden-Württemberg, 69120, Germany.
12BRIC/Finsen Laboratory, Rigshospitalet, Copenhagen, 2200, Denmark.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Denis Yuen
1Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Adam P. Butler
8Cancer Ageing and Somatic Mutation Programme, Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA,United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Brandi N. Davis-Dusenbery
13Seven Bridges, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Roland Eils
14Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
6Department for Bioinformatics and Functional Genomics, Institutefor Pharmacy and Molecular Biotechnology and BioQuant, Heidelberg University, Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vincent Ferretti
1Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Robert L. Grossman
15Center for Data Intensive Science, University of Chicago, Chicago, Illinois,60637, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Olivier Harismendy
16Department of Medicine, University of California San Diego, San Diego, California, 92093, USA.
17Moores Cancer Center, Department of Medicine, Division of Biomedical Informatics, University of California San Diego, San Diego, California, 92093, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Youngwook Kim
18Samsung Advanced Institute of Health Science and Technology, Sungkyunkwan University, School of Medicine, Seoul, 135-710, South Korea.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hidewaki Nakagawa
19Laboratory for Genome Sequencing Analysis, RIKEN Center for Integrative Medical Sciences, Tokyo, 108-8639, Japan.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Steven J. Newhouse
20Technical Services Cluster, European Molecular Biology Laboratory, European Bioinforamtics Institute, Hinxton, Cambridge, CB10 1SD,United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Torrents
9Department of Life Sciences, Barcelona Supercomputing Center, Barcelona, Catalunya, 8034, Spain.
21Institució Catalana de Recerca i Estudis Avançats, Barcelona, Catalunya, 8010, Spain.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lincoln D. Stein
1Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
22Department of Molecular Genetics, University of Toronto, Toronto, Ontario, M5S 1A1, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: lincoln.stein@gmail.com
Javier Bartolomé Rodriguez
1Department of Operations, Barcelona Supercomputing Center, Barcelona, Catalunya, 8034, Spain.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Keith A. Boroevich
2Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rich Boyce
3European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Angela N. Brooks
4Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, 95065, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alex Buchanan
5Department of Computational Biology, Oregon Health and Science University, Portland, Oregon,97239, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ivo Buchhalter
6Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
7Department for Bioinformatics and Functional Genomics, Institute for Pharmacy and Molecular Biotechnology and BioQuant, Heidelberg University, Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Niall J. Byrne
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andy Cafferkey
9Technical Services Cluster, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter J. Campbell
10Cancer Genome Project, Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhaohong Chen
11Department of Medicine, University of California San Diego, San Diego, California, 92093, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sunghoon Cho
12PDXen Biosystems Inc., Seoul, 4900, South Korea.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wan Choi
13Electronics and Telecommunications Research Institute, Daejon, 34129, South Korea.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter Clapham
14Informatics Support Group, Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Francisco M. De La Vega
15Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California, 94305, USA.
16Annai Systems, Inc., Carlsbad, California, 92011, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jonas Demeulemeester
17The Francis Crick Institute, London, NW1 1AT, United Kingdom.
18Department of Human Genetics, University of Leuven, B-3000 Leuven, Belgium.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michelle T. Dow
19Biomedical Informatics, University of California San Diego, San Diego, California, 92093, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lewis J. Dursi
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
20The Centre for Computational Medicine, The Hospital for Sick Children, Toronto, Ontario, M5G 0A4, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Juergen Eils
21Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Claudiu Farcas
22Health System Department of Biomedical Informatics, University of California San Diego, La Jolla, California, 92093, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Francesco Favero
23BRIC/Finsen Laboratory, Rigshospitalet, Copenhagen, 2200, Denmark.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nodirjon Fayzullaev
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Paul Flicek
3European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nuno A. Fonseca
3European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Josep L.l. Gelpi
24Department of Life 528 Sciences, Barcelona Supercomputing Center, Barcelona, Catalunya, 8034, Spain.
25Department of Biochemistry and Molecular Biomedicine, University of Barcelona, Barcelona, Catalunya, 8028, Spain.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gad Getz
26Cancer Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02142, USA.
27Cancer Center and Department of Pathology, Massachusetts General Hospital, Boston, Massachusetts, 02114, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bob Gibson
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael C. Heinold
6Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
7Department for Bioinformatics and Functional Genomics, Institute for Pharmacy and Molecular Biotechnology and BioQuant, Heidelberg University, Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Julian M. Hess
26Cancer Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Oliver Hofmann
28Center for Cancer Research, University of Melbourne, Melbourne, VIC 3001, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jongwhi H. Hong
29Genome Data Integration Center, Syntekabio Inc., Daejon, 34025, South Korea.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Thomas J. Hudson
30Genomics Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 534 0A3, Canada.
31Oncology Discovery and Early Development, AbbVie, Redwood City, California, 94063, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel Huebschmann
6Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
7Department for Bioinformatics and Functional Genomics, Institute for Pharmacy and Molecular Biotechnology and BioQuant, Heidelberg University, Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Barbara Hutter
32Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 536 69120, Germany.
33Division of Applied Bioinformatics, National Center for Tumor Diseases, Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carolyn M. Hutter
34Division of Genomic Medicine, National Human Genome Research Institute Bethesda, Maryland, 20852, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Seiya Imoto
35Health Intelligence Center, Institute of Medical Science, University of Tokyo, Tokyo, 108-8639, Japan.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sinisa Ivkovic
36Seven Bridges, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Seung-Hyup Jeon
13Electronics and Telecommunications Research Institute, Daejon, 34129, South Korea.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wei Jiao
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jongsun Jung
37Genome Data Integration Center, Syntekabio Inc., Daejon, 34025, South Korea.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rolf Kabbe
6Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andre Kahles
38Department of Computer Science, ETH Zurich, Zurich, Zurich, 8092, Switzerland.
39Computational Biology Center, Memorial Sloan Kettering Cancer Center, New York, New York, 10065, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jules Kerssemakers
40German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hyunghwan Kim
13Electronics and Telecommunications Research Institute, Daejon, 34129, South Korea.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hyung-Lae Kim
41Department of Biochemistry, Ewha Womans University, Seoul, O7985, South Korea.
42PGM21, Seoul, O7985, South Korea.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jihoon Kim
11Department of Medicine, University of California San Diego, San Diego, California, 92093, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jan O. Korbel
3European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, United Kingdom.
43Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael Koscher
40German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Antonios Koures
11Department of Medicine, University of California San Diego, San Diego, California, 92093, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Milena Kovacevic
36Seven Bridges, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chris Lawerenz
6Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ignaty Leshchiner
26Cancer Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dimitri G. Livitz
26Cancer Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
George L. Mihaiescu
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sanja Mijalkovic
36Seven Bridges, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ana Mijalkovic Lazic
36Seven Bridges, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Satoru Miyano
44Human Genome Center, Institute of Medical Science, University of Tokyo, Tokyo, 108-8639, Japan.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hardeep K. Nahal
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mia Nastic
36Seven Bridges, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jonathan Nicholson
14Informatics Support Group, Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Ocana
3European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kazuhiro Ohi
44Human Genome Center, Institute of Medical Science, University of Tokyo, Tokyo, 108-8639, Japan.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lucila Ohno-Machado
22Health System Department of Biomedical Informatics, University of California San Diego, La Jolla, California, 92093, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Larsson Omberg
45Systems Biology, Sage Bionetworks, Seattle, Washington, 98112, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
B.F. Francis Ouellette
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
46Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, M5S 3G5, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nagarajan Paramasivam
6Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
47Medical Faculty Heidelberg, Heidelberg University, Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Marc D. Perry
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Todd D. Pihl
48CSRA Incorporated, Fairfax, Virginia, 22042, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Manuel Prinz
6Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Montserrat Puiggròs
24Department of Life 528 Sciences, Barcelona Supercomputing Center, Barcelona, Catalunya, 8034, Spain.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Petar Radovic
36Seven Bridges, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Esther Rheinbay
26Cancer Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02142, USA.
49Cancer Center, Massachusetts General Hospital, Boston, Massachusetts, 02114, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mara W. Rosenberg
26Cancer Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02142, USA.
49Cancer Center, Massachusetts General Hospital, Boston, Massachusetts, 02114, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Charles Short
3European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, United Kingdom.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Heidi J. Sofia
50National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, 20892-551 9305, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jonathan Spring
51Center for Data Intensive Science, University of Chicago, Chicago, Illinois, 60637, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Adam J. Struck
5Department of Computational Biology, Oregon Health and Science University, Portland, Oregon,97239, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Grace Tiao
26Cancer Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nebojsa Tijanic
36Seven Bridges, Cambridge, Massachusetts, 02142, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter Van Loo
17The Francis Crick Institute, London, NW1 1AT, United Kingdom.
18Department of Human Genetics, University of Leuven, B-3000 Leuven, Belgium.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Vicente
1Department of Operations, Barcelona Supercomputing Center, Barcelona, Catalunya, 8034, Spain.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeremiah A. Wala
26Cancer Program, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02142, USA.
52Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, 02115, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhining Wang
53TCGA 553 Program Office, National Cancer Institute, Bethesda, Maryland, 20892, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Johannes Werner
6Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Baden-Württemberg, 69120, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ashley Williams
11Department of Medicine, University of California San Diego, San Diego, California, 92093, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Youngchoon Woo
13Electronics and Telecommunications Research Institute, Daejon, 34129, South Korea.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Adam J. Wright
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Qian Xiang
8Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, M5G 0A3, Canada.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

The International Cancer Genome Consortium (ICGC)’s Pan-Cancer Analysis of Whole Genomes (PCAWG) project aimed to categorize somatic and germline variations in both coding and non-coding regions in over 2,800 cancer patients. To provide this dataset to the research working groups for downstream analysis, the PCAWG Technical Working Group marshalled ~800TB of sequencing data from distributed geographical locations; developed portable software for uniform alignment, variant calling, artifact filtering and variant merging; performed the analysis in a geographically and technologically disparate collection of compute environments; and disseminated high-quality validated consensus variants to the working groups. The PCAWG dataset has been mirrored to multiple repositories and can be located using the ICGC Data Portal. The PCAWG workflows are also available as Docker images through Dockstore enabling researchers to replicate our analysis on their own data.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted July 10, 2017.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Large-Scale Uniform Analysis of Cancer Whole Genomes in Multiple Computing Environments
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Large-Scale Uniform Analysis of Cancer Whole Genomes in Multiple Computing Environments
Christina K. Yung, Brian D. O’Connor, Sergei Yakneen, Junjun Zhang, Kyle Ellrott, Kortine Kleinheinz, Naoki Miyoshi, Keiran M. Raine, Romina Royo, Gordon B. Saksena, Matthias Schlesner, Solomon I. Shorser, Miguel Vazquez, Joachim Weischenfeldt, Denis Yuen, Adam P. Butler, Brandi N. Davis-Dusenbery, Roland Eils, Vincent Ferretti, Robert L. Grossman, Olivier Harismendy, Youngwook Kim, Hidewaki Nakagawa, Steven J. Newhouse, David Torrents, Lincoln D. Stein, on behalf of the PCAWG Technical Working Group, Javier Bartolomé Rodriguez, Keith A. Boroevich, Rich Boyce, Angela N. Brooks, Alex Buchanan, Ivo Buchhalter, Niall J. Byrne, Andy Cafferkey, Peter J. Campbell, Zhaohong Chen, Sunghoon Cho, Wan Choi, Peter Clapham, Francisco M. De La Vega, Jonas Demeulemeester, Michelle T. Dow, Lewis J. Dursi, Juergen Eils, Claudiu Farcas, Francesco Favero, Nodirjon Fayzullaev, Paul Flicek, Nuno A. Fonseca, Josep L.l. Gelpi, Gad Getz, Bob Gibson, Michael C. Heinold, Julian M. Hess, Oliver Hofmann, Jongwhi H. Hong, Thomas J. Hudson, Daniel Huebschmann, Barbara Hutter, Carolyn M. Hutter, Seiya Imoto, Sinisa Ivkovic, Seung-Hyup Jeon, Wei Jiao, Jongsun Jung, Rolf Kabbe, Andre Kahles, Jules Kerssemakers, Hyunghwan Kim, Hyung-Lae Kim, Jihoon Kim, Jan O. Korbel, Michael Koscher, Antonios Koures, Milena Kovacevic, Chris Lawerenz, Ignaty Leshchiner, Dimitri G. Livitz, George L. Mihaiescu, Sanja Mijalkovic, Ana Mijalkovic Lazic, Satoru Miyano, Hardeep K. Nahal, Mia Nastic, Jonathan Nicholson, David Ocana, Kazuhiro Ohi, Lucila Ohno-Machado, Larsson Omberg, B.F. Francis Ouellette, Nagarajan Paramasivam, Marc D. Perry, Todd D. Pihl, Manuel Prinz, Montserrat Puiggròs, Petar Radovic, Esther Rheinbay, Mara W. Rosenberg, Charles Short, Heidi J. Sofia, Jonathan Spring, Adam J. Struck, Grace Tiao, Nebojsa Tijanic, Peter Van Loo, David Vicente, Jeremiah A. Wala, Zhining Wang, Johannes Werner, Ashley Williams, Youngchoon Woo, Adam J. Wright, Qian Xiang, the PCAWG Network
bioRxiv 161638; doi: https://doi.org/10.1101/161638
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Large-Scale Uniform Analysis of Cancer Whole Genomes in Multiple Computing Environments
Christina K. Yung, Brian D. O’Connor, Sergei Yakneen, Junjun Zhang, Kyle Ellrott, Kortine Kleinheinz, Naoki Miyoshi, Keiran M. Raine, Romina Royo, Gordon B. Saksena, Matthias Schlesner, Solomon I. Shorser, Miguel Vazquez, Joachim Weischenfeldt, Denis Yuen, Adam P. Butler, Brandi N. Davis-Dusenbery, Roland Eils, Vincent Ferretti, Robert L. Grossman, Olivier Harismendy, Youngwook Kim, Hidewaki Nakagawa, Steven J. Newhouse, David Torrents, Lincoln D. Stein, on behalf of the PCAWG Technical Working Group, Javier Bartolomé Rodriguez, Keith A. Boroevich, Rich Boyce, Angela N. Brooks, Alex Buchanan, Ivo Buchhalter, Niall J. Byrne, Andy Cafferkey, Peter J. Campbell, Zhaohong Chen, Sunghoon Cho, Wan Choi, Peter Clapham, Francisco M. De La Vega, Jonas Demeulemeester, Michelle T. Dow, Lewis J. Dursi, Juergen Eils, Claudiu Farcas, Francesco Favero, Nodirjon Fayzullaev, Paul Flicek, Nuno A. Fonseca, Josep L.l. Gelpi, Gad Getz, Bob Gibson, Michael C. Heinold, Julian M. Hess, Oliver Hofmann, Jongwhi H. Hong, Thomas J. Hudson, Daniel Huebschmann, Barbara Hutter, Carolyn M. Hutter, Seiya Imoto, Sinisa Ivkovic, Seung-Hyup Jeon, Wei Jiao, Jongsun Jung, Rolf Kabbe, Andre Kahles, Jules Kerssemakers, Hyunghwan Kim, Hyung-Lae Kim, Jihoon Kim, Jan O. Korbel, Michael Koscher, Antonios Koures, Milena Kovacevic, Chris Lawerenz, Ignaty Leshchiner, Dimitri G. Livitz, George L. Mihaiescu, Sanja Mijalkovic, Ana Mijalkovic Lazic, Satoru Miyano, Hardeep K. Nahal, Mia Nastic, Jonathan Nicholson, David Ocana, Kazuhiro Ohi, Lucila Ohno-Machado, Larsson Omberg, B.F. Francis Ouellette, Nagarajan Paramasivam, Marc D. Perry, Todd D. Pihl, Manuel Prinz, Montserrat Puiggròs, Petar Radovic, Esther Rheinbay, Mara W. Rosenberg, Charles Short, Heidi J. Sofia, Jonathan Spring, Adam J. Struck, Grace Tiao, Nebojsa Tijanic, Peter Van Loo, David Vicente, Jeremiah A. Wala, Zhining Wang, Johannes Werner, Ashley Williams, Youngchoon Woo, Adam J. Wright, Qian Xiang, the PCAWG Network
bioRxiv 161638; doi: https://doi.org/10.1101/161638

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4119)
  • Biochemistry (8828)
  • Bioengineering (6532)
  • Bioinformatics (23484)
  • Biophysics (11805)
  • Cancer Biology (9223)
  • Cell Biology (13336)
  • Clinical Trials (138)
  • Developmental Biology (7442)
  • Ecology (11425)
  • Epidemiology (2066)
  • Evolutionary Biology (15173)
  • Genetics (10453)
  • Genomics (14056)
  • Immunology (9187)
  • Microbiology (22199)
  • Molecular Biology (8823)
  • Neuroscience (47626)
  • Paleontology (351)
  • Pathology (1431)
  • Pharmacology and Toxicology (2493)
  • Physiology (3736)
  • Plant Biology (8090)
  • Scientific Communication and Education (1438)
  • Synthetic Biology (2224)
  • Systems Biology (6042)
  • Zoology (1254)