The Institute for Systems Biology Cancer Genomics Cloud (ISB-CGC) provides access to datasets hosted on Google Cloud. These datasets are based on data from The Cancer Genome Atlas (TCGA) project.
The ISB-CGC also provides the following GitHub repositories for trying out sample queries and analysis using R, Python, and Datalab:
Dataset access
BigQuery datasets
You can access the following datasets in BigQuery for data exploration and querying. For more information, see ISB-CGC BigQuery tables.
- isb-cgc:TCGA_bioclin_v0
- isb-cgc:TCGA_hg19_data_v0
- isb-cgc:TCGA_hg38_data_v0
- isb-cgc:TARGET_bioclin_v0
- isb-cgc:TARGET_hg38_data_v0
- isb-cgc.metadata
- isb-cgc.GDC_metadata
- isb-cgc.tcga_seq_metadata
- isb-cgc.tcga_cohorts
About the data
Use: This dataset is publicly available for anyone to use under the terms provided by the dataset source (https://cancergenome.nih.gov/) and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.