Therapeutically Applicable Research to Generate Effective Treatments (TARGET) data

The Therapeutically Applicable Research to Generate Effective Treatments (TARGET) program applies a comprehensive genomic approach to determine the molecular changes driving childhood cancers.

The Institute for Systems Biology Cancer Gateway to the Cloud (ISB-CGC) provides access to TARGET data and metadata in BigQuery tables for ease of access and analysis. These tables consolidate the information scattered over tens of thousands of XML and tabular open-access TARGET data into a queryable format by data type (such as clinical, biospecimen, gene expression, mutation, and so forth) for ease of access and analysis.

Similarly, ISB-CGC has created BigQuery tables for other cancer programs; see the ISB-CGC Programs documentation for a full list.

ISB-CGC also provides notebook examples in both R and Python that range from simple to complex query building and analysis using ISB-CGC BigQuery tables:

Dataset access

Cloud Storage folders

ISB-CGC stores Cloud Storage paths to TARGET data hosted by the National Cancer Institute's Genomic Data Commons in the BigQuery dataset isb-cgc-bq.GDC_case_file_metadata. Please see the ISB-CGC TARGET documentation to find out how to access these file locations.

BigQuery datasets

You can access the following TARGET datasets in BigQuery for data exploration and querying:

To explore other ISB-CGC cancer datasets, use the ISB-CGC BigQuery Search Tool. You can find this data in the isb-cgc-bq project in Google BigQuery. For more information about ISB-CGC and its data, see ISB-CGC documentation.

About the data

Use: This dataset is publicly available for anyone to use under the terms provided by the dataset source (https://cancergenome.nih.gov/) and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.