National Institutes of Health The Cancer Genome Atlas (TCGA)
The Cancer Genome Atlas (TCGA) is a comprehensive and coordinated effort to accelerate our understanding of the molecular basis of cancer through the application of genome analysis technologies, including large-scale genome sequencing. TCGA is a joint effort of the National Cancer Institute (NCI) and the National Human Genome Research Institute (NHGRI), which are both part of the National Institutes of Health, U.S. Department of Health and Human Services.
TCGA projects are organized by cancer type or subtype. Click here for a current list of cancer types selected for study in TCGA.
Data from TCGA (e.g., gene expression, copy number variation and clinical information), are available via the Genomic Data Commons (GDC).
Data from TCGA projects are organized into two tiers: Open Access and Controlled Access.
- Open Access data tier contains data that cannot be attributed to an individual research participant. The Open Access data tier does not require user certification. Data in Open Access tier are available in the TCGA Data Portal.
- Controlled Access data tier contains individual-level genotype data that are unique to an individual. Access to data in the Controlled Access data tier requires user certification through dbGaP Authorized Access.
- Controlled Access data types consist of the following:
- Individual germline variant data (SNP .cel files)
- Primary sequence data (.bam files), which are available at GDC
- Clinical free text fields
- Exon Array files (for Glioblastoma and Ovarian projects only)
NOTE: TCGA strives to release most data in the open access tier. Individual genotype or sequence files are prominent exceptions. Commonly requested files such as descriptions of somatic mutations or clinical data are open access.
Please go to this page: https://tcga-data.nci.nih.gov/docs/publications/ to access all data associated with TCGA tumor specific publications.
The TCGA study is utilized in the following dbGaP substudies. To view genotypes and other molecular data collected in these substudies, please click on the following substudies below or in the "Substudies" section of this top-level study page phs000178 TCGA study.
- phs000854 Genome-wide Analysis of Noncoding Regulatory Mutations in Cancer
- Type: Tumor vs. Matched-Normal
- Archiver: The database of Genotypes and Phenotypes (dbGaP)