Need Help?

DNA Methylation and Microbiome Profiles from Colorectal Cancer Patients and Healthy Controls

This dataset contains DNA methylation data and gut microbiome profiles derived from tumor biopsies, matched blood samples, and stool samples collected from 200 Swedish participants (150 colorectal cancer patients and 50 healthy controls) between 2020 and 2023. For DNA methylation profiling, genomic DNA was extracted from tumor tissues and blood samples using the Qiagen AllPrep DNA/RNA Kit and analyzed with the Illumina Infinium MethylationEPIC BeadChip. This approach captures more than 850,000 methylation sites across the genome. In parallel, stool samples were collected prior to treatment, and microbial DNA was extracted using the Qiagen QIAamp Fast DNA Stool Kit. 16S rRNA gene sequencing was performed on the Illumina MiSeq platform to characterize the gut microbiome composition in patients and controls. The resulting dataset includes: Methylation array data in IDAT format (~120 GB total) Normalized methylation matrices in CSV format (~15 GB) 16S rRNA microbiome data in FASTQ format (~250 GB total) Microbiome abundance tables in TSV format (~2 GB) All data are stored in FEGA Sweden under controlled access, and access requests will be reviewed by the Lund University RDO as the data controller. Ethical approval was obtained from the Swedish Ethical Review Authority (Dnr: 2023-04567).

Request Access

DUO:0000025
version: 2021-02-23

time limit on use

This data use modifier indicates that use is approved for a specific number of months.

BC Cancer, part of the Provincial Health Services Authority - Data Access Policy

Access to this data is controlled. There are a number of steps that a researcher must take to obtain access to this data, including execution of a Data Access Agreement between the institutions. The process is overseen by the Technology Development Office; please contact our general email address TDOadmin@phsa.ca. Please only click the "request data" button on the EGA website after a Data Access Agreement is fully executed.

Studies are experimental investigations of a particular phenomenon, e.g., case-control studies on a particular trait or cancer research projects reporting matching cancer normal genomes from patients.

Study ID Study Title Study Type
EGAS50000000158 Whole Genome Sequencing
  • ilum press2
  • I have updated the experiment description without this to affect in some way the datasets
  • Checked the description and I am satisfied
  • Dataset Released

This table displays only public information pertaining to the files in the dataset. If you wish to access this dataset, please submit a request. If you already have access to these data files, please consult the download documentation.

ID File Type Size Quality Report
Located in
EGAF50000105293 fastq 306 Bytes
Federated EGA
EGAF50000105294 fastq 306 Bytes
Federated EGA
2 Files (612 Bytes)