Description of runs dataset of submission 174
Despite the vast diversity of its populations, genetic studies in Africa have been limited. African populations, Malians in particular, have a high rate of intra-ethnic and consanguineous marriage, resulting in increased prevalence of autosomal recessive diseases. Family-based genetic studies can be limited in developed countries due to small sibships. The average fertility rate in Mali is over 6 births per woman, offering a unique opportunity to find new disease genes or mutations that can then be studied in other populations. Hereditary neurological diseases are very debilitating diseases, and developing countries, particularly in Africa, pay a high price in terms of disability-adjusted life years. Although most are currently untreatable, increasing awareness about hereditary neurological disorders can reduce this burden. This study will help to identify and characterize novel hereditary neurological disease genes in Mali. These genes are likely to be important in the normal function of the nervous system and to have important pathophysiological implications for African and other populations. This study will also train physicians and students in the characterization of neurodegenerative diseases as well as in genetic technology and molecular biology, and build a suitable research environment that will retain them.
There is a need for quantitative measurements of evolutionary metrics in controlled clinical trials with long term follow-up information. This is particularly true in advanced localised prostate cancer, which can recur more than a decade after diagnosis. Here we mapped genomic intra-tumour heterogeneity in 642 tumour samples from 114 patients who took part in the IMRT and DELINEATE clinical trials, for which full clinical information and 12y median follow-up was available. We concomitantly assessed phenotypic (morphological) heterogeneity using Deep Learning in 1,923 histological sections from 250 IMRT patients (fully overlapping with the genetic set). This study shows that combining genomics with AI-aided histopathology in clinical trials leads to novel clinical biomarkers. This EGA repository contains data produced from tumour samples using low coverage whole genome sequencing and a prostate cancer specific gene panel data following compression of unique molecular identifiers.
Description of runs dataset of submission 163
The Age-Related Eye Disease Study (AREDS) was initially designed as a long-term multi-center, prospective study of the clinical course of age-related macular degeneration (AMD) and age-related cataract. In addition to collecting natural history data, AREDS included a clinical trial of high-dose vitamin and mineral supplements for AMD and a clinical trial of high-dose vitamin supplements for cataract. AREDS participants were 55 to 80 years of age at enrollment and had to be free of any illness or condition that would make long-term follow-up or compliance with study medications unlikely or difficult. On the basis of fundus photographs graded by a central reading center, best-corrected visual acuity and ophthalmologic evaluations, 4,757 participants were enrolled in one of several AMD categories, including persons with no AMD. The clinical trials for AMD and cataract were conducted concurrently. AREDS participants were followed on the clinical trials for a median time of 6.5 years. Subsequent to the conclusion of the clinical trials, participants were followed for an additional 5 years and natural history data were collected. The AREDS research design is detailed in AREDS Report 1. AREDS Report 8 contains the mainline results from the AMD trial; AREDS Report 9 contains the results of the cataract trial. Blood samples were also collected from 3,700+ AREDS participants for genetic research. Genetic samples from 600 AREDS participants (200 controls, 200 Neovascular AMD cases, and 200 Geographic Atrophy cases) were selected using data available in March 2005 and then were evaluated with a genome-wide scan. These data, as well as selected phenotypic data, were made available in the dbGaP. DNA from AREDS participants, which is currently being stored in the AREDS Genetic Repository, is available for research purposes. However, not all of the 3,700+ AREDS participants who submitted a blood sample currently have DNA available. In addition to including the data from the genome-wide scan on the 600 original samples, this second version of the AREDS dbGaP database provides a comprehensive set of data tables with extensive clinical information collected for the 4,757 participants who participated in AREDS. The tables include information collected at enrollment/baseline, during study follow-up, fundus and lens pathology, nutritional estimates, quality of life measures and measures of morbidity and mortality. In November 2010, over 72,000 high quality fundus and lens photographs of 595 AREDS participants (of the original 600 selected for the genome-wide scan) were made available in the AREDS dbGaP. In addition to the genome-wide scan data, the fundus and lens grading data for these participants are also available through the AREDS dbGaP. Details about the ocular photographs that are available may be found in the document "Age-Related Eye Disease Study (AREDS) Ocular Photographs". In January 2012, a measure of daily sunlight exposure was added in a separate "sunlight" table. Furthermore, the "followup" table has been revised. The visual acuity for the right eye was inadvertently missing at odd-numbered visits (01, 03, 05, etc.). This data is now part of the table. In February 2014 over 134,500 high-quality fundus photographs (macular field F2) of 4613 AREDS participants were added to the existing AREDS dbGaP resource. The AREDS dbGaP image archive already contains over 72,000 high quality fundus and lens photographs of 595 AREDS participants for whom dbGaP-accessible genotype data exist. Information about the available ocular photographs found in the document "Age-Related Eye Disease Study (AREDS) Ocular Photographs" has been updated with an addendum. It is hoped that this resource will better help researchers understand two important diseases that affect an aging population. These data may be applied to examination and inference on genetic and genetic-environmental bases for age-related diseases of public health significance and may also help elucidate the clinical course of both conditions, generate hypotheses, and aid in the design of clinical trials of preventive interventions. Definitions of Final AMD Phenotype Categories Please see phd001138.1 for a detailed description of how AREDS participants' final AMD phenotype was categorized. User's Guide for AREDS Phenotype Data A detailed User's Guide for the AREDS phenotype data is available. This User's Guide is meant to be a comprehensive document which explains the complexities of the AREDS data. It is recommended that all researchers using AREDS phenotype data make use of this User's Guide.
Startup of Framingham Heart Study. Cardiovascular disease (CVD) is the leading cause of death and serious illness in the United States. In 1948, the Framingham Heart Study (FHS) -- under the direction of the National Heart Institute (now known as the National Heart, Lung, and Blood Institute, NHLBI) -- embarked on a novel and ambitious project in health research. At the time, little was known about the general causes of heart disease and stroke, but the death rates for CVD had been increasing steadily since the beginning of the century and had become an American epidemic. The objective of the FHS was to identify the common factors or characteristics that contribute to CVD by following its development over a long period of time in a large group of participants who had not yet developed overt symptoms of CVD or suffered a heart attack or stroke. Design of Framingham Heart Study. In 1948, the researchers recruited 5,209 men and women between the ages of 30 and 62 from the town of Framingham, Massachusetts, and began the first round of extensive physical examinations and lifestyle interviews that they would later analyze for common patterns related to CVD development. Since 1948, the subjects have returned to the study every two years for an examination consisting of a detailed medical history, physical examination, and laboratory tests, and in 1971, the study enrolled a second-generation cohort -- 5,124 of the original participants' adult children and their spouses -- to participate in similar examinations. The second examination of the Offspring cohort occurred eight years after the first examination, and subsequent examinations have occurred approximately every four years thereafter. In April 2002 the Study entered a new phase: the enrollment of a third generation of participants, the grandchildren of the original cohort. The first examination of the Third Generation Study was completed in July 2005 and involved 4,095 participants. Thus, the FHS has evolved into a prospective, community-based, three generation family study. The FHS is a joint project of the National Heart, Lung and Blood Institute and Boston University. Research Areas in the Framingham Heart Study. Over the years, careful monitoring of the FHS population has led to the identification of the major CVD risk factors -- high blood pressure, high blood cholesterol, smoking, obesity, diabetes, and physical inactivity -- as well as a great deal of valuable information on the effects of related factors such as blood triglyceride and HDL cholesterol levels, age, gender, and psychosocial issues. Risk factors have been identified for the major components of CVD, including coronary heart disease, stroke, intermittent claudication, and heart failure. It is also clear from research in the FHS and other studies that substantial subclinical vascular disease occurs in the blood vessels, heart and brain that precedes clinical CVD. With recent advances in technology, the FHS has enhanced its research capabilities and capitalized on its inherent resources by the conduct of high resolution imaging to detect and quantify subclinical vascular disease in the major blood vessels, heart and brain. These studies have included ultrasound studies of the heart (echocardiography) and carotid arteries, computed tomography studies of the heart and aorta, and magnetic resonance imaging studies of the brain, heart, and aorta. Although the Framingham cohort is primarily white, the importance of the major CVD risk factors identified in this group have been shown in other studies to apply almost universally among racial and ethnic groups, even though the patterns of distribution may vary from group to group. In the past half century, the Study has produced approximately 1,200 articles in leading medical journals. The concept of CVD risk factors has become an integral part of the modern medical curriculum and has led to the development of effective treatment and preventive strategies in clinical practice. In addition to research studies focused on risk factors, subclinical CVD and clinically apparent CVD, Framingham investigators have also collaborated with leading researchers from around the country and throughout the world on projects involving some of the major chronic illnesses in men and women, including dementia, osteoporosis and arthritis, nutritional deficiencies, eye diseases, hearing disorders, and chronic obstructive lung diseases. Genetic Research in the Framingham Heart Study. While pursuing the Study's established research goals, the NHLBI and the Framingham investigators has expanded its research mission into the study of genetic factors underlying CVD and other disorders. Over the past two decades, DNA has been collected from blood samples and from immortalized cell lines obtained from Original Cohort participants, members of the Offspring Cohort and the Third Generation Cohort. Several large-scale genotyping projects have been conducted in the past decade. Genome-wide linkage analysis has been conducted using genotypes of approximately 400 microsatellite markers that have been completed in over 9,300 subjects in all three generations. Analyses using microsatellite markers completed in the original cohort and offspring cohorts have resulted in over 100 publications, including many publications from the Genetics Analysis Workshop 13. Several other recent collaborative projects have completed thousands of SNP genotypes for candidate gene regions in subsets of FHS subjects with available DNA. These projects include the Cardiogenomics Program of the NHLBI's Programs for Genomics Applications, the genotyping of ~3000 SNPs in inflammation genes, and the completion of a genome-wide scan of 100,000 SNPs using the Affymetrix 100K Genechip. Framingham Cohort Phenotype Data. The phenotype database contains a vast array of phenotype information available in all three generations. These will include the quantitative measures of the major risk factors such as systolic blood pressure, total and HDL cholesterol, fasting glucose, and cigarette use, as well as anthropomorphic measures such as body mass index, biomarkers such as fibrinogen and CRP, and electrocardiography measures such as the QT interval. Many of these measures have been collected repeatedly in the original and offspring cohorts. Also included in the SHARe database will be an array of recently collected biomarkers, subclinical disease imaging measures, clinical CVD outcomes as well as an array of ancillary studies. The phenotype data is located here in the top-level study phs000007 Framingham Cohort. To view the phenotype variables collected from the Framingham Cohort, please click on the "Variables" tab above. The Framingham Cohort is utilized in the following dbGaP substudies. To view genotypes, analysis, expression data, other molecular data, and derived variables collected in these substudies, please click on the following substudies below or in the "Substudies" section of this top-level study page phs000007 Framingham Cohort. phs000342 Framingham SHARe phs000282 Framingham CARe phs000363 Framingham SABRe phs000307 Framingham Medical Resequencing phs000401 Framingham ESP Heart-GO phs000651 Framingham CHARGE-S phs000724 Framingham DNA Methylation phs001610 Framingham T2D-GENES phs002558 Framingham ADSP phs002559 Framingham BRIDGET phs002560 Framingham Cholesterol phs002611 Framingham Post-Mortem Brain Tissue The unflagging commitment of the research participants in the NHLBI FHS has made more than a half century of research success possible. For decades, the FHS has made its data and DNA widely available to qualified investigators throughout the world through the Limited Access Datasets and the FHS DNA Committee, and the SHARe database will continue that tradition by allowing access to qualified investigators who agree to the requirements of data access. With the SHARe database, we continue with an ambitious research agenda and look forward to new discoveries in the decades to come.