Need Help?

Genome Asia 100K Project

The underrepresentation of non-European individuals in human genetic studies so far has limited the diversity of individuals in genomic datasets and led to reduced medical relevance for a large proportion of the world’s population. Population-specific reference genome datasets as well as genome-wide association studies in diverse populations are needed to address this issue. Here we describe the pilot phase of the GenomeAsia 100K Project. This includes a whole-genome sequencing reference dataset from 1,739 individuals of 219 population groups and 64 countries across Asia. We catalogue genetic variation, population structure, disease associations and founder effects. We also explore the use of this dataset in imputation, to facilitate genetic studies in populations across Asia and worldwide.

Click on a Dataset ID in the table below to learn more, and to find out who to contact about access to these data

Dataset ID Description Technology Samples
EGAD00001005975 1163
Publications Citations
The GenomeAsia 100K Project enables genetic discoveries across Asia.
Nature 576: 2019 106-111
166
Understanding signatures of positive natural selection in human zinc transporter genes.
Sci Rep 12: 2022 4320
1
Population-specific positive selection on low CR1 expression in malaria-endemic regions.
PLoS One 18: 2023 e0280282
0
Prehistoric human migration between Sundaland and South Asia was driven by sea-level rise.
Commun Biol 6: 2023 150
4