Back

The Multiethnic Cohort: A Resource for the study of Genetic and non-Genetic Cancer Risk Across Populations

Bogumil, D.; Sheng, X.; Wan, P.; Xia, L.; Pooler, L.; Cheng, I.; Streicher, S.; Huang, B. Z.; Chen, F.; Stram, D.; Shen, S.; King, G.; Chiang, C. W. K.; Ongaco, C.; Adams, M.; McMullen, I.; Zhang, P.; Ling, H.; Mawhinney, M.; Doheny, K. F.; Le Marchand, L.; Wilkens, L. R.; Haiman, C. A.; Conti, D. V.

2025-06-11 epidemiology
10.1101/2025.06.09.25328993 medRxiv
Show abstract

IntroductionThe Multiethnic Cohort Study (MEC) is a U.S. prospective cohort of over 215,000 participants, designed to investigate variation in risk factors and disease across diverse racial and ethnic groups. Over 74,000 participants contributed biospecimens for genetic studies. We describe this sub-cohort and demonstrate the types of analyses it enables. MethodsThe MEC recruited adults aged 45-75 in California and Hawaii between 1993 and 1996. Cancer diagnoses were identified via state tumor registries. The MEC Genetics Database includes 73,139 participants with germline genotype data. We evaluated genetic similarity, its relationship with self-reported race/ethnicity, and baseline characteristics, including neighborhood socioeconomic status. Using breast, colorectal, and prostate cancer as examples, the database supports multi-ancestry genome-wide association studies (GWAS), evaluation of non-genetic factors, and time-to-event analyses. ResultsParticipants included 10,962 African Americans, 24,234 Japanese Americans, 17,242 Latinos, 5,488 Native Hawaiians, 14,649 Whites, and 564 other. Principal component analysis revealed substantial diversity in ancestry. Multiethnic GWAS demonstrated effective control of population stratification while replicating many previously discovered variants. Polygenic risk score (PRS) effects varied by racial and ethnic group. Time-to-event analysis showed associations between cancer incidence and neighborhood socioeconomic status, population descriptors, and genetic similarity. DiscussionThe MEC Genetics Database enables comprehensive assessment of genetic and non-genetic cancer risk, revealing differences in absolute risk by race and ethnicity. Studying both types of risk factors in diverse and admixed populations is critical for improving risk characterization and reducing disparities. This resource supports future research in polygenic traits, gene-environment interactions, and integrated risk prediction.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Cancer Epidemiology, Biomarkers & Prevention
17 papers in training set
Top 0.1%
34.0%
2
International Journal of Epidemiology
74 papers in training set
Top 0.1%
23.2%
50% of probability mass above
3
American Journal of Epidemiology
57 papers in training set
Top 0.2%
5.0%
4
PLOS ONE
4510 papers in training set
Top 30%
5.0%
5
Nature Communications
4913 papers in training set
Top 38%
3.7%
6
eLife
5422 papers in training set
Top 29%
3.2%
7
Genome Medicine
154 papers in training set
Top 4%
1.9%
8
International Journal of Cancer
42 papers in training set
Top 0.6%
1.7%
9
JNCI: Journal of the National Cancer Institute
16 papers in training set
Top 0.3%
1.7%
10
JNCI Cancer Spectrum
10 papers in training set
Top 0.3%
1.4%
11
Scientific Reports
3102 papers in training set
Top 63%
1.4%
12
JAMA Network Open
127 papers in training set
Top 4%
0.8%
13
Epidemiology
26 papers in training set
Top 0.5%
0.8%
14
Bioinformatics
1061 papers in training set
Top 10%
0.7%
15
Genetic Epidemiology
46 papers in training set
Top 0.9%
0.7%
16
Circulation: Genomic and Precision Medicine
42 papers in training set
Top 1%
0.7%
17
Human Mutation
29 papers in training set
Top 0.9%
0.5%
18
Nature
575 papers in training set
Top 18%
0.5%
19
Genetics in Medicine
69 papers in training set
Top 1%
0.5%
20
Journal of the American Medical Informatics Association
61 papers in training set
Top 2%
0.5%
21
BMC Medical Genomics
36 papers in training set
Top 2%
0.5%
22
Breast Cancer Research
32 papers in training set
Top 0.6%
0.5%
23
European Journal of Human Genetics
49 papers in training set
Top 2%
0.5%