Back

Mapping lineage and functional diversity in the high-risk human mammary epithelium

Waas, M.; Zhang, B.; Govindarajan, M.; Tharmapalan, P.; Kuttanamkuzhi, A.; Drummond Guy, O.; Woolman, M.; Berman, H. K.; Waterhouse, P. D.; Khokha, R.; Kislinger, T.

2026-01-13 cancer biology
10.64898/2026.01.12.699075 bioRxiv
Show abstract

BackgroundBreast cancer risk is shaped by the vast heterogeneity of mammary epithelial cells, comprising basal, luminal progenitor, and mature luminal populations. While transcriptional variation among these lineages has been extensively studied, protein-level features - particularly in high-risk women - remain underexplored, limiting insight into early cellular and molecular determinants of susceptibility. Moreover, little is known about how clinical covariates influence clonogenic capacity, proteomic states, and epithelial proportions, complicating the design of properly controlled human studies of early breast cancer risk. ResultsWe combined low-input proteomics with functional clonogenic assays to profile mammary epithelial cell subpopulations from a cohort of 21 breast tissues encompassing different germline mutation backgrounds, parity status and age. We quantified over 5,555 proteins and observed marked inter-donor variation in epithelial composition, proteomic programs, and colony-forming capacity. Multivariable modeling revealed that clinical covariates - including age, parity, and germline mutation status - modulate both global proteomic architecture and lineage-specific pathway activity. Parity was associated with reduced basal cell abundance, altered luminal progenitor and mature luminal proteomes, and changes in clonogenicity. Pathway analyses identified both conserved and lineage-restricted responses to shared risk factors. Projection of clonogenic signatures onto METABRIC and TCGA tumors further linked functional programs to tumor subtypes and clinical outcomes. ConclusionsThis study provides the most comprehensive proteomic atlas of cell-type resolved diversity in the high-risk breast to date. By defining how clinical covariates remodel epithelial composition and molecular state, it clarifies key sources of biological variability that challenge controlled study design and offers a resource for improving mechanistic insight, risk assessment, and prevention strategies.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Breast Cancer Research
32 papers in training set
Top 0.1%
37.9%
2
Nature Communications
4913 papers in training set
Top 26%
6.8%
3
Genome Medicine
154 papers in training set
Top 2%
4.3%
4
Cell Reports Medicine
140 papers in training set
Top 1%
3.6%
50% of probability mass above
5
Cell Reports
1338 papers in training set
Top 14%
3.6%
6
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 24%
2.7%
7
Cancer Research
116 papers in training set
Top 1%
2.4%
8
JCI Insight
241 papers in training set
Top 3%
1.9%
9
npj Breast Cancer
18 papers in training set
Top 0.1%
1.9%
10
Science Advances
1098 papers in training set
Top 16%
1.8%
11
Scientific Reports
3102 papers in training set
Top 58%
1.7%
12
PLOS ONE
4510 papers in training set
Top 54%
1.7%
13
The Journal of Pathology
22 papers in training set
Top 0.2%
1.5%
14
PLOS Computational Biology
1633 papers in training set
Top 19%
1.3%
15
Developmental Cell
168 papers in training set
Top 10%
1.2%
16
Cell Genomics
162 papers in training set
Top 4%
1.2%
17
Clinical Cancer Research
58 papers in training set
Top 1%
1.1%
18
Cancers
200 papers in training set
Top 4%
1.0%
19
Laboratory Investigation
13 papers in training set
Top 0.2%
0.9%
20
Molecular Cancer Research
42 papers in training set
Top 0.6%
0.9%
21
eBioMedicine
130 papers in training set
Top 3%
0.9%
22
Clinical Proteomics
10 papers in training set
Top 0.2%
0.8%
23
eLife
5422 papers in training set
Top 58%
0.7%
24
JNCI Cancer Spectrum
10 papers in training set
Top 0.5%
0.7%
25
Cell Systems
167 papers in training set
Top 13%
0.7%
26
Cancer Research Communications
46 papers in training set
Top 1%
0.7%
27
Human Molecular Genetics
130 papers in training set
Top 4%
0.6%
28
Nature Metabolism
56 papers in training set
Top 3%
0.6%
29
Nature Materials
21 papers in training set
Top 1%
0.5%
30
Nucleic Acids Research
1128 papers in training set
Top 21%
0.5%