Back

Epigenome-informed prioritization of bivalent chromatin SNPs enhances genomic prediction robustness: a proof-of-concept study in Pacific white shrimp (Litopenaeus vannamei)

Shi, J.; Lu, Z.; Sui, M.; Mu, M.; Zhang, D.; Bao, Z.; Hu, J.; Zeng, Q.; Ye, Z.

2026-02-17 genetics
10.64898/2026.02.14.705940 bioRxiv
Show abstract

BackgroundGenomic selection (GS) has revolutionized animal breeding, spanning livestock sectors such as pigs and cattle to aquatic species like fish and shrimp. However, its broader application across these industries is often constrained by high genotyping costs and reduced predictive reliability across divergent populations or generations. Developing cost-effective, biologically informed genotyping strategies to overcome these limitations remains a critical goal in animal agriculture. Epigenetic annotations, particularly histone modifications, provide direct functional insights into regulatory elements underlying complex trait variation and represent a promising but underexplored resource for marker prioritization. ResultsHere, using the Pacific white shrimp (Litopenaeus vannamei) as a model organism, we conducted a proof-of-concept study integrating resequencing and phenotypic data from 972 individuals. We generated high-resolution epigenomic maps by profiling four histone marks (H3K4me1, H3K4me3, H3K27me3, and H3K27ac) across multiple embryonic stages and adult muscle tissue using CUT&Tag. These functional annotations were then leveraged to prioritize single nucleotide polymorphism (SNP) subsets for genomic prediction. Among the tested strategies, SNPs located in the muscle-specific bivalent promoter/enhancer (E6) state--characterized by the co-occurrence of active and repressive marks--consistently maximized prediction accuracy under the BayesA model. Notably, even at a moderate density (15k), E6-derived SNPs achieved prediction accuracies exceeding those obtained using substantially larger genome-wide SNP sets. Most importantly, in a challenging cross-population validation using an independent strain, the E6-derived SNP subset significantly improved prediction accuracy by 47.6% (increasing from 0.21 {+/-} 0.05 to 0.31 {+/-} 0.04, p < 0.05) compared to random subsets at equivalent density. ConclusionsThese results demonstrate that epigenetic annotation-guided SNP prioritization provides a biologically informed and cost-effective strategy to enhance genomic prediction accuracy and stability. This framework is broadly transferable across species and offers a practical strategy for designing low-density genotyping panels that reduce costs while maintaining reliable selection outcomes in large-scale breeding programs.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Molecular Ecology Resources
161 papers in training set
Top 0.1%
18.4%
2
Genetics Selection Evolution
33 papers in training set
Top 0.1%
18.4%
3
BMC Genomics
328 papers in training set
Top 0.1%
12.2%
4
Frontiers in Genetics
197 papers in training set
Top 0.7%
6.7%
50% of probability mass above
5
Scientific Reports
3102 papers in training set
Top 25%
4.8%
6
PLOS ONE
4510 papers in training set
Top 40%
3.5%
7
Bioinformatics Advances
184 papers in training set
Top 2%
2.0%
8
G3 Genes|Genomes|Genetics
351 papers in training set
Top 1%
2.0%
9
The Plant Genome
53 papers in training set
Top 0.4%
1.7%
10
BMC Bioinformatics
383 papers in training set
Top 5%
1.6%
11
PLOS Genetics
756 papers in training set
Top 9%
1.6%
12
Aquaculture
29 papers in training set
Top 0.4%
1.5%
13
Nature Communications
4913 papers in training set
Top 55%
1.3%
14
Genome Biology
555 papers in training set
Top 5%
1.3%
15
G3: Genes, Genomes, Genetics
222 papers in training set
Top 0.5%
1.3%
16
Bioinformatics
1061 papers in training set
Top 8%
0.9%
17
BMC Biology
248 papers in training set
Top 3%
0.9%
18
Molecular Ecology
304 papers in training set
Top 4%
0.8%
19
NAR Genomics and Bioinformatics
214 papers in training set
Top 4%
0.7%
20
Communications Biology
886 papers in training set
Top 25%
0.7%
21
Human Molecular Genetics
130 papers in training set
Top 4%
0.7%
22
Human Genetics and Genomics Advances
70 papers in training set
Top 0.9%
0.7%
23
Genomics
60 papers in training set
Top 3%
0.7%
24
Journal of Heredity
35 papers in training set
Top 0.2%
0.7%
25
Genes
126 papers in training set
Top 3%
0.7%
26
Genetics
225 papers in training set
Top 5%
0.6%
27
iScience
1063 papers in training set
Top 38%
0.6%
28
Genome Medicine
154 papers in training set
Top 9%
0.6%
29
Cell Genomics
162 papers in training set
Top 8%
0.6%