Genomic sampling and population structure of farmer-maintained varieties reveal previously uncharacterized diversity of Theobroma cacao L. in Costa Rica
Herrighty, E. M.; Specht, C. D.; Gore, M. A.; Solano, L.; Estrada-Gamboa, J.; Hernandez, C. E.; Tufan, H. A.; Landis, J. B.
Show abstract
Understanding crop genetic diversity is essential for conservation and breeding, yet farmer-maintained germplasm remains largely underrepresented in genomic studies. Theobroma cacao L. has a complex domestication history and extensive global diversity, and cacao currently cultivated in Central America, particularly in Costa Rica, has been understudied compared to South American and Mexican cultivars despite cultural and historical importance. In this study, we investigate the genetic diversity of cacao from farmer-managed systems across Costa Rica to search for Criollo germplasm and identify and characterize any unique local genetic groups. Ninety-four trees were sampled from 17 farms across four regions of the country and sequenced using whole genome resequencing. Farmer materials were analyzed alongside 166 previously characterized reference accessions representing major cacao genetic groups. Population structure analyses, phylogenetic reconstruction, and network approaches revealed that Costa Rican cacao encompasses multiple known genetic groups, including Criollo-derived lineages, while also harboring locally distinct diversity not fully represented in current global reference collections. Analyses revealed close kinship between many accessions with no clear geographic patterns corresponding to the observed population differentiation, reflecting the effects of farmers in creating dominant patterns of gene flow through seed-saving, clonal propagation, and sharing genotypes among farms. Heterozygosity levels varied substantially among individuals, consistent with a mixture of highly inbred Criollo trees and more heterozygous, admixed genotypes. We find that farmer-managed cacao systems are reservoirs of genetic diversity, including possibly rare or historically important lineages, underscoring the value of these farming systems for effective conservation and management of genomic resources for cacao resilience and improvement.
Matching journals
The top 5 journals account for 50% of the predicted probability mass.