Back

Tracking malaria parasite lineages through de novo mutations in highly related Plasmodium falciparum genomes

Fogang, B.; Guery, M.-A.; Cheeseman, I. H.; Conway, D. J.; Claessens, A.

2026-05-12 genetics
10.64898/2026.05.08.723589 bioRxiv
Show abstract

Where malaria transmission declines, the remaining infections are increasingly low-density and asymptomatic, forming a persistent reservoir that is difficult to track using conventional epidemiological approaches. However, genomic data from such community-level infections remain scarce, limiting the ability to track parasite lineages, detect clonal expansions, and identify persistent chronic infections in pre-elimination settings. Here, 78 single-genotype P. falciparum genome sequences are analysed from community infections within a small area of The Gambia, where malaria transmission has substantially declined over recent decades. Pairwise identity-by-descent (IBD) analysis revealed generally low genetic relatedness among parasites, consistent with ongoing recombination and genetic mixing at the community scale. Nevertheless, eight clusters of near-identical genomes (IBD > 0.9) were identified, enabling the inference of recent de novo mutations that differentiate these genomes. Across these clusters, 43 de novo single-nucleotide polymorphisms and 19 short indels were identified using long-read-derived reference genomes. The observed pattern of mutation in natural infections broadly resembled that previously reported from laboratory mutation-accumulation experiments, including a strong transition bias and enrichment of G:C[->]A:T substitutions. These results demonstrate that combining IBD analysis with de novo mutation detection enables fine-scale resolution of parasite relatedness and recent transmission history. As malaria transmission continues to decline, such approaches may become increasingly valuable for tracking local transmission, identify parasite lineages, and potentially distinguish persistent infections from reintroduction events.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Nature Communications
4913 papers in training set
Top 2%
25.5%
2
eLife
5422 papers in training set
Top 12%
6.7%
3
Genome Medicine
154 papers in training set
Top 1%
6.3%
4
Scientific Reports
3102 papers in training set
Top 20%
6.2%
5
The Lancet Microbe
43 papers in training set
Top 0.3%
3.5%
6
PLOS Pathogens
721 papers in training set
Top 4%
3.5%
50% of probability mass above
7
PLOS Computational Biology
1633 papers in training set
Top 10%
3.5%
8
mBio
750 papers in training set
Top 5%
3.2%
9
Communications Biology
886 papers in training set
Top 3%
3.0%
10
Nucleic Acids Research
1128 papers in training set
Top 7%
3.0%
11
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 30%
1.9%
12
Malaria Journal
48 papers in training set
Top 0.8%
1.9%
13
Nature Microbiology
133 papers in training set
Top 2%
1.7%
14
Cell Genomics
162 papers in training set
Top 3%
1.7%
15
PLOS Biology
408 papers in training set
Top 10%
1.7%
16
The American Journal of Human Genetics
206 papers in training set
Top 2%
1.7%
17
PLOS Genetics
756 papers in training set
Top 10%
1.3%
18
mSphere
281 papers in training set
Top 4%
1.2%
19
Molecular Biology and Evolution
488 papers in training set
Top 3%
1.1%
20
PLOS ONE
4510 papers in training set
Top 62%
0.9%
21
EMBO Molecular Medicine
85 papers in training set
Top 3%
0.9%
22
Science Translational Medicine
111 papers in training set
Top 5%
0.9%
23
Genome Research
409 papers in training set
Top 4%
0.9%
24
Science Advances
1098 papers in training set
Top 27%
0.9%
25
Genome Biology
555 papers in training set
Top 7%
0.8%
26
Cell
370 papers in training set
Top 17%
0.7%
27
Cell Reports
1338 papers in training set
Top 34%
0.7%
28
The ISME Journal
194 papers in training set
Top 2%
0.7%
29
The Journal of Infectious Diseases
182 papers in training set
Top 5%
0.7%
30
Microbial Genomics
204 papers in training set
Top 2%
0.7%