Back

Point cloud local ancestry inference (PCLAI): continuous coordinate-based ancestry along the genome

Geleta, M.; Mas Montserrat, D.; Ioannidis, N. M.; Ioannidis, A. G.

2026-03-25 genomics
10.64898/2026.03.23.713813 bioRxiv
Show abstract

Local ancestry inference (LAI) predicts a discrete ancestry label for each segment of an individuals genome and has become integral to studying population history, genetic variation, and polygenic trait association. We present a new local ancestry paradigm that eschews discrete categorical labels and instead performs inference in a continuous coordinate space. We call this method "point cloud local ancestry inference" (PCLAI), since it represents an individuals genetic ancestry as a point cloud with each point corresponding to a small haplotypic segment in their genome. This formulation works in any co-ordinate space (for instance, geographic or principal components) permitting the representation of continuous genetic variation at the haplotypic-segment level without resorting to artificially constructed discrete labels. We illustrate PCLAI by training on ancient samples from multiple time periods separately, yielding chromosome paintings based on geography that are time-stratified and provide insight into how individuals genomic segments moved across space and time.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Nature Genetics
240 papers in training set
Top 0.4%
14.0%
2
Genome Biology
555 papers in training set
Top 0.3%
10.2%
3
The American Journal of Human Genetics
206 papers in training set
Top 0.5%
9.9%
4
Science
429 papers in training set
Top 5%
6.2%
5
Nature Biotechnology
147 papers in training set
Top 2%
4.7%
6
Nature Communications
4913 papers in training set
Top 37%
3.9%
7
Nature
575 papers in training set
Top 6%
3.9%
50% of probability mass above
8
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 21%
3.5%
9
Bioinformatics
1061 papers in training set
Top 6%
3.5%
10
Genome Research
409 papers in training set
Top 1%
2.7%
11
Cell Genomics
162 papers in training set
Top 3%
2.0%
12
Genome Medicine
154 papers in training set
Top 4%
1.8%
13
Cell
370 papers in training set
Top 10%
1.8%
14
PLOS Computational Biology
1633 papers in training set
Top 15%
1.7%
15
Nature Computational Science
50 papers in training set
Top 0.7%
1.7%
16
Nature Methods
336 papers in training set
Top 4%
1.7%
17
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
18
GENETICS
189 papers in training set
Top 0.8%
1.5%
19
BMC Bioinformatics
383 papers in training set
Top 5%
1.5%
20
Frontiers in Genetics
197 papers in training set
Top 6%
1.5%
21
PLOS ONE
4510 papers in training set
Top 59%
1.3%
22
eLife
5422 papers in training set
Top 50%
1.2%
23
European Journal of Human Genetics
49 papers in training set
Top 0.9%
1.2%
24
Scientific Reports
3102 papers in training set
Top 67%
1.2%
25
Cell Systems
167 papers in training set
Top 10%
0.9%
26
PLOS Genetics
756 papers in training set
Top 13%
0.9%
27
Molecular Biology and Evolution
488 papers in training set
Top 4%
0.8%
28
Science Advances
1098 papers in training set
Top 31%
0.7%
29
iScience
1063 papers in training set
Top 34%
0.7%
30
Communications Biology
886 papers in training set
Top 25%
0.7%