Genetics — Latest Matching Preprints

1

Bias in diversity estimators and neutrality tests induced by neutral polymorphic structural variants

Ramos-Onsins, S. E.; Ross-Ibarra, J.; Caceres, M.; Ferretti, L.

2026-02-28 genetics 10.64898/2026.02.26.708357 medRxiv

Top 0.1%

89.7%

Show abstract

Estimators of genetic diversity and neutrality tests derived from the site frequency spectrum (SFS), such as Wattersons{theta} W, nucleotide diversity{pi} , Tajimas D, and Fay and Wus H, are designed to be interpreted relative to a baseline defined by the standard neutral SFS. In genomic regions strongly linked to a polymorphic structural variant (SV), deviations from these baselines occur even under strict neutrality: conditioning on an SV at known frequency partitions samples into SV and non-SV haplotypes and distorts the SFS for linked neutral mutations. These deviations are well understood for genomic inversions under long-term balancing selection. However, not all SVs are under strong selection, and the evolution of some SVs may be better approximated as neutral. Here we derive analytical expectations for the unfolded (and, when necessary, folded) SFS of single nucleotide polymorphisms conditional on neutral linked polymorphic SVs, including inversions, deletions, insertions, and introgressions. We use these expectations to quantify the resulting bias in standard diversity estimators and neutrality tests as a function of SV frequency and type. Finally, we discuss approaches to build corrected estimators of diversity and neutrality tests that are unbiased/centered after accounting for the presence and frequency of the SV.

2

Taming Strong Selection with Large Sample Sizes

Gravel, S.; Krukov, I.

2021-03-30 genetics 10.1101/2021.03.30.437711 medRxiv

Top 0.1%

70.2%

Show abstract

1The fate of mutations and the genetic load of populations depend on the relative importance of genetic drift and natural selection. In addition, the accuracy of numerical models of evolution depends on the strength of both selection and drift: strong selection breaks the assumptions of the nearly neutral model, and drift coupled with large sample sizes breaks Kingmans coalescent model. Thus, the regime with strong selection and large sample sizes, relevant to the study of pathogenic variation, appears particularly daunting. Surprisingly, we find that the interplay of drift and selection in that regime can be used to define asymptotically closed recursions for the distribution of allele frequencies that are accurate well beyond the strong selection limit. Selection becomes more analytically tractable when the sample size n is larger than twice the population-scaled selection coefficient: n [≥] 2Ns (4Ns in diploids). That is, when the expected number of coalescent events in the sample is larger than the number of selective events. We construct the relevant transition matrices, show how they can be used to accurately compute distributions of allele frequencies, and show that the distribution of deleterious allele frequencies is sensitive to details of the evolutionary model.

3

Robustness of Selection and Timing Inference under Model Variation in Population Genetics

Escabi, J.; Hormoz, S.

2025-01-10 genetics 10.1101/2025.01.08.631974 medRxiv

Top 0.1%

68.8%

Show abstract

1In population genetics, accurately inferring the selection coefficient and the time of onset of advantageous mutations from genetic data is fundamental for understanding evolutionary processes. Here, we investigate how mismatches between the true evolutionary process and the inference model--specifically in the reproductive variance ({sigma}2) and the number of generations (L)--affect the posterior distributions of the selection coefficient and the time of onset. Using the Kolmogorov forward and backward equations, we model the stochastic dynamics of gene frequencies under selection and drift. We show that while the posterior distribution of the selection coefficient remains unaffected by changes in{sigma} 2 and L, this invariance does not apply to the time of onset. By framing the problem as a first passage time issue, we derive explicit expressions for the offsets in the posterior mean and variance of the time of onset that result from incorrect assumptions about{sigma} 2 and L. Our analysis reveals that these offsets are related to the mean and variance of the first passage time required for the allele frequency to reach a certain threshold, starting from an initial frequency determined by the model parameters. Under the assumption of a uniform prior for the time of onset, we find that the offset in the inferred mean is given by the difference in the effective generation duration ({Delta} = 1/{sigma}2) between the true process and the inference model. We validate our theoretical findings through simulations, demonstrating that the empirical offsets closely match our predictions. Furthermore, we generalize our results to accommodate non-uniform prior distributions, such as exponential priors, and provide numerical methods for calculating offsets under arbitrary priors. Stochastic fluctuations due to genetic drift, which are influenced by the reproductive variance and generational structure, can introduce significant biases in the posterior distribution of time of onset of advantageous mutations. By quantifying these biases, our framework enables more accurate adjustments to inferences drawn from genetic data, thereby enhancing our understanding of evolutionary dynamics and improving the reliability of population genetic analyses.

4

The effect of long-range linkage disequilibrium on allele-frequency dynamics under stabilizing selection

Negm, S.; Veller, C.

2025-09-13 genetics 10.1101/2024.06.27.601075 medRxiv

Top 0.1%

68.8%

Show abstract

Stabilizing selection on a polygenic trait reduces the traits genetic variance by (i) generating correlations (linkage disequilibria) between opposite-effect alleles throughout the genome, and (ii) selecting against rare alleles at loci that affect the trait, eroding heterozygosity at these loci. Here, we show that the linkage disequilibria, which stabilizing selection generates on a rapid timescale, slow down the subsequent allele-frequency dynamics at individual loci, which proceed on a much longer timescale. Exploiting this separation of timescales, we obtain expressions for the expected per-generation change in minor-allele frequency at individual loci, as functions of the effect sizes at these loci, the strength of selection on the trait, its variance and heritability, and the linkage relations among loci. Using whole-genome simulations, we show that our expressions predict allele-frequency dynamics under stabilizing selection more accurately than the formulae that have previously been used for this purpose. Our results have implications for understanding the genetic architecture of complex traits.

5

Reduced histone gene copy number disrupts Drosophila Polycomb function

McPherson, J.-M. E.; Grossmann, L. C.; Salzler, H. R.; Armstrong, R. L.; Kwon, E.; Matera, A. G.; McKay, D. J.; Duronio, R. J.

2023-03-28 genetics 10.1101/2023.03.28.534544 medRxiv

Top 0.1%

66.6%

Show abstract

The chromatin of animal cells contains two types of histones: canonical histones that are expressed during S phase of the cell cycle to package the newly replicated genome, and variant histones with specialized functions that are expressed throughout the cell cycle and in non-proliferating cells. Determining whether and how canonical and variant histones cooperate to regulate genome function is integral to understanding how chromatin-based processes affect normal and pathological development. Here, we demonstrate that variant histone H3.3 is essential for Drosophila development only when canonical histone gene copy number is reduced, suggesting that coordination between canonical H3.2 and variant H3.3 expression is necessary to provide sufficient H3 protein for normal genome function. To identify genes that depend upon, or are involved in, this coordinate regulation we screened for heterozygous chromosome 3 deficiencies that impair development of flies bearing reduced H3.2 and H3.3 gene copy number. We identified two regions of chromosome 3 that conferred this phenotype, one of which contains the Polycomb gene, which is necessary for establishing domains of facultative chromatin that repress master regulator genes during development. We further found that reduction in Polycomb dosage decreases viability of animals with no H3.3 gene copies. Moreover, heterozygous Polycomb mutations result in de-repression of the Polycomb target gene Ubx and cause ectopic sex combs when either canonical or variant H3 gene copy number is also reduced. We conclude that Polycomb-mediated facultative heterochromatin function is compromised when canonical and variant H3 gene copy number falls below a critical threshold.

6

Laboratory yeast crosses reveal limited epistasis in the genetic basis of complex traits

Gupta, M.; Holmes, C. M.; Belousova, J.; Gopalakrishnan, S.; Rego-Costa, A.; Desai, M. M.

2026-04-06 genetics 10.64898/2026.04.04.716439 medRxiv

Top 0.1%

66.0%

Show abstract

Mapping the genetic basis of complex traits is complicated by the presence of epistatic interactions between loci. While work in molecular genetics identifies numerous specific genetic interactions, statistical analyses of quantitative traits frequently conclude that additive (nonepistatic) models explain most heritable variation. However, these conclusions are typically limited by the narrow range of genetic relatedness(e.g. in F1 offspring of a biparental or circular cross). Here, we use a barcoded panel of Saccharomyces cerevisiae genotypes with a broad range of relatedness to quantify the effects of epistasis on the genetic architecture of seven complex traits. We find limited contributions of epistasis to the genetic basis of these traits. These results indicate that epistasis beyond that detected in standard yeast crosses may exist, yet it contributes little to phenotypic variance in these systems.

7

Genetic diversity during selective sweeps in non-recombining populations

Kaushik, S.; Jain, K.; Johri, P.

2024-09-18 genetics 10.1101/2024.09.12.612756 medRxiv

Top 0.1%

65.9%

Show abstract

Selective sweeps, resulting from the spread of beneficial, neutral, or deleterious mutations through a population, shape patterns of genetic variation at linked neutral sites. While many theoretical, computational, and statistical advances have been made in understanding the genomic signatures of selective sweeps in recombining populations, relatively less is understood in populations with little/no recombination, and arbitrary dominance and inbreeding. Using diffusion theory, we obtain the full expression for the expected site frequency spectrum (SFS) at linked neutral sites immediately post and during the fixation of moderately or strongly beneficial mutations. When a single hard sweep occurs, the SFS decays as 1/x for low derived allele frequencies (x), similar to the neutral SFS at equilibrium, whereas at higher derived allele frequencies, it follows a 1/x2 power law as also seen in a rapidly expanding neutral population. We show that these power laws are universal in the sense that they are independent of the dominance and inbreeding coefficients, and also characterize the SFS during the sweep. Additionally, we find that the derived allele frequency where the SFS shifts from the 1/x to 1/x2 power law is inversely proportional to the selection strength; thus under strong selection, the SFS follows the 1/x2 dependence for most allele frequencies. When clonal interference is pervasive, the SFS immediately post-fixation becomes U-shaped and can be approximated by the equilibrium SFS of selected sites. Our results will be important in developing statistical methods to infer the timing and strength of recent selective sweeps in asexual populations, genomic regions that lack recombination, and clonally propagating tumor populations.

8

A population genetics model explaining overdispersion in active transposable elements

Omole, A. D.; Czuppon, P.

2025-12-01 genetics 10.1101/2025.11.27.691047 medRxiv

Top 0.1%

65.7%

Show abstract

The number of transposable elements (TEs) per host genome varies within natural populations, with variance much greater than the mean. This pattern, known as overdispersion, conflicts with classical population genetic models based on the Poisson distribution, which predict equal mean and variance. To address this gap, we develop a stochastic model of TE dynamics using a bi-parental Moran process with recombination that explicitly accounts for core evolutionary forces: transposition, excision, and purifying selection. From this model, we derive analytical expressions for the mean and variance of the TE copy number. Our results show that overdispersion arises naturally when the transposition rate exceeds the product of the selection coefficient and the mean copy number, and that overdispersion increases with higher transposition rates. Additionally, we show that maintaining positive TE copy numbers at equilibrium, and thus sustaining overdispersion, requires a net transposition rate below approximately 0.5 insertions per copy per generation, a constraint satisfied by observed TE families to maintain genome stability. The derived overdispersion also accounts for the right-skewed, heavy-tailed distribution of copy numbers, capturing features that classical models fail to account for. A qualitative comparison of these predictions with data from 18 active TE families in 85 Drosophila melanogaster strains confirms these patterns: all active TEs in the data exhibited overdispersion, with variances 2-10 times the mean and the distribution showing positive, or right skewness. Collectively, our findings reveal that TE distributions deviate from Poisson expectations and establish overdispersion as an inherent feature of TE population dynamics, providing a mechanistic framework for understanding the full distributional properties of TE copy numbers.

9

Histone H3 availability is more important for development than H3.2 versus H3.3 subtype identity

McPherson, J.-M. E.; Sykes, C.; Grossmann, L. C.; Hill, C. H.; Leatham-Jensen, M. P.; Duronio, R. J.; McKay, D. J.

2026-02-10 genetics 10.64898/2026.02.09.704946 medRxiv

Top 0.1%

65.1%

Show abstract

The distinct contributions of replication-dependent and replication-independent histones to development and genome function remain unclear. In this study, we investigate how the distinct protein identities of the histone H3.2 and H3.3 subtypes contribute to development and gene regulation in Drosophila. Comparing animals in which the replication-independent H3.3 genes were mutated to produce the replication-dependent H3.2 protein with those carrying deletions of the replication-independent H3.3 genes revealed that replication-independent H3.3 is essential for fertility, adult locomotor behavior, and normal longevity. However, development to adulthood does not depend on which replication-independent H3 subtype is expressed from the H3.3 loci. Moreover, replication-independent H3.3 is not required to establish or maintain global patterns of chromatin accessibility or gene expression in the adult brain. Surprisingly, we find that expression of H3.2 from the replication-dependent HisC locus is essential in post-replicative cells in the absence of replication-independent H3.3, and we uncover a critical role for the HIRA histone chaperone complex in preserving genome function when replication-independent H3.3 is deleted. We conclude that an available pool of H3 is more critical than the specific identity of H3 in the pool.

10

Tobacco twin and single spots result from genome instability, chromosome breaks, and translocations

Demarco, I. J.; Amundson, K. R.; Phan, A.; Henry, I. M.; Comai, L.

2025-11-10 genetics 10.1101/2025.11.09.687472 medRxiv

Top 0.1%

65.0%

Show abstract

Somatic recombination can have profound consequences including clonal evolution. In plants, however, the contribution of homologous recombination (HR) to genomic instability remains unclear. The semidominant sulfur mutant of tobacco, named for its yellow foliage resulting from a chlorophyll defect, spontaneously develops green and white twin spots that have been attributed to mitotic crossovers. To test this hypothesis, we sequenced DNA from twin spots and surrounding tissues in su/+ F1 hybrids, expecting somatic crossovers to result in reciprocal chromosome arm exchange in the paired sectors. Contrary to expectations, 20 of 22 twin spots exhibited aneuploidy or reciprocal translocations between non-homologous chromosomes. One twin spot showed a pattern consistent with a reciprocal crossover. Single white or green spots were [~]10X more frequent than twin spots. Among 26 single spots, 18 resulted from terminal arm deletion, 2 from aneuploidy, and 4 were associated with translocations. Deletion and reciprocal exchange breakpoints clustered near the centromere. Together, these findings indicate that nearly all spots arise from deletions, translocations, and other rearrangements likely mediated by chromosome missegregation or nonhomologous repair rather than by HR. These rearrangements altered the ratio of mutant to wild-type alleles and consequently chlorophyll levels. We conclude that mitotic recombination between homologous chromosomes is rare, and that genome instability is the dominant driver of somatic chimerism in this system.

11

RNA editing controls meiotic drive by a Neurospora Spore killer

Rhoades, N.; Hammond, T. M.

2021-01-01 genetics 10.1101/2020.12.30.424869 medRxiv

Top 0.1%

64.5%

Show abstract

Neurospora Sk-2 is a complex meiotic drive element that is transmitted to offspring through sexual reproduction in a biased manner. Sk-2s biased transmission mechanism involves spore killing, and recent evidence has demonstrated that spore killing is triggered by a gene called rfk-1. However, a second gene, rsk, is also critically important for meiotic drive by spore killing because it allows offspring with an Sk-2 genotype to survive the toxic effects of rfk-1. Here, we present evidence demonstrating that rfk-1 encodes two protein variants: a 102 amino acid RFK-1A and a 130 amino acid RFK-1B, but only RFK-1B is toxic. We also show that expression of RFK-1B requires an early stop codon in rfk-1 mRNA to undergo adenosine-to-inosine (A-to-I) mRNA editing. Finally, we demonstrate that RFK-1B is toxic when expressed within vegetative tissue of Spore killer sensitive (SkS) strains, and that this vegetative toxicity can be overcome by co-expressing Sk-2s version of RSK. Overall, our results demonstrate that Sk-2 uses RNA editing to control when its spore killer is produced, and that the primary killing and resistance functions of Sk-2 can be conferred upon an SkS strain by the transfer of only two genes.

12

Mathematical constraints on FST: multiallelic markers in arbitrarily many populations

Alcala, N.; Rosenberg, N. A.

2021-07-25 genetics 10.1101/2021.07.23.453474 medRxiv

Top 0.1%

63.6%

Show abstract

Interpretations of values of the FST measure of genetic differentiation rely on an understanding of its mathematical constraints. Previously, it has been shown that FST values computed from a biallelic locus in a set of multiple populations and FST values computed from a multiallelic locus in a pair of populations are mathematically constrained as a function of the frequency of the allele that is most frequent across populations. We generalize from these cases to report here the mathematical constraint on FST given the frequency M of the most frequent allele at a multiallelic locus in a set of multiple populations. Using coalescent simulations of an island model of migration with an infinitely-many-alleles mutation model, we argue that the joint distribution of FST and M helps in disentangling the separate influences of mutation and migration on FST. Finally, we show that our results explain a puzzling pattern of microsatellite differentiation: the lower FST in an interspecific comparison between humans and chimpanzees than in the comparison of chimpanzee populations. We discuss the implications of our results for the use of FST.

13

A functionally, structurally, and evolutionarily distinct set of genes are linked to phenome wide variation in maize

Liang, Z.; Qiu, Y.; Schnable, J.

2019-08-20 bioinformatics 10.1101/534503 medRxiv

Top 0.1%

62.8%

Show abstract

Naturally occurring functionally variable alleles in specific genes within a population allows the identification of which genes are involved in the determination of which phenotypes. The omnigenetic model proposes that essentially all genes which are expressed in relevant contexts likely play some role in determining phenotypic outcomes. Here, we develop an approach to identify genes where natural functional variation plays a role in shaping many phenotypic traits simultaneously. We demonstrate that this approach identifies a distinct set of genes relative to conventional genome wide association using data for 260 traits scored a maize diversity panel, and the genes identified using this approach are more likely to be independently validated than genes identified by convetional genome wide association. Genes identified by the new approach share a number of features with a gold standard set of genes characterized through forward genetics which separate them from both genes identified by conventional genome wide association and the overall population of annotated gene models. These features include evidence of significantly stronger purifying selection, positional conservation across the genomes of related species, greater length, and a scarcity of presence absence variation for these loci in natural populations. Genes identified by phenome-wide analyses also showed much stronger signals of GO enrichment and purification than genes identified by conventional genome wide association. Overall these findings are consistent with large subset of annotated gene models - despite support from transcriptional and homology evidence - being unlikely to play any role in determining organismal phenotypes.

14

Dual roles of histone H3 lysine-4 in antagonizing Polycomb group function and promoting target gene expression

Anyetei-Anum, C. S.; Leatham-Jensen, M. P.; Fox, G. C.; Smith, B. R.; Krajewski, K.; Strahl, B. D.; Dowen, J.; Matera, A. G.; Duronio, R. J.; McKay, D. J.

2024-06-29 genetics 10.1101/2024.06.25.600669 medRxiv

Top 0.1%

62.6%

Show abstract

Tight control over cell identity gene expression is necessary for proper adult form and function. The opposing activities of Polycomb and trithorax complexes determine the ON/OFF state of targets like the Hox genes. Trithorax encodes a methyltransferase specific to histone H3 lysine-4 (H3K4). However, there is no direct evidence that H3K4 regulates Polycomb group target genes in vivo. Here, we demonstrate two key roles for replication-dependent histone H3.2K4 in target control. We find that H3.2K4 antagonizes Polycomb group catalytic activity and that it is required for proper target gene activation. We conclude that H3.2K4 directly regulates expression of Polycomb targets.

15

A new experimental system to study meiotic recurrent non-allelic homologous recombination in budding yeast

Sedam, H. N. C.; Argueso, J. L.

2020-06-25 genetics 10.1101/2020.06.25.164855 medRxiv

Top 0.1%

61.9%

Show abstract

In humans, de novo recurrent copy number variations (CNVs) often arise during meiosis from non-allelic homologous recombination (NAHR) between low copy repeat elements (LCRs). These chromosomal rearrangements are responsible for a wide variety of genomic disorders involving duplication or deletion of dose-sensitive genes. The precise factors that steer meiotic cells toward this detrimental recombination pathway are not fully understood. To create a model for the investigation of LCR-mediated CNV mechanisms, we developed a diploid experimental system in Saccharomyces cerevisiae. We modified the right arm of chromosome V through the introduction of engineered LCRs: duplicated 5 to 35 kb segments of yeast DNA flanking single copy interstitial spacers, analogously to the meiotic NAHR substrates that exist in humans. Phenotypic markers, including a copy number reporter, were inserted within the interstitial spacer. Their segregation in the haploid meiotic progeny was used to phenotypically identity and classify recurrent CNV events. This system allowed us to measure the effects of LCR size on the frequency of meiotic de novo recurrent CNV formation, and to determine the relative proportions of each of the three main NAHR classes: interhomolog, intersister, and intrachromatid. The frequency of CNV increased as the LCRs became larger, and interhomolog NAHR was overrepresented relative to the two other classes. We showed that this experimental system directly mimics the features of de novo recurrent CNVs reported in human disease, thus it represents a promising tool for the discovery and characterization of conserved cellular factors and environmental exposures that can modulate meiotic NAHR.

16

The effect of a reduction in population size on mean fitness and inbreeding depression

Lopez-Cortegano, E.; Charlesworth, B.

2026-05-21 genetics 10.64898/2026.05.15.725556 medRxiv

Top 0.1%

61.8%

Show abstract

A sudden reduction in population size increases the rate of genetic drift, reducing variability and increasing the mean level of homozygosity. The resulting increased exposure of recessive or partially recessive, strongly deleterious alleles to selection against homozygotes may lead to their being purged from the population, potentially allowing mean fitness to increase after an initial decline, and accelerating the decline in inbreeding depression associated with reduced variability. However, detailed population genetic theory on the effects of population bottlenecks on mean fitness and inbreeding depression remains limited. We develop a theoretical framework for small, randomly mating populations founded from a large population near mutation-selection-drift equilibrium, using both simulations and approximate analytical predictions. These provide quantitative predictions for the dynamics of the populations mean fitness and level of inbreeding depression following a bottleneck. In particular, we derive an approximate expression for the time needed for mean fitness to recover after an initial decline; such a recovery requires selection to be sufficiently strong relative to drift and mutations to be sufficiently recessive. In contrast, weakly deleterious mutations cause reductions in mean fitness and inbreeding depression that are similar in size to those predicted from increases in neutral homozygosity.

17

Redesigning the Drosophila histone gene cluster: An improved genetic platform for spatiotemporal manipulation of histone function

Crain, A. T.; Nevil, M.; Leatham-Jensen, M. P.; Reeves, K. B.; Matera, A. G.; McKay, D. J.; Duronio, R. J.

2024-04-28 genetics 10.1101/2024.04.25.591202 medRxiv

Top 0.1%

61.5%

Show abstract

Mutating replication-dependent (RD) histone genes is an important tool for understanding chromatin-based epigenetic regulation. Deploying this tool in metazoan models is particularly challenging because RD histones in these organisms are typically encoded by many genes, often located at multiple loci. Such RD histone gene arrangements make the ability to generate homogenous histone mutant genotypes by site-specific gene editing quite difficult. Drosophila melanogaster provides a solution to this problem because the RD histone genes are organized into a single large tandem array that can be deleted and replaced with transgenes containing mutant histone genes. In the last [~]15 years several different RD histone gene replacement platforms have been developed using this simple strategy. However, each platform contains weaknesses that preclude full use of the powerful developmental genetic capabilities available to Drosophila researchers. Here we describe the development of a newly engineered platform that rectifies many of these weaknesses. We used CRISPR to precisely delete the RD histone gene array (HisC), replacing it with a multifunctional cassette that permits site-specific insertion of either one or two synthetic gene arrays using selectable markers. We designed this cassette with the ability to selectively delete each of the integrated gene arrays in specific tissues using site-specific recombinases. We also present a method for rapidly synthesizing histone gene arrays of any genotype using Golden Gate cloning technologies. These improvements facilitate generation of histone mutant cells in various tissues at different stages of Drosophila development and provide an opportunity to apply forward genetic strategies to interrogate chromatin structure and gene regulation.

18

Genome-wide regulatory effects of STRs stabilized by elevated expression of antioxidant genes in C. elegans

Zhang, G.; Andersen, E.

2022-09-28 genetics 10.1101/2022.09.27.509703 medRxiv

Top 0.1%

59.2%

Show abstract

Genetic variation can cause significant differences in gene expression among individuals. Although quantitative genetic mapping techniques provide ways to identify genome-wide regulatory loci, they almost entirely focus on single nucleotide variants (SNVs). Short tandem repeats (STRs) represent a large source of genetic variation with potential regulatory effects. Here, we leverage the recently generated expression and STR variation data among wild Caenorhabditis elegans strains to conduct a genome-wide analysis of how STRs affect gene expression variation. We identify thousands of expression STRs (eSTRs) showing regulatory effects and demonstrate that they explain missing heritability beyond SNV-based expression quantitative trait loci. We illustrate specific regulatory mechanisms such as how eSTRs affect splicing sites and alternative splicing efficiency. We also show that differential expression of antioxidant genes might affect STR variation systematically. Overall, we reveal the interplay between STRs and gene expression variation in a tractable model system to ultimately associate STR variation with differences in complex traits.

19

Haldane’s Probability of Mutant Survival is Not the Probability of Allele Establishment

Krukov, I.; de Koning, A. P. J.

2019-07-16 genetics 10.1101/704577 medRxiv

Top 0.1%

59.2%

Show abstract

Haldane notably showed in 1927 that the probability of fixation for an advantageous allele is approximately 2s, for selective advantage s. This widely known result is variously interpreted as either the fixation probability or the establishment probability, where the latter is considered the likelihood that an allele will survive long enough to have effectively escaped loss by drift. While Haldane was concerned with escape from loss by drift in the same paper, in this short note we point out that: 1) Haldanes probability of survival is analogous to the probability of fixation in a Wright-Fisher model (as also shown by others); and 2) This result is unrelated to Haldanes consideration of how common an allele must be to probably spread through the species. We speculate that Haldanes survival probability may have become misunderstood over time due to a conflation of terminology about surviving drift and ultimately surviving (i.e., fixing). Indeed, we find that the probability of establishment remarkably appears to have been overlooked all these years, perhaps as a consequence of this misunderstanding. Using straightforward diffusion and Markov chain methods, we show that under Haldanes assumptions, where establishment is defined by eventual fixation being more likely that extinction, the establishment probability is actually 4s when the fixation probability is 2s. Generalizing consideration to deleterious, neutral, and adaptive alleles in finite populations, if establishment is defined by the odds ratio between eventual fixation and extinction, k, the general establishment probability is (1 + k)/k times the fixation probability. It is therefore 4s when k = 1, or 3s when k = 2 for beneficial alleles in large populations. As k is made large, establishment becomes indistinguishable from fixation, and ceases to be a useful concept. As a result, we recommend establishment be generally defined as when the odds of ultimate fixation are greater than for extinction (k = 1, following Haldane), or when fixation is twice as likely as extinction (k = 2).

20

Analysis of meiotic recombination in Drosophila simulans shows heterozygous inversions do not cause an interchromosomal effect

Man, B.; Kim, E.; Vadlakonda, A.; Stern, D. L.; Crown, N.

2024-03-12 genetics 10.1101/2024.03.09.584235 medRxiv

Top 0.1%

58.6%

Show abstract

Chromosome inversions are of unique importance in the evolution of genomes and species because when heterozygous with a standard arrangement chromosome, they suppress meiotic crossovers within the inversion. In Drosophila species, heterozygous inversions also cause the interchromosomal effect, whereby the presence of a heterozygous inversion induces a dramatic increase in crossover frequencies in the remainder of the genome within a single meiosis. To date, the interchromosomal effect has been studied exclusively in species that also have high frequencies of inversions in wild populations. We took advantage of a recently developed approach for generating inversions in Drosophila simulans, a species that does not have inversions in wild populations, to ask if there is an interchromosomal effect. We used the existing chromosome 3R balancer and generated a new chromosome 2L balancer to assay for the interchromosomal effect genetically and cytologically. We found no evidence of an interchromosomal effect in D. simulans. To gain insight into the underlying mechanistic reasons, we qualitatively analyzed the relationship between meiotic double-strand break formation and synaptonemal complex assembly. We find that the synaptonemal complex is assembled prior to double-strand break formation as in D. melanogaster; however, we show that the synaptonemal complex is assembled prior to localization of the oocyte determination factor Orb, whereas in D. melanogaster, synaptonemal complex formation does not begin until Orb is localized. Together, our data show heterozygous inversions in D. simulans do not induce an interchromosomal effect and that there are differences in the developmental programming of the early stages of meiosis.