Back

Predicting trait phenotypes from knowledge of the topology of gene networks

Beatty, A.; Winkler, C. R.; Hagen, T.; Cooper, M.

2021-07-01 genetics
10.1101/2021.06.29.450449 bioRxiv
Show abstract

In many fields there is interest in manipulating genes and gene networks to realize improved trait phenotypes. The practicality of doing so, however, requires accepted theory on the properties of gene networks that is well-tested by empirical results. The extension of quantitative genetics to include models that incorporate properties of gene networks expands the long tradition of studying epistasis resulting from gene-gene interactions. Here we consider NK models of gene networks by applying concepts from graph theory and Boolean logic theory, motivated by a desire to model the parameters that influence predictive skill for trait phenotypes under the control of gene networks; N defines the number of graph nodes, the number of genes in the network, and K defines the number of edges per node in the graph, representing the gene-gene interactions. We define and consider the attractor period of an NK network as an emergent trait phenotype for our purposes. A long-standing theoretical treatment of the dynamical properties of random Boolean networks suggests a transition from long to short attractor periods as a function of the average node degree K and the bias probability P in the applied Boolean rules. In this paper we investigate the appropriateness of this theory for predicting trait phenotypes on random and real microorganism networks through numerical simulation. We show that: (i) the transition zone between long and short attractor periods depends on the number of network nodes for random networks; (ii) networks derived from metabolic reaction data on microorganisms also show a transition from long to short attractor periods, but at higher values of the bias probability than in random networks with similar numbers of network nodes and average node degree; (iii) the distribution of phenotypes measured on microorganism networks shows more variation than random networks when the bias probability in the Boolean rules is above 0.75; and (iv) the topological structure of networks built from metabolic reaction data is not random, being best approximated, in a statistical sense, by a lognormal distribution. The implications of these results for predicting trait phenotypes where the genetic architecture of a trait is a gene network are discussed.

Matching journals

The top 6 journals account for 50% of the predicted probability mass.

1
Physical Biology
43 papers in training set
Top 0.1%
19.9%
2
PLOS ONE
4510 papers in training set
Top 15%
12.6%
3
PLOS Computational Biology
1633 papers in training set
Top 5%
6.5%
4
Scientific Reports
3102 papers in training set
Top 22%
5.0%
5
Biosystems
18 papers in training set
Top 0.1%
5.0%
6
Journal of The Royal Society Interface
189 papers in training set
Top 1%
3.7%
50% of probability mass above
7
Journal of Mathematical Biology
37 papers in training set
Top 0.1%
3.7%
8
Biology
43 papers in training set
Top 0.2%
3.1%
9
Royal Society Open Science
193 papers in training set
Top 1.0%
2.8%
10
Mathematical Biosciences
42 papers in training set
Top 0.4%
2.4%
11
BioSystems
11 papers in training set
Top 0.1%
1.9%
12
IFAC-PapersOnLine
12 papers in training set
Top 0.1%
1.7%
13
Bulletin of Mathematical Biology
84 papers in training set
Top 1%
1.7%
14
Frontiers in Genetics
197 papers in training set
Top 5%
1.7%
15
Theoretical Population Biology
47 papers in training set
Top 0.1%
1.5%
16
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 2%
1.4%
17
Bioinformatics
1061 papers in training set
Top 8%
1.4%
18
Journal of Biosciences
12 papers in training set
Top 0.1%
1.4%
19
Applied Sciences
24 papers in training set
Top 0.4%
1.4%
20
Archives of Clinical and Biomedical Research
28 papers in training set
Top 1%
1.3%
21
Physical Review E
95 papers in training set
Top 1.0%
1.0%
22
PeerJ
261 papers in training set
Top 11%
1.0%
23
BMC Bioinformatics
383 papers in training set
Top 6%
0.9%
24
Frontiers in Plant Science
240 papers in training set
Top 5%
0.8%
25
Physical Review Research
46 papers in training set
Top 0.7%
0.8%
26
Genetics
225 papers in training set
Top 4%
0.7%
27
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 9%
0.7%
28
Infectious Disease Modelling
50 papers in training set
Top 1%
0.7%
29
Molecular Genetics and Genomics
11 papers in training set
Top 0.6%
0.5%
30
Ecology and Evolution
232 papers in training set
Top 5%
0.5%