Back

dnaudit + Pydnaweb: A lightweight text-based planning and documentation workflow for genetic cloning with automatic verification

Fernandes, P.; Ghasemi, F.; Silva, P. C.; Paiva, S.; Johansson, B.

2025-06-03 bioinformatics
10.1101/2025.05.31.657172 bioRxiv
Show abstract

Life science research often depends on the construction and analysis of recombinant DNA molecules, where sequence accuracy is critical. However, the field continues to face a reproducibility crisis, partly due to the lack of comprehensive, systematic, and verifiable documentation of genetic constructions. Although most cloning procedures are deterministic and theoretically describable in a complete and unambiguous way, published methods are typically described in a form free narrative, making them laborious to reproduce and assess for completeness. Tools like the Python package Pydna support programmable and reproducible cloning strategies but require coding expertise, which can be a barrier for some users. To address this, we developed Pydnaweb and dnaudit, two open-source and complementary web tools that build on Pydna. Pydnaweb offers simulation of unit operations such as PCR and restriction digestion providing results in text format. These results can be collected and combined to form complex cloning strategies in a bottom-up approach. Dnaudit can verify such collections for internal consistency and that the cloning strategy meet a specific goal such as the expression of a protein sequence. The tools are design for a low barrier of entry, and they can be used separately. This workflow enables fully automated validation, providing a no-code, reproducible solution for documenting and sharing molecular cloning workflows. These tools ease compliance with FAIR principles and align with emerging standards for the transparent and reproducible sharing of scientific methods and data.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
Bioinformatics
1061 papers in training set
Top 2%
14.1%
2
BMC Bioinformatics
383 papers in training set
Top 1%
8.0%
3
Nucleic Acids Research
1128 papers in training set
Top 3%
6.7%
4
NAR Genomics and Bioinformatics
214 papers in training set
Top 0.3%
6.2%
5
Nature Communications
4913 papers in training set
Top 30%
6.2%
6
ACS Synthetic Biology
256 papers in training set
Top 0.8%
4.8%
7
Nature Biotechnology
147 papers in training set
Top 2%
4.8%
50% of probability mass above
8
PLOS ONE
4510 papers in training set
Top 32%
4.8%
9
Nature Protocols
30 papers in training set
Top 0.1%
4.8%
10
GigaScience
172 papers in training set
Top 0.3%
4.2%
11
Cell Reports Methods
141 papers in training set
Top 1.0%
3.5%
12
Protein Science
221 papers in training set
Top 0.4%
3.5%
13
Genome Biology
555 papers in training set
Top 3%
3.0%
14
Journal of Molecular Biology
217 papers in training set
Top 1%
2.0%
15
PLOS Computational Biology
1633 papers in training set
Top 18%
1.5%
16
Journal of Open Source Software
22 papers in training set
Top 0.1%
1.5%
17
eLife
5422 papers in training set
Top 48%
1.3%
18
STAR Protocols
15 papers in training set
Top 0.1%
1.2%
19
Methods
29 papers in training set
Top 0.4%
0.9%
20
Bioinformatics Advances
184 papers in training set
Top 4%
0.8%
21
Computational and Structural Biotechnology Journal
216 papers in training set
Top 9%
0.8%
22
Nature Methods
336 papers in training set
Top 6%
0.7%
23
Cell Systems
167 papers in training set
Top 14%
0.6%
24
Scientific Reports
3102 papers in training set
Top 79%
0.6%