Back

Culsma: A Formal Language for Laboratory Protocols

Chen, Y.; Sun, M.; Tadepally, L.; Wang, J.; Barcenilla, H.; Gonzalez, L.; Brodin, P.

2026-05-12 bioinformatics
10.64898/2026.05.07.723509 bioRxiv
Show abstract

The application of artificial intelligence to biomedical research increasingly depends on iterative cycles in which AI systems analyze experimental data, propose follow-up conditions, and drive automated execution at scale, a paradigm central to Bio-AI and autonomous laboratory science. For such cycles to operate, laboratory protocols must be expressed in a form that is simultaneously human-readable and machine-executable. Natural-language descriptions, the current standard in laboratory practice, do not satisfy this dual requirement. We present Culsma, a formal language and execution framework that elevates laboratory protocols from informal prose to semantically explicit workflow programs that can be analyzed, validated, executed, and transferred across settings. The same protocol can be read and verified by a bench scientist, and parsed, validated, and executed by an automated pipeline without re-translation. We demonstrate an end-to-end implementation providing concrete evidence of practical viability.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Cell Systems
167 papers in training set
Top 1%
8.5%
2
npj Digital Medicine
97 papers in training set
Top 0.7%
7.2%
3
Nature Methods
336 papers in training set
Top 2%
6.9%
4
Nature Communications
4913 papers in training set
Top 28%
6.4%
5
PLOS ONE
4510 papers in training set
Top 27%
6.4%
6
eLife
5422 papers in training set
Top 20%
4.3%
7
Nature
575 papers in training set
Top 6%
4.0%
8
Genome Medicine
154 papers in training set
Top 2%
3.6%
9
Nature Machine Intelligence
61 papers in training set
Top 0.9%
3.6%
50% of probability mass above
10
Bioinformatics
1061 papers in training set
Top 6%
3.1%
11
Scientific Reports
3102 papers in training set
Top 43%
2.8%
12
PLOS Computational Biology
1633 papers in training set
Top 12%
2.8%
13
PLOS Biology
408 papers in training set
Top 8%
1.9%
14
iScience
1063 papers in training set
Top 11%
1.9%
15
Molecular Systems Biology
142 papers in training set
Top 0.5%
1.8%
16
BMC Bioinformatics
383 papers in training set
Top 4%
1.8%
17
Nature Medicine
117 papers in training set
Top 2%
1.7%
18
Journal of the American Medical Informatics Association
61 papers in training set
Top 1%
1.7%
19
Bioinformatics Advances
184 papers in training set
Top 3%
1.5%
20
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 35%
1.5%
21
Nucleic Acids Research
1128 papers in training set
Top 12%
1.5%
22
Nature Biotechnology
147 papers in training set
Top 6%
1.2%
23
Genome Biology
555 papers in training set
Top 6%
1.0%
24
Frontiers in Physiology
93 papers in training set
Top 5%
0.9%
25
Cell
370 papers in training set
Top 16%
0.8%
26
GigaScience
172 papers in training set
Top 3%
0.8%
27
Communications Biology
886 papers in training set
Top 23%
0.8%
28
Science
429 papers in training set
Top 21%
0.6%
29
ACS Synthetic Biology
256 papers in training set
Top 3%
0.6%
30
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.6%