Back

An AI/ML-Powered Workflow for End-to-End Cell Line Development

Raj Unnikandam Veettil, S.; Donatelli, J.; Kalra, G.; Veronica Ljubetic San Martin, C.; Ramakrishnan, S.; McGregor, C.; Wallace, M.; Ankala, R.; Rodrigues de Souza Pinto, L.; Dhama, A.; Regens, C.; Li, Y.; Smith, D.

2026-02-07 cell biology
10.64898/2026.02.04.703387 bioRxiv
Show abstract

The generation of clonal CHO cell lines is foundational to biologics manufacturing; however, labor-intensive cell culture workflows predominate in the field. We created the CLAIRE (Cell Line AI Recognition and Evaluation) tool to streamline end-to-end cell line development by integrating deep-learning image analysis with automated liquid handling. We benchmarked three object detection models for monoclonality verification and found DETR provides superior accuracy (>0.90 F1-score) in identifying single cells. To quantify the outgrowth of cell lines, we evaluated multiple zero-shot SAM2 segmentation models against a feature-based estimation method. Feature-based detection successfully identified diverse cell colony types while less robust performance was observed for SAM2 models, particularly for sparse density colonies. The pre-trained DETR and feature-based detection models were wrapped in a task-focused user interface that outputs cell line hitpick lists compatible with a Lynx LM1800 liquid handler in addition to custom scripts automating cell passaging and sampling. This approach yielded an end-to-end 36 day CLD workflow capable of generating high-titer cell lines for multiple complex antibody structures. Here, we open-access our trained models, user interface, and Lynx automation scripts to provide a modular toolkit useful for clonal cell line engineering projects. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=153 SRC="FIGDIR/small/703387v1_ufig1.gif" ALT="Figure 1"> View larger version (51K): org.highwire.dtl.DTLVardef@1f72e70org.highwire.dtl.DTLVardef@109c54dorg.highwire.dtl.DTLVardef@7867b1org.highwire.dtl.DTLVardef@dfa61e_HPS_FORMAT_FIGEXP M_FIG C_FIG

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
mAbs
28 papers in training set
Top 0.1%
26.9%
2
Cell Reports Methods
141 papers in training set
Top 0.2%
8.7%
3
SLAS Technology
11 papers in training set
Top 0.1%
5.0%
4
iScience
1063 papers in training set
Top 2%
5.0%
5
Lab on a Chip
88 papers in training set
Top 0.3%
4.3%
50% of probability mass above
6
PLOS ONE
4510 papers in training set
Top 37%
3.7%
7
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.2%
8
Analytical Chemistry
205 papers in training set
Top 1%
2.0%
9
Communications Biology
886 papers in training set
Top 6%
2.0%
10
Patterns
70 papers in training set
Top 0.8%
1.8%
11
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 1%
1.8%
12
Cells
232 papers in training set
Top 2%
1.8%
13
HardwareX
16 papers in training set
Top 0.1%
1.5%
14
Development
440 papers in training set
Top 2%
1.4%
15
Nature Machine Intelligence
61 papers in training set
Top 2%
1.3%
16
Bioengineering
24 papers in training set
Top 0.7%
1.3%
17
ACS Synthetic Biology
256 papers in training set
Top 2%
1.2%
18
Nature Communications
4913 papers in training set
Top 58%
1.0%
19
eLife
5422 papers in training set
Top 51%
1.0%
20
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.8%
21
SLAS Discovery
25 papers in training set
Top 0.1%
0.8%
22
Frontiers in Cell and Developmental Biology
218 papers in training set
Top 8%
0.8%
23
Integrative Biology
13 papers in training set
Top 0.2%
0.7%
24
PNAS Nexus
147 papers in training set
Top 2%
0.7%
25
Biotechnology and Bioengineering
49 papers in training set
Top 1%
0.7%
26
Journal of Visualized Experiments
30 papers in training set
Top 0.9%
0.7%
27
Cytometry Part A
30 papers in training set
Top 0.4%
0.7%
28
Advanced Biology
29 papers in training set
Top 1%
0.7%
29
ACS Applied Bio Materials
21 papers in training set
Top 1%
0.7%
30
Antibody Therapeutics
16 papers in training set
Top 0.7%
0.5%