Back

A Data-Driven Image Extraction and Analysis Pipeline for Plant Phenotyping in Controlled Environments

Orvati Nia, F.; Peeples, J.; Murray, S. C.; McFarland, A.; Vann, T.; Salehi, S.; Hardin, R.; Baltensperger, D. D.; Ibrahim, A.; Thomasson, J. A.; Fadamiro, H.; Subramanian, N. K.; Oladepo, N.; Vysyaraju, U.

2026-02-27 plant biology
10.64898/2026.02.25.707797 bioRxiv
Show abstract

Advances in automation, imaging, and artificial intelligence have enabled researchers to capture large volumes of high-quality plant data for understanding crop growth, stress, and genotype-by-environment interactions. While genomics has achieved remarkable throughput, phenotypic data acquisition remains a critical bottleneck for accelerating crop improvement and biological discovery. To address this challenge, an integrated multispectral phenotyping framework was developed using imagery from the Texas A&M AgriLife Precision Automated Phenotyping Greenhouse, a fully controlled facility designed for reproducible plant monitoring throughout the entire growth cycle of most crops. The framework expands the Plant Growth and Phenotyping (PGP v2) dataset and establishes a standardized system for continuous image acquisition, segmentation, deep feature extraction, and temporal analysis across multiple crop species. The project was organized around five coordinated areas: Administration and Coordination, Imaging and Sensor Operations, Data Processing and Management, Artificial Intelligence and Analytics, and Plant Science and Discovery. This structure ensured consistent data quality, version-controlled workflows, and communication across disciplines. The analytical pipeline integrates pseudo-RGB generation, deep learning-based detection and segmentation, image stitching, and temporal (longitudinal) tracking to isolate individual plants and analyze changes in morphology, spectral reflectance, and texture over time. Beyond technical innovation, the framework provides a replicable model for interdisciplinary collaboration and administrative integration in plant phenomics. The combined dataset, workflow, and management framework enable scalable, reproducible, and data-driven plant science research that bridges engineering and biological discovery. Plain Language SummaryTemporal imaging of plants in controlled environments helps scientists better understand growth and biological processes. However, analyzing large volumes of images has been limited by a lack of automated tools. Multispectral imagery captures additional information about plant pigments, structure, and stress beyond standard color images. We developed an automated analysis pipeline that identifies individual plants, tracks their growth over time, and measures traits such as height, area, shape, texture, and vegetation indices. Using artificial intelligence, the system efficiently processes thousands of images to provide consistent and repeatable measurements. By integrating engineering and plant biology, this work supports data-driven decisions for crop improvement and agricultural research.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
The Plant Phenome Journal
14 papers in training set
Top 0.1%
21.7%
2
Plant Direct
81 papers in training set
Top 0.1%
11.9%
3
Plant Phenomics
17 papers in training set
Top 0.1%
10.1%
4
Plant Physiology
217 papers in training set
Top 0.4%
9.7%
50% of probability mass above
5
The Plant Journal
197 papers in training set
Top 0.9%
6.1%
6
Scientific Data
174 papers in training set
Top 0.3%
6.1%
7
PLOS ONE
4510 papers in training set
Top 30%
6.1%
8
Frontiers in Plant Science
240 papers in training set
Top 3%
2.5%
9
Plant Methods
39 papers in training set
Top 0.3%
2.3%
10
Applications in Plant Sciences
21 papers in training set
Top 0.1%
1.8%
11
Nature Communications
4913 papers in training set
Top 52%
1.6%
12
Cell Systems
167 papers in training set
Top 8%
1.4%
13
Plant Communications
35 papers in training set
Top 1%
1.3%
14
GigaScience
172 papers in training set
Top 2%
1.3%
15
The Plant Cell
141 papers in training set
Top 2%
1.1%
16
Development
440 papers in training set
Top 3%
1.1%
17
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 40%
0.9%
18
New Phytologist
309 papers in training set
Top 4%
0.9%
19
eLife
5422 papers in training set
Top 57%
0.8%
20
Plant Biotechnology Journal
56 papers in training set
Top 1%
0.7%
21
Developmental Biology
134 papers in training set
Top 3%
0.7%
22
Scientific Reports
3102 papers in training set
Top 77%
0.7%
23
Remote Sensing in Ecology and Conservation
10 papers in training set
Top 0.3%
0.6%