Back

FRED: a universal tool to generate FAIR metadata for omics experiments

Walter, J.; Kuenne, C.; Knoppik, N.; Goymann, P.; Looso, M.

2026-02-04 bioinformatics
10.64898/2026.02.03.703529 bioRxiv
Show abstract

Scientific research relies on transparent dissemination of data and its associated interpretations. This task encompasses accessibility of raw data, its metadata, details concerning experimental design, along with parameters and tools employed for data interpretation. Production and handling of these data represents an ongoing challenge, extending beyond publication into individual facilities, institutes and research groups, often termed Research Data Management (RDM). It is foundational to scientific discovery and innovation, and can be paraphrased as Findability, Accessibility, Interoperability and Reusability (FAIR). Although the majority of peer-reviewed journals require the deposition of raw data in public repositories in alignment with FAIR principles, metadata frequently lacks full standardization. This critical gap in data management practices hinders effective utilization of research findings and complicates sharing of scientific knowledge. Here we present a flexible design of a machine-readable metadata format to store experimental metadata, along with an implementation of a generalized tool named FRED. It enables i) dialog based creation of metadata files, ii) structured semantic validation, iii) logical search, iv) an external programming interface (API), and v) a standalone web-front end. The tool is intended to be used by non-computational scientists as well as specialized facilities, and can be seamlessly integrated in existing RDM infrastructure.

Matching journals

The top 4 journals account for 50% of the predicted probability mass.

1
Nucleic Acids Research
1128 papers in training set
Top 0.6%
18.4%
2
Bioinformatics
1061 papers in training set
Top 2%
14.5%
3
GigaScience
172 papers in training set
Top 0.1%
12.2%
4
Scientific Data
174 papers in training set
Top 0.3%
6.3%
50% of probability mass above
5
Nature Communications
4913 papers in training set
Top 33%
4.8%
6
PLOS ONE
4510 papers in training set
Top 34%
4.2%
7
Database
51 papers in training set
Top 0.2%
3.0%
8
BMC Bioinformatics
383 papers in training set
Top 3%
3.0%
9
Genome Biology
555 papers in training set
Top 3%
2.3%
10
Nature Methods
336 papers in training set
Top 4%
2.0%
11
PLOS Computational Biology
1633 papers in training set
Top 15%
1.9%
12
NAR Genomics and Bioinformatics
214 papers in training set
Top 2%
1.9%
13
Journal of Molecular Biology
217 papers in training set
Top 2%
1.7%
14
Nature Biotechnology
147 papers in training set
Top 5%
1.6%
15
Computational and Structural Biotechnology Journal
216 papers in training set
Top 5%
1.6%
16
Peer Community Journal
254 papers in training set
Top 2%
1.3%
17
Bioinformatics Advances
184 papers in training set
Top 3%
1.3%
18
Genomics, Proteomics & Bioinformatics
171 papers in training set
Top 5%
0.9%
19
iScience
1063 papers in training set
Top 27%
0.9%
20
Scientific Reports
3102 papers in training set
Top 71%
0.9%
21
Patterns
70 papers in training set
Top 2%
0.8%
22
International Journal of Molecular Sciences
453 papers in training set
Top 16%
0.7%
23
Communications Biology
886 papers in training set
Top 25%
0.7%
24
Journal of Proteome Research
215 papers in training set
Top 2%
0.7%
25
Briefings in Bioinformatics
326 papers in training set
Top 7%
0.7%
26
Frontiers in Molecular Biosciences
100 papers in training set
Top 6%
0.7%
27
Cell Systems
167 papers in training set
Top 14%
0.6%