Back

NGBO: Introducing -omics metadata to biobanking ontology.

Alghamdi, D.; Dooley, D.; Samman, M.; Hsiao, W.

2023-05-10 bioinformatics
10.1101/2023.05.09.539725 bioRxiv
Show abstract

BackgroundWith improvements in high throughput sequencing technologies and the constant generation of large biomedical datasets, biobanks increasingly take on the role of managing and delivering not just specimens but also data. However, re-using data from different biobanks is challenged by incompatible data representations. Contextual data describing biobank digital resources often contain unstructured textual information incompatible with computational processes such as automated data discovery and integration. Therefore, a consistent and comprehensive contextual data framework is needed to increase discovery, reusability, and integrability across data sources. MethodsBased on available genomics standards (e.g., Minimum information about a microarray experiment (MIAME)), the College of American Pathologists (CAP) laboratory accreditation requirements, and the Open Biological and Biomedical Ontologies Foundry principles, we developed the Next Generation Biobanking Ontology (NGBO). In addition, we created new terms and re-used concepts from the Ontology for Biomedical Investigations (OBI) and the Ontology for Biobanking (OBIB) to build NGBO. ResultsThe Next Generation Biobanking Ontology https://www.ebi.ac.uk/ols4/ontologies/ngbo is an open application ontology representing omics contextual data, licensed under the Apache License 2.0. The ontology focuses on capturing information about three main activities: wet bench analysis used to generate omics data, bioinformatics analysis used to process and interpret data, and data management. In this paper, we demonstrated the use of the NGBO to add semantic statements to real-life use cases and query data previously stored in unstructured textual format.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Database
51 papers in training set
Top 0.1%
27.5%
2
Bioinformatics
1061 papers in training set
Top 0.9%
24.0%
50% of probability mass above
3
PLOS ONE
4510 papers in training set
Top 25%
6.8%
4
BMC Bioinformatics
383 papers in training set
Top 1%
6.7%
5
GigaScience
172 papers in training set
Top 0.3%
4.6%
6
Journal of the American Medical Informatics Association
61 papers in training set
Top 1.0%
2.2%
7
Scientific Data
174 papers in training set
Top 0.8%
2.0%
8
JAMIA Open
37 papers in training set
Top 0.7%
1.8%
9
Bioinformatics Advances
184 papers in training set
Top 3%
1.8%
10
BMC Medical Informatics and Decision Making
39 papers in training set
Top 1%
1.8%
11
Scientific Reports
3102 papers in training set
Top 61%
1.6%
12
Computational and Structural Biotechnology Journal
216 papers in training set
Top 6%
1.3%
13
JCO Clinical Cancer Informatics
18 papers in training set
Top 0.6%
1.0%
14
Nucleic Acids Research
1128 papers in training set
Top 15%
0.9%
15
BMC Medical Genomics
36 papers in training set
Top 1%
0.8%
16
Cell Systems
167 papers in training set
Top 12%
0.8%
17
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.6%
0.8%
18
GENETICS
189 papers in training set
Top 1%
0.8%
19
PLOS Digital Health
91 papers in training set
Top 3%
0.8%
20
Med
38 papers in training set
Top 1.0%
0.7%
21
PLOS Computational Biology
1633 papers in training set
Top 26%
0.7%
22
Nature Communications
4913 papers in training set
Top 66%
0.5%