Back

Optimizing the detection of biological signals through a semi-automated feature selection tool

Arini, G.; Mencucini, L. G. S.; de Felicio, R.; Feitosa, L. G. P.; Rezende-Teixeira, P.; Tsuji, H.; Pilon, A.; Pinto, D. R.; Costa-Lotufo, L. V.; Lopes, N. P.; Trivella, D. B.; da Silva, R. R.

2024-08-09 bioinformatics
10.1101/2024.08.07.607073 bioRxiv
Show abstract

Untargeted metabolomics is often used in studies that aim to trace the metabolic profile in a broad context, with the data-dependent acquisition (DDA) mode being the most commonly used method. However, this approach has the limitation that not all detected ions are fragmented in the data acquisition process, in addition to the lack of specificity regarding the process of fragmentation of biological signals. The present work aims to extend the detection of biological signals and contribute to overcoming the fragmentation limits of the DDA mode with a dynamic procedure that combines experimental and in silico approaches. Metabolomic analysis was performed on three different species of actinomycetes using liquid chromatography coupled to mass spectrometry. The data obtained were preprocessed by the MZmine software and processed by the custom package, RegFilter. RegFilter allowed the coverage of the entire chromatographic run and the selection of precursor ions for fragmentation that were previously missed in DDA mode. Most of the ions selected by the tool could be annotated through three levels of annotation, presenting biological relevant candidates. In addition, the tool offers the possibility of creating local spectral libraries curated according to the users interests. Thus, the adoption of a dynamic analysis flow using RegFilter allowed for detection optimization of biological signals, previously absent in the DDA mode. In addition, this workflow enables the creation and search of in-house tailored custom libraries.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
PLOS ONE
4510 papers in training set
Top 19%
10.2%
2
Journal of Proteome Research
215 papers in training set
Top 0.4%
8.5%
3
Analytical Chemistry
205 papers in training set
Top 0.3%
8.5%
4
Journal of Proteomics
27 papers in training set
Top 0.1%
8.5%
5
Metabolites
50 papers in training set
Top 0.2%
4.0%
6
Scientific Reports
3102 papers in training set
Top 36%
3.6%
7
Analytica Chimica Acta
17 papers in training set
Top 0.2%
3.1%
8
Peer Community Journal
254 papers in training set
Top 1%
3.1%
9
Computational and Structural Biotechnology Journal
216 papers in training set
Top 2%
3.1%
50% of probability mass above
10
Frontiers in Plant Science
240 papers in training set
Top 3%
2.9%
11
Metabolomics
11 papers in training set
Top 0.1%
2.6%
12
Analytical and Bioanalytical Chemistry
17 papers in training set
Top 0.1%
2.6%
13
BMC Bioinformatics
383 papers in training set
Top 4%
2.1%
14
PROTEOMICS
35 papers in training set
Top 0.3%
1.9%
15
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.7%
16
Bioinformatics
1061 papers in training set
Top 7%
1.7%
17
Frontiers in Microbiology
375 papers in training set
Top 5%
1.7%
18
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.2%
1.7%
19
Food Chemistry
12 papers in training set
Top 0.2%
1.7%
20
Molecular & Cellular Proteomics
158 papers in training set
Top 1%
1.5%
21
PeerJ
261 papers in training set
Top 11%
1.0%
22
BMC Microbiology
35 papers in training set
Top 1%
0.8%
23
International Journal of Molecular Sciences
453 papers in training set
Top 14%
0.8%
24
mSystems
361 papers in training set
Top 7%
0.8%
25
SoftwareX
15 papers in training set
Top 0.5%
0.7%
26
Molecules
37 papers in training set
Top 2%
0.7%
27
PLOS Neglected Tropical Diseases
378 papers in training set
Top 6%
0.6%
28
Microbiome
139 papers in training set
Top 3%
0.6%
29
mSphere
281 papers in training set
Top 7%
0.6%
30
Limnology and Oceanography: Methods
11 papers in training set
Top 0.5%
0.5%