Back

Correlation network analysis based on untargeted LC-MS profiles of cocoa reveals processing stage and origin country

Kumar, S.; D'Souza, R. N.; Corno, M.; Ullrich, M. S.; Kuhnert, N.; Huett, M.-T.

2020-02-10 systems biology
10.1101/2020.02.09.940585 bioRxiv
Show abstract

In order to implement quality control measures and create fine flavor products, an important objective in cocoa processing industry is to realize standards for characterization of cocoa raw materials, intermediate and finished products with respect to their processing stages and countries of origin. Towards this end, various works have studied separability or distinguishability of cocoa samples belonging to various processing stages in a typical cocoa processing pipeline or to different origins. Limited amount of success has been possible in this direction in that unfermented and fermented cocoa samples have been shown to group into separate clusters in PCA. However, a clear clustering with respect to the country of origin has remained elusive. In this work we suggest an alternative approach to this problem through the framework of correlation networks. For 140 cocoa samples belonging to eight countries and three progressive stages in a typical cocoa processing pipeline we compute pairwise Spearman and Pearson correlation coefficients based on the LC-MS profiles and derive correlation networks by retaining only correlations higher than a threshold. Progressively increasing this threshold reveals, first, processing stage (or sample type) modules (or network clusters) at low and intermediate values of correlation threshold and then country specific modules at high correlation thresholds. We present both qualitative and quantitative evidence through network visualization and node connectivity statistics. Besides demonstrating separability of the two data properties via this network-based method, our work suggests a new approach for studying classification of cocoa samples with nested attributes of processing stage sample types and country of origin along with possibility of including additional factors, e.g., hybrid variety, etc. in the analysis.

Matching journals

The top 5 journals account for 50% of the predicted probability mass.

1
Analytica Chimica Acta
17 papers in training set
Top 0.1%
18.0%
2
Analytical Chemistry
205 papers in training set
Top 0.1%
15.1%
3
Journal of Proteome Research
215 papers in training set
Top 0.4%
7.0%
4
PLOS ONE
4510 papers in training set
Top 26%
6.5%
5
Scientific Reports
3102 papers in training set
Top 26%
4.4%
50% of probability mass above
6
Analytical and Bioanalytical Chemistry
17 papers in training set
Top 0.1%
4.1%
7
Molecular Omics
21 papers in training set
Top 0.1%
3.1%
8
Frontiers in Plant Science
240 papers in training set
Top 3%
2.8%
9
Computational and Structural Biotechnology Journal
216 papers in training set
Top 3%
2.4%
10
PROTEOMICS
35 papers in training set
Top 0.3%
2.1%
11
SLAS Technology
11 papers in training set
Top 0.1%
1.8%
12
BMC Bioinformatics
383 papers in training set
Top 5%
1.5%
13
Metabolites
50 papers in training set
Top 0.6%
1.5%
14
Frontiers in Molecular Biosciences
100 papers in training set
Top 2%
1.4%
15
Metabolomics
11 papers in training set
Top 0.2%
1.4%
16
Heliyon
146 papers in training set
Top 4%
1.0%
17
Bioinformatics
1061 papers in training set
Top 9%
0.9%
18
npj Systems Biology and Applications
99 papers in training set
Top 2%
0.9%
19
ACS Omega
90 papers in training set
Top 3%
0.9%
20
Journal of Proteomics
27 papers in training set
Top 0.3%
0.9%
21
Nature Communications
4913 papers in training set
Top 59%
0.9%
22
Biology
43 papers in training set
Top 2%
0.8%
23
Archives of Clinical and Biomedical Research
28 papers in training set
Top 2%
0.8%
24
Journal of the American Society for Mass Spectrometry
33 papers in training set
Top 0.5%
0.8%
25
iScience
1063 papers in training set
Top 33%
0.7%
26
Chemosphere
15 papers in training set
Top 0.6%
0.7%
27
Molecular & Cellular Proteomics
158 papers in training set
Top 2%
0.7%
28
Communications Biology
886 papers in training set
Top 28%
0.7%
29
International Journal of Molecular Sciences
453 papers in training set
Top 19%
0.5%