Back

SPARC Data Structure: Rationale and Design of a FAIR Standard for Biomedical Research Data

Bandrowski, A.; Grethe, J. S.; Pilko, A.; Gillespie, T. H.; Pine, G.; Patel, B.; Surles-Zeiglera, M.; Martone, M. E.

2021-03-19 bioinformatics
10.1101/2021.02.10.430563 bioRxiv
Show abstract

The NIH Common Funds Stimulating Peripheral Activity to Relieve Conditions (SPARC) initiative is a large-scale program that seeks to accelerate the development of therapeutic devices that modulate electrical activity in nerves to improve organ function. Integral to the SPARC program are the rich anatomical and functional datasets produced by investigators across the SPARC consortium that provide key details about organ-specific circuitry, including structural and functional connectivity, mapping of cell types and molecular profiling. These datasets are provided to the research community through an open data platform, the SPARC Portal. To ensure SPARC datasets are Findable, Accessible, Interoperable and Reusable (FAIR), they are all submitted to the SPARC portal following a standard scheme established by the SPARC Curation Team, called the SPARC Data Structure (SDS). Inspired by the Brain Imaging Data Structure (BIDS), the SDS has been designed to capture the large variety of data generated by SPARC investigators who are coming from all fields of biomedical research. Here we present the rationale and design of the SDS, including a description of the SPARC curation process and the automated tools for complying with the SDS, including the SDS validator and Software to Organize Data Automatically (SODA) for SPARC. The objective is to provide detailed guidelines for anyone desiring to comply with the SDS. Since the SDS are suitable for any type of biomedical research data, it can be adopted by any group desiring to follow the FAIR data principles for managing their data, even outside of the SPARC consortium. Finally, this manuscript provides a foundational framework that can be used by any organization desiring to either adapt the SDS to suit the specific needs of their data or simply desiring to design their own FAIR data sharing scheme from scratch.

Matching journals

The top 1 journal accounts for 50% of the predicted probability mass.

1
Scientific Data
174 papers in training set
Top 0.1%
66.4%
50% of probability mass above
2
PLOS ONE
4510 papers in training set
Top 37%
3.7%
3
Neuroinformatics
40 papers in training set
Top 0.3%
2.4%
4
NeuroImage
813 papers in training set
Top 4%
1.9%
5
Database
51 papers in training set
Top 0.3%
1.9%
6
Frontiers in Neuroinformatics
38 papers in training set
Top 0.3%
1.9%
7
GigaScience
172 papers in training set
Top 1%
1.7%
8
Frontiers in Physiology
93 papers in training set
Top 3%
1.7%
9
Scientific Reports
3102 papers in training set
Top 69%
1.0%
10
Aperture Neuro
18 papers in training set
Top 0.3%
1.0%
11
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 2%
0.9%
12
eLife
5422 papers in training set
Top 52%
0.9%
13
Physics in Medicine & Biology
17 papers in training set
Top 0.4%
0.8%
14
Computers in Biology and Medicine
120 papers in training set
Top 4%
0.8%
15
Human Brain Mapping
295 papers in training set
Top 4%
0.8%
16
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.8%
17
Frontiers in Psychiatry
83 papers in training set
Top 3%
0.8%
18
Brain and Behavior
37 papers in training set
Top 1%
0.8%
19
Biology Open
130 papers in training set
Top 3%
0.8%
20
Imaging Neuroscience
242 papers in training set
Top 3%
0.8%
21
Frontiers in Neuroscience
223 papers in training set
Top 9%
0.5%
22
Computational and Structural Biotechnology Journal
216 papers in training set
Top 12%
0.5%