Back

Implementing a data infrastructure for precision oncology projects leveraging REDCap

Vesteghem, C.; Dahl, S. C.; Broendum, R. F.; Soenderkaer, M.; Boedker, J. S.; Schmitz, A.; Weischenfeldt, J.; Pedersen, I. S.; Sommer, M.; Rytter, A. S.; Nielsen, M. M.; Ladekarl, M.; Severinsen, M. T.; Dybkaer, K.; Groenbaek, K.; El-Galaly, T.; Roug, A. S.; Boegsted, M.

2022-05-12 health informatics
10.1101/2022.05.09.22274599
Show abstract

ObjectivesTo facilitate clinical implementation and research in precision oncology, notably the pairing of patients, variants and treatments to identify candidates for clinical trials, we have built a data infrastructure to 1) capture and store data, 2) reduce manual tasks for clinical and genomic data collection and management, 3) combine data for quality controls, reporting and findability. InfrastructureThe infrastructure uses REDCap repositories to capture and store data. The structure of these repositories is customized for each project. Additionally, a cross-project web platform was developed using software development best practices and state-of-the-art web technologies to circumvent REDCaps limitations and integrate other third-party resources. Using REDCaps application programming interfaces, this platform allowed validation of data across multiple repositories, easy import of data from external sources, generation of overviews of included patients and available data, combination of genomic and clinical data to generate tumour board reports and the findability of data. Its design was driven by data stewardship best practices. UsageAcross four precision medicine projects, the infrastructure has been used to collect data for 1921 patients, including 453 genomic data files. The custom-built web platform made it possible to import, validate, and present data in a comprehensive manner. This included building tumour board reports for clinicians, combining clinical and genomic data, and search functionalities for researchers. DiscussionREDCap allowed us to capitalize on the numerous data capture and management features developed in this solution. Designing a cross-project platform guarantees long-term relevance where developments can be mutualised across projects and allowed us to make the overall solution more compliant with the FAIR (Findable, Accessible, Interoperable, Reusable) data principles. Further developments should be considered, notably automatic retrieval of data from electronic health records to limit the number of manual tasks. ConclusionThe proposed infrastructure allowed our precision oncology projects to gain efficiency in data collection and increase data quality by reducing manual work, and it gave a straightforward and customized access to data for researchers and clinicians.

Matching journals

The top 7 journals account for 50% of the predicted probability mass.

1
JCO Clinical Cancer Informatics
based on 14 papers
Top 0.1%
15.3%
2
JAMIA Open
based on 35 papers
Top 0.6%
10.1%
3
BMC Medical Informatics and Decision Making
based on 36 papers
Top 0.9%
10.1%
4
BMJ Health & Care Informatics
based on 13 papers
Top 0.1%
5.0%
5
Journal of the American Medical Informatics Association
based on 53 papers
Top 3%
4.7%
6
Scientific Data
based on 30 papers
Top 0.3%
4.5%
7
Journal of Medical Internet Research
based on 81 papers
Top 6%
2.8%
50% of probability mass above
8
PLOS ONE
based on 1737 papers
Top 80%
2.8%
9
Journal of Biomedical Informatics
based on 37 papers
Top 2%
2.8%
10
International Journal of Medical Informatics
based on 25 papers
Top 2%
2.5%
11
JMIR Medical Informatics
based on 16 papers
Top 2%
2.5%
12
Scientific Reports
based on 701 papers
Top 62%
2.4%
13
Frontiers in Digital Health
based on 18 papers
Top 1%
2.4%
14
Informatics in Medicine Unlocked
based on 11 papers
Top 0.8%
2.2%
15
Cancer Medicine
based on 17 papers
Top 2%
1.6%
16
Cancers
based on 57 papers
Top 5%
1.6%
17
Bioinformatics
based on 24 papers
Top 1%
1.3%
18
BMC Medical Research Methodology
based on 41 papers
Top 3%
1.3%
19
PLOS Digital Health
based on 88 papers
Top 10%
1.2%
20
BMJ Open
based on 553 papers
Top 49%
0.8%
21
JMIR Public Health and Surveillance
based on 45 papers
Top 11%
0.8%
22
npj Digital Medicine
based on 85 papers
Top 13%
0.8%
23
JMIR Formative Research
based on 31 papers
Top 7%
0.7%
24
BMC Medical Genomics
based on 12 papers
Top 2%
0.7%
25
Computers in Biology and Medicine
based on 39 papers
Top 8%
0.7%
26
Nature Medicine
based on 88 papers
Top 20%
0.7%
27
The Lancet Digital Health
based on 25 papers
Top 6%
0.7%