Back

A RAG Chatbot for Precision Medicine of Multiple Myeloma

Quidwai, M. A.; Lagana, A.

2024-03-18 genetic and genomic medicine
10.1101/2024.03.14.24304293 medRxiv
Show abstract

The advent of precision medicine has revolutionized cancer treatment by integrating individual genetic, lifestyle, and environmental factors to tailor patient care (Huang et al., 2020; Ginsburg and Phillips, 2018). However, the complexity and heterogeneity of diseases like Multiple Myeloma (MM) pose significant challenges in leveraging the vast amounts of genomic data and biomedical literature available for personalized treatment planning (Rajkumar, 2014; Rollig et al., 2015). To address this, we present an innovative Retrieval-Augmented Generation (RAG) based chatbot framework that harnesses the power of Natural Language Processing (NLP) and state-of-the-art language models to curate and analyze MM-specific literature and provide personalized treatment recommendations based on patient-specific genomic data (Lewis et al., 2020). Our framework integrates the BioMed-RoBERTa-base model for embedding generation (Gururangan et al., 2020) and the Mistral-7B language model for question answering (Anthropic, 2023), enabling effective understanding and response to complex clinical queries. The retrieval component is enhanced by Amazon OpenSearch Service, ensuring fast and accurate access to relevant information. A comprehensive data analysis pipeline, including exploratory data analysis, semantic search, clustering, and topic modeling, provides valuable insights into the MM research landscape, informing the chatbots knowledge base and uncovering potential research directions (Blei et al., 2003; Mikolov et al., 2013). Deployed using Amazon Kendra, our RAG chatbot offers a user-friendly and scalable platform for accessing MM information, incorporating features such as user authentication, customizable web interface, and continuous improvement based on user feedback. The framework aims to democratize access to precision medicine by providing clinicians with a sophisticated tool for interpreting complex genomic data in the context of MM, streamlining clinical workflows, and facilitating the development of personalized treatment plans (Patel et al., 2015). This paper presents the conceptualization, development, and potential impact of our RAG-based chatbot framework on the landscape of MM treatment and precision medicine. We argue that the synergistic integration of AI, NLP, and domain-specific knowledge marks a new era of healthcare, characterized by highly personalized, data-driven, and effective treatment modalities (Thong et al., 2021). Our framework not only advances the field of precision medicine in MM but also serves as a blueprint for the development of similar systems in other complex diseases, ultimately improving patient outcomes and quality of life.

Matching journals

The top 2 journals account for 50% of the predicted probability mass.

1
Bioinformatics Advances
184 papers in training set
Top 0.1%
34.2%
2
Bioinformatics
1061 papers in training set
Top 1.0%
23.3%
50% of probability mass above
3
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.2%
10.5%
4
Frontiers in Genetics
197 papers in training set
Top 3%
2.4%
5
Journal of Biomedical Informatics
45 papers in training set
Top 0.7%
2.0%
6
iScience
1063 papers in training set
Top 13%
1.8%
7
Frontiers in Bioinformatics
45 papers in training set
Top 0.2%
1.7%
8
npj Digital Medicine
97 papers in training set
Top 2%
1.7%
9
BMC Medical Genomics
36 papers in training set
Top 0.4%
1.7%
10
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.5%
11
Heliyon
146 papers in training set
Top 3%
1.4%
12
GigaScience
172 papers in training set
Top 2%
1.3%
13
Scientific Reports
3102 papers in training set
Top 68%
1.1%
14
Frontiers in Bioengineering and Biotechnology
88 papers in training set
Top 2%
1.0%
15
Artificial Intelligence in the Life Sciences
11 papers in training set
Top 0.1%
1.0%
16
Human Mutation
29 papers in training set
Top 0.7%
0.8%
17
Frontiers in Oncology
95 papers in training set
Top 4%
0.7%
18
Frontiers in Molecular Biosciences
100 papers in training set
Top 5%
0.7%
19
Proceedings of the National Academy of Sciences
2130 papers in training set
Top 48%
0.5%
20
BMC Genomics
328 papers in training set
Top 7%
0.5%
21
eLife
5422 papers in training set
Top 62%
0.5%