Back

The Orphanet Nomenclature of rare diseases: a standard terminology for improved patient recognition and data interoperability

Lucano, C.; Lagorce, D.; Olry, A.; Ali, H.; Lanneau, V.; De Carvalho, M.; Dilsizoglu Senol, A.; Fructuoso, M.; Gaillard, E.; Gaillard, M.-C.; Mihic, S.; Tannoury, M.; Sauvage, F.; Rodwell, C.; Maiella, S.; Hanauer, M.; Rath, A.

2025-08-12 health informatics
10.1101/2025.08.10.25333394 medRxiv
Show abstract

BackgroundAlthough individually uncommon, rare diseases (RD) collectively affect an estimated 329-624 million people worldwide There are over 6,500 known RD, 85% of which affect fewer than 1 person per million. As a result, the critical amount of data necessary to improve knowledge, care, and treatment can only be achieved through cumulative data collection across different countries in a standardized manner. However, RD are under-represented in medical terminologies and classification systems, hindering data sharing, interoperability, and public health monitoring. ObjectiveThe aim of this paper is to present the Orphanet Nomenclature and Classification of RD. We detail its content as well as its production and update methodology, and an overview of its mappings to other semantic resources. In addition, this work provides a clear and up-to-date count of RD based on the consensus operational definition of RD, and it details the distribution of RD by medical domain. MethodsThe Orphanet Nomenclature of RD is a multilingual standardized system composed of clinical entities, each defined by a unique and time-stable ORPHAcode, a preferred term, synonyms, a classification level, and a textual definition. This nomenclature is structured into three classification levels organized within a multi-hierarchical and multi-parental classification system by medical domain. Its production, updates, and mappings to major biomedical resources rely on standardized and published procedures, continuous literature review, manual curation and expert validation, reflecting advancements in RD knowledge and clinical practice. Presented data metrics were computed using the Orphanet July 2025 release to quantitatively characterize the content, structure, classification, and semantic alignments of the Orphanet Nomenclature and Classification system. ResultsAs of July 2025, the Orphanet Nomenclature of RD includes a total of 9,784 active clinical entities, including 6,527 disorders (corresponding to the RD definition), 1,084 subtypes of disorders, and 2,173 groups of disorders. Disorders are multiclassified into 29 classification hierarchies, each corresponding to a distinct medical domain, accurately representing the complex multisystemic nature of RD. Extensive qualified mappings ensure semantic interoperability: 97.4% of disorders are mapped to at least one ICD-10 code (6.4% with an exact proximity relationship), 71.8% are mapped to at least one ICD-11 MMS code (14.7% with an exact relationship) and 94.8% are mapped to SNOMED CT (all with an exact relationship). Genetic disorders represent 72.2% of all RD, and 63.4% are mapped to at least one phenotypic OMIM number. ConclusionsThe Orphanet Nomenclature and Classification of RD is the only RD-specific interoperable medical terminology meeting the needs of healthcare, research, and public health systems. By addressing the underrepresentation of RD in medical terminologies, it enables accurate RD identification, coding, and monitoring, supporting cross-border data interoperability, and contributing to improved knowledge, policy-making, and ultimately better care for people living with a RD.

Matching journals

The top 9 journals account for 50% of the predicted probability mass.

1
Orphanet Journal of Rare Diseases
18 papers in training set
Top 0.1%
12.5%
2
JMIR Medical Informatics
17 papers in training set
Top 0.1%
8.4%
3
Journal of Biomedical Informatics
45 papers in training set
Top 0.2%
6.8%
4
Journal of Personalized Medicine
28 papers in training set
Top 0.1%
6.4%
5
Scientific Reports
3102 papers in training set
Top 27%
4.3%
6
Journal of the American Medical Informatics Association
61 papers in training set
Top 0.7%
4.0%
7
JAMIA Open
37 papers in training set
Top 0.3%
4.0%
8
BMC Medical Informatics and Decision Making
39 papers in training set
Top 0.8%
3.6%
9
PLOS ONE
4510 papers in training set
Top 39%
3.6%
50% of probability mass above
10
International Journal of Medical Informatics
25 papers in training set
Top 0.4%
3.6%
11
eBioMedicine
130 papers in training set
Top 0.5%
2.6%
12
Frontiers in Public Health
140 papers in training set
Top 3%
2.4%
13
Database
51 papers in training set
Top 0.3%
2.1%
14
Scientific Data
174 papers in training set
Top 0.9%
1.9%
15
Computers in Biology and Medicine
120 papers in training set
Top 2%
1.7%
16
npj Digital Medicine
97 papers in training set
Top 2%
1.7%
17
Journal of Medical Internet Research
85 papers in training set
Top 3%
1.5%
18
Genetics in Medicine
69 papers in training set
Top 0.7%
1.5%
19
IEEE Journal of Biomedical and Health Informatics
34 papers in training set
Top 1%
1.3%
20
BMJ Health & Care Informatics
13 papers in training set
Top 0.6%
1.1%
21
PLOS Digital Health
91 papers in training set
Top 2%
1.0%
22
Informatics in Medicine Unlocked
21 papers in training set
Top 0.9%
0.9%
23
BMC Medical Research Methodology
43 papers in training set
Top 1%
0.9%
24
The Lancet Digital Health
25 papers in training set
Top 0.8%
0.9%
25
Nature Communications
4913 papers in training set
Top 60%
0.9%
26
Human Mutation
29 papers in training set
Top 0.7%
0.8%
27
Bioinformatics
1061 papers in training set
Top 9%
0.8%
28
Acta Neuropsychiatrica
12 papers in training set
Top 0.9%
0.8%
29
Med
38 papers in training set
Top 0.7%
0.8%
30
GENETICS
189 papers in training set
Top 2%
0.7%