TY - JOUR
T1 - Autoimmune Disease Classification Based on PubMed Text Mining
AU - Samuels, Hadas
AU - Malov, Malki
AU - Saha Detroja, Trishna
AU - Ben Zaken, Karin
AU - Bloch, Naamah
AU - Gal-Tanamy, Meital
AU - Avni, Orly
AU - Polis, Baruh
AU - Samson, Abraham O.
N1 - Publisher Copyright: © 2022 by the authors.
PY - 2022/7/26
Y1 - 2022/7/26
N2 - Autoimmune diseases (AIDs) are often co-associated, and about 25% of patients with one AID tend to develop other comorbid AIDs. Here, we employ the power of datamining to predict the comorbidity of AIDs based on their normalized co-citation in PubMed. First, we validate our technique in a test dataset using earlier-reported comorbidities of seven knowns AIDs. Notably, the prediction correlates well with comorbidity (R = 0.91) and validates our methodology. Then, we predict the association of 100 AIDs and classify them using principal component analysis. Our results are helpful in classifying AIDs into one of the following systems: (1) gastrointestinal, (2) neuronal, (3) eye, (4) cutaneous, (5) musculoskeletal, (6) kidneys and lungs, (7) cardiovascular, (8) hematopoietic, (9) endocrine, and (10) multiple. Our classification agrees with experimentally based taxonomy and ranks AID according to affected systems and gender. Some AIDs are unclassified and do not associate well with other AIDs. Interestingly, Alzheimer’s disease correlates well with other AIDs such as multiple sclerosis. Finally, our results generate a network classification of autoimmune diseases based on PubMed text mining and help map this medical universe. Our results are expected to assist healthcare workers in diagnosing comorbidity in patients with an autoimmune disease, and to help researchers in identifying common genetic, environmental, and autoimmune mechanisms.
AB - Autoimmune diseases (AIDs) are often co-associated, and about 25% of patients with one AID tend to develop other comorbid AIDs. Here, we employ the power of datamining to predict the comorbidity of AIDs based on their normalized co-citation in PubMed. First, we validate our technique in a test dataset using earlier-reported comorbidities of seven knowns AIDs. Notably, the prediction correlates well with comorbidity (R = 0.91) and validates our methodology. Then, we predict the association of 100 AIDs and classify them using principal component analysis. Our results are helpful in classifying AIDs into one of the following systems: (1) gastrointestinal, (2) neuronal, (3) eye, (4) cutaneous, (5) musculoskeletal, (6) kidneys and lungs, (7) cardiovascular, (8) hematopoietic, (9) endocrine, and (10) multiple. Our classification agrees with experimentally based taxonomy and ranks AID according to affected systems and gender. Some AIDs are unclassified and do not associate well with other AIDs. Interestingly, Alzheimer’s disease correlates well with other AIDs such as multiple sclerosis. Finally, our results generate a network classification of autoimmune diseases based on PubMed text mining and help map this medical universe. Our results are expected to assist healthcare workers in diagnosing comorbidity in patients with an autoimmune disease, and to help researchers in identifying common genetic, environmental, and autoimmune mechanisms.
KW - Alzheimer’s disease
KW - PubMed
KW - autoimmune disease
KW - classification
KW - frequency analysis
KW - list of autoimmune diseases
KW - taxonomy
KW - text mining
UR - http://www.scopus.com/inward/record.url?scp=85136794084&partnerID=8YFLogxK
U2 - https://doi.org/10.3390/jcm11154345
DO - https://doi.org/10.3390/jcm11154345
M3 - Article
C2 - 35893435
SN - 2077-0383
VL - 11
JO - Journal of Clinical Medicine
JF - Journal of Clinical Medicine
IS - 15
M1 - 4345
ER -