Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Background: Characterizing Post-Acute Sequelae of COVID (SARS-CoV-2 Infection), or PASC has been challenging due to the multitude of sub-phenotypes, temporal attributes, and definitions. Scalable characterization of PASC sub-phenotypes can enhance screening capacities, disease management, and treatment planning. Methods: We conducted a retrospective multi-centre observational cohort study, leveraging longitudinal electronic health record (EHR) data of 30,422 patients from three healthcare systems in the Consortium for the Clinical Characterization of COVID-19 by EHR (4CE). From the total cohort, we applied a deductive approach on 12,424 individuals with follow-up data and developed a distributed representation learning process for providing augmented definitions for PASC sub-phenotypes. Findings: Our framework characterized seven PASC sub-phenotypes. We estimated that on average 15.7% of the hospitalized COVID-19 patients were likely to suffer from at least one PASC symptom and almost 5.98%, on average, had multiple symptoms. Joint pain and dyspnea had the highest prevalence, with an average prevalence of 5.45% and 4.53%, respectively. Interpretation: We provided a scalable framework to every participating healthcare system for estimating PASC sub-phenotypes prevalence and temporal attributes, thus developing a unified model that characterizes augmented sub-phenotypes across the different systems. Funding: Authors are supported by National Institute of Allergy and Infectious Diseases, National Institute on Aging, National Center for Advancing Translational Sciences, National Medical Research Council, National Institute of Neurological Disorders and Stroke, European Union, National Institutes of Health, National Center for Advancing Translational Sciences.

Original languageEnglish
Article number102210
JournalEClinicalMedicine
Volume64
DOIs
StatePublished - Oct 2023

Bibliographical note

Publisher Copyright:
© 2023 The Author(s)

Funding

Authors are supported by National Institute of Allergy and Infectious Diseases, National Institute on Aging, National Center for Advancing Translational Sciences, National Medical Research Council, National Institute of Neurological Disorders and Stroke, European Union, National Institutes of Health, National Center for Advancing Translational Sciences.Authors were supported by National Institute of Allergy and Infectious Diseases R01AI165535, National Institute on Aging RF1AG074372, National Center for Advancing Translational Sciences UL1-TR001878, National Medical Research Council Research Training Fellowship MOH-00195-00, National Institute of Neurological Disorders and Stroke (NINDS) R01NS098023 and NINDS R01NS124882, European Union Periscope Project 101016233, National Institutes of Health R01HL151643, National Center for Advancing Translational Sciences (NCATS) UL1TR001857. Authors were supported by National Institute of Allergy and Infectious Diseases R01AI165535, National Institute on Aging RF1AG074372, National Center for Advancing Translational Sciences UL1-TR001878, National Medical Research Council Research Training Fellowship MOH-00195-00, National Institute of Neurological Disorders and Stroke (NINDS) R01NS098023 and NINDS R01NS124882, European Union Periscope Project 101016233, National Institutes of Health R01HL151643, National Center for Advancing Translational Sciences (NCATS) UL1TR001857.

FundersFunder number
National Institutes of Health (NIH)MOH-00195-00
National Institutes of Health (NIH)
National Institute on AgingRF1AG074372
National Institute on Aging
National Institute of Allergy and Infectious DiseasesR01AI165535
National Institute of Allergy and Infectious Diseases
Institute of Neurological Disorders and Stroke National Advisory Neurological Disorders and Stroke CouncilR01NS124882, R01HL151643, 101016233, R01NS098023, UL1TR001857
Institute of Neurological Disorders and Stroke National Advisory Neurological Disorders and Stroke Council
National Center for Advancing Translational Sciences (NCATS)UL1-TR001878
National Center for Advancing Translational Sciences (NCATS)
European Commission
National Medical Research Council Singapore

    Keywords

    • COVID-19
    • Electronic health records
    • PASC
    • Post-acute sequelae of SARS-CoV-2
    • SARS-CoV-2

    ASJC Scopus subject areas

    • General Medicine

    Fingerprint

    Dive into the research topics of 'Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study'. Together they form a unique fingerprint.

    Cite this