Omics-based computational approaches for biomarker identification, prediction, and treatment of Long COVID

Abstract:

Long COVID, or post-acute sequelae of COVID-19 (PASC), is a major global health problem, with cumulative estimates suggesting that around 400 million people worldwide have been affected. It is characterized by persistent or new symptoms such as fatigue, cognitive impairment, and breathlessness lasting beyond four weeks after acute infection. Diverse clinical manifestations, chronic course, and incompletely understood pathophysiology-including hypotheses involving viral persistence, immune dysregulation, autoimmunity, endothelial dysfunction, and metabolic reprogramming-impede the development of diagnostic criteria, biomarkers, and targeted therapies. We conducted a critical review of 101 Long COVID omics studies, focusing on the computational methods used and their methodological quality.

Using standardized criteria, we evaluated study design, statistical rigor, reproducibility, and clinical relevance across genomics, epigenomics, transcriptomics, proteomics, metabolomics, and multiomics integration, and mapped these findings onto regulatory and translational frameworks. Despite substantial methodological heterogeneity, convergent biological signals emerged.

Genomic studies implicate risk loci in immune and cardiopulmonary pathways. Epigenomic analyses identify differentially methylated regions in immune and circadian genes. Transcriptomic studies reveal persistent dysregulation of innate immune and coagulation pathways, as well as reproducible molecular endotypes. Proteomic studies consistently show abnormalities in the complement cascade and coagulation, with a small panel of complement proteins showing highly reproducible changes across independent cohorts. Metabolomic studies demonstrate sustained mitochondrial dysfunction and altered cellular bioenergetics for up to two years after infection.

Multiomics integration supports at least two major endotypes, characterized by predominant inflammatory versus metabolic dysregulation, and provides a basis for patient stratification and computational treatment discovery. Machine learning models frequently achieve high classification performance, but are rarely externally validated. Critical limitations restrict clinical translation. Most studies are underpowered relative to analytical complexity, use heterogeneous case definitions and controls, and report platform-specific signatures with limited overlap. External validation, preregistered analysis plans, and regulatory-aligned assay development are uncommon. To date, no regulatory-approved diagnostic assay or evidence-based therapeutic intervention has directly emerged from these computational findings.

Future progress requires harmonized phenotyping protocols, adequately powered longitudinal cohorts with external validation, integration of spatial omics and explainable artificial intelligence, and early engagement with regulatory and health-technology assessment pathways. This review provides a critical assessment and a translational roadmap, outlining how methodologically robust computational omics can be advanced toward clinically actionable tools for Long COVID.

Source: Pinero S, Li X, Zhang J, Winter M, Lee SH, Nguyen T, Liu L, Li J, Le TD. Omics-based computational approaches for biomarker identification, prediction, and treatment of Long COVID. Crit Rev Clin Lab Sci. 2026 Jun;63(4):332-358. doi: 10.1080/10408363.2025.2583083. Epub 2025 Dec 9. PMID: 41368891. https://pubmed.ncbi.nlm.nih.gov/41368891/

Toward a Molecular Reclassification of Myalgic Encephalomyelitis/Chronic Fatigue Syndrome: Integrating Multi-Omics, Machine Learning, and Precision Medicine

Abstract:

Myalgic Encephalomyelitis/Chronic Fatigue Syndrome (ME/CFS) is a complex, multi-system disease characterized by a multitude of symptoms across various organ systems. Diagnosis has relied heavily on heterogeneous clinical symptom presentation and evolving case definitions, with treatment focused on addressing presenting symptoms due to the paucity of validated biomarkers. Meanwhile, advances have been made in understanding the underlying pathophysiology through strong epidemiologic, clinical, and basic science studies. This narrative review synthesizes recent advances that are likely to drive a shift in understanding from symptom-based classification toward a molecularly defined understanding of the disease.

This shift in understanding will likely provide the foundation for future research efforts focused on targeting diagnosis and treatment more effectively. Specifically, we reference the identification of rare genetic risk variants through the HEAL2 deep learning framework, the large-scale DecodeME genome-wide association study, and dynamic epigenetic markers of disease state.

In addition, the findings revealed the downstream consequences of this genetic and epigenetic priming: chronic innate immune activation, CD8+ T cell exhaustion characterized by upregulation of the exhaustion-driving transcription factors Thymocyte Selection-Associated HMG Box (TOX) and Eomesodermin (EOMES), and a cellular energy crisis centered on mitochondrial dysfunction. Furthermore, results of recent studies have revealed sex-specific transcriptomic and proteomic signatures of maladaptive recovery.

We also highlight the role of machine learning and artificial intelligence integrations in translating high-dimensional multi-omics data into actionable biological insights, including the identification of monocyte subsets via Positive Unlabeled Learning, circulating cell-free RNA diagnostic signatures, and integrated multi-modal disease models such as BioMapAI.

The combination of these findings, which highlight multiple identifiable mechanisms of molecular activity, support the feasibility of molecular subtyping, precision diagnostics, and targeted therapeutic strategies for ME/CFS.

Source: Frank J, Nesterovitch N, Movva C, Klimas NG, Nathanson L. Toward a Molecular Reclassification of Myalgic Encephalomyelitis/Chronic Fatigue Syndrome: Integrating Multi-Omics, Machine Learning, and Precision Medicine. Int J Mol Sci. 2026 May 15;27(10):4436. doi: 10.3390/ijms27104436. PMID: 42196410; PMCID: PMC13207433. https://pmc.ncbi.nlm.nih.gov/articles/PMC13207433/ (Full text)

A multiomics recovery factor predicts long COVID in the IMPACC study

Abstract:

BACKGROUND: Following SARS-CoV-2 infection, approximately 10%–35% of patients with COVID-19 experience long COVID (LC), in which debilitating symptoms persist for at least 3 months. Elucidating the biologic underpinnings of LC could identify therapeutic opportunities.

METHODS: We utilized machine learning methods on biologic analytes provided over 12 months after hospital discharge from more than 500 patients with COVID-19 in the IMPACC cohort to identify a multiomics “recovery factor,” trained on patient-reported physical function survey scores. Immune profiling data included PBMC transcriptomics, serum O-link and plasma proteomics, plasma metabolomics, and blood mass cytometry by time of flight (CyTOF) protein levels. Recovery factor scores were tested for association with LC, disease severity, clinical parameters, and immune subset frequencies. Enrichment analyses identified biologic pathways associated with recovery factor scores.

RESULTS: Participants with LC had lower recovery factor scores compared with recovered participants. Recovery factor scores predicted LC as early as hospital admission, irrespective of acute COVID-19 severity. Biologic characterization revealed increased inflammatory mediators, elevated signatures of heme metabolism, and decreased androgenic steroids as predictive and ongoing biomarkers of LC. Lower recovery factor scores were associated with reduced lymphocyte and increased myeloid cell frequencies. The observed signatures are consistent with persistent inflammation driving anemia and stress erythropoiesis as major biologic underpinnings of LC.

CONCLUSION: The multiomics recovery factor identifies patients at risk of LC early after SARS-CoV-2 infection and reveals LC biomarkers and potential treatment targets.

TRIAL REGISTRATION: ClinicalTrials.gov NCT04378777.

FUNDING:

National Institute of Allergy and Infectious Diseases (NIAID), NIH (3U01AI167892-03S2, 3U01AI167892-01S2, 5R01AI135803-03, 5U19AI118608-04, 5U19AI128910-04, 4U19AI090023-11, 4U19AI118610-06, R01AI145835-01A1S1, 5U19AI062629-17, 5U19AI057229-17, 5U19AI057229-18, 5U19AI125357-05, 5U19AI128913-03, 3U19AI077439-13, 5U54AI142766-03, 5R01AI104870-07S1, 3U19AI089992-09, 3U19AI128913-03, and 5T32DA018926-1, 3U19AI1289130, U19AI128913-04S1, R01AI122220); NIH (UM1TR004528); and National Science Foundation (NSF) (DMS2310836).

Source: Gabernet G, Maciuch J, Gygi JP, Moore JF, Hoch A, Syphurs C, Chu T, Doni Jayavelu N, Corry DB, Kheradmand F, Baden LR, Sekaly RP, McComsey GA, Haddad EK, Cairns CB, Rouphael N, Fernandez-Sesma A, Simon V, Metcalf JP, Agudelo Higuita NI, Hough CL, Messer WB, Davis MM, Nadeau KC, Pulendran B, Kraft M, Bime C, Reed EF, Schaenman J, Erle DJ, Calfee CS, Atkinson MA, Brakenridge SC, Melamed E, Shaw AC, Hafler DA, Augustine AD, Becker PM, Ozonoff A, Bosinger SE, Eckalbar W, Maecker HT, Kim-Schulze S, Steen H, Krammer F, Westendorf K; IMPACC Network; Peters B, Fourati S, Altman MC, Levy O, Smolen KK, Montgomery RR, Diray-Arce J, Kleinstein SH, Guan L, Ehrlich LI. A multiomics recovery factor predicts long COVID in the IMPACC study. J Clin Invest. 2025 Sep 9;135(21):e193698. doi: 10.1172/JCI193698. PMID: 40924481; PMCID: PMC12582403. https://pmc.ncbi.nlm.nih.gov/articles/PMC12582403/ (Full text)

Multi-omics identifies lipid accumulation in Myalgic Encephalomyelitis/Chronic Fatigue Syndrome cell lines: a case-control study

Abstract:

Background: In recent years, evidence has indicated a metabolic shift towards increased demand for lipids in various lymphoid cell populations from people with Myalgic Encephalomyelitis/Chronic Fatigue Syndrome (ME/CFS). We previously screened the mitochondrial function and gene expression of B cell-derived lymphoblastoid cell lines (LCLs) generated from the blood of people with ME/CFS to characterise a model for hypothesis discovery and testing, observing elevated expression of gene products facilitating amino acid and fatty acid degradation for energy.

Method: In this follow-up study we have expanded this characterisation by profiling the polar metabolomes and non-polar lipidomes of an all-female cohort of 17 healthy control and 15 ME/CFS LCLs, and we integrated this new data with the previously generated proteomic and transcriptomic data.

Results: In the polar metabolome we detected no significantly altered individual features, while integrated multi-omic analysis by MetaboAnalyst indicated 15 dysregulated pathways. Next, in the non-polar lipidome, we identified that PC(O-38:4) had significantly reduced levels in ME/CFS LCLs and was almost entirely discriminative of ME/CFS status. Among all detected classes of lipids we found that triradylglycerolipids (“triglycerides”), diradylglycerolipids and fatty acids were the most significantly affected and were elevated, and that most lipids exhibited average levels higher than in healthy controls. BioPAN pathway analysis of the lipidomic data predicted a more-active gene product that we confirmed to be significantly elevated in both our proteomic and transcriptomic data, this being phosphatidylserine synthase 1 (PTDSS1), plus 7 other gene products that were concordantly altered in expression in the transcriptomic data. We also found that ME/CFS LCLs exhibited a significant tendency towards more saturated lipid content.

Conclusions: LCLs generated from circulating B cells from people with ME/CFS show accumulation of lipids, skewed lipid profiles and altered activity of related metabolic enzymes such as PTDSS1. These findings will inform future hypothesis-driven studies of primary lymphoid cell populations from people with ME/CFS to dissect specific immunometabolic mechanisms that may be involved in the syndrome, particularly relating to intersections between lipid abnormalities and potential effects on immune cell effector functions.

Source: Missailidis D, Armstrong CW, Anderson D, Allan CY, Sanislav O, Smith PK, Esmaili T, Creek DJ, Annesley SJ, Fisher PR. Multi-omics identifies lipid accumulation in Myalgic Encephalomyelitis/Chronic Fatigue Syndrome cell lines: a case-control study. J Transl Med. 2026 Jan 8. doi: 10.1186/s12967-025-07620-x. Epub ahead of print. PMID: 41508032. https://link.springer.com/article/10.1186/s12967-025-07620-x (Full text available as PDF file)

Exploring a genetic basis for the metabolic perturbations in ME/CFS using UK Biobank

Highlights:

  • ME/CFS shows distinct genetic influences on metabolic regulation.
  • Lipid and hormone-related pathways emerge as key areas of interest.
  • Many small genetic effects may collectively disrupt metabolic resilience in ME/CFS.

Summary:

Myalgic Encephalomyelitis/Chronic Fatigue Syndrome (ME/CFS) is a clinically heterogeneous disease lacking approved therapies. To assess genetic susceptibility towards a specific metabolic phenotype, we performed a genome-wide association study on plasma biomarker levels (mGWAS) in ME/CFS patients (n=875) and healthy controls (HCs) (n=36,033).
We identified 112 significant SNP–biomarker associations in ME/CFS, compared with 4,114 in HCs. Two SNPs specific to ME/CFS, mapping to HSD11B1 and SCGN, were associated to phospholipids in extra-large very low-density lipoproteins (VLDL) and total fatty acids respectively. Genetic effects of VLDL associations were among the least correlated between ME/CFS and HCs. Heterogeneity tests found differential effects for several lipid traits at ADAP1NR1H3 and CD40, which are involved in immune regulation.
ME/CFS mGWAS summary statistics were decomposed to uncover shared genetic-metabolic patterns, where enrichment analysis highlighted pathways in lipid metabolism, neurotransmitter transport, and inflammation. These findings provide a genetic and molecular rationale for patient heterogeneity and suggest a polygenic predisposition in which many small-effect variants may jointly perturb metabolic mechanisms.
Source: Katherine Huang, Muhammad Muneeb, Natalie Thomas, Elena K. Schneider-Futschik, Paul R. Gooley, David B. Ascher, Christopher W. Armstrong. Exploring a genetic basis for the metabolic perturbations in ME/CFS using UK Biobank. iScience, 2025, 114316 ISSN 2589-0042, https://doi.org/10.1016/j.isci.2025.114316. https://www.sciencedirect.com/science/article/pii/S2589004225025775 (Full text available as PDF file)

Urinary Peptidomic Profiling In Post-Acute Sequelae of SARS-CoV-2 Infection: A Case-Control Study

Abstract:

Post-acute sequelae of severe acute respiratory syndrome coronavirus 2-infection (PASC) is challenging to diagnose and treat, and its molecular pathophysiology remains unclear. Urinary peptidomics can provide valuable information on urine peptides that may enable improved and specified PASC diagnosis.
Using standardized capillary electrophoresis-MS, we examined the urinary peptidomes of 50 patients with PASC 10 months after COVID-19 and 50 controls, including healthy individuals (n = 42) and patients with non-COVID-19-associated myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) (n = 8).Based on peptide abundance differences between cases and controls, we developed a diagnostic model using a support vector machine. The abundance of 195 urine peptides among PASC patients significantly differed from that in controls, with a predominant abundance of collagen alpha chains. This molecular signature (PASC195) effectively distinguished PASC cases from controls in the training set (AUC of 0.949 [95% CI 0.900–0.998; p < 0.0001]) and independent validation set (AUC of 0.962 [95% CI 0.897–1.00]; p < 0.0001]). In silico assessment suggested exercise, GLP-1RAs and mineralocorticoid receptor antagonists (MRAs) as potentially efficacious interventions. We present a novel and non-invasive diagnostic model for PASC. Reflecting its molecular pathophysiology, PASC195 has the potential to advance diagnostics and inform therapeutic interventions.

Statement of Significance of the Study

Despite the recent emergence of omics-derived candidates for post-acute sequelae of SARS-CoV-2 infection (PASC), the pending validation of proposed markers and lack of consensus result in the continuous reliance on symptom-based criteria, being subject to diagnostic uncertainties and potential recall bias. Building upon prior findings of renal involvement in acute COVID-19 pathophysiology and PASC-associated alterations, we hypothesized that the use of urinary peptides for PASC-specific biomarker discovery, unlike conventional specimens that have been utilized thus far, may offer complementary information on putative disease mechanisms.

In the present study, 195 significantly expressed peptides were used to form a classifier termed PASC195, which effectively discriminated PASC from non-PASC (p < 0.0001), including healthy individuals and non-COVID-19-associated myalgic encephalomyelitis/chronic fatigue syndrome, in both the derivation (n = 60) and an independent validation set (n = 40). The peptidome profile associated with PASC was consistent with a shift in collagen turnover, with most PASC195 peptides derived from alpha chains. Ongoing inflammatory responses, hemostatic imbalances, and endothelial damage were indicated by cross-sectional variations in endogenous peptide excretion.

Source: Gülmez D, Siwy J, Kurz K, Wendt R, Banasik M, Peters B, Dudoignon E, Depret F, Salgueira M, Nowacki E, Kurnikowski A, Mussnig S, Krenn S, Gonos S, Löffler-Ragg J, Weiss G, Mischak H, Hecking M, Schernhammer E, Beige J; UriCoV Working Group. Urinary Peptidomic Profiling In Post-Acute Sequelae of SARS-CoV-2 Infection: A Case-Control Study. Proteomics. 2025 Nov 21:e70074. doi: 10.1002/pmic.70074. Epub ahead of print. PMID: 41273049. https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/pmic.70074 (Full text)

The Role of Nuclear and Mitochondrial DNA in Myalgic Encephalomyelitis: Molecular Insights into Susceptibility and Dysfunction

Abstract:

Myalgic Encephalomyelitis (ME), also known as chronic fatigue syndrome (CFS), is a debilitating and heterogeneous disorder marked by persistent fatigue, post-exertional malaise, cognitive impairment, and multisystem dysfunction. Despite its prevalence and impact, the molecular mechanisms underlying ME remain poorly understood.
This review synthesizes current evidence on the role of DNA, both nuclear and mitochondrial, in the susceptibility and pathophysiology of ME. We examined genetic predispositions, including familial clustering and candidate gene associations, and highlighted emerging insights from genome-wide and multi-omics studies.
Mitochondrial DNA variants and oxidative stress-related damage are discussed in relation to impaired bioenergetics and symptom severity. Epigenetic modifications, particularly DNA methylation dynamics and transposable element activation, are explored as mediators of gene–environment interactions and immune dysregulation.
Finally, we explored the translational potential of DNA-based biomarkers and therapeutic targets, emphasizing the need for integrative molecular approaches to advance diagnosis and treatment. Understanding the DNA-associated mechanisms in ME offers a promising path toward precision medicine in post-viral chronic diseases.
Source: Elremaly W, Elbakry M, Vahdani Y, Franco A, Moreau A. The Role of Nuclear and Mitochondrial DNA in Myalgic Encephalomyelitis: Molecular Insights into Susceptibility and Dysfunction. DNA. 2025; 5(4):53. https://doi.org/10.3390/dna5040053 https://www.mdpi.com/2673-8856/5/4/53 (Full text)

Advancing Digital Precision Medicine for Chronic Fatigue Syndrome through Longitudinal Large-Scale Multi-Modal Biological Omics Modeling with Machine Learning and Artificial Intelligence

Abstract:

We studied a generalized question: chronic diseases like ME/CFS and long COVID exhibit high heterogeneity with multifactorial etiology and progression, complicating diagnosis and treatment. To address this, we developed BioMapAI, an explainable Deep Learning framework using the richest longitudinal multi-omics dataset for ME/CFS to date.

This dataset includes gut metagenomics, plasma metabolome, immune profiling, blood labs, and clinical symptoms. By connecting multi-omics to a symptom matrix, BioMapAI identified both disease- and symptom-specific biomarkers, reconstructed symptoms, and achieved state-of-the-art precision in disease classification.

We also created the first connectivity map of these omics in both healthy and disease states and revealed how microbiome-immune-metabolome crosstalk shifted from healthy to ME/CFS.

Source: Xiong R. Advancing Digital Precision Medicine for Chronic Fatigue Syndrome through Longitudinal Large-Scale Multi-Modal Biological Omics Modeling with Machine Learning and Artificial Intelligence. ArXiv [Preprint]. 2025 Jun 18:arXiv:2506.15761v1. PMID: 40980765; PMCID: PMC12447721. https://pmc.ncbi.nlm.nih.gov/articles/PMC12447721/ (Full text available as PDF file)

A multi-omics recovery factor predicts long COVID in the IMPACC study

Abstract:

Background. Following SARS-CoV-2 infection, ~10-35% of COVID-19 patients experience long COVID (LC), in which debilitating symptoms persist for at least three months. Elucidating biologic underpinnings of LC could identify therapeutic opportunities.

Methods. We utilized machine learning methods on biologic analytes provided over 12-months after hospital discharge from >500 COVID-19 patients in the IMPACC cohort to identify a multi-omics “recovery factor”, trained on patient-reported physical function survey scores. Immune profiling data included PBMC transcriptomics, serum O-link and plasma proteomics, plasma metabolomics, and blood CyTOF protein levels. Recovery factor scores were tested for association with LC, disease severity, clinical parameters, and immune subset frequencies. Enrichment analyses identified biologic pathways associated with recovery factor scores.

Results. LC participants had lower recovery factor scores compared to recovered participants. Recovery factor scores predicted LC as early as hospital admission, irrespective of acute COVID-19 severity. Biologic characterization revealed increased inflammatory mediators, elevated signatures of heme metabolism, and decreased androgenic steroids as predictive and ongoing biomarkers of LC. Lower recovery factor scores were associated with reduced lymphocyte and increased myeloid cell frequencies. The observed signatures are consistent with persistent inflammation driving anemia and stress erythropoiesis as major biologic underpinnings of LC.

Conclusion. The multi-omics recovery factor identifies patients at risk of LC early after SARS-CoV-2 infection and reveals LC biomarkers and potential treatment targets.

Trial Registration. ClinicalTrials.gov NCT04378777.

Funding. This study was funded by NIH, NIAID and NSF.

Source: Gisela Gabernet, Leying Guan, Lauren I.R. Ehrlich, et al. A multi-omics recovery factor predicts long COVID in the IMPACC study. J Clin Invest. September 9, 2025. https://doi.org/10.1172/JCI193698. https://www.jci.org/articles/view/193698/ (Full study available as PDF file)

Heightened innate immunity may trigger chronic inflammation, fatigue and post-exertional malaise in ME/CFS

Abstract:

Myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) is characterized by unexplained fatigue, post-exertional malaise (PEM), and cognitive dysfunction. ME/CFS patients often report a prodrome consistent with infection. We present a multi-omics analysis based on plasma metabolomic and proteomic profiling, and immune responses to microbial stimulation, before and after exercise.

We report evidence of an exaggerated innate immune response after exposures to microbial antigens; impaired energy production involving the citric acid cycle, beta-oxidation of fatty acids, and urea cycle energy production from amino acids; systemic inflammation linked with lipid abnormalities; disrupted extracellular matrix homeostasis with release of endogenous ligands that promote inflammation; reduced cell-cell adhesion and associated gut dysbiosis; complement activation; redox imbalance reflected by disturbances in copper-dependent antioxidant pathways and dysregulation of the tryptophan-serotonin-kynurenine pathways.

Many of these underlying abnormalities worsened following exercise in ME/CFS patients, but not in healthy subjects; many abnormalities reinforced each other and several were correlated with the intensity of symptoms. Our findings may inform targeted therapeutic interventions for ME/CFS and PEM.

Source: Che X, Ranjan A, Guo C, Zhang K, Goldsmith R, Levine S, Moneghetti KJ, Zhai Y, Ge L, Mishra N, Hornig M, Bateman L, Klimas NG, Montoya JG, Peterson DL, Klein SL, Fiehn O, Komaroff AL, Lipkin WI. Heightened innate immunity may trigger chronic inflammation, fatigue and post-exertional malaise in ME/CFS. medRxiv [Preprint]. 2025 Jul 24:2025.07.23.25332049. doi: 10.1101/2025.07.23.25332049. PMID: 40778181; PMCID: PMC12330418. https://pmc.ncbi.nlm.nih.gov/articles/PMC12330418/ (Full text available as PDF file)