Single-cell Omics

Prognostic Marker Discovery Pipeline

  • Python
  • Metadata Harmonization
  • Survival Analysis
Overview

Developed a semi-automated pipeline to standardize heterogeneous public single-cell datasets, enabling the discovery of prognostic markers across multiple tissue types.

Key highlights
  • Curated and standardized 67 public scRNA-seq datasets across 11 tissue types.
  • Implemented a metadata harmonization module to resolve inconsistencies in public data reporting.
  • Applied Gene Set Variation Analysis (GSVA) and statistical testing to link cell type abundance with patient prognosis.