- Translate SME-designed clinical rules into scalable and reproducible data pipelines operating against centralized data lakes.
- Engineer patient-level features using medical claims, pharmacy claims, laboratory results, and NLP-derived outputs.
- Build and maintain disease-specific datasets covering cohort construction, index dating, treatment sequencing, and clinical event labeling.
- Develop and apply Line of Therapy (LOT) algorithms that handle combination regimens, treatment gaps, dose modifications, and off-label usage.
- Integrate diagnosis mentions, staging information, biomarker results, and progression indicators from NLP outputs into structured datasets.
- Produce data quality reports and perform sample-level audits to validate clinical logic against source data.
- Collaborate closely with Clinical SMEs to identify anomalies, refine business rules, and improve dataset accuracy.
Pay: ₹30,000.00 - ₹60,000.00 per month
Work Location: Remote