Introduction to the Role
We are looking for a Data Engineer (Life Sciences/Biomedical domain) with strong expertise in PySpark, Databricks, and SQL. The role involves building scalable data pipelines, integrating complex biomedical datasets, and enabling advanced analytics for genomics, clinical trials, and research.
Accountabilities
Design and build ETL/ELT pipelines to ingest, transform, and load data from clinical, omics, research, and operational sources.
Optimize performance and scalability of data flows using Apache Spark, Databricks, or AWS Glue.
Collaborate with domain experts in genomics, clinical trials, and lab science to implement robust data solutions.
Develop and maintain data models, schemas, and governance practices for structured and unstructured biomedical data.
Implement data quality checks, lineage, logging, and alerts to ensure reliability and reproducibility.
Work with cloud infrastructure teams to deploy pipelines on AWS, GCP, or Azure.
Contribute to data lake/warehouse solutions (Snowflake, Redshift, Synapse).
Essential Skills / Experience
Strong experience with PySpark, Databricks, and SQL.
Proven expertise in building ETL/ELT pipelines for large-scale datasets.
Experience with cloud platforms (AWS, GCP, Azure).
Solid knowledge of data modelling, schemas, and governance.
Hands-on experience with data lake/warehouse technologies (Snowflake, Redshift, Synapse).
Strong problem-solving and collaboration skills.
Desirable Skills / Experience
Experience working with clinical, genomics, or biomedical data.
Familiarity with data quality frameworks and reproducibility practices.
Exposure to workflow orchestration tools (Airflow, Prefect, dbt).
Understanding of research informatics and healthcare compliance standards.
About Agilisium:
Agilisium is a Life Sciences Industry’s Premier Autonomous Agentic AI Services Company for “Reimagining, developing, and co-developing AI-engineered business processes that are autonomous, scalable, and built for Life Sciences.”
Agilisium delivers AI services for life sciences, helping pharma reimagine how therapies are discovered, developed, and delivered. We build domain-infused, agentic AI solutions across R&D, Clinical, and Commercial to automate workflows and enhance oversight.
Agilisium continues to invest in top-tier talent to fuel innovation, scalability, and impact across the Life Sciences value chain