Overall, 3- 7 years of relevant experience in Data Warehousing, Data management projects with some experience in the Pharma domain.
We are hiring for following roles across Data management tech stacks -
Data Engineer - Advanced knowledge of PySpark ,python, pandas, numpy frameworks.
- Minimum 3 years of extensive experience in design, build and deployment of Spark/Pyspark - for data integration.
- Deep experience in developing data processing tasks using pySpark such as reading data from external sources, merge data, perform data enrichment and load in to target data destinations
- Create Spark jobs for data transformation and aggregation
AWS Infra - 5-8 years of experience & should have hands-on experience working on strong expertise in AWS IAM, EKS (deep expertise), S3, EC2, and Cost Monitoring.
Up-to-date with recent AWS service trends and best practices
Kubernetes/EKS cluster setup, scaling, monitoring, troubleshooting & Karpenter implementation
Experience with Terraform for infrastructure automation