Azure Data Engineer
Req number:
R7830
Employment type:
Full time
Worksite flexibility:
Hybrid
CAI is a global services firm with over 9,000 associates worldwide and a yearly revenue of $1.3 billion+. We have over 40 years of excellence in uniting talent and technology to power the possible for our clients, colleagues, and communities. As a privately held company, we have the freedom and focus to do what is right—whatever it takes. Our tailor-made solutions create lasting results across the public and commercial sectors, and we are trailblazers in bringing neurodiversity to the enterprise.
Job Summary
We are looking for a motivated Azure Data Engineer ready to take us to the next level! If you have experience in building data products using Azure Databricks, Python and Pyspark and are looking for your next career move, apply now.
Job Description
We are looking for an Azure Data Engineer with 5 to 8 years of experience in building data products using Databricks and related technologies. This position will be Full-time and Hybrid/ Remote position, Bangalore, India.
What You’ll Do
- Design, develop, and maintain data lakes and data pipelines on Azure using ETL frameworks and Databricks
- Integrate and transform large-scale data from multiple heterogeneous sources into a centralized data lake environment
- Implement and manage Delta Lake architecture using Databricks Delta or Apache Hudi
- Develop end-to-end data workflows using PySpark, Databricks Notebooks, and Python scripts for ingestion, transformation, and enrichment
- Design and develop data warehouses and data marts for analytical workloads using Snowflake, Redshift, or similar systems
- Design and evaluate data models (Star, Snowflake, Flattened) for analytical and transactional systems
- Optimize data storage, query performance, and cost across the AWS and Databricks ecosystem
- Build and maintain CI/CD pipelines for Databricks notebooks, jobs, and Python-based data processing scripts
- Collaborate with data scientists, analysts, and stakeholders to deliver high-performance, reusable data assets
- Maintain and manage code repositories (Git) and promote best practices in version control, testing, and deployment
- Participate in making major technical and architectural decisions for data engineering initiatives
- Monitor and troubleshoot Databricks clusters, Spark jobs, and ETL processes for performance and reliability
- Coordinate with business and technical teams through all phases of the software development life cycle
What You'll Need
Required:
- 5+ years of experience building and managing Data Lake Architecture on AzurecCloud
- 3+ years of experience with Azure Data services such as Data Factory, Azure Databricks, PySpark, Azure Synapse, and ADLS Gen2
- 3+ years of experience building Data Warehouses on Snowflake, Redshift, HANA, Teradata, or Exasol
- 3+ years of hands-on experience working with Apache Spark or PySpark, on Databricks
- 3+ years of experience implementing Delta Lakes using Databricks Delta or Apache Hudi
- 3+ years of experience in ETL development using Databricks, AWS Glue, or other modern frameworks
- Proficiency in Python for data engineering, automation, and API integrations.
- Experience in Databricks Jobs, Workflows, and Cluster Management
- Experience with CI/CD pipelines and Infrastructure as Code (IaC) tools like Terraform or CloudFormation is a plus
- Bachelor’s degree in computer science, Information Technology, Data Science, or related field
Physical Demands
- Ability to safely and successfully perform the essential job functions
- Sedentary work that involves sitting or remaining stationary most of the time with occasional need to move around the office to attend meetings, etc.
- Ability to conduct repetitive tasks on a computer, utilizing a mouse, keyboard, and monitor
Reasonable accommodation statement
If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employment selection process, please direct your inquiries to [email protected] or (888) 824 – 8111.