We are seeking a highly skilled Senior Data Engineer with 5+ years of experience in building scalable data solutions. The ideal candidate will have strong expertise in SQL, PySpark, ETL processes, Data Lakes, and the Azure data ecosystem.
-
Design, develop, and maintain robust data pipelines using Python/PySpark
-
Build and optimize ETL workflows for large-scale data processing
-
Work with Azure services such as:
-
Azure Blob Storage
-
Azure Data Lake
-
Azure Data Factory
-
Azure Synapse Analytics
-
Ensure high performance and reliability of data systems
-
Collaborate with cross-functional teams to understand data requirements
-
Implement best practices for data governance, security, and quality
-
Use version control tools like Git and manage tasks via Azure DevOps or Jira
-
Provide technical guidance and mentorship to junior team members.
-
Drive design decisions and contribute to data architecture planning.
-
5+ years of experience with SQL
-
4+ years of hands-on experience in building data pipelines using Python/PySpark
-
4+ years of experience with the Azure ETL stack
-
Strong understanding of:
-
Data modeling
-
Distributed computing
-
Data warehousing concepts
-
Experience with code versioning tools (Git)
-
Familiarity with Agile tools such as Azure DevOps or Jira
-
Bachelor’s degree in:
-
B.Sc / BCA / B.Tech / B.E (any specialization)
-
Strong verbal and written communication skills
-
Reliable internet connection (for remote/hybrid roles)
-
Azure certifications are a plus
-
Experience with real-time data processing
-
Knowledge of CI/CD pipelines for data engineering
-
Exposure to big data technologies beyond Azure ecosystem
-
Provide technical guidance and mentorship to junior team members.
-
Drive design decisions and contribute to data architecture planning.