Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : Databricks Unified Data Analytics Platform
Good to have skills : Microsoft Azure Databricks, Microsoft Azure Analytics Services, PySpark
Minimum
3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, a typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. This role includes creating efficient data pipelines to facilitate smooth data flow, ensuring the accuracy and quality of data throughout its lifecycle, and implementing extract, transform, and load processes to enable seamless migration and deployment of data across various systems. The position requires continuous collaboration with different teams to optimize data handling and support organizational data needs effectively.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Collaborate with cross-functional teams to understand data requirements and deliver scalable data solutions.
- Monitor and troubleshoot data pipelines to ensure reliability and performance.
- Document data processes and workflows to maintain clarity and support knowledge sharing.
- Assist junior team members in understanding project requirements and best practices.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in Databricks Unified Data Analytics Platform, Microsoft Azure Databricks, Microsoft Azure Analytics Services, PySpark.
- Good To Have Skills: Experience with Microsoft Azure Databricks, Microsoft Azure Analytics Services, PySpark.
- Strong knowledge of data pipeline architecture and ETL process design.
- Experience in optimizing data workflows for performance and scalability.
- Familiarity with cloud-based data platforms and analytics services.
- Ability to work with large datasets and ensure data integrity throughout processing.
Additional Information:
- The candidate should have minimum 3 years of experience in Databricks Unified Data Analytics Platform.
- This position is based at our Bengaluru office.
- A 15 years full time education is required.