Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : PySpark
Good to have skills : NA
Minimum
5 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, a typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. The role includes creating efficient data pipelines and ensuring the integrity and quality of data throughout its lifecycle. You will be responsible for implementing processes that extract, transform, and load data to facilitate seamless migration and deployment across various systems. This position requires continuous collaboration with different teams to optimize data workflows and support organizational data needs effectively.
Roles & Responsibilities:
- Expected to be an SME, collaborate and manage the team to perform.
- Responsible for team decisions.
- Engage with multiple teams and contribute on key decisions.
- Provide solutions to problems for their immediate team and across multiple teams.
- Lead efforts to identify and resolve data-related challenges to improve overall system performance.
- Mentor junior team members to enhance their technical skills and understanding of data engineering practices.
- Coordinate with stakeholders to align data engineering activities with business objectives.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in PySpark.
- Experience in building scalable data pipelines and implementing ETL processes.
- Strong knowledge of distributed computing frameworks and big data technologies.
- Ability to optimize data workflows for performance and reliability.
- Familiarity with data storage solutions and data modeling techniques.
- Experience with debugging and troubleshooting complex data issues.
Additional Information:
- The candidate should have minimum 5 years of experience in PySpark.
- This position is based at our Bengaluru office.
- A 15 years full time education is required.