Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : PySpark
Good to have skills : NA
Minimum
12 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, your typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. You will be responsible for creating efficient data pipelines that facilitate smooth data flow and ensure the integrity and quality of data throughout its lifecycle. Your role includes implementing processes to extract, transform, and load data, enabling seamless migration and deployment across various systems. This position requires a proactive approach to managing data infrastructure and collaborating with different teams to support organizational data needs effectively.
Roles & Responsibilities:
- Expected to be an SME, collaborate and manage the team to perform.
- Responsible for team decisions.
- Engage with multiple teams and contribute on key decisions.
- Expected to provide solutions to problems that apply across multiple teams.
- Lead the design and implementation of scalable data architectures to support business requirements.
- Mentor junior team members and support their professional growth within the data engineering domain.
- Coordinate with stakeholders to understand data needs and translate them into technical solutions.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in PySpark.
- Strong experience in building and optimizing data pipelines and workflows using PySpark.
- In-depth knowledge of data processing frameworks and distributed computing concepts.
- Ability to troubleshoot and resolve complex data-related issues efficiently.
- Familiarity with cloud-based data platforms and storage solutions.
- Experience in performance tuning and ensuring data quality and consistency across systems.
Additional Information:
- The candidate should have minimum 12 years of experience in PySpark.
- This position is based at our Bengaluru office.
- A 15 years full time education is required.