Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : PySpark
Good to have skills : NA
Minimum
3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, a typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. This role includes creating efficient data pipelines and ensuring the seamless migration and deployment of data across various systems. The position requires continuous attention to data quality and the implementation of extract, transform, and load processes to facilitate smooth data flow and integration within the organization s infrastructure. Collaboration with different teams to support data-driven initiatives is also a key aspect of daily activities.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Collaborate with cross-functional teams to understand data requirements and deliver scalable data solutions.
- Monitor and optimize data pipelines to ensure high performance and reliability.
- Document data processes and workflows to maintain clarity and support knowledge sharing.
- Assist junior team members by providing guidance and support in their tasks.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in PySpark.
- Experience in building and managing data pipelines and workflows.
- Strong knowledge of data processing frameworks and distributed computing.
- Ability to troubleshoot and optimize large-scale data processing jobs.
- Familiarity with data storage solutions and data integration techniques.
- Understanding of data quality assurance and validation methods.
Additional Information:
- The candidate should have minimum 3 years of experience in PySpark.
- This position is based at our Hyderabad office.
- A 15 years full time education is required.