Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : PySpark
Good to have skills : NA
Minimum
3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, a typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. The role includes creating efficient data pipelines to facilitate smooth data flow, ensuring the accuracy and quality of data throughout its lifecycle, and implementing processes to extract, transform, and load data across various systems. This position requires continuous collaboration with different teams to optimize data handling and support organizational data needs effectively.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Collaborate with cross-functional teams to understand data requirements and deliver scalable solutions.
- Monitor and troubleshoot data pipeline issues to maintain system reliability and performance.
- Document data processes and workflows to ensure transparency and knowledge sharing.
- Assist junior team members by providing guidance and support in their tasks.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in PySpark.
- Experience in building and optimizing data pipelines and workflows using PySpark.
- Strong knowledge of data processing frameworks and distributed computing concepts.
- Familiarity with data storage solutions and data modeling techniques.
- Ability to work with large datasets and ensure data quality and integrity.
- Experience in debugging and performance tuning of data processing jobs.
Additional Information:
- The candidate should have minimum 3 years of experience in PySpark.
- This position is based at our Gurugram office.
- A 15 years full time education is required.