Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : Databricks Unified Data Analytics Platform
Good to have skills : Generative AI
Minimum
5 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, you will engage in the design, development, and maintenance of data solutions that facilitate data generation, collection, and processing. Your typical day will involve creating efficient data pipelines, ensuring the integrity and quality of data, and implementing ETL processes to seamlessly migrate and deploy data across various systems. You will collaborate with cross-functional teams to understand data requirements and contribute to the overall data strategy, ensuring that the solutions you develop align with organizational goals and enhance data accessibility for stakeholders.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Assist in the design and implementation of data architecture to support data initiatives.
- Monitor and optimize data pipelines for performance and reliability.
- data processing and pipeline development, Databricks(Delta Lake, Spark jobs, notebooks ), Azure data services, vector databases, embedding pipelines strong for GenAI/RAG, agent orchestration frameworks, streaming (Kafka/Event Hubs) and real-time data products.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in Databricks Unified Data Analytics Platform.
- Good To Have Skills: Experience with Generative AI.
- Strong understanding of data modeling and database design principles.
- Experience with ETL tools and data integration techniques.
- Familiarity with cloud platforms and services related to data storage and processing.
Additional Information:
- The candidate should have minimum 3 years of experience in Databricks Unified Data Analytics Platform.
- This position is based at our Pune office.
- A 15 years full time education is required.