Work Mode: Remote
Experience: 5+ Years
Notice Period: Immediate to 15 days
Role Summary: We are seeking an experienced Data Engineer with strong Databricks expertise to design, build, and optimize scalable data pipelines and analytics solutions. In this fully remote role, you will collaborate closely with data scientists, analysts, and engineering teams to deliver reliable, high-performance data infrastructure that powers business decisions.
Required Qualifications
- 5+ years of hands-on data engineering experience.
- Strong hands-on experience with Databricks and Apache Spark (PySpark/Scala).
- Proficiency in SQL and Python.
- Experience with Delta Lake and the Lakehouse architecture.
- Working knowledge of at least one cloud platform (AWS, Azure, or GCP).
- Experience with data modeling, warehousing, and pipeline orchestration.
- Familiarity with CI/CD practices and version control (Git).
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).Key Responsibilities
- Design, develop, and maintain ETL/ELT pipelines using Databricks and Apache Spark.
- Build and optimize Delta Lake architectures for both batch and streaming data.
- Develop scalable data models and data warehousing solutions.
- Implement data quality, validation, and monitoring frameworks.
- Optimize Spark jobs for performance and cost efficiency.
- Collaborate with stakeholders to translate business requirements into technical solutions.
- Manage and orchestrate workflows using Databricks Workflows or tools such as Airflow.
- Ensure data security, governance, and compliance across all pipelines.
- Troubleshoot and resolve data pipeline issues in production environments.