Job Description:
Design and develop scalable data pipelines using Databricks and SQL for analytics and reporting.
Experience: 5–8 years
Key Responsibilities:
Build ETL/ELT pipelines using Databricks (PySpark)
Write optimized SQL queries for data transformation and analysis
Work with Delta Lake / Lakehouse architecture
Perform data ingestion, cleansing, and validation
Optimize query performance and data workflows
Skills Required:
Databricks, Apache Spark
Strong SQL expertise
Python / PySpark
Data modeling concepts
Cloud platforms (Azure/AWS/GCP)
Good to Have:
Azure Data Factory / Airflow
Power BI / Tableau
Databricks certification