Databricks architect

Job details

Job Description: Key Role & Responsibilities

Define lakehouse architecture: medallion (bronze/silver/gold) patterns, batch/streaming designs, and multi-workspace strategies.
Design and implement data pipelines using Spark , Delta Lake , and Databricks workflows (Jobs/Workflows, DLT where applicable).
Establish governance and security using Unity Catalog , access controls, lineage, and data quality gates.
Optimise performance: cluster policies, autoscaling, partitioning, file sizing, caching, Spark tuning, and job orchestration.
Build CI/CD and release governance for notebooks, repos, jobs, and infrastructure-as-code.
Integrate Databricks with enterprise ecosystem (cloud storage, event streaming, data warehouse, BI tools).
Conduct solution workshops with customers; provide options and trade-offs; create phased implementation roadmaps aligned to business value.

Must Have

10+ years experience with a strong Data Engineering background (ETL/ELT, distributed compute, production-grade pipelines).
4+ years hands-on Databricks experience in architecture/technical leadership roles.
Strong experience in Apache Spark (PySpark/Scala), Delta Lake , pipeline design, and performance tuning.
Experience with data orchestration and DevOps practices (Git, CI/CD, testing frameworks).
Experience designing secure data platforms (RBAC, secrets, network/security integration, compliance considerations).

Good to Have

Streaming experience (Kafka/Event Hubs, Structured Streaming, CDC patterns).
ML/AI enablement experience (MLflow, feature engineering, model lifecycle) as it relates to platform design.

Education

Bachelor’s/Master’s in Computer Science, Engineering, or related fields.

Key Skills: Databricks, Python, Spark, Data Architecture, Data Pipelines

Responsibilities: Key Role & Responsibilities

Define lakehouse architecture: medallion (bronze/silver/gold) patterns, batch/streaming designs, and multi-workspace strategies.
Design and implement data pipelines using Spark , Delta Lake , and Databricks workflows (Jobs/Workflows, DLT where applicable).
Establish governance and security using Unity Catalog , access controls, lineage, and data quality gates.
Optimise performance: cluster policies, autoscaling, partitioning, file sizing, caching, Spark tuning, and job orchestration.
Build CI/CD and release governance for notebooks, repos, jobs, and infrastructure-as-code.
Integrate Databricks with enterprise ecosystem (cloud storage, event streaming, data warehouse, BI tools).
Conduct solution workshops with customers; provide options and trade-offs; create phased implementation roadmaps aligned to business value.

Must Have

10+ years experience with a strong Data Engineering background (ETL/ELT, distributed compute, production-grade pipelines).
4+ years hands-on Databricks experience in architecture/technical leadership roles.
Strong experience in Apache Spark (PySpark/Scala), Delta Lake , pipeline design, and performance tuning.
Experience with data orchestration and DevOps practices (Git, CI/CD, testing frameworks).
Experience designing secure data platforms (RBAC, secrets, network/security integration, compliance considerations).

Good to Have

Streaming experience (Kafka/Event Hubs, Structured Streaming, CDC patterns).
ML/AI enablement experience (MLflow, feature engineering, model lifecycle) as it relates to platform design.

Education

Bachelor’s/Master’s in Computer Science, Engineering, or related fields.

Key Skills: Databricks, Python, Spark, Data Architecture, Data Pipelines

Qualifications: Key Role & Responsibilities

Define lakehouse architecture: medallion (bronze/silver/gold) patterns, batch/streaming designs, and multi-workspace strategies.
Design and implement data pipelines using Spark , Delta Lake , and Databricks workflows (Jobs/Workflows, DLT where applicable).
Establish governance and security using Unity Catalog , access controls, lineage, and data quality gates.
Optimise performance: cluster policies, autoscaling, partitioning, file sizing, caching, Spark tuning, and job orchestration.
Build CI/CD and release governance for notebooks, repos, jobs, and infrastructure-as-code.
Integrate Databricks with enterprise ecosystem (cloud storage, event streaming, data warehouse, BI tools).
Conduct solution workshops with customers; provide options and trade-offs; create phased implementation roadmaps aligned to business value.

Must Have

10+ years experience with a strong Data Engineering background (ETL/ELT, distributed compute, production-grade pipelines).
4+ years hands-on Databricks experience in architecture/technical leadership roles.
Strong experience in Apache Spark (PySpark/Scala), Delta Lake , pipeline design, and performance tuning.
Experience with data orchestration and DevOps practices (Git, CI/CD, testing frameworks).
Experience designing secure data platforms (RBAC, secrets, network/security integration, compliance considerations).

Good to Have

Streaming experience (Kafka/Event Hubs, Structured Streaming, CDC patterns).
ML/AI enablement experience (MLflow, feature engineering, model lifecycle) as it relates to platform design.

Education

Bachelor’s/Master’s in Computer Science, Engineering, or related fields.

Key Skills: Databricks, Python, Spark, Data Architecture, Data Pipelines