Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : Databricks Unified Data Analytics Platform
Good to have skills : NA
Minimum
7.5 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Data Platform Architect, you will be responsible for architecting the data platform blueprint and implementing the design, which includes various relevant components of the data platform. Your typical day will involve collaborating with Integration Architects and Data Architects to ensure that there is cohesive integration between systems and data models, thereby enhancing the overall functionality and efficiency of the data architecture.
Job Description: Databricks Unity Catalog Lead
**Role Overview**
The Databricks Unity Catalog Lead will be responsible for architecting and managing centralized data governance across enterprise Databricks workspaces on **Azure Cloud**. This role ensures secure, compliant, and scalable data provisioning aligned with business requirements and enterprise standards.
**Key Responsibilities**
- Implement data provisioning patterns based on business requirements, following enterprise policies and metadata management rules.
- Design and configure Unity Catalog metastores with **Azure Data Lake Storage Gen2** as root storage.
- Define and enforce workspace policies, cluster provisioning, and infrastructure sizing on **Azure Databricks**.
- Implement fine-grained access control (catalogs, schemas, tables, volumes) using ANSI SQL.
- Manage external locations, storage credentials, and service principals for secure data access.
- Synchronize and manage identities (users, groups, service principals) with **Azure Active Directory**.
- Monitor audit logs, track data lineage, and ensure compliance with governance frameworks.
- Configure secure data sharing (Delta Sharing) with external partners.
- Collaborate with architecture, security, and governance teams to navigate enterprise approval processes.
- Operationalize ML models in batch and real-time pipelines with appropriate governance setups.
**Required Skills & Qualifications**
- **8–12 years of overall IT experience**, with at least **5+ years of hands-on experience in Azure Databricks and Unity Catalog**.
- Strong expertise in **ANSI SQL** for access control and policy enforcement.
- Proficiency in **Python scripting** and automation for data pipeline management.
- Deep knowledge of **Azure cloud services** (ADLS Gen2, Key Vault, Azure AD, IAM).
- Familiarity with **Privacera** for data security and access control.
- Experience with metadata management, lineage, and cataloging frameworks (Collibra preferred).
- Experience with **Terraform** or ARM templates for Infrastructure as Code (IaC).
- Strong understanding of enterprise data governance, compliance (GDPR/PII), and security standards.
**Preferred Qualifications**
- Databricks Certified Data Engineer or Architect certification.
- Microsoft Certified: **Azure Solutions Architect Expert** or **Azure Data Engineer Associate**.
- Experience with Collibra or similar enterprise data governance tools.
- Prior experience leading enterprise-scale data governance initiatives.
- Strong stakeholder management and communication skills.
Roles & Responsibilities:
- Expected to be an SME.
- Collaborate and manage the team to perform.
- Responsible for team decisions.
- Engage with multiple teams and contribute on key decisions.
- Provide solutions to problems for their immediate team and across multiple teams.
- Facilitate knowledge sharing sessions to enhance team capabilities.
- Monitor and evaluate team performance to ensure alignment with project goals.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in Databricks Unity Catalog.
- Good To Have Skills: Experience with data governance frameworks.
- Strong understanding of data modeling techniques.
- Familiarity with cloud-based data storage solutions.
- Experience in implementing data security measures.
- 15 years full time education is required.