Databricks Engineer

METADESIGN SOLUTIONS -
Gurugram, Haryana

Apply Now

Job details

Full-time

Benefits

Paid time off

Qualifications

CI/CD
Performance tuning
Power BI
Azure
Computer Science
Big data
Spark
Microsoft SQL Server
Git
Tableau
Master's degree
SQL
AWS
Analysis skills
Bachelor's degree
Continuous integration
REST
Scripting
Unity
GitHub
APIs
ETL
S3
Kafka
Metadata
SSIS
Leadership
Jenkins
Data warehouse
Python
High availability
Analytics
Information Technology

Full job description

Position Overview

Job Title: Databricks Engineer (7+ Years Experience) — MetaDesignSolutions
Location: Gurugram, Haryana, India (On-site) — Employment Type: Full-Time

MetaDesignSolutions is a technology consulting and digital transformation firm focused on delivering cloud-native data engineering, analytics, and AI-enabled solutions to enterprise clients across industries. We partner with customers to modernize data platforms, accelerate time-to-insight, and enable governed, scalable data architectures. The Databricks Engineer will be a senior contributor responsible for designing, implementing, and optimizing large-scale data solutions on Databricks and associated cloud services to support the company’s analytics and cloud migration engagements.

Key Responsibilities

Design, develop, and maintain scalable, resilient data pipelines and ETL/ELT workflows using Databricks and Apache Spark to ingest, transform, and curate large volumes of structured and semi-structured data.
Migrate legacy SSIS packages and SQL-based ETL processes to cloud-native Databricks and managed data services (Azure/AWS), ensuring functional parity, performance improvements, and maintainability.
Develop and maintain Databricks notebooks, jobs, Delta Lake tables, and orchestration workflows to support data lakes, data warehouse ingestion, and downstream analytics.
Integrate data from multiple sources including relational databases, cloud object stores, REST APIs, message queues, and flat files; implement robust error handling and retry logic.
Perform performance tuning and cost optimization of Spark jobs, cluster configurations, and SQL queries; implement partitioning, caching, and resource controls for efficient processing.
Implement data quality frameworks, monitoring, alerting, and observability for pipelines; establish automated validation, reconciliation, and lineage tracking in collaboration with data governance teams.
Work independently on end-to-end delivery of data engineering projects, collaborate with architects, business analysts, and stakeholders to translate requirements into pragmatic technical solutions, and participate in design reviews.
Participate in release management, CI/CD for data artifacts, and provide production support, troubleshooting, and operational runbooks to ensure high availability and SLAs.

Required Qualifications

Minimum 7 years of experience in Data Engineering, ETL development, or related roles with proven delivery on enterprise data programs.
Strong hands-on experience with Databricks and Apache Spark; demonstrated ability to build production-grade Spark jobs and optimize job performance.
Proficiency in SQL and Python (PySpark) for data transformation, analysis, and scripting.
Experience with SSIS packages and proven experience migrating SSIS/legacy ETL to cloud-native platforms.
Solid understanding of data warehousing concepts, dimensional modeling, and best practices for data lakes and lakehouse architectures.
Hands-on experience with cloud data services such as Azure Data Factory, Azure Databricks, AWS Glue, S3/ADLS, and related security/configuration patterns.
Familiarity with Delta Lake, Unity Catalog, ACID transactions, and principles of data governance and access control.
Demonstrated skills in performance tuning Spark jobs, cluster sizing, and query optimization; experience with monitoring and profiling tools.
Bachelor’s or Master’s degree in Computer Science, Information Technology, Engineering, or a related field.
Strong analytical, problem-solving abilities and the capacity to work independently with minimal supervision while communicating effectively with technical and non-technical stakeholders.

Preferred Qualifications

Experience implementing CI/CD pipelines for data engineering artifacts using tools such as Azure DevOps, GitHub Actions, Jenkins, or Bitbucket Pipelines.
Exposure to streaming and real-time processing frameworks such as Kafka, Structured Streaming, or Kinesis.
Familiarity with BI tools such as Power BI or Tableau and an understanding of downstream reporting/consumption patterns.
Certifications such as Databricks Certified Associate/Professional, Microsoft Azure Data Engineer, or AWS Big Data/Analytics certifications.
Experience with data cataloging, metadata management, and data lineage tools.
Previous consulting or client-facing experience delivering cloud migration and data modernization engagements.

What We Offer

Competitive salary commensurate with experience and market standards; comprehensive compensation package.
Full-time, on-site role based in Gurugram with opportunities to work on strategic, high-impact data modernization projects across industries.
Health insurance and employee benefits aligned with company policy; paid time off and statutory benefits.
Professional development support including training, certification sponsorship, and mentorship to advance technical and leadership skills.
Exposure to modern data architectures, large-scale datasets, and best-in-class tools and platforms to accelerate career growth.
Collaborative and inclusive culture with emphasis on knowledge sharing, technical excellence, and client-focused delivery.
Opportunities to contribute to architecture design, influence technical direction, and mentor junior engineers.

Apply Now

Position Overview

Key Responsibilities

Required Qualifications

Preferred Qualifications

What We Offer

Jobseeker tools

Employer Tools

Browse

Stay Connected