Location: Hyderabad, India
Work Mode: Work from Office
Shift: 24x7 Rotational Shifts
Experience: 2–5 Years
We are looking for a proactive and detail-oriented Support Engineer to manage and support data platforms and client solutions across cloud and on-premise environments. This role involves working with modern data technologies such as AWS, Databricks, GCP, and Spark to ensure system reliability, performance, and uptime in a 24x7 support model.
-
Provide L1/L2 production support for data platforms and applications in cloud and on-prem environments
-
Monitor systems and ensure high availability, performance, and reliability
-
Troubleshoot incidents, perform root cause analysis (RCA), and implement fixes
-
Analyze logs and system metrics to identify and resolve issues proactively
-
Support deployment, configuration, and enhancements of data solutions
-
Collaborate with Cloud, DevOps, and Engineering teams for issue resolution and system improvements
-
Participate in Major Incident Management, including real-time response, escalation, and stakeholder communication
-
Maintain documentation for incidents, known errors, and operational procedures
-
Ensure adherence to SLAs, KPIs, and operational processes
-
2+ years of experience in Production Support, Data Engineering, or Cloud Support roles
-
Hands-on experience with at least one: AWS, GCP, Databricks, Spark, or Snowflake
-
Experience working in cloud-based and distributed systems environments
-
Strong troubleshooting, debugging, and analytical skills
-
Scripting experience in Python or Perl
-
Familiarity with monitoring tools and incident management workflows
-
Experience with Data Lakes, Data Warehousing, or Big Data ecosystems
-
Exposure to workflow orchestration tools such as Airflow, DBT, or Dagster
-
Understanding of data architecture and pipeline design
-
Knowledge of CI/CD pipelines and DevOps practices
-
Awareness of cloud cost optimization and operational efficiency
-
Ability to work effectively in 24x7 rotational shifts (including weekends/holidays)
-
Strong ownership mindset with the ability to handle critical production incidents
-
Ability to work in a fast-paced, high-availability 24x7 support environment
-
Excellent communication and stakeholder management skills
-
Proactive problem-solving approach with attention to detail