Project Role : Application Support Engineer
Project Role Description : Act as software detectives, provide a dynamic service identifying and solving issues within multiple components of critical business systems.
Must have skills : Enterprise Systems Monitoring Tools
Good to have skills : Splunk Administration, Dynatrace Administration, Generative AI
Minimum
12 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
Owns and governs enterprise production Run operations, ensuring service stability, SLA compliance, and operational resilience across application and infrastructure landscapes.
Applies an AI first Run mindset, leveraging GenAI, AIOps, and agentic automation to reduce incidents, eliminate noise, and enable self healing while operating strictly within ITSM and governance frameworks.
Focuses on measurable outcomes, not tool deployment alone.
Roles & Responsibilities:
- Expected to be an SME.
- Own end to end Run operations across production environments with client accountability
- Lead major incident management, including coordination, diagnosis, remediation, and recovery
- Operate within ITSM processes (Incident, Problem, Change) and change windows
- Apply observability led operations using metrics, logs, traces, and events to prevent incidents.
- Leverage AIOps and agentic AI for incident reduction, noise elimination, and self healing
- Break Run workflows (incident diagnosis remediation validation) into agent driven, orchestrated steps
- Drive outcome based improvements (MTTR reduction, ticket deflection, automation coverage)
Professional & Technical Skills:
- Strong hands on experience running enterprise production environments with SLAs and major incidents.
- Experience with enterprise observability / operations tools, such as:
o Splunk (Enterprise / ITSI / Observability Cloud)
o Dynatrace / AppDynamics
o SolarWinds
o Datadog / New Relic / Grafana / Prometheus ecosystem
o Metrics, logs, traces, events
o Alerting strategies and noise reduction
o Event correlation and predictive incident prevention
- Practical exposure to GenAI and AIOps in Run, including:
o Incident summarization
o RCA assistance
o Runbook and knowledge augmentation
- Ability to work with ambiguity and incomplete data in complex Run environments.
- Resource needs to be AI Ready.
Additional Information:
- The candidate should have minimum 12 years of experience in Enterprise Systems Monitoring Tools.
- This position is based at our Bengaluru office.
- A 15 years full time education is required.
- Operates independently with accountability for complex Run operations.
- Influences adoption of AI first operating models at scale.