Job Title: Production Engineer, AS
Location: Pune, India
Purpose Introduction
Deutsche Bank is the leading German bank with strong European roots and a global network. We’re driving digital transformation and have a long-term aspiration to be one of the best banks in the world. As a diverse and inclusive workplace, we’re cultivating a culture of collaboration, innovation, and a strong client focus.
We have a robust, hands-on engineering culture dedicated to continuous learning, knowledge-sharing, technical skill development and networking. We are an essential part of the Bank’s technology platform and develop applications for many important business areas.
Role Description
This role is part of Finance(FPI)-RFT Production Integrity SRE portfolio. FPI SRE team is based out of India and Cary, US. This role will add to the follow the sun support model.
Finance(FPI) IT is an integral function of DB providing end to end support for all Finance related applications . The Current team supports the below Finance domains:
Reporting Tribe
- Risk Close
- Local Reg
- CRC
- Group Reporting
Accounting Tribe
- Accounting Revenue
- Accounting Close
What we’ll offer you
As part of our flexible scheme, here are just some of the benefits that you’ll enjoy
- Best in class leave policy
- Gender neutral parental leaves
- 100% reimbursement under childcare assistance benefit (gender neutral)
- Sponsorship for Industry relevant certifications and education
- Employee Assistance Program for you and your family members
- Comprehensive Hospitalization Insurance for you and your dependents
- Accident and Term life Insurance
- Complementary Health screening for 35 yrs. and above
Your key responsibilities
- System Reliability: Ensure the reliability, availability, and performance of production systems by implementing best practices in monitoring, alerting, and incident response.
- System Maintenance: Understand thoroughly the end-to-end application support process and escalation procedures, become fully conversant with all support tools. Maintain an end-to-end view of the application and infrastructure landscape.
- Automation: Develop and maintain automation tools and scripts to streamline deployment, scaling, and operational tasks.
- Incident Management: Act as a primary responder to system outages and incidents, ensuring rapid resolution and thorough post-mortem analysis to prevent recurrence.
- Monitoring & Alerting: Design and implement robust monitoring and alerting systems to proactively identify and address potential issues.
- Performance Optimization: Identify and resolve performance bottlenecks across the stack, from application code to infrastructure.
- Collaboration: Work closely with development teams and other stakeholders to ensure that new features and services are designed with reliability and scalability in mind.
- Documentation: Maintain comprehensive documentation of systems, processes, and procedures to ensure knowledge sharing and continuity.
- Continuous Improvement: Continuously evaluate and improve our infrastructure, tools, and processes to enhance system reliability and operational efficiency.
Your skills and experience
- 5+ years overall IT experience resource, with an ability to drive the right level of SRE Production engagement and controls within the Change organization, and from production support standpoint.
- Ability to work in a fast paced environment with competing and alternating priorities with a constant focus on delivery.
- Ability to balance business demands and IT fulfilment in terms of standardization, reducing risk and increasing IT flexibility.
Tech Stack:
AXIOM Platform Support:
- Strong AXIOM Tool Knowledge and Experience and Debugging skills .
- Provide 1st and 2nd line support for AXIOMSL ControllerView applications, including incident resolution, problem management, and service request fulfillment.
- Monitor AXIOM environments (production) to proactively identify and address potential issues, ensuring high availability and performance.
- Perform root cause analysis for AXIOM application and data-related issues, Support in implementing permanent fixes and preventative measures.
- Manage and troubleshoot AXIOM data ingestions, transformations (AXIOM Workflow and Expression Builder), report generation, and submission processes.
- Collaborate with AXIOM developers, business analysts, and infrastructure teams to resolve complex technical issues .
- Execute and monitor batch processes, schedules, and dependencies within the AXIOM ecosystem.
- Experience in Microservices, Spring boot, Angular JS.
- Strong working / scripting experience in Oracle, Unix Shell Scripting
- Strong Knowledge of Oracle Management, SQL scripts/PL SQL, performance mgmt.
- Strong understanding of Unix, Linux, and Windows
- Understanding of Agile and Safe methodologies (preferred)
- Strong Experience working on Cloud Technology i.e. Google Cloud (Preferred)
- Strong scripting experience in Java, Python and Shell
- Solid understanding of messaging middleware like Solace, TIBCO or MQ using JMS
- Solid understanding of monitoring systems like ITRS Geneos, Splunk.
- Spark - Basic to intermediate : Know How of spark framework on Hadoop with some background on the Hadoop ecosystem (Cluster, Nodes, Yarn, Oozie etc.)
AI Usage & Innovation:
- Proactively identify opportunities to leverage AI/ML tools and techniques to improve production support processes, such as:
- Predictive analytics for incident prevention: Analyzing historical incident data to anticipate potential system failures.
- Intelligent log analysis: Utilizing AI-powered tools to quickly pinpoint anomalies and root causes in large log datasets.
- Automated issue triage and routing: Implementing AI to classify and assign incidents to the appropriate support teams.
- Knowledge base enhancement: Contributing to and utilizing AI-driven knowledge management systems for faster resolution.
- Work with data engineers and AI specialists to integrate AI solutions into existing support workflows.
- Stay abreast of emerging AI technologies and their potential application in production support.
Technical/Functional Skills:
- Dev Ops – Experience with Linux/Unix systems and scripting languages such as, Python and shell
- Automation with tools such as Ansible, SSH, and Shell
- Monitoring Experience with the design and implementation of AI/ML and RPA/ automation tools or tools like Geneos, Prometheus, Grafana etc.
- Problem analysis and solving in multiple layers such as hardware, Linux, networking and application
- Hosting services (PaaS) DHSO, VHS, DAP, DWEB, etc.
- Working knowledge of networks and load balancing and ssh keys.
- Expertise in Unix command line and shell scripting.
- Deep knowledge of the Incident, Problem and Change Management processes within the ITSM framework at minimum must be ITIL V3 Foundation certified. Proficient at using Service Management tools (e.g. ServiceNow, JIRA, etc.) and service monitoring tools.
- Exposure to GCP and monitoring tools such as NewRelic will be preferred.
- Exposure to SRE Model and execution/maintain deliveries within the SRE model.
Soft Skills:
- Excellent communication and collaboration skills
- Able to adapt to a changing environment and drive change
- Able to successfully interface with various stakeholders
- Self-motivated, delivery focused with the ability to work independently where required.
- Able to own and drive solution understanding the real issues behind Business Requirements.
- Committed and demonstrate a strong ownership.
How we’ll support you
- Training and development to help you excel in your career
- Coaching and support from experts in your team
- A culture of continuous learning to aid progression
- A range of flexible benefits that you can tailor to suit your needs
About us and our teams
Please visit our company website for further information:
https://www.db.com/company/company.html
We strive for a culture in which we are empowered to excel together every day. This includes acting responsibly, thinking commercially, taking initiative and working collaboratively.
Together we share and celebrate the successes of our people. Together we are Deutsche Bank Group.
We welcome applications from all people and promote a positive, fair and inclusive work environment.