Job Description
The IT infrastructure Disaster Recovery & BC engineer will be responsible for formulating, defining & implementing disaster recovery strategies for complex IT infrastructure to meet Recovery Point & Recovery Time objectives. This includes the review of configuration data, capacity data, systems performance data, architectural drawings, as well as assessing physical rack layouts and deploying disaster recovery technology. You will also be reviewing every aspect of resiliency and disaster recovery capability, test results, and event post mortems.
This position will provide the incumbent with an opportunity to accomplish contingency planning goals: Risk Management, Data Center Relocation, and Business Recovery.
The responsibilities also encompass analyzing recovery requirements associated with business unit externally from the data center to ensure timely restoration of business operations or relocation to an adequate alternate facility in the event of disruption of services.
Principal Responsibilities & Skills:
- Ensure Business Continuity and Disaster recovery plans are developed as per business requirements for accounts unit organizations.
- Analyze impact on & risk to, essential business functions or information systems to identify recovery time periods and resource requirements.
- Ensure enterprise IT Disaster Recovery capability for application, server, network, and database systems within Recovery Time Objective.
- To successfully demonstrate, conduct POC / POT and implement replication technology & disaster recovery orchestration.
- Coordinate and participate in customer IT DR drills to ensure timely recovery capability in the event of a disaster.
- Perform risk analyses for corporate functional areas to identify points of vulnerability, and recommend disaster avoidance and reduction strategies.
- Participate in Business Impact Analysis to help determine system criticality and Recovery Time Objectives (RTOs) for business applications
- Collaborate with Business Continuity function to test and/or execute Business Continuity Plans in conjunction with DR plan.
- Write reports to summarize testing activities, including descriptions of goals, planning, execution, results, analysis, conclusions & recommendations.
- Maintain strong relationships with key contacts at third party DR data center hosting provider.
- Coordinate all IT DR activities with the contracted DR hosting provider.
Experience:
- 2 years of experience in deploying Infrastructure systems, backup software, replication tools and RPO & RTO strategies
- Deep understanding of IT DR at both a technical and business level
- In-depth knowledge of current best practices and technologies and their DR applications
- Experience in planning and executing enterprise-wide DR testing
- Experience working with and managing relationships with 3rd party DR providers
- Good technical understanding on virtualization platform like VMware & MS Hyper-V
- Proven knowledge of installing and configuring snapshot & real-time based backup & replication technologies.
- Knowledge of API integration with Databases, ERP, Email workloads & Virtualization platform, required.
- Basic understanding on Storage deployments like NFS, SAN, CIFS is highly desirable.
Knowledge on hybrid cloud use cases for backup & DR integration with public cloud providers like AWS, Azure, IBM will have an added advantage.