-
Manage and maintain
Windows Server environments
, ensuring high availability, reliability, and security.
-
Develop, implement, and test
Disaster Recovery (DR)
and
Business Continuity
procedures to meet defined
RTO/RPO
objectives.
-
Perform
failover recovery operations
, DR drills, and post-event validation to ensure minimal downtime and data integrity.
-
Conduct regular
database sanity checks
and system health monitoring to prevent performance degradation.
-
Administer and optimize
load balancers
to ensure efficient distribution of network traffic and service availability.
-
Collaborate with
networking teams
to troubleshoot connectivity issues, maintain firewall and routing configurations, and optimize performance.
-
Use
Splunk
for system log analysis, alerting, and proactive issue detection.
-
Automate routine administrative and recovery tasks using
PowerShell, Bash, or Python
scripts.
-
Maintain detailed
documentation
of system configurations, recovery processes, and change management activities.
Operating Systems:
Windows Server (2012/2016/2019/2022)
Disaster Recovery Tools:
Veeam, Azure Site Recovery, Zerto (as applicable)
Monitoring & Logging:
Splunk, SCOM, SolarWinds
Networking:
TCP/IP, DNS, DHCP, Load Balancing, Firewalls
Automation:
PowerShell, Bash, Python
Databases:
MS SQL Server – health checks, backup, and restoration
Virtualization:
VMware, Hyper-V
Backup & Restore:
Implementation, validation, and testing of backup strategies