Role Overview:
We are seeking an experienced Team Lead – Data Centre Operations to manage and oversee day-to-day data centre activities, ensuring high availability, reliability, and performance of IT infrastructure.
The role involves leading a team of engineers, coordinating incident management, ensuring compliance with operational procedures, and maintaining data centre standards.
Location: Mumbai
Experience: 6+ Years
Key Responsibilities:
Lead and manage a team of L1/L2 Data Centre Engineers and ensure smooth shift operations.- Monitor and maintain data centre infrastructure including servers, storage, networking, power, and cooling systems.
- Ensure 24x7 uptime and availability of critical infrastructure services.
- Act as the first point of escalation for complex technical incidents and coordinate with L3 teams or vendors.
- Plan and oversee rack & stack activities, server deployments, and infrastructure upgrades.
- Monitor environmental systems such as power, UPS, cooling, and fire suppression.
- Maintain and enforce data centre operational procedures and compliance standards.
- Manage incident, problem, and change management processes.
- Coordinate with vendors and third-party service providers for maintenance and support.
- Ensure accurate documentation, asset management, and reporting of data center operations.
- Conduct team training, mentoring, and performance monitoring.
- Support capacity planning, infrastructure optimization, and preventive maintenance activities.
Technical Expertise / Required Skills:
Strong knowledge of Data Centre Operations and Infrastructure Management.- Hands-on experience with server hardware, storage systems, and networking devices.
- Experience with data centre monitoring tools and incident management systems.
- Strong knowledge of Linux and Windows server environments.
- Good understanding of networking protocols, VLANs, routing, switching, and firewall basics.
- Experience in virtualization technologies such as VMware vSphere or Microsoft Hyper-V.
- Familiarity with remote management tools (iLO, iDRAC, IPMI).
- Experience with monitoring tools like Nagios, Zabbix, or SolarWinds.
- Understanding of data center power systems, UPS, cooling systems, and physical infrastructure.
- Experience in incident escalation, root cause analysis, and problem resolution.
Leadership & Management Skills:
Ability to lead and mentor technical teams.- Strong incident management and decision-making ability.
- Excellent communication and coordination skills.
- Experience managing shift-based operations in a 24x7 environment.
- Ability to handle high-pressure situations and critical outages.
Preferred Certifications:
Cisco Certified Network Associate (CCNA)- VMware Certified Professional (VCP)
- Red Hat Certified System Administrator (RHCSA)
- ITIL Foundation Certification
- CompTIA Server+
Educational Qualification:
Bachelor’s Degree in Computer Science / Information Technology / Electronics / related discipline.
About Us:
Zybysis is a fast-growing technology company delivering enterprise infrastructure, data center, and cybersecurity solutions to clients.
With a strong focus on reliability, security, and operational excellence, we support mission-critical environments for enterprise customers. Our teams combine technical expertise with strong service delivery to ensure high availability and performance across complex IT environments.