Roles & Responsibilities – Cloud & Infrastructure Operations (Azure + On-Prem)
Role name: Azure and Windows Infrastructure Specialist
Experience: 6-10 years
Mission Statement Enable reliable, secure, and cost-efficient IT services by owning the end-to-end operation of our Azure and on-premises infrastructure. As an Azure and Windows Infrastructure Specialist, this role ensures high availability, performance, and resilience of compute, storage, networking, and identity platforms while maintaining strong security, compliance, and disaster recovery posture. Through proactive monitoring, structured problem-solving, and close collaboration with service desk, security, and application teams, the role continuously optimizes our hybrid environment so that business users and development teams experience stable, scalable, and easy-to-consume IT services.
1. Infrastructure & Compute Management
Manage Azure Virtual Machines and on-premises servers (Windows/Linux)
Perform system sizing, optimization, patching, and lifecycle management
Maintain physical server hardware (CPU, storage, firmware, RAID)
Ensure high availability using Azure availability sets/zones and clustering
Monitor system performance, uptime, and capacity
Plan and execute infrastructure upgrades and migrations
2. Operating System Administration
Install, configure, and maintain Windows and Linux operating systems
Perform regular patching and security updates across environments
Manage services, processes, and system configurations
Apply OS hardening in line with security policies
Ensure consistency across hybrid environments
3. Networking & Hybrid Connectivity
Design and manage Azure VNets, subnets, and IP addressing
Configure VPN and ExpressRoute for hybrid connectivity
Manage DNS, DHCP, and name resolution services
Administer Azure Firewall, NSGs, Load Balancers, and network policies
Troubleshoot connectivity, performance, and routing issues
Ensure secure communication across cloud and on-prem environments
4. Identity & Access Management
Manage Active Directory and Azure AD (users, groups, GPOs)
Configure and maintain Azure AD Connect (hybrid identity)
Implement RBAC, PIM, and least-privilege access
Support SSO, MFA, and Conditional Access policies
Handle user lifecycle management and access governance
Troubleshoot identity and authentication issues
5. Security & Compliance
Implement security best practices across infrastructure platforms
Maintain compliance with organizational policies and standards
Manage endpoint protection, encryption, and certificate lifecycle
Monitor security alerts, logs, and vulnerabilities
Support audit activities and remediation initiatives
6. Backup, Restore & Disaster Recovery
Configure and maintain backup solutions (Azure and on-prem)
Perform periodic restore tests to validate recoverability
Implement disaster recovery and business continuity strategies
Ensure compliance with backup and retention policies
Maintain recovery documentation and procedures
7. Cloud Architecture & Solution Design
Support design of scalable and secure Azure architectures
Contribute to landing zones, hub-spoke network models, and governance
Align infrastructure designs with business requirements
Evaluate new technologies and recommend improvements
Ensure performance, resiliency, and scalability of solutions
8. Storage & Data Management
Manage Azure storage services and on-prem SAN/NAS systems
Monitor storage capacity, performance, and growth trends
Configure file shares, permissions, and quotas
Ensure data availability, integrity, and security
9. Monitoring & Performance Optimization
Use tools such as Azure Monitor, PRTG, and system logs
Analyze performance metrics across compute, network, and storage
Identify bottlenecks and implement optimization strategies
Maintain high availability and minimize downtime
Implement proactive alerting mechanisms
10. Cost Management & Optimization (Azure)
Monitor cloud consumption and usage trends
Optimize resources through right-sizing and scaling strategies
Identify unused or underutilized services
Prepare cost reports, forecasts, and recommendations
Implement tagging and governance for cost control
11. Incident Management & Troubleshooting
Monitor infrastructure and respond to incidents and alerts
Diagnose and resolve issues across cloud and on-prem platforms
Perform root cause analysis (RCA)
Manage escalations and coordinate with vendors
Maintain incident documentation and improvement actions
12. Documentation & Change Management
Maintain system documentation, architecture diagrams, and runbooks
Follow formal change management processes (RFCs, approvals, rollback plans)
Document configurations, procedures, and recovery steps
Ensure audit readiness and compliance documentation
Contribute to knowledge sharing and training initiatives
Required Skills & Qualifications Technical Skills
Strong experience with Microsoft Azure (VMs, Networking, Security, Backup)
Experience managing Windows/Linux servers in on-prem environments
Knowledge of Active Directory, Azure AD, and identity management
Hands-on experience with hybrid networking (VPN/ExpressRoute)
Experience with monitoring tools (Azure Monitor, PRTG, SCOM)
Understanding of security frameworks and best practices
Azure Fundamentals certification (Exam AZ-900) Preferred Skills
Experience with automation (PowerShell, ARM, Bicep, Terraform)
Knowledge of Azure cost management and optimization strategies
Familiarity with virtualization platforms (VMware/Hyper-V)
Experience with enterprise backup and DR solutions Soft Skills
Analytical thinking: Structured, root-cause-focused approach to problem-solving and continuous improvement.
Collaboration: Acts as a trusted sounding board and support function for teams working in Azure and adjacent domains.
Communication: Able to articulate complex technical topics for non-technical stakeholders. Teamwork: Works effectively across Service Desk, infrastructure, security, and application teams.
Attention to detail:
High standards in configuration, documentation, and change execution.
Prioritization: Effective time management with focus on initiatives that have clear impact on platform reliability and operational efficiency.
Customer focus: Genuine interest in making IT and cloud services simpler and better for end users and development teams.
Strategic mindset: Understands how cloud platform operations support business goals and how to evolve the operating model over time.
Informal leadership:
Drives initiatives, influences stakeholders, and serves as a role model in operational excellence.
Key Outcomes / Success Metrics
High availability and performance of infrastructure services
Reduced incident resolution time (MTTR)
Optimized cloud cost and resource utilization
Compliance with security and governance standards
Accurate and up-to-date documentation
Salary Budget 15.00 -18.00 lacs
Mode of operation - WFH (sometimes needs to visit Pune office)
Interview process: 3 interviews
Exp : 6-10 yrs
Timing: European timezone
Interested Candidates can apply with Updated CV on [email protected] OR can call / WhatsApp On 9850700213
Pay: ₹1,500,000.00 - ₹1,800,000.00 per year
Benefits:
Work Location: Remote