Staff Platform Engineer
Summary / Key Responsibilities:
The Staff Platform Engineer is responsible for ensuring the reliability, scalability, automation, security, and continuous delivery of enterprise applications and platforms. This role focuses on managing CI/CD pipelines, cloud and container platforms, infrastructure automation, deployment processes, monitoring solutions, and operational best practices while supporting development teams and maintaining system performance, availability, and security.
Key Functions & ResponsibilitiesPlatform Administration & Infrastructure Management:
- Configure, administer, and maintain Linux and Windows servers, cloud services, container platforms, and production environments.
- Manage and maintain Red Hat OpenShift Container Platform (OCP 4.x), ensuring platform stability, availability, performance, and scalability.
- Optimize infrastructure resources for scalability, cost efficiency, security, and operational excellence.
IBM Cloud Pak Administration:
- Deploy, configure, administer, and support IBM Cloud Pak platforms, including:
- IBM Cloud Pak for Business Automation (CP4BA)
- Manage:
- Cloud Pak Operators
- Custom Resources (CRs)
- Certificates
- Storage Classes
- Scaling & High Availability Configurations
IBM Platform Components Support
Administer and support the following IBM platform services:
- Business Automation Workflow (BAW)
- Business Automation Studio
- FileNet Content Platform Engine
- Operational Decision Manager (ODM)
- Automation Document Processing (ADP)
- API Connect
- App Connect Enterprise (ACE)
- IBM MQ
- Aspera
- DataPower
- Watson Studio
- Watson Machine Learning
- DataStage
- Db2 Warehouse
- Data Virtualization
- Knowledge Catalog
Integration & Deployment:
- Configure and manage integrations with external systems, APIs, databases, and enterprise applications.
- Provide technical support and operational guidance during deployments, releases, upgrades, and production incidents.
Monitoring & Observability:
- Implement and maintain monitoring, logging, and alerting solutions using:
- Prometheus
- Grafana
- ELK Stack
- IBM Monitoring Solutions
- Proactively monitor:
- Infrastructure
- Applications
- Compute resources
- Storage
- Networking
- Virtualization platforms
- Container platforms
Incident & Problem Management:
- Perform incident management, including:
- Troubleshooting
- Root Cause Analysis (RCA)
- Post-Incident Reviews
- Corrective & Preventive Actions
- Ensure service continuity and minimize platform downtime.
Documentation & Operational Excellence:
- Maintain accurate:
- Technical Documentation
- Operational Procedures
- Architecture Diagrams
- Runbooks
- Establish and follow platform operational best practices.
Qualification & Experience Education:
- Degree or Diploma in Information Technology, Computer Science, Engineering, or an equivalent technical discipline.
Experience:
- Minimum 4+ years of experience in a similar Platform Engineering / DevOps / Infrastructure Engineering role.
Required Technical Skills & Expertise Operating Systems:
- Strong experience in:
- Linux Administration
- Windows Server Administration
DevOps & Automation:
- Hands-on experience with:
- CI/CD Pipelines
- Jenkins
- GitLab CI/CD
- GitHub Actions
- Strong scripting skills in:
- Bash
- PowerShell
- Python
Containers & Orchestration:
- Strong experience with:
- Docker
- Kubernetes
- Red Hat OpenShift (Production Environments)
IBM Technologies:
- Strong hands-on experience deploying and supporting at least two IBM Cloud Pak solutions, preferably including:
- IBM Cloud Pak for Business Automation (CP4BA)
Monitoring & Observability:
- Experience with:
- Prometheus
- Grafana
- ELK Stack
- Enterprise monitoring platforms
Operations & Support:
- Strong skills in:
- Incident Management
- Troubleshooting
- Root Cause Analysis (RCA)
- Performance Tuning
- Capacity Planning
Soft Skills:
- Excellent communication and coordination skills.
- Ability to work effectively in a cross-functional team environment.
- Ability to perform under pressure and manage critical production environments.
- Good command of English (Reading, Writing, and Communication).
Pay: ₹1,684,478.78 - ₹2,878,933.53 per year
Benefits:
Application Question(s):
- Which IBM Cloud Pak products have you worked on?
- Have you managed CI/CD pipelines using Jenkins, GitLab CI or GitHub Actions?
- Which container technologies have you worked with?
- Have you supported production environments with 99.9%+ availability requirements?
- Have you implemented monitoring using any of the following?
(Multi-select)
Prometheus
Grafana
ELK
Splunk
Datadog
- Have you performed Root Cause Analysis (RCA) for production incidents?
- Are you comfortable writing automation scripts?
- Notice Period?
Immediate
<30 Days
30-60 Days
60-90 Days
90+ Days
Experience:
- Platform Engineer: 5 years (Preferred)
- Open Shift Administrator: 5 years (Preferred)
- DevOps Engineer: 5 years (Preferred)
- Red Hat OpenShift in a production environment: 5 years (Preferred)
- Kubernetes: 5 years (Preferred)
- administered IBM Cloud Pak solutions in production: 5 years (Preferred)
Location:
- Mumbai, Maharashtra (Mumbai) (Preferred)
Work Location: In person