Job Title: Staff Platform Engineer
Experience: 4 to 8 Years
Location: Mumbai
Employment Type: Full-time
Job Summary: Staff Platform Engineer ensures the reliability, scalability, automation, and continuous delivery of applications and platforms by managing CI/CD pipelines, cloud and container platforms, infrastructure automation, deployment processes, monitoring solutions, and operational best practices while supporting development teams and maintaining system security, performance, and availability. Responsibilities:
- Configure, administer, and maintain Linux/Windows Servers, Cloud Services, Container Platforms, and Production Environments.
- Manage and maintain RedHat OpenShift Container Platform (OCP 4.x), ensuring platform stability, availability, performance, and scalability.
- Deploy, configure, administer, and support IBM Cloud Pak Platforms, including: IBM Cloud Pak for Business Automation (CP4BA) j.
- Manage Cloud Pak Operators, Custom Resources (CRs), Certificates, Storage Classes, Scaling, and High-Availability Configurations.
- Administer and support Cloud Pak components and services, including but not limited to: a. Business Automation Workflow (BAW) b. Business Automation Studio c. FileNet Content Platform Engine d. Operational Decision Manager (ODM) e. Automation Document Processing (ADP) f. API Connect g. App Connect Enterprise (ACE) h. IBM MQ i. Aspera DataPower k. Watson Studio l. Watson Machine Learning m. DataStage n. Db2 Warehouse o. Data Virtualization p. Knowledge Catalog • Configure and manage integrations with External Systems, APIs, Databases, and Enterprise Applications. • Monitor, troubleshoot, tune, and optimize Platform Performance, Reliability, and Stability across all supported environments. • Implement and maintain Monitoring, Logging, and Alerting Solutions using tools such as Prometheus, Grafana, ELK Stack, and IBM Monitoring Solutions. • Proactively monitor Infrastructure, Applications, Compute, Storage, Networking, Virtualization, and Container Platforms to prevent outages and ensure service continuity.
- Perform Incident Management activities, including Troubleshooting, Root Cause Analysis (RCA), Post-Incident Reviews, and implementation of permanent corrective actions.
- Optimize infrastructure and platform resources for Scalability, Cost Efficiency, Security, and Operational Excellence.
- Maintain accurate Technical Documentation, Operational Procedures, Architecture Diagrams, and Runbooks. • Provide technical support and operational guidance during Deployments, Releases, Upgrades, and Production Incidents. Knowledge / Experience / Skills / Competency Required
- 4+ Years of Experience in a similar position.
- Degree or Diploma in Information Technology, BA or Commercial direction At least Engineering diploma level or equivalent in professional previous responsibility.
- Good communication skills to carry out coordination/liaison with other sections.
- Ability to work under pressure and effectively in a team environment.
- Good Experience in Linux/Windows Server Administration. • Good Experience with CI/CD Pipelines, automation tools (Jenkins, GitLab CI, GitHub Actions) and scripting (Bash, PowerShell, Python).
- Good Experience with RedHat OpenShift in production environments.
- Good Experience with Containers and Orchestration (Docker, Kubernetes).
- Strong experience deploying and supporting at least two IBM Cloud Paks (CP4BA).
- Hands-on experience with Monitoring & Observability Tools.
- Strong skills in Incident Management, Troubleshooting, and Root Cause Analysis (RCA).
- Good English Communication Skills (Read & Write). Key Technologies OpenShift | Kubernetes | Docker | IBM Cloud Pak (CP4BA) | BAW | FileNet | ODM | CI/CD | Jenkins | GitLab CI | GitHub Actions | Bash | PowerShell | Python | Prometheus | Grafana | ELK Stack | Linux | Windows | RCA | Production Support
Pay: ₹1,000,000.00 - ₹1,200,000.00 per year
Work Location: In person