Database Operations Engineer (DB Ops)
Location: Offshore – India Development Centers
Experience: 5–7 Years
Role Summary
We are seeking a motivated and detail-oriented Database Operations Engineer (DB Ops) to support database administration, monitoring, and operational activities across cloud and hybrid environments.
In this role, you will assist with database maintenance, performance monitoring, backup and recovery operations, incident management, and automation initiatives while collaborating closely with senior database engineers, application teams, and infrastructure teams.
The ideal candidate will have a solid foundation in database administration, Azure cloud technologies, and production support operations, along with a strong willingness to learn automation, DevOps, and cloud-native database technologies.
Key Responsibilities
- Support administration and maintenance of SQL Server, Azure SQL Database, PostgreSQL, and Azure Cosmos DB environments.
- Ensure database availability, reliability, and operational stability across Production, UAT, QA, and Development environments.
- Perform and monitor database backup, restore, and recovery activities.
- Execute routine database maintenance tasks, including index maintenance, statistics updates, consistency checks, and health assessments.
- Assist in database patching, upgrades, migrations, and environment refresh activities.
- Support database security administration, user access management, and compliance requirements.
- Monitor database performance and identify performance bottlenecks.
- Assist in troubleshooting blocking sessions, deadlocks, long-running queries, replication latency, and resource utilization issues.
- Support basic query tuning, indexing strategies, and performance optimization initiatives.
- Utilize monitoring and observability tools such as SentryOne, Dynatrace, and native database monitoring solutions to track database health and performance.
- Assist in implementing, maintaining, and monitoring High Availability (HA) and Disaster Recovery (DR) solutions.
- Gain exposure to technologies such as SQL Server Always On Availability Groups, replication, failover strategies, and backup validation processes.
- Participate in recovery testing and business continuity exercises.
- Support automation of routine operational activities using PowerShell, Python, or similar scripting technologies.
- Assist with database deployment processes and CI/CD pipelines using Azure DevOps.
- Contribute to Infrastructure-as-Code (IaC) initiatives using Terraform and related tools.
- Support release management and deployment automation activities.
- Participate in incident management, troubleshooting, and issue resolution activities.
- Perform initial root cause analysis (RCA) and escalate issues appropriately when required.
- Follow operational runbooks, standard operating procedures, and escalation processes.
- Participate in on-call support rotations and major incident management activities.
- Collaborate with application, infrastructure, and support teams to resolve production issues efficiently.
- Maintain and enhance operational documentation, knowledge articles, and runbooks.
- Contribute to continuous improvement initiatives aimed at increasing operational efficiency and reliability.
- Support implementation of best practices in database operations, monitoring, and automation.
Required Technical Skills
- Hands-on experience with:
- SQL Server
- Azure SQL Database
- PostgreSQL
- Azure Cosmos DB
- Database installation, configuration, and environment setup.
- Backup, restore, and recovery management.
- Understanding of High Availability (HA) and Disaster Recovery (DR) concepts.
- Exposure to SQL Server Always On Availability Groups and replication technologies.
- Performance tuning fundamentals and query optimization techniques.
- Indexing, statistics management, and execution plan analysis.
- Database security, user access control, and permissions management.
- Scripting experience using PowerShell and/or Python.
- Exposure to Azure DevOps and CI/CD pipelines.
- Understanding of Terraform and Infrastructure-as-Code (IaC) principles.
- Experience with database monitoring and performance analysis tools.
- Exposure to:
- SentryOne
- Dynatrace
- SQL Monitoring Solutions
- Database health monitoring, ing, and reporting.
- Understanding of blocking, deadlock analysis, and replication monitoring.
- Windows Server administration fundamentals.
- Basic Linux administration knowledge.
- Understanding of Azure infrastructure services, including:
- Virtual Machines
- Storage Accounts
- Networking
- Firewalls
- IIS
- Strong analytical and troubleshooting skills.
- Ability to perform initial issue analysis and escalation.
- Basic understanding of application architecture and .NET environments.
- Ability to troubleshoot across database, infrastructure, and application layers.
Preferred Qualifications
- Exposure to Microsoft Azure cloud services.
- Understanding of Site Reliability Engineering (SRE) concepts, including monitoring, ing, incident response, and operational excellence.
- Experience with ServiceNow or other ITSM/ticketing platforms.
- Exposure to Ansible or configuration management tools.
- Understanding of software release management and deployment processes.
- Familiarity with Agile and DevOps methodologies.
Soft Skills
- Strong desire to learn and grow within database, cloud, and DevOps technologies.
- Excellent analytical and problem-solving abilities.
- Effective verbal and written communication skills.
- Strong collaboration and teamwork capabilities.
- Ability to work in a fast-paced production support environment.
- Ability to follow established processes and operate under guidance while demonstrating initiative.
sql server,postgresql,cloud-based database,database administration,performance tuning,backup and recovery,database reliability,high availability,monitoring