Job summary
Sr Consultant role focusing on LogicMonitor and Datadog based monitoring for hybrid environments delivering robust observability incident reduction and performance optimization for enterprise clients while collaborating closely with cross functional teams to design implement and maintain scalable monitoring solutions that align with organizational goals and compliance standards.
Responsibilities
Design and implement comprehensive LogicMonitor and Datadog monitoring strategies that align with enterprise observability objectives and service level targets
Configure advanced monitoring dashboards in LogicMonitor to provide meaningful visibility into infrastructure health application performance and capacity trends
Develop tailored Datadog metric collections monitors and visualization assets that highlight critical service indicators and enable proactive incident detection
Optimize alert rules and thresholds in LogicMonitor and Datadog to reduce noise prioritize critical events and improve response efficiency for support teams
Collaborate with application infrastructure and security teams to identify key telemetry requirements and translate them into actionable monitoring configurations
Conduct detailed root cause analyses using LogicMonitor and Datadog data to identify recurring issues and recommend sustainable remediation actions
Create and maintain reusable monitoring templates runbooks and standard operating procedures that support consistent deployment across hybrid environments
Integrate LogicMonitor and Datadog with ticketing and collaboration tools to streamline incident workflows and enhance transparency for stakeholders
Perform regular health checks tuning exercises and platform upgrades for LogicMonitor and Datadog to ensure resilience availability and optimal performance
Provide expert guidance to internal teams on observability best practices and help shape monitoring standards that support reliability and scalability
Document monitoring architectures configurations and operational guidelines in a clear and comprehensive manner to support onboarding and knowledge sharing
Coordinate with client partners to gather requirements present monitoring insights and translate feedback into iterative improvements for observability solutions
Measure and report on the impact of monitoring enhancements by tracking incident reduction mean time to resolution and performance improvements across services
Qualifications
Possess professional experience of eight to twelve years in enterprise monitoring and observability with strong focus on LogicMonitor and Datadog based solutions
Demonstrate advanced proficiency in configuring LogicMonitor collectors data sources dashboards and alerting components for complex hybrid environments
Demonstrate strong expertise in Datadog including custom metrics creation monitor configuration log analytics and visualization of service performance
Exhibit solid understanding of infrastructure components such as servers networks cloud platforms and containers to design effective monitoring coverage
Demonstrate ability to interpret performance data and logs to identify trends capacity risks and optimization opportunities that support business continuity
Display strong skills in documenting technical solutions creating runbooks and communicating complex topics in a clear and concise manner to diverse audiences
Demonstrate experience working in hybrid work models while collaborating with distributed teams using modern communication and project management tools
Exhibit familiarity with incident management and problem management practices to align monitoring configurations with operational processes and governance
Demonstrate strong analytical mindset with focus on reducing incidents improving service uptime and enhancing customer experience through observability improvements
Display knowledge of scripting or automation tools that support integration and configuration of LogicMonitor and Datadog within enterprise environments
Demonstrate commitment to continuous learning and staying current with emerging monitoring practices tools and industry trends relevant to observability