Observability Manager

  • Permanent
  • Madrid
  • Negotiable EUR / Year

Levy Search

Skills: Observability Management, Automation, AWS/GCP and CI/CD pipelines

Location : Madrid Spain (Hybrid)

Type : Permanent

Lead Observability Vision: Develop and implement the company’s observability strategy, aligning it with both IT operational goals and broader business objectives.

Cross-functional Collaboration: Work closely with IT, DevOps, and business stakeholders to ensure observability initiatives support the company’s strategic goals and operational efficiency.

Senior Leadership: Lead and mentor a team of observability engineers and specialists responsible for configuration, deployment, and optimization of observability tools and processes.

Drive Performance Optimization: Collaborate with IT and SRE teams to identify and resolve performance bottlenecks, optimize system performance, and enhance the customer experience using Dynatrace insights.

Innovate and Automate: Integrate observability into automated processes, including CI/CD pipelines, to ensure proactive monitoring and faster incident detection and resolution.

Reporting and Insights: Provide stakeholders regular reports on system performance, observability metrics, and actionable insights, making data-driven recommendations for improvement.

Incident Management: Oversee major performance-related incidents, ensuring quick resolution and post-incident analysis to prevent recurrence.

What We Expect From You:

Educational Background: Bachelor’s Degree in Computer Science, Information Technology, or a related field (or equivalent practical experience).

Experience: 7+ years of experience in IT Operations, Performance Engineering, or Site Reliability Engineering, with at least 3+ years of hands-on experience with Dynatrace.

Leadership Skills: Proven track record of leading teams and developing a vision for observability in alignment with business and operational goals.

Technical Expertise: Strong knowledge of cloud platforms (AWS, Azure, GCP), containerized environments (Kubernetes, Docker), and observability tools like Prometheus, Grafana, Splunk, or similar.

Automation Experience: Familiarity with CI/CD pipelines, infrastructure-as-code, and automation tools to streamline monitoring and incident management processes.

Preferred Qualifications:

Dynatrace Certification and experience with AIOps, automated incident response, or machine learning-driven performance monitoring.

Familiarity with ITIL frameworks and incident/problem management processes.

Upload your CV/resume or any other relevant file. Max. file size: 98 MB.