Engineering Lead
NEW CONTRACT ROLE - ENGINEERING LEAD (OBSERVABILITY / SRE) ASAP START UK (Remote / Hybrid) 6-Month Contract Possible Extension London, Manchester, Birmingham or Edinburgh THE OPPORTUNITY We''re looking for an experienced Engineering Lead to support a critical enterprise observability and operational resilience programme. This role is focused on leading the uplift of monitoring, alerting, and end-to-end service visibility across business-critical applications. It''s ideal for a senior, hands-on engineering lead with deep Prometheus and Grafana expertise, capable of guiding best practices across SRE, platform, and application teams. THE ROLE Lead collaboration with Application Stewards and Site Reliability Engineers (SREs) to confirm critical services and assets in scope for monitoring verification and upliftWork with EMAS to analyse Prometheus scrape coverage, exporter deployment, and Grafana dashboard availability for critical applicationsDrive improvements across monitoring configuration, alert quality, metrics, dashboards, KPIs, SLIs, and SLOsLead the optimisation of alerting to ensure alerts are reliable, actionable, and noise-optimised, applying Alertmanager best practicesOversee delivery of automated end-to-end business flow visibility through Grafana service maps, dependency visualisation, and topology integrationsReview observability roles and responsibilities and recommend improvements aligned to Operational Resilience standardsChampion automation and API-driven app
Other jobs of interest...
Perform a fresh search...
-
Create your ideal job search criteria by
completing our quick and simple form and
receive daily job alerts tailored to you!