Senior Azure SaaS Reliability and Support Engineer
Senior Azure SaaS Reliability andamp; Support Engineer - Hybrid (2 days a week in Kingston) - ASAP Start You will be the bridge between support, engineering, and cloud operations Investigating and fixing complex application and infrastructure issues. Monitoring capacity, performance, and error budgets across all deployments. Designing automation and tooling to improve reliability and reduce manual work. Your Responsibilities and Tasks 1. Environment Health andamp; Incident Response Monitor ST and MT environments for server performance, response times, error rates, and application health. Detect and resolve database issues, stalled file processing, or misplaced storage objects. Use Azure diagnostics and telemetry to troubleshoot and resolve complex incidents. Provide third-line support for escalated customer cases, collaborating with development for code-level fixes. 2. Reliability Engineering (Fleet Level) Maintain uptime, performance, and scalability across all ST and MT deployments. Define and track service-level objectives (SLOs) and error budgets for different environment types. Perform capacity planning for Servers, databases, and storage, scaling resources before issues occur. Identify systemic patterns causing downtime and implement fixes at scale. 3. Automation andamp; Tooling Build scripts and automation (PowerShell, C#, Azure Functions, Logic Apps) to detect and remediate common application or infrastructure issues. Automate environment health checks a
Other jobs of interest...
Perform a fresh search...
-
Create your ideal job search criteria by
completing our quick and simple form and
receive daily job alerts tailored to you!