LiveOps Engineer
Contract
Job Summary
We are looking for a LiveOps Engineer to support and enhance the reliability, availability, performance, and security of our Azure-based SaaS platforms. The role combines Cloud Operations and Site Reliability Engineering (SRE), focusing on production support, incident response, monitoring, automation, and continuous improvement. You will work closely with Engineering, Platform, Security, and Support teams to ensure stable and scalable cloud operations.
Roles & Responsibilities:
Manage and support Azure-based SaaS platforms and production environments.
Monitor system health, troubleshoot incidents, and participate in on-call support.
Support IIS-hosted .NET applications, SQL Server, and Azure SQL Managed Instance.
Improve reliability through monitoring, logging, alerting, and observability practices.
Work with Kubernetes, containerised environments, and CI/CD pipelines.
Collaborate with cross-functional teams to deliver reliable operational outcomes.
Maintain runbooks, operational documentation, and automation processes.
Support continuous improvement, operational efficiency, and platform stability.
Requirements:
Strong experience with Microsoft Azure SaaS platform operations.
Hands-on experience with IIS and .NET application troubleshooting.
Good knowledge of SQL Server and Azure SQL Managed Instance.
Experience with monitoring/logging tools (Datadog preferred).
Experience with Kubernetes, containerisation, and related tools.
Knowledge of CI/CD pipelines, GitHub, Argo CD, and version control systems.
Familiarity with Infrastructure-as-Code tools, especially Terraform.
Scripting or automation experience (C# is a plus).
Basic Linux administration knowledge.
Experience with Remote Desktop Architecture is an advantage.
Strong troubleshooting, analytical, and incident management skills.
Good communication and stakeholder management abilities.
Exposure to security, compliance, and customer-facing incident communication is preferred.