IT Operations Manager with 10+ years of experience supporting 24/7 production environments across mission-critical enterprise infrastructure. Led Network and Server Operations teams of 16 engineers managing 1,000+ network devices and 300+ servers across 30+ sites, achieving 100% SLA compliance and sustaining 99.5% availability across 21 systems. Experienced in Major Incident Management, ITIL-based Problem Management and data-driven Continual Service Improvement (600-800 incidents/month). Contributed to Business Continuity Planning, participated in internal audit reviews and strengthened operational governance through structured escalation, change control and vendor coordination.
Summary
Education
University of Wollongong Bachelor of Computer Science, Digital Systems Security Sep 2020 - Mar 2022
Singapore Polytechnic Diploma, Infocomm Security Management Sep 2013 - May 2015
Experience
- Infrastructure Service Delivery Manager MINDEF Jun 2023 - Present • 2 yrs 10 mos
Lead enterprise IT service delivery across 21 mission-critical infrastructure systems, ensuring production stability and 99.5% availability performance. Own end-to-end L3 escalation governance and ITIL-based Problem Management in ServiceNow, directing RCA and CAPA to reduce recurring incidents and strengthen system resilience. Direct cross-functional incident response across technical teams, stakeholders and vendors to restore services, reduce MTTR and minimise business impact. Drive data-driven Continual Service Improvement (CSI), analysing 600-800 monthly incidents using trend and variance analysis to identify recurring failure patterns and prioritise risk mitigation actions. Participate in internal audit reviews assessing adherence to incident management controls and technical SOPs, reinforcing governance discipline and audit readiness. Contribute engineering inputs to Business Continuity Planning (BCP), supporting operational readiness through budget forecasting, resource planning and vendor coordination. Exercise governance oversight across more than 30 distributed sites, reviewing and approving infrastructure changes to meet fire safety requirements and ensure operational integrity.
- Infrastructure Team Lead MINDEF Sep 2020 - Jun 2023 • 2 yrs 10 mos
Led Network and Server Operations teams (16 engineers) supporting 1,000+ network devices and 300+ servers across 30+ sites islandwide, overseeing day-to-day operations, workforce planning and team performance management. Owned Major Incident Management (P1/P2) within an ITIL-aligned framework, coordinating cross-functional teams and vendors to restore services within SLA targets and minimise business disruption. Maintained 100% SLA compliance through KPI monitoring, structured escalation processes and shift coverage planning. Directed on-premise infrastructure operations across distributed sites, covering routing, switching, firewalls, load balancing and virtualisation platforms (Cisco, Alcatel), including high availability (HA), disaster recovery (DR) and change management. Implemented operational governance standards, including SOPs and incident playbooks, to improve service reliability and strengthen Continuous Service Improvement (CSI). Developed structured onboarding and technical training programmes in networking and incident management to improve response quality and operational readiness.
Skills
- IT Operations Management (ITOM)Expert
- Major Incident ManagementExpert
Languages
- EnglishNative speaker
Certifications
- CompTIA Network+ CompTIAIssued Feb 2024 • Expired Feb 2027