Service Support Engineer

Service Support Engineer (Senior)

Permanent – Hybrid

Manchester

Join a global technology driven organisation as part of a high performing Cloud Infrastructure & Platform Engineering function, supporting complex digital platforms used across financial markets worldwide. As a Senior Service Engineer, you will embed directly with development squads across the UK and US, driving reliability, scalability, and operational excellence across critical production services — including new AI powered platforms.

This is an opportunity to combine deep technical engineering with strategic influence, enabling cloud native, modern deployment patterns, and AI enhanced operations within a mature, forward looking engineering environment.

________________________________________

About the Team

Service Reliability brings together Site Reliability, DevOps, Security and Applications Support engineering into one group working within a broader Cloud Infrastructure & Platform Engineering function. The team provides subject matter expertise across cloud technologies, systems engineering, DevOps tooling, infrastructure automation, and platform reliability.

Your work will focus on improving service quality for major production systems, while partnering closely with engineering squads delivering new AI capabilities and mission critical digital services.

________________________________________

Key Responsibilities

  • Lead the delivery of reliable, scalable, mission critical services across cloud and containerised environments.
  • Advise engineering squads on Kubernetes, modern deployment patterns, and best practice operational design.
  • Mentor associate engineers, establish engineering standards, and uplift service maturity across teams.
  • Partner with development and operations teams to evolve service architectures, DevOps tooling, and CI/CD practices.
  • Architect, govern GitHub Actions pipelines with quality gates, automated validation.
  • Own observability (Datadog), defining SLIs/SLOs, dashboards, alerting, MS Teams integrations, and telemetry based improvements.
  • Drive AI enabled operations using Bedrock/SageMaker and the Model Context Protocol (MCP) for log analysis, anomaly detection, and incident workflows.
  • Define cloud guardrails and enforce security controls, including IAM boundaries, SCPs, OPA policies, tagging, centralized logging, and audit tooling.
  • Act as an escalation point for complex issues and participate in on call rotation.
  • Shape cross functional roadmaps and contribute to major platform initiatives across the global engineering organisation.

________________________________________

Skills & Experience required

  • Deep hands on experience in SRE / DevOps / Platform Engineering across AWS Azure.
  • Strong production experience running Docker and Kubernetes at scale.
  • Proficiency in administering Linux & Windows
  • Experience supporting .NET/IIS and Java Spring Boot applications.
  • Hands-on building and maintaining GitHub Actions CI/CD pipelines (Bamboo).
  • Competence scripting with Python, PowerShell, or Bash.
  • Experience with cloud security; IAM, secrets management, image/container scanning.
  • Strong understanding of core infrastructure: networking, storage, DNS, APM, telemetry

________________________________________

Highly desired experience includes:

  • Experience with AI driven operational tooling (triage, runbooks, change management) with strong governance controls.
  • Supporting large scale AI/ML workloads — SageMaker, GPU autoscaling, Kubernetes based model serving.
  • Experience implementing policy as code (OPA)
  • Experience implementing compliance frameworks (CIS, NIST, ISO 27001).
  • Applying AI within CI/CD, observability, and incident response using Bedrock / SageMaker and MCP.

KEY NFO:

**Visa sponsorship is not available at this time

**Hybrid working will require the successful candidate to reside within a reasonable commutable distance.

Rewards & Benefits

In addition to a strong salary linked to experience:

  • Flexible annual leave
  • Strong pension and life - assurance schemes
  • Health insurance
  • Access to wellbeing, mental - health, and lifestyle support programmes
  • Employee discounts across a range of health and related services

________________________________________

Keyword Terminology

SRE, Service Reliability Engineering, Applications Support, Applications Engineering, DevOps, SecDevOps, Platform Engineering, AWS, Azure, Kubernetes, Docker, Linux, Windows, IIS, .NET, Java, Spring Boot, CI/CD, GitHub Actions, Bamboo, Python, PowerShell, Bash, Datadog, Observability, SLIs, SLOs, Bedrock, SageMaker, MCP, OPA, IAM, Cloud Security, Infrastructure Automation, AI enabled Operations, Container Security

Job Details

Company
Be-IT Resourcing
Location
Manchester, Lancashire, England, United Kingdom
Hybrid / Remote Options
Employment Type
Full-Time
Salary
Salary negotiable
Posted