Permanent Site Reliability Engineering Jobs in East London

17 of 17 Permanent Site Reliability Engineering Jobs in East London

Site Reliability Engineer

South East London, England, United Kingdom
Hybrid / WFH Options
Explore Group
Site Reliability Engineer (Hybrid – London) | RegTech Innovator | AWS, Terraform, Kubernetes Location: London (Hybrid – 2-3 days in office) Are you passionate about scalable infrastructure and modern DevOps practices … Want to make a tangible impact in a fast-growing RegTech company that’s transforming how businesses navigate regulatory compliance? Join us as a Site Reliability Engineer (SRE) and help build and operate the infrastructure that powers cutting-edge compliance solutions used by global financial institutions. What You'll Do Maintain and improve our AWS-based infrastructure using … Docker, Kubernetes (EKS) CI/CD: GitHub Actions, Argo CD, Helm Monitoring: Prometheus, Grafana, CloudWatch, OpenTelemetry Languages: Python, Bash, Go (bonus) What We're Looking For Strong experience in SRE, DevOps, or Production Engineering roles Proven hands-on skills with AWS , Terraform , and Kubernetes Experience with production support, incident management, and RCA practices Comfortable working in a fast-paced More ❯
Posted:

DevOps Engineer - AWS

South East London, England, United Kingdom
Hybrid / WFH Options
Cognitive Group | Part of the Focus Cloud Group
AWS DevOps Engineer Senior Site Reliability Engineer - Infrastructure Contract - Inside IR35 - Fulltime Location: London - Hybrid (3 days per week onsite) SC Cleared or Eligible for SC Clearance Your responsibilities: Deploy, configure, and monitor AWS services ensuring high availability, scalability, and security. Respond to and resolve infrastructure and service incidents with root cause analysis and preventive measures. Handle change More ❯
Posted:

Site Reliability Engineer

South East London, England, United Kingdom
Xcede
looking for a Site Reliability Engineer to join their highly skilled, innovative team. Essential skills: Strong proficiency in Python for infrastructure and automation Hands-on experience in SRE, DevOps or production engineering roles Deep understanding of monitoring, incident response workflows, and system architecture Productive approach to improving systems and reducing technical debt Strong collaboration and communication skills … working closely with developers, quants, and platform engineers Experience designing and delivering scalable, reliable production systems Proficiency with Linux/Unix systems Bachelor’s degree in CS, Engineering or a related field Familiarity with Kubernetes, Docker, or container orchestration technologies Experience with automation tools such as Terraform or Ansible Background in Go, Bash or other system-level languages Exposure … design and implement automation for operations, deployments, monitoring and incident management, as well as owning the observability stack (metrics, logs, traces and alerting). You will also: apply core SRE principles (SLIs, SLOs, error budgets) to enhance system reliability; build, document, and improve high-performance system designs; lead incident response and implement improvements; collaborate closely with quant developers/ More ❯
Posted:

Senior Site Reliability Engineer

South East London, England, United Kingdom
TRIA
Lead Site Reliability Engineer Central London (Hybrid) Up to £95k + Car Allowance & Bonus TRIA are working with a leading hospitality client for a Lead SRE, where they are investing heavily in the performance, stability, and reliability of its digital platforms. This is a hands-on leadership role - you won’t just guide others, you’ll be … uptime The stack includes Kubernetes , Terraform , AWS , Python , and modern CI/CD tools, and it's evolving. If you're confident in a crisis, understand what a good SRE practice looks like, and want to leave systems in a better place than you found them, please apply to be considered and learn more! What you’ll bring : Experience in … high-traffic digital or eCommerce platforms 5+ years in SRE/DevOps roles; strong background in incident response Observability, automation, and infrastructure as code expertise Leadership skills - mentoring others or leading from the front More ❯
Posted:

Senior Production Engineer - Fintech/Digital Assets

South East London, England, United Kingdom
Tempest Vane Partners
is redefining how digital assets are secured and managed. As part of their expansion, they’re looking to bring on a Senior Production Engineer to lead the charge on reliability, resilience, and operational excellence within a complex, high-uptime platform environment. What You’ll Get A superb opportunity to join an institutionally backed, cutting edge Crypto Fintech at the … salary and annual discretionary bonus. Pension contributions, in addition to Health Insurance, Life Assurance. 25 Annual Leave. What You’ll Be Doing This is a hands-on and strategic engineering role where you’ll be responsible for ensuring production stability across a highly dynamic microservices architecture hosted in Azure . You’ll have end-to-end ownership over reliability … and monitoring across distributed systems. Collaborating with cross-functional teams to align platform strategy and reliability goals. What You’ll Bring: 5+ years in software engineering or SRE/production infrastructure roles. Strong experience with Java (Spring) and cloud platforms (ideally Azure ). Proven track record in building and maintaining mission-critical systems. Deep understanding of Kubernetes, observability More ❯
Posted:

Senior DevOps Engineer AWS - SC Clearance

South East London, England, United Kingdom
Hybrid / WFH Options
Client Server
WFH: You can work from home most of the time, meeting up with colleagues in the London office once a week. About you: You have experience in similar DevOps, SRE or Infrastructure engineering positions You have expertise with Kubernetes and Helm, having built mission critical systems in production You have strong IaC, Terraform experience You have strong CI/ More ❯
Posted:

Lead Cloud Engineer

South East London, England, United Kingdom
developrec
Lead Cloud Engineer As a Lead Cloud Engineer, you’ll have SME level knowledge across AWS and Platform Engineering disciplines. You’ll build AWS Cloud Solutions to ensure the organisation can take full advantage of Cloud based technologies. You’ll contribute to standards, guardrails and best practices, and implement improvements to processes and tooling to ensure engineering excellence. … You’ll have a strong understanding of operational requirements, and ensure Scalability, Resiliency, Observability, Security, Cost and Maintainability are at the forefront of all engineering activities. This specific project will involve Real Time Payments value stream, Form 3 gateway set-up and setting up the infrastructure for connectivity. What you’ll bring SME Level knowledge in AWS and Platform … and secure code delivery (ie SCA, SAST, DAST Networks/Security/Middleware & Apps Scripting/Coding (Bash, Python) End to End Observability solutions (logging, monitoring, alerting) Knowledge of SRE principles and practices More ❯
Posted:

Observability Engineer - Grafana Dashboarding

South East London, England, United Kingdom
Levy Global
and tooling (e.g., Terraform) as data sources for observability. Solid programming ability in Golang (preferred) or Python for automation and integration. Strong collaboration skills to work with cross-functional engineering teams. Experience working in Linux-based environments. Bonus/Nice … to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructure monitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Posted:

Solution Architect - Azure

South East London, England, United Kingdom
Birlasoft
and optimizing cloud-modernization solutions that drive our business forward Extensive experience in modernizing Java based monolith applications to Microservices based architecture on Azure Extensive experience in DevSecOps and SRE Primary Responsibilities: Lead the design and implementation of microservices architecture on Azure. Architect and deploy scalable, reliable, and secure solutions using AKS. Design and manage PostgreSQL databases in a cloud … Conduct assessments of existing applications and recommend modernization strategies. Develop and maintain architectural documentation and guidelines. Ensure compliance with security and governance policies. Provide technical leadership and mentorship to engineering teams. Stay updated with the latest industry trends and technologies. Implement Infrastructure as Code (IaC) using tools like Terraform or ARM templates. Oversee DevOps practices and CI/CD More ❯
Posted:

Software Engineer - Infrastructure / Observability

South East London, England, United Kingdom
SGI
engineering? Join a small, high-performing engineering team on a project at a top-tier quant fund, helping to build out observability capabilities. This is not an SRE or sysadmin role – it would suit a software engineer who cares about clean, testable code and good software practices, but prefers working in the infra/tooling space. What you … monitoring pipelines in Grafana and Prometheus Developing infrastructure-as-code tooling (Terraform, Ansible) Designing well-structured, testable software that improves system visibility What they’re looking for: Strong software engineering skills (Go or Python preferred) Experience working in … or alongside platform engineering teams Familiarity with modern observability tools (Grafana, Prometheus, etc.) Comfort working across both code and infrastructure – but this is not a pure ops/SRE role If you've worked in finance that would be great but not mandatory The role offers an initial 6-month initial contract, Inside IR35, up to £750/day More ❯
Posted:

Platform Engineer

South East London, England, United Kingdom
Ascendion
Document configurations, workflows, and best practices for team and organizational use. Requirements: Must-Have: Bachelor’s degree in computer science, Engineering, or a related field. Experience in DevOps, SRE, or platform/infrastructure roles. Hands-on experience with Kong Mesh (or other service mesh solutions like Istio, Linkerd, Kuma). Solid understanding of Kubernetes, Envoy proxy, and container orchestration … . Familiarity with CI/CD pipelines, Git, and Infrastructure as Code tools like Terraform or Helm. Proficient in scripting languages (e.g., Bash, Python, or Go). Join our engineering team to build and scale service mesh architecture using Kong Mesh , Envoy , and Kubernetes . We're looking for someone with deep experience in service discovery , zero-trust networking More ❯
Posted:

Head of DevOps

South East London, England, United Kingdom
Oliver Bernard
and operational excellence, supporting a global client base of financial institutions. You’ll define and implement a modern operations roadmap—driving automation, CI/CD, managed services, and platform reliability—while enabling high-performing engineering and delivery teams. Key Responsibilities Lead DevOps and infrastructure strategy across cloud/on-prem environments Oversee CI/CD, automation, and platform … reliability Align operations with business goals, client delivery … and engineering standards Support ISO27001/SOC2 compliance and secure operational models Drive continuous improvement through KPIs and operational metrics Build and lead a multidisciplinary operations team (DevOps, SRE, Infra) Working predominantly with AWS Requirements Proven experience in Ops/Platform/DevOps leadership within tech or software Deep knowledge of DevOps tools, infrastructure-as-code, and cloud architecture More ❯
Posted:

Risk Analyst - Market/Counterparty Risk

South East London, England, United Kingdom
Lorien
with SQL and Python Data Visualisation skills with PowerBI, other Automation and Metrics knowledge handy. Proficiency with tools like Jira, Confluence, Excel, and SharePoint Familiarity with Agile, DevOps, and Site Reliability Engineering Excellent communication and stakeholder management skills More ❯
Posted:

Head of DevOps - FinTech

South East London, England, United Kingdom
Hybrid / WFH Options
Client Server
software enables customers to build high-performance web trading apps that deliver real-time information securely and reliably. As Head of DevOps you will oversee the Infrastructure and Build Engineering Team, taking ownership of defining and implementing the company's DevOps strategy, including streamlining CI/CD pipelines, automation, observability and release processes to support reliable and … scalable software delivery. There's currently a focus on transforming inefficient CI/CD pipelines using Gitlab as well as making improvements to monitoring, uptime and incident response via SRE and DevOps principles. You lead a team of three which you'll help to grow, providing technical leadership and direction. Location/WFH: There's a hybrid work from home More ❯
Posted:

Principal Network Engineer

South East London, England, United Kingdom
WNTD
A leading financial institution seeks a Principal Network Engineer Position Summary As a highly skilled principal network engineer, you will be responsible for the ongoing support and reliability of all components within the company's ecosystem, encompassing platforms, networks, applications, and services. In this role, you will provide escalation support for important network services, ensuring the operational stability and … performance of the company's global network. You will also find opportunities and improvements, then define requirements for software automation to enhance the features, functions and reliability of the network, underpinning company's digital platforms. Role Responsibilities We will play a pivotal role in designing, building, and maintaining systems, software, and applications across various domains, ensuring the highest quality … RSVP. Comprehensive knowledge of multicast concepts, including PIM-SM, PIM-SSM, IGMP, MVPN, and MLDP. Proficiency in product-aligned, service-focused work, with a solid grasp of network automation, SRE, and DevOps or equivalent experience. Working with Agile methodologies (Scrum, Kanban) and project management tools like JIRA. Excellent skills in network packet analysis and packet capture tools for troubleshooting. Familiarity More ❯
Posted:

Site Reliability Engineer | Inside IR35 | Remote - UK | 6 Month Contract

East London, London, United Kingdom
Hybrid / WFH Options
RP International
Site Reliability Engineer | Inside IR35 | Remote - UK | 6 Month Contract Our client a multinational and respected consultancy is hiring for a Site Reliability Engineer with expertise in GCP and DevOps Tools for a new project in the Communication Sector. Duration: 6 Months + Extensions Location: Remote (Ideally UK Based) Rate: £300-350 p/d (Inside … IR35) This role has multiple headcount. Technical Skills/Exp. GCP, Gitlab, Terraform Scripts, HassCorp, Env0, Okta, Security Architecture Nice to have: Platform Engineering If you are interested please apply with your updated CV, we can then arrange a call to further your application. More ❯
Posted:

Grafana Consultant - UK Remote - £55,000

East London, London, United Kingdom
Hybrid / WFH Options
Opus Recruitment Solutions
AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site Reliability Engineer | Telecommunications | Consultant | Dashboard | Systems Engineer Looking to make a step into SRE? Excited by the prospect of AI Ops? I've partnered an exciting business who've recently been acquired by a European leader in the AI Ops Consultancy space. Taking on their UK market to replicate their … to large scale Enterprise right through to SME size businesses. Working with clients across the world, this one will be scaling a team of 4 to join their expert SRE's in the Observability domain. This one, in particular is hiring multiple people such as Grafana and PagerDuty to take on modern state of the art AI Ops projects and … streamline businesses Cloud Operations. If you've got a couple years experience working as a System Engineer or SRE and are looking for somewhere to really get hands on with AI Ops then get in touch. In return the role offers £55k and an opportunity to work remotely within the UK. AI Ops | Grafana | Observability | Pagerduty | Prometheus | SRE | Site More ❯
Posted: