Site Reliability Engineering Jobs in the UK

76 to 100 of 436 Site Reliability Engineering Jobs in the UK

Platform Engineer

Stroud, England, United Kingdom
Hybrid / WFH Options
Ecotricity
Visibility & Observability: Help create and maintain a single source of truth for our technology ecosystem, including a central application directory, system diagrams, and standardized monitoring and alerting dashboards. Enhancing Reliability: Work to improve system reliability by helping to define key metrics, implement robust monitoring solutions, and support the development of a company-wide backup and business continuity plan. … Agile development environment. Refining tickets, estimating work and breaking user stories down into smaller tasks. What Success Looks Like (The First Year) As a founding member of the Platform Engineering team, your impact will be a key part of our department's transformation. Within your first year, we expect to see tangible progress on our core initiatives, specifically: Centralized … the entire department. Infrastructure Governance: You will be heavily involved in the effort to identify and eliminate all inefficient applications and infrastructure, improving overall security and cost management. Enhanced Reliability: You will play a key role in developing a Business Continuity Plan (BCP) playbook for the company's core applications, significantly improving our resilience. Desirable Skills (Bonus) Prior experience More ❯
Posted:

Platform Engineer

Stroud, south east england, united kingdom
Hybrid / WFH Options
Ecotricity
Visibility & Observability: Help create and maintain a single source of truth for our technology ecosystem, including a central application directory, system diagrams, and standardized monitoring and alerting dashboards. Enhancing Reliability: Work to improve system reliability by helping to define key metrics, implement robust monitoring solutions, and support the development of a company-wide backup and business continuity plan. … Agile development environment. Refining tickets, estimating work and breaking user stories down into smaller tasks. What Success Looks Like (The First Year) As a founding member of the Platform Engineering team, your impact will be a key part of our department's transformation. Within your first year, we expect to see tangible progress on our core initiatives, specifically: Centralized … the entire department. Infrastructure Governance: You will be heavily involved in the effort to identify and eliminate all inefficient applications and infrastructure, improving overall security and cost management. Enhanced Reliability: You will play a key role in developing a Business Continuity Plan (BCP) playbook for the company's core applications, significantly improving our resilience. Desirable Skills (Bonus) Prior experience More ❯
Posted:

Site Reliability Engineer SC Cleared

Wokingham, Berkshire, South East, United Kingdom
Hybrid / WFH Options
Sanderson Government and Defence
Position: Senior & Principal Site Reliability Engineer Location: Wokingham/Home-Based Salary: £500-£590 p.d inside IR35 (Depending on seniority) Security Clearance: SC Clearance Required We're looking for a sharp-minded Site Reliability Engineer to join our cloud-native mission in Azure. If you thrive in Agile teams, live for automation, and know your way … around observability stacks and CI/CD pipelines - this is your playground. What you'll be doing: Automating deployment, monitoring & infrastructure with precision Owning platform reliability, performance & SLAs Building IaC with Helm/… Kustomize Managing CI/CD pipelines via Tekton & ArgoCD Supporting OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ) Implementing security controls aligned with national infra standards Mentoring junior engineers & championing SRE best practices Collaborating across dev teams, vendors & IT to drive operational excellence If you're interested in hearing more, please do send your profile to Reasonable Adjustments: Respect and equality More ❯
Employment Type: Contract, Work From Home
Posted:

DevOps Engineer

Welwyn Garden City, England, United Kingdom
Hybrid / WFH Options
PayPoint plc
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of site reliability engineering (SRE) principles What can we do for you? Holiday purchase scheme, with 25 days holiday plus bank holidays as standard. On-site gym at our office, and nationwide corporate rate More ❯
Posted:

DevOps Engineer

welwyn garden city, east anglia, united kingdom
Hybrid / WFH Options
PayPoint plc
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of site reliability engineering (SRE) principles What can we do for you? Holiday purchase scheme, with 25 days holiday plus bank holidays as standard. On-site gym at our office, and nationwide corporate rate More ❯
Posted:

DevOps Engineer

watford, hertfordshire, east anglia, united kingdom
Hybrid / WFH Options
PayPoint plc
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of site reliability engineering (SRE) principles What can we do for you? Holiday purchase scheme, with 25 days holiday plus bank holidays as standard. On-site gym at our office, and nationwide corporate rate More ❯
Posted:

Senior Site Reliability Engineer

Birmingham, West Midlands, United Kingdom
Hybrid / WFH Options
Inspire People
team at the heart of the global economy! The Department for Business and Trade ('DBT') and Inspire People are partnering together to bring you an exciting opportunity for Senior Site Reliability Engineers to join a team that ensures DBT's digital services work as users expect, working with development teams giving them the tools for their job, including … service-level objectives. - Participate in an on-call rota (with allowance), helping to keep DBT services resilient and reliable. - Mentor junior engineers and contribute to the growth of the SRE function. Technologies you will work with include AWS, Azure, Terraform/CloudFormation, Docker, ECS, ECR, ElasticSearch, Python/Django, PostgreSQL (RDS), Redis, and more. Essential Criteria - Cloud experience with AWS … application will be assessed against these requirements before being progressed to DBT. Shortlisted candidates will then be invited to interview and technical exercise. If you are a DevOps Engineer, SRE, or Systems Administrator looking to make a real impact across government digital services, apply today or contact Keesha Paulsen at Inspire People in confidence for more information. More ❯
Employment Type: Permanent, Work From Home
Salary: £80,000
Posted:

DevOps Engineer

Liverpool, England, United Kingdom
Hybrid / WFH Options
Love2shop
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of site reliability engineering (SRE) principles Our benefits if you decide to join us: 25 days’ holiday per year, plus bank holidays Company sick pay from day 1 Company pension scheme UK health care cover More ❯
Posted:

DevOps Engineer

birkenhead, north west england, united kingdom
Hybrid / WFH Options
Love2shop
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of site reliability engineering (SRE) principles Our benefits if you decide to join us: 25 days’ holiday per year, plus bank holidays Company sick pay from day 1 Company pension scheme UK health care cover More ❯
Posted:

DevOps Engineer

old swan, north west england, united kingdom
Hybrid / WFH Options
Love2shop
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of site reliability engineering (SRE) principles Our benefits if you decide to join us: 25 days’ holiday per year, plus bank holidays Company sick pay from day 1 Company pension scheme UK health care cover More ❯
Posted:

DevOps Engineer

warrington, cheshire, north west england, united kingdom
Hybrid / WFH Options
Love2shop
and production environments • Manage containerized applications using Docker and Kubernetes orchestration • Implement security scanning and compliance checks within deployment pipelines • Collaborate with development teams to optimize application performance and reliability • Maintain and improve backup, recovery, and disaster recovery procedures • Support incident response and root cause analysis for production issues • Manage configuration management and secrets management solutions • Optimize cloud resource … microservices architecture and deployment patterns • Experience with API management and gateway solutions • Knowledge of network security and firewall management • Certification in Azure, AWS, or DevOps methodologies • Experience with chaos engineering and resilience testing • Familiarity with service mesh technologies (Istio, Linkerd, cilium) Really Nice to Have: • PCI-DSS compliance experience • Experience in … financial services or regulated industries • Knowledge of ITIL or similar service management frameworks • Experience with automated testing frameworks and test automation • Understanding of site reliability engineering (SRE) principles Our benefits if you decide to join us: 25 days’ holiday per year, plus bank holidays Company sick pay from day 1 Company pension scheme UK health care cover More ❯
Posted:

Principal Site Reliability Engineer | Stealth Fintech | London | Up to £180k + Equity

City of London, London, United Kingdom
Maze
Founding Site Reliability Engineer | Stealth Fintech | London | Up to £200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. They’re creating the world’s first open-source, AI-native "Thin Ledger"—set to replace legacy infrastructure at Tier … for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Someone who is currently hands on (not someone primarily focused on strategy & people management) Passion for More ❯
Posted:

Principal Site Reliability Engineer | Stealth Fintech | London | Up to £180k + Equity

London Area, United Kingdom
Maze
Founding Site Reliability Engineer | Stealth Fintech | London | Up to £200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. They’re creating the world’s first open-source, AI-native "Thin Ledger"—set to replace legacy infrastructure at Tier … for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Someone who is currently hands on (not someone primarily focused on strategy & people management) Passion for More ❯
Posted:

Principal Site Reliability Engineer | Stealth Fintech | London | Up to £180k + Equity

london, south east england, united kingdom
Maze
Founding Site Reliability Engineer | Stealth Fintech | London | Up to £200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. They’re creating the world’s first open-source, AI-native "Thin Ledger"—set to replace legacy infrastructure at Tier … for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Someone who is currently hands on (not someone primarily focused on strategy & people management) Passion for More ❯
Posted:

Principal Site Reliability Engineer | Stealth Fintech | London | Up to £180k + Equity

slough, south east england, united kingdom
Maze
Founding Site Reliability Engineer | Stealth Fintech | London | Up to £200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. They’re creating the world’s first open-source, AI-native "Thin Ledger"—set to replace legacy infrastructure at Tier … for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Someone who is currently hands on (not someone primarily focused on strategy & people management) Passion for More ❯
Posted:

Principal Site Reliability Engineer | Stealth Fintech | London | Up to £180k + Equity

london (city of london), south east england, united kingdom
Maze
Founding Site Reliability Engineer | Stealth Fintech | London | Up to £200k + Equity | London (Min 4 days per week in office) Maze is partnering with a stealth-mode startup that's rebuilding core banking from the ground up. They’re creating the world’s first open-source, AI-native "Thin Ledger"—set to replace legacy infrastructure at Tier … for our next-gen ledger infrastructure Scale multi-region Kubernetes environments across cloud & on-prem Harden distributed systems (Kafka, Redis, CockroachDB) for global banking workloads Lead our AI-powered SRE approach: observability, remediation, and auto-response Enforce zero-trust, multi-tenant security and compliance (SOC2, ISO 27001) Define IaC foundations (Terraform, GitOps, Helm) What We're Looking For: Expert with … Kubernetes and Distributed Systems Experience building production infrastructure at scale (multi-region, high-availability) Extensive experience building both on-Prem & Cloud infrastructure at scale from scratch. Strong SRE mindset: SLOs, SLIs, incident response AI-curious or AI-native: excited to build agent-powered ops Someone who is currently hands on (not someone primarily focused on strategy & people management) Passion for More ❯
Posted:

HPC Engineer

London, United Kingdom
Red - The Global SAP Solutions Provider
infrastructure using Infrastructure-as-Code (IaC) tools such as Terraform . Develop, deliver, and support advanced research computing services and applications . Apply Site Reliability Engineering (SRE) principles to ensure high availability, performance, and reliability across HPC environments. Troubleshoot and resolve complex technical challenges affecting both the platform and user workloads. Essential Skills and Experience 10+ … years of hands-on experience designing, operating, or engineering large-scale computing environments (HPC, HTC, or Big Compute). Proven ability to drive innovation and integrate emerging technologies into HPC solutions. Administration experience with cluster and workload management software (eg, Slurm , LSF , Grid Engine ). Strong knowledge of Linux system administration , TCP/IP Networking , and storage systems . More ❯
Employment Type: Contract
Rate: GBP Annual
Posted:

Platform Engineer Azure, Linux, DataDog Experience

London, South East, England, United Kingdom
Hybrid / WFH Options
Michael Page Technology
issues. Ensure adherence to SLAs and help improve operational support efficiency. Participate in on-call rotations to provide 24/7 platform coverage. Continuously optimize monitoring, alerting, and platform reliability processes. Demonstrate a "can do" attitude, with flexibility to work occasional overtime when incidents extend beyond normal working hours. Profile Required … Skills & Qualifications Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent work experience). Proven experience in technical support, site reliability engineering (SRE), or platform operations. Strong knowledge of Linux/Unix and Windows environments. Familiarity with cloud platforms (AWS, Azure, GCP). Hands-on experience with CI/CD tools (Jenkins, GitHub More ❯
Employment Type: Full-Time
Salary: £60,000 - £90,000 per annum
Posted:

Senior Cloud Architect (GCP)

London Area, United Kingdom
Anson McCade
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in site reliability engineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
Posted:

Senior Cloud Architect (GCP)

City of London, London, United Kingdom
Anson McCade
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in site reliability engineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
Posted:

Senior Cloud Architect (GCP)

london, south east england, united kingdom
Anson McCade
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in site reliability engineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
Posted:

Senior Cloud Architect (GCP)

slough, south east england, united kingdom
Anson McCade
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in site reliability engineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
Posted:

Senior Cloud Architect (GCP)

london (city of london), south east england, united kingdom
Anson McCade
to join a leading technology and innovation consultancy, supporting UK public sector clients in their cloud transformation journeys. This role sits within a highly skilled team dedicated to designing, engineering, and optimising Google Cloud Platform (GCP ) solutions that power large-scale, mission-critical systems. The successful candidate will play a key role in shaping cloud strategy, driving architectural excellence … technical architecture and delivery of Google Cloud solutions for public sector organisations. Design, deploy, and operate secure, scalable, and high-performing GCP environments. Provide technical leadership and mentorship to engineering teams to ensure successful project delivery. Apply deep knowledge of Google Cloud architecture and engineering to deliver enterprise-grade solutions that meet both functional and non-functional requirements. … networking (TCP/IP, subnets, load balancing, DNS). A track record of leading small technical teams, providing guidance and mentorship. Experience in site reliability engineering (SRE) or IT operations, including incident response and troubleshooting. Strong problem-solving and innovation skills, with evidence of delivering technical improvements or new ways of working. More ❯
Posted:

Site Reliability Engineer - High Frequency Trading

City Of London, England, United Kingdom
Hamilton Barnes 🌳
markets interests you, this could be the perfect opportunity to take your career to the next level! About the role: You will play a crucial role in ensuring the reliability, performance, and efficiency the companies trading platforms. This is not your average DevOps role - this position focuses on site reliability, where you'll be troubleshooting, supporting traders … support new trading systems, continuously improving the infrastructure. • Drive automation and operational excellence by leveraging your Linux expertise, Kubernetes, and Python scripting skills. • Monitor and ensure high availability and reliability of trading applications while being on top of system alerts and incidents. Key Requirements: • 1-5 years working experience • Background working in the financial services sector, ideally supporting traders … Solid experience with Linux Systems administration and troubleshooting. • Hands-on experience with Kubernetes for container orchestration. • Proficient in Python scripting for automation and system management. • A mindset focused on site reliability and performance. • Strong troubleshooting skills and a proactive approach to problem-solving. Salary: Up to £90,000 base salary Lucrative bonus scheme Company perks/benefits Location More ❯
Posted:

Site Reliability Engineer - High Frequency Trading

london, south east england, united kingdom
Hamilton Barnes 🌳
markets interests you, this could be the perfect opportunity to take your career to the next level! About the role: You will play a crucial role in ensuring the reliability, performance, and efficiency the companies trading platforms. This is not your average DevOps role - this position focuses on site reliability, where you'll be troubleshooting, supporting traders … support new trading systems, continuously improving the infrastructure. • Drive automation and operational excellence by leveraging your Linux expertise, Kubernetes, and Python scripting skills. • Monitor and ensure high availability and reliability of trading applications while being on top of system alerts and incidents. Key Requirements: • 1-5 years working experience • Background working in the financial services sector, ideally supporting traders … Solid experience with Linux Systems administration and troubleshooting. • Hands-on experience with Kubernetes for container orchestration. • Proficient in Python scripting for automation and system management. • A mindset focused on site reliability and performance. • Strong troubleshooting skills and a proactive approach to problem-solving. Salary: Up to £90,000 base salary Lucrative bonus scheme Company perks/benefits Location More ❯
Posted:
Site Reliability Engineering
10th Percentile
£57,500
25th Percentile
£68,125
Median
£77,500
75th Percentile
£91,250
90th Percentile
£115,000