enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled SiteReliabilityEngineer with a strong focus on Google Cloud Platform … and respond to cloud incidents using incident.io, ensuring timely resolution. Use JIRA to log, track, and prioritize support tickets and workflow tasks. Monitor and maintain cloud infrastructure for performance, reliability, and security. Collaborate with teams to identify and implement solutions to technical challenges. Assist in deploying, configuring, and optimising GCP resources. Create and maintain documentation for troubleshooting processes and More ❯
enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled SiteReliabilityEngineer with a strong focus on Google Cloud Platform … and respond to cloud incidents using incident.io, ensuring timely resolution. Use JIRA to log, track, and prioritize support tickets and workflow tasks. Monitor and maintain cloud infrastructure for performance, reliability, and security. Collaborate with teams to identify and implement solutions to technical challenges. Assist in deploying, configuring, and optimising GCP resources. Create and maintain documentation for troubleshooting processes and More ❯
enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled SiteReliabilityEngineer with a strong focus on Google Cloud Platform … and respond to cloud incidents using incident.io, ensuring timely resolution. Use JIRA to log, track, and prioritize support tickets and workflow tasks. Monitor and maintain cloud infrastructure for performance, reliability, and security. Collaborate with teams to identify and implement solutions to technical challenges. Assist in deploying, configuring, and optimising GCP resources. Create and maintain documentation for troubleshooting processes and More ❯
enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled SiteReliabilityEngineer with a strong focus on Google Cloud Platform … and respond to cloud incidents using incident.io, ensuring timely resolution. Use JIRA to log, track, and prioritize support tickets and workflow tasks. Monitor and maintain cloud infrastructure for performance, reliability, and security. Collaborate with teams to identify and implement solutions to technical challenges. Assist in deploying, configuring, and optimising GCP resources. Create and maintain documentation for troubleshooting processes and More ❯
london (city of london), south east england, united kingdom
WALT Labs
enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled SiteReliabilityEngineer with a strong focus on Google Cloud Platform … and respond to cloud incidents using incident.io, ensuring timely resolution. Use JIRA to log, track, and prioritize support tickets and workflow tasks. Monitor and maintain cloud infrastructure for performance, reliability, and security. Collaborate with teams to identify and implement solutions to technical challenges. Assist in deploying, configuring, and optimising GCP resources. Create and maintain documentation for troubleshooting processes and More ❯
Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform Oliver Bernard are currently working with an established EdTech, based in London, looking to expand their SRE function with a Lead level engineer. The incoming profile will have the chance to work on a variety of greenfield projects, and be able to help grow and scale their SRE practices whilst … competes heavily with their sizeable competitors. To be considered for this opening you’ll need at least 7-8 years’ experience, encompassing the following: Recent experience in a Lead SRE capacity, coaching/mentoring other engineers Hands-On Cloud experience with AWS and AWS Services Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform … Engineers, able to offer £80-90K, and operates a remote first model (with only quarterly visits required). Please apply here to register interest in this opportunity. Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform Oliver Bernard are currently working with an established EdTech, based in London, looking to expand their SRE function with a Lead level engineer. The incoming profile will have the chance to work on a variety of greenfield projects, and be able to help grow and scale their SRE practices whilst … competes heavily with their sizeable competitors. To be considered for this opening you’ll need at least 7-8 years’ experience, encompassing the following: Recent experience in a Lead SRE capacity, coaching/mentoring other engineers Hands-On Cloud experience with AWS and AWS Services Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform … Engineers, able to offer £80-90K, and operates a remote first model (with only quarterly visits required). Please apply here to register interest in this opportunity. Lead Site Relability Engineer – EdTech – AWS, Kubernetes, Terraform More ❯
Position : SiteReliabilityEngineer Manager Location : Manchester Industry : Financial Services Salary : Up to £72,000 + discretionary bonus Key Responsibilities: Lead and develop a high-performing SRE team, driving collaboration and continuous improvement. Oversee system reliability, incident management, and root cause analysis. Define and implement automation, monitoring, and alerting strategies. Partner globally to align on uptime … and resiliency goals. Promote SRE best practices and operational excellence. About You: Proven leadership in production support or software delivery. Strong knowledge of cloud (preferably AWS) and automation. Excellent communicator and problem solver with a client-focused mindset. AWS certifications and software engineering experience are a plus. For more information, contact Maria Ciprini at Harrington Starr, or click "Apply" to More ❯
warrington, cheshire, north west england, united kingdom
Harrington Starr
Position : SiteReliabilityEngineer Manager Location : Manchester Industry : Financial Services Salary : Up to £72,000 + discretionary bonus Key Responsibilities: Lead and develop a high-performing SRE team, driving collaboration and continuous improvement. Oversee system reliability, incident management, and root cause analysis. Define and implement automation, monitoring, and alerting strategies. Partner globally to align on uptime … and resiliency goals. Promote SRE best practices and operational excellence. About You: Proven leadership in production support or software delivery. Strong knowledge of cloud (preferably AWS) and automation. Excellent communicator and problem solver with a client-focused mindset. AWS certifications and software engineering experience are a plus. For more information, contact Maria Ciprini at Harrington Starr, or click "Apply" to More ❯
bolton, greater manchester, north west england, united kingdom
Harrington Starr
Position : SiteReliabilityEngineer Manager Location : Manchester Industry : Financial Services Salary : Up to £72,000 + discretionary bonus Key Responsibilities: Lead and develop a high-performing SRE team, driving collaboration and continuous improvement. Oversee system reliability, incident management, and root cause analysis. Define and implement automation, monitoring, and alerting strategies. Partner globally to align on uptime … and resiliency goals. Promote SRE best practices and operational excellence. About You: Proven leadership in production support or software delivery. Strong knowledge of cloud (preferably AWS) and automation. Excellent communicator and problem solver with a client-focused mindset. AWS certifications and software engineering experience are a plus. For more information, contact Maria Ciprini at Harrington Starr, or click "Apply" to More ❯
Position : SiteReliabilityEngineer Manager Location : Financial Services Salary : Up to £72,000 + discretionary bonus Oversee system reliability, incident management, and root cause analysis. Define and implement automation, monitoring, and alerting strategies. Proven leadership in production support or software delivery. Strong knowledge of cloud (preferably AWS) and automation. AWS certifications and software engineering experience More ❯
and secure, while continuously pushing performance boundaries across multiple cloud and bare-metal environments. Skills & Experience Degree in Engineering, Computer Science, or a related quantitative field. 5+ years in SiteReliability, DevOps, or Platform Engineering. Deep experience with Kubernetes Proven AWS expertise across multi-region environments. Strong Ansible experience, including OS-independent roles and dynamic inventories. Excellent Linux More ❯
and secure, while continuously pushing performance boundaries across multiple cloud and bare-metal environments. Skills & Experience Degree in Engineering, Computer Science, or a related quantitative field. 5+ years in SiteReliability, DevOps, or Platform Engineering. Deep experience with Kubernetes Proven AWS expertise across multi-region environments. Strong Ansible experience, including OS-independent roles and dynamic inventories. Excellent Linux More ❯
and secure, while continuously pushing performance boundaries across multiple cloud and bare-metal environments. Skills & Experience Degree in Engineering, Computer Science, or a related quantitative field. 5+ years in SiteReliability, DevOps, or Platform Engineering. Deep experience with Kubernetes Proven AWS expertise across multi-region environments. Strong Ansible experience, including OS-independent roles and dynamic inventories. Excellent Linux More ❯
and secure, while continuously pushing performance boundaries across multiple cloud and bare-metal environments. Skills & Experience Degree in Engineering, Computer Science, or a related quantitative field. 5+ years in SiteReliability, DevOps, or Platform Engineering. Deep experience with Kubernetes Proven AWS expertise across multi-region environments. Strong Ansible experience, including OS-independent roles and dynamic inventories. Excellent Linux More ❯
london (city of london), south east england, united kingdom
Cititec
and secure, while continuously pushing performance boundaries across multiple cloud and bare-metal environments. Skills & Experience Degree in Engineering, Computer Science, or a related quantitative field. 5+ years in SiteReliability, DevOps, or Platform Engineering. Deep experience with Kubernetes Proven AWS expertise across multi-region environments. Strong Ansible experience, including OS-independent roles and dynamic inventories. Excellent Linux More ❯
reliable, scalable, and secure, while continuously pushing performance boundaries across multiple cloud and bare-metal environments. Degree in Engineering, Computer Science, or a related quantitative field. 5+ years in SiteReliability, DevOps, or Platform Engineering. Proven AWS expertise across multi-region environments. Strong Ansible experience, including OS-independent roles and dynamic inventories. Excellent Linux (Ubuntu/Debian) administration More ❯
response. Preferred qualifications: Master's degree or PhD in Computer Science, or a related technical field. Experience as a cloud customer. About the job SiteReliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally … an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large … scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive More ❯
with a long-standing client in the data and analytics space, an organisation known for its technical excellence, collaborative culture, and meaningful impact across sectors. They re scaling their SRE function and looking for a seasoned engineer to join a high-performing team delivering internal applications that power critical operations. This is a 6-month contract (with strong potential … Experience with scalable, cloud-native applications This is a brilliant opportunity to work with a forward-thinking tech team that values autonomy, innovation, and impact. If you re an SRE who thrives in fast-paced environments and loves solving complex problems, I d love to hear from you. Apply now to learn more. More ❯
with a long-standing client in the data and analytics space, an organisation known for its technical excellence, collaborative culture, and meaningful impact across sectors. They’re scaling their SRE function and looking for a seasoned engineer to join a high-performing team delivering internal applications that power critical operations. This is a 6-month contract (with strong potential … Experience with scalable, cloud-native applications This is a brilliant opportunity to work with a forward-thinking tech team that values autonomy, innovation, and impact. If you’re an SRE who thrives in fast-paced environments and loves solving complex problems, I’d love to hear from you. Apply now to learn more. More ❯
My client, a successful quantitative investment manager, is looking for a Senior DevOps engineer to join their ML ops team and to implement testing, development, automation tools, and IT infrastructure for the ML platform team and its users. They are looking for a senior with 10+ years of experience and proficient in AWS and Terraform. Key Responsibilities: Implement testing More ❯
My client, a successful quantitative investment manager, is looking for a Senior DevOps engineer to join their ML ops team and to implement testing, development, automation tools, and IT infrastructure for the ML platform team and its users. They are looking for a senior with 10+ years of experience and proficient in AWS and Terraform. Key Responsibilities: Implement testing More ❯
My client, a successful quantitative investment manager, is looking for a Senior DevOps engineer to join their ML ops team and to implement testing, development, automation tools, and IT infrastructure for the ML platform team and its users. They are looking for a senior with 10+ years of experience and proficient in AWS and Terraform. Key Responsibilities: Implement testing More ❯
Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. Consolidated revenues as of 12 months ending December 2024 totaled $13.8 billion. SRE for production support of mission critical tokenization platform. Candidate should be strong in ITSM process and hands-on with automation scripting and cloud technologies. Good to have proficiency with: Programming … Java, vert x, Python, Shell Scripting, GO, REST SRE - Kubernetes, Splunk/ELF, Openshift, CI/CD DB Postgres/Couchbase/Oracle Technical Skill Managing production support for mission-critical platforms Implementing and following ITSM processes for incident handling Writing automation scripts using Shell, Python , or Go Deploying and managing Kubernetes clusters in production Operating and troubleshooting OpenShift environments More ❯
crawley, west sussex, south east england, united kingdom
HCLTech
Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. Consolidated revenues as of 12 months ending December 2024 totaled $13.8 billion. SRE for production support of mission critical tokenization platform. Candidate should be strong in ITSM process and hands-on with automation scripting and cloud technologies. Good to have proficiency with: Programming … Java, vert x, Python, Shell Scripting, GO, REST SRE - Kubernetes, Splunk/ELF, Openshift, CI/CD DB Postgres/Couchbase/Oracle Technical Skill Managing production support for mission-critical platforms Implementing and following ITSM processes for incident handling Writing automation scripts using Shell, Python , or Go Deploying and managing Kubernetes clusters in production Operating and troubleshooting OpenShift environments More ❯