City of London, London, United Kingdom Hybrid / WFH Options
Thurn Partners
Their cutting-edge technology enables rapid response and efficient opportunity capture. The company fosters innovation and encourages hands-on implementation of infrastructure-supporting applications. Role Summary: As a Senior SiteReliabilityEngineer , you will play a pivotal role in greenfield application platforms. Your responsibilities include designing, developing, and maintaining these platforms. You’ll collaborate with various departments … including business, development, IT, and trading support teams. Staying up-to-date with emerging technologies and industry trends is essential. Required Experience: 4+ years working in a SiteReliability Engineering, DevOps or Platform engineering environment. Proficiency in one or more of the following languages: Python, Go, C++, C#. Experience with Linux administration . Familiarity with observability tools such More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Thurn Partners
Their cutting-edge technology enables rapid response and efficient opportunity capture. The company fosters innovation and encourages hands-on implementation of infrastructure-supporting applications. Role Summary: As a Senior SiteReliabilityEngineer , you will play a pivotal role in greenfield application platforms. Your responsibilities include designing, developing, and maintaining these platforms. You’ll collaborate with various departments … including business, development, IT, and trading support teams. Staying up-to-date with emerging technologies and industry trends is essential. Required Experience: 4+ years working in a SiteReliability Engineering, DevOps or Platform engineering environment. Proficiency in one or more of the following languages: Python, Go, C++, C#. Experience with Linux administration . Familiarity with observability tools such More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Digital Realty (UK) Limited
Position Title: SiteReliabilityEngineer, Interconnection Service and Network Delivery Location: Hybrid: Austin, Dallas, Boston, Ashburn, Atlanta, London, or Amsterdam Your role In this role, you will be responsible for deploying and maintaining all Digital Realty interconnection fabric network infrastructure. The ideal candidate can demonstrate a unique blend of network engineering, network operations, and software understanding through More ❯
markets interests you, this could be the perfect opportunity to take your career to the next level! About the role: You will play a crucial role in ensuring the reliability, performance, and efficiency the companies trading platforms. This is not your average DevOps role - this position focuses on sitereliability, where you'll be troubleshooting, supporting traders … support new trading systems, continuously improving the infrastructure. • Drive automation and operational excellence by leveraging your Linux expertise, Kubernetes, and Python scripting skills. • Monitor and ensure high availability and reliability of trading applications while being on top of system alerts and incidents. Key Requirements: • 1-5 years working experience • Background working in the financial services sector, ideally supporting traders … Solid experience with Linux Systems administration and troubleshooting. • Hands-on experience with Kubernetes for container orchestration. • Proficient in Python scripting for automation and system management. • A mindset focused on sitereliability and performance. • Strong troubleshooting skills and a proactive approach to problem-solving. Salary: Up to £90,000 base salary Lucrative bonus scheme Company perks/benefits Location More ❯
london (city of london), south east england, united kingdom
Hamilton Barnes 🌳
markets interests you, this could be the perfect opportunity to take your career to the next level! About the role: You will play a crucial role in ensuring the reliability, performance, and efficiency the companies trading platforms. This is not your average DevOps role - this position focuses on sitereliability, where you'll be troubleshooting, supporting traders … support new trading systems, continuously improving the infrastructure. • Drive automation and operational excellence by leveraging your Linux expertise, Kubernetes, and Python scripting skills. • Monitor and ensure high availability and reliability of trading applications while being on top of system alerts and incidents. Key Requirements: • 1-5 years working experience • Background working in the financial services sector, ideally supporting traders … Solid experience with Linux Systems administration and troubleshooting. • Hands-on experience with Kubernetes for container orchestration. • Proficient in Python scripting for automation and system management. • A mindset focused on sitereliability and performance. • Strong troubleshooting skills and a proactive approach to problem-solving. Salary: Up to £90,000 base salary Lucrative bonus scheme Company perks/benefits Location More ❯
enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled SiteReliabilityEngineer with a strong focus on Google Cloud Platform … and respond to cloud incidents using incident.io, ensuring timely resolution. Use JIRA to log, track, and prioritize support tickets and workflow tasks. Monitor and maintain cloud infrastructure for performance, reliability, and security. Collaborate with teams to identify and implement solutions to technical challenges. Assist in deploying, configuring, and optimising GCP resources. Create and maintain documentation for troubleshooting processes and More ❯
london (city of london), south east england, united kingdom
WALT Labs
enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements. Role Description This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled SiteReliabilityEngineer with a strong focus on Google Cloud Platform … and respond to cloud incidents using incident.io, ensuring timely resolution. Use JIRA to log, track, and prioritize support tickets and workflow tasks. Monitor and maintain cloud infrastructure for performance, reliability, and security. Collaborate with teams to identify and implement solutions to technical challenges. Assist in deploying, configuring, and optimising GCP resources. Create and maintain documentation for troubleshooting processes and More ❯
with a modern, full-stack platform that delivers logs, metrics, traces, and security monitoring — cutting costs by up to 70% while boosting efficiency. They are looking for a Lead SRE to own and elevate our Alerting & Incident Management platform . You’ll be the driving force behind reliability, customer satisfaction, and product excellence — ensuring smooth alert management, fewer engineering … experience by speeding up alert resolution and reducing interruptions for engineers. Build solutions to common pain points, shaping roadmaps, documentation, and technical knowledge. Develop benchmarking tools to improve performance, reliability, and scalability. Stay ahead of incident management trends to drive new workflows and product improvements. Mentor teams and lead with clear, impactful communication. What We’re Looking For 5+ … platform experience (PagerDuty, OpsGenie, etc. a plus). Solid technical foundation with cloud/distributed systems. Excellent communicator, comfortable working across US/IL time zones. Bonus: leadership experience, SRE/DevOps background, knowledge of SLO/SLA practices. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Fintech (in Stealth Mode)
and enhancements, while also being responsible for their reliability, performance, and stability in production. This hybrid role merges software engineering with a deep SiteReliability Engineering (SRE) mindset. You'll combine software, systems, and cloud engineering expertise to solve complex production challenges, drive automation, and ensure our services are both innovative and exceptionally dependable. You should have … this role. Own the availability, scalability, latency, and performance of mission-critical services, managing and upholding a 99.999% SLA. Design, write, and deliver software and automation to improve system reliability and reduce manual procedures. Build robust automation for monitoring, and proactive issue detection to prevent problem recurrence. Lead the incident management process, conduct blameless post-mortems, and drive corrective More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Fintech (in Stealth Mode)
and enhancements, while also being responsible for their reliability, performance, and stability in production. This hybrid role merges software engineering with a deep SiteReliability Engineering (SRE) mindset. You'll combine software, systems, and cloud engineering expertise to solve complex production challenges, drive automation, and ensure our services are both innovative and exceptionally dependable. You should have … this role. Own the availability, scalability, latency, and performance of mission-critical services, managing and upholding a 99.999% SLA. Design, write, and deliver software and automation to improve system reliability and reduce manual procedures. Build robust automation for monitoring, and proactive issue detection to prevent problem recurrence. Lead the incident management process, conduct blameless post-mortems, and drive corrective More ❯