SiteReliability Engineer (SRE) Are you ready to shape the future of a cutting-edge platform? We're looking for a SiteReliability Engineer (SRE) to join our squad dedicated to ensuring our foundation is scalable and robust. Role can be based in either Edinburgh or … scale, optimize performance, and ensure efficient maintenance. SLO/SLA Concepts: Implement and manage Service Level Objectives and Agreements to guarantee our platform's reliability and performance. Infrastructure Management: Use Terraform to manage infrastructure and deployments, ensuring everything runs smoothly and efficiently. CI/CD Proficiency: Work with a more »
London, England, United Kingdom Hybrid / WFH Options
Fastmarkets
equity firm Astorg, a specialist investor in healthcare, software, technology, business services and technology-based industrial companies. Job Description Fastmarkets requires an experienced Senior SiteReliability Engineer with great DevOps and Stake holder management skills. To compliment an worldwide existing team we're looking … for someone to help us modernise our Azure cloud platforms to a cloud native, containerised fully automated deployment pipelines. Reporting to the Head of SRE, the correct candidate will have extensive experience in modernising Azure platforms, excel in Infrastructure and code, as well as being comfortable in more traditional DevOps more »
all infrastructure components. • Continuously evaluate and adopt new technologies to improve efficiency and productivity. You will have: • Proven years of experience in a DevOps, SRE or similar role. • Proficiency in scripting languages such as Python, Bash, or PowerShell. • Hands-on experience with CI/CD tools such as GitLab CI. more »
Berkeley Square - Talent Specialists in IT & Engineering
Network Developer/Network Automation Specialist/Network SRE - Python - Salary to £200k per annum! My client, a leading algorithmic trading company is seeking an experienced Network SRE who understands how networks function at a fundamental level, considers how problems can be solved using a broad array of tools and … running trading and post-trade systems among others. Prior finance knowledge is not required. Candidates must have the following skills/experience: Strong network engineering and architecture skills. Knowledge of switch internals, Ethernet and IP routing, including but not limited to: VPN and tunnelling protocols (IPSec, GRE, MPLS). more »
Introduction In this role, you'll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we deliver deep technical and industry expertise to a wide range of public and private sector clients around the world. Our more »
Manchester Area, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliability Engineer who will develop software solutions, consult with development teams and work with modern telemetry data to maintain and improve the performance of key systems. The sitereliability team provide an increasingly important service to our technology department. … Focusing on application performance, reliability, availability, capacity and health, you will work with other teams across the platform department to help ensure our critical systems are reliable and observable. You will be working to provide solutions to help minimise toil and provide operational efficiency at scale on our critical … systems. This role is eligible for inclusion in the Company’s hybrid working from home policy. Preferred Skills, Qualifications and Experience Excellent knowledge of SRE principles, including the creation and management of effective SLI’s and SLO’s for reliability and customer satisfaction. Knowledge of contemporary observability tools, techniques more »
Hello SiteReliability Engineers! Having an average day? Well, luckily you've come across an opportunity that might just change that. For this one - you will be part of a team that is building & designing a new serverless architecture. Therefore, you will be comfortable deploying with Terraform, while more »
Reigate, England, United Kingdom Hybrid / WFH Options
Client Server
SiteReliability Engineer/SRE Reigate/WFH to £85k Global FinTech is seeking a skilled SiteReliability Engineer/SRE to collaborate across product focussed Agile engineering teams to ensure the reliability, availability and performance of client facing services. Responsibilities will include managing … week for team meet-ups and stakeholder meetings with the other three days work from home. About you: You have experience in a similar SRE/SiteReliability Engineer position You have experience of running 24x7 services in the public cloud - Azure preferred You have experience with observability … and happy to collaborate with senior stakeholders and mentor others What's in it for you: As a SiteReliability Engineer/SRE you will receive a competitive salary plus a range of perks and benefits: Up to £85k salary plus bonus Hybrid working (3 days a week more »
the freedom to try the latest tech and drive adoption of new tools as you see fit. Requirements: Strong experience in a DevOps/SRE/Cloud Engineering position. Great AWS knowledge and experience Experience of large-scale, complex on-prem and Cloud systems Excellent experience of tech such … as Kubernetes and Terraform etc. Solid grounding in software engineering and Linux Excellent communication skills more »
SiteReliability Engineer/SRE (Docker &/or Kubernetes) Duration: Permanent Location: Oxfordshire Package: Competitive salary and package As a SiteReliability Engineer, you will be at the heart of ground-breaking projects, ensuring operational reliability and accelerating code velocity for highly innovative devices. If … you’re passionate about blending systems engineering with software development in a collaborative environment, this role is your gateway to innovation. Why You Should Apply Innovative Environment : Work on the frontier of innovative research and device development. Competitive Salary : Growth Opportunities : Be part of a growing team with room more »
We are seeking a talented and experienced SiteReliability Engineer (SRE) to join our team. As a SRE, you will be instrumental in helping engineer, implement, and maintain our infrastructure to ensure its reliability, scalability, and security. Your role will focus on leveraging infrastructure as code principles … resolve complex infrastructure issues, minimizing downtime and improving system reliability. Mentor and provide guidance to junior team members, fostering their growth and development in SRE practices and principles. Experience Required Must have: Strong proficiency in scripting languages like Ruby, Python or Go for automation and tooling. Experience with infrastructure as more »
A Tier 1 bank is looking for multiple Java Developers with significant cloud infrastructure and SRE experience to join a very high-impact project, working on a pioneering platform affecting millions of customers 🚀 📍 Location: 1 day/week in London 💰 Salary on offer: £85-100k + up to … k. ✅ Must have requirements: Java software engineering background - ideally you would have worked as a Java Developer before shifting your focus to SRE/Platform Engineering, or are still working as a Java Developer extensive experience with AWS, Kubernetes, Terraform, CI/CD tools strong observability experience, ideally more »
of cost-effectiveness, performance and reliability Always motivated team work for the best possible business outcomes Readiness to expand the horizon into adjacent SRE and production monitoring areas Potential infrequent trips to the US facilities for on-site solution deployments Role Requirements: Proven track record of successful software more »
SiteEngineering Manager | Cross-Border Payment Fintech We are working with the leading cross-border payments provider that went through an IPO last year and is now completing an extensive digital transformation. They are looking for a SiteReliability Engineer to join their greenfield team. You … rapidly growing technology function. You will be responsible for keeping their new technology platforms available 24/7/365 by monitoring the Performance, Reliability, Change Management, Incident Response, and Capacity Planning for a number of their core services. Some of their key technologies are: AWS Cloud, Dynatrace, Terraform … to performance, reliability, and scalability and lead the coordination across technology to resolve them Experience Required: 3+ years of platform operations engineering, SRE, DevOps, or similar relevant experience in a B2B environment Experience with application performance monitoring e.g Dynatrace, DataDog Experience of Cloud Migration e.g AWS and Terraform more »
My client, an investment manager specialising in systematic trading are looking to hire a Lead SiteReliability Engineer to help form and lead a new team that will be responsible … for ensuring best in class reliability and performance for their low latency market making systems. This is a unique opportunity to shape the SRE function within the organisation and drive business continuity through the principles of Chaos Engineering to enable any issues to be resolved across their stack. … If you have previous experience driving SRE programs forward within low latency environments and can take on the challenge of leading a brand new team then this is the role for you. Requirements Proven experience in driving SRE programs within low latency environments Experience with trading systems ideally within a more »
SiteReliability Engineer London (Hybrid 2 days a week on site) Permanent £75,000 - £85,000 p/a The Background We are partnered with an innovative IT consultancy based in London but with a global presence who are leading advisors in their industry by creating lasting … a flexible benefits fund. You… In order to be a successful SiteReliability Engineer you will have… Previous experience working as an SRE/at system administrator level In-depth knowledge of Windows Operating Systems and VMware with a good understanding of Linux Operating Systems In depth knowledge … VLAN’s, Routing, Switching) Security (Splunk, APM, SIEM) Login/Monitoring (Splunk, Elastic, Prometheus, PRTG, Netbox, IPAM, CMDB) Mattermost, Atlassian The role As a SiteReliability Engineer you will work on projects relating to application software, operating systems and system management tools as well as maintaining new and more »
junior members of the team and learn from industry leaders. Requirements for the Lead DevOps Engineer 5+ years’ experience working in a DevOps/SRE/Platforms/Infrastructure roles at a technology driven organization Experience with Azure Experience with IaaC such as Terraform Experience with Azure DevOps Experience with more »
implement, and manage scalable AWS environments Automate CI/CD pipelines and deployment processes Collaborate with cross-functional teams to enhance system performance and reliability What We’re Looking For: Proven experience with AWS services and DevOps practices Strong scripting skills (Python, Bash, etc.) Excellent problem-solving and communication … Join: £60k-£75k Innovative projects Adult approach to working hours/Flexibility Ready to take your career to the next level? Apply now! 🌟 DevOps | SRE | Platform Engineer #AWS #DevOps #Engineering #Cloud more »
Leeds, England, United Kingdom Hybrid / WFH Options
Staffworx
Lead Scrum Master for full stack software engineering group of leading blue chip. Home based with occasional office days, 1-2 days week in West Yorkshire, otherwise remote working. Serve as a Chief Scrum Master on cloud based identity program, consisting of multiple scrum teams Mentor and coach the … a professional setting, preferably in consulting or technology as a Senior or Lead Scrum Master Has led Scrum Teams within large Agile Transformations Support SRE teams and operate as a Senior Ideally, you'll also have 2 5 years of SAFe Agile coaching experience and an active SAFe Program Consultant more »
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliabilityEngineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliabilityEngineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliabilityEngineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliabilityEngineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliabilityEngineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »
Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliabilityEngineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and more »