an exciting new opportunity for a SiteReliabilityEngineer who will work alongside the development, architecture and service management teams. The SRE champions proactive measures and automation to pre-empt disruptions and optimize system performance. This role is instrumental in bridging the gap between development and operations … applying engineering principles to operational challenges to drive continuous improvement and innovation. What you'll be doing: Leading the enhancement of system reliability and scalability, architecting robust solutions that meet the dynamic needs of the business while ensuring security and architectural excellence. Responsible for facilitating technical roadmaps by ensuring … looking for: AWS certification (e.g. AWS Solutions Architect Associate or Professional) or other industry certification is beneficial and desired. Has significant experience in DevOps, SRE implementation and in evolving practices and ways of working through multi-disciplinary teams, business frameworks and culture. Advanced knowledge and hands-on experience with cloud more »
About the role: Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations SiteReliabilityEngineer with a strong cloud-based Linux and Windows knowledge. The Cloud Operations SiteReliabilityEngineer will be … troubleshooting customer environments for mission-critical application use across the range of cloud platforms used by Loftware, including AWS and Azure. The Cloud Operations SiteReliabilityEngineer is someone that is a team player with the desire and passion for modern technology and keen to take on … large-scale responsibility for the cloud environment. The Cloud Operations SiteReliabilityEngineer will work with the rest of the Cloud Operations team and alongside QA and Development to continually improve automated infrastructure and application deployment, to build and maintain reliable cloud infrastructure and services and to more »
London, England, United Kingdom Hybrid / WFH Options
Bayside Solutions
SiteReliabilityEngineer Contract Salary Range: £91,400 - £108,000 per year Location: London, England - Hybrid Role Job Summary: We seek a SiteReliabilityEngineer to join our team and play a crucial role in ensuring our applications and services' reliability, availability, and … Willingness to adapt and learn new tools and technologies as needed Availability to participate in on-call rotations as required Desired Skills and Experience SiteReliability, Java, AWS, Azure, Kubernetes, GIT, CD Bayside Solutions, Inc. may collect your personal information during the position application process. Please reference Bayside more »
SiteReliabilityEngineer – High Frequency Trading – London - £120-150k base + bonus + share options It’s a very cool time to join this high frequency trading firm. They are past the shaky uncertainty phase of a start-up. The core blocks have been built. They more »
SiteReliabilityEngineer – Google Cloud London Excellent Salary & Package including Bonus Key Skills – SRE, GCP (Enterprise Deployments), HELM, Python/Golang/Java, IAC/Automation, Blockchain Technologies, Node Infrastructure, Security Hardening Overview An influential member of a team of highly skilled engineers building out cloud native … as an enabler for the developers and business. Predominantly supporting Java, Typescript and Python workloads which are built upon open-source software. As an SRE subject matter expert you will: Enable cross functional teams to rapidly code, build and deliver. Own critical parts of the software development life cycle such … accountable for the cloud native deployment environments across dev, staging and production. Expertise Required: At least 5 years professional experience in a DevOps/SRE role Google Cloud Expertise - GCP Enterprise Level Deployments, Helm etc. Experience building tooling, scripts or applications to enhance the developer experience. 2+ years current experience more »
the projects where you’ll embed with different application teams to roll-out new features. From free food to private trainers in their on-site gym, there’s an in-office focus here. However, for the four days on-site, they’ll ensure you’re comfortable. While they … like teams to collaborate on-site, they follow the sun from an incident perspective, so you won’t be needed on-call. Joining a lean but global team, if you want the impact and freedom of a smaller environment, without the out-of-office on-call, happy to provide more »
processing data at a scale comparable to Meta and Google! They are on the lookout for multiple count Senior SiteReliability Engineers (SRE) to join one of their incredibly talented teams. As a SiteReliabilityEngineer (SRE), you will play a crucial role in ensuring … the reliability, scalability, and performance of our systems and infrastructure. You will work closely with cross-functional teams to design, implement, and maintain robust and resilient systems, with a focus on automation, monitoring, and incident response. The role: • Working arrangements: Flexible – can be fully remote (UK residents only – unfortunately … automation tools and scripts for deployment, monitoring, and management of infrastructure components. Collaborate with software engineering teams to ensure that applications are designed with reliability, scalability, and performance in mind. Implement and maintain monitoring, alerting, and logging systems to proactively identify and resolve issues before they impact customers. Participate more »
SiteReliabilityEngineer to join them on a major government project that's based 2 days per week in Wokingham. The SRE team have L2 support responsibilities and will lead the triages. You will be trained in and exposed to many different modern technologies (OpenShift/Kubernetes more »
roles where you can make a significant impact on the availability, performance, and efficiency of critical services? If you've previously excelled in an SRE or similar operations environment and are looking for your next challenge, we want to hear from you! These opportunities require you to work one day … Role Overview: As part of our clients dedicated Mortgages team, you'll be instrumental in working within a new SiteReliability Engineering (SRE) Function, focusing on enhancing system reliability across key areas such as availability, performance, latency, efficiency, capability, and incident response. This role is crucial as … manage risks effectively. What They're Looking For: Proven experience in software engineering with a strong background in Java or C#. Experience in an SRE function or similar operations environment, excluding purely DevOps, infrastructure, or deployment analyst roles. Familiarity with AWS, Kubernetes, and moving systems from data centers to cloud more »
City of London, London, United Kingdom Hybrid / WFH Options
Akkodis
Azure SiteReliabilityEngineer Akkodis are currently working in partnership with a leading service provider to recruit an experienced Azure SiteReliabilityEngineer to join a growing team of talented Cloud Engineers providing high level support and project delivery for a large customer base. … fully remote role and you must be eligible to gain security clearance (do not need to hold currently). The Role As an Azure SiteReliabilityEngineer you will support the cloud infrastructure used to deliver cloud hosted managed services to customers. You will have a high more »
SiteReliabilityEngineer (x2) Permanent/Remote £50,000 - £85,000 We have an exciting opportunity to join a leading FinTech business as a SiteReliability Engineer. There are currently two available roles. The two SiteReliability Engineers will work closely and collaboratively more »
SiteReliabilityEngineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD A leading provider of financial services is seeking two SiteReliability Engineers- Leads with a solid and proven background in Azure or GCP. This position will also be based onsite in … Will consider candidates from any of the key vendors across the Cloud- Azure, GCP, and AWS. Kubernetes & troubleshooting, managed services like AKS Using your SRE Attitude (understanding SLI, SLO & SLA) Container Image Management & Security like Aquasec Code Quality & repository Management like SonarQube & NexusQ Service Mesh (Istio) traffic shaping, canary, blue … Unit/Integration/Load Testing Azure Application Gateway & API Management Azure IAM - Identity & Access Management Azure Policy Management & Cloud Security Azure Express Route SiteReliabilityEngineer- Lead, Mentoring, Kubernetes, PaaS, IaaS, SQL, Azure DevOps, CI/CD McGregor Boyall is an equal opportunity employer and do more »
Cheltenham, Gloucestershire, South West, United Kingdom Hybrid / WFH Options
Searchability NS&D Ltd
project) Skills required in Java Spring Boot, Kubernetes & Docker, Elastic, Helm, Linux, Git, CI/CD Who are we? We are recruiting a Senior SRE with enhanced DV Clearance for a prestigious client to work on a portfolio of public and private sector projects. Our client is a global leader … and platforms. You'll experience excellent career progression opportunities to develop your skillset and personal profile in an inclusive culture. What will the Senior SRE be doing? Monitor system metric dashboards using Kibana Diagnosing problems Remedy and debug any issues from system deployment environments Track issues and carry out releases … on LinkedIn, just search for Henry Clay-Davies. I look forward to hearing from you. KEY SKILLS: SiteReliabilityEngineer/SRE/Senior SRE/Kubernetes/Ansible/Elastic Stack/Elastic/Kibana/Linux/Git/Helm/CI/CD/ more »
SiteReliabilityEngineer The successful candidate will be based in the United Kingdom and must have at least good-years residency to be eligible for security vetting. Some level of travel to the client site in central London or Corsham can be expected, in line with … AWS stack for optimal platform performance. Automation Focus: Patch, update, and automate tasks for maximum efficiency. Incident Lead: Coordinate incident response with L2 and SRE teams. Handover and Reviews: Facilitate daily SRE handovers and post-incident reviews. Reporting and Improvement: Monitor queues, create reports, and implement automations. AWS Knowledge: Expertise more »
Nottingham, Nottinghamshire, East Midlands, United Kingdom
Microlise
Lead EngineerSRE When registering to this job board you will be redirected to the online application form. Please ensure that this is completed in full in order that your application can be reviewed. Our Engineering Team is 200 strong, from Apprentice Engineers through to Enterprise Architects, and were … currently in an exciting period of growth! As our new Lead Engineer, you would be key to maximising this growth through coaching in terms of technical performance, achieving technical evangelism and acting in a leadership role in terms of design review. We provide clear career ladders for each employee … you are looking for a new challenge and have a strong technical background, then we want to hear from you! As our new Lead SiteReliabilityEngineer , you will be key to maximising the observability of our infrastructure and applications, and to resolving error-prone manual processes more »
which involves spending at least two days per week currently, or 40% of our time, at our Bristol office. ABOUT THIS OPPORTUNITY Our Cloud SRE (SiteReliability Engineering) team is looking for an experienced and passionate Engineer with strong hands-on development experience. As a Cloud SRE … the Bank's vision for 2023 and beyond! Specific activities might include: Working with service teams to directly influence and drive the adoption of SRE best practices and ways of working within our microservices; Collaborating with infrastructure engineers to ensure resilience and scalability across the platform; Observing, investigating & fixing service … following to consider you for interview: Background Ideally you'll come from a software engineering or telemetry background and have now moved into an SRE role. Technical Skills Experience working with a broad set of GCP products (or extensive experience with another Public Cloud platform, such as Azure or AWS more »
Global technology company is expanding its EMEA customer operations support team and is looking to hire a further 2 Linux operations/SRE Engineers for their Bristol office. Key skills/experience required: A degree in Systems Engineering, Computer Science or related subjects 5+ years of experience administering Linux systems more »
SiteReliabilityEngineer I am seeking … a SiteReliabilityEngineer for one of the worlds fastest growing social media platforms. With over 900 Million Daily users. The SRE group come from diverse technical backgrounds, Reliability, Software Engineering and Security Engineering, and have a broad remit ensuring high availability and performance, and currently more »
SiteReliabilityEngineer/SRE/London/Hybrid Remote My client do amazing things with data. The consider themselves as experts in all things consumer and location, bringing together cutting-edge analytical techniques, creative thinking and diverse perspectives to drive growth for their client base. They … highly regarded, innovative datasets in the market and their people are the best at manipulating that data to provide insight. Working as the DevOps engineer, you will play a critical role in the development, deployment, and management of software infrastructure. You will collaborate closely with cross-functional teams to … will have: Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience). 4+ years of experience in a DevOps, SRE or similar role. Proficiency in scripting languages such as Python, Bash, or PowerShell. Hands-on experience with CI/CD tools such as GitLab CI. more »
Company | Health and Fitness 📏 Size | 400 🧢 Role | Senior SiteReliabilityEngineer 🪜 Level | Senior ✨Skills | K8's, Terraform, Honeycomb, AWS 📍 Based | Manchester City Centre 💻 Hybrid | 2 days a week in-office 💰 Offer | up to £70k + Benefits A Scale-up Tech for good business based in Manchester City … Centre is looking an experienced Site Reliabiliy Engineer to assist with the growing demand for their services. If you're an advocate for monitoring and observability practices who enjoys working closely with product teams to ensure systems are secure, scalable and reliable then this could be the perfect more »
infrastructure sitereliabilityengineer who primarily has experience in windows environments and a specialism in storage. You'd be joining an SRE team that underpins the entirety of the funds systems meaning you'll have direct impact on the success of the company. You can also expect … a broad range of exposure and responsibilities from scaling large volumes of research related data to improving the reliability and speed of the application estate. Primarily we're looking for strong experience in windows infrastructure engineering, storage, kubernetes and python/powershell automation. Any additional experience around Prometheus/ more »
SiteReliability/Platform Engineer - Leading SaaS provider - Hybrid (London) - Up to £77k p.a. - Really exciting opportunity to join a global organisation at a point of technical change A leading global SaaS provider is seeking a SiteReliability/Platform Engineer to join their more »
SREEngineer should have knowledge of alerting and monitoring tools The tools can be Splunk, Log DNA, Grafana, AWS Cloud Watch Should have knowledge of CI/CD tools. The tools can be Team City, Jenkins, IBM Tool Chain etc Should have knowledge of APM and observability tools. The more »
Stoke-On-Trent, England, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliability Engineering Team Leader, who will help facilitate and drive activity and efforts of the team to deliver effective technical solutions to operational problems. The SiteReliability team works with several sections across the business, ensuring that our critical more »
London (city), London, England Hybrid / WFH Options
T Rowe Price
invite you to explore the opportunity to join us and grow your career with us. Job Title: Principal SiteReliabilityEngineer (SRE) Department: CDO Technology Group Summary: We are seeking a highly motivated and experienced Principal SiteReliabilityEngineer (SRE) to join the CDO … Technology leadership team to stand up and lead the SRE function within CDO Technology. In this role, you will be responsible for ensuring the availability, latency, performance, efficiency, and stability of our critical infrastructure, which supports a range of data platforms, applications, and services. You will collaborate closely with development … infrastructure, and anticipate significant risks. Work with development teams to review architecture design to ensure high availability and proper disaster recovery strategy Collaborate with reliability and infrastructure engineering team in T Rowe Price to build synergy in tooling for the implementation of observability, tracing, and alerting Qualifications: Bachelor's more »