Join our team as a MongoDB SiteReliability Engineer, where you'll be at the forefront of designing and maintaining robust, high-performance systems that power critical financial services. In this dynamic and fast-paced environment, your role will be essential to ensuring our infrastructure remains resilient, secure, and scalable. You'll work on automating operations, enhancing system … If you're motivated by solving, multi-layered problems and building systems that perform reliably amid shifting priorities, we encourage you to apply. To be successful as a MongoDB SiteReliability Engineer, you should have experience with: Working in SiteReliabilityEngineering, DevOps, and MongoDB administration in financial services. Using MongoDB features like replicaset, sharding More ❯
Join us as a Senior SiteReliability Engineer - Oracle where you'll spearhead the evolution of our digital landscape, driving innovation and excellence.This role will include: applying software engineering techniques, automation, and best practices in incident response, ensuring the reliability, availability, and scalability of the systems, platforms, and technology through themTo be successful as a Senior … SiteReliability Engineer - Oracle you should have experience with:Oracle Enterprise manager (OEM), Oracle Internet Directory (OID),Oracle database Performance Tuning – SMEDeep understanding of LDAP protocols and directory services.SQL OptimizationStrong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef).Expertise in setting up and maintaining monitoring … business acumen strategic thinking and digital and technology, as well as job-specific technical skillsThis role will be based in our Knutsford campus. Purpose of the roleTo apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. AccountabilitiesAvailability, performance, and scalability of More ❯
Join us as a Senior SiteReliability Engineer - Oracle where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. This role will include: applying software engineering techniques, automation, and best practices in incident response, ensuring the reliability, availability, and scalability of the systems, platforms, and technology through them To be successful as … a Senior SiteReliability Engineer - Oracle you should have experience with: Oracle Enterprise manager (OEM), Oracle Internet Directory (OID),Oracle database Performance Tuning - SME Deep understanding of LDAP protocols and directory services. SQL Optimization Strong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). Expertise … strategic thinking and digital and technology, as well as job-specific technical skills This role will be based in our Knutsford campus. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Commify Group
us and be part of our success story! Role Summary In the role of SiteReliability Engineer at Commify, you will be an integral part of our SRE team. Your focus will be on ensuring that our products and platforms perform at their best, understanding how our software interacts with both physical and Cloud infrastructure to deliver exceptional … Maintaining high levels of system performance through monitoring and performance tuning Implementing scalability and fault tolerance Automating processes and improving operational efficiencies Troubleshooting application and middleware challenges Collaborating with engineering teams to support high-throughput production environments Building and maintaining robust deployment pipelines What essentials are we looking for? Proficiency with Microsoft Azure Strong expertise in Terraform, App Services … and Kubernetes Fluent in both written and spoken English A genuine passion for reliability in systems Experience in creating and modifying Terraform deployments Prior experience in an operations role, ideally as a SiteReliability Engineer Ability to work cross-functionally, take ownership of tasks, and prioritize effectively Excellent communication and collaboration skills Experience with monitoring solutions (e.g. More ❯
Splunk SiteReliability Engineer/Migration Specialist - Fixed Term Contract 6-12 Months Job Summary: The Splunk SRE/Migration Specialist is responsible for leading and executing the migration of data, dashboards, alerts, and configurations from Splunk systems to Elasticsearch. This role involves deep technical expertise in Splunk architecture, data ingestion, and observability tools, along with strong project … models and recreate in Kibana. Incident Response Ensure the smooth functioning of Splunk platform across BT maintaining the Splunk’s infrastructure in Production & Non-Production environments. To support Splunk SRE & Application teams in investigating incidents following established procedures. Upgrades: Keep the Splunk components to the latest version applicable and carry out the necessary pre & post upgrade checks accordingly. Change Requests … security measures and ensure compliance with relevant standards and best practices. Skills and Qualifications: Hands-on experience with enterprise-level monitoring tools and applications, and familiarity with DevOps/SRE’s best practices. Proven experience with Splunk and Elasticsearch (ELK Stack). Familiarity with containerized environments (Docker, Kubernetes). Proficiency in Unix/Linux systems, Networking protocols, and possess strong More ❯
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
Anson Mccade
security accreditation standards in the public sector (e.g. ISO27001, Cyber Essentials+). SC Clearance (must be active and valid at time of starting). Experience with platform reliability (SRE), service resilience, and scaling services in secure or sensitive environments. Desirable: Experience contributing to and leading large public sector or defence bids/tenders (e.g. G-Cloud, DOS, RM6116) . More ❯
Embark on a transformative journey as an Engineering Manager Production Network Engineering at Barclays, where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. You'll harness describe an aspect of the team or role technology to revolutionise our digital offerings, ensuring unparalleled customer experiences. Production Network Engineering is a new mission aligned … operating model within the newly formed Network Product Design & Engineering domain at Barclays. Network Product Design & Engineering is responsible for the design, build and run of the underlying strategic network infrastructure platforms and the over-the-top connectivity products and services that enable colleagues, guests and partners. To be successful in this role as an Engineering Manager … including requirements analysis, platform selection, technical architecture design, service design & development, testing and deployment at scale, including proficiency in Agile Methodologies Scrum/Kanban, backlog and workflow mgmt. and SRE specific reporting (MTTR, deployment frequency, SLO). You may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and transformation, business More ❯
Embark on a transformative journey as an Infrastructure Engineer Production Network Engineering at Barclays, where you spearhead the evolution of our digital landscape, driving innovation and excellence. You'll harness technology to revolutionise our digital offerings, ensuring unparalleled customer experiences. Production Network Engineering is a new mission aligned operating model within the newly formed Network Product Design & Engineering domain at Barclays. Network Product Design & Engineering is responsible for the design, build and run of the underlying strategic network infrastructure platforms and the over-the-top connectivity products and services that enable systems, colleagues, guests and partners To be successful in this role as an Infrastructure Engineer - Production Network Engineering, you should possess the following skillsets … including requirements analysis, platform selection, technical architecture design, service design & development, testing and deployment at scale, including proficiency in Agile Methodologies Scrum/Kanban, backlog and workflow mgmt. and SRE specific reporting (MTTR, deployment frequency, SLO). You may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and transformation, business More ❯
Embark on a transformative journey as an Infrastructure Engineer Production Network Engineering at Barclays, where you spearhead the evolution of our digital landscape, driving innovation and excellence. You'll harness technology to revolutionise our digital offerings, ensuring unparalleled customer experiences.Production Network Engineering is a new mission aligned operating model within the newly formed Network Product Design & Engineering domain at Barclays. Network Product Design & Engineering is responsible for the design, build and run of the underlying strategic network infrastructure platforms and the over-the-top connectivity products and services that enable systems, colleagues, guests and partnersTo be successful in this role as an Infrastructure Engineer – Production Network Engineering, you should possess the following skillsets:Extensive … including requirements analysis, platform selection, technical architecture design, service design & development, testing and deployment at scale, including proficiency in Agile Methodologies Scrum/Kanban, backlog and workflow mgmt. and SRE specific reporting (MTTR, deployment frequency, SLO.You may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and transformation, business acumen strategic More ❯
Join us as a Senior PostgreSQL SRE at Barclays where you'll effectively monitor and maintain the bank's critical technology infrastructure and resolve more complex technical issues, whilst minimizing disruption to operations. In this role you will assume a key technical leadership role. You will shape the direction of our database administration, ensuring our technological approaches are innovative and … aligned with the Bank's business goals. You will guide high-impact projects to completion, collaborate with management, and implement SRE practices using software engineering and database administration to address infrastructure and operational challenges at scale. To be successful as a Senior PostgreSQL SRE , you should have: Strong experience as a Principal Level Database Administrator, with a focus on … PostgreSQL A Proven track record of implementing and leading SRE practices across large organizations or complex teams. Extensive hands-on experience on Containers and Kubernetes In depth experience with DevOps automation tools such as Code versioning (git), JIRA, Ansible, database CI/CD tools and their implementation. Some other highly valued skills may include: Expert expertise with scripting languages (e.g. More ❯
Design and implement scalable, secure cloud environments (primarily Azure) using Terraform and modern DevOps tools. Contribute to automation, monitoring, and system reliability improvements. Guide incident management and promote SRE best practices (SLAs, SLOs, SLIs). Mentor engineers and collaborate on platform roadmaps. Evolve CI/CD pipelines, GitHub workflows, and operational playbooks. Foster a culture of automation, security, and … accountability. Ensure effective Agile delivery practices. Qualifications Proven leadership in DevOps or SRE teams. Deep expertise in Azure cloud services and infrastructure provisioning. Strong Terraform and IaC experience. Knowledge of Docker, Kubernetes, AKS. Proficient in CI/CD tools (e.g., GitHub Actions). Experience with distributed systems, monitoring tools (e.g., Prometheus, Grafana), and caching (e.g., Redis). Strong communication and … cross-functional collaboration skills. Desirable Skills Familiarity with SRE principles and incident management tools (e.g., PagerDuty). Understanding of DevOps security practices and FinOps. Experience with MACH architecture, CDN optimization, and service mesh/API gateways. If you're interested please get in touch ASAP More ❯
Caching technologies - Redis, GridGain, Apache Ignite Programming languages: Java, Python, Go Lang Container orchestration/Cloud platform: RedHat Openshift/AWS/Azure DevOps tools - Ansible, Chef, Kubernetes, GitLab SRE logging & Monitoring Tools - ELK stack, Grafana, Prometheus, Open Telemetry Other highly valued skills include: Strong understanding of Agile application development methodology. Strong knowledge of API development/principles Collaborating with … the development teams to build scalable and resilient systems. Aligning cross functional teams on priorities and deliverables. Driving automation to enhance operational efficiency Promoting the culture of SRE practices to continuously measure and improve You may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and transformation, business acumen strategic thinking … and digital and technology, as well as job-specific technical skills. This role will be based out of our Knutsford campus. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of More ❯
API development/principlesCollaborating with the development teams to build scalable and resilient systems.Aligning cross functional teams on priorities and deliverables.Driving automation to enhance operational efficiencyPromoting the culture of SRE practices to continuously measure and improveYou may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and transformation, business acumen strategic … thinking and digital and technology, as well as job-specific technical skills.This role will be based out of our Knutsford campus. Purpose of the roleTo apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. AccountabilitiesAvailability, performance, and scalability of systems and … system resilience.Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning.Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations.Stay informed of industry technology trends and innovations, and actively More ❯
Join us as a PostgreSQL SRE at Barclays where you'll effectively monitor and maintain the bank’s critical technology infrastructure and resolve more complex technical issues, whilst minimizing disruption to operations. In this role you will assume a key technical leadership role. You will shape the direction of our database administration, ensuring our technological approaches are innovative and aligned … with the Bank’s business goals.To be successful as a PostgreSQL SRE, you should have:Experience as a Database Administrator, with a focus on PostgreSQL and similar database technologies such as Oracle or MS-SQL.A background in implementing and leading SRE practices across large organizations or complex teams.Hands-on experience on Containers and KubernetesExperience with DevOps automation tools such as … thinking and digital and technology, as well as job-specific technical skillsThis role can be based in our London, Knutsford or Glasgow, locations. Purpose of the roleTo apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. AccountabilitiesAvailability, performance, and scalability of More ❯
Join us as a PostgreSQL SRE at Barclays where you'll effectively monitor and maintain the bank's critical technology infrastructure and resolve more complex technical issues, whilst minimizing disruption to operations. In this role you will assume a key technical leadership role. You will shape the direction of our database administration, ensuring our technological approaches are innovative and aligned … with the Bank's business goals. To be successful as a PostgreSQL SRE, you should have: Experience as a Database Administrator, with a focus on PostgreSQL and similar database technologies such as Oracle or MS-SQL. A background in implementing and leading SRE practices across large organizations or complex teams. Hands-on experience on Containers and Kubernetes Experience with DevOps … and digital and technology, as well as job-specific technical skills This role can be based in our Knutsford, or Glasgow, locations. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability More ❯
effectively monitor and maintain the bank’s critical technology infrastructure and resolve more multi-faceted technical issues, whilst minimizing disruption to operations. In this role you will apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them.To be successful as a Database … should have:Experience in Ansible for database server configurationsExperience in devops automation tools such as git, JIRA and database CI/CD tools A good working knowledge of coaching SRE practices across large organisation or multifaceted teamsA working knowledge of MongoDBSome other highly valued skills may include:Practical experience of containers and KubernetesExperience with scripting languages (e.g. python, bash) for … applications and data systems, using hardware, software, networks, and cloud computing platforms as required with the aim of ensuring that the infrastructure is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response. AccountabilitiesBuild Engineering: Development, delivery More ❯
effectively monitor and maintain the bank’s critical technology infrastructure and resolve more multi-faceted technical issues, whilst minimizing disruption to operations. In this role you will apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them.To be successful as a Database … should have:Experience in Ansible for database server configurationsExperience in devops automation tools such as git, JIRA and database CI/CD tools A good working knowledge of coaching SRE practices across large organisation or multifaceted teamsA working knowledge of MongoDBSome other highly valued skills may include:Practical experience of containers and KubernetesExperience with scripting languages (e.g. python, bash) for … business acumen strategic thinking and digital and technology, as well as job-specific technical skillsThis role will be based in our Knutsford location. Purpose of the roleTo apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. AccountabilitiesAvailability, performance, and scalability of More ❯
effectively monitor and maintain the bank's critical technology infrastructure and resolve more multi-faceted technical issues, whilst minimizing disruption to operations. In this role you will apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. To be successful as a … have: Experience in Ansible for database server configurations Experience in devops automation tools such as git, JIRA and database CI/CD tools A good working knowledge of coaching SRE practices across large organisation or multifaceted teams A working knowledge of MongoDB Some other highly valued skills may include: Practical experience of containers and Kubernetes Experience with scripting languages (e.g. … strategic thinking and digital and technology, as well as job-specific technical skills This role will be based in our Knutsford location. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability More ❯
a solid grasp of testing tools, observability solutions to drive the integration and automation of capacity controls across our technology estate. This role blends solution design with hands-on engineering and technical communications. The ideal candidate brings knowledge of risk and controls frameworks, diverse application integration solutions, solution design and driving technology best practice through communities of practice and … disperse testing and observability systems into a governance and reporting platform. Assess existing tooling, data flows and operational practices to identify control gaps and propose remediation strategies. Collaborate with engineering, infrastructure, architecture, and risk teams to validate technical designs and implementation plans. Define reusable technical patterns and tooling strategies that enhance operational readiness and control sustainability. Lead tooling assessment … execution activities. Support release, warranty and operational handover of platforms. Required Skills & Experience: 10+ years in engineering, infrastructure, or technical architecture roles in complex technology environments. Familiarity with SRE disciplines such as observability, service-level indicators/objectives (SLIs/SLOs), and automation of operational tasks. Demonstrated ability to interpret and apply control requirements in technical design contexts. Experience More ❯