deployments as well as accurate health monitoring through all our clients, both new and old. The person in this role will join the SiteReliability Engineering team (SRE). The main role of the SRE team is to facilitate the scalability of Dayshape and allow us to meet the demands of an increasing client base. What you'll … do Lead initiatives to enhance Dayshape's ability to scale our cloud platform Maintain and improve our cloud estate in Azure Improve SRE and other teams' working lives through automation of manual tasks Lead in making the deployment of Dayshape more scalable Increase our knowledge sharing of SRE across the organisation Improve the observability of Dayshape through reporting and tool More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
to gemstone supplies They have a presence in London, Hong Kong, Amsterdam, and as well in Mumbai and now in New York in 2001. About the role : As the SRE Manager, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and services through both direct technical contribution along with team building and … tooling. Drive automation initiatives to streamline operational workflows and improve efficiency. Develop and maintain tools, scripts, and dashboards to monitor system health, performance, and reliability. Build a first class SRE team. Through a combination of leading by example, coaching and mentoring, mould the team would want to have around you. Provide leadership and guidance to the SRE team, fostering a … culture of collaboration, innovation, and continuous improvement. RESPONSIBILITIES: Proven experience in a senior or lead SRE role, with a strong track record of building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with More ❯
along the way! Job Summary We have built Curve Dental into an industry-leading provider of beautiful cloud software for the dental industry. Who We're Looking For Our SiteReliability Engineers (SREs) are passionate about automation and its power to streamline the deployment and operation of software. They collaborate closely with developers to support a wide range More ❯
hybrid environment. This opportunity is ideal for candidates who thrive in fast-paced environments and are eager to contribute to a growing organisation. If you have a passion for SiteReliability Engineering and the desire to make a meaningful impact, we encourage you to apply. The role is set to commence immediately, and while core benefits are not More ❯
Saffron Walden, Essex, South East, United Kingdom Hybrid / WFH Options
EMBL-EBI
ground-breaking research that improves human and planetary health. As part of our small but highly skilled IT Operations team, youll play a critical role in ensuring the availability, reliability and efficiency of services that support scientists and collaborators worldwide. This is a hands-on, varied position where youll combine deep technical expertise with a service-oriented mindset. If … migration to O365. Core Services Jointly develop and maintain services such as transfer services, software-defined object storage, authentication/authorisation tools, and our Request Tracker ticketing system. Monitoring & Reliability Maintain and evolve distributed Check_mk monitoring while helping shape a long-term monitoring strategy. Automation & Orchestration Work with Gerrit, Foreman, RPM repositories and Puppet to deploy, update and … days annual leave per year, in addition to eight bank holidays Relocation package including installation grant (as applicable) Campus life: Free shuttle bus to and from work, on-site library, subsidised on-site gym and cafeteria, casual dress code, extensive sports and social club activities (on campus and remotely) Family benefits: On-site nursery, child sick leave More ❯
Description Summary : We're looking for an experienced Platform/Infrastructure Engineer with a strong Microsoft Azure background and deep knowledge of Kubernetes. You'll play a key role in designing, deploying, and maintaining infrastructure and services that power our products. This role requires hands-on experience with automation, modern IaC practices, CI/CD, and maintaining production-grade … and maintain Infrastructure as Code using Terraform or OpenTofu Develop scripts and automation to support infrastructure and deployment workflows - PowerShell is preferred Collaborate with engineering teams to support platform reliability and enable delivery Maintain visibility and awareness through monitoring and logging tools such as Datadog, Azure Monitor, App Insights etc. Support incident resolution and participate in an on-call … such as Azure Monitor, App Insights, or similar Clear communicator with the ability to collaborate across cross-functional teams Nice to Have: Azure certifications (e.g. Azure Administrator, Azure DevOps Engineer) Experience with GitOps and tools such as ArgoCD or Flux Familiarity with Configuration as Code tools like Ansible or Puppet Exposure to large-scale distributed systems or high-volume More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
William Hill PLC
bets per second, accommodate 20 million users, and process 160 terabytes a day. You can be sure there are many challenges waiting for you. The Leeds-based, highly skilled SRE team are primarily managing the Kubernetes clusters within the organisation for multiple departments, and through a DevOps culture enabling those departments with observability and pipelines for their business applications. Their … job is to guarantee system reliability, performance, and supportability with a strong engineering emphasis on building autonomous solutions that deliver value to end-users early, often, and fast. We are also open to candidates that come from a Software Engineering background - As long as you show the willingness to learn, we are more than happy to invest the time … Storage Platforms, developing any necessary integration Supporting Incidents - Assist Incident Management in Production all the way through impact assessment, service restoration and post-mortems, including being part of the SRE on call rotation Sharing Knowledge - Enabling development teams within the DevOps Culture, promoting best practice, documenting runbooks, presenting talks, working with production engineering teams Who we are looking for: We More ❯
requirements, being transactional, analytical, non-relational, or data warehouse. The wider DBA team is the technology owner of multiple RDBMS and NoSQL technologies, is responsible to strategize, advance, and engineer enterprise solution for automated build and patching and efficient administration, that meet security, availability, performance, fast delivery and reporting requirements, and to support projects and products using these technologies. … JOB SCOPE As an engineer in this team, the individual will be involved in the build and run activities related to NoSQL database technology and infrastructure. The role will contribute to solution engineering and support as well as being responsible for delivering database projects, maintaining running systems and performing problem analysis and troubleshooting. The individual should be well versed More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Arm Limited
enable innovation across the business. To support that mission, we're growing our Data Engineering Platform team and investing deeply in modern, reliable infrastructure. We're seeking a DevOps engineer with hands-on expertise in containerisation, orchestration, cloud platforms, continuous-delivery pipelines, and cloud at scale. In this role, you'll partner with the team to develop new functionality … cloud deployments (AWS-first) using Terraform and platform tooling Improve security posture across IAM, secrets, and networking Help the team ship faster and safer by mentoring on DevOps and SRE practices We're solving for reliability, compliance, performance, and speed - at once. You'll be key to making it work. Required Skills: Knowledge of one or more programming languages … highly leveraged platform, enabling hundreds of engineers to use critical data systems with confidence. You'll have ownership, impact, and a seat at the table as we define how SRE and platform thinking shape our next-generation data infrastructure. If you're looking to scale not just systems but the capabilities of the engineers around you, this is your team. More ❯
As an SRE/Infrastructure Engineer, is responsible for designing, implementing, and maintaining the cloud infrastructure our platform sits on, as well as the monitoring and deployment services that enable the rest of engineering to develop, deliver and maintain our platform services. You will also be instrumental in both monitoring and incident response, playing a key role in ensuring … maximum reliability and minimal downtime. You will collaborate with teams across the company, including developers, customer support, product owners and sales, to ensure the reliability, scalability, and performance of our platform. Infrastructure Design and Implementation: assist or lead in the design, deployment, and operation of the infrastructure components required to support our applications and services. This includes managed More ❯
Tewkesbury, Gloucestershire, England, United Kingdom
Sanderson
SC Cleared Platform Engineer/SRE - Permanent Location: Tewkesbury or Bromsgrove, need to be in Tewkesbury 2-3 times a week if Bromsgrove The role is 5 days on-site in the office Salary: £60,000 - £80,000 + Package Clearance: Must have active SC Clearance and be eligible for DV This role is with a growing technology More ❯