Manchester, Lancashire, United Kingdom Hybrid / WFH Options
WorksHub
the infrastructure and deployment of those applications. We are actively expanding our Manchester born SRE function, which aims to advance our knowledge and innovation globally in areas such as Observability, Reliability and Availability. We have the autonomy to choose the technologies and processes that help us achieve our objectives. So each team leverages the technology that fits their needs best. More ❯
Some Other Highly Valued Skills May Include Extensive experience in deploying and operating SaaS products on premise with deep understanding of SRE principles and expertise in leveraging monitoring and observability tools like ELK and/or AWS cloud watch. Understanding of configuration management tools like Chef. Understanding of Agile Software Engineering practices and experience of working in dynamic teams following More ❯
Social network you want to login/join with: At Ansys, we are reimagining the way complex simulation software is deployed—across cloud, on-prem, and hybrid environments. We’re looking for a Cloud Platform Engineer who specializes in Infrastructure More ❯
Social network you want to login/join with: At Ansys, we are reimagining the way complex simulation software is deployed—across cloud, on-prem, and hybrid environments. We’re looking for a Cloud Platform Engineer who specializes in Infrastructure More ❯
Social network you want to login/join with: At Ansys, we are reimagining the way complex simulation software is deployed—across cloud, on-prem, and hybrid environments. We’re looking for a Cloud Platform Engineer who specializes in Infrastructure More ❯
Social network you want to login/join with: At Ansys, we are reimagining the way complex simulation software is deployed—across cloud, on-prem, and hybrid environments. We’re looking for a Cloud Platform Engineer who specializes in Infrastructure More ❯
Social network you want to login/join with: At Ansys, we are reimagining the way complex simulation software is deployed—across cloud, on-prem, and hybrid environments. We’re looking for a Cloud Platform Engineer who specializes in Infrastructure More ❯
Social network you want to login/join with: col-narrow-left Client: Ansys Location: portsmouth, hampshire, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 3 Posted: 31.05.2025 Expiry Date: 15.07.2025 col-wide More ❯
At Ansys, we are reimagining the way complex simulation software is deployed—across cloud, on-prem, and hybrid environments. We’re looking for a Cloud Platform Engineer who specializes in Infrastructure as Code (IaC) to lead the design, implementation, and More ❯
Social network you want to login/join with: At Ansys, we are reimagining the way complex simulation software is deployed—across cloud, on-prem, and hybrid environments. We’re looking for a Cloud Platform Engineer who specializes in Infrastructure More ❯
Social network you want to login/join with: Senior Cloud Engineer, crawley, west sussex col-narrow-left Client: Ansys Location: crawley, west sussex, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 3 More ❯
At Ansys, we are reimagining the way complex simulation software is deployed—across cloud, on-prem, and hybrid environments. We’re looking for a Cloud Platform Engineer who specializes in Infrastructure as Code (IaC) to lead the design, implementation, and More ❯
Social network you want to login/join with: At Ansys, we are reimagining the way complex simulation software is deployed—across cloud, on-prem, and hybrid environments. We’re looking for a Cloud Platform Engineer who specializes in Infrastructure More ❯
Software Engineer – Infrastructure/Observability Focus (Inside IR35, up to £750/day) Are you a software engineer who enjoys solving infrastructure problems - someone who sits comfortably between development and platform engineering? Join a small, high-performing engineering team on a project at a top-tier quant fund, helping to build out observability capabilities. This is not an SRE or … software that improves system visibility What they’re looking for: Strong software engineering skills (Go or Python preferred) Experience working in or alongside platform engineering teams Familiarity with modern observability tools (Grafana, Prometheus, etc.) Comfort working across both code and infrastructure – but this is not a pure ops/SRE role If you've worked in finance that would be More ❯
standards, guardrails and best practices, and implement improvements to processes and tooling to ensure engineering excellence. You’ll have a strong understanding of operational requirements, and ensure Scalability, Resiliency, Observability, Security, Cost and Maintainability are at the forefront of all engineering activities. This specific project will involve Real Time Payments value stream, Form 3 gateway set-up and setting up … Knowledge of software engineering including testing frameworks, and secure code delivery (ie SCA, SAST, DAST Networks/Security/Middleware & Apps Scripting/Coding (Bash, Python) End to End Observability solutions (logging, monitoring, alerting) Knowledge of SRE principles and practices More ❯
South East London, England, United Kingdom Hybrid / WFH Options
SiSU Health UK
compliance to relevant standards. ️ In this role, you'll: Own and optimise our AWS stack and CI/CD pipelines Maintain our Health Station platform, networks and infrastructure Improve observability, resilience, compliance, security and uptime Collaborate across product and tech teams to resolve issues and ensure platform stability Be part of the team designing our cloud platform Work at the …/recovery, security, RTO, RPO) Experience with IaC tools (CDK, Cloudformation, Terraform) Experience with shell scripting and using JavaScript or Python Experience with GitHub, CloudWatch, CodePipeline Experience with system observability, managing SLAs and resolving issues Excellent problem-solving skills and passion to learn Strong communication skills and critical thinking ability Ability to work effectively in a Team, confidently, autonomously and More ❯
issues like memory leaks, outages, and flaky services. Your focus will include : Leading incident management, post-mortems, and blameless RCAs Building scalable, resilient microservices with the dev teams Uplifting observability Improving alerting, monitoring, and system-level metrics Driving better SLOs, SLIs, and overall uptime The stack includes Kubernetes , Terraform , AWS , Python , and modern CI/CD tools, and it's … be considered and learn more! What you’ll bring : Experience in high-traffic digital or eCommerce platforms 5+ years in SRE/DevOps roles; strong background in incident response Observability, automation, and infrastructure as code expertise Leadership skills - mentoring others or leading from the front More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
Boost Talent ltd
scalable compute, and a rapidly evolving platform — all in a high-growth, high-performance environment. Role Highlights: Salary: Up to £85,000 Location: Hybrid (Leeds) Tech: AWS, Kubernetes, IaC, Observability, CI/CD Focus: Infrastructure scalability, automation, performance, and reliability What You’ll Be Doing: Designing and scaling cloud-native infrastructure with a strong emphasis on AWS (EKS, MSK, DynamoDB … RDS) Driving automation with Terraform/OpenTofu , scripting (Python, PowerShell), and GitLab CI Enabling observability across services using Prometheus , OpenTelemetry , and custom tooling Implementing secure deployment practices, performance tuning, and cost optimisation strategies Collaborating with engineers and data scientists to ensure platform reliability and rapid iteration What We’re Looking For: Solid experience as an SRE or DevOps Engineer in More ❯
area of the product component or the system in aggregate and at scale. Specific domains include Workload Management (Kubernetes, Ray, and so on); Cloud Development (Cloud Infrastructure Automation); Management & Observability (open source and commercial monitoring, observability and DCIM solutions) Skills and Experience Essential Strong relevant programming experience Python/Go/C infrastructure-as-code scripting or related to the … of the products under test: Containerisation (e.g. Docker), Virtualisation and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms such More ❯
area of the product component or the system in aggregate and at scale. Specific domains include Workload Management (Kubernetes, Ray, and so on); Cloud Development (Cloud Infrastructure Automation); Management & Observability (open source and commercial monitoring, observability and DCIM solutions) Skills and Experience [Essential] Strong relevant programming experience Python/Go/C infrastructure-as-code scripting or related to the … of the products under test: Containerisation (e.g. Docker), Virtualisation and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms such More ❯
configuration and version control with Bitbucket and Artifactory. • Ensure seamless integration of monitoring solutions into CI/CD pipelines. • Develop and maintain alerting, logging, and tracing solutions to support observability best practices. • Optimize monitoring configurations for performance, cost, and scalability. • Troubleshoot monitoring issues and provide root cause analysis for system incidents. • Document monitoring architectures, automation scripts, and best practices. • Stay …/CD Pipelines: Proficiency in Jenkins and version control tools like Bitbucket. •Configuration Management: Experience with Ansible for automated deployments. •Artifact Management: Knowledge of Artifactory for package management. •Monitoring & Observability: Experience configuring and managing Splunk, Dynatrace, and OpenTelemetry (OTel). • Scripting & Automation: Proficiency in Python, and bash shell for automation. •Problem-Solving: Strong troubleshooting skills for diagnosing monitoring and performance More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
Ada Meher
will involve working across multiple projects simultaneously using Agile practices. The ideal candidate will also have knowledge around or an interest in learning other key DevOps areas such as observability, CI/CD pipeline development and config management. The company have a personal development budget available to all staff for such courses and accreditations, to help aid the rounded development … Expert experience working with AWS services and architecture Strong experience working with Terraform (or other IaC technology) Proven team leadership experience Experience working with CI/CD pipelines (Jenkins), Observability (Grafana) & Configuration Management (Ansible, Chef, Puppet) Excellent communication skills are a must Along with an excellent work/life balance, this company also offer a strong financial incentive to those More ❯
practices into network design and operations. Performance & Optimization Optimise infrastructure for low-latency, high-availability trading. Collaborate with developers on performance tuning and troubleshoot cross-region network issues. Monitoring & Observability Set up and manage monitoring, alerting, and logging using OpenSearch, Prometheus, and Grafana. Develop dashboards for real-time network insights. Partner with developers, traders, and data engineers to align infrastructure … and Ansible for infrastructure and network automation Experience with GitLab CI/CD pipelines and GitOps principles Knowledge of container orchestration platforms like Kubernetes (EKS) Experience with monitoring and observability tools including OpenSearch, Prometheus, and Grafana Understanding of security best practices and AWS CIS Benchmark standards Experience with low-latency network design and optimization Strong verbal communication and documentation skills More ❯
across multiple squads to ensure our platform is scalable, secure, and designed for rapid deployment and operational excellence. You'll contribute to the development and automation of cloud infrastructure, observability systems, CI/CD pipelines, and event-based services that power key parts of our product ecosystem. About Suits Me Suits Me is a multi-award-winning, ethical fintech dedicated … pipelines (e.g. GitHub Actions) to enable rapid and reliable delivery of services Contributing to the design of scalable and secure platform components that enable developer productivity Building and improving observability tooling (e.g. CloudWatch, Grafana) to support rapid detection and resolution of issues Collaborating with developers and stakeholders across squads to understand infrastructure needs and ensure best practices are applied Writing More ❯
with business objectives to meet evolving customer needs. As an influential figure in our company, the Systems Reliability Engineering Senior Lead will spearhead initiatives to automate infrastructure, enhance system observability, and drive the transformation of our IT operations. What are we looking for? Bachelor’s degree in Information Technology, Computer Science, Business Management, or a related field 7+ years of … with a proven track record in issue and problem management in a multicultural and global environment. Proficiency with cloud platforms and experience in configuration management, scripting, and monitoring and observability tools. Understanding of business processes, change management, and ITSM processes, including service level management and reporting. Excellent communication skills and the ability to work collaboratively with cross-functional teams. What More ❯