management for Windows workloads Create tooling and automation around the deployment of a customer-specific Windows-based SaaS product Ensure high availability, reliability, and scalability of Windows services. Integrate observability tooling (metrics, logs, traces) into IIS-hosted services Harden Windows infrastructure for security, compliance, and operational best practices Lead incident response for Windows-related systems Contribute to internal documentation and … Windows internals Proven ability to build infrastructure-as-code and CI/CD for Windows environments Comfort wrapping a Windows software product with the surrounding infrastructure, services, automation, and observability required to run it as a SaaS offering. Hands-on experience administering cloud infrastructure or building cloud-native applications (preferably on AWS) Comfortable using AWS EC2 Proficiency with command-line More ❯
and shared infrastructure Identify and resolve architectural bottlenecks in the current data platform and propose improvements that reduce complexity and boost performance Drive initiatives that improve data quality, lineage, observability, and system reliability Influence and Collaborate Across Teams Act as a technical liaison between engineering, product, and analytics teams, ensuring alignment on architecture and data strategy Provide technical leadership and … workloads Familiarity with data governance, privacy, and compliance frameworks Background in customer-centric or product-driven environments (e.g., digital, eCommerce, SaaS) Experience with infrastructure-as-code and data platform observability (e.g., Terraform) What You Can Expect Interesting work - working in a fast-paced and ever-changing industry, new problems and exciting solutions are never too far away. There are always More ❯
Position Summary We are looking for an experienced Systems Engineer with strong Linux and Kubernetes experience to join our Group Engineering - Systems team. You will help design, build and operate modern infrastructure platforms that support continually evolving applications and services. More ❯
in implementing good practice with regards to accessibility (Keyboard support, screen readers, form usability) Knowledge of various front-end architectural patterns E2E Testing experience (Cypress/Playwright) Experience with Observability as a practice (logging, GA tagging, TrackJS, App Insights) If you would be interested please apply below! INDMANS More ❯
in implementing good practice with regards to accessibility (Keyboard support, screen readers, form usability) Knowledge of various front-end architectural patterns E2E Testing experience (Cypress/Playwright) Experience with Observability as a practice (logging, GA tagging, TrackJS, App Insights) If you would be interested please apply below! INDMANS More ❯
Bracknell, Berkshire, South East, United Kingdom Hybrid / WFH Options
Halian Technology Limited
within the last 2 years Strong understanding of CI/CD , Continuous Testing , and Shift Left/Right principles Hands-on technical skills, including TDD , pairing , and experience with observability practices (e.g. logs, metrics, APM) Able to coach and mentor developers in testing and quality ownership Comfortable working in cross-functional teams embedded with engineers Excellent grasp of modern quality More ❯
Reading, Berkshire, South East, United Kingdom Hybrid / WFH Options
Halian Technology Limited
within the last 2 years Strong understanding of CI/CD , Continuous Testing , and Shift Left/Right principles Hands-on technical skills, including TDD , pairing , and experience with observability practices (e.g. logs, metrics, APM) Able to coach and mentor developers in testing and quality ownership Comfortable working in cross-functional teams embedded with engineers Excellent grasp of modern quality More ❯
Oxford, Oxfordshire, South East, United Kingdom Hybrid / WFH Options
Halian Technology Limited
within the last 2 years Strong understanding of CI/CD , Continuous Testing , and Shift Left/Right principles Hands-on technical skills, including TDD , pairing , and experience with observability practices (e.g. logs, metrics, APM) Able to coach and mentor developers in testing and quality ownership Comfortable working in cross-functional teams embedded with engineers Excellent grasp of modern quality More ❯
london, south east england, united kingdom Hybrid / WFH Options
Stax - Deeptech Talent
human and AI-native querying (e.g., Text-to-SQL). Productize the PLuG SDK as a Data Platform: Leverage in-app instrumentation (session replays, logs, engagement events) to power observability, anomaly detection, and customer experience analytics. Build AI-Enabled Interactions: Drive the development of agentic features including natural language queries, AI-generated dashboards, and real-time recommendations—bridging structured data More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Stax - Deeptech Talent
human and AI-native querying (e.g., Text-to-SQL). Productize the PLuG SDK as a Data Platform: Leverage in-app instrumentation (session replays, logs, engagement events) to power observability, anomaly detection, and customer experience analytics. Build AI-Enabled Interactions: Drive the development of agentic features including natural language queries, AI-generated dashboards, and real-time recommendations—bridging structured data More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Stax - Deeptech Talent
human and AI-native querying (e.g., Text-to-SQL). Productize the PLuG SDK as a Data Platform: Leverage in-app instrumentation (session replays, logs, engagement events) to power observability, anomaly detection, and customer experience analytics. Build AI-Enabled Interactions: Drive the development of agentic features including natural language queries, AI-generated dashboards, and real-time recommendations—bridging structured data More ❯
fund administrators, and institutional investors around the world. Working closely with our engineering teams, you'll design and maintain cloud infrastructure, improve our CI/CD pipelines, and enhance observability so we can ship high-quality features quickly and confidently. You'll bring your initiative as well as your technical skills to solve real operational challenges, ensuring our systems are … production and staging environments Build and improve CI/CD pipelines to support rapid, high-quality deployments Monitor and improve system availability, performance, and cost-efficiency Implement and manage observability tools (logging, metrics, tracing) Enhance infrastructure-as-code using AWS CDK and related tools Collaborate with engineers to streamline development workflows and deployment strategies Champion DevOps best practices across the More ❯
quality KPIs that drive accountability and continuous improvement Act as a mentor and coach for engineers and QA professionals, upskilling the org in modern testing practices Improve visibility and observability of test execution and failures Support initiatives to enhance our staging and test environments for reliable internal testing A third level degree in an Information Technology or Computer Science related … a testing or quality engineering capacity Experience with performance and load testing frameworks (e.g., k6, JMeter) Familiarity with cloud-based test environments and infrastructure (AWS preferred) Working knowledge of observability and test reporting tools (e.g., Datadog, Grafana) Experience improving test data strategies and test isolation techniques Contributions to internal tooling or open-source testing frameworks Background in building out quality More ❯
CI/CD pipelineswithGitLab CI or Jenkinsto enable fast, secure, and reliable software delivery. o Champion Kubernetes-based platformsusingAmazon EKSandIstio Service Meshto build scalable, service-oriented architectures. o Drive observability and reliability engineeringthrough proactive monitoring, alerting, and incident response strategies. o Mentor and guide DevOps engineers, fostering a culture of continuous improvement, automation, and operational excellence. o Collaborate cross-functionallywith … We're looking for someone with deep expertise in: oInfrastructure as Code: Terraform, CloudFormation o Security best practices: IAM, KMS, encryption in transit/at rest, DevSecOps o Monitoring & observability: Datadog, Prometheus, Grafana, ELK, or similar What You Bring o 6+ years in DevOps or platform engineering, with experience in a technical lead role. o Proven experience designing and operating More ❯
and maintain shared flows and reusable proxy patterns for authentication, logging, error handling, and traffic control. Monitor and troubleshoot platform issues using Kubernetes tools (kubectl, helm) and integrate with observability platforms (e.g., Cloud Ops, Prometheus, ELK). Collaborate with backend teams, security teams, and infrastructure teams to ensure seamless API adoption and runtime stability. Provide technical leadership, documentation, and mentorship … design, high availability, and TLS configuration. Familiarity with Cassandra (for Apigee Hybrid runtime), cluster scaling strategies , and troubleshooting synchronization or runtime issues . Working knowledge of log management and observability solutions (e.g., Fluent Bit, Splunk, ELK, or Google Cloud Logging). Exposure to multi-region or hybrid cloud deployments , and knowledge of best practices around IP address planning and firewalling More ❯
area of the product component or the system in aggregate and at scale. Specific domains include Workload Management (Kubernetes, Ray, and so on); Cloud Development (Cloud Infrastructure Automation); Management & Observability (open source and commercial monitoring, observability and DCIM solutions) Skills and Experience Essential Strong relevant programming experience Python/Go/C infrastructure-as-code scripting or related to the … of the products under test: Containerisation (e.g. Docker), Virtualisation and Provisioning, Workload and job scheduling (e.g. Kubernetes, Ray) on high core-count machines and rack-scale installations, Management and Observability (e.g. Prometheus, OpenTelemetry, DataDog, Splunk, etc.). 10+ years of relevant experience related to quality assurance/testing teams. Experience with the Atlassian suite and CI/CD platforms such More ❯
in implementing good practice with regards to accessibility (Keyboard support, screen readers, form usability) Knowledge of various front-end architectural patterns E2E Testing experience (Cypress/Playwright) Experience with Observability as a practice (logging, GA tagging, TrackJS, App Insights) If you would be interested please apply below! INDMANS More ❯
in implementing good practice with regards to accessibility (Keyboard support, screen readers, form usability) Knowledge of various front-end architectural patterns E2E Testing experience (Cypress/Playwright) Experience with Observability as a practice (logging, GA tagging, TrackJS, App Insights) If you would be interested please apply below! INDMANS More ❯
Cambridge (onsite travel required) Job Type: 12-Month Contract (Inside IR35) Experience Level: Mid to Senior Level Role Overview We are seeking an experienced Dynatrace Consultant to join our Observability Team on a 12-month engagement. This role is critical in driving the adoption and integration of Dynatrace across a complex enterprise environment. You will work closely with platform teams … application owners, and DevOps engineers to enable full observability, implement best practices, and ensure successful platform rollout as part of our new Center of Excellence initiative. Key Responsibilities Provide technical consulting and enablement to internal engineering teams for effective use of Dynatrace. Build dashboards, alerts, and service flow mappings aligned with application performance needs. Develop and optimize Dynatrace Query Language … DQL) queries for actionable insights. Support observability design and migration from tools such as Prometheus, Grafana, and AWS CloudWatch to Dynatrace. Advise on RBAC models, data access strategies , and security best practices for multi-team environments. Design monitoring strategies for Kubernetes workloads in hybrid cloud/on-prem environments. Promote observability-as-code using tools like Terraform and GitLab for More ❯
ll dig into logs, traces and code to explain behaviour, patch bugs or raise backlog stories when deeper product work is needed. Often these investigations will result in improving observability or stability of the platform. High-impact feature work. Between investigations we deliver focused enhancements and platform improvements that don't slot neatly into long-term road-maps. Because our … team's workload is unpredictable, delivery dates are flexible and scoped by the team. Platform observability & performance. Your team members continually raise the bar on monitoring, metrics and efficiency. Joining as our newest engineer, you'll pair with seasoned Go/TypeScript/Python devs, owning real tasks from week one. Expect a dynamic mix of bug hunting, green field … and basic cloud/Linux fundamentals. Curiosity and the confidence to ask questions in a fast-moving team. Nice-to-haves Exposure to Kubernetes, Docker or Terraform. Experience with observability stacks (Grafana, Prometheus, OpenTelemetry). Familiarity with Postgres. Interest in data-privacy, AdTech/MarTech or large-scale data processing. Familiarity with Kafka, gRPC or Apache Spark. As well as More ❯
of a secure, cloud-native SaaS platform Partner with Product, UX, and scientific teams to translate genomic needs into scalable software features Oversee full engineering lifecycle – infra, DevOps, QA, observability, and application layer Build and mentor a high-performing engineering team, setting standards and best practices Maintain regulatory alignment and readiness for healthcare/genomics SaaS products Drive innovation by … evolution What You Bring Proven software engineering leadership, including strategy, hiring, delivery, and technical oversight Deep experience building and scaling SaaS platforms (cloud-native, Kubernetes, Terraform, CI/CD, observability) Expertise in modern stacks (Python, TypeScript/Node.js, React) and major clouds (AWS, GCP, Azure, Oracle) Knowledge of security and privacy frameworks: RBAC, encryption, secure API design, identity/auth More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
William Hill PLC
you. The Leeds-based, highly skilled SRE team are primarily managing the Kubernetes clusters within the organisation for multiple departments, and through a DevOps culture enabling those departments with observability and pipelines for their business applications. Their job is to guarantee system reliability, performance, and supportability with a strong engineering emphasis on building autonomous solutions that deliver value to end … to be. Please note the interviews for this role will be face-to-face in our central Leeds office. What you will be doing: Ensuring Reliability - Best in class Observability and Security, applying the Four Golden Signals, with appropriate Testing and Disaster Recovery Plans Improving Productivity - Automate rapid delivery through software delivery pipelines using Infrastructure as Code Maintaining and Developing … people who can support our ethos. To apply to this post, you will have: A base in Leeds with working experience of an incident response model and fluency with observability and monitoring (Prometheus, Grafana) Experience defining alerts and implementing dashboards from existing monitoring and logging data Relentless focus on customer experience with good understanding of security best practice Fluency in More ❯
of a secure, cloud-native SaaS platform Partner with Product, UX, and scientific teams to translate genomic needs into scalable software features Oversee full engineering lifecycle – infra, DevOps, QA, observability, and application layer Build and mentor a high-performing engineering team, setting standards and best practices Maintain regulatory alignment and readiness for healthcare/genomics SaaS products Drive innovation by … evolution What You Bring Proven software engineering leadership, including strategy, hiring, delivery, and technical oversight Deep experience building and scaling SaaS platforms (cloud-native, Kubernetes, Terraform, CI/CD, observability) Expertise in modern stacks (Python, TypeScript/Node.js, React) and major clouds (AWS, GCP, Azure, Oracle) Knowledge of security and privacy frameworks: RBAC, encryption, secure API design, identity/auth More ❯
as an Oracle Site Reliability Engineer to help us build and maintain resilient, high-performing systems in a fast-paced financial services environment. If you're passionate about automation, observability, and continuous improvement, we'd love to hear from you. To be successful as a Oracle Site Reliability Engineer, you should have experience with: Significant experience in Site Reliability Engineering … tools that support system setup and automation, such as Ansible, Puppet, or Chef. Experience designing and maintaining CI/CD pipelines to support seamless deployments. Knowledge of monitoring and observability tools such as Prometheus, Grafana, and the ELK stack. You may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and More ❯
as an Oracle Site Reliability Engineer to help us build and maintain resilient, high-performing systems in a fast-paced financial services environment. If you're passionate about automation, observability, and continuous improvement, we'd love to hear from you. To be successful as a Oracle Site Reliability Engineer, you should have experience with: Significant experience in Site Reliability Engineering … tools that support system setup and automation, such as Ansible, Puppet, or Chef. Experience designing and maintaining CI/CD pipelines to support seamless deployments. Knowledge of monitoring and observability tools such as Prometheus, Grafana, and the ELK stack. You may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and More ❯