engineering teams Guide system design for complex, cloud-native, containerised environments (Kubernetes, Terraform, Helm, Flux) Own the evolution of our data model and processing platforms Embed best practices in observability , security , and testing across the organisation Anticipate dependencies and unblock teams before issues arise Partner with product and business leaders to align technology with strategic goals Mentor engineers and technical … Expertise in modern backend architectures (Node.js, Python, Go) and fluency with frontend ecosystems (React, TypeScript) Deep experience with cloud-native infrastructure (Terraform, Kubernetes, Helm, Flux) Strong grasp of security, observability, and operational excellence Proven ability to influence and guide without formal authority Track record of mentoring and developing high-performing engineering teams Why Youll Love It Here Join a mission More ❯
engineering teams Guide system design for complex, cloud-native, containerised environments (Kubernetes, Terraform, Helm, Flux) Own the evolution of our data model and processing platforms Embed best practices in observability , security , and testing across the organisation Anticipate dependencies and unblock teams before issues arise Partner with product and business leaders to align technology with strategic goals Mentor engineers and technical … Expertise in modern backend architectures (Node.js, Python, Go) and fluency with frontend ecosystems (React, TypeScript) Deep experience with cloud-native infrastructure (Terraform, Kubernetes, Helm, Flux) Strong grasp of security, observability, and operational excellence Proven ability to influence and guide without formal authority Track record of mentoring and developing high-performing engineering teams Why You’ll Love It Here Join a More ❯
About NewStore NewStore provides Omnichannel-as-a-Service for retail brands worldwide that want to accelerate their digital transformation. Built for speed and flexibility, NewStore allows brands to easily deliver amazing shopping experiences that store associates and consumers love. Its More ❯
a Great Place to Work, ITRS is an Enterprise SaaS provider with industry-leading solutions. Our mission is to make society's critical technology work via automated & holistic IT observability solutions that safeguard critical applications and enable innovation. With our prestigious customer base includes 90% of the world's top investment banks. We are backed by leading private equity investors … sit in the Platform Team and form part of a wider global Engineering Team. The Core Platform layer is a collection of distributed services which ingest, transform and materialise observability data to make it available to several similarly distributed visualisation, integration, analytics and other domain specific applications to provide solutions to a range of observability problems. We are looking to … tuned garbage collection, fixed CPU hotspots and analysed heap dumps to find memory leaks on JVMs. You have likely instrumented custom metrics in your application and used monitoring/observability tools to understand how they behave in different environments and to identify the root cause of issues. Health Insurance and Dental Health Cover for you and your dependants Employee Assistance More ❯
newport, wales, united kingdom Hybrid / WFH Options
Travel Chapter
clear, actionable reports and communicate findings to technical and non-technical stakeholders Contribute to our Cypress UI automation suite and embed quality checks early in the development lifecycle Use observability tools such as DataDog to monitor metrics and investigate performance trends Support our CI/CD processes and ensure smooth, reliable automation execution Share best practices and champion continuous improvement … for UI automation and familiarity with BDD tools such as Cucumber and Gherkin Solid API testing experience using tools such as Postman Experience with DataDog or similar tools for observability and bottleneck analysis Good understanding of performance metrics and collaboration with DevOps to optimise systems Experience with CI/CD pipelines and version control (GitLab or similar) Practical experience with More ❯
City Of Bristol, England, United Kingdom Hybrid / WFH Options
Travel Chapter
clear, actionable reports and communicate findings to technical and non-technical stakeholders Contribute to our Cypress UI automation suite and embed quality checks early in the development lifecycle Use observability tools such as DataDog to monitor metrics and investigate performance trends Support our CI/CD processes and ensure smooth, reliable automation execution Share best practices and champion continuous improvement … for UI automation and familiarity with BDD tools such as Cucumber and Gherkin Solid API testing experience using tools such as Postman Experience with DataDog or similar tools for observability and bottleneck analysis Good understanding of performance metrics and collaboration with DevOps to optimise systems Experience with CI/CD pipelines and version control (GitLab or similar) Practical experience with More ❯
bath, south west england, united kingdom Hybrid / WFH Options
Travel Chapter
clear, actionable reports and communicate findings to technical and non-technical stakeholders Contribute to our Cypress UI automation suite and embed quality checks early in the development lifecycle Use observability tools such as DataDog to monitor metrics and investigate performance trends Support our CI/CD processes and ensure smooth, reliable automation execution Share best practices and champion continuous improvement … for UI automation and familiarity with BDD tools such as Cucumber and Gherkin Solid API testing experience using tools such as Postman Experience with DataDog or similar tools for observability and bottleneck analysis Good understanding of performance metrics and collaboration with DevOps to optimise systems Experience with CI/CD pipelines and version control (GitLab or similar) Practical experience with More ❯
bradley stoke, south west england, united kingdom Hybrid / WFH Options
Travel Chapter
clear, actionable reports and communicate findings to technical and non-technical stakeholders Contribute to our Cypress UI automation suite and embed quality checks early in the development lifecycle Use observability tools such as DataDog to monitor metrics and investigate performance trends Support our CI/CD processes and ensure smooth, reliable automation execution Share best practices and champion continuous improvement … for UI automation and familiarity with BDD tools such as Cucumber and Gherkin Solid API testing experience using tools such as Postman Experience with DataDog or similar tools for observability and bottleneck analysis Good understanding of performance metrics and collaboration with DevOps to optimise systems Experience with CI/CD pipelines and version control (GitLab or similar) Practical experience with More ❯
proactive incident management. Key Responsibilities: . Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. . Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. . Provide live support for monitoring technologies and assist with live service support, including key business … and tooling exploitation to enhance operational efficiency efficiency within immature teams Required Skills and Experience: . Strong understanding and experience in SRE principals and methodologies . Strong understanding of Observability within a complex tech stack . Hands-on experience with monitoring tools such as Splunk, Splunk ITSI, Dynatrace, AppDynamics, and synthetic monitoring platforms. . Strong understanding and experience with implementing More ❯
Providing hands-on technical guidance on data design and integration to Data Engineering, BI, and Data Science teams. Driving the adoption of tools like Alation (cataloguing) and Monte Carlo (observability) to ensure data quality and lineage. What we're looking for in you (your superpowers!) We're looking for someone who is a creative problem-solver and has a strong … Strategy and Data Governance Strong communication and stakeholder management skills. The Nice-to-Haves (bonus points if you have these!): Experience with Monte Carlo, Alation, or similar metadata/observability tools. Knowledge of event streaming, API integration, and MLOps . Experience in regulated, high-volume industries (gaming, finance, or e-commerce). Proficiency with integration/orchestration tools like Airflow More ❯
maintain large-scale PostgreSQL databases running on AWS RDS. Monitor database health, diagnose bottlenecks, and implement durable fixes. Troubleshoot complex production issues and improve query/index performance. Implement observability and monitoring tools such as CloudWatch and pganalyze. Guide engineers on database best practices and contribute to long-term architecture decisions. Collaborate with backend and infrastructure teams to ensure stability … Fanvue’s databases. Strong communicator who can explain complex database concepts to both technical and non-technical audiences. Nice-to-haves Experience with Prisma or other ORMs. Familiarity with observability tools like CloudWatch and pganalyze. Exposure to IaC tools (Terraform, CDK, etc.). ✨ You’ll Thrive Here If You thrive on ownership and being the go-to expert for critical More ❯
maintain large-scale PostgreSQL databases running on AWS RDS. Monitor database health, diagnose bottlenecks, and implement durable fixes. Troubleshoot complex production issues and improve query/index performance. Implement observability and monitoring tools such as CloudWatch and pganalyze. Guide engineers on database best practices and contribute to long-term architecture decisions. Collaborate with backend and infrastructure teams to ensure stability … of Fanvues databases. Strong communicator who can explain complex database concepts to both technical and non-technical audiences. Nice-to-haves Experience with Prisma or other ORMs. Familiarity with observability tools like CloudWatch and pganalyze. Exposure to IaC tools (Terraform, CDK, etc.). Youll Thrive Here If You thrive on ownership and being the go-to expert for critical infrastructure. More ❯
Alphen aan den Rijn, Zuid-Holland, Netherlands Hybrid / WFH Options
ITRS
with clear technical and business context. Drive innovation around optional DEM capabilities such as thick client monitoring, mobile SDK-based monitoring, internet path analysis, and integrations with APM and observability platforms. Stay ahead of industry trends and competitive offerings in digital experience and performance monitoring, turning insights into product enhancements and differentiation. Support go-to-market activities, release communications, and … sales enablement to position Uptrends effectively in the monitoring and observability ecosystem. Requirements To be successful, we will look for the following experience and behaviours: An interest in digital performance, end-user monitoring, and observability, with an understanding of web technologies, APIs, and cloud service architectures. 5+ years of experience as a Product Manager or Technical Product Owner in a … needs Agile Product Execution and Prioritization Data-Informed Decision-making Technical Depth in Monitoring/Telemetry Collaboration Across Product, Engineering & Go-To-Market Teams Market Awareness in DEM and Observability Benefits Employee Assistance Programme Pension allowance Flexible Hybrid Working Enhanced Maternity and Paternity Leave Commuting Reimbursement Referral Bonus Buy and Sell Holiday Training Reimbursement ITRS is an Equal Opportunity employer More ❯
Oxford, England, United Kingdom Hybrid / WFH Options
Amberes
product features. You will move fast from concept to customer, working across the stack to design APIs, build front-end interfaces, integrate AI models, and ensure performance, reliability, and observability in production. Key Responsibilities Build and ship AI-driven features end-to-end, from prototype to production Design, implement, and maintain inference services with strong observability Develop and optimise retrieval More ❯
Birmingham, England, United Kingdom Hybrid / WFH Options
Amberes
product features. You will move fast from concept to customer, working across the stack to design APIs, build front-end interfaces, integrate AI models, and ensure performance, reliability, and observability in production. Key Responsibilities Build and ship AI-driven features end-to-end, from prototype to production Design, implement, and maintain inference services with strong observability Develop and optimise retrieval More ❯
product features. You will move fast from concept to customer, working across the stack to design APIs, build front-end interfaces, integrate AI models, and ensure performance, reliability, and observability in production. Key Responsibilities Build and ship AI-driven features end-to-end, from prototype to production Design, implement, and maintain inference services with strong observability Develop and optimise retrieval More ❯
banbury, south east england, united kingdom Hybrid / WFH Options
Amberes
product features. You will move fast from concept to customer, working across the stack to design APIs, build front-end interfaces, integrate AI models, and ensure performance, reliability, and observability in production. Key Responsibilities Build and ship AI-driven features end-to-end, from prototype to production Design, implement, and maintain inference services with strong observability Develop and optimise retrieval More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Hunter Bond
Running FinOps forums to improve cost transparency, accountability, and platform efficiency. Overseeing CI/CD pipelines and deployments, ensuring safe, reliable, and compliant change delivery. Driving monitoring, automation, and observability initiatives to improve uptime and reduce manual intervention. Developing runbooks, playbooks, and escalation protocols to strengthen resilience. Partnering closely with data engineering and analytics teams to align operations with business … Knowledge of automation and CI/CD practices to enhance deployment reliability. A FinOps mindset — comfortable managing costs, reporting usage, and running stakeholder forums. Experience with monitoring, alerting, and observability tools for proactive health management. Excellent communication and stakeholder management – able to translate technical detail into clear business impact. Calm, pragmatic problem solver with strong decision-making skills under pressure. More ❯
Running FinOps forums to improve cost transparency, accountability, and platform efficiency. Overseeing CI/CD pipelines and deployments, ensuring safe, reliable, and compliant change delivery. Driving monitoring, automation, and observability initiatives to improve uptime and reduce manual intervention. Developing runbooks, playbooks, and escalation protocols to strengthen resilience. Partnering closely with data engineering and analytics teams to align operations with business … Knowledge of automation and CI/CD practices to enhance deployment reliability. A FinOps mindset — comfortable managing costs, reporting usage, and running stakeholder forums. Experience with monitoring, alerting, and observability tools for proactive health management. Excellent communication and stakeholder management – able to translate technical detail into clear business impact. Calm, pragmatic problem solver with strong decision-making skills under pressure. More ❯
Running FinOps forums to improve cost transparency, accountability, and platform efficiency. Overseeing CI/CD pipelines and deployments, ensuring safe, reliable, and compliant change delivery. Driving monitoring, automation, and observability initiatives to improve uptime and reduce manual intervention. Developing runbooks, playbooks, and escalation protocols to strengthen resilience. Partnering closely with data engineering and analytics teams to align operations with business … Knowledge of automation and CI/CD practices to enhance deployment reliability. A FinOps mindset - comfortable managing costs, reporting usage, and running stakeholder forums. Experience with monitoring, alerting, and observability tools for proactive health management. Excellent communication and stakeholder management - able to translate technical detail into clear business impact. Calm, pragmatic problem solver with strong decision-making skills under pressure. More ❯
london, south east england, united kingdom Hybrid / WFH Options
Hunter Bond
Running FinOps forums to improve cost transparency, accountability, and platform efficiency. Overseeing CI/CD pipelines and deployments, ensuring safe, reliable, and compliant change delivery. Driving monitoring, automation, and observability initiatives to improve uptime and reduce manual intervention. Developing runbooks, playbooks, and escalation protocols to strengthen resilience. Partnering closely with data engineering and analytics teams to align operations with business … Knowledge of automation and CI/CD practices to enhance deployment reliability. A FinOps mindset comfortable managing costs, reporting usage, and running stakeholder forums. Experience with monitoring, alerting, and observability tools for proactive health management. Excellent communication and stakeholder management able to translate technical detail into clear business impact. Calm, pragmatic problem solver with strong decision-making skills under pressure. More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Hunter Bond
Running FinOps forums to improve cost transparency, accountability, and platform efficiency. Overseeing CI/CD pipelines and deployments, ensuring safe, reliable, and compliant change delivery. Driving monitoring, automation, and observability initiatives to improve uptime and reduce manual intervention. Developing runbooks, playbooks, and escalation protocols to strengthen resilience. Partnering closely with data engineering and analytics teams to align operations with business … Knowledge of automation and CI/CD practices to enhance deployment reliability. A FinOps mindset comfortable managing costs, reporting usage, and running stakeholder forums. Experience with monitoring, alerting, and observability tools for proactive health management. Excellent communication and stakeholder management able to translate technical detail into clear business impact. Calm, pragmatic problem solver with strong decision-making skills under pressure. More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Hunter Bond
Running FinOps forums to improve cost transparency, accountability, and platform efficiency. Overseeing CI/CD pipelines and deployments, ensuring safe, reliable, and compliant change delivery. Driving monitoring, automation, and observability initiatives to improve uptime and reduce manual intervention. Developing runbooks, playbooks, and escalation protocols to strengthen resilience. Partnering closely with data engineering and analytics teams to align operations with business … Knowledge of automation and CI/CD practices to enhance deployment reliability. A FinOps mindset — comfortable managing costs, reporting usage, and running stakeholder forums. Experience with monitoring, alerting, and observability tools for proactive health management. Excellent communication and stakeholder management – able to translate technical detail into clear business impact. Calm, pragmatic problem solver with strong decision-making skills under pressure. More ❯
At ITRS, we make society's critical technology work. Our mission is to deliver automated and holistic IT observability solutions that safeguard critical applications and enable innovation. We are the only monitoring and observability platform designed for the most demanding and regulated industries - trusted by 90% of Tier 1 capital markets firms. We believe when our team thrives, so do … Refine and improve Customer Success practices to ensure they are aligned with the evolving needs of clients. Requirements A blend of technical understanding and customer-facing skills, from an observability, monitoring, or enterprise software environment. Experience in a Customer Success, Technical Account Management, or related customer-facing role (non-sales). Hands-on knowledge of observability platforms or monitoring tools More ❯
high-performance connectivity for over 50,000 users and their devices through the implementation of VPN, SASE, and Zero-Trust Network Access (ZTNA) technologies. With a focus on automation, observability, and user experience, this position plays a vital role in ensuring seamless and secure connectivity across third-party, hybrid, and internal environments. Key Responsibilities: Strategy & Architecture Leadership: Define and lead … provisioning, policy enforcement, and configuration management through Infrastructure as Code (IaC) and zero-touch deployment practices. Apply Site Reliability Engineering (SRE) principles to improve performance, availability, and troubleshooting. Establish observability practices across all access points with real-time metrics, logs, and telemetry. Security, Compliance & Governance: Ensure compliance with corporate security and privacy policies through proactive monitoring and enforcement. Partner with … trust architecture, network segmentation, and endpoint-based access control. Proven ability to scale remote connectivity solutions to tens of thousands of users and devices. Experience with IaC, network automation, observability tooling, and SRE methodologies. Preferred Qualifications: Certifications such as CCNP, CCIE, PCNSE, Zscaler Certified, or equivalent. Familiarity with secure hybrid work and cloud networking models. Background in network performance optimization More ❯