London, South East, England, United Kingdom Hybrid / WFH Options
Pontoon
for technology professionals. Overview: Client is looking for a Dynatrace Subject Matter Expert to join their IT Infrastructure team. This role will be key in improving performance monitoring and observability across business-critical applications. You will work closely with Site Reliability Engineers, Application Stewards, and external software vendors to ensure a reliable and efficient monitoring setup. Key Responsibilities :Build and … maintain automated monitoring of key business processes using DynatraceCollaborate with internal teams to configure alerts and monitoring rulesJoin workshops with third-party vendors to align on observability best practices Technical Requirements: Strong hands-on experience with Dynatrace in the following areas: Application Performance Monitoring Anomaly detection and alerting Synthetic monitoring and log monitoring Real User Monitoring across web and mobile More ❯
shape the future! Why this team? This team is responsible for building world-class infrastructure that is critical to all of Cohere's success. Focus on stability, scalability, and observability are all paramount as this work acts as the foundation for all members of technical staff. Our team optimizes for a wide range of technical skillsets (some of which are … a Software Engineer, you will: Build self-service systems that automate managing, deploying and operating services. This includes our custom Kubernetes operators that support language model deployments. Automate environment observability and resilience. Enable all developers to troubleshoot and resolve problems. Take steps required to ensure we hit defined SLOs, including participation in an on-call rotation. Build strong relationships with More ❯
as security, compliance, cost and reliability. As a Senior SRE, you will: Take part in the design, development, deployment and management of infrastructure products Evangelize the best practices around observability, reliability, security and performance Help the company grow faster by abstracting the infrastructure layer Your Story Responsibilities: Experience with the software development lifecycle, preferably using Golang Understanding of CI/… flow of network packets across a container orchestration solution is super interesting Secret management is a rabbit hole, protecting secrets is much more than throwing technology at the problem Observability is now trending, although it's usually reduced to logging, monitoring and tracing The Perks We trust you, so we offer flexible working hours, as long it suits both you More ❯
will have proven experience in the field, focusing on kernel development and cluster automation(build, os/kubernetes upgrade and decommission). You will also drive the implementation of observability practices to monitor, troubleshoot, and ensure the reliability of our infrastructure at scale. What you will accomplish: Design, develop, and maintain a stable, high-performance Linux operating system optimized for …/BPF-based network segmentation and service mesh solutions. Collaborate with cross-functional teams to validate, adopt, and integrate optimized Linux OS distributions across diverse infrastructure environments. Implement robust observability frameworks to monitor system health, ensure performance, and support proactive issue resolution at scale. What you will bring: Bachelor's or Master's degree in Computer Science, Engineering, or a More ❯
mentorship on the practical application of AI/LLM techniques, defining best practices for everything from prompt engineering to fine-tuning and RAG. Ensure Reliability: Implement and own LLM observability and evaluation systems to ensure our AI features are reliable, performant, and continuously improving. Collaborate Cross-Functionally: Work closely with data scientists, platform engineers, and research teams to build cohesive … platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building robust monitoring and observability for AI/ML systems in production. Why Prolific is a great place to work We've built a unique platform that connects researchers and companies with a global pool More ❯
Experience with Git and GitHub workflows Familiarity with CI/CD practices and tools A genuine interest in how modern applications are evolving to leverage Generative AI Experience with observability tools A basic understanding of TypeScript and frontend development At least 5 years of software engineering experience Nice to haves but not required: Knowledge of Rust or interest in learning … of. Why? Because we obsess over developer experience and write code we'd actually want to use ourselves. We're applying that same engineering mindset to Pydantic Logfire , our observability platform built for today's development reality: AI workloads, multi-language environments, and cloud infrastructure that's designed to be straightforward to set up and maintain. We build with technologies More ❯
to deployment and monitoring, balancing cutting-edge techniques with pragmatism to deliver measurable impact. • Apply strong software engineering principles, such as modularity, testing, code reviews, CI/CD and observability, to ensure AI systems are reliable, maintainable, production-ready and can be readily adapted to future developments. • Choose the right approach for the problem at hand, evaluating classical ML and … focused teams, collaborating with designers, engineers, and PMs, to scope and ship AI features iteratively • Ability to reason about system behavior end-to-end, including model performance, latency, and observability, and how these impact user experience. • Clear, structured communicator, comfortable documenting and defending architectural decisions and engaging in thoughtful technical debate. Not required, but it's a plus if you More ❯
for Generative AI. Develop end-to-end AI solutions, including agentic AI systems that dynamically adapt their interactions to evolving goals, solving real problems. Implement comprehensive evaluation methodologies and observability frameworks specifically tailored to assess performance, reliability, and safety of agentic AI solutions. Help to shape and develop Motorway's Generative AI strategy Work across the organisation to run workshops … of when each should be applied. Proven experience with Cloud technologies on GCP or AWS (GCP preferred) including serverless architectures and microservices development. Deep understanding of evaluation strategies and observability tools for assessing AI performance, reliability, robustness, cost, and overall effectiveness. Expertise with SOTA LLMs such as Gemini, GPT/o-series, Claude, Deepseek, Llama 4, etc. Experience with Google More ❯
Head of AI & Technology Shape the Future of AI Recruitment Are you passionate about AI, technical leadership, and building scalable, high-impact products? Join us as a Head of AI & Technology and take the reins of a cutting-edge AI More ❯
About Us The Company: Dotdigital is a thriving global community of passionate, dedicated professionals, committed to the collective success of the organization and its clients. Our core principles of innovation, teamwork, and client-focused solutions drive us to approach challenges More ❯
Engineering and innovation go hand in hand. Without trying, failing, trying again and failing again, trying something new, failing a third time and this repetitive cycle, many of the things we take for granted wouldn't exist. The thirst for More ❯
Bridging development and infrastructure is the true essence of DevOps. Success has led to expansion, creating a need to bridge the gap between these two well-established teams. It's time for DevOps. You would leverage your knowledge of the More ❯
Join the Exciting Journey at tombola!️ Are you ready to be a part of something extraordinary? Astombolacontinues its remarkable growth, we're on the hunt for dynamic and innovative Developers to work in our Platform division. This is your chance More ❯
Join the Exciting Journey at tombola!️ Are you ready to be part of something extraordinary?As we continue ourremarkable growth, we're on the lookout for dynamic and innovative Developers to join our Platform division, and right now, we're More ❯
across global technology teams to design and implement monitoring solutions for the firm's core Line of Business ("LOB") applications and vital infrastructure, while providing input into monitoring and observability platform technical design and architectural decisions and changes, including the design and implementation of new monitoring systems integrations. This role will be located in ourManchesteroffice. Please note that this role … may be eligible for a flexible working schedule that allows for a hybrid and in-office presence. Responsibilities & Qualifications Other key responsibilities include: Demonstrating understanding of Monitoring and Observability tools and core concepts Demonstrating understanding of Observability frameworks and tools that ingest telemetry data from multiple sources Ensuring platform health and stability Providing support to and acting as the main … and the NOC team Promoting the enterprise monitoring service through stakeholder engagement We'd love to hear from you if you: Display expert subject matter knowledge on Monitoring and Observability tools for critical Infrastructure and LOB services/applications Demonstrate good knowledge of working with Observability tools to leverage telemetry data (logs, metrics and trace data) to provide insight into More ❯
/Sheffield Start Date: ASAP End Date: 28/11/2025 Rate: £575p/d via Umbrella Number of roles: 3 Job Description: Seeking an expert in building Observability level 4, including managing events, component-level metrics, metrics-logs, traces, time series topology, and AI/mL models. The role involves achieving: Alerts, comprehensive dashboards, root cause analysis, business … impact analysis, and predictive and preventative insights. Qualifications: 10+ years of experience in building observability level 4. Expertise in Splunk platform, including Essentials, Enterprise Security, and ITSI. Strong understanding of ITSM concepts (ITIL, ServiceNow) and service management processes. Proficiency in scripting and programming languages like Python, Shell Scripting, or Perl (beneficial). Excellent analytical and problem-solving skills. Solid understanding More ❯
with the Enterprise Monitoring & Alerting (EMAS) team to deliver a transformative initiative aimed at maximising Dynatrace capabilities. We are looking for a skilled Dynatrace Admin/Consultant to enable observability across complex, hybrid cloud environments. Essential Skills: To thrive in this role, you must demonstrate extensive experience in designing and configuring within Dynatrace, including: Application Performance Monitoring Proficiency in Dynatrace … assets for monitoring. Work with EMAS to analyse Dynatrace coverage of these critical assets. Identify opportunities for enhancement in monitoring configurations across crucial applications. Review roles and responsibilities concerning observability and propose improvements focused on Operational Resilience. Contribute to establishing an automated end-to-end business flow for key business processes within the Dynatrace toolset. Ensure optimal alerting configurations in … collaboration with Application Stewards and SREs. Participate in workshops with third-party software suppliers to review observability standards. Bonus Points: Skills in correlating events across the full stack for root cause analysis. Key Attributes: Ability to manage competing priorities in a fast-paced environment. Flexibility and a pragmatic approach to problem-solving. A delivery-oriented mindset coupled with a can More ❯
with the Enterprise Monitoring & Alerting (EMAS) team to deliver a transformative initiative aimed at maximising Dynatrace capabilities. We are looking for a skilled Dynatrace Admin/Consultant to enable observability across complex, hybrid cloud environments. Essential Skills: To thrive in this role, you must demonstrate extensive experience in designing and configuring within Dynatrace, including: Application Performance Monitoring Proficiency in Dynatrace … assets for monitoring. Work with EMAS to analyse Dynatrace coverage of these critical assets. Identify opportunities for enhancement in monitoring configurations across crucial applications. Review roles and responsibilities concerning observability and propose improvements focused on Operational Resilience. Contribute to establishing an automated end-to-end business flow for key business processes within the Dynatrace toolset. Ensure optimal alerting configurations in … collaboration with Application Stewards and SREs. Participate in workshops with third-party software suppliers to review observability standards. Bonus Points: Skills in correlating events across the full stack for root cause analysis. Key Attributes: Ability to manage competing priorities in a fast-paced environment. Flexibility and a pragmatic approach to problem-solving. A delivery-oriented mindset coupled with a can More ❯
this role combines technical leadership with hands-on engineering across a modern stack: AWS (Lambda, Step Functions, DynamoDB, Postgres, CDK), TypeScript, React, Next.js, Jest, Playwright, CI/CD and observability tools. You'll help establish and grow the Core Services team, building scalable architecture, developer tooling, and platform services that power multiple product squads. Expect around 70-80% coding and … Manager, Technical Lead, Senior Full Stack Engineer, Staff Engineer, AWS Lambda, AWS Step Functions, AWS DynamoDB, AWS Postgres, AWS CDK, TypeScript, React, Next.js, Jest, Playwright, CI/CD, DevOps, Observability, Monitoring, SaaS, HRTech, Hybrid Jobs London, Remote, Scale-up, Scalable Architecture. More ❯
and the chance to act as a “mini-CEO” across Sales, Marketing, Product, and Engineering. TL;DR 📍 Location: Amsterdam Area/London (hybrid possible) 💼 Env: SaaS/monitoring/observability/devtools 💸 Salary: €300K–€400K + bonus/stock 🔧 Tech: DEM, observability, PLG/SLG, SaaS GTM, AM/CSM frameworks 🗣 Lingua franca: English Role Lead and grow a fully … product-led SaaS platform delivering next-gen, real-time IT monitoring and observability for complex, hybrid IT environments. You’ll own business strategy, product roadmap, GTM execution, and culture for a 45-person cross-functional team - engineering, product, sales, and marketing. Expect board-level visibility, global impact, and the ability to drive both strategic enterprise sales and high-volume SME … plus strategic selling to enterprise-level clients ✔ Skilled at leading cross-functional teams in high-growth environments ✔ Fluent in English (spoken and written) Nice to haves ➕ IT monitoring/observability industry background ➕ Founder/early-stage scale-up experience ➕ Worked with remote/international teams Hop onboard! You’ll be defining what “modern operations” means for thousands of customers, while More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Bluebird
and the chance to act as a “mini-CEO” across Sales, Marketing, Product, and Engineering. TL;DR 📍 Location: Amsterdam Area/London (hybrid possible) 💼 Env: SaaS/monitoring/observability/devtools 💸 Salary: €300K–€400K + bonus/stock 🔧 Tech: DEM, observability, PLG/SLG, SaaS GTM, AM/CSM frameworks 🗣 Lingua franca: English Role Lead and grow a fully … product-led SaaS platform delivering next-gen, real-time IT monitoring and observability for complex, hybrid IT environments. You’ll own business strategy, product roadmap, GTM execution, and culture for a 45-person cross-functional team - engineering, product, sales, and marketing. Expect board-level visibility, global impact, and the ability to drive both strategic enterprise sales and high-volume SME … plus strategic selling to enterprise-level clients ✔ Skilled at leading cross-functional teams in high-growth environments ✔ Fluent in English (spoken and written) Nice to haves ➕ IT monitoring/observability industry background ➕ Founder/early-stage scale-up experience ➕ Worked with remote/international teams Hop onboard! You’ll be defining what “modern operations” means for thousands of customers, while More ❯
london, south east england, united kingdom Hybrid / WFH Options
Bluebird
and the chance to act as a “mini-CEO” across Sales, Marketing, Product, and Engineering. TL;DR 📍 Location: Amsterdam Area/London (hybrid possible) 💼 Env: SaaS/monitoring/observability/devtools 💸 Salary: €300K–€400K + bonus/stock 🔧 Tech: DEM, observability, PLG/SLG, SaaS GTM, AM/CSM frameworks 🗣 Lingua franca: English Role Lead and grow a fully … product-led SaaS platform delivering next-gen, real-time IT monitoring and observability for complex, hybrid IT environments. You’ll own business strategy, product roadmap, GTM execution, and culture for a 45-person cross-functional team - engineering, product, sales, and marketing. Expect board-level visibility, global impact, and the ability to drive both strategic enterprise sales and high-volume SME … plus strategic selling to enterprise-level clients ✔ Skilled at leading cross-functional teams in high-growth environments ✔ Fluent in English (spoken and written) Nice to haves ➕ IT monitoring/observability industry background ➕ Founder/early-stage scale-up experience ➕ Worked with remote/international teams Hop onboard! You’ll be defining what “modern operations” means for thousands of customers, while More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Bluebird
and the chance to act as a “mini-CEO” across Sales, Marketing, Product, and Engineering. TL;DR 📍 Location: Amsterdam Area/London (hybrid possible) 💼 Env: SaaS/monitoring/observability/devtools 💸 Salary: €300K–€400K + bonus/stock 🔧 Tech: DEM, observability, PLG/SLG, SaaS GTM, AM/CSM frameworks 🗣 Lingua franca: English Role Lead and grow a fully … product-led SaaS platform delivering next-gen, real-time IT monitoring and observability for complex, hybrid IT environments. You’ll own business strategy, product roadmap, GTM execution, and culture for a 45-person cross-functional team - engineering, product, sales, and marketing. Expect board-level visibility, global impact, and the ability to drive both strategic enterprise sales and high-volume SME … plus strategic selling to enterprise-level clients ✔ Skilled at leading cross-functional teams in high-growth environments ✔ Fluent in English (spoken and written) Nice to haves ➕ IT monitoring/observability industry background ➕ Founder/early-stage scale-up experience ➕ Worked with remote/international teams Hop onboard! You’ll be defining what “modern operations” means for thousands of customers, while More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Bluebird
and the chance to act as a “mini-CEO” across Sales, Marketing, Product, and Engineering. TL;DR 📍 Location: Amsterdam Area/London (hybrid possible) 💼 Env: SaaS/monitoring/observability/devtools 💸 Salary: €300K–€400K + bonus/stock 🔧 Tech: DEM, observability, PLG/SLG, SaaS GTM, AM/CSM frameworks 🗣 Lingua franca: English Role Lead and grow a fully … product-led SaaS platform delivering next-gen, real-time IT monitoring and observability for complex, hybrid IT environments. You’ll own business strategy, product roadmap, GTM execution, and culture for a 45-person cross-functional team - engineering, product, sales, and marketing. Expect board-level visibility, global impact, and the ability to drive both strategic enterprise sales and high-volume SME … plus strategic selling to enterprise-level clients ✔ Skilled at leading cross-functional teams in high-growth environments ✔ Fluent in English (spoken and written) Nice to haves ➕ IT monitoring/observability industry background ➕ Founder/early-stage scale-up experience ➕ Worked with remote/international teams Hop onboard! You’ll be defining what “modern operations” means for thousands of customers, while More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
Gemba Advantage
processes to facilitate the work of the development team Build continuous integration, continuous delivery, and continuous deployment (CI/CD) pipelines for software delivery Set up operational tooling and observability infrastructure Identify security risks and deploy measures to mitigate them Identify problems and take steps to resolve and remove them Automate and improve development and release processes Your skills and … processes to facilitate the work of the development team Build continuous integration, continuous delivery, and continuous deployment (CI/CD) pipelines for software delivery Set up operational tooling and observability infrastructure Identify security risks and deploy measures to mitigate them Identify problems and take steps to resolve and remove them Automate and improve development and release processes Why join Gemba More ❯