help drive platform engineering maturity by delivering cloud-native infrastructure, scalable CI/CD tooling, and shared services that empower cross-functional teams. Your focus will be on enhancing observability, automating operational processes, and ensuring that systems are well-documented and supportable. Working with tools such as GCP, Kubernetes, Helm, Terraform, and Azure DevOps, you'll create efficient, reliable environments … Terraform for infrastructure-as-code delivery Experience building and maintaining CI/CD pipelines, preferably with Azure DevOps Solid grasp of Git version control and GitOps principles Familiarity with observability tooling such as Prometheus, Grafana, or GCP Operations Suite Scripting ability with tools like Bash or Python Understanding of shared service models, access control, and platform support processes Desirable: experience More ❯
efficiently and securely, using modern tooling such as Terraform, Docker, and cloud-native patterns. Drive cross-team initiatives , helping teams adopt and scale platform capabilities (e.g., CI/CD, observability, workflow orchestration) with a focus on reliability, security, and performance. Collaborate with stakeholders (data engineering, backend, DevOps, product) to identify platform gaps and design scalable, long-term solutions that unlock … with Terraform and infrastructure-as-code practices. Familiarity with tools like Airflow or DBT , and data platforms such as Snowflake or Databricks . Solid experience with CI/CD, observability, and platform reliability practices in cloud-native environments. Understanding of distributed computing concepts , and experience designing systems for scale, security, and availability . A proactive, collaborative mindset and demonstrated ability More ❯
Build and monitor data warehouse solutions (e.g. Redshift), ensuring data is accessible, clean, and well-modelled for analytics and product teams. Drive system performance and operational excellence by improving observability, uptime, and deployment processes across data and platform systems. What the Senior Data and Devops Engineerwill be responsible for: Design, implement and maintain scalable and reliable data pipelines Build, optimize … and manage cloud infrastructure Own and manage AWS-based systems, ensuring cost-effective, secure and high-performance operations Support and enhance the data and platform stack to ensure uptime, observability, and recoverability of key systems Collaborate with engineering and product teams to ensure data needs are met and infrastructure bottlenecks are identified early Support analytics and reporting workflows by making More ❯
global transportation agencies. As a senior engineer, you will play a critical role in designing, building, and scaling cloud services that enable remote device management, over-the-air updates, observability, and high-availability operations for our mobile perception platform. We tackle complex challenges related to scalability, performance, and security to enable smarter and safer cities through cutting-edge innovation. As … future of intelligent transportation systems. Responsibilities: Participate in incident prevention, response, and remediation efforts, learning and applying best practices. Design, build, and maintain scalable cloud services that support device observability, OTA updates, and fleet operations. Lead efforts to improve the reliability, security, and performance of multi-region AWS infrastructure using Infrastructure as Code (IaC) tools. Own CI/CD pipelines More ❯
Red Badger's high-performing engineering teams. In this role, you'll define and drive quality strategy for platform and infrastructure-level products - from container orchestration and microservices to observability tooling and CI/CD pipelines. This is a hands-on engineering position within cross-functional teams where quality is everyone's responsibility but you'll lead the way in … leads to ensure the platform behaves reliably under real-world conditions Be a Technical Leader in Quality Engineering Establish standards and practices for testing distributed, event-driven systems Enable observability-driven debugging by working closely with platform and service teams Automate validation of operational characteristics like availability, latency, throughput and recoverability Contribute to security posture through continuous validation of access … and fault-tolerant design in distributed systems Hands-on experience with infrastructure-as-code (Terraform), Kubernetes, cloud-native platforms (AWS), service meshes, CI/CD (e.g. GitHub Actions) and observability tooling (e.g. OpenTelemetry, Grafana) Strong programming skills in a modern backend language (e.g. Kotlin, Java, Go, or Python) with a test automation mindset Familiarity with resilience patterns, chaos testing, synthetic More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Capital One (Europe) plc
engineering solutions to make them more efficient, stable, and scalable. You'll lead on planning and implementing key SRE initiatives, optimise and automate how our systems operate, and improve observability through better monitoring and logging. You'll also work closely with your peers to drive consistency and high standards across SRE and the wider engineering community, so a real enthusiasm … vision set out by your Site Reliability Engineering Manager (SREM). Contribute to the major optimisation and improvement themes within the team. Identifying opportunities to reduce operational overheads through observability and service automation. Drive engineering best practice (e.g., Operational Excellence, Security, Quality, Resilience etc.) and set standards across the team and wider SRE community. Innovate within your team and contribute More ❯
results that matter. By taking advantage of all structured and unstructured data - securing and protecting private information more effectively - Elastic's complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI. What Is The Role: You will have the opportunity to work with a tremendous services, engineering, product, and sales team and wear … consultant will be focused on excellence, taking the initiative for self-improvement and possess great communication skills. Our customers' use cases extend across all the Elastic Solutions: Enterprise Search, Observability and Security, and beyond, and the scale of data in their environments ranges from gigabytes to petabytes. This diverse mix of a customer base means the challenges they face that More ❯
more than 10 individuals) Career coaching Skills & Mentoring I can mentor in: Technical communication, cross-functional collaboration, performance reviews, managing up Python, data structures, machine learning, LLM fine-tuning, observability, large-scale ML deployments Availability times are shown in your local timezone. Reviews Recent reviews highlight the effectiveness of coaching sessions in ML system design, behavioral interview preparation, career guidance More ❯
new AI/ML methods Deployment and serving of models at scale Infrastructure automation and cloud-native design Responsible AI, LLM safety, and interpretability tooling Data pipelines, versioning, and observability in production A glimpse of roles we recruit for: AI Research Scientist Machine Learning Engineer Data Engineer with ML experience Applied Scientist/Research Engineer DevOps for AI/AI More ❯
models for real-time inference. Responsibilities Develop APIs for AI inference used by internal and external clients Benchmark and optimize bottlenecks in our inference stack Enhance system reliability and observability, and manage system outages Research and implement optimizations for LLM inference Qualifications Experience with ML systems and deep learning frameworks such as PyTorch, TensorFlow, ONNX Knowledge of LLM architectures and More ❯
building, and maintaining secure, high-performance network platforms. Work closely with internal teams and external partners to deliver integrated, end-to-end solutions. Support and enhance existing monitoring and observability frameworks using tools like SNMP and syslog. Deliver against technical roadmaps, ensuring platforms remain aligned with product support lifecycles. Stay up to date with new technologies, expanding your knowledge to More ❯
Hemel Hempstead, Hertfordshire, Felden, United Kingdom
Meritus
building, and maintaining secure, high-performance network platforms. Work closely with internal teams and external partners to deliver integrated, end-to-end solutions. Support and enhance existing monitoring and observability frameworks using tools like SNMP and syslog. Deliver against technical roadmaps, ensuring platforms remain aligned with product support lifecycles. Stay up to date with new technologies, expanding your knowledge to More ❯
Hemel Hempstead, Hertfordshire, South East, United Kingdom
Yolk Recruitment
building, and maintaining secure, high-performance network platforms. Work closely with internal teams and external partners to deliver integrated, end-to-end solutions. Support and enhance existing monitoring and observability frameworks using tools like SNMP and syslog. Deliver against technical roadmaps, ensuring platforms remain aligned with product support lifecycles. Stay up to date with new technologies, expanding your knowledge to More ❯
in biotech, pharma, or AI-driven drug discovery Experience in both large organisations (with structured processes and metrics) and smaller/startup environments (delivering with limited resources) Knowledge of observability and reliability practices for product platforms Security or compliance experience Why Join? Be part of a world-class AI-first research environment shaping the future of drug discovery Work on More ❯
in biotech, pharma, or AI-driven drug discovery Experience in both large organisations (with structured processes and metrics) and smaller/startup environments (delivering with limited resources) Knowledge of observability and reliability practices for product platforms Security or compliance experience Why Join? Be part of a world-class AI-first research environment shaping the future of drug discovery Work on More ❯
Vector Search & Pipelines : Redesign and optimise RAG pipelines, create secure vector-based knowledge repositories, and build survey data transformation services. 🔧 MLOps & Evaluation : Own testing, evaluation harnesses, prompt management, and observability for AI systems in production. 🌍 Remote Flexibility : Work fully remotely with a team that values impact, collaboration, and continuous learning. What We’re Looking For: 2–4 years of experience More ❯
in biotech, pharma, or AI-driven drug discovery Experience in both large organisations (with structured processes and metrics) and smaller/startup environments (delivering with limited resources) Knowledge of observability and reliability practices for product platforms Security or compliance experience Why Join? Be part of a world-class AI-first research environment shaping the future of drug discovery Work on More ❯
in biotech, pharma, or AI-driven drug discovery Experience in both large organisations (with structured processes and metrics) and smaller/startup environments (delivering with limited resources) Knowledge of observability and reliability practices for product platforms Security or compliance experience Why Join? Be part of a world-class AI-first research environment shaping the future of drug discovery Work on More ❯
london (city of london), south east england, united kingdom
Hlx Life Sciences
in biotech, pharma, or AI-driven drug discovery Experience in both large organisations (with structured processes and metrics) and smaller/startup environments (delivering with limited resources) Knowledge of observability and reliability practices for product platforms Security or compliance experience Why Join? Be part of a world-class AI-first research environment shaping the future of drug discovery Work on More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Addition
complex sales cycles and build C-level relationships Confident presenter with a consultative sales approach Experience working with nearshore/offshore delivery models is a plus Knowledge of AIOps, observability, or platform engineering is advantageous What’s in It for You: Be part of a global team of 6,000+ technologists, with autonomy to shape a key growth sector Hybrid More ❯
AI solutions for high-stakes processes. Our platform enables teams to create AI co-workers that automate complex workflows while keeping humans central to decision-making. With robust governance, observability, and scalability tailored for regulated sectors like healthcare, finance, and aviation, Noxus drives confident, effective AI adoption. About the Role We are seeking a Junior Product Designer who will play More ❯
AI solutions for high-stakes processes. Our platform enables teams to create AI co-workers that automate complex workflows while keeping humans central to decision-making. With robust governance, observability, and scalability tailored for regulated sectors like healthcare, finance, and aviation, Noxus drives confident, effective AI adoption. About the Role We are seeking a Junior Product Designer who will play More ❯
AI solutions for high-stakes processes. Our platform enables teams to create AI co-workers that automate complex workflows while keeping humans central to decision-making. With robust governance, observability, and scalability tailored for regulated sectors like healthcare, finance, and aviation, Noxus drives confident, effective AI adoption. About the Role We are seeking a Junior Product Designer who will play More ❯
AI solutions for high-stakes processes. Our platform enables teams to create AI co-workers that automate complex workflows while keeping humans central to decision-making. With robust governance, observability, and scalability tailored for regulated sectors like healthcare, finance, and aviation, Noxus drives confident, effective AI adoption. About the Role We are seeking a Junior Product Designer who will play More ❯
london (city of london), south east england, united kingdom
Noxus
AI solutions for high-stakes processes. Our platform enables teams to create AI co-workers that automate complex workflows while keeping humans central to decision-making. With robust governance, observability, and scalability tailored for regulated sectors like healthcare, finance, and aviation, Noxus drives confident, effective AI adoption. About the Role We are seeking a Junior Product Designer who will play More ❯