This is a job posted by our partner Jooble. Below is a snippet of the job description. To read the full text, please click on the "Apply Now" link. Job Description: The role involves enhancing the company's monitoring capabilities More ❯
including Salesforce-specific pipelines. Build and maintain Infrastructure as Code (IaC) using Terraform and Ansible. Design highly reliable, scalable, and secure infrastructure supporting performance-critical workloads. Build proactive monitoring, observability, and alerting with Prometheus, Grafana, Azure Monitor, DataDog, and Dynatrace. Troubleshoot complex system issues spanning applications, networks, and infrastructure. Define platform SLAs, SLOs, and governance standards for self-service use. … Infrastructure as Code with Terraform and Ansible, along with scripting in PowerShell, Python, or Bash Experience implementing GitOps workflows and managing platform SLAs, SLOs, and governance standards Familiarity with observability and monitoring tools including Prometheus, Grafana, Azure Monitor, DataDog, or Dynatrace Preferred experience supporting Salesforce DevOps pipelines and working with Java, .NET, or Node.js application environments Exposure to AI/ More ❯
an initial 6 month contract. You'll be primarily responsible for working in a team that designs, builds, and maintains the organisations cloud infrastructure, with a focus on automation, observability and scalability. Essential skills/experience required: AWS Infrastructure as code using Terraform Cloudflare Developing CI/CD pipelines Incredibly beneficial: Snowflake MLOps Security best practices The role is confirmed More ❯
language models, or high-throughput data processing , Experience working collaboratively in cross-functional teams with diverse technical backgrounds , (Desirable) Experience with GitHub Actions, CI/CD pipelines, monitoring, and observability , Strong problem-solving skills with the ability to debug and optimize systems across different domains , (Desirable) Interest in market research, behavioral science, or business applications of AI , Excellent communication skills More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions Ltd
transformation and governance Working closely with Engineering, Analytics, Product, and Pricing teams to ensure priorities are aligned Driving improvements in tooling, infrastructure, and engineering practices (CI/CD, testing, observability) Required Experience Proven experience leading Data Engineering teams Strong technical background (5+ years) in building scalable data platforms Excellent communication and stakeholder management skills Hands-on experience with modern data More ❯
endpoints Integrate AWS foundation models and optimise their performance across use cases Create abstraction layers so non-technical users can deploy AI agents easily Implement strong logging, monitoring, and observability Work closely with frontend developers to ensure seamless integration Set up and manage CI/CD pipelines using GitLab Contribute to containerisationand deployments on OpenShift What You'll Bring: 3+ More ❯
Telford, Shropshire, United Kingdom Hybrid / WFH Options
Experis - ManpowerGroup
proactive incident management. Key Skills/requirements Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Provide live support for monitoring technologies and assist with live service support, including key business events More ❯
interfaces using React, Next.js, and Vercel AI SDK Containerising services using Docker and deploying to AWS (ECS, Lambda) Collaborating with researchers to productionise transformer models (e.g. PyTorch, HF) Using observability tools like Langfuse to monitor prompt and model performance Writing clean, modular, testable code that scales in production environments What They're Looking For: 3+ years' experience as a full More ❯
innovation cycles. You will have the opportunity to take ambiguity and refine it into valuable outcomes, taking risks where justified by the reward.You will understand how CI/CD, observability, and SLOs form part of a mature product offering and push for best practices. Use your insight to prevent production issues before they happen. When issues do occur you will More ❯
data is delivered on time and without failure. The ideal candidate will have a strong experience working with streaming and batch data systems, a solid understanding of monitoring a observability, and hands-on experience working with AWS, Apache Flink, Kafka, and Python. This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
data is delivered on time and without failure. The ideal candidate will have a strong experience working with streaming and batch data systems, a solid understanding of monitoring a observability, and hands-on experience working with AWS, Apache Flink, Kafka, and Python. This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
reporting data is delivered on time and without failure.The ideal candidate will have a strong experience working with streaming and batch data systems, a solid understanding of monitoring a observability, and hands-on experience working with AWS, Apache Flink, Kafka, and Python.This is a fantastic opportunity to step into a SRE role focused on data reliability in a modern cloud More ❯
and Engineering background Proficient in writing infrastructure as code for public cloud Experience with Python coding/testing or any Cloud-based technology (AWS preferred) Good understanding of Data Observability Good understanding of Hosting Platform Linux/Unix (EKS and Container experience is a plus) Good understanding of Databases, Data Lakes, and Query Engines, SQL/DDLs is preferred We More ❯
leaks, and performance bottlenecks Turn research prototypes into robust, production-ready software modules Lead architecture discussions and enforce clean, scalable design patterns Drive engineering standards across CI/CD, observability, and system modularisation Mentor developers through code reviews, pair programming, and design walkthroughs Bridge the gap between research and deployable robotics software-across embedded and cloud platforms What we're More ❯
ELT/ETL pipelines Batch, streaming, and micro-batching architectures CDC patterns and incremental load strategies at scale Experience building ingestion pipelines from diverse external APIs (CRM, analytics, finance, observability tools) to consolidate all organisational data into a unified lakehouse Infrastructure & Networking Azure networking and security best practices for enterprise data platforms Experience with event-driven architectures and API integration More ❯
Bring * Proven experience designing and running Kubernetes-based systems, ideally in constrained or disconnected environments * Hands-on expertise with infrastructure-as-code tooling (Terraform, Helm), CI/CD, and observability stacks * Deep understanding of containerisation, service networking, and resource tuning for edge devices or VMs * Practical experience deploying and running machine learning workloads, including LLMs or transcription models * Comfort adapting More ❯
report - Protecting & growing your payments business - Are you passionate about building reliable, scalable, and high-performing systems? Do you thrive on solving complex infrastructure challenges while driving automation and observability best practices? If so, we want to hear from you! At Thredd, we're looking for a Site Reliability Engineer to act as a North Star for this evolving discipline. More ❯
Telford, Shropshire, West Midlands, United Kingdom
LA International Computer Consultants Ltd
insight, and proactive incident management. Key Responsibilities: * Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. * Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. * Provide live support for monitoring technologies and assist with live service support, including key business events More ❯
across the business. Collaborate across teams: Work closely with product, sales, finance and operations to understand business needs, define KPIs, and deliver scalable, accessible data products. Maintain quality and observability: Monitor data pipeline health, implement alerts, and ensure accuracy, documentation, and governance are embedded in everything you build. What We're Looking For: 2+ years of experience in data engineering More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
British Veterinary Association
agile delivery workflows A passion for collaboration, clean code, and continuous improvement Bonus if you also have: Experience with hybrid rendering models (SSR, SSG, ISR) Exposure to monitoring/observability tools for frontend apps Familiarity with backend or full-stack development (though not essential) You Matter to Us At IVC Evidensia, we believe better engineering leads to better animal care. More ❯
over process and deliberation Great to haves Experience with .NET/C# Experience working in an agile development team with a focus on delivering value early Experience with building observability and alerting into systems Salary and benefits (the stuff you'd expect!) Salary is £78K - £100K (depending on experience) This is a full time opportunity, working Monday to Friday remotely More ❯
secure handling of sensitive operational data and compliance with relevant standards Developed and maintained robust APIs for system integration Drove operational excellence and continuous improvement Implemented and managed monitoring, observability, and troubleshooting tools for deployed systems Designed and handled containerised applications (e.g., Docker, Kubernetes) Qualifications Bachelor's degree in Computer Science , Engineering, or a related technical field 5+ years of More ❯
you have We're a multi-cloud team - experience with AWS and CDK is a big plus. Experience implementing/maintaining cloud compliance standards (SOC2, ISO-27001) Familiarity with observability tools like Datadog Background in implementing security best practices in cloud infrastructure Why Prolific is a great place to work We've built a unique platform that connects researchers and More ❯
you have We're a multi-cloud team - experience with AWS and CDK is a big plus. Experience implementing/maintaining cloud compliance standards (SOC2, ISO-27001) Familiarity with observability tools like Datadog Background in implementing security best practices in cloud infrastructure Why Prolific is a great place to work We've built a unique platform that connects researchers and More ❯