to design, build, and maintain the platforms and tooling that underpin our infrastructure provisioning and delivery lifecycle. You'll work collaboratively with cross-functional teams to automate infrastructure, enhance observability, and embed best practices in VMware Hypervisor and DevOps . Key Responsibilities: Build and maintain on-prem and cloud infrastructure (VMware Hypervisor, vSphere, OpenStack, AWS, GCP, Azure). Apply deep More ❯
to design, build, and maintain the platforms and tooling that underpin our infrastructure provisioning and delivery lifecycle. You'll work collaboratively with cross-functional teams to automate infrastructure, enhance observability, and embed best practices in VMware Hypervisor and DevOps . Key Responsibilities: Build and maintain on-prem and cloud infrastructure (VMware Hypervisor, vSphere, OpenStack, AWS, GCP, Azure). Apply deep More ❯
GitHub Actions, or GitLab CI. Solid understanding of containerization technologies (Docker, Kubernetes). Working knowledge of Python and SQL for automation and data pipeline development. Familiarity with monitoring and observability tools (Grafana, Prometheus, CloudWatch). Strong grasp of data architecture principles and ETL design patterns. Financial services or regulated industry experience (desirable). More ❯
Wokingham, Berkshire, United Kingdom Hybrid / WFH Options
Experis
Collaborate with Agile teams to automate deployment, monitoring, and infrastructure management. Ensure platform and business application reliability and performance against strict SLAs and KPIs. Implement and maintain cloud-native observability stacks (Prometheus, Grafana, Loki, Tempo). Develop and maintain Infrastructure as Code (IaC) using tools like Kustomize or Helm. Manage CI/CD pipelines using Tekton and ArgoCD. Support and More ❯
Wokingham, Berkshire, United Kingdom Hybrid / WFH Options
Experis
Collaborate with Agile teams to automate deployment, monitoring, and infrastructure management. Ensure platform and business application reliability and performance against strict SLAs and KPIs. Implement and maintain cloud-native observability stacks (Prometheus, Grafana, Loki, Tempo). Develop and maintain Infrastructure as Code (IaC) using tools like Kustomize or Helm. Manage CI/CD pipelines using Tekton and ArgoCD. Support and More ❯
consistency, repeatability, and auditability across environments Develop and maintain developer tooling and golden templates (CI/CD pipelines, scaffolds, environments) to standardize best practices across teams Design and implement observability frameworks (metrics, tracing, logging, alerting) that are easy to consume and part of the platform baseline Eliminate repetitive tasks through automation and opinionated defaults, so teams are not blocked by … and orchestration (Docker, Kubernetes) Familiarity with CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.) Hands-on experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation) Knowledge of observability tools (Prometheus, Grafana, ELK stack, Datadog, etc.). Solid grasp of Linux systems and networking fundamentals Strong problem-solving and debugging skills Your Package & Perks: A competitive salary Flexible working More ❯
consistency, repeatability, and auditability across environments Develop and maintain developer tooling and golden templates (CI/CD pipelines, scaffolds, environments) to standardize best practices across teams Design and implement observability frameworks (metrics, tracing, logging, alerting) that are easy to consume and part of the platform baseline Eliminate repetitive tasks through automation and opinionated defaults, so teams are not blocked by … and orchestration (Docker, Kubernetes) Familiarity with CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.) Hands-on experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation) Knowledge of observability tools (Prometheus, Grafana, ELK stack, Datadog, etc.). Solid grasp of Linux systems and networking fundamentals Strong problem-solving and debugging skills Your Package & Perks: A competitive salary Flexible working More ❯
Farnborough, Hampshire, South East, United Kingdom
Stott & May Professional Search Limited
play a key role in designing, implementing, and maintaining performance testing frameworks to ensure application reliability, scalability, and efficiency. This role combines expertise in performance engineering, infrastructure automation, and observability to optimise system performance across production and pre-production environments. Your Responsibilities * Design and implement performance testing strategies using tools such as JMeter, Gatling, or LoadRunner. * Monitor and analyse system … such as Jenkins or GitLab CI. * Proficiency in scripting languages such as Python or Bash. * Experience with Infrastructure as Code tools (Ansible, Terraform). * Strong knowledge of monitoring and observability tools (Prometheus, Grafana). * Familiarity with Linux environments and cloud platforms (AWS, Azure, or GCP). * Excellent analytical, problem-solving, and communication skills. Desirable Skills: * Experience with Kubernetes and container More ❯
Wokingham, Berkshire, South East, United Kingdom Hybrid / WFH Options
Sanderson Government and Defence
for a sharp-minded Site Reliability Engineer to join our cloud-native mission in Azure. If you thrive in Agile teams, live for automation, and know your way around observability stacks and CI/CD pipelines - this is your playground. What you'll be doing: Automating deployment, monitoring & infrastructure with precision Owning platform reliability, performance & SLAs Building IaC with Helm More ❯
insight, and proactive incident management. Key Responsibilities Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Provide live support for monitoring technologies and assist with live service support, including key business events More ❯
UKIC DV Cleared Site Reliability/DevOp Engineer London - 5 Days Onsite Up to £550 per day (Umbrella, Inside IR35) 12-Month Contract Must hold UKIC DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join More ❯
South West London, London, United Kingdom Hybrid / WFH Options
Purview Consultancy Services Ltd
and agentic workflows Drive architectural reviews for LlamaParse/Azure Document Intelligence integration Design fault-tolerant, high-availability AI systems with automatic failover and load balancing Establish comprehensive monitoring, observability, and performance optimization strategies Mentor technical teams and establish AI engineering best practices using modern toolchains Oversee model performance evaluation using LangGraph evals and DeepEval frameworks More ❯
company's customer experience (CX) vision. You will collaborate closely with other software engineers, product teams, and AI specialists to develop LLM AI-powered applications, ensuring their scalability, security, observability and performance. This role is hands-on, with a primary focus on coding, testing, and deploying AI solutions in a fast-paced, agile environment. Responsibilities: Code Development and Testing Write More ❯
specialism in vulnerability management Self-starter, able to work in technical detail and motivate a diverse group of stakeholders to build sponsorship for significant and impactful change Desired: Establishing observability platforms Capabilities adjacent to exposure/vulnerability management capabilities (ie cyber security asset management, attack surface management, etc) Pragmatic application of zero-trust philosophies Cloud based security (GCP, AWS and More ❯
South West London, London, United Kingdom Hybrid / WFH Options
Purview Consultancy Services Ltd
Intelligence Implement advanced RAG systems with text-embedding-3-large and Azure DB for Postgres Lead hands-on development using Claude Code for rapid agentic workflow creation Establish AI observability and monitoring using Arize Phoenix and Azure AI Foundry Fine-tune and optimize Azure OpenAI GPT-5 models for financial document understanding Implement comprehensive evaluation strategies using LangGraph evals and More ❯
experiences. Proven experience as a Business Analyst in an Agile environment Strong knowledge of market data and market data supervision Financial Services experience is mandatory Strong understanding of monitoring, observability, and telemetry (metrics, logs, traces) Ability to translate technical concepts into actionable business requirements Hands-on experience with tools such as Datadog, BigPanda, Grafana would be desirable Excellent stakeholder management More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Devonshire Hayes Recruitment Specialists Limited
business processes. (LEAD) Familiarity with Microsoft Power Platform concepts, including Power Automate, Power Apps, and Dataverse. (LEAD) Experience applying Generative AI and prompting techniques. Strong understanding of AI governance, observability, and compliance frameworks. Proven ability to deliver secure, scalable, and responsible AI solutions. Excellent communication and presentation skills Extensive experience working collaboratively with diverse colleagues and stakeholders. Knowledge of the More ❯
Knutsford, Cheshire, United Kingdom Hybrid / WFH Options
Experis LTD
Role Title: Observability and telemetry Engineer Duration: contract to run until 31/12/2025 Location: Knutsford, Hybrid 2/3 days per week onsite Rate: up to £368 p/d Umbrella inside IR35 Key Skills/requirements EaaS Evolution Working Experience on PHP or Python Knowledge of Oracle and other relational Databases Well versed in working on More ❯
Knutsford, Cheshire, England, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
and driven Security Engineer to join our small, focused team building a telemetry pipeline MVP. You'll play a key role in designing and securing our containerized environments, ensuring observability tools and infrastructure are built with security at their core. This role blends deep technical expertise with a hands-on, collaborative approach ideal for someone who thrives in fast-moving … documentation and response playbooks What You Bring Hands-on experience with Kubernetes, OpenShift, and secure production systems Strong GitLab and CI/CD security expertise Familiarity with telemetry and observability stacks Solid grasp of networking, firewalls, and core security principles Knowledge of container security tools (Aqua, Twistlock, Trivy) Understanding of frameworks like NIST or ISO 27001 Excellent analytical and communication More ❯
interfaces and deliver product features. Working with DevOps/Platform teams on CI/CD, containerisation and deployment (Docker, Kubernetes or managed alternatives). Troubleshooting production issues and improving observability (logging, metrics, tracing). Contributing to technical design discussions and driving improvements to reliability and performance. Tech Stack & Skills Core skills: Strong Python development experience (5+ years preferred) with production … Nice to have: Experience with async frameworks (FastAPI, Celery, or asyncio-based work). Exposure to event-driven architectures, message queues (Kafka, RabbitMQ) or pub/sub. Knowledge of observability tooling (Prometheus, Grafana, Sentry, ELK). Understanding of security best practices for web services (OWASP, authentication/authorization patterns). Experience working in product-led teams and mentoring junior engineers. More ❯
Knutsford, Cheshire, United Kingdom Hybrid / WFH Options
Octopus Computer Associates
Role Overview: We are seeking a highly capable Security Engineer to join a focused team developing a telemetry pipeline MVP. This role requires deep technical expertise in containerized environments, observability tooling, and secure infrastructure design. The ideal candidate will ensure that security is Embedded across the pipeline architecture, from deployment to data flow, while collaborating closely with DevOps and development … risk analysis for the telemetry pipeline Collaborate with DevOps engineers to embed security into infrastructure-as-code and deployment workflows Monitor and respond to security events and alerts from observability platforms Maintain documentation of security architecture, policies, and incident response procedures Required Skills & Experience: Strong hands-on experience with Kubernetes and OpenShift in secure production environments Proficiency in GitLab and More ❯
Glasgow, Lanarkshire, United Kingdom Hybrid / WFH Options
Octopus Computer Associates
Role Overview: We are seeking a highly capable Security Engineer to join a focused team developing a telemetry pipeline MVP. This role requires deep technical expertise in containerized environments, observability tooling, and secure infrastructure design. The ideal candidate will ensure that security is Embedded across the pipeline architecture, from deployment to data flow, while collaborating closely with DevOps and development … risk analysis for the telemetry pipeline Collaborate with DevOps engineers to embed security into infrastructure-as-code and deployment workflows Monitor and respond to security events and alerts from observability platforms Maintain documentation of security architecture, policies, and incident response procedures Required Skills & Experience: Strong hands-on experience with Kubernetes and OpenShift in secure production environments Proficiency in GitLab and More ❯
Telford, Shropshire, West Midlands, United Kingdom
Sanderson Government and Defence
insight, and proactive incident management. Key Responsibilities Translate high-level monitoring non-functional requirements (NFRs) into actionable configurations across tools such as Splunk, Dynatrace, and AppDynamics. Deliver full-stack observability solutions, including application-aware network performance monitoring (NPM), synthetics, log analytics, and infrastructure metrics. Provide live support for monitoring technologies and assist with live service support, including key business events … improvement initiatives and tooling exploitation to enhance operational efficiency efficiency within immature teams Required Skills and Experience Strong understanding and expereince in SRE principals and methodologies Strong understanding of Observability within a complex tech stack Hands-on experience with monitoring tools such as Splunk, Splunk ITSI, Dynatrace, AppDynamics, and synthetic monitoring platforms. Strong understanding and experience with implementing and using More ❯