Manchester, England, United Kingdom Hybrid / WFH Options
Suits Me
using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices in identity management, network design More ❯
bolton, greater manchester, north west england, united kingdom Hybrid / WFH Options
Suits Me
using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices in identity management, network design More ❯
warrington, cheshire, north west england, united kingdom Hybrid / WFH Options
Suits Me
using IaC (e.g. Terraform, CDK) Owning and improving CI/CD pipelines (e.g. GitHub Actions, Jenkins) to streamline secure, automated deployments Building and managing observability tooling (e.g. CloudWatch, Grafana, OpenTelemetry) for proactive system monitoring and alerting Developing event-driven containerised and serverless systems using Lambda, ECS and EKS Championing reliability and security, embedding best practices in identity management, network design More ❯
networking and security standards, protocols and best practices Proven experience in logging systems (e.g. ELK stack ) Proven experience in monitoring systems (e.g. Prometheus ) Proven experience in tracing systems (e.g. OpenTelemetry , Jaeger) Experience in performance optimization and resource management Relevant certifications (AWS, Google) Understanding of Agile methodologies Ability to diagnose and resolve service- affecting issues in a Broadcast/Livestream environment More ❯
networking and security standards, protocols and best practices Proven experience in logging systems (e.g. ELK stack ) Proven experience in monitoring systems (e.g. Prometheus ) Proven experience in tracing systems (e.g. OpenTelemetry , Jaeger) Experience in performance optimization and resource management Relevant certifications (AWS, Google) Understanding of Agile methodologies Ability to diagnose and resolve service- affecting issues in a Broadcast/Livestream environment More ❯
networking and security standards, protocols and best practices Proven experience in logging systems (e.g. ELK stack ) Proven experience in monitoring systems (e.g. Prometheus ) Proven experience in tracing systems (e.g. OpenTelemetry , Jaeger) Experience in performance optimization and resource management Relevant certifications (AWS, Google) Understanding of Agile methodologies Ability to diagnose and resolve service- affecting issues in a Broadcast/Livestream environment More ❯
Python) or JMeter, with data parameterization and correlation. Manage distributed load generation (containers, cloud workers) to simulate millions of concurrent users. Integrate performance metrics from CloudWatch, Prometheus, Grafana, and OpenTelemetry to analyze system bottlenecks. Develop SLA/SLO dashboards and integrate performance gates into CI/CD pipelines. Collaborate with DevOps and developers to tune JVM, Akka, thread pools, GC More ❯
principles) Familiarity with RESTful APIs and data integrations Proven experience in observability development and IT service management processes Excellent collaboration, communication, and documentation skills Desirable experience: Exposure to Prometheus, OpenTelemetry, OpenSearch, or similar tools Familiarity with Docker, Kubernetes, and distributed systems monitoring Experience with message brokers (Kafka, RabbitMQ) Front-end skills (Angular, JavaScript frameworks) for custom dashboards Knowledge of CI More ❯
of CI/CD pipelines using GitLab and ArgoCD. Design and operate containerised workloads with EKS, Fargate, and Kubernetes. Manage Kubernetes deployments using Helm charts. Implement observability solutions using OpenTelemetry (OTel), Grafana, and Splunk. Optimise infrastructure with Karpenter for autoscaling and cost efficiency. Ensure robust AWS networking (VPC, Transit Gateway, PrivateLink, Route 53) and enforce security best practices. Drive incident … response, monitoring, and performance tuning. Key Technologies: AWS (EKS, Fargate, EC2, S3), Terraform, CloudFormation, GitLab, ArgoCD, Docker, Kubernetes, Helm, Cassandra, OTel, Grafana, Splunk, Karpenter, Python, Bash. Desirable: Experience with Google Cloud Platform (GCP), Apigee Hybrid, and hybrid/multi-cloud environments. Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment Business in relation to this vacancy. More ❯
experience in technical integrations and POCs Comfortable coding in any high-level programming language (Java, Go, Python) Strong hands-on knowledge of Kubernetes, AWS, Azure, GCP, Docker, Prometheus, and OpenTelemetry Industry knowledge and opinions on Monitoring, Observability, Log Management, SIEM Engineering/DevOps Background - advantage Experience in Technical Sales of Log Analytics/Monitoring/APM/SIEM - advantage Cultural More ❯
of ITSM/incident management processes and tools (Halo ITSM, ServiceNow, Jira Service Management) Cloud experience ( AWS, Azure, GCP ) and deploying observability tools in cloud-native environments Understanding of OpenTelemetry and modern observability standards Strong problem-solving skills and ability to work in a fast-paced start-up or consulting environment Why Join: Work with our exclusive client , a high More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Morela
of ITSM/incident management processes and tools (Halo ITSM, ServiceNow, Jira Service Management) Cloud experience ( AWS, Azure, GCP ) and deploying observability tools in cloud-native environments Understanding of OpenTelemetry and modern observability standards Strong problem-solving skills and ability to work in a fast-paced start-up or consulting environment Why Join: Work with our exclusive client , a high More ❯
ML lifecycle tools, model monitoring, and versioning Exposure to tools like KServe, Ray Serve, Triton, or vLLM a big plus Bonus Points: Experience with observability frameworks like Prometheus or OpenTelemetry Knowledge of ML libraries: TensorFlow, PyTorch, HuggingFace Exposure to Azure or GCP Passion for financial services Requirements: Degree in Computer Science, Engineering, Data Science or similar What We Offer A More ❯
technical experience in Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics, logs, traces and APM. Leadership & Global Operations Proven success leading multi-regional or global technical teams with direct management of managers. Demonstrated More ❯
managing data handling, consent flows, and feature-gating based on user location. Partner with the DevOps Engineer to create comprehensive logging, monitoring, and analytics systems (eg, using Prometheus, Grafana, OpenTelemetry) to provide deep visibility into platform health, security events, and business KPIs. Required Qualifications Education & Experience Bachelor's degree in Computer Science or a related technical field. 5+ years of More ❯
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside IR35 More ❯
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside IR35 More ❯
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside IR35 More ❯
london (city of london), south east england, united kingdom
Staffworx
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside IR35 More ❯
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Please send updated CV quoting availability and outside IR35 More ❯
West London, London, United Kingdom Hybrid / WFH Options
Staffworx Limited
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated CV More ❯
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated CV More ❯
west london, south east england, united kingdom Hybrid / WFH Options
Staffworx Limited
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated CV More ❯