or above Experience with Apache Kafka and containerization technology. Understanding messaging platforms (i.e. Kafka, XFB, EMS, ActiveMQ and/or IBM MQ) Experience with monitoring and alerting (Prometheus/Grafana), Elastic search Experience with containers (Docker, K8S or OpenShift) Experience with handling concurrency in high load application Oracle RDBMS knowledge - optimising performance and queries. Spring core, Spring boot API, Spring More ❯
as Azure, AWS or GCP. Experience with Kubernetes is desirable. You have a high degree of experience in observing the performance and health of applications via tools such as Grafana, Prometheus, Data Dog, Sentry, etc. You have a strong desire and are an advocate for performant applications. You have a flair for simplicity when problem solving. Excellent communication skills, with More ❯
Lexington, Massachusetts, United States Hybrid / WFH Options
Raft
DoD/Air Force AOC Weapon System and operating standards within cleared facilities (SIPR, IL6) - Familiarity with AWS and cloud technologies - Skill in operating observability tooling and alerting (Prometheus, Grafana, etc.) - Knowledge of Platform One Big Bang Clearance Requirements: Active Secret security clearance Work Type: Hybrid - Hanscom AFB, MA highly preferred (or local to Reston, VA or Hampton, VA or More ❯
Newport News, Virginia, United States Hybrid / WFH Options
Raft
DoD/Air Force AOC Weapon System and operating standards within cleared facilities (SIPR, IL6) - Familiarity with AWS and cloud technologies - Skill in operating observability tooling and alerting (Prometheus, Grafana, etc.) - Knowledge of Platform One Big Bang Clearance Requirements: Active Secret security clearance Work Type: Hybrid - Hanscom AFB, MA highly preferred (or local to Reston, VA or Hampton, VA or More ❯
Founded in 2017, Obsidian Security was created to close a critical gap: securing the SaaS applications where modern business happens-platforms like Microsoft 365, Salesforce, and hundreds more. Backed by top investors including Greylock, Norwest Venture Partners, and IVP, we More ❯
Gloucester, Gloucestershire, South West Hybrid / WFH Options
CGI
Automation Tester (DV Security Clearance) Position Description CGI was recognised in the Sunday Times Best Places to Work List 2025 and has been named one of the 'World's Best Employers' by Forbes magazine. We offer a competitive salary, excellent More ❯
infrastructure for CI/CD processes. Operate and maintain Kafka clusters for real-time data pipelines. Diagnose and resolve issues across systems, networks, containers, and applications. Use observability tools (Grafana, Prometheus, Kibana, Elasticsearch) to monitor system health. Automate system management tasks using Ansible. Participate in an on-call rotation to support global operations. Required Skills & Experience: Strong hands-on Linux … experience managing Kubernetes clusters. Proficiency with GitLab for version control and CI/CD workflows. Solid understanding of Kafka in high-throughput environments. Experience with observability tools such as Grafana, Prometheus, Kibana, and Elasticsearch. Expertise in Ansible for automation and configuration management. Strong problem-solving skills across infrastructure layers (compute, network, OS, containers). More ❯
infrastructure for CI/CD processes. Operate and maintain Kafka clusters for real-time data pipelines. Diagnose and resolve issues across systems, networks, containers, and applications. Use observability tools (Grafana, Prometheus, Kibana, Elasticsearch) to monitor system health. Automate system management tasks using Ansible. Participate in an on-call rotation to support global operations. Required Skills & Experience: Strong hands-on Linux … experience managing Kubernetes clusters. Proficiency with GitLab for version control and CI/CD workflows. Solid understanding of Kafka in high-throughput environments. Experience with observability tools such as Grafana, Prometheus, Kibana, and Elasticsearch. Expertise in Ansible for automation and configuration management. Strong problem-solving skills across infrastructure layers (compute, network, OS, containers). More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Inara
the norm. This is a hands-on engineering role with the opportunity to work on cutting-edge systems from day one. Tech Stack: Java 21 Spring Boot Kubernetes , AWS Grafana , GitHub Event-driven architecture, microservices CI/CD & rapid delivery tooling What You’ll Be Doing: Build scalable and secure microservices using Java 21 + Spring Boot Collaborate with in … house engineers to augment and accelerate platform delivery Deploy and monitor services in AWS using Kubernetes Work in a high-frequency release environment — deploying multiple times per day Use Grafana (or similar) for observability and maintain production-grade reliability Work onsite 3 days/week in London for the first 4–6 weeks (hybrid flexibility beyond this) We’re Looking More ❯
the norm. This is a hands-on engineering role with the opportunity to work on cutting-edge systems from day one. Tech Stack: Java 21 Spring Boot Kubernetes , AWS Grafana , GitHub Event-driven architecture, microservices CI/CD & rapid delivery tooling What You’ll Be Doing: Build scalable and secure microservices using Java 21 + Spring Boot Collaborate with in … house engineers to augment and accelerate platform delivery Deploy and monitor services in AWS using Kubernetes Work in a high-frequency release environment — deploying multiple times per day Use Grafana (or similar) for observability and maintain production-grade reliability Work onsite 3 days/week in London for the first 4–6 weeks (hybrid flexibility beyond this) We’re Looking More ❯
maintaining backend services using Django Designing and implementing scalable REST APIs Ensuring security, performance, and reliability of the platform Managing our AWS partner Monitoring and troubleshooting issues using Prometheus, Grafana, and OpenSearch Supporting the team with infrastructure needs for new features and deployments Implementing Infrastructure as Code (IaC) using tools like Terraform and Ansible Handling high traffic volumes, up to … maintaining backend services using Django Designing and implementing scalable REST APIs Ensuring security, performance, and reliability of the platform Managing our AWS partner Monitoring and troubleshooting issues using Prometheus, Grafana, and OpenSearch Supporting the team with infrastructure needs for new features and deployments Implementing Infrastructure as Code (IaC) using tools like Terraform and Ansible Handling high traffic volumes, up to More ❯
DEVOPS ENGINEER •On-call support may be required Required Skills: •Experience maintaining an operational environment and use of monitoring tools and dashboard interfaces (ie. Kibana, Grafana, Nagios) •Experience working with container images and platforms (Kubernetes/Docker/OpenShift) •Strong understanding of DevOps and software/application development processes •Understanding of GitLab, Jenkins, ArgoCD, and other DevOps/Continuous Integration More ❯
security - Proficiency with Ansible, Python, or Shell scripting to automate manual tasks - Experience maintaining web servers such as NGINX and Envoy - Familiarity with monitoring tools such as Prometheus and Grafana - Hands-on experience supporting data platforms such as PostgreSQL, Oracle, Cassandra, or Elasticsearch - Ability to troubleshoot complex system issues across OS, cloud, and network layers Preferred Skills: - Experience supporting hybrid More ❯
using programming languages. Python or Java is preferred. Full understanding of the end-to-end trade lifecycle (FX knowledge preferred) Experience using monitoring tools such as Splunk, Prometheus or Grafana etc. Expertise on containerization alongside tools like Docker, Kubernetes, and CI/CD. Exposure to Linux/Unix and SQL This is a great opportunity for a Production Engineer to More ❯
patterns and implementing best practices Exposure to secrets management platforms (e.g., HashiCorp Vault) Familiarity with infrastructure as code using Terraform Experience with monitoring, logging, and security tools (e.g., Prometheus, Grafana, and BQL) Expertise in containerization and orchestration using Kubernetes for deployments Experience working with high-availability systems architecture and the ability to support critical scalable and robust systems Bachelor's More ❯
patterns and implementing best practices Exposure to secrets management platforms (e.g., HashiCorp Vault) Familiarity with infrastructure as code using Terraform Experience with monitoring, logging, and security tools (e.g., Prometheus, Grafana, and BQL) Expertise in containerization and orchestration using Kubernetes for deployments Experience working with high-availability systems architecture and the ability to support critical scalable and robust systems Bachelor's More ❯
Farnborough, Hampshire, England, United Kingdom Hybrid / WFH Options
Randstad Technologies
or private cloud platforms Proficient in Infrastructure as Code - Ansible, Terraform Skilled in CI/CD tools Solid scripting skills - PowerShell, Python, or equivalent Experience with monitoring tools - Prometheus, Grafana, Kibana Please note: Active SC Clearance is essential Hybrid working - Farnborough-based Day Rate: £450-£550/day Duration: 6 months | Inside IR35 If this seems of interest to you More ❯
on AWS and other providers Operating MongoDB (or other document database) clusters Operating Redis (or other key-value storage) clusters Administering Linux servers Maintaining distributed software Operating Prometheus and Grafana Operating logging collection and analysis systems Participating in the on-call rotation(4:00am - 16:00pm UTC) Skills: Kubernetes & containers (advanced) AWS/EKS (advanced) Linux (advanced) Terraform and IaC … in general (proficient) Helm (proficient) Go and/or Python (familiar) MongoDB (or similar) Redis (or similar) Monitoring - prometheus, grafana, thanos (familiar) Grasp of networking concepts (subnets, routing, peering, load balancing, NAT, etc.) Common networking protocols (DNS, TCP/IP, HTTP, TLS, UDP) Proactive, energetic, innovative and change oriented Nice to have: GCP or Azure Bare metal infrastructure engineering API More ❯
on AWS and other providers Operating MongoDB (or other document database) clusters Operating Redis (or other key-value storage) clusters Administering Linux servers Maintaining distributed software Operating Prometheus and Grafana Operating logging collection and analysis systems Working hours within 16:00pm - 4:00am UTC Skills: Kubernetes & containers (advanced) AWS/EKS (advanced) Linux (advanced) Terraform and IaC in general (proficient … Helm (proficient) Go and/or Python (familiar) MongoDB (or similar) Redis (or similar) Monitoring - prometheus, grafana, thanos (familiar) Grasp of networking concepts (subnets, routing, peering, load balancing, NAT, etc.) Common networking protocols (DNS, TCP/IP, HTTP, TLS, UDP) Proactive, energetic, innovative and change oriented Nice to have: GCP or Azure Bare metal infrastructure engineering API management experience Large More ❯
years of experience with either Python or Go Building CI/CD pipelines and automation of various parts of the stack Self-hosting and maintaining observability tools such as Grafana/Prometheus It would be great if you also have experience with one or more Edge/IoT infrastructure (Yocto, IoT devices provisioning, over-the-air updates..) Remote management of More ❯
years of experience with either Python or Go Building CI/CD pipelines and automation of various parts of the stack Self-hosting and maintaining observability tools such as Grafana/Prometheus It would be great if you also have experience with one or more Edge/IoT infrastructure (Yocto, IoT devices provisioning, over-the-air updates..) Remote management of More ❯
NPM for application builds Cypress for automated testing Git for source control Terraform and Ansible for infrastructure configuration OpenShift, RHEL/CentOS, and Docker for deployment targets InfluxDB and Grafana for monitoring and observability Oracle (or equivalent RDBMS), AMQP, and S3-compatible object storage systems ** Please note this role requires active UK*C DV Clearance. Hold the necessary clearance but More ❯
NPM for application builds Cypress for automated testing Git for source control Terraform and Ansible for infrastructure configuration OpenShift, RHEL/CentOS, and Docker for deployment targets InfluxDB and Grafana for monitoring and observability Oracle (or equivalent RDBMS), AMQP, and S3-compatible object storage systems Please note this role requires active UK C DV Clearance. Hold the necessary clearance but More ❯
Cheltenham, Gloucestershire, England, United Kingdom
iO Associates
NPM for application builds Cypress for automated testing Git for source control Terraform and Ansible for infrastructure configuration OpenShift, RHEL/CentOS, and Docker for deployment targets InfluxDB and Grafana for monitoring and observability Oracle (or equivalent RDBMS), AMQP, and S3-compatible object storage systems ** Please note this role requires active UK*C DV Clearance. Hold the necessary clearance but More ❯
Hounslow, London, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions
and work independently across technical tasks What You'll Need Languages & Tools: Python, Ansible (C++, Go a plus), Git, Jira, Confluence Cloud & Infrastructure: Azure, Kubernetes, OpenShift Monitoring: Splunk, Prometheus, Grafana Databases: Oracle (OCA/OCP a plus) Environments: Linux/Unix Strong debugging, problem-solving, and collaboration skills Proven experience in DevOps and service reliability roles Interested? Apply now and More ❯