Integration services such as messaging and streams. Building RESTful API Services. Containerisation, Kubernetes, serverless functions. Microservices, and distributed tracing. Enterprise logging, monitoring, and alerting frameworks (e.g., ELK, Splunk, Prometheus, Grafana). Automation scripting (using scripting languages such as Terraform, Ansible etc.). Experience of working with Continuous Integration (CI), Continuous Delivery (CD) and continuous testing tools. Experience working within an More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Become
platforms (AWS, Azure, or GCP) and containerisation (e.g., Docker, Kubernetes) Experience with Infrastructure as Code tools (e.g., Terraform, Ansible, CloudFormation) Familiarity with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK, Datadog) Experience working in regulated environments such as banking, fintech, or insurance Prior experience working in or contributing to a Centre of Excellence team Strong scripting skills (e.g., Bash More ❯
Agile teams using tools like Git , Jira , and Confluence Eligible for SC and NPPV3 clearance Desirable: Container orchestration with Kubernetes HashiCorp tools: Vault , Consul , Packer Monitoring and observability with Grafana , Prometheus , or similar Familiarity with cloud networking, VPCs, NAT Gateways, security groups, etc. Personal Attributes: Proactive and self-driven with a passion for technology Strong problem-solving mindset Collaborative team More ❯
IP, HTTP/S, DNS, VPNs). Expertise withcontainerand orchestrationtechnologies, including Docker and Kubernetes. Hands-on experience withHelmfor packaging, deploying, and managing Kubernetes applications. Experience withmonitoring and loggingsolutions likePrometheus,Grafana,ELK Stack, or similar. Knowledge of security best practices in DevOps and cloud environments. Terraform, Ansible or Chef experience is preferred. Nice to haves: knowledge of Concourse, Nexus, SonarQube, various More ❯
Flux) Knowledge of IaC and configuration management tools (Terraform, OpenTofu, Crossplane, Pulumi, Ansible, CloudFormation) Strong problem-solving experience, focusing on automation Production experience with Monitoring and Observability tools (Prometheus, Grafana, Datadog, Thanos, New Relic, Open Telemetry) Understanding of Cloud Networking concepts (Mesh Networking, NAT, Load Balancers, SSL Certificates and TLS termination, API Gateways, proxies, etc) Strong written and verbal communication More ❯
languages, such as C#, Python, Perl, Java, C++ CICD tools such as Azure DevOps, GitHub Actions, Gitlab, Jenkins, TeamCity Scripting languages such as PowerShell, bash Observability/Monitoring: Prometheus, Grafana, Splunk Containerisation tools such as Docker, K8S, OpenShift, EC, containers Hosting technologies such as IIS, nginx, Apache, App Service, LightSail Analytical and creative approach to problem solving We encourage you More ❯
C++. Experience with CI/CD tools like Azure DevOps, GitHub Actions, GitLab, Jenkins, TeamCity. Scripting skills in PowerShell, Bash. Familiarity with observability and monitoring tools such as Prometheus, Grafana, Splunk. Experience with containerization tools like Docker, Kubernetes, OpenShift, EC2 containers. Analytical and creative problem-solving skills. We encourage you to apply, even if you don't meet all the More ❯
languages, such as C#, Python, Perl, Java, C++ CICD tools such as Azure DevOps, GitHub Actions, Gitlab, Jenkins, TeamCity Scripting languages such as PowerShell, bash Observability/Monitoring: Prometheus, Grafana, Splunk Containerisation tools such as Docker, K8S, OpenShift, EC, containers Analytical and creative approach to problem solving We encourage you to apply , even if you don't meet all of More ❯
Azure, AWS or GCP Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation. Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar. Proven track record of maintaining highly-available and performant production environments. Ability to identify and implement effective mitigation strategies and operational playbooks. Useful/Bonus Skills to have More ❯
Linux internals, and security best practices. • Deep understanding of CI/CD tools and practices (GitHub Actions, Jenkins, ArgoCD, etc.). • Strong observability mindset-experience with tools like Prometheus, Grafana, Loki, etc. • Experience with hybrid service meshes, multi-cluster Kubernetes, or edge computing, preferred. • Knowledge of Kafka, Redis, Elasticsearch, or RDBMS (MySQL/Postgres), preferred. As a global leader in More ❯
/Golang Provisioning software/frameworks (Elasticsearch/Spark/Hadoop/Airflow/PostgreSQL) Infrastructure Management - CasC, IasC (Ansible, Terraform, Packer) Log and metric aggregation with Fluentd, Prometheus, Grafana, Alertmanager Public Cloud, primarily GCP and Azure, but also AWS What do I need to have? Take pride in designing, building and delivering high quality well engineered solutions to complex More ❯
infrastructure Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation More ❯
infrastructure Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud automation More ❯
technologies such as Oracle SQL, Mongo, Postgres o Know your way around Linux and Windows command lines, e.g. Bash and PowerShell o Monitoring large systems using technologies such as Grafana, Prometheus, ELK, Splunk o Experience of working in Agile teams, and the tooling that supports it, e.g. Atlassian o Diagnosing and troubleshooting application issues resulting in service outages o Troubleshooting More ❯
Accounts - AWS Control Tower, GCP Resource Manager, etc. Network - AWS Transit Gateway, GCP Shared VPC, AWS Route53, GCP Cloud DNS, etc. Observability - AWS OpenSearch, GCP Monitoring/Traces, OpenTelemetry, Grafana, Prometheus, etc. Automation Prowess: Hands-on experience with modern Infrastructure as Code (IaC) automation tools and frameworks (Terraform, Jenkins, Ansible, etc.). Software Development Acumen: A software development background is More ❯
Node, RabbitMQ Databases - Postgres, MariaDB, MongoDB, ClickHouse, Redis, JupyterLab, Metabase Data Engineering & Orchestration - Python, Airflow, Kafka, DataHub Cloud & Infrastructure - AWS, K8s DevOps & CI/CD - Git, GitLab CI, DBS, Grafana, ELK, Prometheus, Docker, Docker Compose Why join us? Shape the future of a data business at the forefront of global payments insights A chance to work with a vibrant, friendly More ❯
Accounts - AWS Control Tower, GCP Resource Manager, etc. Network - AWS Transit Gateway, GCP Shared VPC, AWS Route53, GCP Cloud DNS, etc. Observability - AWS OpenSearch, GCP Monitoring/Traces, OpenTelemetry, Grafana, Prometheus, etc. Automation Prowess: Hands-on experience with modern Infrastructure as Code (IaC) automation tools and frameworks (Terraform, Jenkins, Ansible, etc.). Software Development Acumen: A software development background is More ❯
Kubernetes). Experience with Infrastructure as Code (e.g., Terraform, CloudFormation). Experience in deploying and managing LLM-powered features in production environments. Bonus : experience with monitoring tools (e.g., Prometheus, Grafana), agent orchestration, or legaltech domain knowledge. Working for Opus 2 Opus 2 is a global leader in legal software and services, trusted partner of the worlds leading legal teams. All More ❯
etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Docker (with Docker Swarm) Elastic Stack Typescript/React/Node Go Prometheus/Grafana ESRI Maps Ansible Windows & Linux Jenkins Automation skills: Automation is a key skill domain for this role. Specific automation skills are: Continuous Integration - Skilled in the tooling and principles of More ❯
Kubernetes Service (AKS), Azure Synapse Analytics, or Azure Cognitive Services Azure certifications, such as Azure Solutions Architect Expert or DevOps Engineer Expert Experience with infrastructure monitoring tools like Prometheus, Grafana, or Azure Monitor at scale Background in implementing disaster recovery and high-availability solutions for critical systems Qualifications Bachelor's or Master's degree in Computer Science, Information Technology, or More ❯
cloud architecture IoT 'smart' edge devices (using nVidia AI chips) Linux-based embedded OS on our Edge devices Continuous Integration and Delivery using Jenkins, SonarQube Terraform for infrastructure management Grafana, Elasticsearch, Kibana & New Relic for metrics, logs and monitoring In the company we also use: VueJS, MySQL, Spring Boot, Apache Camel, AWS Redshift, AWS SageMaker, Pentaho, Balena, Serverless functions Winnow More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Lorien
to work independently or lead a small team Nice to Have: Experience with TYK API Gateway Exposure to microservices and event-driven architectures Familiarity with observability tools (e.g., Prometheus, Grafana) Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment Business in relation to this vacancy. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
and cost optimisation Nice to Have Experience with ML tooling (MLflow, Kubeflow) Knowledge of FastAPI , Databricks, or Snowflake Exposure to SRE practices or cloud security certifications Familiarity with Prometheus , Grafana , or Datadog Interested? If you want to be part of a world-class AI team at an early stage-where your infrastructure decisions will directly shape the company's success More ❯
Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must To More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Searchability NS&D
Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must To More ❯