london (city of london), south east england, United Kingdom Hybrid / WFH Options
Future Talent Group
culture by leading knowledge-sharing sessions and supporting issue resolution. Skills Strong grounding in SRE principles and operational best practices. Proficient with observability tools (Prometheus, Grafana, OTEL, Cloudwatch) and telemetry pipelines. Solid programming skills in Python and/or Go; Java experience a plus. Hands-on AWS expertise (EKS, SQS More ❯
with Docker and container orchestration platforms like Kubernetes or Azure AKS. Monitoring & Alerting: Ability to implement and maintain monitoring, logging, and alerting solutions (e.g., Prometheus, Grafana, Azure Monitor). DevSecOps Practices & Toolchains: Understanding of secure software development lifecycle (SSDLC) and toolsets that integrate security into DevOps (e.g., Snyk, Aqua, SonarQube More ❯
IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency: 2-3 times More ❯
london (city of london), south east england, United Kingdom Hybrid / WFH Options
Harrington Starr
with Python. Solid understanding of containerisation concepts and how they support scalability, isolation, and portability in modern application deployment. Familiarity with monitoring stacks (Grafana, Prometheus, etc.) Working knowledge of CI/CD pipelines and Git-based workflows. Exposure to Terraform and Infrastructure as Code principles. Ready to Take the Next More ❯
london (city of london), south east england, United Kingdom Hybrid / WFH Options
Fruition Group
delivery. Lead deployment strategies and ensure smooth feature rollouts with minimal downtime. Define and manage monitoring, logging, and telemetry using tools like AWS Cloudwatch, Prometheus, and Datadog. Lead incident response and production troubleshooting with a proactive and preventative mindset. Drive automation initiatives with tools like GitlabCI, Terraform/OpenTofu, Ansible More ❯
london (city of london), south east england, United Kingdom
Ncounter Technology Recruitment
and trading opportunities. Experience - 8+ years in Python (or Golang) in a DevOps or SRE capacity. Strong Linux experience Understanding of Kubernetes, Public Cloud, Prometheus, Grafana, Telemetry and general Observability Experience with Gitlab, Bitbucket and CI (GitHub/CI/Bamboo) Willingness to engage in technical discussion and commit to More ❯
Architecture using AWS services (SNS, SQS, EventBridge). Knowledge of GraphQL, WebSockets, or real-time data streaming. Exposure to DevOps and observability practices (e.g., Prometheus, Datadog, AWS CloudWatch, OpenTelemetry). Prior experience in leading distributed engineering teams. Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Sanderson Recruitment
root cause analysis programming experience Kubernetes and Docker Deploy and release services experience Experience with Greenfield projects ideally 6+ years relevant experience Grafana/Prometheus ideal Strong communication skills with the ability to proactively engage with a wide range of stakeholders If this sounds of interest to you, please ring More ❯
london (city of london), south east england, United Kingdom Hybrid / WFH Options
Tenth Revolution Group
experience of 5+ years DevOps expertise of 5+ years GenAI pilots (CICD pipelines, install LLM) LLM applications, Langchain, Conda Kubernetes CICD Pipeline build experience Prometheus To find out more apply with job post or contact o.king@tenthrevolution.com More ❯
tolerance Develop and maintain backup, disaster recovery, and data security strategies Collaborate with teams to align database solutions with business goals Monitor performance with Prometheus and Grafana , addressing issues proactively Implement best practices for data modelling and processing Guide and mentor junior developers in ClickHouse best practices Stay up-to … and ETL Experience with DevOps tools (Jenkins, Ansible) and CI/CD pipelines Solid understanding of data security , backup , and disaster recovery Familiarity with Prometheus , Grafana , and C++ for customisation Strong problem-solving , analytical , and communication skills Ability to collaborate effectively with cross-functional teams What's on Offer This More ❯