london (city of london), south east england, united kingdom
Ncounter Technology Recruitment
and trading opportunities. Experience - 8+ years in Python (or Golang) in a DevOps or SRE capacity. Strong Linux experience Understanding of Kubernetes, Public Cloud, Prometheus, Grafana, Telemetry and general Observability Experience with Gitlab, Bitbucket and CI (GitHub/CI/Bamboo) Willingness to engage in technical discussion and commit to More ❯
of DevOps tools Kubernetes, Jenkins, Gitlab, Terraform, and more. -Optimising automation and performance. -Champion containerisation and high performance base images. -Elevate monitoring systems Zabbix, Prometheus, Thanos ensuring 24/7 operational excellence. -Secure infrastructure access management, balancing innovation with ironclad security. They're offering a career defining role in a More ❯
of DevOps tools Kubernetes, Jenkins, Gitlab, Terraform, and more. -Optimising automation and performance. -Champion containerisation and high performance base images. -Elevate monitoring systems Zabbix, Prometheus, Thanos ensuring 24/7 operational excellence. -Secure infrastructure access management, balancing innovation with ironclad security. They're offering a career defining role in a More ❯
as a software engineer. Over 5 years in data engineering and pipeline development in high-volume production environments. Experience with monitoring systems such as Prometheus, Grafana, Zabbix, or Datadog. Experience in fintech or trading industries. Strong object-oriented development skills and software engineering fundamentals. Hands-on experience with cloud data More ❯
complex technical information to diverse stakeholders. Strong presentation abilities to establish credibility with executives. Preferred Qualifications: Certifications in AWS, Kubernetes, or observability platforms (e.g., Prometheus Certified). Experience in a data-driven or SaaS environment. MBA or relevant leadership experience. This role is located in London and is a hybrid More ❯
london, south east england, united kingdom Hybrid / WFH Options
ITR Partners
complex technical information to diverse stakeholders. Strong presentation abilities to establish credibility with executives. Preferred Qualifications: Certifications in AWS, Kubernetes, or observability platforms (e.g., Prometheus Certified). Experience in a data-driven or SaaS environment. MBA or relevant leadership experience. This role is located in London and is a hybrid More ❯
systems design and share responsibility with them in diagnosing, resolving, and preventing production issues. What We Value Experience with monitoring systems using tools like Prometheus and writing health checks Interest in learning and managing technologies like Spark, Hadoop, Elasticsearch, and Cassandra Familiarity with deploying GPUs Moderate experience with TCP/ More ❯
Search, Discovery & Insights, Company Profiles, Workflow & Efficiency, and many more. Our stack Java 17/21, Spring Boot (MVC, JDBC, Security). Postgres, Docker, Prometheus, K8s, Elastic. Team Stream Development Lead, 2 BE, 1 FE, 1 SDET As a qualified expert, You will Help architect, design, and develop complex, large More ❯
recovery and failover setups. Conduct regular testing of disaster recovery plans. Monitoring and Troubleshooting Set up and manage monitoring tools like CloudWatch, Datadog, or Prometheus to track database performance and availability. Troubleshoot and resolve complex database issues in real time. Collaboration and Documentation Work closely with developers to optimise application More ❯
network configurations and deployments using infrastructure-as-code (IaC) tools e.g. Ansible, Terraform, or Python scripts. Monitoring and logging network performance using tools like Prometheus, Grafana, or ELK stack. Experience with developing and maintaining air gapped networks. Experience with Voice over IP (VoIP) technologies including SIP, RTP protocols, and implementation More ❯
CDP/LLDP) and network engineering, management, and operations. Experience with search and analytics engines/big data tools (OpenSearch, Kafka, Kibana, Telegraf, InfluxDB, Prometheus). Our Preferred Qualifications for this role: Basic understanding of AI and ML algorithms, including model training, testing, and deployment. Hands-on project experience in More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
KPMG UK
GCP) Knowledge of Database systems and models. Ability to use wide variety of open-source technologies. Experience with logging/monitoring tools (DataDog, StackDriver, Prometheus etc), Knowledge of test automation frameworks. To discuss this or wider Technology roles with our recruitment team, all you need to do is apply, create More ❯
functional teams, including DevOps, Engineering, Service Reliability, and Service Delivery teams. Technical Expertise: In-depth knowledge of open-source and commercial observability tools (e.g., Prometheus, Grafana, NewRelic). Expertise in cloud environments (e.g., AWS, Azure) and infrastructure as code (IaC) tools like Terraform. Monitoring and Observability: Experience in creating and More ❯
based deployments. Administer cloud environments (AWS, GCP, Azure) to ensure scalability, security, and high availability for applications. Implement monitoring and alerting using AWS CloudWatch, Prometheus, or similar tools to proactively identify and resolve issues in production systems. Work with development teams to optimize application performance, provide support for troubleshooting, and … in scripting languages like Python, Bash, or Shell. Experience with Docker and Kubernetes for container orchestration and management. Familiarity with monitoring tools like CloudWatch, Prometheus, or Grafana, and best practices in security for cloud environments. Strong communication skills and experience working with development, operations, and security teams. Preferred Qualifications AWS More ❯
Terraform or CloudFormation, and manage resources for optimal performance. Monitor, troubleshoot, and resolve incidents, optimizing systems to ensure reliability and minimize downtime. Implement monitoring (Prometheus, Grafana, Datadog) and set up alerting systems to proactively address issues and ensure scalability. Work with DevOps, engineering, and security teams to improve application deployment … networking services. Proficiency in using Terraform, CloudFormation, Ansible, or similar tools for automating infrastructure. Strong experience in monitoring and incident response using tools like Prometheus, Grafana, and ELK Stack. Strong scripting skills in Python, Bash, Go, or Ruby for automating tasks and building custom tools. Experience with CI/CD More ❯
Warwick, Warwickshire, United Kingdom Hybrid / WFH Options
ICEO
in at least one programming language (Python, GoLang, C++, or Java). Solid experience with Terraform for IaC. Hands-on skills with observability tools (Prometheus, Grafana, ELK stack, OpenTelemetry) and logging pipelines (Kibana, Elasticsearch). Expertise in Docker and container orchestration using Kubernetes (preferably on GCP) and Helm. Familiarity with … autonomy to make your own choices and explore new ideas. Our tech stack & methodologies: Automation & IaC : Bash, Python, GoLang, Terraform Observability : Elasticsearch, Kibana, FluentD; Prometheus, Grafana; Jaeger, Grafana Tempo CI/CD : Bitbucket Pipelines, ArgoCD Containerization & Orchestration : Docker, Kubernetes, Helm Security : SOPS, Okta, TFsec, Trivy, Istio Stateful Services : PostgreSQL, TimescaleDB More ❯
With our innovative software-defined connectivity technology, FloLIVE delivers seamless connectivity management across boundaries. Our stack: Bitbucket, Jenkins, Kubernetes, Docker, PostgreSQL, MongoDB, ClickHouse, Elastic, Prometheus/Grafana, Zabbix, Kafka, Zookeeper, Ansible, and Terraform. As a qualified expert you will: Manage and maintain Linux-based systems, administer and optimize Kubernetes, and … Postgres/MySQL/Clickhouse. Experience with deploying frameworks of software bus using stream-processing such as Kafka. Experience with monitoring and observability tools: Prometheus/Grafana, ELK. Experience with repository manager, preferably Artifactory/Nexus. What's in it for You Reveal great tech solutions Join the team of More ❯
delivery, and deployment of applications. Collaborate with the development team to optimise pipeline efficiency and ensure code quality. Implement monitoring solutions using AWS CloudWatch, Prometheus, Grafana, or similar tools to ensure visibility into application performance, health, and security. Troubleshoot production issues and provide resolution. Ensure the security of cloud infrastructure … automating infrastructure tasks using Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation. Experience with monitoring and logging tools such as AWS CloudWatch, Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana). Join a rapidly expanding start-up where personal growth is a part of our DNA. Benefit from a More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Sanderson Recruitment
root cause analysis programming experience Kubernetes and Docker Deploy and release services experience Experience with Greenfield projects ideally 6+ years relevant experience Grafana/Prometheus ideal Strong communication skills with the ability to proactively engage with a wide range of stakeholders If this sounds of interest to you, please ring More ❯
Architecture using AWS services (SNS, SQS, EventBridge). Knowledge of GraphQL, WebSockets, or real-time data streaming. Exposure to DevOps and observability practices (e.g., Prometheus, Datadog, AWS CloudWatch, OpenTelemetry). Prior experience in leading distributed engineering teams. Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment More ❯
driven architectures. Deep understanding of data processing, analytics, and real-time event streaming. Expertise in PostgreSQL, AWS and Kubernetes. Proficiency in monitoring tools like Prometheus, Grafana, and Kibana. Knowledge of security best practices, including OAuth, JWT, and data encryption. Fluent in English with strong communication and collaboration skills. Preferred Qualifications More ❯
verbal communication skills Ability to work well on a team as well as independently What will make you stand out: Experience using Splunk, Grafana, Prometheus and other observability tools Experience using kubernetes to deploy and maintain systems Experience using Jsonnet or other templating tools to render complex yaml/json More ❯
version, and manage infrastructure as code across multiple environments. GitHub Actions & OIDC – build and maintain automated CI/CD pipelines with secure authentication. Datadog, Prometheus or similar – implement logging, metrics, and alerting for robust observability – the interim CTO is keen to hear your recommendation(s) on tooling and implementation strategy. More ❯