City of London, London, United Kingdom Hybrid / WFH Options
Searchability NS&D
Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must To More ❯
improvement Take pride in building and operating scalable, reliable, secure systems Are comfortable with ambiguity and rapid change Preferred skills and experience: Familiar with monitoring tools such as Prometheus, Grafana, or similar 5+ years building core infrastructure Experience running inference clusters at scale Experience operating orchestration systems such as Kubernetes at scale Benefits & perks (UK full-time employees): Generous PTO More ❯
IP, VLANs, routing). You will bring some of these skills, but more importantly you're interested in learning these things: • Hardware & physical infrastructure. • Data-driven monitoring and observability (Grafana, InfluxDB, Prometheus, Elastic). • Exposure to configuration management (Puppet, Ansible, Terraform). • Some exposure to scripting (Bash, Python). • Supporting CI/CD delivery pipelines (GitLab, GitHub). 25 days More ❯
Office365 platform and applications Understanding and experience with the administration of SQL databases Experience with task automation, leveraging Python, Bash and/or PowerShell Experience with monitoring tools - PRTG, Grafana, OpenSearch, Prometheus. Beneficial Experience: Hands-on experience with Amazon Web Services Hands-on experience with Kubernetes/containerised environments Experience with No-Code tools such as Retool or Appsmith Experience More ❯
throughput environment preferably within financial services or trading. Deep understanding of Linux systems, networking, and distributed systems at scale. Strong experience with infrastructure automation (Ansible, Terraform, etc.), monitoring (Prometheus, Grafana, ELK), and CI/CD pipelines. Familiarity with market data feeds, co-location, and low-latency network optimisation is highly desirable. Experience with hybrid cloud/on-prem environments and More ❯
and scripting languages such as Python, Go, or Bash. Experience with Kubernetes security, including workload isolation, RBAC, and network policies, containerisation, orchestration, and Kubernetes observability tools (e.g., Falco, Prometheus, Grafana). Experience with infrastructure-as-code and configuration management tools (e.g., Terraform, Helm, ArgoCD). United Kingdom Security Vetting Developed Vetting (DV) clearance. Preferred qualifications: Certifications in Security (e.g., GSEC More ❯
GitLab , GitHub Actions, or CircleCI Strong testing capabilities using JUnit , RestAssured , or similar frameworks Proactive with monitoring, observability, and system health Desirable Skills: Exposure to monitoring platforms like Datadog, Grafana, Prometheus , or PagerDuty Familiarity with Python scripting Experience with Kubernetes and deployment tools such as Helm Why Join H&B Tech? Help define the future of digital health & wellness in More ❯
welcome Proficiency in testing frameworks like JUnit and RestAssured A passion for monitoring, observability , and maintaining resilient systems Desirable Skills: Experience with monitoring and alerting tools like Datadog, Prometheus, Grafana, or PagerDuty Exposure to Python scripting Familiarity with deployment platforms such as Kubernetes and tools like Helm Why Join H&B Tech? Be part of a fast-moving, forward-thinking More ❯
resource allocation, network and/or internals. Experience working with cloud solutions (GCP or AWS). Deep understanding and demonstrable experience with modern monitoring tools such as Prometheus, Datadog, Grafana, Telegraf Experience with infrastructure as code tools. Experience with complex Terraform deployments is a plus. Solid background with configuration management tools. Experience with Saltstack is a plus. Experience with using More ❯
Perform detailed root cause analysis of defects and manage fix/retest cycles. Reporting:Produce regular Test Reports to communicate delivery health to key stakeholders and automated reporting to Grafana, JIRA and internal delivery reporting systems. Risk Mitigation:Mitigate roll-out risk through runbook/pipeline reviews Qualifications Strong Java developer with exposure to BDD/TDD based development processes More ❯
cloud architecture IoT 'smart' edge devices (using nVidia AI chips) Linux-based embedded OS on our Edge devices Continuous Integration and Delivery using Jenkins, SonarQube Terraform for infrastructure management Grafana, Elasticsearch, Kibana & New Relic for metrics, logs and monitoring In the company we also use: VueJS, MySQL, Spring Boot, Apache Camel, AWS Redshift, AWS SageMaker, Pentaho, Balena, Serverless functions Winnow More ❯
services including AWS CDK Knowledge of DevOps tools - CloudFormation/Terraform, Jenkins Strong understanding of containers - Docker, Kubernetes Understanding of network protocols and security Experience with Monitoring tools - Prometheus, Grafana Scripting experience - Preferably Python or Bash Please note you will need to travel to their office within central London a minimum of 3 days per week. By submitting your details More ❯
improving CI/CD pipelines with a strong focus on automation and security. Key Skills Required: CI/CD automation, Jenkins, Ansible, Docker, Kubernetes, Terraform Unix/shell scripting, Grafana, Azure Strong understanding of software delivery and Agile methodologies Desirable: Experience in the Power Transmission & Distribution domain Eligibility: Candidates must hold UK or NATO nationality . This role offers a More ❯
Preferred: Exposure to Infrastructure-as-Code tools (Terraform or CloudFormation) Experience working with CI/CD pipelines and Git workflows Knowledge of logging and monitoring tools (e.g., CloudWatch, ELK, Grafana) Exposure to container technologies like Docker Interest in financial markets or experience with trading system infrastructure If you are interested and looking to be part of a high-impact technology More ❯
/Jenkins/Golang Provisioning software/frameworks (Elasticsearch/Spark/Hadoop/PostgreSQL) Infrastructure Management - CasC, IasC (Ansible, Terraform, Packer) Log and metric aggregation with Fluentd, Prometheus, Grafana, Alertmanager Public Cloud, primarily GCP, but also AWS and Azure More ❯
Exposure to low-code platforms (e.g., Retool) for rapid application development. Experience in DevOps practices, including infrastructure-as-code (IaC), monitoring, alerting, and incident management. Familiarity with observability tools (Grafana, Prometheus) and APM tools (New Relic, Datadog). Knowledge of microservices architecture, event-driven design, and scalability best practices. Experience implementing data compliance standards (GDPR, ISO 27001). Find.co is More ❯
to See: Experience with AI tools such as Co-pilot, ChatGPT and Gemini. Project leadership experience Working experience in an Agile environment Experience with Log aggregation toolsets (e.g. SPLUNK, Grafana) Knowledge of Python Bloomberg is an equal opportunity employer and we value diversity at our company. We do not discriminate on the basis of age, ancestry, color, gender identity or More ❯
as Azure, AWS or GCP. Experience with Kubernetes is desirable. You have a high degree of experience in observing the performance and health of applications via tools such as Grafana, Prometheus, Data Dog, Sentry, etc. You have a strong desire and are an advocate for performant applications. You have a flair for simplicity when problem solving. Excellent communication skills, with More ❯
a bonus: Java experience Python experience Ruby experience Big data technologies: Spark, Trino, Kafka Financial Markets experience SQL: Postgres, Oracle Cloud-native deployments: AWS, Docker, Kubernetes Observability: Splunk, Prometheus, Grafana For more information about DRW's processing activities and our use of job applicants' data, please view our Privacy Notice at . California residents, please review the California Privacy Notice More ❯
we continuously improve Monitoring and Optimization: Monitor AWS infrastructure performance, troubleshoot issues, and implement optimizations for cost and performance. Implement logging, monitoring, and alerting solutions using AWS CloudWatch, Prometheus, Grafana, and other monitoring tools. Conduct periodic reviews of infrastructure to identify opportunities for optimization and cost reduction. Security and Compliance: Implement security best practices and compliance standards in AWS environments. More ❯
using programming languages. Python or Java is preferred. Full understanding of the end-to-end trade lifecycle (FX knowledge preferred) Experience using monitoring tools such as Splunk, Prometheus or Grafana etc. Expertise on containerization alongside tools like Docker, Kubernetes, and CI/CD. Exposure to Linux/Unix and SQL This is a great opportunity for a Production Engineer to More ❯
applications and optimizing fleet utilization - Strong understanding of network fundamentals (DNS, DHCP, TCP/IP, routing, load balancing, load shedding) and experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar) - Experience scripting operating system tasks in Bash, Python, etc. and with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar) - Experience operating services in More ❯
Python, Go, or similar languages for automation and scripting. Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Experience leading/managing junior engineers Significant experience with Control Tower and deploying landing zones. For this More ❯
various methods such as unit, integration, contract and E2E testing. You have a high degree of experience in observing the performance and health of applications via tools such as Grafana, Prometheus, Data Dog, Sentry, etc. You have a strong desire and are an advocate for performant applications. Proactive in solving problems simply and effectively, with an eye for pragmatic solutions. More ❯
to read code and trace failures in one or more of the following application languages Java GoLang React .NET Python Solid understanding of modern observability tooling (e.g., Datadog, Loki, Grafana) Comfortable working on a shared on-call rotation Enhanced leave - 38 days inclusive of 8 UK Public Holidays Private Health Care including family cover Life Assurance - 5x salary Flexible working More ❯