Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
CACI Ltd
Management and Automation Proficiency with containers and container orchestration including Helm, Docker and Kubernetes Observability experience of designing and building monitoring and logging tools such as CloudWatch, ELK, and Grafana Basic programming skills in at least one language Experience with CI/CD pipeline development and management Due to the industries we work in, we require the successful candidate to More ❯
MySQL (Aurora DB), Redis (ElastiCache), MongoDB (AWS DocumentDB) Cloud & DevOps: AWS (20+ services), Kubernetes (EKS), Docker, Infrastructure as Code(CloudFormation, Terraform), CI/CD (Jenkins,GitHub Actions), Observability(AWS, Grafana) Development tools: GitHub, Jira, Notion, ChatGPT,Gemini,LangChain, AI-native IDE's (Cursor, JetBrains), LLM-powered internal tools. WHAT WE OFFER YOU A front-row seat in a fast scaling More ❯
Unix systems, SQL, and programming languages such as C++, Java or Python. Strong understanding of distributed systems and low-latency architectures Hands-on experience with observability stacks (e.g., Prometheus, Grafana, Splunk, Geneos, OpenTelemetry) and infrastructure automation (e.g., Ansible, Terraform, CI/CD pipelines) Strong understanding of the trade lifecycle, market data, and fixed income products, FX or algorithmic trading experience More ❯
technologies such as Oracle SQL, Mongo, Postgres o Know your way around Linux and Windows command lines, e.g. Bash and PowerShell o Monitoring large systems using technologies such as Grafana, Prometheus, ELK, Splunk o Experience of working in Agile teams, and the tooling that supports it, e.g. Atlassian o Diagnosing and troubleshooting application issues resulting in service outages o Troubleshooting More ❯
Accounts - AWS Control Tower, GCP Resource Manager, etc. Network - AWS Transit Gateway, GCP Shared VPC, AWS Route53, GCP Cloud DNS, etc. Observability - AWS OpenSearch, GCP Monitoring/Traces, OpenTelemetry, Grafana, Prometheus, etc. Automation Prowess: Hands-on experience with modern Infrastructure as Code (IaC) automation tools and frameworks (Terraform, Jenkins, Ansible, etc.). Software Development Acumen: A software development background is More ❯
Kubernetes). Experience with Infrastructure as Code (e.g., Terraform, CloudFormation). Experience in deploying and managing LLM-powered features in production environments. Bonus : experience with monitoring tools (e.g., Prometheus, Grafana), agent orchestration, or legaltech domain knowledge. Working for Opus 2 Opus 2 is a global leader in legal software and services, trusted partner of the worlds leading legal teams. All More ❯
Unix systems, SQL, and programming languages such as C++, Java or Python. Strong understanding of distributed systems and low-latency architectures Hands-on experience with observability stacks (e.g., Prometheus, Grafana, Splunk, Geneos, OpenTelemetry) and infrastructure automation (e.g., Ansible, Terraform, CI/CD pipelines) Strong understanding of the trade lifecycle, market data, and fixed income products, FX or algorithmic trading experience More ❯
years of professional experience, some of which should have focus on Observability. Excellent knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, VictoriaMetrics, Grafana, Datadog, New Relic, OpenTelemetry, ELK Stack, or similar. Experience with high volume data storage (Structured and unstructured). A strong technical background, with current capabilities and willingness to get hands on More ❯
Accounts - AWS Control Tower, GCP Resource Manager, etc. Network - AWS Transit Gateway, GCP Shared VPC, AWS Route53, GCP Cloud DNS, etc. Observability - AWS OpenSearch, GCP Monitoring/Traces, OpenTelemetry, Grafana, Prometheus, etc. Automation Prowess: Hands-on experience with modern Infrastructure as Code (IaC) automation tools and frameworks (Terraform, Jenkins, Ansible, etc.). Software Development Acumen: A software development background is More ❯
Node, RabbitMQ Databases - Postgres, MariaDB, MongoDB, ClickHouse, Redis, JupyterLab, Metabase Data Engineering & Orchestration - Python, Airflow, Kafka, DataHub Cloud & Infrastructure - AWS, K8s DevOps & CI/CD - Git, GitLab CI, DBS, Grafana, ELK, Prometheus, Docker, Docker Compose Why join us? Shape the future of a data business at the forefront of global payments insights A chance to work with a vibrant, friendly More ❯
to work independently or lead a small team Nice to Have: Experience with TYK API Gateway Exposure to microservices and event-driven architectures Familiarity with observability tools (e.g., Prometheus, Grafana) Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment Business in relation to this vacancy. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Lorien
to work independently or lead a small team Nice to Have: Experience with TYK API Gateway Exposure to microservices and event-driven architectures Familiarity with observability tools (e.g., Prometheus, Grafana) Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment Business in relation to this vacancy. More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
BOSS Professional Services LTD
the customer base and product offering. For the SRE Engineer role we are seeking: Technology stack: Kubernetes, MySQL, PostgreSQL, PHP, Python, Docker, AWS Lambda, AWS, Redis, ELK, monitoring: Prometheus, Grafana or Loki You have previous experience of working within SRE capacity or experience in DevOps and interest in moving into that field. Be responsible for the production environment. Improve the More ❯
Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must To More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Searchability NS&D
Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must To More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting and automation skills, with proficiency More ❯
Annapolis Junction, Maryland, United States Hybrid / WFH Options
Codescratch LLC
Experience with asynchronous messaging systems (RabbitMQ, Apache Kafka, etc.) Experience creating and integrating with remote services via HTTP, Thrift, or gRPC Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElsticSearch, Logstash, Kibana) Salary Range Pay range $165,000 - $205,000 . (Plus Benefits) The pay range for this job level is a general estimated More ❯
you'll need: Comfortable with automation, IaC, and CI/CD principles. Understand Network concepts, Infrastructure, and common protocols. Able to write basic scripts for automation Build dashboards in Grafana and understanding of Prometheus and PromQL. Knowledge of SDLC and experience integrating solutions into CI pipelines Experience with cloud (AWS, GCP) is beneficial, but not essential. Able to self-manage More ❯
resource allocation, network and/or internals. Experience working with cloud solutions (GCP or AWS). Deep understanding and demonstrable experience with modern monitoring tools such as Prometheus, Datadog, Grafana, Telegraf Experience with infrastructure as code tools. Experience with complex Terraform deployments is a plus. Solid background with configuration management tools. Experience with Saltstack is a plus. Experience with using More ❯
Perform detailed root cause analysis of defects and manage fix/retest cycles. Reporting:Produce regular Test Reports to communicate delivery health to key stakeholders and automated reporting to Grafana, JIRA and internal delivery reporting systems. Risk Mitigation:Mitigate roll-out risk through runbook/pipeline reviews Qualifications Strong Java developer with exposure to BDD/TDD based development processes More ❯
Salford, Manchester, United Kingdom Hybrid / WFH Options
Lloyds Bank plc
error budgets, and incident response. Experience with infrastructure as code (e.g., Terraform, Deployment Manager) and CI/CD pipelines. Proficiency in monitoring, logging, and observability tools (e.g., Stackdriver, Prometheus, Grafana). Knowledge of Linux systems, networking, and cloud security best practices. It would be great if you also had Experience working in DevOps environments, with a focus on automation, scalability More ❯
technical solutions Bring practical knowledge of designing for high availability, fault tolerance, and performance at scale in production-grade systems Demonstrated experience managing operational resilience, including monitoring (e.g. Prometheus, Grafana), security (e.g. OAuth2, API rate-limiting), and high availability deployments. Exposure to hybrid and cloud-native architecture patterns, especially in regulated environments. Are confident debugging and resolving deep technical issues More ❯
technical solutions Bring practical knowledge of designing for high availability, fault tolerance, and performance at scale in production-grade systems Demonstrated experience managing operational resilience, including monitoring (e.g. Prometheus, Grafana), security (e.g. OAuth2, API rate-limiting), and high availability deployments. Exposure to hybrid and cloud-native architecture patterns, especially in regulated environments. Are confident debugging and resolving deep technical issues More ❯
cloud architecture IoT 'smart' edge devices (using nVidia AI chips) Linux-based embedded OS on our Edge devices Continuous Integration and Delivery using Jenkins, SonarQube Terraform for infrastructure management Grafana, Elasticsearch, Kibana & New Relic for metrics, logs and monitoring In the company we also use: VueJS, MySQL, Spring Boot, Apache Camel, AWS Redshift, AWS SageMaker, Pentaho, Balena, Serverless functions Winnow More ❯