Reading, Berkshire, South East, United Kingdom Hybrid / WFH Options
Ignite Digital Search Ltd
Scripting expertise (Python, Bash, PowerShell) Highly Valued: Experience in regulated industries (healthcare, financial services, life sciences) AWS cost management and FinOps experience Monitoring tools expertise (CloudWatch, Datadog, New Relic, Prometheus) Security and compliance framework knowledge Experience with observability and APM solutions Why This Opportunity Stands Out: Real Impact - Your work directly improves healthcare outcomes Growth Trajectory - Join a scaling company More ❯
relevant tools. Security Best Practices: IAM, MFA, data encryption, firewall configurations. Programming/Scripting: Python, Terraform, or similar languages. Event-Driven Architectures: Kafka. Monitoring and Logging: Datadog, ELK Stack, Prometheus, etc. Experience in agile methodologies and DevOps practices. Location: Hybrid. Office located in London. (Hayes area). Office presence required: Yes. Frequency: 2-3 times a week at the office. More ❯
able to translate complexity into clarity Experience with Terraform, Helm, or GitOps tooling Familiarity with front-end technologies such as React and TypeScript Exposure to GraphQL, observability stacks (e.g., Prometheus, OpenTelemetry), or large-scale data platforms Prior work in regulated industries (BFSI, telecom, public sector) To succeed in this role, you'll bring more than just technical knowledge. You'll More ❯
code tools (e.g., Terraform , Helm , GitLab CI ) Solid understanding of cloud platforms (preferably AWS or Azure) Strong scripting abilities (e.g., Bash, Python, or Go) Familiarity with monitoring tools (e.g., Prometheus, Grafana) and security controls Nice to Have: Experience with service mesh technologies (Istio, Linkerd) Knowledge of networking fundamentals and zero trust architectures Prior involvement in large-scale infrastructure migrations Contract More ❯
Edinburgh, Midlothian, Scotland, United Kingdom Hybrid / WFH Options
McGregor Boyall
Hat Satellite Automate OS and application deployment using Ansible and Infrastructure as Code (IaC) principles Support Oracle 19c on Oracle Linux with KVM and CommVault integration Maintain observability stacks (Prometheus, Grafana, InfluxDB, Nagios, Fluentd, Elasticsearch/OpenSearch) Implement and monitor SLIs/SLOs; support disaster recovery testing Manage Docker containers and work in Kubernetes/OpenShift environments Configure and maintain More ❯
Burke, Virginia, United States Hybrid / WFH Options
ALTA IT Services
AWS GovCloud or Azure Government. Preferred Qualifications: • Elastic Certified Engineer or Elastic Certified Analyst. • Experience with Elasticsearch Service (Elastic Cloud). • Familiarity with other observability tools (e.g., Grafana, Splunk, Prometheus). • Experience with NIST RMF, DoD 8570 compliance, or CDM initiatives. • Prior experience supporting DoD, IC, or civilian agencies. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Pathmere Partners Limited
Bash) Financial services or FinTech background is a plus but not essential Tech Stack Includes: Microsoft Azure Terraform, Bicep, ARM templates Docker, Kubernetes (AKS) Azure DevOps, GitHub Actions Helm, Prometheus, Grafana, App Insights PowerShell, Bash Benefits Competitive base salary (£90,000£110,000) Annual performance bonus Private medical insurance Pension scheme and flexible benefits Clear career path to Head of More ❯
healing systems etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Docker (with Docker Swarm) Elastic Stack Typescript/React/Node Go Prometheus/Grafana ESRI Maps Ansible Windows & Linux Jenkins Automation skills: Automation is a key skill domain for this role. Specific automation skills are: Continuous Integration - Skilled in the tooling and More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Netcompany UK Limited
Azure Kubernetes Service (AKS), Azure Synapse Analytics, or Azure Cognitive Services Azure certifications, such as Azure Solutions Architect Expert or DevOps Engineer Expert Experience with infrastructure monitoring tools like Prometheus, Grafana, or Azure Monitor at scale Background in implementing disaster recovery and high-availability solutions for critical systems Qualifications Bachelor's or Master's degree in Computer Science, Information Technology More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
building and maintaining highly reliable infrastructure and services. Expertise in incident management, including incident response, resolution, and post-mortem analysis. Proficiency in monitoring, alerting, and observability tools such as Prometheus, Grafana, ELK stack or Datadog. Experience with cloud platforms such as AWS, Azure, or GCP, including infrastructure as code tools like Terraform or CloudFormation. Strong scripting and automation skills, with More ❯
At least 5 years of professional experience, some of which should have focus on Observability. Excellent knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, VictoriaMetrics, Grafana, Datadog, New Relic, OpenTelemetry, ELK Stack, or similar. Experience with high volume data storage (Structured and unstructured). A strong technical background, with current capabilities and willingness to More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Navtech, Inc
and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated experience More ❯
Cardiff, South Glamorgan, United Kingdom Hybrid / WFH Options
Navtech, Inc
and Git for version control. Scripting & Troubleshooting: Strong scripting skills (Python/Bash) for automation and ability to analyze logs and monitor performance using tools like AWS Cloudwatch, Datadog, Prometheus, Grafana, or pgBadger. Solid understanding of DevOps practices, including CI/CD pipelines (e.g., GitLab CI, Cloudbees, Jenkins, GitHub Actions), containerization with Docker, and monitoring/logging tools. Demonstrated experience More ❯
in managing complex infrastructure, CI/CD pipelines, and cloud networking Demonstrated ability to manage application observability, triage and monitoring Technical Competencies (Desirable): Familiarity with Auth0, AWS Cognito, Helm, Prometheus/Grafana, OpenTelemetry or Honeycomb Experience with CI/CD pipelines for containerised and serverless environments Knowledge of additional cloud platforms such as GCP or Azure Benefits Market-leading salary More ❯
applications and infrastructure Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, New Relic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with More ❯
/Unix systems, SQL, and programming languages such as C++, Java or Python. Strong understanding of distributed systems and low-latency architectures Hands-on experience with observability stacks (e.g., Prometheus, Grafana, Splunk, Geneos, OpenTelemetry) and infrastructure automation (e.g., Ansible, Terraform, CI/CD pipelines) Strong understanding of the trade lifecycle, market data, and fixed income products, FX or algorithmic trading More ❯
with automation, IaC, and CI/CD principles. Understand Network concepts, Infrastructure, and common protocols. Able to write basic scripts for automation Build dashboards in Grafana and understanding of Prometheus and PromQL. Knowledge of SDLC and experience integrating solutions into CI pipelines Experience with cloud (AWS, GCP) is beneficial, but not essential. Able to self-manage Jira tickets and provide More ❯
Databases - Postgres, MariaDB, MongoDB, ClickHouse, Redis, JupyterLab, Metabase Data Engineering & Orchestration - Python, Airflow, Kafka, DataHub Cloud & Infrastructure - AWS, K8s DevOps & CI/CD - Git, GitLab CI, DBS, Grafana, ELK, Prometheus, Docker, Docker Compose Why join us? Shape the future of a data business at the forefront of global payments insights A chance to work with a vibrant, friendly team in More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Lorien
ability to work independently or lead a small team Nice to Have: Experience with TYK API Gateway Exposure to microservices and event-driven architectures Familiarity with observability tools (e.g., Prometheus, Grafana) Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment Business in relation to this vacancy. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Searchability NS&D
with Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must More ❯
with Kubernetes, Docker, Helm Proficient in Terraform, CI/CD Pipelines (Drone/GitLab) Excellent understanding of Kafka internals, stream processing, and secure Kafka deployments Strong experience across monitoring (Prometheus, Grafana, CloudWatch) Knowledge of security hardening, IAM, WAF, Shield, Vault Working knowledge of Agile, Infrastructure-as-Code, and DevSecOps practices UK*C or Enhanced DV (eDV) Clearance is a must More ❯
with ML lifecycle tools, model monitoring, and versioning Exposure to tools like KServe, Ray Serve, Triton, or vLLM is a big plus Bonus Points Experience with observability frameworks like Prometheus or OpenTelemetry Knowledge of ML libraries: TensorFlow, PyTorch, HuggingFace Exposure to Azure or GCP Passion for financial services Qualifications Degree in Computer Science, Engineering, Data Science, or similar What We More ❯
AWS Control Tower, GCP Resource Manager, etc. Network - AWS Transit Gateway, GCP Shared VPC, AWS Route53, GCP Cloud DNS, etc. Observability - AWS OpenSearch, GCP Monitoring/Traces, OpenTelemetry, Grafana, Prometheus, etc. Automation Prowess: Hands-on experience with modern Infrastructure as Code (IaC) automation tools and frameworks (Terraform, Jenkins, Ansible, etc.). Software Development Acumen: A software development background is highly More ❯
Annapolis Junction, Maryland, United States Hybrid / WFH Options
Codescratch LLC
Services (AWS) Experience with asynchronous messaging systems (RabbitMQ, Apache Kafka, etc.) Experience creating and integrating with remote services via HTTP, Thrift, or gRPC Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElsticSearch, Logstash, Kibana) Salary Range Pay range $165,000 - $205,000 . (Plus Benefits) The pay range for this job level is a More ❯
AWS Control Tower, GCP Resource Manager, etc. Network - AWS Transit Gateway, GCP Shared VPC, AWS Route53, GCP Cloud DNS, etc. Observability - AWS OpenSearch, GCP Monitoring/Traces, OpenTelemetry, Grafana, Prometheus, etc. Automation Prowess: Hands-on experience with modern Infrastructure as Code (IaC) automation tools and frameworks (Terraform, Jenkins, Ansible, etc.). Software Development Acumen: A software development background is highly More ❯