experience building and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An interest in observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, baking More ❯
Sheffield, England, United Kingdom Hybrid / WFH Options
Vallum Associates
Database Management Operations and automation workflows Troubleshoot automation issues across scripting, API s and containerized environments. Nice to have Exposure on enhancing observability with knowledge of tools such as Prometheus, Grafana, and OpenTelemetry. Advantageous to have enterprise tools knowledge (i.e., Control M, True sight, Guardium, Tenable Nessus, Delinea) Knowledge of Security and Software Development in a Highly regulated environment End More ❯
handsworth, yorkshire and the humber, united kingdom Hybrid / WFH Options
Vallum Associates
Database Management Operations and automation workflows Troubleshoot automation issues across scripting, API s and containerized environments. Nice to have Exposure on enhancing observability with knowledge of tools such as Prometheus, Grafana, and OpenTelemetry. Advantageous to have enterprise tools knowledge (i.e., Control M, True sight, Guardium, Tenable Nessus, Delinea) Knowledge of Security and Software Development in a Highly regulated environment End More ❯
Sheffield, South Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
VANLOQ LIMITED
environments Required Skills: Proven experience in Python development & FastAPI Strong knowledge of PostgreSQL database administration Excellent problem-solving, debugging, and analytical skills Nice to Have: Exposure to observability tools ( Prometheus, Grafana, OpenTelemetry ) Experience with enterprise tools (Control M, True Sight, Guardium, Tenable Nessus, Delinea) Understanding of security and software development in highly regulated environments End-to-end experience with CI More ❯
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Fruition Group
Working with developers and SREs to solve complex problems What we're looking for: Strong experience with AWS (EC2, ECS, Lambda, RDS etc.) Good knowledge of observability tools (Grafana, Prometheus, OpenTelemetry, Datadog, or similar) Background in software engineering (JavaScript/TypeScript & Node.js, although any language is fine) Experience with Infrastructure as Code (Terraform, CloudFormation, or similar) CI/CD pipelines More ❯
Lexington, Massachusetts, United States Hybrid / WFH Options
Raft
within DoD/Air Force AOC Weapon System and operating standards within cleared facilities (SIPR, IL6) - Familiarity with AWS and cloud technologies - Skill in operating observability tooling and alerting (Prometheus, Grafana, etc.) - Knowledge of Platform One Big Bang Clearance Requirements: Active Secret security clearance Work Type: Hybrid - Hanscom AFB, MA highly preferred (or local to Reston, VA or Hampton, VA More ❯
London, Bloomsbury, United Kingdom Hybrid / WFH Options
IntaPeople
CI, Jenkins, GitHub Actions, or AWS CodePipeline Support and train technical staff in upskilling necessary for ongoing operations Monitor and ensure system reliability, availability, and performance using tools likeCloudWatch, Prometheus, Icinga2, Grafana, and Datadog Automate deployment, scaling, and management of containerized applications using Docker and Kubernetes Desirable skills Travis CI Monitoring – Grafana, Icinga Prometheus Rabbit MQ/AMQP Working knowledge More ❯
Leeds, West Yorkshire, United Kingdom Hybrid / WFH Options
Corecom Consulting
influencing and negotiating in a highly regulated environment . Desirable skills include: RESTful design and API Gateway tools (Apigee or equivalent) Kubernetes/OpenShift Monitoring tools such as Grafana, Prometheus, Dynatrace CI/CD with GitLab or Artifactory Security practices (OWASP, JWT, certificates, encryption) Experience within IT support environments Benefits 25 days holiday + Bank Holidays , with the option to More ❯
Amazon EKS (Elastic Kubernetes Service) and Azure Kubernetes Service (AKS), ensuring their reliability, availability, and performance. Monitoring and Alerting: Monitor application performance and system health through observability tools (e.g., Prometheus, Grafana, ELK stack), proactively identifying and resolving issues to ensure high availability and rapid incident response. Security and IAM: Implement security best practices, managing Identity and Access Management (IAM) policies … with container orchestration technologies, particularly Kubernetes. Familiarity with version control systems (e.g., Git) and CI/CD pipelines for efficient code deployment. Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack) to ensure system observability. Strong experience with SQL databases and AWS DynamoDB, focusing on performance tuning and optimization. Proven ability to design and manage RESTful APIs, ensuring More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
Hays
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Hays Specialist Recruitment Limited
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
writing code that is intuitive, clear, and easy to test Developing observability for new and existing ML applications and GenAI/LLM integrations, making use of the Grafana Stack (Prometheus, Loki, Tempo) Working closely with Data Scientists and ML Engineers throughout the lifecycle of productionising their models Being responsive to incidents regarding ML applications - including an understanding of how to More ❯
team. Some things that may be a bonus (but learning these on the job will be fine): Experience in some of the other key technologies we use (Linux, Docker, Prometheus, Express, MongoDB, Redis, PostgreSQL). Experience integrating large scale pub-sub systems such as Google Cloud Pub/Sub. We expect you to: Work with us full time. Have strong More ❯
Wokingham, Berkshire, England, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions Ltd
and smart architecture are key. What You’ll Be Working With: MySQL , Vitess , and Linux in production (Dont worry if you haven't worked with Vitess) Monitoring tools like Prometheus and Grafana Shard allocation, replication tuning, disk performance Backup, restore, and DR testing Data migrations and custom table loads for NHS tenants Zero-downtime patching and performance baselining What You More ❯
Edinburgh & Lothians, Scotland, United Kingdom Hybrid / WFH Options
Bright Purple Resourcing
and hands-on cluster configuration. Youll be working across: Multi-node RKE2 clusters set-up, networking, RBAC, disaster recovery and failover Installation and configuration of open source components Including Prometheus, Grafana, Loki, Alloy, PostgreSQL, Rook Ceph, ActiveMQ Artemis, and Keycloak Kubernetes native deployment tooling - Helm and Kustomize (creation and use required) plus exposure to FluxCD pipelines Service integration and lifecycle More ❯
Annapolis Junction, Maryland, United States Hybrid / WFH Options
Codescratch LLC
Artifactory, and SonarQube Knowledgeable in Artificial Intelligence, specifically Large Language Models A strong understanding of cybersecurity best practices, encryption methods, and secure coding techniques Familiar with observability tools, including Prometheus, Grafana, and the ELK stack Ability to effectively communicate intricate technical information to individuals with non-technical backgrounds and to senior leadership Experience with Machine Learning Analytics Experience with Amazon … Web Services (AWS) Experience with asynchronous messaging systems (RabbitMQ, Apache Kafka, etc.) Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElsticSearch, Logstash, Kibana) Excellent communication and collaboration abilities Experience working independently to solve complex problems Salary Range Pay range $165,000 - $225,000. (Plus Benefits) The pay range for this job level is a More ❯