Manchester, North West, United Kingdom Hybrid / WFH Options
Hays
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Hays Specialist Recruitment Limited
Strong understanding of networking, virtualisation, and cloud security principles. Operate, maintain, and enhance the Azure Virtual Desktop (AVD) environment. Experience with monitoring and logging tools (e.g., Azure Monitor, CloudWatch, Prometheus). Expert in setting up and managing host pools, session hosts, user access, application layers, and FSLogix profiles. Strong knowledge of cloud architecture, design, and implementation principles and practices. Proficiency More ❯
writing code that is intuitive, clear, and easy to test Developing observability for new and existing ML applications and GenAI/LLM integrations, making use of the Grafana Stack (Prometheus, Loki, Tempo) Working closely with Data Scientists and ML Engineers throughout the lifecycle of productionising their models Being responsive to incidents regarding ML applications - including an understanding of how to More ❯
team. Some things that may be a bonus (but learning these on the job will be fine): Experience in some of the other key technologies we use (Linux, Docker, Prometheus, Express, MongoDB, Redis, PostgreSQL). Experience integrating large scale pub-sub systems such as Google Cloud Pub/Sub. We expect you to: Work with us full time. Have strong More ❯
Wokingham, Berkshire, England, United Kingdom Hybrid / WFH Options
Opus Recruitment Solutions Ltd
and smart architecture are key. What You’ll Be Working With: MySQL , Vitess , and Linux in production (Dont worry if you haven't worked with Vitess) Monitoring tools like Prometheus and Grafana Shard allocation, replication tuning, disk performance Backup, restore, and DR testing Data migrations and custom table loads for NHS tenants Zero-downtime patching and performance baselining What You More ❯
Edinburgh & Lothians, Scotland, United Kingdom Hybrid / WFH Options
Bright Purple Resourcing
and hands-on cluster configuration. Youll be working across: Multi-node RKE2 clusters set-up, networking, RBAC, disaster recovery and failover Installation and configuration of open source components Including Prometheus, Grafana, Loki, Alloy, PostgreSQL, Rook Ceph, ActiveMQ Artemis, and Keycloak Kubernetes native deployment tooling - Helm and Kustomize (creation and use required) plus exposure to FluxCD pipelines Service integration and lifecycle More ❯
Annapolis Junction, Maryland, United States Hybrid / WFH Options
Codescratch LLC
Artifactory, and SonarQube Knowledgeable in Artificial Intelligence, specifically Large Language Models A strong understanding of cybersecurity best practices, encryption methods, and secure coding techniques Familiar with observability tools, including Prometheus, Grafana, and the ELK stack Ability to effectively communicate intricate technical information to individuals with non-technical backgrounds and to senior leadership Experience with Machine Learning Analytics Experience with Amazon … Web Services (AWS) Experience with asynchronous messaging systems (RabbitMQ, Apache Kafka, etc.) Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElsticSearch, Logstash, Kibana) Excellent communication and collaboration abilities Experience working independently to solve complex problems Salary Range Pay range $165,000 - $225,000. (Plus Benefits) The pay range for this job level is a More ❯
features with long-term technical initiatives. Technologies: AI AWS Azure CI/CD Cloud CTO Cypress Docker FastAPI Flask GitLab Grafana IoT Support Java Kafka Kubernetes Mobile MQTT PostgreSQL Prometheus Python REST Security TypeScript microservices pytest More: About us: At aedifion you will actively shape the future of energy-efficient building operations. The mission is to make existing buildings smarter … Go, Java, Typescript • Python Frameworks: FastAPI, Flask • Cloud Environments: AWS, Azure • Databases: PostgreSQL, InfluxDB • CI/CD Tools: GitLab CI/CD, Sonarqube • Containerization: Docker, Kubernetes • Monitoring & Logging: Grafana, Prometheus, VictoriaMetrics • Architecture: Microservices, Event-driven Architecture, REST APIs • Testing: Pytest, Cypress, vitest, testify • Messaging: Apache Kafka, AMQP, MQTT This foundation enables the processing of billions of building data points, real More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom Hybrid / WFH Options
Lorien
on Azure or AWS. Driving Infrastructure as Code (IaC) practices using Terraform. Building and optimising CI/CD pipelines to accelerate delivery. Implementing and maintaining monitoring and observability with Prometheus and Grafana. Enabling team collaboration and incident response through Slack and other ChatOps tools. Leading, mentoring, and supporting engineers (or preparing to step into people management if you're progressing … hands-on experience with Azure or AWS. Solid background with Terraform and IaC. Proven use of CI/CD tools (Jenkins, GitHub Actions, GitLab CI, etc.). Knowledge of Prometheus and Grafana for monitoring. Familiarity with collaboration tools like Slack. Either: Prior management/team lead experience, or A Senior DevOps engineer ready to progress into a managerial role. (Bonus More ❯
Senior Software Engineer, Commerce Platform Remote - Ireland Who we are At Twilio, we're shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions tohundreds of thousands of businesses and empower millions of developers More ❯
and maintaining backend services using Django Designing and implementing scalable REST APIs Ensuring security, performance, and reliability of the platform Managing our AWS partner Monitoring and troubleshooting issues using Prometheus, Grafana, and OpenSearch Supporting the team with infrastructure needs for new features and deployments Implementing Infrastructure as Code (IaC) using tools like Terraform and Ansible Handling high traffic volumes, up … and maintaining backend services using Django Designing and implementing scalable REST APIs Ensuring security, performance, and reliability of the platform Managing our AWS partner Monitoring and troubleshooting issues using Prometheus, Grafana, and OpenSearch Supporting the team with infrastructure needs for new features and deployments Implementing Infrastructure as Code (IaC) using tools like Terraform and Ansible Handling high traffic volumes, up More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Salt Search
overcome technical barriers. Contributing ideas and helping raise capability across the team. Taking part in an out-of-hours escalation rota. Tech Environment Core: Kubernetes (EKS on AWS), Karpenter, Prometheus, Terraform. Preferred: Service mesh (Cilium or similar), Flux/Argo, Ansible. Bonus: High-performance compute/GPUs in Kubernetes. What They're Looking For 5-10 years' hands-on Kubernetes … EKS on AWS) experience - ABSOLUTE MUST Strong skills with Terraform, Prometheus, and scaling infra. Collaborative and adaptable in a fast-paced environment where priorities shift quickly. Ability to solve technical challenges and mentor others through example. Culture The environment is fast-moving, with priorities that change quickly in line with business needs. It's collaborative, technical, and high-output - you More ❯