following (bonus): Java experience Python experience Ruby experience Big data technologies: Spark, Trino, Kafka Financial Markets experience SQL: Postgres, Oracle Cloud-native deployments: AWS, Docker, Kubernetes Observability: Splunk, Prometheus, Grafana California residents, please review the California Privacy Notice for information about certain legal rights at For more information about DRW's processing activities and our use of job applicants' data More ❯
Kubernetes-native platforms (ArgoCD, Prometheus, Istio, etc.) Prior work with CI/CD tools (GitHub Actions, Azure DevOps) in Kubernetes-based deployment pipelines Experience with UI telemetry and observability (Grafana, OpenTelemetry, etc.) Familiarity with SAP landscape operations or enterprise automation products AI/ML UI/UX feature integration experience is a strong plus Selected applicant will be subject to More ❯
Kubernetes-native platforms (ArgoCD, Prometheus, Istio, etc.) Prior work with CI/CD tools (GitHub Actions, Azure DevOps) in Kubernetes-based deployment pipelines Experience with UI telemetry and observability (Grafana, OpenTelemetry, etc.) Familiarity with SAP landscape operations or enterprise automation products AI/ML UI/UX feature integration experience is a strong plus Selected applicant will be subject to More ❯
plus. Capable of writing clean, maintainable and well-tested code. Comfortable working in on-prem and cloud-native environments with an interest in observability, using tools like Prometheus and Grafana to keep services healthy and maintainable. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, combining testing and scanning into More ❯
a bonus: Java experience Python experience Ruby experience Big data technologies: Spark, Trino, Kafka Financial Markets experience SQL: Postgres, Oracle Cloud-native deployments: AWS, Docker, Kubernetes Observability: Splunk, Prometheus, Grafana For more information about DRW's processing activities and our use of job applicants' data, please view our Privacy Notice at . California residents, please review the California Privacy Notice More ❯
Store , or Play Store . Familiarity with CI/CD pipelines , testing frameworks , and DevOps principles . Exposure to cloud-native environments (AWS, Docker, K8s) and observability tools (Prometheus, Grafana) is a plus. Passionate about mentoring , code reviews , and collaborative development . Energized by fast-paced, regulated environments where speed, quality, and adaptability matter. Your Technical Toolkit React/React More ❯
and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An interest in observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security, baking testing and More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Manchester Digital
several microservices, also written in Python, utilising frameworks and libraries such as Celery, Eventlet, SQLAlchemy, etc. Additionally, GOV.UK Notify utilises AWS RDS (Postgres), AWS SQS, AWS ElastiCache, OpenTelemetry, Prometheus, Grafana and other related services. Concourse CI and Terraform are used to run build-pipelines and manage our infrastructure. For the frontend, we follow theGOV.UK Design System , making use of GOV.UK More ❯
What you'll be doing: Building and maintaining a Kubernetes-hosted AI platform (AKS) Deploying and managing LLMOps tools such as LiteLLM, Langflow, and Langfuse Implementing observability with Prometheus, Grafana, and Loki Managing infrastructure through Terraform, ArgoCD, and GitHub Actions Supporting internal AI applications including RAG, document processing, and internal AI assistants What you'll need: 2-4 years in More ❯
What you’ll be doing: Building and maintaining a Kubernetes-hosted AI platform (AKS) Deploying and managing LLMOps tools such as LiteLLM, Langflow, and Langfuse Implementing observability with Prometheus, Grafana, and Loki Managing infrastructure through Terraform, ArgoCD, and GitHub Actions Supporting internal AI applications including RAG, document processing, and internal AI assistants What you’ll need: 2–4 years in More ❯
What you’ll be doing: Building and maintaining a Kubernetes-hosted AI platform (AKS) Deploying and managing LLMOps tools such as LiteLLM, Langflow, and Langfuse Implementing observability with Prometheus, Grafana, and Loki Managing infrastructure through Terraform, ArgoCD, and GitHub Actions Supporting internal AI applications including RAG, document processing, and internal AI assistants What you’ll need: 2–4 years in More ❯
What you’ll be doing: Building and maintaining a Kubernetes-hosted AI platform (AKS) Deploying and managing LLMOps tools such as LiteLLM, Langflow, and Langfuse Implementing observability with Prometheus, Grafana, and Loki Managing infrastructure through Terraform, ArgoCD, and GitHub Actions Supporting internal AI applications including RAG, document processing, and internal AI assistants What you’ll need: 2–4 years in More ❯
london, south east england, united kingdom Hybrid / WFH Options
Jump IT Recruitment Solutions Limited
Azure DevOps, IaC, Terraform, AKS, Grafana, Infrastructure, Ansible, Hashicorp, PowerShell, C#, CI/CD, DevOps, Kubernetes, SaaS, PowerShell, Git. Our client is looking to fill a brand new permanent role that is basically for a hands-on team lead/manager to lead and manage the day-to-day activities of a UK-based Azure DevOps team. Strong Azure DevOps More ❯
Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead level More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Oliver Bernard
Expert knowledge of Containerisation with Docker and Kubernetes Strong Infrastructure as Code experience with Terraform History of working across CI/CD pipelines Monitoring and Observability experience with Prometheus, Grafana, and/or DataDog Prior experience overseeing Change and Incident Management processes Previous work in an Architectural capacity is also a massive bonus This position is open to Lead level More ❯
unit, API, e2e) Bonus: Proficiency in Typescript (and Angular) Openness to learn new technologies and skills Our Tech Stack Java, Spring, Angular, TypeScript, Git, Gradle, Jenkins, Kubernetes, Rancher, Opensearch, Grafana Wir bieten A relaxed, friendly, and collaborative atmosphere. The opportunity to work with international teams and gain experience in a large, enterprise- level company. Engaged in team- building events, quizzes More ❯
Skills: Infrastructure Specialist Configuration automation with Ansible Linux, preferably Redhat/Centos Good Network skills (Firewalls & Switches) AWS/Azure/GCP Containerisation technologies such as Kubernetes and Docker Grafana and or Prometheus Vmware Experience: Minimum 3 years. More ❯
and maintain Kubernetes manifests (Deployments, StatefulSets, PVCs, NetworkPolicies, etc Implement role-based access control (RBAC), service accounts, and admission policies Monitor cluster health and performance using tools like Prometheus, Grafana, and ELK/Loki -JupyterHub/Jupyter Notebook Expertise Deploy, configure, and scale JupyterHub for multi-tenant use on Kubernetes/OpenShift Integrate JupyterHub with enterprise authentication (OAuth, SAML, LDAP More ❯
and serverless architectures. Deep understanding of CI/CD (GitHub Actions, Jenkins, or AWS CodePipeline). Proven ability to secure and scale production systems. Monitoring and observability tools (CloudWatch, Grafana, OpenTelemetry). Familiar with data exchange formats (JSON, YAML, Parquet) and API design. Leadership & Delivery 4-8 years in software development and/or DevOps, including 2+ in a management More ❯
serverless architectures . Deep understanding of CI/CD (GitHub Actions, Jenkins, or AWS CodePipeline). Proven ability to secure and scale production systems. Monitoring and observability tools (CloudWatch, Grafana, OpenTelemetry). Familiar with data exchange formats (JSON, YAML, Parquet) and API design. Leadership & Delivery 4–8 years in software development and/or DevOps , including 2+ in a management More ❯
serverless architectures . Deep understanding of CI/CD (GitHub Actions, Jenkins, or AWS CodePipeline). Proven ability to secure and scale production systems. Monitoring and observability tools (CloudWatch, Grafana, OpenTelemetry). Familiar with data exchange formats (JSON, YAML, Parquet) and API design. Leadership & Delivery 4-8 years in software development and/or DevOps , including 2+ in a management More ❯
bolton, greater manchester, north west england, united kingdom
RiskPod
serverless architectures . Deep understanding of CI/CD (GitHub Actions, Jenkins, or AWS CodePipeline). Proven ability to secure and scale production systems. Monitoring and observability tools (CloudWatch, Grafana, OpenTelemetry). Familiar with data exchange formats (JSON, YAML, Parquet) and API design. Leadership & Delivery 4–8 years in software development and/or DevOps , including 2+ in a management More ❯
warrington, cheshire, north west england, united kingdom
RiskPod
serverless architectures . Deep understanding of CI/CD (GitHub Actions, Jenkins, or AWS CodePipeline). Proven ability to secure and scale production systems. Monitoring and observability tools (CloudWatch, Grafana, OpenTelemetry). Familiar with data exchange formats (JSON, YAML, Parquet) and API design. Leadership & Delivery 4–8 years in software development and/or DevOps , including 2+ in a management More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
La Fosse Associates Limited
with the ability to interact effectively with both business users and technical teams. Strong understanding of: Application architecture and design Relational databases (SQL Server) Monitoring and alerting tools (eg Grafana, Prometheus, VictoriaMetrics) Scheduling tools (eg Control-M) Operating systems (Windows and Linux) Containerisation and orchestration (Kubernetes) Cloud platforms (Azure) Issue tracking and source control (JIRA, Git, Bitbucket) Familiarity with ITIL More ❯