pipelines with Jenkins, GitLab CI/CD or similar - Containerize applications using Docker and orchestrate with Kubernetes - Monitor systems with Prometheus, Grafana and apply observability best practices - Automate deployment processes and improve DevOps workflows - Ensure high availability, fault tolerance and disaster recovery of cloud infrastructure - Collaborate with developers and operational more »
for mentoring and technical coaching of developers Excellent verbal and written communication skills The following skills and experience are nice to have: Experience with observability and monitoring tools such as Datadog Familiarity with polyglot persistence and experience with a NoSql DB especially DocumentDB Experience with Kafka, HiveMQ, ElasticSearch Infrastructure as more »
Warwick, Warwickshire, West Midlands, United Kingdom
Tata Technologies Europe Ltd
for mentoring and technical coaching of developers Excellent verbal and written communication skills The following skills and experience are nice to have: Experience with observability and monitoring tools such as Datadog Familiarity with polyglot persistence and experience with a NoSql DB especially DocumentDB Experience with Kafka, HiveMQ, ElasticSearch Infrastructure as more »
infrastructure needs, ideally in a self-service environment. WHAT YOU'LL DO: Build and provide platform features for compute, authentication, service discovery and calls, observability and redundancy Deploy and maintain Ripple's multi-region, multi-provider service platforms (with an emphasis on security and resiliency) Develop tools for automation, monitoring more »
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Experis
Skills/requirements AWS including AWS ECS Terraform CI/CD practices Jenkins Containerisation (Docker) Agile practices and DevOps methodologies Nice to have : Kubernetes Observability practices/tooling (eg. Prometheus, Dynatrace) Experience supporting Scala/Java applications All profiles will be reviewed against the required skills and experience. Due to more »
in GitHub as a Source Code Management (SCM) platform, with hands-on experience in version control and collaborative coding. Experience with Kubernetes Experience with observability tools such as Datadog, Grafana, and Elastic for monitoring, analytics, and proactive issue identification. Experience in incident management, showcasing the ability to respond promptly and more »
infrastructure as code (IaC) tools (Terraform) Solid understanding of CI/CD pipelines, version control systems, and release management practices. Familiarity with monitoring and observability tools (Prometheus, Grafana, Dynatrace) Knowledge of security best practices, compliance standards, and incident response protocols. Strong analytical and problem-solving skills, with the ability to more »
scaling Kubernetes clusters (EKS) for containerised application deployments is essential. Collaborating with development teams to streamline application deployment lifecycles and implementing security, monitoring and observability best practices are also priorities. The ideal candidate will have 5+ years of DevOps experience with extensive Kubernetes expertise, strong programming abilities in Python (Java more »
normal—and that’s where you come in! We are seeking a skilled Site Reliability Engineer (SRE) with experience in AWS, Serverless, Monitoring, and Observability to join our team. Responsibilities: Design, build and maintain scalable, and reliable cloud infrastructure in AWS Monitor and manage the performance, reliability, and security of more »
normal—and that’s where you come in! We are seeking a skilled Site Reliability Engineer (SRE) with experience in AWS, Serverless, Monitoring, and Observability to join our team. Responsibilities: Design, build and maintain scalable, and reliable cloud infrastructure in AWS Monitor and manage the performance, reliability, and security of more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
a Site Reliability Engineer you’ll: Work closely with application teams to ensure their products, service and tools follow SRE best practices relating to observability, reliability, scalability and resilience. Proactively look to improve day to day SRE tasks through automation and collaboration. Take ownership of incident management and participate in more »
MySQL, Postgres, Redis, etc.) •Experience with DevOps engineering and working with container orchestration, such as with Docker or Kubernetes •Experience with log monitoring and observability via platforms like Sumologic or Cloudwatch •Experience automating infrastructure, testing, and deployments using tools like CircleCI Configuration management tooling and infrastructure as code knowledge is more »
essential Experience diagnosing and troubleshooting connectivity issues Well-versed in security by design and security by default. Familiarity with SOC compliance Strong command of observability and monitoring in a multi-tenant cloud Experience working in high-performing Agile teams Proficient development in a major programming language such as C#, Python more »
essential Experience diagnosing and troubleshooting connectivity issues Well-versed in security by design and security by default. Familiarity with SOC compliance Strong command of observability and monitoring in a multi-tenant cloud Experience working in high-performing Agile teams Proficient development in a major programming language such as C#, Python more »
Responsibilities: Act as the overall owner of Monitoring services, ensuring their reliability, availability, and quality. Establish and enforce operational processes and procedures for effective use of monitoring tools. Coordinate with stakeholders to identify and prioritize monitoring requirements, and ensure appropriate more »
that’s dedicated to creating opportunities for our customers, partners, and employees. We hope you’ll join us. Let’s create something incredible together! Observability Engineer At Anaplan we are looking for a self-motivated Observability Engineer to join our dedicated Observability Infrastructure team. Anaplan is a high-growth company … working people who believe in simplicity, agility and performance and can choose and use the best tools for the job. In the role of Observability Engineer, you will be working on the tools used to collect and analyse Observability telemetry (Logs, Metrics and Traces). You will enable engineers across … What you’ll be doing: In this role, working a minimum of 2 days a week in our London Office, you will be: Administering observability infrastructure. Deploying and configuring OTEL agents to collect telemetry, and to visualise this data in Grafana. Pairing with your colleagues to build everything from rapid more »
london, south east england, United Kingdom Hybrid / WFH Options
Anaplan
that’s dedicated to creating opportunities for our customers, partners, and employees. We hope you’ll join us. Let’s create something incredible together! Observability Engineer At Anaplan we are looking for a self-motivated Observability Engineer to join our dedicated Observability Infrastructure team. Anaplan is a high-growth company … working people who believe in simplicity, agility and performance and can choose and use the best tools for the job. In the role of Observability Engineer, you will be working on the tools used to collect and analyse Observability telemetry (Logs, Metrics and Traces). You will enable engineers across … What you’ll be doing: In this role, working a minimum of 2 days a week in our London Office, you will be: Administering observability infrastructure. Deploying and configuring OTEL agents to collect telemetry, and to visualise this data in Grafana. Pairing with your colleagues to build everything from rapid more »
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
Interact Consulting Limited
on experience with Terraform for Infrastructure as Code (IaC). Proficiency with Docker, Kubernetes, and AKS. Familiarity with Azure cloud-native services. Knowledge of observability and site-reliability engineering principles. Proficiency in SQL and experience working with relational databases. more »
responsibilities include: Monitoring the performance and reliability of the company's global online platforms Troubleshooting issues via proactive and reactive monitoring and alerts using observability tooling and logging service requests Enhancing the tech stack/configurations of existing services to improve site performance and reduce issues Exploring and delivering on more »
infrastructure as code. Implement and maintain CI/CD pipelines using GitLab CI/CD and Jenkins. Manage and monitor SRE systems, including log observability, Application Performance Monitoring (APM), infrastructure monitoring, and security. Proficient in working with Kubernetes for container orchestration and management. Experienced with AWS Cloud services and infrastructure more »
london, south east england, United Kingdom Hybrid / WFH Options
McCabe & Barton
infrastructure as code. Implement and maintain CI/CD pipelines using GitLab CI/CD and Jenkins. Manage and monitor SRE systems, including log observability, Application Performance Monitoring (APM), infrastructure monitoring, and security. Proficient in working with Kubernetes for container orchestration and management. Experienced with AWS Cloud services and infrastructure more »
Greater London, England, United Kingdom Hybrid / WFH Options
Overcast HQ
years of real-world application of these concepts in a DevOps position. AWS Cloud skills & best practices Infrastructure as code CloudFormation Templates Continuous delivery, Observability (Application Performance Monitoring) Configuration management (Infrastructure as a Service) AWS product experience in high-levels Cloudwatch EC2, Lambda Containers - Docker, AWS ECR IT Operations & Production more »
london, south east england, United Kingdom Hybrid / WFH Options
Overcast HQ
years of real-world application of these concepts in a DevOps position. AWS Cloud skills & best practices Infrastructure as code CloudFormation Templates Continuous delivery, Observability (Application Performance Monitoring) Configuration management (Infrastructure as a Service) AWS product experience in high-levels Cloudwatch EC2, Lambda Containers - Docker, AWS ECR IT Operations & Production more »