in GitHub as a Source Code Management (SCM) platform, with hands-on experience in version control and collaborative coding. Experience with Kubernetes Experience with observability tools such as Datadog, Grafana, and Elastic for monitoring, analytics, and proactive issue identification. Experience in incident management, showcasing the ability to respond promptly and more »
normal—and that’s where you come in! We are seeking a skilled Site Reliability Engineer (SRE) with experience in AWS, Serverless, Monitoring, and Observability to join our team. Responsibilities: Design, build and maintain scalable, and reliable cloud infrastructure in AWS Monitor and manage the performance, reliability, and security of more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
london, south east england, United Kingdom Hybrid / WFH Options
Anaplan
that’s dedicated to creating opportunities for our customers, partners, and employees. We hope you’ll join us. Let’s create something incredible together! Observability Engineer At Anaplan we are looking for a self-motivated Observability Engineer to join our dedicated Observability Infrastructure team. Anaplan is a high-growth company … working people who believe in simplicity, agility and performance and can choose and use the best tools for the job. In the role of Observability Engineer, you will be working on the tools used to collect and analyse Observability telemetry (Logs, Metrics and Traces). You will enable engineers across … What you’ll be doing: In this role, working a minimum of 2 days a week in our London Office, you will be: Administering observability infrastructure. Deploying and configuring OTEL agents to collect telemetry, and to visualise this data in Grafana. Pairing with your colleagues to build everything from rapid more »
responsibilities include: Monitoring the performance and reliability of the company's global online platforms Troubleshooting issues via proactive and reactive monitoring and alerts using observability tooling and logging service requests Enhancing the tech stack/configurations of existing services to improve site performance and reduce issues Exploring and delivering on more »
london, south east england, United Kingdom Hybrid / WFH Options
McCabe & Barton
infrastructure as code. Implement and maintain CI/CD pipelines using GitLab CI/CD and Jenkins. Manage and monitor SRE systems, including log observability, Application Performance Monitoring (APM), infrastructure monitoring, and security. Proficient in working with Kubernetes for container orchestration and management. Experienced with AWS Cloud services and infrastructure more »
london, south east england, United Kingdom Hybrid / WFH Options
Overcast HQ
years of real-world application of these concepts in a DevOps position. AWS Cloud skills & best practices Infrastructure as code CloudFormation Templates Continuous delivery, Observability (Application Performance Monitoring) Configuration management (Infrastructure as a Service) AWS product experience in high-levels Cloudwatch EC2, Lambda Containers - Docker, AWS ECR IT Operations & Production more »
the lead on projects to improve our DevOps: CI/CD pipeline (vulnerability scanning, static analysis, tests), blue/green deploys, auto load balancing, observability & instrumentation, infrastructure as code (eg Terraform) etc. Take the lead on projects to refactor our codebase, separating domain-specific logic, application logic and UI code more »
solutions including for compute, storage, messaging and networking Good understanding of Docker and experience with CI/CD tooling Good understanding of security and observability best practices and tooling Effective communication and collaboration skills What else about you would excite us? Experience building and maintaining high-traffic server-side web more »
to SRE/Platform Engineering, or are still working as a Java Developer extensive experience with AWS, Kubernetes, Terraform, CI/CD tools strong observability experience, ideally with more modern approaches like Prometheus, Grafana, Open Telemetry comfortable with databases exposure to Kafka would be ideal more »
london, south east england, United Kingdom Hybrid / WFH Options
Oliver Bernard
required: Strong Cloud experience with AWS and AWS Services Containerisation/Orchestration with Kubernetes Strong understanding of IaC with Terraform Wealth of Monitoring and Observability experience Knowledge of Security/DevSecOps practices This position can offer £90-110K, plus benefits, and operates a hybrid working model (with 3 office more »
a Lead DevOps Engineer Design and develop cutting edge computing infrastructure in a modern cloud environment - Azure and Kubernetes (AKS). Design and develop observability and monitoring solutions to optimize the product and end user experience. Leverage cutting edge technologies to ensure CI/CD in Azure DevOps Mentor junior more »
able to build automation tools (we use Python, but any language is fine) Extensive experience with AWS, K8s, Terraform Experience designing, building and maintaining observability tooling and CI/CD pipelines Understanding of Security, Networking and Databases 👍 Bonus points for: Having worked as a software engineer before shifting your focus more »
london, south east england, United Kingdom Hybrid / WFH Options
BPP
management, and the prowess of cloud-native solutions. In your pursuit of continuous improvement, you're not solely reliant on metrics; you dive into observability metrics and user feedback, steering our technical progress with insightful analysis. Staying ahead is not just a practice; it's inherent. You're not merely more »
similar frameworks/tech. Strong Automation & Config management tooling experience - preferred Ansible/Puppet/Terraform. Strong Linux troubleshooting skills Excellent practical experience with Observability systems. Excellent communicator - well versed in working directly with stakeholders. Excellent P&L linked bonus pay, generous base salary and comprehensive suite of benefits on more »
keep this complex platform operational. You will be task with managing both on premise and public cloud and containerised infrastructure, while increasing monitoring and observability, optimasing performance and automation etc. We are seeking a DevOps Engineer with the following: Experience maintaining and scaling hybrid cloud infrastructure Containerisation experience with Docker more »
london, south east england, United Kingdom Hybrid / WFH Options
ByteHire
understanding of Google Cloud (GCP) Deep understanding of SRE ethos and principles Vast amounts of Terraform experience Solid experience with Python Solid experience of Observability tooling. Good experience in dashboard creation/data visualisation using tools such as Google Looker, or Grafana Strong CI/CD experience Strong containerisation experience more »
london, south east england, United Kingdom Hybrid / WFH Options
Xpertise Recruitment
will involve ensuring that the design of new cloud services and features is optimal, considering various functional and non-functional attributes, including performance, availability, observability, security and cost. Whilst remaining hands-on, you’ll also maintain a strategic vision, ensuring that a fast-paced development cycle converges on your chosen more »
london (croydon), south east england, United Kingdom
Matchtech
to build and elaborate the backlog and sprints. * Scoping and participating in spikes and defining best practice. * Leading the build effort, including runbooks, automation, observability, and selfservice. * Guiding and mentoring other members of the squad. Skills and Experience * Designing solutions using AWS products and services. * Knowledge of the Well-Architected more »
the transition from design and construction to maintenance and monitoring has left you yearning for more. Requirements Proficient Python coder. Kubernetes experience. Expertise in Observability platforms and tooling such as Grafana and Prometheus. At least 5 years of experience, ideally in a start up or scale up. We understand that more »
models to production. Optimizing the platform runtime for maximum performance. This is largely C++ code with parts of the pipeline running on GPU. Building observability and telemetry. Requirements and experience we are looking for 3+ years of experience writing production software in C++ and Python of experience building applications processing more »
youll do Working as part of a team or as the engagement lead with experience in multicloud networking, SASE, datacentre networking, network simplification, automation, observability and application networking and a strong understanding and alignment of security as it relates to the previous technology areas. Leveraging this experience to deliver: Enterprise more »
quickly. Comfortable building highly-available APIs with a strong understanding of RPC frameworks Proven track record of building scalable and modern backend microservices with observability as a first-class citizen. You have expertise in coding with modern Python or at least one modern strongly-typed language. You’re an expert more »
able to articulate these decisions to promote wider adoption For each stage of the data management process, such as integration, orchestration, serialisation, project configuration, observability, monitoring, cataloguing etc you are aware of best-in-class industry solutions and can make considered choices between them depending on your use case and more »