Oxfordshire, South East, United Kingdom Hybrid / WFH Options
La Fosse Associates Ltd
Monitoring and Observability Engineer Salary - £50,000 - £55,000 - Fully remote role! Principal Accountabilities Design, implement, and manage monitoring solutions to ensure the availability, performance, and reliability of our systems. Collaborate with cross-functional teams to understand system requirements and implement effective monitoring strategies. Utilise expertise in Logic Monitor, OpenSearch … Proficient experience with other monitoring tools such as Dynatrace, New Relic, Splunk, Datadog, Nagios, Prometheus etc. Take ownership of the development of monitoring and observability practices Benefits include: 25 days holiday + statutory Competitive pension match Car allowance Family health care more »
in GitHub as a Source Code Management (SCM) platform, with hands-on experience in version control and collaborative coding. Experience with Kubernetes Experience with observability tools such as Datadog, Grafana, and Elastic for monitoring, analytics, and proactive issue identification. Experience in incident management, showcasing the ability to respond promptly and more »
Cheltenham, England, United Kingdom Hybrid / WFH Options
Northrop Grumman
the support for live (mission critical) systems, working with customers to fault find and resolve issues within strict time constraints. Experience using Industry standard observability tooling (ELK, Grafana, Prometheus), creating/maintaining these environments is a plus. You will have a strong understanding & navigation of both Windows and Linux operating more »
priorities. You are an expert in monitoring distributed systems while leveraging industry best practices. Analyse monitoring costs and drive cost saving whilst maintaining service observability and engineering productivity. Drive and contribute to error budget plans. Identify and lead development of automation that significantly reduce toil. Iteratively perform auditing of performance … and mentoring to junior team members. On-call rotation where required. What we're looking for in our next team mate: Proficient in application observability, DataDog preferred. Ability to build and maintain a system and culture that supports and implements SLOs. Experienced in AWS - everything from IAM, Lambda, Cloudfront, RDS more »
Monitoring and Observability Engineer Salary - £50,000 - £55,000 - Fully remote role! Principal Accountabilities Design, implement, and manage monitoring solutions to ensure the availability, performance, and reliability of our systems. Collaborate with cross-functional teams to understand system requirements and implement effective monitoring strategies. Utilise expertise in AWS CloudWatch, OpenSearch... more »
Manchester, North West, United Kingdom Hybrid / WFH Options
Travel Counsellors
close liaison with the Product, Software Engineering and Platform Engineering teams to ensure the swift and effective resolution of Platform issues. Using our Platform Observability and Support tooling, you will be responsible for proactively identifying challenges with platform performance or stability to ensure the platform operates within our service levels. more »
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
Interact Consulting Limited
on experience with Terraform for Infrastructure as Code (IaC). Proficiency with Docker, Kubernetes, and AKS. Familiarity with Azure cloud-native services. Knowledge of observability and site-reliability engineering principles. Proficiency in SQL and experience working with relational databases. more »
resilient and high performing to meet the evolving needs of the ParentPay group and will work in partnership with the IT Engineering , Monitoring and Observability, IT support, Application Support Service Ops teams ensuring systems, services & infrastructure work reliably and securely. Key Responsibilities Line management responsibilities for the team; providing support … personal development plans and undertaking 1-2-1 reviews. Working closely with the Monitoring and Observability, IT Support and Engineering teams to triage all infrastructure tickets and escalate to the Engineering team when SME resource is required. Develop the skills and competence within the infrastructure team to fulfil business requirements. more »
essential Experience diagnosing and troubleshooting connectivity issues Well-versed in security by design and security by default. Familiarity with SOC compliance Strong command of observability and monitoring in a multi-tenant cloud Experience working in high-performing Agile teams Proficient development in a major programming language such as C#, Python more »
essential Experience diagnosing and troubleshooting connectivity issues Well-versed in security by design and security by default. Familiarity with SOC compliance Strong command of observability and monitoring in a multi-tenant cloud Experience working in high-performing Agile teams Proficient development in a major programming language such as C#, Python more »
greater manchester, north west england, United Kingdom Hybrid / WFH Options
Clarus WMS
skills, and the capacity to work well in a team environment are often considered essential. Understanding of DevOps principles and practices, including monitoring, logging, observability and infrastructure management. Why Join Clarus Software? Here are some of the reasons you will love working here: A team of amazing people. We are more »
to SRE/Platform Engineering, or are still working as a Java Developer extensive experience with AWS, Kubernetes, Terraform, CI/CD tools strong observability experience, ideally with more modern approaches like Prometheus, Grafana, Open Telemetry comfortable with databases exposure to Kafka would be ideal more »
Northern Ireland, United Kingdom Hybrid / WFH Options
SoftwareOne
CI/CD pipeline development IaC, specifically Terraform or a wrapper Kubernetes, or a Cloud-managed container orchestrator, e.g. AKS, EKS Experience operating an observability stack preferred Bonus points for being security-minded About You: You have experience operating a mature, high volume stack in the cloud You have opinions more »
Manchester, England, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for An Observability Engineer to be part of a team responsible for the provisioning and development of a modern observability platform. You will be administering, maintaining and evolving our observability platform while providing best practice and monitoring solutions to support all our critical products and systems. … remediation which are critical for maintaining the health and uptime of our on premise and cloud applications. Our recent move from a monitoring to observability first mind-set is a critical part of our DevOps strategy and as such, Observability Engineers will be expected to provide a suitable platform for … key insights, help to reduce mean time to detect (MTTD) and mean time to repair (MTTR) and introduce practices and standards for ensuring that observability is not an afterthought for new systems. This role is eligible for inclusion in the Company’s hybrid working from home policy. Preferred skills and more »
technical POC for the company. Work closely with key decision makers in the company. Requirements: Experience with cloud technologies and DevOps principles. Familiarity with Observability concepts and hands-on experience with tools like Prometheus, OpenSearch, etc. Proficiency in delivering presentations in fluent English; proficiency in additional languages is a plus. more »
a vital role in maximising the value of data. Our client is looking for a hands-on Data Governance Manager to work to improve observability and usability of data within the organisation and to ensure compliance with group policies. You’ll be expected to drive these outcomes, designing, and deploying more »
Manchester Area, United Kingdom Hybrid / WFH Options
bet365
for those that operate them. You will work with a wide range of technologies developing solutions, consulting with development teams and working with contemporary observability and incident management tools to assist the Business. You will be required to make effective decisions to improve the health and maintain the availability and … knowledge of SRE principles, including the creation and management of effective SLI’s and SLO’s for reliability and customer satisfaction. Knowledge of contemporary observability tools, techniques and best practice including Splunk, New Relic, Grafana and Pager Duty. Excellent knowledge of programming languages including Python, Golang and JavaScript. Knowledge and more »
youll do Working as part of a team or as the engagement lead with experience in multicloud networking, SASE, datacentre networking, network simplification, automation, observability and application networking and a strong understanding and alignment of security as it relates to the previous technology areas. Leveraging this experience to deliver: Enterprise more »
Warwick, Warwickshire, West Midlands, United Kingdom
Tata Technologies Europe Ltd
for mentoring and technical coaching of developers Excellent verbal and written communication skills The following skills and experience are nice to have: Experience with observability and monitoring tools such as Datadog Familiarity with polyglot persistence and experience with a NoSql DB especially DocumentDB Experience with Kafka, HiveMQ, ElasticSearch Infrastructure as more »
for mentoring and technical coaching of developers Excellent verbal and written communication skills The following skills and experience are nice to have: Experience with observability and monitoring tools such as Datadog Familiarity with polyglot persistence and experience with a NoSql DB especially DocumentDB Experience with Kafka, HiveMQ, ElasticSearch Infrastructure as more »
normal—and that’s where you come in! We are seeking a skilled Site Reliability Engineer (SRE) with experience in AWS, Serverless, Monitoring, and Observability to join our team. Responsibilities: Design, build and maintain scalable, and reliable cloud infrastructure in AWS Monitor and manage the performance, reliability, and security of more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
london, south east england, United Kingdom Hybrid / WFH Options
Anaplan
that’s dedicated to creating opportunities for our customers, partners, and employees. We hope you’ll join us. Let’s create something incredible together! Observability Engineer At Anaplan we are looking for a self-motivated Observability Engineer to join our dedicated Observability Infrastructure team. Anaplan is a high-growth company … working people who believe in simplicity, agility and performance and can choose and use the best tools for the job. In the role of Observability Engineer, you will be working on the tools used to collect and analyse Observability telemetry (Logs, Metrics and Traces). You will enable engineers across … What you’ll be doing: In this role, working a minimum of 2 days a week in our London Office, you will be: Administering observability infrastructure. Deploying and configuring OTEL agents to collect telemetry, and to visualise this data in Grafana. Pairing with your colleagues to build everything from rapid more »
london, south east england, United Kingdom Hybrid / WFH Options
McCabe & Barton
infrastructure as code. Implement and maintain CI/CD pipelines using GitLab CI/CD and Jenkins. Manage and monitor SRE systems, including log observability, Application Performance Monitoring (APM), infrastructure monitoring, and security. Proficient in working with Kubernetes for container orchestration and management. Experienced with AWS Cloud services and infrastructure more »